WorldWideScience

Sample records for protein folding protein

  1. PREFACE Protein folding: lessons learned and new frontiers Protein folding: lessons learned and new frontiers

    Science.gov (United States)

    Pappu, Rohit V.; Nussinov, Ruth

    2009-03-01

    In appropriate physiological milieux proteins spontaneously fold into their functional three-dimensional structures. The amino acid sequences of functional proteins contain all the information necessary to specify the folds. This remarkable observation has spawned research aimed at answering two major questions. (1) Of all the conceivable structures that a protein can adopt, why is the ensemble of native-like structures the most favorable? (2) What are the paths by which proteins manage to robustly and reproducibly fold into their native structures? Anfinsen's thermodynamic hypothesis has guided the pursuit of answers to the first question whereas Levinthal's paradox has influenced the development of models for protein folding dynamics. Decades of work have led to significant advances in the folding problem. Mean-field models have been developed to capture our current, coarse grain understanding of the driving forces for protein folding. These models are being used to predict three-dimensional protein structures from sequence and stability profiles as a function of thermodynamic and chemical perturbations. Impressive strides have also been made in the field of protein design, also known as the inverse folding problem, thereby testing our understanding of the determinants of the fold specificities of different sequences. Early work on protein folding pathways focused on the specific sequence of events that could lead to a simplification of the search process. However, unifying principles proved to be elusive. Proteins that show reversible two-state folding-unfolding transitions turned out to be a gift of natural selection. Focusing on these simple systems helped researchers to uncover general principles regarding the origins of cooperativity in protein folding thermodynamics and kinetics. On the theoretical front, concepts borrowed from polymer physics and the physics of spin glasses led to the development of a framework based on energy landscape theories. These

  2. Physics of protein folding

    Science.gov (United States)

    Finkelstein, A. V.; Galzitskaya, O. V.

    2004-04-01

    Protein physics is grounded on three fundamental experimental facts: protein, this long heteropolymer, has a well defined compact three-dimensional structure; this structure can spontaneously arise from the unfolded protein chain in appropriate environment; and this structure is separated from the unfolded state of the chain by the “all-or-none” phase transition, which ensures robustness of protein structure and therefore of its action. The aim of this review is to consider modern understanding of physical principles of self-organization of protein structures and to overview such important features of this process, as finding out the unique protein structure among zillions alternatives, nucleation of the folding process and metastable folding intermediates. Towards this end we will consider the main experimental facts and simple, mostly phenomenological theoretical models. We will concentrate on relatively small (single-domain) water-soluble globular proteins (whose structure and especially folding are much better studied and understood than those of large or membrane and fibrous proteins) and consider kinetic and structural aspects of transition of initially unfolded protein chains into their final solid (“native”) 3D structures.

  3. Nucleation phenomena in protein folding: the modulating role of protein sequence

    International Nuclear Information System (INIS)

    Travasso, Rui D M; FaIsca, Patricia F N; Gama, Margarida M Telo da

    2007-01-01

    For the vast majority of naturally occurring, small, single-domain proteins, folding is often described as a two-state process that lacks detectable intermediates. This observation has often been rationalized on the basis of a nucleation mechanism for protein folding whose basic premise is the idea that, after completion of a specific set of contacts forming the so-called folding nucleus, the native state is achieved promptly. Here we propose a methodology to identify folding nuclei in small lattice polymers and apply it to the study of protein molecules with a chain length of N = 48. To investigate the extent to which protein topology is a robust determinant of the nucleation mechanism, we compare the nucleation scenario of a native-centric model with that of a sequence-specific model sharing the same native fold. To evaluate the impact of the sequence's finer details in the nucleation mechanism, we consider the folding of two non-homologous sequences. We conclude that, in a sequence-specific model, the folding nucleus is, to some extent, formed by the most stable contacts in the protein and that the less stable linkages in the folding nucleus are solely determined by the fold's topology. We have also found that, independently of the protein sequence, the folding nucleus performs the same 'topological' function. This unifying feature of the nucleation mechanism results from the residues forming the folding nucleus being distributed along the protein chain in a similar and well-defined manner that is determined by the fold's topological features

  4. Kinetics and Thermodynamics of Membrane Protein Folding

    Directory of Open Access Journals (Sweden)

    Ernesto A. Roman

    2014-03-01

    Full Text Available Understanding protein folding has been one of the great challenges in biochemistry and molecular biophysics. Over the past 50 years, many thermodynamic and kinetic studies have been performed addressing the stability of globular proteins. In comparison, advances in the membrane protein folding field lag far behind. Although membrane proteins constitute about a third of the proteins encoded in known genomes, stability studies on membrane proteins have been impaired due to experimental limitations. Furthermore, no systematic experimental strategies are available for folding these biomolecules in vitro. Common denaturing agents such as chaotropes usually do not work on helical membrane proteins, and ionic detergents have been successful denaturants only in few cases. Refolding a membrane protein seems to be a craftsman work, which is relatively straightforward for transmembrane β-barrel proteins but challenging for α-helical membrane proteins. Additional complexities emerge in multidomain membrane proteins, data interpretation being one of the most critical. In this review, we will describe some recent efforts in understanding the folding mechanism of membrane proteins that have been reversibly refolded allowing both thermodynamic and kinetic analysis. This information will be discussed in the context of current paradigms in the protein folding field.

  5. Teaching computers to fold proteins

    DEFF Research Database (Denmark)

    Winther, Ole; Krogh, Anders Stærmose

    2004-01-01

    A new general algorithm for optimization of potential functions for protein folding is introduced. It is based upon gradient optimization of the thermodynamic stability of native folds of a training set of proteins with known structure. The iterative update rule contains two thermodynamic averages...

  6. Solvent Effects on Protein Folding/Unfolding

    Science.gov (United States)

    García, A. E.; Hillson, N.; Onuchic, J. N.

    Pressure effects on the hydrophobic potential of mean force led Hummer et al. to postulate a model for pressure denaturation of proteins in which denaturation occurs by means of water penetration into the protein interior, rather than by exposing the protein hydrophobic core to the solvent --- commonly used to describe temperature denaturation. We study the effects of pressure in protein folding/unfolding kinetics in an off-lattice minimalist model of a protein in which pressure effects have been incorporated by means of the pair-wise potential of mean force of hydrophobic groups in water. We show that pressure slows down the kinetics of folding by decreasing the reconfigurational diffusion coefficient and moves the location of the folding transition state.

  7. Understanding ensemble protein folding at atomic detail

    International Nuclear Information System (INIS)

    Wallin, Stefan; Shakhnovich, Eugene I

    2008-01-01

    Although far from routine, simulating the folding of specific short protein chains on the computer, at a detailed atomic level, is starting to become a reality. This remarkable progress, which has been made over the last decade or so, allows a fundamental aspect of the protein folding process to be addressed, namely its statistical nature. In order to make quantitative comparisons with experimental kinetic data a complete ensemble view of folding must be achieved, with key observables averaged over the large number of microscopically different folding trajectories available to a protein chain. Here we review recent advances in atomic-level protein folding simulations and the new insight provided by them into the protein folding process. An important element in understanding ensemble folding kinetics are methods for analyzing many separate folding trajectories, and we discuss techniques developed to condense the large amount of information contained in an ensemble of trajectories into a manageable picture of the folding process. (topical review)

  8. Protein folding and the organization of the protein topology universe

    DEFF Research Database (Denmark)

    Lindorff-Larsen,, Kresten; Røgen, Peter; Paci, Emanuele

    2005-01-01

    residues and, in addition, that the topology of the transition state is closer to that of the native state than to that of any other fold in the protein universe. Here, we review the evidence for these conclusions and suggest a molecular mechanism that rationalizes these findings by presenting a view...... of protein folds that is based on the topological features of the polypeptide backbone, rather than the conventional view that depends on the arrangement of different types of secondary-structure elements. By linking the folding process to the organization of the protein structure universe, we propose...

  9. Self-organized critical model for protein folding

    Science.gov (United States)

    Moret, M. A.

    2011-09-01

    The major factor that drives a protein toward collapse and folding is the hydrophobic effect. At the folding process a hydrophobic core is shielded by the solvent-accessible surface area of the protein. We study the fractal behavior of 5526 protein structures present in the Brookhaven Protein Data Bank. Power laws of protein mass, volume and solvent-accessible surface area are measured independently. The present findings indicate that self-organized criticality is an alternative explanation for the protein folding. Also we note that the protein packing is an independent and constant value because the self-similar behavior of the volumes and protein masses have the same fractal dimension. This power law guarantees that a protein is a complex system. From the analyzed data, q-Gaussian distributions seem to fit well this class of systems.

  10. Frustration in Condensed Matter and Protein Folding

    Science.gov (United States)

    Li, Z.; Tanner, S.; Conroy, B.; Owens, F.; Tran, M. M.; Boekema, C.

    2014-03-01

    By means of computer modeling, we are studying frustration in condensed matter and protein folding, including the influence of temperature and Thomson-figure formation. Frustration is due to competing interactions in a disordered state. The key issue is how the particles interact to reach the lowest frustration. The relaxation for frustration is mostly a power function (randomly assigned pattern) or an exponential function (regular patterns like Thomson figures). For the atomic Thomson model, frustration is predicted to decrease with the formation of Thomson figures at zero kelvin. We attempt to apply our frustration modeling to protein folding and dynamics. We investigate the homogeneous protein frustration that would cause the speed of the protein folding to increase. Increase of protein frustration (where frustration and hydrophobicity interplay with protein folding) may lead to a protein mutation. Research is supported by WiSE@SJSU and AFC San Jose.

  11. Improving Protein Fold Recognition by Deep Learning Networks

    Science.gov (United States)

    Jo, Taeho; Hou, Jie; Eickholt, Jesse; Cheng, Jianlin

    2015-12-01

    For accurate recognition of protein folds, a deep learning network method (DN-Fold) was developed to predict if a given query-template protein pair belongs to the same structural fold. The input used stemmed from the protein sequence and structural features extracted from the protein pair. We evaluated the performance of DN-Fold along with 18 different methods on Lindahl’s benchmark dataset and on a large benchmark set extracted from SCOP 1.75 consisting of about one million protein pairs, at three different levels of fold recognition (i.e., protein family, superfamily, and fold) depending on the evolutionary distance between protein sequences. The correct recognition rate of ensembled DN-Fold for Top 1 predictions is 84.5%, 61.5%, and 33.6% and for Top 5 is 91.2%, 76.5%, and 60.7% at family, superfamily, and fold levels, respectively. We also evaluated the performance of single DN-Fold (DN-FoldS), which showed the comparable results at the level of family and superfamily, compared to ensemble DN-Fold. Finally, we extended the binary classification problem of fold recognition to real-value regression task, which also show a promising performance. DN-Fold is freely available through a web server at http://iris.rnet.missouri.edu/dnfold.

  12. Improving Protein Fold Recognition by Deep Learning Networks.

    Science.gov (United States)

    Jo, Taeho; Hou, Jie; Eickholt, Jesse; Cheng, Jianlin

    2015-12-04

    For accurate recognition of protein folds, a deep learning network method (DN-Fold) was developed to predict if a given query-template protein pair belongs to the same structural fold. The input used stemmed from the protein sequence and structural features extracted from the protein pair. We evaluated the performance of DN-Fold along with 18 different methods on Lindahl's benchmark dataset and on a large benchmark set extracted from SCOP 1.75 consisting of about one million protein pairs, at three different levels of fold recognition (i.e., protein family, superfamily, and fold) depending on the evolutionary distance between protein sequences. The correct recognition rate of ensembled DN-Fold for Top 1 predictions is 84.5%, 61.5%, and 33.6% and for Top 5 is 91.2%, 76.5%, and 60.7% at family, superfamily, and fold levels, respectively. We also evaluated the performance of single DN-Fold (DN-FoldS), which showed the comparable results at the level of family and superfamily, compared to ensemble DN-Fold. Finally, we extended the binary classification problem of fold recognition to real-value regression task, which also show a promising performance. DN-Fold is freely available through a web server at http://iris.rnet.missouri.edu/dnfold.

  13. Heterochiral Knottin Protein: Folding and Solution Structure.

    Science.gov (United States)

    Mong, Surin K; Cochran, Frank V; Yu, Hongtao; Graziano, Zachary; Lin, Yu-Shan; Cochran, Jennifer R; Pentelute, Bradley L

    2017-10-31

    Homochirality is a general feature of biological macromolecules, and Nature includes few examples of heterochiral proteins. Herein, we report on the design, chemical synthesis, and structural characterization of heterochiral proteins possessing loops of amino acids of chirality opposite to that of the rest of a protein scaffold. Using the protein Ecballium elaterium trypsin inhibitor II, we discover that selective β-alanine substitution favors the efficient folding of our heterochiral constructs. Solution nuclear magnetic resonance spectroscopy of one such heterochiral protein reveals a homogeneous global fold. Additionally, steered molecular dynamics simulation indicate β-alanine reduces the free energy required to fold the protein. We also find these heterochiral proteins to be more resistant to proteolysis than homochiral l-proteins. This work informs the design of heterochiral protein architectures containing stretches of both d- and l-amino acids.

  14. In vitro folding of inclusion body proteins.

    Science.gov (United States)

    Rudolph, R; Lilie, H

    1996-01-01

    Insoluble, inactive inclusion bodies are frequently formed upon recombinant protein production in transformed microorganisms. These inclusion bodies, which contain the recombinant protein in an highly enriched form, can be isolated by solid/liquid separation. After solubilization, native proteins can be generated from the inactive material by using in vitro folding techniques. New folding procedures have been developed for efficient in vitro reconstitution of complex hydrophobic, multidomain, oligomeric, or highly disulfide-bonded proteins. These protocols take into account process parameters such as protein concentration, catalysis of disulfide bond formation, temperature, pH, and ionic strength, as well as specific solvent ingredients that reduce unproductive side reactions. Modification of the protein sequence has been exploited to improve in vitro folding.

  15. Evolution of a protein folding nucleus.

    Science.gov (United States)

    Xia, Xue; Longo, Liam M; Sutherland, Mason A; Blaber, Michael

    2016-07-01

    The folding nucleus (FN) is a cryptic element within protein primary structure that enables an efficient folding pathway and is the postulated heritable element in the evolution of protein architecture; however, almost nothing is known regarding how the FN structurally changes as complex protein architecture evolves from simpler peptide motifs. We report characterization of the FN of a designed purely symmetric β-trefoil protein by ϕ-value analysis. We compare the structure and folding properties of key foldable intermediates along the evolutionary trajectory of the β-trefoil. The results show structural acquisition of the FN during gene fusion events, incorporating novel turn structure created by gene fusion. Furthermore, the FN is adjusted by circular permutation in response to destabilizing functional mutation. FN plasticity by way of circular permutation is made possible by the intrinsic C3 cyclic symmetry of the β-trefoil architecture, identifying a possible selective advantage that helps explain the prevalence of cyclic structural symmetry in the proteome. © 2015 The Protein Society.

  16. Roles of beta-turns in protein folding: from peptide models to protein engineering.

    Science.gov (United States)

    Marcelino, Anna Marie C; Gierasch, Lila M

    2008-05-01

    Reverse turns are a major class of protein secondary structure; they represent sites of chain reversal and thus sites where the globular character of a protein is created. It has been speculated for many years that turns may nucleate the formation of structure in protein folding, as their propensity to occur will favor the approximation of their flanking regions and their general tendency to be hydrophilic will favor their disposition at the solvent-accessible surface. Reverse turns are local features, and it is therefore not surprising that their structural properties have been extensively studied using peptide models. In this article, we review research on peptide models of turns to test the hypothesis that the propensities of turns to form in short peptides will relate to the roles of corresponding sequences in protein folding. Turns with significant stability as isolated entities should actively promote the folding of a protein, and by contrast, turn sequences that merely allow the chain to adopt conformations required for chain reversal are predicted to be passive in the folding mechanism. We discuss results of protein engineering studies of the roles of turn residues in folding mechanisms. Factors that correlate with the importance of turns in folding indeed include their intrinsic stability, as well as their topological context and their participation in hydrophobic networks within the protein's structure.

  17. Fluorescence of Alexa fluor dye tracks protein folding.

    Directory of Open Access Journals (Sweden)

    Simon Lindhoud

    Full Text Available Fluorescence spectroscopy is an important tool for the characterization of protein folding. Often, a protein is labeled with appropriate fluorescent donor and acceptor probes and folding-induced changes in Förster Resonance Energy Transfer (FRET are monitored. However, conformational changes of the protein potentially affect fluorescence properties of both probes, thereby profoundly complicating interpretation of FRET data. In this study, we assess the effects protein folding has on fluorescence properties of Alexa Fluor 488 (A488, which is commonly used as FRET donor. Here, A488 is covalently attached to Cys69 of apoflavodoxin from Azotobacter vinelandii. Although coupling of A488 slightly destabilizes apoflavodoxin, the three-state folding of this protein, which involves a molten globule intermediate, is unaffected. Upon folding of apoflavodoxin, fluorescence emission intensity of A488 changes significantly. To illuminate the molecular sources of this alteration, we applied steady state and time-resolved fluorescence techniques. The results obtained show that tryptophans cause folding-induced changes in quenching of Alexa dye. Compared to unfolded protein, static quenching of A488 is increased in the molten globule. Upon populating the native state both static and dynamic quenching of A488 decrease considerably. We show that fluorescence quenching of Alexa Fluor dyes is a sensitive reporter of conformational changes during protein folding.

  18. Solitons and protein folding: An In Silico experiment

    Energy Technology Data Exchange (ETDEWEB)

    Ilieva, N., E-mail: nevena.ilieva@parallel.bas.bg [Institute of Information and Communication Technologies, Bulgarian Aacademy of Sciences, Sofia (Bulgaria); Dai, J., E-mail: daijing491@gmail.com [School of Physics, Beijing Institute of Technology, Beijing (China); Sieradzan, A., E-mail: adams86@wp.pl [Faculty of Chemistry, University of Gdańsk, Gdańsk (Poland); Niemi, A., E-mail: Antti.Niemi@physics.uu.se [Department of Physics and Astronomy, Uppsala University, Uppsala (Sweden); LMPT–CNRS, Université de Tours, Tours (France)

    2015-10-28

    Protein folding [1] is the process of formation of a functional 3D structure from a random coil — the shape in which amino-acid chains leave the ribosome. Anfinsen’s dogma states that the native 3D shape of a protein is completely determined by protein’s amino acid sequence. Despite the progress in understanding the process rate and the success in folding prediction for some small proteins, with presently available physics-based methods it is not yet possible to reliably deduce the shape of a biologically active protein from its amino acid sequence. The protein-folding problem endures as one of the most important unresolved problems in science; it addresses the origin of life itself. Furthermore, a wrong fold is a common cause for a protein to lose its function or even endanger the living organism. Soliton solutions of a generalized discrete non-linear Schrödinger equation (GDNLSE) obtained from the energy function in terms of bond and torsion angles κ and τ provide a constructive theoretical framework for describing protein folds and folding patterns [2]. Here we study the dynamics of this process by means of molecular-dynamics simulations. The soliton manifestation is the pattern helix–loop–helix in the secondary structure of the protein, which explains the importance of understanding loop formation in helical proteins. We performed in silico experiments for unfolding one subunit of the core structure of gp41 from the HIV envelope glycoprotein (PDB ID: 1AIK [3]) by molecular-dynamics simulations with the MD package GROMACS. We analyzed 80 ns trajectories, obtained with one united-atom and two different all-atom force fields, to justify the side-chain orientation quantification scheme adopted in the studies and to eliminate force-field based artifacts. Our results are compatible with the soliton model of protein folding and provide first insight into soliton-formation dynamics.

  19. Solitons and protein folding: An In Silico experiment

    International Nuclear Information System (INIS)

    Ilieva, N.; Dai, J.; Sieradzan, A.; Niemi, A.

    2015-01-01

    Protein folding [1] is the process of formation of a functional 3D structure from a random coil — the shape in which amino-acid chains leave the ribosome. Anfinsen’s dogma states that the native 3D shape of a protein is completely determined by protein’s amino acid sequence. Despite the progress in understanding the process rate and the success in folding prediction for some small proteins, with presently available physics-based methods it is not yet possible to reliably deduce the shape of a biologically active protein from its amino acid sequence. The protein-folding problem endures as one of the most important unresolved problems in science; it addresses the origin of life itself. Furthermore, a wrong fold is a common cause for a protein to lose its function or even endanger the living organism. Soliton solutions of a generalized discrete non-linear Schrödinger equation (GDNLSE) obtained from the energy function in terms of bond and torsion angles κ and τ provide a constructive theoretical framework for describing protein folds and folding patterns [2]. Here we study the dynamics of this process by means of molecular-dynamics simulations. The soliton manifestation is the pattern helix–loop–helix in the secondary structure of the protein, which explains the importance of understanding loop formation in helical proteins. We performed in silico experiments for unfolding one subunit of the core structure of gp41 from the HIV envelope glycoprotein (PDB ID: 1AIK [3]) by molecular-dynamics simulations with the MD package GROMACS. We analyzed 80 ns trajectories, obtained with one united-atom and two different all-atom force fields, to justify the side-chain orientation quantification scheme adopted in the studies and to eliminate force-field based artifacts. Our results are compatible with the soliton model of protein folding and provide first insight into soliton-formation dynamics

  20. Protein solubility and folding enhancement by interaction with RNA.

    Directory of Open Access Journals (Sweden)

    Seong Il Choi

    Full Text Available While basic mechanisms of several major molecular chaperones are well understood, this machinery has been known to be involved in folding of only limited number of proteins inside the cells. Here, we report a chaperone type of protein folding facilitated by interaction with RNA. When an RNA-binding module is placed at the N-terminus of aggregation-prone target proteins, this module, upon binding with RNA, further promotes the solubility of passenger proteins, potentially leading to enhancement of proper protein folding. Studies on in vitro refolding in the presence of RNA, coexpression of RNA molecules in vivo and the mutants with impaired RNA binding ability suggests that RNA can exert chaperoning effect on their bound proteins. The results suggest that RNA binding could affect the overall kinetic network of protein folding pathway in favor of productive folding over off-pathway aggregation. In addition, the RNA binding-mediated solubility enhancement is extremely robust for increasing soluble yield of passenger proteins and could be usefully implemented for high-throughput protein expression for functional and structural genomic research initiatives. The RNA-mediated chaperone type presented here would give new insights into de novo folding in vivo.

  1. Microwave-enhanced folding and denaturation of globular proteins

    DEFF Research Database (Denmark)

    Bohr, Henrik; Bohr, Jakob

    2000-01-01

    It is shown that microwave irradiation can affect the kinetics of the folding process of some globular proteins, especially beta-lactoglobulin. At low temperature the folding from the cold denatured phase of the protein is enhanced, while at a higher temperature the denaturation of the protein from...... its folded state is enhanced. In the latter case, a negative temperature gradient is needed for the denaturation process, suggesting that the effects of the microwaves are nonthermal. This supports the notion that coherent topological excitations can exist in proteins. The application of microwaves...

  2. Protein folding and protein metallocluster studies using synchrotron small angler X-ray scattering

    International Nuclear Information System (INIS)

    Eliezer, D.

    1994-06-01

    Proteins, biological macromolecules composed of amino-acid building blocks, possess unique three dimensional shapes or conformations which are intimately related to their biological function. All of the information necessary to determine this conformation is stored in a protein's amino acid sequence. The problem of understanding the process by which nature maps protein amino-acid sequences to three-dimensional conformations is known as the protein folding problem, and is one of the central unsolved problems in biophysics today. The possible applications of a solution are broad, ranging from the elucidation of thousands of protein structures to the rational modification and design of protein-based drugs. The scattering of X-rays by matter has long been useful as a tool for the characterization of physical properties of materials, including biological samples. The high photon flux available at synchrotron X-ray sources allows for the measurement of scattering cross-sections of dilute and/or disordered samples. Such measurements do not yield the detailed geometrical information available from crystalline samples, but do allow for lower resolution studies of dynamical processes not observable in the crystalline state. The main focus of the work described here has been the study of the protein folding process using time-resolved small-angle x-ray scattering measurements. The original intention was to observe the decrease in overall size which must accompany the folding of a protein from an extended conformation to its compact native state. Although this process proved too fast for the current time-resolution of the technique, upper bounds were set on the probable compaction times of several small proteins. In addition, an interesting and unexpected process was detected, in which the folding protein passes through an intermediate state which shows a tendency to associate. This state is proposed to be a kinetic molten globule folding intermediate

  3. Experimental investigation of protein folding and misfolding.

    Science.gov (United States)

    Dobson, Christopher M

    2004-09-01

    Newly synthesised proteins need to fold, often to intricate and close-packed structures, in order to function. The underlying mechanism by which this complex process takes place both in vitro and in vivo is now becoming understood, at least in general terms, as a result of the application of a wide range of biophysical and computational methods used in combination with the techniques of biochemistry and protein engineering. It is increasingly apparent, however, that folding is not only crucial for generating biological activity, but that it is also coupled to a wide range of processes within the cell, ranging from the trafficking of proteins to specific organelles to the regulation of cell growth and differentiation. Not surprisingly, therefore, the failure of proteins to fold appropriately, or to remain correctly folded, is associated with a large number of cellular malfunctions that give rise to disease. Misfolding, and its consequences such as aggregation, can be investigated by extending the types of techniques used to study the normal folding process. Application of these techniques is enabling the development of a unified description of the interconversion and regulation of the different conformational states available to proteins in living systems. Such a description proves a generic basis for understanding the fundamental links between protein misfolding and its associated clinical disorders, such as Alzheimer's disease and Type II diabetes, and for exploring novel therapeutic strategies directed at their prevention and treatment on a rational basis.

  4. Protein folding and wring resonances

    DEFF Research Database (Denmark)

    Bohr, Jakob; Bohr, Henrik; Brunak, Søren

    1997-01-01

    The polypeptide chain of a protein is shown to obey topological contraints which enable long range excitations in the form of wring modes of the protein backbone. Wring modes of proteins of specific lengths can therefore resonate with molecular modes present in the cell. It is suggested that prot......The polypeptide chain of a protein is shown to obey topological contraints which enable long range excitations in the form of wring modes of the protein backbone. Wring modes of proteins of specific lengths can therefore resonate with molecular modes present in the cell. It is suggested...... that protein folding takes place when the amplitude of a wring excitation becomes so large that it is energetically favorable to bend the protein backbone. The condition under which such structural transformations can occur is found, and it is shown that both cold and hot denaturation (the unfolding...

  5. Impact of hydrodynamic interactions on protein folding rates depends on temperature

    Science.gov (United States)

    Zegarra, Fabio C.; Homouz, Dirar; Eliaz, Yossi; Gasic, Andrei G.; Cheung, Margaret S.

    2018-03-01

    We investigated the impact of hydrodynamic interactions (HI) on protein folding using a coarse-grained model. The extent of the impact of hydrodynamic interactions, whether it accelerates, retards, or has no effect on protein folding, has been controversial. Together with a theoretical framework of the energy landscape theory (ELT) for protein folding that describes the dynamics of the collective motion with a single reaction coordinate across a folding barrier, we compared the kinetic effects of HI on the folding rates of two protein models that use a chain of single beads with distinctive topologies: a 64-residue α /β chymotrypsin inhibitor 2 (CI2) protein, and a 57-residue β -barrel α -spectrin Src-homology 3 domain (SH3) protein. When comparing the protein folding kinetics simulated with Brownian dynamics in the presence of HI to that in the absence of HI, we find that the effect of HI on protein folding appears to have a "crossover" behavior about the folding temperature. This means that at a temperature greater than the folding temperature, the enhanced friction from the hydrodynamic solvents between the beads in an unfolded configuration results in lowered folding rate; conversely, at a temperature lower than the folding temperature, HI accelerates folding by the backflow of solvent toward the folded configuration of a protein. Additionally, the extent of acceleration depends on the topology of a protein: for a protein like CI2, where its folding nucleus is rather diffuse in a transition state, HI channels the formation of contacts by favoring a major folding pathway in a complex free energy landscape, thus accelerating folding. For a protein like SH3, where its folding nucleus is already specific and less diffuse, HI matters less at a temperature lower than the folding temperature. Our findings provide further theoretical insight to protein folding kinetic experiments and simulations.

  6. Flexibility damps macromolecular crowding effects on protein folding dynamics: Application to the murine prion protein (121-231)

    Science.gov (United States)

    Bergasa-Caceres, Fernando; Rabitz, Herschel A.

    2014-01-01

    A model of protein folding kinetics is applied to study the combined effects of protein flexibility and macromolecular crowding on protein folding rate and stability. It is found that the increase in stability and folding rate promoted by macromolecular crowding is damped for proteins with highly flexible native structures. The model is applied to the folding dynamics of the murine prion protein (121-231). It is found that the high flexibility of the native isoform of the murine prion protein (121-231) reduces the effects of macromolecular crowding on its folding dynamics. The relevance of these findings for the pathogenic mechanism are discussed.

  7. Improving protein fold recognition by extracting fold-specific features from predicted residue-residue contacts.

    Science.gov (United States)

    Zhu, Jianwei; Zhang, Haicang; Li, Shuai Cheng; Wang, Chao; Kong, Lupeng; Sun, Shiwei; Zheng, Wei-Mou; Bu, Dongbo

    2017-12-01

    Accurate recognition of protein fold types is a key step for template-based prediction of protein structures. The existing approaches to fold recognition mainly exploit the features derived from alignments of query protein against templates. These approaches have been shown to be successful for fold recognition at family level, but usually failed at superfamily/fold levels. To overcome this limitation, one of the key points is to explore more structurally informative features of proteins. Although residue-residue contacts carry abundant structural information, how to thoroughly exploit these information for fold recognition still remains a challenge. In this study, we present an approach (called DeepFR) to improve fold recognition at superfamily/fold levels. The basic idea of our approach is to extract fold-specific features from predicted residue-residue contacts of proteins using deep convolutional neural network (DCNN) technique. Based on these fold-specific features, we calculated similarity between query protein and templates, and then assigned query protein with fold type of the most similar template. DCNN has showed excellent performance in image feature extraction and image recognition; the rational underlying the application of DCNN for fold recognition is that contact likelihood maps are essentially analogy to images, as they both display compositional hierarchy. Experimental results on the LINDAHL dataset suggest that even using the extracted fold-specific features alone, our approach achieved success rate comparable to the state-of-the-art approaches. When further combining these features with traditional alignment-related features, the success rate of our approach increased to 92.3%, 82.5% and 78.8% at family, superfamily and fold levels, respectively, which is about 18% higher than the state-of-the-art approach at fold level, 6% higher at superfamily level and 1% higher at family level. An independent assessment on SCOP_TEST dataset showed consistent

  8. Intermediates and the folding of proteins L and G

    Energy Technology Data Exchange (ETDEWEB)

    Brown, Scott; Head-Gordon, Teresa

    2003-07-01

    We use a minimalist protein model, in combination with a sequence design strategy, to determine differences in primary structure for proteins L and G that are responsible for the two proteins folding through distinctly different folding mechanisms. We find that the folding of proteins L and G are consistent with a nucleation-condensation mechanism, each of which is described as helix-assisted {beta}-1 and {beta}-2 hairpin formation, respectively. We determine that the model for protein G exhibits an early intermediate that precedes the rate-limiting barrier of folding and which draws together misaligned secondary structure elements that are stabilized by hydrophobic core contacts involving the third {beta}-strand, and presages the later transition state in which the correct strand alignment of these same secondary structure elements is restored. Finally the validity of the targeted intermediate ensemble for protein G was analyzed by fitting the kinetic data to a two-step first order reversible reaction, proving that protein G folding involves an on-pathway early intermediate, and should be populated and therefore observable by experiment.

  9. Ligand-promoted protein folding by biased kinetic partitioning.

    Science.gov (United States)

    Hingorani, Karan S; Metcalf, Matthew C; Deming, Derrick T; Garman, Scott C; Powers, Evan T; Gierasch, Lila M

    2017-04-01

    Protein folding in cells occurs in the presence of high concentrations of endogenous binding partners, and exogenous binding partners have been exploited as pharmacological chaperones. A combined mathematical modeling and experimental approach shows that a ligand improves the folding of a destabilized protein by biasing the kinetic partitioning between folding and alternative fates (aggregation or degradation). Computationally predicted inhibition of test protein aggregation and degradation as a function of ligand concentration are validated by experiments in two disparate cellular systems.

  10. Coherent topological phenomena in protein folding

    DEFF Research Database (Denmark)

    Bohr, Henrik; Brunak, Søren; Bohr, Jakob

    1997-01-01

    A theory is presented for coherent topological phenomena in protein dynamics with implications for protein folding and stability. We discuss the relationship to the writhing number used in knot diagrams of DNA. The winding state defines a long-range order along the backbone of a protein with long......-range excitations, `wring' modes, that play an important role in protein denaturation and stability. Energy can be pumped into these excitations, either thermally or by an external force....

  11. Energetic frustrations in protein folding at residue resolution: a homologous simulation study of Im9 proteins.

    Directory of Open Access Journals (Sweden)

    Yunxiang Sun

    Full Text Available Energetic frustration is becoming an important topic for understanding the mechanisms of protein folding, which is a long-standing big biological problem usually investigated by the free energy landscape theory. Despite the significant advances in probing the effects of folding frustrations on the overall features of protein folding pathways and folding intermediates, detailed characterizations of folding frustrations at an atomic or residue level are still lacking. In addition, how and to what extent folding frustrations interact with protein topology in determining folding mechanisms remains unclear. In this paper, we tried to understand energetic frustrations in the context of protein topology structures or native-contact networks by comparing the energetic frustrations of five homologous Im9 alpha-helix proteins that share very similar topology structures but have a single hydrophilic-to-hydrophobic mutual mutation. The folding simulations were performed using a coarse-grained Gō-like model, while non-native hydrophobic interactions were introduced as energetic frustrations using a Lennard-Jones potential function. Energetic frustrations were then examined at residue level based on φ-value analyses of the transition state ensemble structures and mapped back to native-contact networks. Our calculations show that energetic frustrations have highly heterogeneous influences on the folding of the four helices of the examined structures depending on the local environment of the frustration centers. Also, the closer the introduced frustration is to the center of the native-contact network, the larger the changes in the protein folding. Our findings add a new dimension to the understanding of protein folding the topology determination in that energetic frustrations works closely with native-contact networks to affect the protein folding.

  12. Glycoprotein folding and quality-control mechanisms in protein-folding diseases

    Directory of Open Access Journals (Sweden)

    Sean P. Ferris

    2014-03-01

    Full Text Available Biosynthesis of proteins – from translation to folding to export – encompasses a complex set of events that are exquisitely regulated and scrutinized to ensure the functional quality of the end products. Cells have evolved to capitalize on multiple post-translational modifications in addition to primary structure to indicate the folding status of nascent polypeptides to the chaperones and other proteins that assist in their folding and export. These modifications can also, in the case of irreversibly misfolded candidates, signal the need for dislocation and degradation. The current Review focuses on the glycoprotein quality-control (GQC system that utilizes protein N-glycosylation and N-glycan trimming to direct nascent glycopolypeptides through the folding, export and dislocation pathways in the endoplasmic reticulum (ER. A diverse set of pathological conditions rooted in defective as well as over-vigilant ER quality-control systems have been identified, underlining its importance in human health and disease. We describe the GQC pathways and highlight disease and animal models that have been instrumental in clarifying our current understanding of these processes.

  13. Nanoscale Dewetting Transition in Protein Complex Folding

    Science.gov (United States)

    Hua, Lan; Huang, Xuhui; Liu, Pu; Zhou, Ruhong; Berne, Bruce J.

    2011-01-01

    In a previous study, a surprising drying transition was observed to take place inside the nanoscale hydrophobic channel in the tetramer of the protein melittin. The goal of this paper is to determine if there are other protein complexes capable of displaying a dewetting transition during their final stage of folding. We searched the entire protein data bank (PDB) for all possible candidates, including protein tetramers, dimers, and two-domain proteins, and then performed the molecular dynamics (MD) simulations on the top candidates identified by a simple hydrophobic scoring function based on aligned hydrophobic surface areas. Our large scale MD simulations found several more proteins, including three tetramers, six dimers, and two two-domain proteins, which display a nanoscale dewetting transition in their final stage of folding. Even though the scoring function alone is not sufficient (i.e., a high score is necessary but not sufficient) in identifying the dewetting candidates, it does provide useful insights into the features of complex interfaces needed for dewetting. All top candidates have two features in common: (1) large aligned (matched) hydrophobic areas between two corresponding surfaces, and (2) large connected hydrophobic areas on the same surface. We have also studied the effect on dewetting of different water models and different treatments of the long-range electrostatic interactions (cutoff vs PME), and found the dewetting phenomena is fairly robust. This work presents a few proteins other than melittin tetramer for further experimental studies of the role of dewetting in the end stages of protein folding. PMID:17608515

  14. A model in which heat shock protein 90 targets protein-folding clefts: rationale for a new approach to neuroprotective treatment of protein folding diseases.

    Science.gov (United States)

    Pratt, William B; Morishima, Yoshihiro; Gestwicki, Jason E; Lieberman, Andrew P; Osawa, Yoichi

    2014-11-01

    In an EBM Minireview published in 2010, we proposed that the heat shock protein (Hsp)90/Hsp70-based chaperone machinery played a major role in determining the selection of proteins that have undergone oxidative or other toxic damage for ubiquitination and proteasomal degradation. The proposal was based on a model in which the Hsp90 chaperone machinery regulates signaling by modulating ligand-binding clefts. The model provides a framework for thinking about the development of neuroprotective therapies for protein-folding diseases like Alzheimer's disease (AD), Parkinson's disease (PD), and the polyglutamine expansion disorders, such as Huntington's disease (HD) and spinal and bulbar muscular atrophy (SBMA). Major aberrant proteins that misfold and accumulate in these diseases are "client" proteins of the abundant and ubiquitous stress chaperone Hsp90. These Hsp90 client proteins include tau (AD), α-synuclein (PD), huntingtin (HD), and the expanded glutamine androgen receptor (polyQ AR) (SBMA). In this Minireview, we update our model in which Hsp90 acts on protein-folding clefts and show how it forms a rational basis for developing drugs that promote the targeted elimination of these aberrant proteins. © 2014 by the Society for Experimental Biology and Medicine.

  15. SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.

    Science.gov (United States)

    Melvin, Iain; Ie, Eugene; Kuang, Rui; Weston, Jason; Stafford, William Noble; Leslie, Christina

    2007-05-22

    Predicting a protein's structural class from its amino acid sequence is a fundamental problem in computational biology. Much recent work has focused on developing new representations for protein sequences, called string kernels, for use with support vector machine (SVM) classifiers. However, while some of these approaches exhibit state-of-the-art performance at the binary protein classification problem, i.e. discriminating between a particular protein class and all other classes, few of these studies have addressed the real problem of multi-class superfamily or fold recognition. Moreover, there are only limited software tools and systems for SVM-based protein classification available to the bioinformatics community. We present a new multi-class SVM-based protein fold and superfamily recognition system and web server called SVM-Fold, which can be found at http://svm-fold.c2b2.columbia.edu. Our system uses an efficient implementation of a state-of-the-art string kernel for sequence profiles, called the profile kernel, where the underlying feature representation is a histogram of inexact matching k-mer frequencies. We also employ a novel machine learning approach to solve the difficult multi-class problem of classifying a sequence of amino acids into one of many known protein structural classes. Binary one-vs-the-rest SVM classifiers that are trained to recognize individual structural classes yield prediction scores that are not comparable, so that standard "one-vs-all" classification fails to perform well. Moreover, SVMs for classes at different levels of the protein structural hierarchy may make useful predictions, but one-vs-all does not try to combine these multiple predictions. To deal with these problems, our method learns relative weights between one-vs-the-rest classifiers and encodes information about the protein structural hierarchy for multi-class prediction. In large-scale benchmark results based on the SCOP database, our code weighting approach

  16. Aggregation of natively folded proteins: a theoretical approach

    International Nuclear Information System (INIS)

    Trovato, Antonio; Maritan, Amos; Seno, Flavio

    2007-01-01

    The reliable identification of β-aggregating stretches in protein sequences is essential for the development of therapeutic agents for Alzheimer's and Parkinson's diseases, as well as other pathological conditions associated with protein deposition. While the list of aggregation related diseases is growing, it has also been shown that many proteins that are normally well behaved can be induced to aggregate in vitro. This fact suggests the existence of a unified framework that could explain both folding and aggregation. By assuming this universal behaviour, we have recently introduced an algorithm (PASTA: prediction of amyloid structure aggregation), which is based on a sequence-specific energy function derived from the propensity of two residue types to be found paired in neighbouring strands within β-sheets in globular proteins. The algorithm is able to predict the most aggregation-prone portions of several proteins initially unfolded, in excellent agreement with experimental results. Here, we apply the method to a set of proteins which are known to aggregate, but which are natively folded. The quality of the prediction is again very high, corroborating the hypothesis that the amyloid structure is stabilized by the same physico-chemical determinants as those operating in folded proteins

  17. Transiently disordered tails accelerate folding of globular proteins.

    Science.gov (United States)

    Mallik, Saurav; Ray, Tanaya; Kundu, Sudip

    2017-07-01

    Numerous biological proteins exhibit intrinsic disorder at their termini, which are associated with multifarious functional roles. Here, we show the surprising result that an increased percentage of terminal short transiently disordered regions with enhanced flexibility (TstDREF) is associated with accelerated folding rates of globular proteins. Evolutionary conservation of predicted disorder at TstDREFs and drastic alteration of folding rates upon point-mutations suggest critical regulatory role(s) of TstDREFs in shaping the folding kinetics. TstDREFs are associated with long-range intramolecular interactions and the percentage of native secondary structural elements physically contacted by TstDREFs exhibit another surprising positive correlation with folding kinetics. These results allow us to infer probable molecular mechanisms behind the TstDREF-mediated regulation of folding kinetics that challenge protein biochemists to assess by direct experimental testing. © 2017 Federation of European Biochemical Societies.

  18. Examining a Thermodynamic Order Parameter of Protein Folding.

    Science.gov (United States)

    Chong, Song-Ho; Ham, Sihyun

    2018-05-08

    Dimensionality reduction with a suitable choice of order parameters or reaction coordinates is commonly used for analyzing high-dimensional time-series data generated by atomistic biomolecular simulations. So far, geometric order parameters, such as the root mean square deviation, fraction of native amino acid contacts, and collective coordinates that best characterize rare or large conformational transitions, have been prevailing in protein folding studies. Here, we show that the solvent-averaged effective energy, which is a thermodynamic quantity but unambiguously defined for individual protein conformations, serves as a good order parameter of protein folding. This is illustrated through the application to the folding-unfolding simulation trajectory of villin headpiece subdomain. We rationalize the suitability of the effective energy as an order parameter by the funneledness of the underlying protein free energy landscape. We also demonstrate that an improved conformational space discretization is achieved by incorporating the effective energy. The most distinctive feature of this thermodynamic order parameter is that it works in pointing to near-native folded structures even when the knowledge of the native structure is lacking, and the use of the effective energy will also find applications in combination with methods of protein structure prediction.

  19. Protein fold recognition using geometric kernel data fusion.

    Science.gov (United States)

    Zakeri, Pooya; Jeuris, Ben; Vandebril, Raf; Moreau, Yves

    2014-07-01

    Various approaches based on features extracted from protein sequences and often machine learning methods have been used in the prediction of protein folds. Finding an efficient technique for integrating these different protein features has received increasing attention. In particular, kernel methods are an interesting class of techniques for integrating heterogeneous data. Various methods have been proposed to fuse multiple kernels. Most techniques for multiple kernel learning focus on learning a convex linear combination of base kernels. In addition to the limitation of linear combinations, working with such approaches could cause a loss of potentially useful information. We design several techniques to combine kernel matrices by taking more involved, geometry inspired means of these matrices instead of convex linear combinations. We consider various sequence-based protein features including information extracted directly from position-specific scoring matrices and local sequence alignment. We evaluate our methods for classification on the SCOP PDB-40D benchmark dataset for protein fold recognition. The best overall accuracy on the protein fold recognition test set obtained by our methods is ∼ 86.7%. This is an improvement over the results of the best existing approach. Moreover, our computational model has been developed by incorporating the functional domain composition of proteins through a hybridization model. It is observed that by using our proposed hybridization model, the protein fold recognition accuracy is further improved to 89.30%. Furthermore, we investigate the performance of our approach on the protein remote homology detection problem by fusing multiple string kernels. The MATLAB code used for our proposed geometric kernel fusion frameworks are publicly available at http://people.cs.kuleuven.be/∼raf.vandebril/homepage/software/geomean.php?menu=5/. © The Author 2014. Published by Oxford University Press.

  20. A simple quantitative model of macromolecular crowding effects on protein folding: Application to the murine prion protein(121-231)

    Science.gov (United States)

    Bergasa-Caceres, Fernando; Rabitz, Herschel A.

    2013-06-01

    A model of protein folding kinetics is applied to study the effects of macromolecular crowding on protein folding rate and stability. Macromolecular crowding is found to promote a decrease of the entropic cost of folding of proteins that produces an increase of both the stability and the folding rate. The acceleration of the folding rate due to macromolecular crowding is shown to be a topology-dependent effect. The model is applied to the folding dynamics of the murine prion protein (121-231). The differential effect of macromolecular crowding as a function of protein topology suffices to make non-native configurations relatively more accessible.

  1. Protein folding simulations: from coarse-grained model to all-atom model.

    Science.gov (United States)

    Zhang, Jian; Li, Wenfei; Wang, Jun; Qin, Meng; Wu, Lei; Yan, Zhiqiang; Xu, Weixin; Zuo, Guanghong; Wang, Wei

    2009-06-01

    Protein folding is an important and challenging problem in molecular biology. During the last two decades, molecular dynamics (MD) simulation has proved to be a paramount tool and was widely used to study protein structures, folding kinetics and thermodynamics, and structure-stability-function relationship. It was also used to help engineering and designing new proteins, and to answer even more general questions such as the minimal number of amino acid or the evolution principle of protein families. Nowadays, the MD simulation is still undergoing rapid developments. The first trend is to toward developing new coarse-grained models and studying larger and more complex molecular systems such as protein-protein complex and their assembling process, amyloid related aggregations, and structure and motion of chaperons, motors, channels and virus capsides; the second trend is toward building high resolution models and explore more detailed and accurate pictures of protein folding and the associated processes, such as the coordination bond or disulfide bond involved folding, the polarization, charge transfer and protonate/deprotonate process involved in metal coupled folding, and the ion permeation and its coupling with the kinetics of channels. On these new territories, MD simulations have given many promising results and will continue to offer exciting views. Here, we review several new subjects investigated by using MD simulations as well as the corresponding developments of appropriate protein models. These include but are not limited to the attempt to go beyond the topology based Gō-like model and characterize the energetic factors in protein structures and dynamics, the study of the thermodynamics and kinetics of disulfide bond involved protein folding, the modeling of the interactions between chaperonin and the encapsulated protein and the protein folding under this circumstance, the effort to clarify the important yet still elusive folding mechanism of protein BBL

  2. Cotranslational protein folding reveals the selective use of ...

    Indian Academy of Sciences (India)

    to fold properly by decelerating the translation rate at these sites. Thus the cotranslational protein folding is believed to be true for many proteins and is an important selection factor for the selective codon usage to optimize proper gene expres- sion and function (Komar 2009). A web server CS and S has been created by ...

  3. Water dynamics clue to key residues in protein folding

    International Nuclear Information System (INIS)

    Gao, Meng; Zhu, Huaiqiu; Yao, Xin-Qiu; She, Zhen-Su

    2010-01-01

    A computational method independent of experimental protein structure information is proposed to recognize key residues in protein folding, from the study of hydration water dynamics. Based on all-atom molecular dynamics simulation, two key residues are recognized with distinct water dynamical behavior in a folding process of the Trp-cage protein. The identified key residues are shown to play an essential role in both 3D structure and hydrophobic-induced collapse. With observations on hydration water dynamics around key residues, a dynamical pathway of folding can be interpreted.

  4. Protein folding: Over half a century lasting quest. Comment on "There and back again: Two views on the protein folding puzzle" by Alexei V. Finkelstein et al.

    Science.gov (United States)

    Krokhotin, Andrey; Dokholyan, Nikolay V.

    2017-07-01

    Most proteins fold into unique three-dimensional (3D) structures that determine their biological functions, such as catalytic activity or macromolecular binding. Misfolded proteins can pose a threat through aberrant interactions with other proteins leading to a number of diseases including Alzheimer's disease, Parkinson's disease, and amyotrophic lateral sclerosis [1,2]. What does determine 3D structure of proteins? The first clue to this question came more than fifty years ago when Anfinsen demonstrated that unfolded proteins can spontaneously fold to their native 3D structures [3,4]. Anfinsen's experiments lead to the conclusion that proteins fold to unique native structure corresponding to the stable and kinetically accessible free energy minimum, and protein native structure is solely determined by its amino acid sequence. The question of how exactly proteins find their free energy minimum proved to be a difficult problem. One of the puzzles, initially pointed out by Levinthal, was an inconsistency between observed protein folding times and theoretical estimates. A self-avoiding polymer model of a globular protein of 100-residues length on a cubic lattice can sample at least 1047 states. Based on the assumption that conformational sampling occurs at the highest vibrational mode of proteins (∼picoseconds), predicted folding time by searching among all the possible conformations leads to ∼1027 years (much larger than the age of the universe) [5]. In contrast, observed protein folding time range from microseconds to minutes. Due to tremendous theoretical progress in protein folding field that has been achieved in past decades, the source of this inconsistency is currently understood that is thoroughly described in the review by Finkelstein et al. [6].

  5. Protein Folding: Search for Basic Physical Models

    Directory of Open Access Journals (Sweden)

    Ivan Y. Torshin

    2003-01-01

    Full Text Available How a unique three-dimensional structure is rapidly formed from the linear sequence of a polypeptide is one of the important questions in contemporary science. Apart from biological context of in vivo protein folding (which has been studied only for a few proteins, the roles of the fundamental physical forces in the in vitro folding remain largely unstudied. Despite a degree of success in using descriptions based on statistical and/or thermodynamic approaches, few of the current models explicitly include more basic physical forces (such as electrostatics and Van Der Waals forces. Moreover, the present-day models rarely take into account that the protein folding is, essentially, a rapid process that produces a highly specific architecture. This review considers several physical models that may provide more direct links between sequence and tertiary structure in terms of the physical forces. In particular, elaboration of such simple models is likely to produce extremely effective computational techniques with value for modern genomics.

  6. When fast is better: protein folding fundamentals and mechanisms from ultrafast approaches.

    Science.gov (United States)

    Muñoz, Victor; Cerminara, Michele

    2016-09-01

    Protein folding research stalled for decades because conventional experiments indicated that proteins fold slowly and in single strokes, whereas theory predicted a complex interplay between dynamics and energetics resulting in myriad microscopic pathways. Ultrafast kinetic methods turned the field upside down by providing the means to probe fundamental aspects of folding, test theoretical predictions and benchmark simulations. Accordingly, experimentalists could measure the timescales for all relevant folding motions, determine the folding speed limit and confirm that folding barriers are entropic bottlenecks. Moreover, a catalogue of proteins that fold extremely fast (microseconds) could be identified. Such fast-folding proteins cross shallow free energy barriers or fold downhill, and thus unfold with minimal co-operativity (gradually). A new generation of thermodynamic methods has exploited this property to map folding landscapes, interaction networks and mechanisms at nearly atomic resolution. In parallel, modern molecular dynamics simulations have finally reached the timescales required to watch fast-folding proteins fold and unfold in silico All of these findings have buttressed the fundamentals of protein folding predicted by theory, and are now offering the first glimpses at the underlying mechanisms. Fast folding appears to also have functional implications as recent results connect downhill folding with intrinsically disordered proteins, their complex binding modes and ability to moonlight. These connections suggest that the coupling between downhill (un)folding and binding enables such protein domains to operate analogically as conformational rheostats. © 2016 The Author(s).

  7. BiP clustering facilitates protein folding in the endoplasmic reticulum.

    Directory of Open Access Journals (Sweden)

    Marc Griesemer

    2014-07-01

    Full Text Available The chaperone BiP participates in several regulatory processes within the endoplasmic reticulum (ER: translocation, protein folding, and ER-associated degradation. To facilitate protein folding, a cooperative mechanism known as entropic pulling has been proposed to demonstrate the molecular-level understanding of how multiple BiP molecules bind to nascent and unfolded proteins. Recently, experimental evidence revealed the spatial heterogeneity of BiP within the nuclear and peripheral ER of S. cerevisiae (commonly referred to as 'clusters'. Here, we developed a model to evaluate the potential advantages of accounting for multiple BiP molecules binding to peptides, while proposing that BiP's spatial heterogeneity may enhance protein folding and maturation. Scenarios were simulated to gauge the effectiveness of binding multiple chaperone molecules to peptides. Using two metrics: folding efficiency and chaperone cost, we determined that the single binding site model achieves a higher efficiency than models characterized by multiple binding sites, in the absence of cooperativity. Due to entropic pulling, however, multiple chaperones perform in concert to facilitate the resolubilization and ultimate yield of folded proteins. As a result of cooperativity, multiple binding site models used fewer BiP molecules and maintained a higher folding efficiency than the single binding site model. These insilico investigations reveal that clusters of BiP molecules bound to unfolded proteins may enhance folding efficiency through cooperative action via entropic pulling.

  8. There and back again: Two views on the protein folding puzzle.

    Science.gov (United States)

    Finkelstein, Alexei V; Badretdin, Azat J; Galzitskaya, Oxana V; Ivankov, Dmitry N; Bogatyreva, Natalya S; Garbuzynskiy, Sergiy O

    2017-07-01

    The ability of protein chains to spontaneously form their spatial structures is a long-standing puzzle in molecular biology. Experimentally measured folding times of single-domain globular proteins range from microseconds to hours: the difference (10-11 orders of magnitude) is the same as that between the life span of a mosquito and the age of the universe. This review describes physical theories of rates of overcoming the free-energy barrier separating the natively folded (N) and unfolded (U) states of protein chains in both directions: "U-to-N" and "N-to-U". In the theory of protein folding rates a special role is played by the point of thermodynamic (and kinetic) equilibrium between the native and unfolded state of the chain; here, the theory obtains the simplest form. Paradoxically, a theoretical estimate of the folding time is easier to get from consideration of protein unfolding (the "N-to-U" transition) rather than folding, because it is easier to outline a good unfolding pathway of any structure than a good folding pathway that leads to the stable fold, which is yet unknown to the folding protein chain. And since the rates of direct and reverse reactions are equal at the equilibrium point (as follows from the physical "detailed balance" principle), the estimated folding time can be derived from the estimated unfolding time. Theoretical analysis of the "N-to-U" transition outlines the range of protein folding rates in a good agreement with experiment. Theoretical analysis of folding (the "U-to-N" transition), performed at the level of formation and assembly of protein secondary structures, outlines the upper limit of protein folding times (i.e., of the time of search for the most stable fold). Both theories come to essentially the same results; this is not a surprise, because they describe overcoming one and the same free-energy barrier, although the way to the top of this barrier from the side of the unfolded state is very different from the way from the

  9. Quantification of Drive-Response Relationships Between Residues During Protein Folding.

    Science.gov (United States)

    Qi, Yifei; Im, Wonpil

    2013-08-13

    Mutual correlation and cooperativity are commonly used to describe residue-residue interactions in protein folding/function. However, these metrics do not provide any information on the causality relationships between residues. Such drive-response relationships are poorly studied in protein folding/function and difficult to measure experimentally due to technical limitations. In this study, using the information theory transfer entropy (TE) that provides a direct measurement of causality between two times series, we have quantified the drive-response relationships between residues in the folding/unfolding processes of four small proteins generated by molecular dynamics simulations. Instead of using a time-averaged single TE value, the time-dependent TE is measured with the Q-scores based on residue-residue contacts and with the statistical significance analysis along the folding/unfolding processes. The TE analysis is able to identify the driving and responding residues that are different from the highly correlated residues revealed by the mutual information analysis. In general, the driving residues have more regular secondary structures, are more buried, and show greater effects on the protein stability as well as folding and unfolding rates. In addition, the dominant driving and responding residues from the TE analysis on the whole trajectory agree with those on a single folding event, demonstrating that the drive-response relationships are preserved in the non-equilibrium process. Our study provides detailed insights into the protein folding process and has potential applications in protein engineering and interpretation of time-dependent residue-based experimental observables for protein function.

  10. Interferences of Silica Nanoparticles in Green Fluorescent Protein Folding Processes.

    Science.gov (United States)

    Klein, Géraldine; Devineau, Stéphanie; Aude, Jean Christophe; Boulard, Yves; Pasquier, Hélène; Labarre, Jean; Pin, Serge; Renault, Jean Philippe

    2016-01-12

    We investigated the relationship between unfolded proteins, silica nanoparticles and chaperonin to determine whether unfolded proteins could stick to silica surfaces and how this process could impair heat shock protein activity. The HSP60 catalyzed green fluorescent protein (GFP) folding was used as a model system. The adsorption isotherms and adsorption kinetics of denatured GFP were measured, showing that denaturation increases GFP affinity for silica surfaces. This affinity is maintained even if the surfaces are covered by a protein corona and allows silica NPs to interfere directly with GFP folding by trapping it in its unstructured state. We determined also the adsorption isotherms of HSP60 and its chaperonin activity once adsorbed, showing that SiO2 NP can interfere also indirectly with protein folding through chaperonin trapping and inhibition. This inhibition is specifically efficient when NPs are covered first with a layer of unfolded proteins. These results highlight for the first time the antichaperonin activity of silica NPs and ask new questions about the toxicity of such misfolded proteins/nanoparticles assembly toward cells.

  11. Mechanical Modeling and Computer Simulation of Protein Folding

    Science.gov (United States)

    Prigozhin, Maxim B.; Scott, Gregory E.; Denos, Sharlene

    2014-01-01

    In this activity, science education and modern technology are bridged to teach students at the high school and undergraduate levels about protein folding and to strengthen their model building skills. Students are guided from a textbook picture of a protein as a rigid crystal structure to a more realistic view: proteins are highly dynamic…

  12. Periodic and stochastic thermal modulation of protein folding kinetics.

    Science.gov (United States)

    Platkov, Max; Gruebele, Martin

    2014-07-21

    Chemical reactions are usually observed either by relaxation of a bulk sample after applying a sudden external perturbation, or by intrinsic fluctuations of a few molecules. Here we show that the two ideas can be combined to measure protein folding kinetics, either by periodic thermal modulation, or by creating artificial thermal noise that greatly exceeds natural thermal fluctuations. We study the folding reaction of the enzyme phosphoglycerate kinase driven by periodic temperature waveforms. As the temperature waveform unfolds and refolds the protein, its fluorescence color changes due to FRET (Förster resonant Energy Transfer) of two donor/acceptor fluorophores labeling the protein. We adapt a simple model of periodically driven kinetics that nicely fits the data at all temperatures and driving frequencies: The phase shifts of the periodic donor and acceptor fluorescence signals as a function of driving frequency reveal reaction rates. We also drive the reaction with stochastic temperature waveforms that produce thermal fluctuations much greater than natural fluctuations in the bulk. Such artificial thermal noise allows the recovery of weak underlying signals due to protein folding kinetics. This opens up the possibility for future detection of a stochastic resonance for protein folding subject to noise with controllable amplitude.

  13. Conformational dynamics of a protein in the folded and the unfolded state

    Energy Technology Data Exchange (ETDEWEB)

    Fitter, Joerg

    2003-08-01

    In a quasielastic neutron scattering experiment, the picosecond dynamics of {alpha}-amylase was investigated for the folded and the unfolded state of the protein. In order to ensure a reasonable interpretation of the internal protein dynamics, the protein was measured in D{sub 2}O-buffer solution. The much higher structural flexibility of the pH induced unfolded state as compared to the native folded state was quantified using a simple analytical model, describing a local diffusion inside a sphere. In terms of this model the conformational volume, which is explored mainly by confined protein side-chain movements, is parameterized by the radius of a sphere (folded state, r=1.2 A; unfolded state, 1.8 A). Differences in conformational dynamics between the folded and the unfolded state of a protein are of fundamental interest in the field of protein science, because they are assumed to play an important role for the thermodynamics of folding/unfolding transition and for protein stability.

  14. Detecting protein folding by thermal fluctuations of microcantilevers.

    Directory of Open Access Journals (Sweden)

    Romina Muñoz

    Full Text Available The accurate characterization of proteins in both their native and denatured states is essential to effectively understand protein function, folding and stability. As a proof of concept, a micro rheological method is applied, based on the characterization of thermal fluctuations of a micro cantilever immersed in a bovine serum albumin solution, to assess changes in the viscosity associated with modifications in the protein's structure under the denaturant effect of urea. Through modeling the power spectrum density of the cantilever's fluctuations over a broad frequency band, it is possible to implement a fitting procedure to accurately determine the viscosity of the fluid, even at low volumes. Increases in viscosity during the denaturant process are identified using the assumption that the protein is a hard sphere, with a hydrodynamic radius that increases during unfolding. This is modeled accordingly through the Einstein-Batchelor formula. The Einstein-Batchelor formula estimates are verified through dynamic light scattering, which measures the hydrodynamic radius of proteins. Thus, this methodology is proven to be suitable for the study of protein folding in samples of small size at vanishing shear stresses.

  15. Probabilistic analysis for identifying the driving force of protein folding

    Science.gov (United States)

    Tokunaga, Yoshihiko; Yamamori, Yu; Matubayasi, Nobuyuki

    2018-03-01

    Toward identifying the driving force of protein folding, energetics was analyzed in water for Trp-cage (20 residues), protein G (56 residues), and ubiquitin (76 residues) at their native (folded) and heat-denatured (unfolded) states. All-atom molecular dynamics simulation was conducted, and the hydration effect was quantified by the solvation free energy. The free-energy calculation was done by employing the solution theory in the energy representation, and it was seen that the sum of the protein intramolecular (structural) energy and the solvation free energy is more favorable for a folded structure than for an unfolded one generated by heat. Probabilistic arguments were then developed to determine which of the electrostatic, van der Waals, and excluded-volume components of the interactions in the protein-water system governs the relative stabilities between the folded and unfolded structures. It was found that the electrostatic interaction does not correspond to the preference order of the two structures. The van der Waals and excluded-volume components were shown, on the other hand, to provide the right order of preference at probabilities of almost unity, and it is argued that a useful modeling of protein folding is possible on the basis of the excluded-volume effect.

  16. Transient intermediates are populated in the folding pathways of single-domain two-state folding protein L

    Science.gov (United States)

    Maity, Hiranmay; Reddy, Govardhan

    2018-04-01

    Small single-domain globular proteins, which are believed to be dominantly two-state folders, played an important role in elucidating various aspects of the protein folding mechanism. However, recent single molecule fluorescence resonance energy transfer experiments [H. Y. Aviram et al. J. Chem. Phys. 148, 123303 (2018)] on a single-domain two-state folding protein L showed evidence for the population of an intermediate state and it was suggested that in this state, a β-hairpin present near the C-terminal of the native protein state is unfolded. We performed molecular dynamics simulations using a coarse-grained self-organized-polymer model with side chains to study the folding pathways of protein L. In agreement with the experiments, an intermediate is populated in the simulation folding pathways where the C-terminal β-hairpin detaches from the rest of the protein structure. The lifetime of this intermediate structure increased with the decrease in temperature. In low temperature conditions, we also observed a second intermediate state, which is globular with a significant fraction of the native-like tertiary contacts satisfying the features of a dry molten globule.

  17. Transferable coarse-grained potential for de novo protein folding and design.

    Directory of Open Access Journals (Sweden)

    Ivan Coluzza

    Full Text Available Protein folding and design are major biophysical problems, the solution of which would lead to important applications especially in medicine. Here we provide evidence of how a novel parametrization of the Caterpillar model may be used for both quantitative protein design and folding. With computer simulations it is shown that, for a large set of real protein structures, the model produces designed sequences with similar physical properties to the corresponding natural occurring sequences. The designed sequences require further experimental testing. For an independent set of proteins, previously used as benchmark, the correct folded structure of both the designed and the natural sequences is also demonstrated. The equilibrium folding properties are characterized by free energy calculations. The resulting free energy profiles not only are consistent among natural and designed proteins, but also show a remarkable precision when the folded structures are compared to the experimentally determined ones. Ultimately, the updated Caterpillar model is unique in the combination of its fundamental three features: its simplicity, its ability to produce natural foldable designed sequences, and its structure prediction precision. It is also remarkable that low frustration sequences can be obtained with such a simple and universal design procedure, and that the folding of natural proteins shows funnelled free energy landscapes without the need of any potentials based on the native structure.

  18. Thermodynamic properties of an extremely rapid protein folding reaction.

    Science.gov (United States)

    Schindler, T; Schmid, F X

    1996-12-24

    The cold-shock protein CspB from Bacillus subtilis is a very small beta-barrel protein, which folds with a time constant of 1 ms (at 25 degrees C) in a U reversible N two-state reaction. To elucidate the energetics of this extremely fast reaction we investigated the folding kinetics of CspB as a function of both temperature and denaturant concentration between 2 and 45 degrees C and between 1 and 8 M urea. Under all these conditions unfolding and refolding were reversible monoexponential reactions. By using transition state theory, data from 327 kinetic curves were jointly analyzed to determine the thermodynamic activation parameters delta H H2O++, delta S H2O++, delta G H2O++, and delta C p H2O++ for unfolding and refolding and their dependences on the urea concentration. 90% of the total change in heat capacity and 96% of the change in the m value (m = d delta G/d[urea]) occur between the unfolded state and the activated state. This suggests that for CspB the activated state of folding is unusually well structured and almost equivalent to the native protein in its interactions with the solvent. As a consequence of this native-like activated state a strong temperature-dependent enthalpy/entropy compensation is observed for the refolding kinetics, and the barrier to refolding shifts from being largely enthalpic at low temperature to largely entropic at high temperature. This shift originates not from the changes in the folding protein chains itself, but from the changes in the protein-solvent interactions. We speculate that the absence of intermediates and the native-like activated state in the folding of CspB are correlated with the small size and the structural type of this protein. The stabilization of a small beta-sheet as in CspB requires extensive non-local interactions, and therefore incomplete sheets are unstable. As a consequence, the critical activated state is reached only very late in folding. The instability of partially folded structure is a means to

  19. Can Natural Proteins Designed with ‘Inverted’ Peptide Sequences Adopt Native-Like Protein Folds?

    Science.gov (United States)

    Sridhar, Settu; Guruprasad, Kunchur

    2014-01-01

    We have carried out a systematic computational analysis on a representative dataset of proteins of known three-dimensional structure, in order to evaluate whether it would possible to ‘swap’ certain short peptide sequences in naturally occurring proteins with their corresponding ‘inverted’ peptides and generate ‘artificial’ proteins that are predicted to retain native-like protein fold. The analysis of 3,967 representative proteins from the Protein Data Bank revealed 102,677 unique identical inverted peptide sequence pairs that vary in sequence length between 5–12 and 18 amino acid residues. Our analysis illustrates with examples that such ‘artificial’ proteins may be generated by identifying peptides with ‘similar structural environment’ and by using comparative protein modeling and validation studies. Our analysis suggests that natural proteins may be tolerant to accommodating such peptides. PMID:25210740

  20. Multiple molecule effects on the cooperativity of protein folding transitions in simulations

    Science.gov (United States)

    Lewis, Jacob I.; Moss, Devin J.; Knotts, Thomas A.

    2012-06-01

    Though molecular simulation of proteins has made notable contributions to the study of protein folding and kinetics, disagreement between simulation and experiment still exists. One of the criticisms levied against simulation is its failure to reproduce cooperative protein folding transitions. This weakness has been attributed to many factors such as a lack of polarizability and adequate capturing of solvent effects. This work, however, investigates how increasing the number of proteins simulated simultaneously can affect the cooperativity of folding transitions — a topic that has received little attention previously. Two proteins are studied in this work: phage T4 lysozyme (Protein Data Bank (PDB) ID: 7LZM) and phage 434 repressor (PDB ID: 1R69). The results show that increasing the number of proteins molecules simulated simultaneously leads to an increase in the macroscopic cooperativity for transitions that are inherently cooperative on the molecular level but has little effect on the cooperativity of other transitions. Taken as a whole, the results identify one area of consideration to improving simulations of protein folding.

  1. Folding of multidomain proteins: biophysical consequences of tethering even in apparently independent folding.

    Science.gov (United States)

    Arviv, Oshrit; Levy, Yaakov

    2012-12-01

    Most eukaryotic and a substantial fraction of prokaryotic proteins are composed of more than one domain. The tethering of these evolutionary, structural, and functional units raises, among others, questions regarding the folding process of conjugated domains. Studying the folding of multidomain proteins in silico enables one to identify and isolate the tethering-induced biophysical determinants that govern crosstalks generated between neighboring domains. For this purpose, we carried out coarse-grained and atomistic molecular dynamics simulations of two two-domain constructs from the immunoglobulin-like β-sandwich fold. Each of these was experimentally shown to behave as the "sum of its parts," that is, the thermodynamic and kinetic folding behavior of the constituent domains of these constructs seems to occur independently, with the folding of each domain uncoupled from the folding of its partner in the two-domain construct. We show that the properties of the individual domains can be significantly affected by conjugation to another domain. The tethering may be accompanied by stabilizing as well as destabilizing factors whose magnitude depends on the size of the interface, the length, and the flexibility of the linker, and the relative stability of the domains. Accordingly, the folding of a multidomain protein should not be viewed as the sum of the folding patterns of each of its parts, but rather, it involves abrogating several effects that lead to this outcome. An imbalance between these effects may result in either stabilization or destabilization owing to the tethering. Copyright © 2012 Wiley Periodicals, Inc.

  2. Modulation of the maladaptive stress response to manage diseases of protein folding.

    Directory of Open Access Journals (Sweden)

    Daniela Martino Roth

    2014-11-01

    Full Text Available Diseases of protein folding arise because of the inability of an altered peptide sequence to properly engage protein homeostasis components that direct protein folding and function. To identify global principles of misfolding disease pathology we examined the impact of the local folding environment in alpha-1-antitrypsin deficiency (AATD, Niemann-Pick type C1 disease (NPC1, Alzheimer's disease (AD, and cystic fibrosis (CF. Using distinct models, including patient-derived cell lines and primary epithelium, mouse brain tissue, and Caenorhabditis elegans, we found that chronic expression of misfolded proteins not only triggers the sustained activation of the heat shock response (HSR pathway, but that this sustained activation is maladaptive. In diseased cells, maladaptation alters protein structure-function relationships, impacts protein folding in the cytosol, and further exacerbates the disease state. We show that down-regulation of this maladaptive stress response (MSR, through silencing of HSF1, the master regulator of the HSR, restores cellular protein folding and improves the disease phenotype. We propose that restoration of a more physiological proteostatic environment will strongly impact the management and progression of loss-of-function and gain-of-toxic-function phenotypes common in human disease.

  3. Melody discrimination and protein fold classification

    Directory of Open Access Journals (Sweden)

    Robert P. Bywater

    2016-10-01

    Full Text Available One of the greatest challenges in theoretical biophysics and bioinformatics is the identification of protein folds from sequence data. This can be regarded as a pattern recognition problem. In this paper we report the use of a melody generation software where the inputs are derived from calculations of evolutionary information, secondary structure, flexibility, hydropathy and solvent accessibility from multiple sequence alignment data. The melodies so generated are derived from the sequence, and by inference, of the fold, in ways that give each fold a sound representation that may facilitate analysis, recognition, or comparison with other sequences.

  4. A Particle Swarm Optimization-Based Approach with Local Search for Predicting Protein Folding.

    Science.gov (United States)

    Yang, Cheng-Hong; Lin, Yu-Shiun; Chuang, Li-Yeh; Chang, Hsueh-Wei

    2017-10-01

    The hydrophobic-polar (HP) model is commonly used for predicting protein folding structures and hydrophobic interactions. This study developed a particle swarm optimization (PSO)-based algorithm combined with local search algorithms; specifically, the high exploration PSO (HEPSO) algorithm (which can execute global search processes) was combined with three local search algorithms (hill-climbing algorithm, greedy algorithm, and Tabu table), yielding the proposed HE-L-PSO algorithm. By using 20 known protein structures, we evaluated the performance of the HE-L-PSO algorithm in predicting protein folding in the HP model. The proposed HE-L-PSO algorithm exhibited favorable performance in predicting both short and long amino acid sequences with high reproducibility and stability, compared with seven reported algorithms. The HE-L-PSO algorithm yielded optimal solutions for all predicted protein folding structures. All HE-L-PSO-predicted protein folding structures possessed a hydrophobic core that is similar to normal protein folding.

  5. Analysis of the free-energy surface of proteins from reversible folding simulations.

    Directory of Open Access Journals (Sweden)

    Lucy R Allen

    2009-07-01

    Full Text Available Computer generated trajectories can, in principle, reveal the folding pathways of a protein at atomic resolution and possibly suggest general and simple rules for predicting the folded structure of a given sequence. While such reversible folding trajectories can only be determined ab initio using all-atom transferable force-fields for a few small proteins, they can be determined for a large number of proteins using coarse-grained and structure-based force-fields, in which a known folded structure is by construction the absolute energy and free-energy minimum. Here we use a model of the fast folding helical lambda-repressor protein to generate trajectories in which native and non-native states are in equilibrium and transitions are accurately sampled. Yet, representation of the free-energy surface, which underlies the thermodynamic and dynamic properties of the protein model, from such a trajectory remains a challenge. Projections over one or a small number of arbitrarily chosen progress variables often hide the most important features of such surfaces. The results unequivocally show that an unprojected representation of the free-energy surface provides important and unbiased information and allows a simple and meaningful description of many-dimensional, heterogeneous trajectories, providing new insight into the possible mechanisms of fast-folding proteins.

  6. General Protein Data Bank-Based Collective Variables for Protein Folding.

    Science.gov (United States)

    Ardevol, Albert; Palazzesi, Ferruccio; Tribello, Gareth A; Parrinello, Michele

    2016-01-12

    New, automated forms of data analysis are required to understand the high-dimensional trajectories that are obtained from molecular dynamics simulations on proteins. Dimensionality reduction algorithms are particularly appealing in this regard as they allow one to construct unbiased, low-dimensional representations of the trajectory using only the information encoded in the trajectory. The downside of this approach is that a different set of coordinates are required for each different chemical system under study precisely because the coordinates are constructed using information from the trajectory. In this paper, we show how one can resolve this problem by using the sketch-map algorithm that we recently proposed to construct a low-dimensional representation of the structures contained in the protein data bank. We show that the resulting coordinates are as useful for analyzing trajectory data as coordinates constructed using landmark configurations taken from the trajectory and that these coordinates can thus be used for understanding protein folding across a range of systems.

  7. Variation in the Subcellular Localization and Protein Folding Activity among Arabidopsis thaliana Homologs of Protein Disulfide Isomerase

    Directory of Open Access Journals (Sweden)

    Christen Y. L. Yuen

    2013-10-01

    Full Text Available Protein disulfide isomerases (PDIs catalyze the formation, breakage, and rearrangement of disulfide bonds to properly fold nascent polypeptides within the endoplasmic reticulum (ER. Classical animal and yeast PDIs possess two catalytic thioredoxin-like domains (a, a′ and two non-catalytic domains (b, b′, in the order a-b-b′-a′. The model plant, Arabidopsis thaliana, encodes 12 PDI-like proteins, six of which possess the classical PDI domain arrangement (AtPDI1 through AtPDI6. Three additional AtPDIs (AtPDI9, AtPDI10, AtPDI11 possess two thioredoxin domains, but without intervening b-b′ domains. C-terminal green fluorescent protein (GFP fusions to each of the nine dual-thioredoxin PDI homologs localized predominantly to the ER lumen when transiently expressed in protoplasts. Additionally, expression of AtPDI9:GFP-KDEL and AtPDI10: GFP-KDDL was associated with the formation of ER bodies. AtPDI9, AtPDI10, and AtPDI11 mediated the oxidative folding of alkaline phosphatase when heterologously expressed in the Escherichia coli protein folding mutant, dsbA−. However, only three classical AtPDIs (AtPDI2, AtPDI5, AtPDI6 functionally complemented dsbA−. Interestingly, chemical inducers of the ER unfolded protein response were previously shown to upregulate most of the AtPDIs that complemented dsbA−. The results indicate that Arabidopsis PDIs differ in their localization and protein folding activities to fulfill distinct molecular functions in the ER.

  8. Degradation of extracytoplasmic catalysts for protein folding in Bacillus subtilis

    NARCIS (Netherlands)

    Krishnappa, Laxmi; Monteferrante, Carmine G; Neef, Jolanda; Dreisbach, Annette; van Dijl, Jan Maarten

    The general protein secretion pathway of Bacillus subtilis has a high capacity for protein export from the cytoplasm, which is exploited in the biotechnological production of a wide range of enzymes. These exported proteins pass the membrane in an unfolded state, and accordingly, they have to fold

  9. An update of the DEF database of protein fold class predictions

    DEFF Research Database (Denmark)

    Reczko, Martin; Karras, Dimitris; Bohr, Henrik

    1997-01-01

    An update is given on the Database of Expected Fold classes (DEF) that contains a collection of fold-class predictions made from protein sequences and a mail server that provides new predictions for new sequences. To any given sequence one of 49 fold-classes is chosen to classify the structure re...... related to the sequence with high accuracy. The updated predictions system is developed using data from the new version of the 3D-ALI database of aligned protein structures and thus is giving more reliable and more detailed predictions than the previous DEF system.......An update is given on the Database of Expected Fold classes (DEF) that contains a collection of fold-class predictions made from protein sequences and a mail server that provides new predictions for new sequences. To any given sequence one of 49 fold-classes is chosen to classify the structure...

  10. Peptide folding in the presence of interacting protein crowders

    Energy Technology Data Exchange (ETDEWEB)

    Bille, Anna, E-mail: anna.bille@thep.lu.se; Irbäck, Anders, E-mail: anders@thep.lu.se [Computational Biology and Biological Physics, Department of Astronomy and Theoretical Physics, Lund University, Sölvegatan 14A, SE-223 62 Lund (Sweden); Mohanty, Sandipan, E-mail: s.mohanty@fz-juelich.de [Jülich Supercomputing Centre, Institute for Advanced Simulation, Forschungszentrum Jülich, D-52425 Jülich (Germany)

    2016-05-07

    Using Monte Carlo methods, we explore and compare the effects of two protein crowders, BPTI and GB1, on the folding thermodynamics of two peptides, the compact helical trp-cage and the β-hairpin-forming GB1m3. The thermally highly stable crowder proteins are modeled using a fixed backbone and rotatable side-chains, whereas the peptides are free to fold and unfold. In the simulations, the crowder proteins tend to distort the trp-cage fold, while having a stabilizing effect on GB1m3. The extent of the effects on a given peptide depends on the crowder type. Due to a sticky patch on its surface, BPTI causes larger changes than GB1 in the melting properties of the peptides. The observed effects on the peptides stem largely from attractive and specific interactions with the crowder surfaces, and differ from those seen in reference simulations with purely steric crowder particles.

  11. Why and how does native topology dictate the folding speed of a protein?

    Science.gov (United States)

    Rustad, Mark; Ghosh, Kingshuk

    2012-11-01

    Since the pioneering work of Plaxco, Simons, and Baker, it is now well known that the rates of protein folding strongly correlate with the average sequence separation (absolute contact order (ACO)) of native contacts. In spite of multitude of papers, our understanding to the basis of the relation between folding speed and ACO is still lacking. We model the transition state as a Gaussian polymer chain decorated with weak springs between native contacts while the unfolded state is modeled as a Gaussian chain only. Using these hamiltonians, our perturbative calculation explicitly shows folding speed and ACO are linearly related when only the first order term in the series is considered. However, to the second order, we notice the existence of two new topological metrics, termed COC1 and COC2 (COC stands for contact order correction). These additional correction terms are needed to properly account for the entropy loss due to overlapping (nested or linked) loops that are not well described by simple addition of entropies in ACO. COC1 and COC2 are related to fluctuations and correlations among different sequence separations. The new metric combining ACO, COC1, and COC2 improves folding speed dependence on native topology when applied to three different databases: (i) two-state proteins with only α/β and β proteins, (ii) two-state proteins (α/β, β and purely helical proteins all combined), and (iii) master set (multi-state and two-state) folding proteins. Furthermore, the first principle calculation provides us direct physical insights to the meaning of the fit parameters. The coefficient of ACO, for example, is related to the average strength of the contacts, while the constant term is related to the protein folding speed limit. With the new scaling law, our estimate of the folding speed limit is in close agreement with the widely accepted value of 1 μs observed in proteins and RNA. Analyzing an exhaustive set (7367) of monomeric proteins from protein data bank

  12. The nature of folded states of globular proteins.

    Science.gov (United States)

    Honeycutt, J D; Thirumalai, D

    1992-06-01

    We suggest, using dynamical simulations of a simple heteropolymer modelling the alpha-carbon sequence in a protein, that generically the folded states of globular proteins correspond to statistically well-defined metastable states. This hypothesis, called the metastability hypothesis, states that there are several free energy minima separated by barriers of various heights such that the folded conformations of a polypeptide chain in each of the minima have similar structural characteristics but have different energies from one another. The calculated structural characteristics, such as bond angle and dihedral angle distribution functions, are assumed to arise from only those configurations belonging to a given minimum. The validity of this hypothesis is illustrated by simulations of a continuum model of a heteropolymer whose low temperature state is a well-defined beta-barrel structure. The simulations were done using a molecular dynamics algorithm (referred to as the "noisy" molecular dynamics method) containing both friction and noise terms. It is shown that for this model there are several distinct metastable minima in which the structural features are similar. Several new methods of analyzing fluctuations in structures belonging to two distinct minima are introduced. The most notable one is a dynamic measure of compactness that can in principle provide the time required for maximal compactness to be achieved. The analysis shows that for a given metastable state in which the protein has a well-defined folded structure the transition to a state of higher compactness occurs very slowly, lending credence to the notion that the system encounters a late barrier in the process of folding to the most compact structure. The examination of the fluctuations in the structures near the unfolding----folding transition temperature indicates that the transition state for the unfolding to folding process occurs closer to the folded state.

  13. The Folding of de Novo Designed Protein DS119 via Molecular Dynamics Simulations

    Directory of Open Access Journals (Sweden)

    Moye Wang

    2016-04-01

    Full Text Available As they are not subjected to natural selection process, de novo designed proteins usually fold in a manner different from natural proteins. Recently, a de novo designed mini-protein DS119, with a βαβ motif and 36 amino acids, has folded unusually slowly in experiments, and transient dimers have been detected in the folding process. Here, by means of all-atom replica exchange molecular dynamics (REMD simulations, several comparably stable intermediate states were observed on the folding free-energy landscape of DS119. Conventional molecular dynamics (CMD simulations showed that when two unfolded DS119 proteins bound together, most binding sites of dimeric aggregates were located at the N-terminal segment, especially residues 5–10, which were supposed to form β-sheet with its own C-terminal segment. Furthermore, a large percentage of individual proteins in the dimeric aggregates adopted conformations similar to those in the intermediate states observed in REMD simulations. These results indicate that, during the folding process, DS119 can easily become trapped in intermediate states. Then, with diffusion, a transient dimer would be formed and stabilized with the binding interface located at N-terminals. This means that it could not quickly fold to the native structure. The complicated folding manner of DS119 implies the important influence of natural selection on protein-folding kinetics, and more improvement should be achieved in rational protein design.

  14. CASP10-BCL::Fold efficiently samples topologies of large proteins.

    Science.gov (United States)

    Heinze, Sten; Putnam, Daniel K; Fischer, Axel W; Kohlmann, Tim; Weiner, Brian E; Meiler, Jens

    2015-03-01

    During CASP10 in summer 2012, we tested BCL::Fold for prediction of free modeling (FM) and template-based modeling (TBM) targets. BCL::Fold assembles the tertiary structure of a protein from predicted secondary structure elements (SSEs) omitting more flexible loop regions early on. This approach enables the sampling of conformational space for larger proteins with more complex topologies. In preparation of CASP11, we analyzed the quality of CASP10 models throughout the prediction pipeline to understand BCL::Fold's ability to sample the native topology, identify native-like models by scoring and/or clustering approaches, and our ability to add loop regions and side chains to initial SSE-only models. The standout observation is that BCL::Fold sampled topologies with a GDT_TS score > 33% for 12 of 18 and with a topology score > 0.8 for 11 of 18 test cases de novo. Despite the sampling success of BCL::Fold, significant challenges still exist in clustering and loop generation stages of the pipeline. The clustering approach employed for model selection often failed to identify the most native-like assembly of SSEs for further refinement and submission. It was also observed that for some β-strand proteins model refinement failed as β-strands were not properly aligned to form hydrogen bonds removing otherwise accurate models from the pool. Further, BCL::Fold samples frequently non-natural topologies that require loop regions to pass through the center of the protein. © 2015 Wiley Periodicals, Inc.

  15. An overlapping region between the two terminal folding units of the outer surface protein A (OspA) controls its folding behavior.

    Science.gov (United States)

    Makabe, Koki; Nakamura, Takashi; Dhar, Debanjan; Ikura, Teikichi; Koide, Shohei; Kuwajima, Kunihiro

    2018-04-27

    Although many naturally occurring proteins consist of multiple domains, most studies on protein folding to date deal with single-domain proteins or isolated domains of multi-domain proteins. Studies of multi-domain protein folding are required for further advancing our understanding of protein folding mechanisms. Borrelia outer surface protein A (OspA) is a β-rich two-domain protein, in which two globular domains are connected by a rigid and stable single-layer β-sheet. Thus, OspA is particularly suited as a model system for studying the interplays of domains in protein folding. Here, we studied the equilibria and kinetics of the urea-induced folding-unfolding reactions of OspA probed with tryptophan fluorescence and ultraviolet circular dichroism. Global analysis of the experimental data revealed compelling lines of evidence for accumulation of an on-pathway intermediate during kinetic refolding and for the identity between the kinetic intermediate and a previously described equilibrium unfolding intermediate. The results suggest that the intermediate has the fully native structure in the N-terminal domain and the single layer β-sheet, with the C-terminal domain still unfolded. The observation of the productive on-pathway folding intermediate clearly indicates substantial interactions between the two domains mediated by the single-layer β-sheet. We propose that a rigid and stable intervening region between two domains creates an overlap between two folding units and can energetically couple their folding reactions. Copyright © 2018. Published by Elsevier Ltd.

  16. How Many Protein Sequences Fold to a Given Structure? A Coevolutionary Analysis.

    Science.gov (United States)

    Tian, Pengfei; Best, Robert B

    2017-10-17

    Quantifying the relationship between protein sequence and structure is key to understanding the protein universe. A fundamental measure of this relationship is the total number of amino acid sequences that can fold to a target protein structure, known as the "sequence capacity," which has been suggested as a proxy for how designable a given protein fold is. Although sequence capacity has been extensively studied using lattice models and theory, numerical estimates for real protein structures are currently lacking. In this work, we have quantitatively estimated the sequence capacity of 10 proteins with a variety of different structures using a statistical model based on residue-residue co-evolution to capture the variation of sequences from the same protein family. Remarkably, we find that even for the smallest protein folds, such as the WW domain, the number of foldable sequences is extremely large, exceeding the Avogadro constant. In agreement with earlier theoretical work, the calculated sequence capacity is positively correlated with the size of the protein, or better, the density of contacts. This allows the absolute sequence capacity of a given protein to be approximately predicted from its structure. On the other hand, the relative sequence capacity, i.e., normalized by the total number of possible sequences, is an extremely tiny number and is strongly anti-correlated with the protein length. Thus, although there may be more foldable sequences for larger proteins, it will be much harder to find them. Lastly, we have correlated the evolutionary age of proteins in the CATH database with their sequence capacity as predicted by our model. The results suggest a trade-off between the opposing requirements of high designability and the likelihood of a novel fold emerging by chance. Published by Elsevier Inc.

  17. Participation of Low Molecular Weight Electron Carriers in Oxidative Protein Folding

    Directory of Open Access Journals (Sweden)

    József Mandl

    2009-03-01

    Full Text Available Oxidative protein folding is mediated by a proteinaceous electron relay system, in which the concerted action of protein disulfide isomerase and Ero1 delivers the electrons from thiol groups to the final acceptor. Oxygen appears to be the final oxidant in aerobic living organisms, although the existence of alternative electron acceptors, e.g. fumarate or nitrate, cannot be excluded. Whilst the protein components of the system are well-known, less attention has been turned to the role of low molecular weight electron carriers in the process. The function of ascorbate, tocopherol and vitamin K has been raised recently. In vitro and in vivo evidence suggests that these redox-active compounds can contribute to the functioning of oxidative folding. This review focuses on the participation of small molecular weight redox compounds in oxidative protein folding.

  18. Reliable protein folding on non-funneled energy landscapes: the free energy reaction path

    OpenAIRE

    Lois, Gregg; Blawzdziewicz, Jerzy; O'Hern, Corey S.

    2008-01-01

    A theoretical framework is developed to study the dynamics of protein folding. The key insight is that the search for the native protein conformation is influenced by the rate r at which external parameters, such as temperature, chemical denaturant or pH, are adjusted to induce folding. A theory based on this insight predicts that (1) proteins with non-funneled energy landscapes can fold reliably to their native state, (2) reliable folding can occur as an equilibrium or out-of-equilibrium pro...

  19. Effects of knot type in the folding of topologically complex lattice proteins

    Science.gov (United States)

    Soler, Miguel A.; Nunes, Ana; Faísca, Patrícia F. N.

    2014-07-01

    The folding properties of a protein whose native structure contains a 52 knot are investigated by means of extensive Monte Carlo simulations of a simple lattice model and compared with those of a 31 knot. A 52 knot embedded in the native structure enhances the kinetic stability of the carrier lattice protein in a way that is clearly more pronounced than in the case of the 31 knot. However, this happens at the expense of a severe loss in folding efficiency, an observation that is consistent with the relative abundance of 31 and 52 knots in the Protein Data Bank. The folding mechanism of the 52 knot shares with that of the 31 knot the occurrence of a threading movement of the chain terminus that lays closer to the knotted core. However, co-concomitant knotting and folding in the 52 knot occurs with negligible probability, in sharp contrast to what is observed for the 31 knot. The study of several single point mutations highlights the importance in the folding of knotted proteins of the so-called structural mutations (i.e., energetic perturbations of native interactions between residues that are critical for knotting but not for folding). On the other hand, the present study predicts that mutations that perturb the folding transition state may significantly enhance the kinetic stability of knotted proteins provided they involve residues located within the knotted core.

  20. High Pressure ZZ-Exchange NMR Reveals Key Features of Protein Folding Transition States.

    Science.gov (United States)

    Zhang, Yi; Kitazawa, Soichiro; Peran, Ivan; Stenzoski, Natalie; McCallum, Scott A; Raleigh, Daniel P; Royer, Catherine A

    2016-11-23

    Understanding protein folding mechanisms and their sequence dependence requires the determination of residue-specific apparent kinetic rate constants for the folding and unfolding reactions. Conventional two-dimensional NMR, such as HSQC experiments, can provide residue-specific information for proteins. However, folding is generally too fast for such experiments. ZZ-exchange NMR spectroscopy allows determination of folding and unfolding rates on much faster time scales, yet even this regime is not fast enough for many protein folding reactions. The application of high hydrostatic pressure slows folding by orders of magnitude due to positive activation volumes for the folding reaction. We combined high pressure perturbation with ZZ-exchange spectroscopy on two autonomously folding protein domains derived from the ribosomal protein, L9. We obtained residue-specific apparent rates at 2500 bar for the N-terminal domain of L9 (NTL9), and rates at atmospheric pressure for a mutant of the C-terminal domain (CTL9) from pressure dependent ZZ-exchange measurements. Our results revealed that NTL9 folding is almost perfectly two-state, while small deviations from two-state behavior were observed for CTL9. Both domains exhibited large positive activation volumes for folding. The volumetric properties of these domains reveal that their transition states contain most of the internal solvent excluded voids that are found in the hydrophobic cores of the respective native states. These results demonstrate that by coupling it with high pressure, ZZ-exchange can be extended to investigate a large number of protein conformational transitions.

  1. Folding propensity of intrinsically disordered proteins by osmotic stress

    International Nuclear Information System (INIS)

    Mansouri, Amanda L.; Grese, Laura N.; Rowe, Erica L.

    2016-01-01

    Proteins imparted with intrinsic disorder conduct a range of essential cellular functions. To better understand the folding and hydration properties of intrinsically disordered proteins (IDPs), we used osmotic stress to induce conformational changes in nuclear co-activator binding domain (NCBD) and activator for thyroid hormone and retinoid receptor (ACTR). Osmotic stress was applied by the addition of small and polymeric osmolytes, where we discovered that water contributions to NCBD folding always exceeded those for ACTR. Both NCBD and ACTR were found to gain a-helical structure with increasing osmotic stress, consistent with their folding upon NCBD/ACTR complex formation. Using small-angle neutron scattering (SANS), we further characterized NCBD structural changes with the osmolyte ethylene glycol. Here a large reduction in overall size initially occurred before substantial secondary structural change. In conclusion, by focusing on folding propensity, and linked hydration changes, we uncover new insights that may be important for how IDP folding contributes to binding.

  2. A partially folded intermediate species of the β-sheet protein apo-pseudoazurin ism trapped during proline-limited folding

    NARCIS (Netherlands)

    Reader, J.S.; van Nuland, N.A.J.; Thompson, G.S.; Ferguson, S.J.; Dobson, C.M.; Radford, S.E.

    2001-01-01

    The folding of apo-pseudoazurin, a 123-residue, predominantly -sheet protein with a complex Greek key topology, has been investigated using several biophysical techniques. Kinetic analysis of refolding using farand near-ultraviolet circular dichroism (UV CD) shows that the protein folds slowly to

  3. New insights into structural determinants of prion protein folding and stability.

    Science.gov (United States)

    Benetti, Federico; Legname, Giuseppe

    2015-01-01

    Prions are the etiological agent of fatal neurodegenerative diseases called prion diseases or transmissible spongiform encephalopathies. These maladies can be sporadic, genetic or infectious disorders. Prions are due to post-translational modifications of the cellular prion protein leading to the formation of a β-sheet enriched conformer with altered biochemical properties. The molecular events causing prion formation in sporadic prion diseases are still elusive. Recently, we published a research elucidating the contribution of major structural determinants and environmental factors in prion protein folding and stability. Our study highlighted the crucial role of octarepeats in stabilizing prion protein; the presence of a highly enthalpically stable intermediate state in prion-susceptible species; and the role of disulfide bridge in preserving native fold thus avoiding the misfolding to a β-sheet enriched isoform. Taking advantage from these findings, in this work we present new insights into structural determinants of prion protein folding and stability.

  4. Co-evolutionary constraints of globular proteins correlate with their folding rates.

    Science.gov (United States)

    Mallik, Saurav; Kundu, Sudip

    2015-08-04

    Folding rates (lnkf) of globular proteins correlate with their biophysical properties, but relationship between lnkf and patterns of sequence evolution remains elusive. We introduce 'relative co-evolution order' (rCEO) as length-normalized average primary chain separation of co-evolving pairs (CEPs), which negatively correlates with lnkf. In addition to pairs in native 3D contact, indirectly connected and structurally remote CEPs probably also play critical roles in protein folding. Correlation between rCEO and lnkf is stronger in multi-state proteins than two-state proteins, contrasting the case of contact order (co), where stronger correlation is found in two-state proteins. Finally, rCEO, co and lnkf are fitted into a 3D linear correlation. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  5. Coupling ligand recognition to protein folding in an engineered variant of rabbit ileal lipid binding protein.

    Science.gov (United States)

    Kouvatsos, Nikolaos; Meldrum, Jill K; Searle, Mark S; Thomas, Neil R

    2006-11-28

    We have engineered a variant of the beta-clam shell protein ILBP which lacks the alpha-helical motif that caps the central binding cavity; the mutant protein is sufficiently destabilised that it is unfolded under physiological conditions, however, it unexpectedly binds its natural bile acid substrates with high affinity forming a native-like beta-sheet rich structure and demonstrating strong thermodynamic coupling between ligand binding and protein folding.

  6. Towards a systematic classification of protein folds

    DEFF Research Database (Denmark)

    Lindgård, Per-Anker; Bohr, Henrik

    1997-01-01

    structures are given a unique name, which simultaneously represent a linear string of physical coupling constants describing hinge spin interactions. We have defined a metric and a precise distance measure between the fold classes. An automated procedure is constructed in which any protein structure...

  7. Predicting protein folding pathways at the mesoscopic level based on native interactions between secondary structure elements

    Directory of Open Access Journals (Sweden)

    Sze Sing-Hoi

    2008-07-01

    Full Text Available Abstract Background Since experimental determination of protein folding pathways remains difficult, computational techniques are often used to simulate protein folding. Most current techniques to predict protein folding pathways are computationally intensive and are suitable only for small proteins. Results By assuming that the native structure of a protein is known and representing each intermediate conformation as a collection of fully folded structures in which each of them contains a set of interacting secondary structure elements, we show that it is possible to significantly reduce the conformation space while still being able to predict the most energetically favorable folding pathway of large proteins with hundreds of residues at the mesoscopic level, including the pig muscle phosphoglycerate kinase with 416 residues. The model is detailed enough to distinguish between different folding pathways of structurally very similar proteins, including the streptococcal protein G and the peptostreptococcal protein L. The model is also able to recognize the differences between the folding pathways of protein G and its two structurally similar variants NuG1 and NuG2, which are even harder to distinguish. We show that this strategy can produce accurate predictions on many other proteins with experimentally determined intermediate folding states. Conclusion Our technique is efficient enough to predict folding pathways for both large and small proteins at the mesoscopic level. Such a strategy is often the only feasible choice for large proteins. A software program implementing this strategy (SSFold is available at http://faculty.cs.tamu.edu/shsze/ssfold.

  8. Prediction of the optimal set of contacts to fold the smallest knotted protein

    International Nuclear Information System (INIS)

    Dabrowski-Tumanski, P; Jarmolinska, A I; Sulkowska, J I

    2015-01-01

    Knotted protein chains represent a new motif in protein folds. They have been linked to various diseases, and recent extensive analysis of the Protein Data Bank shows that they constitute 1.5% of all deposited protein structures. Despite thorough theoretical and experimental investigations, the role of knots in proteins still remains elusive. Nonetheless, it is believed that knots play an important role in mechanical and thermal stability of proteins. Here, we perform a comprehensive analysis of native, shadow-specific and non-native interactions which describe free energy landscape of the smallest knotted protein (PDB id 2efv). We show that the addition of shadow-specific contacts in the loop region greatly enhances folding kinetics, while the addition of shadow-specific contacts along the C-terminal region (H3 or H4) results in a new folding route with slower kinetics. By means of direct coupling analysis (DCA) we predict non-native contacts which also can accelerate kinetics. Next, we show that the length of the C-terminal knot tail is responsible for the shape of the free energy barrier, while the influence of the elongation of the N-terminus is not significant. Finally, we develop a concept of a minimal contact map sufficient for 2efv protein to fold and analyze properties of this protein using this map. (paper)

  9. Prediction of the optimal set of contacts to fold the smallest knotted protein

    Science.gov (United States)

    Dabrowski-Tumanski, P.; Jarmolinska, A. I.; Sulkowska, J. I.

    2015-09-01

    Knotted protein chains represent a new motif in protein folds. They have been linked to various diseases, and recent extensive analysis of the Protein Data Bank shows that they constitute 1.5% of all deposited protein structures. Despite thorough theoretical and experimental investigations, the role of knots in proteins still remains elusive. Nonetheless, it is believed that knots play an important role in mechanical and thermal stability of proteins. Here, we perform a comprehensive analysis of native, shadow-specific and non-native interactions which describe free energy landscape of the smallest knotted protein (PDB id 2efv). We show that the addition of shadow-specific contacts in the loop region greatly enhances folding kinetics, while the addition of shadow-specific contacts along the C-terminal region (H3 or H4) results in a new folding route with slower kinetics. By means of direct coupling analysis (DCA) we predict non-native contacts which also can accelerate kinetics. Next, we show that the length of the C-terminal knot tail is responsible for the shape of the free energy barrier, while the influence of the elongation of the N-terminus is not significant. Finally, we develop a concept of a minimal contact map sufficient for 2efv protein to fold and analyze properties of this protein using this map.

  10. Protein folding and misfolding shining light by infrared spectroscopy

    CERN Document Server

    Fabian, Heinz

    2012-01-01

    Infrared spectroscopy is a new and innovative technology to study protein folding/misfolding events in the broad arsenal of techniques conventionally used in this field. The progress in understanding protein folding and misfolding is primarily due to the development of biophysical methods which permit to probe conformational changes with high kinetic and structural resolution. The most commonly used approaches rely on rapid mixing methods to initiate the folding event via a sudden change in solvent conditions. Traditionally, techniques such as fluorescence, circular dichroism or visible absorption are applied to probe the process. In contrast to these techniques, infrared spectroscopy came into play only very recently, and the progress made in this field up to date which now permits to probe folding events over the time scale from picoseconds to minutes has not yet been discussed in a book. The aim of this book is to provide an overview of the developments as seen by some of the main contributors to the field...

  11. Discovery of Proteomic Code with mRNA Assisted Protein Folding

    Directory of Open Access Journals (Sweden)

    Jan C. Biro

    2008-12-01

    Full Text Available The 3x redundancy of the Genetic Code is usually explained as a necessity to increase the mutation-resistance of the genetic information. However recent bioinformatical observations indicate that the redundant Genetic Code contains more biological information than previously known and which is additional to the 64/20 definition of amino acids. It might define the physico-chemical and structural properties of amino acids, the codon boundaries, the amino acid co-locations (interactions in the coded proteins and the free folding energy of mRNAs. This additional information, which seems to be necessary to determine the 3D structure of coding nucleic acids as well as the coded proteins, is known as the Proteomic Code and mRNA Assisted Protein Folding.

  12. Folding 19 proteins to their native state and stability of large proteins from a coarse-grained model.

    Science.gov (United States)

    Kapoor, Abhijeet; Travesset, Alex

    2014-03-01

    We develop an intermediate resolution model, where the backbone is modeled with atomic resolution but the side chain with a single bead, by extending our previous model (Proteins (2013) DOI: 10.1002/prot.24269) to properly include proline, preproline residues and backbone rigidity. Starting from random configurations, the model properly folds 19 proteins (including a mutant 2A3D sequence) into native states containing β sheet, α helix, and mixed α/β. As a further test, the stability of H-RAS (a 169 residue protein, critical in many signaling pathways) is investigated: The protein is stable, with excellent agreement with experimental B-factors. Despite that proteins containing only α helices fold to their native state at lower backbone rigidity, and other limitations, which we discuss thoroughly, the model provides a reliable description of the dynamics as compared with all atom simulations, but does not constrain secondary structures as it is typically the case in more coarse-grained models. Further implications are described. Copyright © 2013 Wiley Periodicals, Inc.

  13. Simulation of fluorescence resonance energy transfer experiments: effect of the dyes on protein folding

    International Nuclear Information System (INIS)

    Allen, Lucy R; Paci, Emanuele

    2010-01-01

    Fluorescence resonance energy transfer is a powerful technique which is often used to probe the properties of proteins and complex macromolecules. The technique relies on relatively large fluorescent dyes which are engineered into the molecule of interest. In the case of small proteins, these dyes may affect the stability of the protein, and modify the folding kinetics and the folding mechanisms which are being probed. Here we use atomistic simulation to investigate the effect that commonly used fluorescent dyes have on the folding of a four-helix bundle protein. We show that, depending on where the dyes are attached, their effect on the kinetic and thermodynamic properties of the protein may be significant. We find that, while the overall folding mechanism is not affected by the dyes, they can destabilize, or even stabilize, intermediate states.

  14. Denatured state is critical in determining the properties of model proteins designed on different folds

    DEFF Research Database (Denmark)

    Amatori, Andrea; Ferkinghoff-Borg, Jesper; Tiana, Guido

    2008-01-01

    The thermodynamics of proteins designed on three common folds (SH3, chymotrypsin inhibitor 2 [CI2], and protein G) is studied with a simplified C alpha, model and compared with the thermodynamics of proteins designed on random-generated folds. The model allows to design sequences to fold within a...

  15. GroEL-GroES assisted folding of multiple recombinant proteins simultaneously over-expressed in Escherichia coli.

    Science.gov (United States)

    Goyal, Megha; Chaudhuri, Tapan K

    2015-07-01

    Folding of aggregation prone recombinant proteins through co-expression of chaperonin GroEL and GroES has been a popular practice in the effort to optimize preparation of functional protein in Escherichia coli. Considering the demand for functional recombinant protein products, it is desirable to apply the chaperone assisted protein folding strategy for enhancing the yield of properly folded protein. Toward the same direction, it is also worth attempting folding of multiple recombinant proteins simultaneously over-expressed in E. coli through the assistance of co-expressed GroEL-ES. The genesis of this thinking was originated from the fact that cellular GroEL and GroES assist in the folding of several endogenous proteins expressed in the bacterial cell. Here we present the experimental findings from our study on co-expressed GroEL-GroES assisted folding of simultaneously over-expressed proteins maltodextrin glucosidase (MalZ) and yeast mitochondrial aconitase (mAco). Both proteins mentioned here are relatively larger and aggregation prone, mostly form inclusion bodies, and undergo GroEL-ES assisted folding in E. coli cells during over-expression. It has been reported that the relative yield of properly folded functional forms of MalZ and mAco with the exogenous GroEL-ES assistance were comparable with the results when these proteins were overexpressed alone. This observation is quite promising and highlights the fact that GroEL and GroES can assist in the folding of multiple substrate proteins simultaneously when over-expressed in E. coli. This method might be a potential tool for enhanced production of multiple functional recombinant proteins simultaneously in E. coli. Copyright © 2015 Elsevier Ltd. All rights reserved.

  16. Thermodynamics of protein folding: a random matrix formulation.

    Science.gov (United States)

    Shukla, Pragya

    2010-10-20

    The process of protein folding from an unfolded state to a biologically active, folded conformation is governed by many parameters, e.g. the sequence of amino acids, intermolecular interactions, the solvent, temperature and chaperon molecules. Our study, based on random matrix modeling of the interactions, shows, however, that the evolution of the statistical measures, e.g. Gibbs free energy, heat capacity, and entropy, is single parametric. The information can explain the selection of specific folding pathways from an infinite number of possible ways as well as other folding characteristics observed in computer simulation studies. © 2010 IOP Publishing Ltd

  17. Predicting protein folding rate change upon point mutation using residue-level coevolutionary information.

    Science.gov (United States)

    Mallik, Saurav; Das, Smita; Kundu, Sudip

    2016-01-01

    Change in folding kinetics of globular proteins upon point mutation is crucial to a wide spectrum of biological research, such as protein misfolding, toxicity, and aggregations. Here we seek to address whether residue-level coevolutionary information of globular proteins can be informative to folding rate changes upon point mutations. Generating residue-level coevolutionary networks of globular proteins, we analyze three parameters: relative coevolution order (rCEO), network density (ND), and characteristic path length (CPL). A point mutation is considered to be equivalent to a node deletion of this network and respective percentage changes in rCEO, ND, CPL are found linearly correlated (0.84, 0.73, and -0.61, respectively) with experimental folding rate changes. The three parameters predict the folding rate change upon a point mutation with 0.031, 0.045, and 0.059 standard errors, respectively. © 2015 Wiley Periodicals, Inc.

  18. An Intramolecular Chaperone Inserted in Bacteriophage P22 Coat Protein Mediates Its Chaperonin-independent Folding*

    Science.gov (United States)

    Suhanovsky, Margaret M.; Teschke, Carolyn M.

    2013-01-01

    The bacteriophage P22 coat protein has the common HK97-like fold but with a genetically inserted domain (I-domain). The role of the I-domain, positioned at the outermost surface of the capsid, is unknown. We hypothesize that the I-domain may act as an intramolecular chaperone because the coat protein folds independently, and many folding mutants are localized to the I-domain. The function of the I-domain was investigated by generating the coat protein core without its I-domain and the isolated I-domain. The core coat protein shows a pronounced folding defect. The isolated I-domain folds autonomously and has a high thermodynamic stability and fast folding kinetics in the presence of a peptidyl prolyl isomerase. Thus, the I-domain provides thermodynamic stability to the full-length coat protein so that it can fold reasonably efficiently while still allowing the HK97-like core to retain the flexibility required for conformational switching during procapsid assembly and maturation. PMID:24126914

  19. Structure of a Trypanosoma brucei α/β-hydrolase fold protein with unknown function

    International Nuclear Information System (INIS)

    Merritt, Ethan A.; Holmes, Margaret; Buckner, Frederick S.; Van Voorhis, Wesley C.; Quartly, Erin; Phizicky, Eric M.; Lauricella, Angela; Luft, Joseph; DeTitta, George; Neely, Helen; Zucker, Frank; Hol, Wim G. J.

    2008-01-01

    T. brucei gene Tb10.6k15.0140 codes for an α/β-hydrolase fold protein of unknown function. The 2.2 Å crystal structure shows that members of this sequence family retain a conserved Ser residue at the expected site of a catalytic nucleophile, but that trypanosomatid sequences lack structural homologs for the other expected residues of the catalytic triad. The structure of a structural genomics target protein, Tbru020260AAA from Trypanosoma brucei, has been determined to a resolution of 2.2 Å using multiple-wavelength anomalous diffraction at the Se K edge. This protein belongs to Pfam sequence family PF08538 and is only distantly related to previously studied members of the α/β-hydrolase fold family. Structural superposition onto representative α/β-hydrolase fold proteins of known function indicates that a possible catalytic nucleophile, Ser116 in the T. brucei protein, lies at the expected location. However, the present structure and by extension the other trypanosomatid members of this sequence family have neither sequence nor structural similarity at the location of other active-site residues typical for proteins with this fold. Together with the presence of an additional domain between strands β6 and β7 that is conserved in trypanosomatid genomes, this suggests that the function of these homologs has diverged from other members of the fold family

  20. Chloroplast Chaperonin: An Intricate Protein Folding Machine for Photosynthesis

    Directory of Open Access Journals (Sweden)

    Qian Zhao

    2018-01-01

    Full Text Available Group I chaperonins are large cylindrical-shaped nano-machines that function as a central hub in the protein quality control system in the bacterial cytosol, mitochondria and chloroplasts. In chloroplasts, proteins newly synthesized by chloroplast ribosomes, unfolded by diverse stresses, or translocated from the cytosol run the risk of aberrant folding and aggregation. The chloroplast chaperonin system assists these proteins in folding into their native states. A widely known protein folded by chloroplast chaperonin is the large subunit of ribulose 1,5-bisphosphate carboxylase/oxygenase (Rubisco, an enzyme responsible for the fixation of inorganic CO2 into organic carbohydrates during photosynthesis. Chloroplast chaperonin was initially identified as a Rubisco-binding protein. All photosynthetic eucaryotes genomes encode multiple chaperonin genes which can be divided into α and β subtypes. Unlike the homo-oligomeric chaperonins from bacteria and mitochondria, chloroplast chaperonins are more complex and exists as intricate hetero-oligomers containing both subtypes. The Group I chaperonin requires proper interaction with a detachable lid-like co-chaperonin in the presence of ATP and Mg2+ for substrate encapsulation and conformational transition. Besides the typical Cpn10-like co-chaperonin, a unique co-chaperonin consisting of two tandem Cpn10-like domains joined head-to-tail exists in chloroplasts. Since chloroplasts were proposed as sensors to various environmental stresses, this diversified chloroplast chaperonin system has the potential to adapt to complex conditions by accommodating specific substrates or through regulation at both the transcriptional and post-translational levels. In this review, we discuss recent progress on the unique structure and function of the chloroplast chaperonin system based on model organisms Chlamydomonas reinhardtii and Arabidopsis thaliana. Knowledge of the chloroplast chaperonin system may ultimately lead

  1. Targeting the OB-Folds of Replication Protein A with Small Molecules

    Directory of Open Access Journals (Sweden)

    Victor J. Anciano Granadillo

    2010-01-01

    Full Text Available Replication protein A (RPA is the main eukaryotic single-strand (ss DNA-binding protein involved in DNA replication and repair. We have identified and developed two classes of small molecule inhibitors (SMIs that show in vitro inhibition of the RPA-DNA interaction. We present further characterization of these SMIs with respect to their target binding, mechanism of action, and specificity. Both reversible and irreversible modes of inhibition are observed for the different classes of SMIs with one class found to specifically interact with DNA-binding domains A and B (DBD-A/B of RPA. In comparison with other oligonucleotide/oligosaccharide binding-fold (OB-fold containing ssDNA-binding proteins, one class of SMIs displayed specificity for the RPA protein. Together these data demonstrate that the specific targeting of a protein-DNA interaction can be exploited towards interrogating the cellular activity of RPA as well as increasing the efficacy of DNA-damaging chemotherapeutics used in cancer treatment.

  2. A multi-directional rapidly exploring random graph (mRRG) for protein folding

    KAUST Repository

    Nath, Shuvra Kanti; Thomas, Shawna; Ekenna, Chinwe; Amato, Nancy M.

    2012-01-01

    Modeling large-scale protein motions, such as those involved in folding and binding interactions, is crucial to better understanding not only how proteins move and interact with other molecules but also how proteins misfold, thus causing many devastating diseases. Robotic motion planning algorithms, such as Rapidly Exploring Random Trees (RRTs), have been successful in simulating protein folding pathways. Here, we propose a new multi-directional Rapidly Exploring Random Graph (mRRG) specifically tailored for proteins. Unlike traditional RRGs which only expand a parent conformation in a single direction, our strategy expands the parent conformation in multiple directions to generate new samples. Resulting samples are connected to the parent conformation and its nearest neighbors. By leveraging multiple directions, mRRG can model the protein motion landscape with reduced computational time compared to several other robotics-based methods for small to moderate-sized proteins. Our results on several proteins agree with experimental hydrogen out-exchange, pulse-labeling, and F-value analysis. We also show that mRRG covers the conformation space better as compared to the other computation methods. Copyright © 2012 ACM.

  3. Protein folding simulations by generalized-ensemble algorithms.

    Science.gov (United States)

    Yoda, Takao; Sugita, Yuji; Okamoto, Yuko

    2014-01-01

    In the protein folding problem, conventional simulations in physical statistical mechanical ensembles, such as the canonical ensemble with fixed temperature, face a great difficulty. This is because there exist a huge number of local-minimum-energy states in the system and the conventional simulations tend to get trapped in these states, giving wrong results. Generalized-ensemble algorithms are based on artificial unphysical ensembles and overcome the above difficulty by performing random walks in potential energy, volume, and other physical quantities or their corresponding conjugate parameters such as temperature, pressure, etc. The advantage of generalized-ensemble simulations lies in the fact that they not only avoid getting trapped in states of energy local minima but also allows the calculations of physical quantities as functions of temperature or other parameters from a single simulation run. In this article we review the generalized-ensemble algorithms. Four examples, multicanonical algorithm, replica-exchange method, replica-exchange multicanonical algorithm, and multicanonical replica-exchange method, are described in detail. Examples of their applications to the protein folding problem are presented.

  4. Fluorescent in situ folding control for rapid optimization of cell-free membrane protein synthesis.

    Directory of Open Access Journals (Sweden)

    Annika Müller-Lucks

    Full Text Available Cell-free synthesis is an open and powerful tool for high-yield protein production in small reaction volumes predestined for high-throughput structural and functional analysis. Membrane proteins require addition of detergents for solubilization, liposomes, or nanodiscs. Hence, the number of parameters to be tested is significantly higher than with soluble proteins. Optimization is commonly done with respect to protein yield, yet without knowledge of the protein folding status. This approach contains a large inherent risk of ending up with non-functional protein. We show that fluorophore formation in C-terminal fusions with green fluorescent protein (GFP indicates the folding state of a membrane protein in situ, i.e. within the cell-free reaction mixture, as confirmed by circular dichroism (CD, proteoliposome reconstitution and functional assays. Quantification of protein yield and in-gel fluorescence intensity imply suitability of the method for membrane proteins of bacterial, protozoan, plant, and mammalian origin, representing vacuolar and plasma membrane localization, as well as intra- and extracellular positioning of the C-terminus. We conclude that GFP-fusions provide an extension to cell-free protein synthesis systems eliminating the need for experimental folding control and, thus, enabling rapid optimization towards membrane protein quality.

  5. Computational Modeling of Proteins based on Cellular Automata: A Method of HP Folding Approximation.

    Science.gov (United States)

    Madain, Alia; Abu Dalhoum, Abdel Latif; Sleit, Azzam

    2018-06-01

    The design of a protein folding approximation algorithm is not straightforward even when a simplified model is used. The folding problem is a combinatorial problem, where approximation and heuristic algorithms are usually used to find near optimal folds of proteins primary structures. Approximation algorithms provide guarantees on the distance to the optimal solution. The folding approximation approach proposed here depends on two-dimensional cellular automata to fold proteins presented in a well-studied simplified model called the hydrophobic-hydrophilic model. Cellular automata are discrete computational models that rely on local rules to produce some overall global behavior. One-third and one-fourth approximation algorithms choose a subset of the hydrophobic amino acids to form H-H contacts. Those algorithms start with finding a point to fold the protein sequence into two sides where one side ignores H's at even positions and the other side ignores H's at odd positions. In addition, blocks or groups of amino acids fold the same way according to a predefined normal form. We intend to improve approximation algorithms by considering all hydrophobic amino acids and folding based on the local neighborhood instead of using normal forms. The CA does not assume a fixed folding point. The proposed approach guarantees one half approximation minus the H-H endpoints. This lower bound guaranteed applies to short sequences only. This is proved as the core and the folds of the protein will have two identical sides for all short sequences.

  6. Protein folding includes oligomerization – examples from the endoplasmic reticulum and cytosol

    NARCIS (Netherlands)

    Christis, C.; Lubsen, N.H.; Braakman, I.

    2008-01-01

    A correct three-dimensional structure is a prerequisite for protein functionality, and therefore for life. Thus, it is not surprising that our cells are packed with proteins that assist protein folding, the process in which the native three-dimensional structure is formed. In general, plasma

  7. Rapid measurement of residual dipolar couplings for fast fold elucidation of proteins

    Energy Technology Data Exchange (ETDEWEB)

    Rasia, Rodolfo M. [Jean-Pierre Ebel CNRS/CEA/UJF, Institut de Biologie Structurale (France); Lescop, Ewen [CNRS, Institut de Chimie des Substances Naturelles (France); Palatnik, Javier F. [Universidad Nacional de Rosario, Instituto de Biologia Molecular y Celular de Rosario, Facultad de Ciencias Bioquimicas y Farmaceuticas (Argentina); Boisbouvier, Jerome, E-mail: jerome.boisbouvier@ibs.fr; Brutscher, Bernhard, E-mail: Bernhard.brutscher@ibs.fr [Jean-Pierre Ebel CNRS/CEA/UJF, Institut de Biologie Structurale (France)

    2011-11-15

    It has been demonstrated that protein folds can be determined using appropriate computational protocols with NMR chemical shifts as the sole source of experimental restraints. While such approaches are very promising they still suffer from low convergence resulting in long computation times to achieve accurate results. Here we present a suite of time- and sensitivity optimized NMR experiments for rapid measurement of up to six RDCs per residue. Including such an RDC data set, measured in less than 24 h on a single aligned protein sample, greatly improves convergence of the Rosetta-NMR protocol, allowing for overnight fold calculation of small proteins. We demonstrate the performance of our fast fold calculation approach for ubiquitin as a test case, and for two RNA-binding domains of the plant protein HYL1. Structure calculations based on simulated RDC data highlight the importance of an accurate and precise set of several complementary RDCs as additional input restraints for high-quality de novo structure determination.

  8. Generic framework for mining cellular automata models on protein-folding simulations.

    Science.gov (United States)

    Diaz, N; Tischer, I

    2016-05-13

    Cellular automata model identification is an important way of building simplified simulation models. In this study, we describe a generic architectural framework to ease the development process of new metaheuristic-based algorithms for cellular automata model identification in protein-folding trajectories. Our framework was developed by a methodology based on design patterns that allow an improved experience for new algorithms development. The usefulness of the proposed framework is demonstrated by the implementation of four algorithms, able to obtain extremely precise cellular automata models of the protein-folding process with a protein contact map representation. Dynamic rules obtained by the proposed approach are discussed, and future use for the new tool is outlined.

  9. Synergistic cooperation of PDI family members in peroxiredoxin 4-driven oxidative protein folding.

    Science.gov (United States)

    Sato, Yoshimi; Kojima, Rieko; Okumura, Masaki; Hagiwara, Masatoshi; Masui, Shoji; Maegawa, Ken-ichi; Saiki, Masatoshi; Horibe, Tomohisa; Suzuki, Mamoru; Inaba, Kenji

    2013-01-01

    The mammalian endoplasmic reticulum (ER) harbors disulfide bond-generating enzymes, including Ero1α and peroxiredoxin 4 (Prx4), and nearly 20 members of the protein disulfide isomerase family (PDIs), which together constitute a suitable environment for oxidative protein folding. Here, we clarified the Prx4 preferential recognition of two PDI family proteins, P5 and ERp46, and the mode of interaction between Prx4 and P5 thioredoxin domain. Detailed analyses of oxidative folding catalyzed by the reconstituted Prx4-PDIs pathways demonstrated that, while P5 and ERp46 are dedicated to rapid, but promiscuous, disulfide introduction, PDI is an efficient proofreader of non-native disulfides. Remarkably, the Prx4-dependent formation of native disulfide bonds was accelerated when PDI was combined with ERp46 or P5, suggesting that PDIs work synergistically to increase the rate and fidelity of oxidative protein folding. Thus, the mammalian ER seems to contain highly systematized oxidative networks for the efficient production of large quantities of secretory proteins.

  10. Stabilities and Dynamics of Protein Folding Nuclei by Molecular Dynamics Simulation

    Science.gov (United States)

    Song, Yong-Shun; Zhou, Xin; Zheng, Wei-Mou; Wang, Yan-Ting

    2017-07-01

    To understand how the stabilities of key nuclei fragments affect protein folding dynamics, we simulate by molecular dynamics (MD) simulation in aqueous solution four fragments cut out of a protein G, including one α-helix (seqB: KVFKQYAN), two β-turns (seqA: LNGKTLKG and seqC: YDDATKTF), and one β-strand (seqD: DGEWTYDD). The Markov State Model clustering method combined with the coarse-grained conformation letters method are employed to analyze the data sampled from 2-μs equilibrium MD simulation trajectories. We find that seqA and seqB have more stable structures than their native structures which become metastable when cut out of the protein structure. As expected, seqD alone is flexible and does not have a stable structure. Throughout our simulations, the native structure of seqC is stable but cannot be reached if starting from a structure other than the native one, implying a funnel-shape free energy landscape of seqC in aqueous solution. All the above results suggest that different nuclei have different formation dynamics during protein folding, which may have a major contribution to the hierarchy of protein folding dynamics. Supported by the National Basic Research Program of China under Grant No. 2013CB932804, the National Natural Science Foundation of China under Grant No. 11421063, and the CAS Biophysics Interdisciplinary Innovation Team Project

  11. Sampling-based exploration of folded state of a protein under kinematic and geometric constraints

    KAUST Repository

    Yao, Peggy; Zhang, Liangjun; Latombe, Jean-Claude

    2011-01-01

    Flexibility is critical for a folded protein to bind to other molecules (ligands) and achieve its functions. The conformational selection theory suggests that a folded protein deforms continuously and its ligand selects the most favorable

  12. Sampling-based exploration of folded state of a protein under kinematic and geometric constraints

    KAUST Repository

    Yao, Peggy

    2011-10-04

    Flexibility is critical for a folded protein to bind to other molecules (ligands) and achieve its functions. The conformational selection theory suggests that a folded protein deforms continuously and its ligand selects the most favorable conformations to bind to. Therefore, one of the best options to study protein-ligand binding is to sample conformations broadly distributed over the protein-folded state. This article presents a new sampler, called kino-geometric sampler (KGS). This sampler encodes dominant energy terms implicitly by simple kinematic and geometric constraints. Two key technical contributions of KGS are (1) a robotics-inspired Jacobian-based method to simultaneously deform a large number of interdependent kinematic cycles without any significant break-up of the closure constraints, and (2) a diffusive strategy to generate conformation distributions that diffuse quickly throughout the protein folded state. Experiments on four very different test proteins demonstrate that KGS can efficiently compute distributions containing conformations close to target (e.g., functional) conformations. These targets are not given to KGS, hence are not used to bias the sampling process. In particular, for a lysine-binding protein, KGS was able to sample conformations in both the intermediate and functional states without the ligand, while previous work using molecular dynamics simulation had required the ligand to be taken into account in the potential function. Overall, KGS demonstrates that kino-geometric constraints characterize the folded subset of a protein conformation space and that this subset is small enough to be approximated by a relatively small distribution of conformations. © 2011 Wiley Periodicals, Inc.

  13. In vivo labelling of proteins associated with folded chromosomes of yeast

    International Nuclear Information System (INIS)

    Litske Petersen, J.G.; Pinon, R.

    1980-01-01

    Proteins associated with the pre-replicative (g 1 ) and post-replicative (g 2 ) folded chromosomes of Saccharomyces cerevisiae can be labelled in vivo by growing cells in acetate vegetative medium containing [ 35 S]methionine. In both sporulating (MATa/MATα) and non-sporulating (MATa/MATa, MATα/MATα) diploids proteins associated with the resting stage genome (g 0 ) can be labelled with [ 35 S]methionine during nitrogen starvation and in sporulation medium. In addition, in MATa/MATα diploids proteins associated with the meiotic replication form (r) can also be labelled. SDS-polyacrylamide gel electrophoresis and autoradiography of the labelled proteins from the various folded genome forms showed that the g 1 and g 2 patterns are, with the exception of one polypeptide band, essentially identical. Several differences distinguished the r and g 0 patterns from those of the g 1 and g 2 structures. At least four polypeptide bands distinguish the r and g 0 patterns. No significant differences were observed between the g 0 proteins of sporulating and non-sporulating diploids. (author)

  14. Autonomously folding protein fragments reveal differences in the energy landscapes of homologous RNases H.

    Directory of Open Access Journals (Sweden)

    Laura E Rosen

    Full Text Available An important approach to understanding how a protein sequence encodes its energy landscape is to compare proteins with different sequences that fold to the same general native structure. In this work, we compare E. coli and T. thermophilus homologs of the protein RNase H. Using protein fragments, we create equilibrium mimics of two different potential partially-folded intermediates (I(core and I(core+1 hypothesized to be present on the energy landscapes of these two proteins. We observe that both T. thermophilus RNase H (ttRNH fragments are folded and have distinct stabilities, indicating that both regions are capable of autonomous folding and that both intermediates are present as local minima on the ttRNH energy landscape. In contrast, the two E. coli RNase H (ecRNH fragments have very similar stabilities, suggesting that the presence of additional residues in the I(core+1 fragment does not affect the folding or structure as compared to I(core. NMR experiments provide additional evidence that only the I(core intermediate is populated by ecRNH. This is one of the biggest differences that has been observed between the energy landscapes of these two proteins. Additionally, we used a FRET experiment in the background of full-length ttRNH to specifically monitor the formation of the I(core+1 intermediate. We determine that the ttRNH I(core+1 intermediate is likely the intermediate populated prior to the rate-limiting barrier to global folding, in contrast to E. coli RNase H for which I(core is the folding intermediate. This result provides new insight into the nature of the rate-limiting barrier for the folding of RNase H.

  15. An evolutionarily conserved glycine-tyrosine motif forms a folding core in outer membrane proteins.

    Directory of Open Access Journals (Sweden)

    Marcin Michalik

    Full Text Available An intimate interaction between a pair of amino acids, a tyrosine and glycine on neighboring β-strands, has been previously reported to be important for the structural stability of autotransporters. Here, we show that the conservation of this interacting pair extends to nearly all major families of outer membrane β-barrel proteins, which are thought to have originated through duplication events involving an ancestral ββ hairpin. We analyzed the function of this motif using the prototypical outer membrane protein OmpX. Stopped-flow fluorescence shows that two folding processes occur in the millisecond time regime, the rates of which are reduced in the tyrosine mutant. Folding assays further demonstrate a reduction in the yield of folded protein for the mutant compared to the wild-type, as well as a reduction in thermal stability. Taken together, our data support the idea of an evolutionarily conserved 'folding core' that affects the folding, membrane insertion, and thermal stability of outer membrane protein β-barrels.

  16. Protein folding on the ribosome studied using NMR spectroscopy

    Science.gov (United States)

    Waudby, Christopher A.; Launay, Hélène; Cabrita, Lisa D.; Christodoulou, John

    2013-01-01

    NMR spectroscopy is a powerful tool for the investigation of protein folding and misfolding, providing a characterization of molecular structure, dynamics and exchange processes, across a very wide range of timescales and with near atomic resolution. In recent years NMR methods have also been developed to study protein folding as it might occur within the cell, in a de novo manner, by observing the folding of nascent polypeptides in the process of emerging from the ribosome during synthesis. Despite the 2.3 MDa molecular weight of the bacterial 70S ribosome, many nascent polypeptides, and some ribosomal proteins, have sufficient local flexibility that sharp resonances may be observed in solution-state NMR spectra. In providing information on dynamic regions of the structure, NMR spectroscopy is therefore highly complementary to alternative methods such as X-ray crystallography and cryo-electron microscopy, which have successfully characterized the rigid core of the ribosome particle. However, the low working concentrations and limited sample stability associated with ribosome–nascent chain complexes means that such studies still present significant technical challenges to the NMR spectroscopist. This review will discuss the progress that has been made in this area, surveying all NMR studies that have been published to date, and with a particular focus on strategies for improving experimental sensitivity. PMID:24083462

  17. Self-organization and mismatch tolerance in protein folding: General theory and an application

    Science.gov (United States)

    Fernández, Ariel; Berry, R. Stephen

    2000-03-01

    The folding of a protein is a process both expeditious and robust. The analysis of this process presented here uses a coarse, discretized representation of the evolving form of the backbone chain, based on its torsional states. This coarse description consists of discretizing the torsional coordinates modulo the Ramachandran basins in the local softmode dynamics. Whenever the representation exhibits "contact patterns" that correspond to topological compatibilities with particular structural forms, secondary and then tertiary, the elements constituting the pattern are effectively entrained by a reduction of their rates of exploration of their discretized configuration space. The properties "expeditious and robust" imply that the folding protein must have some tolerance to both torsional "frustrated" and side-chain contact mismatches which may occur during the folding process. The energy-entropy consequences of the staircase or funnel topography of the potential surface should allow the folding protein to correct these mismatches, eventually. This tolerance lends itself to an iterative pattern-recognition-and-feedback description of the folding process that reflects mismatched local torsional states and hydrophobic/polar contacts. The predictive potential of our algorithm is tested by application to the folding of bovine pancreatic trypsin inhibitor (BPTI), a protein whose ability to form its active structure is contingent upon its frustration tolerance.

  18. Bioinformatics analysis identify novel OB fold protein coding genes in C. elegans.

    Directory of Open Access Journals (Sweden)

    Daryanaz Dargahi

    Full Text Available BACKGROUND: The C. elegans genome has been extensively annotated by the WormBase consortium that uses state of the art bioinformatics pipelines, functional genomics and manual curation approaches. As a result, the identification of novel genes in silico in this model organism is becoming more challenging requiring new approaches. The Oligonucleotide-oligosaccharide binding (OB fold is a highly divergent protein family, in which protein sequences, in spite of having the same fold, share very little sequence identity (5-25%. Therefore, evidence from sequence-based annotation may not be sufficient to identify all the members of this family. In C. elegans, the number of OB-fold proteins reported is remarkably low (n=46 compared to other evolutionary-related eukaryotes, such as yeast S. cerevisiae (n=344 or fruit fly D. melanogaster (n=84. Gene loss during evolution or differences in the level of annotation for this protein family, may explain these discrepancies. METHODOLOGY/PRINCIPAL FINDINGS: This study examines the possibility that novel OB-fold coding genes exist in the worm. We developed a bioinformatics approach that uses the most sensitive sequence-sequence, sequence-profile and profile-profile similarity search methods followed by 3D-structure prediction as a filtering step to eliminate false positive candidate sequences. We have predicted 18 coding genes containing the OB-fold that have remarkably partially been characterized in C. elegans. CONCLUSIONS/SIGNIFICANCE: This study raises the possibility that the annotation of highly divergent protein fold families can be improved in C. elegans. Similar strategies could be implemented for large scale analysis by the WormBase consortium when novel versions of the genome sequence of C. elegans, or other evolutionary related species are being released. This approach is of general interest to the scientific community since it can be used to annotate any genome.

  19. Visualization of protein folding funnels in lattice models.

    Directory of Open Access Journals (Sweden)

    Antonio B Oliveira

    Full Text Available Protein folding occurs in a very high dimensional phase space with an exponentially large number of states, and according to the energy landscape theory it exhibits a topology resembling a funnel. In this statistical approach, the folding mechanism is unveiled by describing the local minima in an effective one-dimensional representation. Other approaches based on potential energy landscapes address the hierarchical structure of local energy minima through disconnectivity graphs. In this paper, we introduce a metric to describe the distance between any two conformations, which also allows us to go beyond the one-dimensional representation and visualize the folding funnel in 2D and 3D. In this way it is possible to assess the folding process in detail, e.g., by identifying the connectivity between conformations and establishing the paths to reach the native state, in addition to regions where trapping may occur. Unlike the disconnectivity maps method, which is based on the kinetic connections between states, our methodology is based on structural similarities inferred from the new metric. The method was developed in a 27-mer protein lattice model, folded into a 3×3×3 cube. Five sequences were studied and distinct funnels were generated in an analysis restricted to conformations from the transition-state to the native configuration. Consistent with the expected results from the energy landscape theory, folding routes can be visualized to probe different regions of the phase space, as well as determine the difficulty in folding of the distinct sequences. Changes in the landscape due to mutations were visualized, with the comparison between wild and mutated local minima in a single map, which serves to identify different trapping regions. The extension of this approach to more realistic models and its use in combination with other approaches are discussed.

  20. Protein folding optimization based on 3D off-lattice model via an improved artificial bee colony algorithm.

    Science.gov (United States)

    Li, Bai; Lin, Mu; Liu, Qiao; Li, Ya; Zhou, Changjun

    2015-10-01

    Protein folding is a fundamental topic in molecular biology. Conventional experimental techniques for protein structure identification or protein folding recognition require strict laboratory requirements and heavy operating burdens, which have largely limited their applications. Alternatively, computer-aided techniques have been developed to optimize protein structures or to predict the protein folding process. In this paper, we utilize a 3D off-lattice model to describe the original protein folding scheme as a simplified energy-optimal numerical problem, where all types of amino acid residues are binarized into hydrophobic and hydrophilic ones. We apply a balance-evolution artificial bee colony (BE-ABC) algorithm as the minimization solver, which is featured by the adaptive adjustment of search intensity to cater for the varying needs during the entire optimization process. In this work, we establish a benchmark case set with 13 real protein sequences from the Protein Data Bank database and evaluate the convergence performance of BE-ABC algorithm through strict comparisons with several state-of-the-art ABC variants in short-term numerical experiments. Besides that, our obtained best-so-far protein structures are compared to the ones in comprehensive previous literature. This study also provides preliminary insights into how artificial intelligence techniques can be applied to reveal the dynamics of protein folding. Graphical Abstract Protein folding optimization using 3D off-lattice model and advanced optimization techniques.

  1. Right- and left-handed three-helix proteins. I. Experimental and simulation analysis of differences in folding and structure.

    Science.gov (United States)

    Glyakina, Anna V; Pereyaslavets, Leonid B; Galzitskaya, Oxana V

    2013-09-01

    Despite the large number of publications on three-helix protein folding, there is no study devoted to the influence of handedness on the rate of three-helix protein folding. From the experimental studies, we make a conclusion that the left-handed three-helix proteins fold faster than the right-handed ones. What may explain this difference? An important question arising in this paper is whether the modeling of protein folding can catch the difference between the protein folding rates of proteins with similar structures but with different folding mechanisms. To answer this question, the folding of eight three-helix proteins (four right-handed and four left-handed), which are similar in size, was modeled using the Monte Carlo and dynamic programming methods. The studies allowed us to determine the orders of folding of the secondary-structure elements in these domains and amino acid residues which are important for the folding. The obtained data are in good correlation with each other and with the experimental data. Structural analysis of these proteins demonstrated that the left-handed domains have a lesser number of contacts per residue and a smaller radius of cross section than the right-handed domains. This may be one of the explanations of the observed fact. The same tendency is observed for the large dataset consisting of 332 three-helix proteins (238 right- and 94 left-handed). From our analysis, we found that the left-handed three-helix proteins have some less-dense packing that should result in faster folding for some proteins as compared to the case of right-handed proteins. Copyright © 2013 Wiley Periodicals, Inc.

  2. Two states or not two states: Single-molecule folding studies of protein L

    Science.gov (United States)

    Aviram, Haim Yuval; Pirchi, Menahem; Barak, Yoav; Riven, Inbal; Haran, Gilad

    2018-03-01

    Experimental tools of increasing sophistication have been employed in recent years to study protein folding and misfolding. Folding is considered a complex process, and one way to address it is by studying small proteins, which seemingly possess a simple energy landscape with essentially only two stable states, either folded or unfolded. The B1-IgG binding domain of protein L (PL) is considered a model two-state folder, based on measurements using a wide range of experimental techniques. We applied single-molecule fluorescence resonance energy transfer (FRET) spectroscopy in conjunction with a hidden Markov model analysis to fully characterize the energy landscape of PL and to extract the kinetic properties of individual molecules of the protein. Surprisingly, our studies revealed the existence of a third state, hidden under the two-state behavior of PL due to its small population, ˜7%. We propose that this minority intermediate involves partial unfolding of the two C-terminal β strands of PL. Our work demonstrates that single-molecule FRET spectroscopy can be a powerful tool for a comprehensive description of the folding dynamics of proteins, capable of detecting and characterizing relatively rare metastable states that are difficult to observe in ensemble studies.

  3. What determines the structures of native folds of proteins?

    International Nuclear Information System (INIS)

    Trovato, Antonio; Hoang, Trinh X; Banavar, Jayanth R; Maritan, Amos; Seno, Flavio

    2005-01-01

    We review a simple physical model (Hoang et al 2004 Proc. Natl Acad. Sci. USA 101 7960, Banavar et al 2004 Phys. Rev. E at press) which captures the essential physico-chemical ingredients that determine protein structure, such as the inherent anisotropy of a chain molecule, the geometrical and energetic constraints placed by hydrogen bonds, sterics, and hydrophobicity. Within this framework, marginally compact conformations resembling the native state folds of proteins emerge as competing minima in the free energy landscape. Here we demonstrate that a hydrophobic-polar (HP) sequence composed of regularly repeated patterns has as its ground state a β-helical structure remarkably similar to a known architecture in the Protein Data Bank

  4. Fast identification of folded human protein domains expressed in E. coli suitable for structural analysis

    Directory of Open Access Journals (Sweden)

    Schlegel Brigitte

    2004-03-01

    Full Text Available Abstract Background High-throughput protein structure analysis of individual protein domains requires analysis of large numbers of expression clones to identify suitable constructs for structure determination. For this purpose, methods need to be implemented for fast and reliable screening of the expressed proteins as early as possible in the overall process from cloning to structure determination. Results 88 different E. coli expression constructs for 17 human protein domains were analysed using high-throughput cloning, purification and folding analysis to obtain candidates suitable for structural analysis. After 96 deep-well microplate expression and automated protein purification, protein domains were directly analysed using 1D 1H-NMR spectroscopy. In addition, analytical hydrophobic interaction chromatography (HIC was used to detect natively folded protein. With these two analytical methods, six constructs (representing two domains were quickly identified as being well folded and suitable for structural analysis. Conclusion The described approach facilitates high-throughput structural analysis. Clones expressing natively folded proteins suitable for NMR structure determination were quickly identified upon small scale expression screening using 1D 1H-NMR and/or analytical HIC. This procedure is especially effective as a fast and inexpensive screen for the 'low hanging fruits' in structural genomics.

  5. Fast mapping of global protein folding states by multivariate NMR:

    DEFF Research Database (Denmark)

    Malmendal, Anders; Underhaug, Jarl; Otzen, Daniel

    2010-01-01

    To obtain insight into the functions of proteins and their specific roles, it is important to establish efficient procedures for exploring the states that encapsulate their conformational space. Global Protein folding State mapping by multivariate NMR (GPS NMR) is a powerful high-throughput method......-lactalbumin in the presence of the anionic surfactant sodium dodecyl sulfate, SDS, and compare these with other surfactants, acid, denaturants and heat....

  6. Equilibrium amide hydrogen exchange and protein folding kinetics

    International Nuclear Information System (INIS)

    Bai Yawen

    1999-01-01

    The classical Linderstrom-Lang hydrogen exchange (HX) model is extended to describe the relationship between the HX behaviors (EX1 and EX2) and protein folding kinetics for the amide protons that can only exchange by global unfolding in a three-state system including native (N), intermediate (I), and unfolded (U) states. For these slowly exchanging amide protons, it is shown that the existence of an intermediate (I) has no effect on the HX behavior in an off-pathway three-state system (I↔U↔N). On the other hand, in an on-pathway three-state system (U↔I↔N), the existence of a stable folding intermediate has profound effect on the HX behavior. It is shown that fast refolding from the unfolded state to the stable intermediate state alone does not guarantee EX2 behavior. The rate of refolding from the intermediate state to the native state also plays a crucial role in determining whether EX1 or EX2 behavior should occur. This is mainly due to the fact that only amide protons in the native state are observed in the hydrogen exchange experiment. These new concepts suggest that caution needs to be taken if one tries to derive the kinetic events of protein folding from equilibrium hydrogen exchange experiments

  7. Development and application of a free energy force field for all atom protein folding

    International Nuclear Information System (INIS)

    Verma, A.

    2007-11-01

    Proteins are the workhorses of all cellular life. They constitute the building blocks and the machinery of all cells and typically function in specific three-dimensional conformations into which each protein folds. Currently over one million protein sequences are known, compared to about 40,000 structures deposited in the Protein Data Bank (the world-wide database of protein structures). Reliable theoretical methods for protein structure prediction could help to reduce the gap between sequence and structural databases and elucidate the biological information in structurally unresolved sequences. In this thesis we explore an approach for protein structure prediction and folding that is based on the Anfinsen's hypothesis that most proteins in their native state are in thermodynamic equilibrium with their environment. We have developed a free energy forcefield (PFF02) that locates the native conformation of many proteins from all structural classes at the global minimum of the free-energy model. We have validated the forcefield against a large decoy set (Rosetta). The average root mean square deviation (RMSD) for the lowest energy structure for the 32 proteins of the decoy set was only 2.14 Aa from the experimental conformation. We have successfully implemented and used stochastic optimization methods, such as the basin hopping technique and evolutionary algorithms for all atom protein structure prediction. The evolutionary algorithm performs exceptionally well on large supercomputational architectures, such as BlueGene and MareNostrum. Using the PFF02 forcefield, we were able to fold 13 proteins (12-56 amino acids), which include helix, sheet and mixed secondary structure. On average the predicted structure of these proteins deviated from their experimental conformation by only 2.89 Aa RMSD. (orig.)

  8. PyFolding: Open-Source Graphing, Simulation, and Analysis of the Biophysical Properties of Proteins.

    Science.gov (United States)

    Lowe, Alan R; Perez-Riba, Albert; Itzhaki, Laura S; Main, Ewan R G

    2018-02-06

    For many years, curve-fitting software has been heavily utilized to fit simple models to various types of biophysical data. Although such software packages are easy to use for simple functions, they are often expensive and present substantial impediments to applying more complex models or for the analysis of large data sets. One field that is reliant on such data analysis is the thermodynamics and kinetics of protein folding. Over the past decade, increasingly sophisticated analytical models have been generated, but without simple tools to enable routine analysis. Consequently, users have needed to generate their own tools or otherwise find willing collaborators. Here we present PyFolding, a free, open-source, and extensible Python framework for graphing, analysis, and simulation of the biophysical properties of proteins. To demonstrate the utility of PyFolding, we have used it to analyze and model experimental protein folding and thermodynamic data. Examples include: 1) multiphase kinetic folding fitted to linked equations, 2) global fitting of multiple data sets, and 3) analysis of repeat protein thermodynamics with Ising model variants. Moreover, we demonstrate how PyFolding is easily extensible to novel functionality beyond applications in protein folding via the addition of new models. Example scripts to perform these and other operations are supplied with the software, and we encourage users to contribute notebooks and models to create a community resource. Finally, we show that PyFolding can be used in conjunction with Jupyter notebooks as an easy way to share methods and analysis for publication and among research teams. Copyright © 2017 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  9. Some physical approaches to protein folding

    Science.gov (United States)

    Bascle, J.; Garel, T.; Orland, H.

    1993-02-01

    To understand how a protein folds is a problem which has important biological implications. In this article, we would like to present a physics-oriented point of view, which is twofold. First of all, we introduce simple statistical mechanics models which display, in the thermodynamic limit, folding and related transitions. These models can be divided into (i) crude spin glass-like models (with their Mattis analogs), where one may look for possible correlations between the chain self-interactions and the folded structure, (ii) glass-like models, where one emphasizes the geometrical competition between one- or two-dimensional local order (mimicking α helix or β sheet structures), and the requirement of global compactness. Both models are too simple to predict the spatial organization of a realistic protein, but are useful for the physicist and should have some feedback in other glassy systems (glasses, collapsed polymers .... ). These remarks lead us to the second physical approach, namely a new Monte-Carlo method, where one grows the protein atom-by-atom (or residue-by-residue), using a standard form (CHARMM .... ) for the total energy. A detailed comparison with other Monte-Carlo schemes, or Molecular Dynamics calculations, is then possible; we will sketch such a comparison for poly-alanines. Our twofold approach illustrates some of the difficulties one encounters in the protein folding problem, in particular those associated with the existence of a large number of metastable states. Le repliement des protéines est un problème qui a de nombreuses implications biologiques. Dans cet article, nous présentons, de deux façons différentes, un point de vue de physicien. Nous introduisons tout d'abord des modèles simples de mécanique statistique qui exhibent, à la limite thermodynamique, des transitions de repliement. Ces modèles peuvent être divisés en (i) verres de spin (éventuellement à la Mattis), où l'on peut chercher des corrélations entre les

  10. Electrostatics, structure prediction, and the energy landscapes for protein folding and binding.

    Science.gov (United States)

    Tsai, Min-Yeh; Zheng, Weihua; Balamurugan, D; Schafer, Nicholas P; Kim, Bobby L; Cheung, Margaret S; Wolynes, Peter G

    2016-01-01

    While being long in range and therefore weakly specific, electrostatic interactions are able to modulate the stability and folding landscapes of some proteins. The relevance of electrostatic forces for steering the docking of proteins to each other is widely acknowledged, however, the role of electrostatics in establishing specifically funneled landscapes and their relevance for protein structure prediction are still not clear. By introducing Debye-Hückel potentials that mimic long-range electrostatic forces into the Associative memory, Water mediated, Structure, and Energy Model (AWSEM), a transferable protein model capable of predicting tertiary structures, we assess the effects of electrostatics on the landscapes of thirteen monomeric proteins and four dimers. For the monomers, we find that adding electrostatic interactions does not improve structure prediction. Simulations of ribosomal protein S6 show, however, that folding stability depends monotonically on electrostatic strength. The trend in predicted melting temperatures of the S6 variants agrees with experimental observations. Electrostatic effects can play a range of roles in binding. The binding of the protein complex KIX-pKID is largely assisted by electrostatic interactions, which provide direct charge-charge stabilization of the native state and contribute to the funneling of the binding landscape. In contrast, for several other proteins, including the DNA-binding protein FIS, electrostatics causes frustration in the DNA-binding region, which favors its binding with DNA but not with its protein partner. This study highlights the importance of long-range electrostatics in functional responses to problems where proteins interact with their charged partners, such as DNA, RNA, as well as membranes. © 2015 The Protein Society.

  11. Introducing the Levinthal's Protein Folding Paradox and Its Solution

    Science.gov (United States)

    Martínez, Leandro

    2014-01-01

    The protein folding (Levinthal's) paradox states that it would not be possible in a physically meaningful time to a protein to reach the native (functional) conformation by a random search of the enormously large number of possible structures. This paradox has been solved: it was shown that small biases toward the native conformation result…

  12. Improving decoy databases for protein folding algorithms

    KAUST Repository

    Lindsey, Aaron

    2014-01-01

    Copyright © 2014 ACM. Predicting protein structures and simulating protein folding are two of the most important problems in computational biology today. Simulation methods rely on a scoring function to distinguish the native structure (the most energetically stable) from non-native structures. Decoy databases are collections of non-native structures used to test and verify these functions. We present a method to evaluate and improve the quality of decoy databases by adding novel structures and removing redundant structures. We test our approach on 17 different decoy databases of varying size and type and show significant improvement across a variety of metrics. We also test our improved databases on a popular modern scoring function and show that they contain a greater number of native-like structures than the original databases, thereby producing a more rigorous database for testing scoring functions.

  13. An essential nonredundant role for mycobacterial DnaK in native protein folding.

    Directory of Open Access Journals (Sweden)

    Allison Fay

    2014-07-01

    Full Text Available Protein chaperones are essential in all domains of life to prevent and resolve protein misfolding during translation and proteotoxic stress. HSP70 family chaperones, including E. coli DnaK, function in stress induced protein refolding and degradation, but are dispensable for cellular viability due to redundant chaperone systems that prevent global nascent peptide insolubility. However, the function of HSP70 chaperones in mycobacteria, a genus that includes multiple human pathogens, has not been examined. We find that mycobacterial DnaK is essential for cell growth and required for native protein folding in Mycobacterium smegmatis. Loss of DnaK is accompanied by proteotoxic collapse characterized by the accumulation of insoluble newly synthesized proteins. DnaK is required for solubility of large multimodular lipid synthases, including the essential lipid synthase FASI, and DnaK loss is accompanied by disruption of membrane structure and increased cell permeability. Trigger Factor is nonessential and has a minor role in native protein folding that is only evident in the absence of DnaK. In unstressed cells, DnaK localizes to multiple, dynamic foci, but relocalizes to focal protein aggregates during stationary phase or upon expression of aggregating peptides. Mycobacterial cells restart cell growth after proteotoxic stress by isolating persistent DnaK containing protein aggregates away from daughter cells. These results reveal unanticipated essential nonredunant roles for mycobacterial DnaK in mycobacteria and indicate that DnaK defines a unique susceptibility point in the mycobacterial proteostasis network.

  14. Chemical Ligation of Folded Recombinant Proteins: Segmental Isotopic Labeling of Domains for NMR Studies

    Science.gov (United States)

    Xu, Rong; Ayers, Brenda; Cowburn, David; Muir, Tom W.

    1999-01-01

    A convenient in vitro chemical ligation strategy has been developed that allows folded recombinant proteins to be joined together. This strategy permits segmental, selective isotopic labeling of the product. The src homology type 3 and 2 domains (SH3 and SH2) of Abelson protein tyrosine kinase, which constitute the regulatory apparatus of the protein, were individually prepared in reactive forms that can be ligated together under normal protein-folding conditions to form a normal peptide bond at the ligation junction. This strategy was used to prepare NMR sample quantities of the Abelson protein tyrosine kinase-SH(32) domain pair, in which only one of the domains was labeled with 15N Mass spectrometry and NMR analyses were used to confirm the structure of the ligated protein, which was also shown to have appropriate ligand-binding properties. The ability to prepare recombinant proteins with selectively labeled segments having a single-site mutation, by using a combination of expression of fusion proteins and chemical ligation in vitro, will increase the size limits for protein structural determination in solution with NMR methods. In vitro chemical ligation of expressed protein domains will also provide a combinatorial approach to the synthesis of linked protein domains.

  15. Atomic force microscopy and force spectroscopy on the assessment of protein folding and functionality.

    Science.gov (United States)

    Carvalho, Filomena A; Martins, Ivo C; Santos, Nuno C

    2013-03-01

    Atomic force microscopy (AFM) applied to biological systems can, besides generating high-quality and well-resolved images, be employed to study protein folding via AFM-based force spectroscopy. This approach allowed remarkable advances in the measurement of inter- and intramolecular interaction forces with piconewton resolution. The detection of specific interaction forces between molecules based on the AFM sensitivity and the manipulation of individual molecules greatly advanced the understanding of intra-protein and protein-ligand interactions. Apart from the academic interest in the resolution of basic scientific questions, this technique has also key importance on the clarification of several biological questions of immediate biomedical relevance. Force spectroscopy is an especially appropriate technique for "mechanical proteins" that can provide crucial information on single protein molecules and/or domains. Importantly, it also has the potential of combining in a single experiment spatial and kinetic measurements. Here, the main principles of this methodology are described, after which the ability to measure interactions at the single-molecule level is discussed, in the context of relevant protein-folding examples. We intend to demonstrate the potential of AFM-based force spectroscopy in the study of protein folding, especially since this technique is able to circumvent some of the difficulties typically encountered in classical thermal/chemical denaturation studies. Copyright © 2012 Elsevier Inc. All rights reserved.

  16. A replica exchange Monte Carlo algorithm for protein folding in the HP model

    Directory of Open Access Journals (Sweden)

    Shmygelska Alena

    2007-09-01

    Full Text Available Abstract Background The ab initio protein folding problem consists of predicting protein tertiary structure from a given amino acid sequence by minimizing an energy function; it is one of the most important and challenging problems in biochemistry, molecular biology and biophysics. The ab initio protein folding problem is computationally challenging and has been shown to be NP MathType@MTEF@5@5@+=feaafiart1ev1aaatCvAUfKttLearuWrP9MDH5MBPbIqV92AaeXatLxBI9gBaebbnrfifHhDYfgasaacH8akY=wiFfYdH8Gipec8Eeeu0xXdbba9frFj0=OqFfea0dXdd9vqai=hGuQ8kuc9pgc9s8qqaq=dirpe0xb9q8qiLsFr0=vr0=vr0dc8meaabaqaciaacaGaaeqabaqabeGadaaakeaat0uy0HwzTfgDPnwy1egaryqtHrhAL1wy0L2yHvdaiqaacqWFneVtcqqGqbauaaa@3961@-hard even when conformations are restricted to a lattice. In this work, we implement and evaluate the replica exchange Monte Carlo (REMC method, which has already been applied very successfully to more complex protein models and other optimization problems with complex energy landscapes, in combination with the highly effective pull move neighbourhood in two widely studied Hydrophobic Polar (HP lattice models. Results We demonstrate that REMC is highly effective for solving instances of the square (2D and cubic (3D HP protein folding problem. When using the pull move neighbourhood, REMC outperforms current state-of-the-art algorithms for most benchmark instances. Additionally, we show that this new algorithm provides a larger ensemble of ground-state structures than the existing state-of-the-art methods. Furthermore, it scales well with sequence length, and it finds significantly better conformations on long biological sequences and sequences with a provably unique ground-state structure, which is believed to be a characteristic of real proteins. We also present evidence that our REMC algorithm can fold sequences which exhibit significant interaction between termini in the hydrophobic core relatively easily. Conclusion We demonstrate that REMC utilizing the pull move

  17. A versatile selection system for folding competent proteins using genetic complementation in a eukaryotic host

    DEFF Research Database (Denmark)

    Lyngsø, C.; Kjaerulff, S.; Muller, S.

    2010-01-01

    in vivo selection system for folded proteins. It is based on genetic complementation of the Schizosaccharomyces pombe growth marker gene invertase fused C-terminally to a protein library. The fusion proteins are directed to the secretion system, utilizing the ability of the eukaryotic protein quality...

  18. Characterisation of transition state structures for protein folding using 'high', 'medium' and 'low' {Phi}-values.

    Science.gov (United States)

    Geierhaas, Christian D; Salvatella, Xavier; Clarke, Jane; Vendruscolo, Michele

    2008-03-01

    It has been suggested that Phi-values, which allow structural information about transition states (TSs) for protein folding to be obtained, are most reliably interpreted when divided into three classes (high, medium and low). High Phi-values indicate almost completely folded regions in the TS, intermediate Phi-values regions with a detectable amount of structure and low Phi-values indicate mostly unstructured regions. To explore the extent to which this classification can be used to characterise in detail the structure of TSs for protein folding, we used Phi-values divided into these classes as restraints in molecular dynamics simulations. This type of procedure is related to that used in NMR spectroscopy to define the structure of native proteins from the measurement of inter-proton distances derived from nuclear Overhauser effects. We illustrate this approach by determining the TS ensembles of five proteins and by showing that the results are similar to those obtained by using as restraints the actual numerical Phi-values measured experimentally. Our results indicate that the simultaneous consideration of a set of low-resolution Phi-values can provide sufficient information for characterising the architecture of a TS for folding of a protein.

  19. The ribosome can prevent aggregation of partially folded protein intermediates: studies using the Escherichia coli ribosome.

    Directory of Open Access Journals (Sweden)

    Bani Kumar Pathak

    Full Text Available BACKGROUND: Molecular chaperones that support de novo folding of proteins under non stress condition are classified as chaperone 'foldases' that are distinct from chaperone' holdases' that provide high affinity binding platform for unfolded proteins and prevent their aggregation specifically under stress conditions. Ribosome, the cellular protein synthesis machine can act as a foldase chaperone that can bind unfolded proteins and release them in folding competent state. The peptidyl transferase center (PTC located in the domain V of the 23S rRNA of Escherichia coli ribosome (bDV RNA is the chaperoning center of the ribosome. It has been proposed that via specific interactions between the RNA and refolding proteins, the chaperone provides information for the correct folding of unfolded polypeptide chains. RESULTS: We demonstrate using Escherichia coli ribosome and variants of its domain V RNA that the ribosome can bind to partially folded intermediates of bovine carbonic anhydrase II (BCAII and lysozyme and suppress aggregation during their refolding. Using mutants of domain V RNA we demonstrate that the time for which the chaperone retains the bound protein is an important factor in determining its ability to suppress aggregation and/or support reactivation of protein. CONCLUSION: The ribosome can behave like a 'holdase' chaperone and has the ability to bind and hold back partially folded intermediate states of proteins from participating in the aggregation process. Since the ribosome is an essential organelle that is present in large numbers in all living cells, this ability of the ribosome provides an energetically inexpensive way to suppress cellular aggregation. Further, this ability of the ribosome might also be crucial in the context that the ribosome is one of the first chaperones to be encountered by a large nascent polypeptide chains that have a tendency to form partially folded intermediates immediately following their synthesis.

  20. Classification of protein fold classes by knot theory and prediction of folds by neural networks: A combined theoretical and experimental approach

    DEFF Research Database (Denmark)

    Ramnarayan, K.; Bohr, Henrik; Jalkanen, Karl J.

    2008-01-01

    We present different means of classifying protein structure. One is made rigorous by mathematical knot invariants that coincide reasonably well with ordinary graphical fold classification and another classification is by packing analysis. Furthermore when constructing our mathematical fold...... classifications, we utilize standard neural network methods for predicting protein fold classes from amino acid sequences. We also make an analysis of the redundancy of the structural classifications in relation to function and ligand binding. Finally we advocate the use of combining the measurement of the VA...

  1. Effect of the geometry of confining media on the stability and folding rate of α -helix proteins

    Science.gov (United States)

    Wang, Congyue; Piroozan, Nariman; Javidpour, Leili; Sahimi, Muhammad

    2018-05-01

    Protein folding in confined media has attracted wide attention over the past 15 years due to its importance to both in vivo and in vitro applications. It is generally believed that protein stability increases by decreasing the size of the confining medium, if the medium's walls are repulsive, and that the maximum folding temperature in confinement is in a pore whose size D0 is only slightly larger than the smallest dimension of a protein's folded state. Until recently, the stability of proteins in pores with a size very close to that of the folded state has not received the attention it deserves. In a previous paper [L. Javidpour and M. Sahimi, J. Chem. Phys. 135, 125101 (2011)], we showed that, contrary to the current theoretical predictions, the maximum folding temperature occurs in larger pores for smaller α-helices. Moreover, in very tight pores, the free energy surface becomes rough, giving rise to a new barrier for protein folding close to the unfolded state. In contrast to unbounded domains, in small nanopores proteins with an α-helical native state that contain the β structures are entropically stabilized implying that folding rates decrease notably and that the free energy surface becomes rougher. In view of the potential significance of such results to interpretation of many sets of experimental data that could not be explained by the current theories, particularly the reported anomalously low rates of folding and the importance of entropic effects on proteins' misfolded states in highly confined environments, we address the following question in the present paper: To what extent the geometry of a confined medium affects the stability and folding rates of proteins? Using millisecond-long molecular dynamics simulations, we study the problem in three types of confining media, namely, cylindrical and slit pores and spherical cavities. Most importantly, we find that the prediction of the previous theories that the dependence of the maximum folding

  2. From the test tube to the cell: exploring the folding and aggregation of a beta-clam protein.

    Science.gov (United States)

    Ignatova, Zoya; Krishnan, Beena; Bombardier, Jeffrey P; Marcelino, Anna Marie C; Hong, Jiang; Gierasch, Lila M

    2007-01-01

    A crucial challenge in present biomedical research is the elucidation of how fundamental processes like protein folding and aggregation occur in the complex environment of the cell. Many new physico-chemical factors like crowding and confinement must be considered, and immense technical hurdles must be overcome in order to explore these processes in vivo. Understanding protein misfolding and aggregation diseases and developing therapeutic strategies to these diseases demand that we gain mechanistic insight into behaviors and misbehaviors of proteins as they fold in vivo. We have developed a fluorescence approach using FlAsH labeling to study the thermodynamics of folding of a model beta-rich protein, cellular retinoic acid binding protein (CRABP) in Escherichia coli cells. The labeling approach has also enabled us to follow aggregation of a modified version of CRABP and chimeras between CRABP and huntingtin exon 1 with its glutamine repeat tract. In this article, we review our recent results using FlAsH labeling to study in-vivo folding and present new observations that hint at fundamental differences between the thermodynamics and kinetics of protein folding in vivo and in vitro.

  3. Identification of a key structural element for protein folding within beta-hairpin turns.

    Science.gov (United States)

    Kim, Jaewon; Brych, Stephen R; Lee, Jihun; Logan, Timothy M; Blaber, Michael

    2003-05-09

    Specific residues in a polypeptide may be key contributors to the stability and foldability of the unique native structure. Identification and prediction of such residues is, therefore, an important area of investigation in solving the protein folding problem. Atypical main-chain conformations can help identify strains within a folded protein, and by inference, positions where unique amino acids may have a naturally high frequency of occurrence due to favorable contributions to stability and folding. Non-Gly residues located near the left-handed alpha-helical region (L-alpha) of the Ramachandran plot are a potential indicator of structural strain. Although many investigators have studied mutations at such positions, no consistent energetic or kinetic contributions to stability or folding have been elucidated. Here we report a study of the effects of Gly, Ala and Asn substitutions found within the L-alpha region at a characteristic position in defined beta-hairpin turns within human acidic fibroblast growth factor, and demonstrate consistent effects upon stability and folding kinetics. The thermodynamic and kinetic data are compared to available data for similar mutations in other proteins, with excellent agreement. The results have identified that Gly at the i+3 position within a subset of beta-hairpin turns is a key contributor towards increasing the rate of folding to the native state of the polypeptide while leaving the rate of unfolding largely unchanged.

  4. Constructing a folding model for protein S6 guided by native fluctuations deduced from NMR structures

    International Nuclear Information System (INIS)

    Lammert, Heiko; Noel, Jeffrey K.; Haglund, Ellinor; Onuchic, José N.; Schug, Alexander

    2015-01-01

    The diversity in a set of protein nuclear magnetic resonance (NMR) structures provides an estimate of native state fluctuations that can be used to refine and enrich structure-based protein models (SBMs). Dynamics are an essential part of a protein’s functional native state. The dynamics in the native state are controlled by the same funneled energy landscape that guides the entire folding process. SBMs apply the principle of minimal frustration, drawn from energy landscape theory, to construct a funneled folding landscape for a given protein using only information from the native structure. On an energy landscape smoothed by evolution towards minimal frustration, geometrical constraints, imposed by the native structure, control the folding mechanism and shape the native dynamics revealed by the model. Native-state fluctuations can alternatively be estimated directly from the diversity in the set of NMR structures for a protein. Based on this information, we identify a highly flexible loop in the ribosomal protein S6 and modify the contact map in a SBM to accommodate the inferred dynamics. By taking into account the probable native state dynamics, the experimental transition state is recovered in the model, and the correct order of folding events is restored. Our study highlights how the shared energy landscape connects folding and function by showing that a better description of the native basin improves the prediction of the folding mechanism

  5. Folding Membrane Proteins by Deep Transfer Learning

    KAUST Repository

    Wang, Sheng

    2017-08-29

    Computational elucidation of membrane protein (MP) structures is challenging partially due to lack of sufficient solved structures for homology modeling. Here, we describe a high-throughput deep transfer learning method that first predicts MP contacts by learning from non-MPs and then predicts 3D structure models using the predicted contacts as distance restraints. Tested on 510 non-redundant MPs, our method has contact prediction accuracy at least 0.18 better than existing methods, predicts correct folds for 218 MPs, and generates 3D models with root-mean-square deviation (RMSD) less than 4 and 5 Å for 57 and 108 MPs, respectively. A rigorous blind test in the continuous automated model evaluation project shows that our method predicted high-resolution 3D models for two recent test MPs of 210 residues with RMSD ∼2 Å. We estimated that our method could predict correct folds for 1,345–1,871 reviewed human multi-pass MPs including a few hundred new folds, which shall facilitate the discovery of drugs targeting at MPs.

  6. Steady-state structural fluctuation is a predictor of the necessity of pausing-mediated co-translational folding for small proteins.

    Science.gov (United States)

    Huang, Wenxi; Liu, Wanting; Jin, Jingjie; Xiao, Qilan; Lu, Ruibin; Chen, Wei; Xiong, Sheng; Zhang, Gong

    2018-03-25

    Translational pausing coordinates protein synthesis and co-translational folding. It is a common factor that facilitates the correct folding of large, multi-domain proteins. For small proteins, pausing sites rarely occurs in the gene body, and the 3'-end pausing sites are only essential for the folding of a fraction of proteins. The determinant of the necessity of the pausings remains obscure. In this study, we demonstrated that the steady-state structural fluctuation is a predictor of the necessity of pausing-mediated co-translational folding for small proteins. Validated by experiments with 5 model proteins, we found that the rigid protein structures do not, while the flexible structures do need 3'-end pausings to fold correctly. Therefore, rational optimization of translational pausing can improve soluble expression of small proteins with flexible structures, but not the rigid ones. The rigidity of the structure can be quantitatively estimated in silico using molecular dynamic simulation. Nevertheless, we also found that the translational pausing optimization increases the fitness of the expression host, and thus benefits the recombinant protein production, independent from the soluble expression. These results shed light on the structural basis of the translational pausing and provided a practical tool for industrial protein fermentation. Copyright © 2017. Published by Elsevier Inc.

  7. Recent developments in the theory of protein folding: searching for the global energy minimum.

    Science.gov (United States)

    Scheraga, H A

    1996-04-16

    Statistical mechanical theories and computer simulation are being used to gain an understanding of the fundamental features of protein folding. A major obstacle in the computation of protein structures is the multiple-minima problem arising from the existence of many local minima in the multidimensional energy landscape of the protein. This problem has been surmounted for small open-chain and cyclic peptides, and for regular-repeating sequences of models of fibrous proteins. Progress is being made in resolving this problem for globular proteins.

  8. Structural Conservation of the Myoviridae Phage Tail Sheath Protein Fold

    Energy Technology Data Exchange (ETDEWEB)

    Aksyuk, Anastasia A.; Kurochkina, Lidia P.; Fokine, Andrei; Forouhar, Farhad; Mesyanzhinov, Vadim V.; Tong, Liang; Rossmann, Michael G. (SOIBC); (Purdue); (Columbia)

    2012-02-21

    Bacteriophage phiKZ is a giant phage that infects Pseudomonas aeruginosa, a human pathogen. The phiKZ virion consists of a 1450 {angstrom} diameter icosahedral head and a 2000 {angstrom}-long contractile tail. The structure of the whole virus was previously reported, showing that its tail organization in the extended state is similar to the well-studied Myovirus bacteriophage T4 tail. The crystal structure of a tail sheath protein fragment of phiKZ was determined to 2.4 {angstrom} resolution. Furthermore, crystal structures of two prophage tail sheath proteins were determined to 1.9 and 3.3 {angstrom} resolution. Despite low sequence identity between these proteins, all of these structures have a similar fold. The crystal structure of the phiKZ tail sheath protein has been fitted into cryo-electron-microscopy reconstructions of the extended tail sheath and of a polysheath. The structural rearrangement of the phiKZ tail sheath contraction was found to be similar to that of phage T4.

  9. Systematic analysis of short internal indels and their impact on protein folding

    Directory of Open Access Journals (Sweden)

    Guo Jun-tao

    2010-08-01

    Full Text Available Abstract Background Protein sequence insertions/deletions (indels can be introduced during evolution or through alternative splicing (AS. Alternative splicing is an important biological phenomenon and is considered as the major means of expanding structural and functional diversity in eukaryotes. Knowledge of the structural changes due to indels is critical to our understanding of the evolution of protein structure and function. In addition, it can help us probe the evolution of alternative splicing and the diversity of functional isoforms. However, little is known about the effects of indels, in particular the ones involving core secondary structures, on the folding of protein structures. The long term goal of our study is to accurately predict the protein AS isoform structures. As a first step towards this goal, we performed a systematic analysis on the structural changes caused by short internal indels through mining highly homologous proteins in Protein Data Bank (PDB. Results We compiled a non-redundant dataset of short internal indels (2-40 amino acids from highly homologous protein pairs and analyzed the sequence and structural features of the indels. We found that about one third of indel residues are in disordered state and majority of the residues are exposed to solvent, suggesting that these indels are generally located on the surface of proteins. Though naturally occurring indels are fewer than engineered ones in the dataset, there are no statistically significant differences in terms of amino acid frequencies and secondary structure types between the "Natural" indels and "All" indels in the dataset. Structural comparisons show that all the protein pairs with short internal indels in the dataset preserve the structural folds and about 85% of protein pairs have global RMSDs (root mean square deviations of 2Å or less, suggesting that protein structures tend to be conserved and can tolerate short insertions and deletions. A few pairs

  10. Adaptive local learning in sampling based motion planning for protein folding.

    Science.gov (United States)

    Ekenna, Chinwe; Thomas, Shawna; Amato, Nancy M

    2016-08-01

    Simulating protein folding motions is an important problem in computational biology. Motion planning algorithms, such as Probabilistic Roadmap Methods, have been successful in modeling the folding landscape. Probabilistic Roadmap Methods and variants contain several phases (i.e., sampling, connection, and path extraction). Most of the time is spent in the connection phase and selecting which variant to employ is a difficult task. Global machine learning has been applied to the connection phase but is inefficient in situations with varying topology, such as those typical of folding landscapes. We develop a local learning algorithm that exploits the past performance of methods within the neighborhood of the current connection attempts as a basis for learning. It is sensitive not only to different types of landscapes but also to differing regions in the landscape itself, removing the need to explicitly partition the landscape. We perform experiments on 23 proteins of varying secondary structure makeup with 52-114 residues. We compare the success rate when using our methods and other methods. We demonstrate a clear need for learning (i.e., only learning methods were able to validate against all available experimental data) and show that local learning is superior to global learning producing, in many cases, significantly higher quality results than the other methods. We present an algorithm that uses local learning to select appropriate connection methods in the context of roadmap construction for protein folding. Our method removes the burden of deciding which method to use, leverages the strengths of the individual input methods, and it is extendable to include other future connection methods.

  11. Soft Computing Techniques for the Protein Folding Problem on High Performance Computing Architectures.

    Science.gov (United States)

    Llanes, Antonio; Muñoz, Andrés; Bueno-Crespo, Andrés; García-Valverde, Teresa; Sánchez, Antonia; Arcas-Túnez, Francisco; Pérez-Sánchez, Horacio; Cecilia, José M

    2016-01-01

    The protein-folding problem has been extensively studied during the last fifty years. The understanding of the dynamics of global shape of a protein and the influence on its biological function can help us to discover new and more effective drugs to deal with diseases of pharmacological relevance. Different computational approaches have been developed by different researchers in order to foresee the threedimensional arrangement of atoms of proteins from their sequences. However, the computational complexity of this problem makes mandatory the search for new models, novel algorithmic strategies and hardware platforms that provide solutions in a reasonable time frame. We present in this revision work the past and last tendencies regarding protein folding simulations from both perspectives; hardware and software. Of particular interest to us are both the use of inexact solutions to this computationally hard problem as well as which hardware platforms have been used for running this kind of Soft Computing techniques.

  12. Protein folding: Defining a standard set of experimental conditions and a preliminary kinetic data set of two-state proteins

    DEFF Research Database (Denmark)

    Maxwell, Karen L.; Wildes, D.; Zarrine-Afsar, A.

    2005-01-01

    Recent years have seen the publication of both empirical and theoretical relationships predicting the rates with which proteins fold. Our ability to test and refine these relationships has been limited, however, by a variety of difficulties associated with the comparison of folding and unfolding ...... efforts is to set uniform standards for the experimental community and to initiate an accumulating, self-consistent data set that will aid ongoing efforts to understand the folding process....... constructs. The lack of a single approach to data analysis and error estimation, or even of a common set of units and reporting standards, further hinders comparative studies of folding. In an effort to overcome these problems, we define here a consensus set of experimental conditions (25°C at pH 7.0, 50 m...... rates, thermodynamics, and structure across diverse sets of proteins. These difficulties include the wide, potentially confounding range of experimental conditions and methods employed to date and the difficulty of obtaining correct and complete sequence and structural details for the characterized...

  13. Chemical Denaturants Smoothen Ruggedness on the Free Energy Landscape of Protein Folding.

    Science.gov (United States)

    Malhotra, Pooja; Jethva, Prashant N; Udgaonkar, Jayant B

    2017-08-08

    To characterize experimentally the ruggedness of the free energy landscape of protein folding is challenging, because the distributed small free energy barriers are usually dominated by one, or a few, large activation free energy barriers. This study delineates changes in the roughness of the free energy landscape by making use of the observation that a decrease in ruggedness is accompanied invariably by an increase in folding cooperativity. Hydrogen exchange (HX) coupled to mass spectrometry was used to detect transient sampling of local energy minima and the global unfolded state on the free energy landscape of the small protein single-chain monellin. Under native conditions, local noncooperative openings result in interconversions between Boltzmann-distributed intermediate states, populated on an extremely rugged "uphill" energy landscape. The cooperativity of these interconversions was increased by selectively destabilizing the native state via mutations, and further by the addition of a chemical denaturant. The perturbation of stability alone resulted in seven backbone amide sites exchanging cooperatively. The size of the cooperatively exchanging and/or unfolding unit did not depend on the extent of protein destabilization. Only upon the addition of a denaturant to a destabilized mutant variant did seven additional backbone amide sites exchange cooperatively. Segmentwise analysis of the HX kinetics of the mutant variants further confirmed that the observed increase in cooperativity was due to the smoothing of the ruggedness of the free energy landscape of folding of the protein by the chemical denaturant.

  14. Exploring the Evolutionary Accident Hypothesis: Are Extant Protein Folds the Fittest or the Luckiest?

    Science.gov (United States)

    Shannon, G.; Wei, C.; Pohorille, A.

    2017-01-01

    Considering the range of functions proteins perform, it is surprising they fold into a relatively small set of structures or "folds" that facilitate such function. One explanation is that only a minority were fit enough to emerge from Darwinian selection during the early evolution of life. Alternatively, perhaps only a fraction of all possible folds were trialed. Understanding proto-catalyst selection will aid understanding of the origins and early evolution of life. To investigate which explanation is correct, we study a protein evolved in vitro to bind ATP by Jack Szostak (Fig. 1). This protein adopts a fold which is absent from nature. We are testing whether this fold would have possessed the capability to evolve that would have been essential to survive natural selection on early Earth. Folds that couldn't improve their fitness and evolve to perform new functions would have been replaced by rivals that could. To determine whether the fold is evolvable, we are attempting to change the function of the protein by rationally redesigning to bind GTP. Two design strategies in the region of the nucleobase have been implemented to provide hydrogen bonding partners for the ligand i) an insertion ii) a MET to ASN mutation. Redesigns are being studied computationally at Ames Research Center including free energy of binding calculations. Binding affinities of promising redesigns are to be validated by experimental collaborators at ForteBio using Super Streptavidin Biosensors. If the fold is found to be non-evolvable, this may suggest that many structures were trialed, but the majority were pruned on the basis of their evolvability. Alternatively, if the fold is demonstrated to be evolvable, it would be difficult to explain its absence from nature without considering the possibility that the fold simply wasn't sampled on early Earth. This would not only further our understanding of the origins of life on Earth but also suggest a common phe-nomenon of proto

  15. Recognition of secretory proteins in Escherichia coli requires signals in addition to the signal sequence and slow folding

    Directory of Open Access Journals (Sweden)

    Flower Ann M

    2002-11-01

    Full Text Available Abstract Background The Sec-dependent protein export apparatus of Escherichia coli is very efficient at correctly identifying proteins to be exported from the cytoplasm. Even bacterial strains that carry prl mutations, which allow export of signal sequence-defective precursors, accurately differentiate between cytoplasmic and mutant secretory proteins. It was proposed previously that the basis for this precise discrimination is the slow folding rate of secretory proteins, resulting in binding by the secretory chaperone, SecB, and subsequent targeting to translocase. Based on this proposal, we hypothesized that a cytoplasmic protein containing a mutation that slows its rate of folding would be recognized by SecB and therefore targeted to the Sec pathway. In a Prl suppressor strain the mutant protein would be exported to the periplasm due to loss of ability to reject non-secretory proteins from the pathway. Results In the current work, we tested this hypothesis using a mutant form of λ repressor that folds slowly. No export of the mutant protein was observed, even in a prl strain. We then examined binding of the mutant λ repressor to SecB. We did not observe interaction by either of two assays, indicating that slow folding is not sufficient for SecB binding and targeting to translocase. Conclusions These results strongly suggest that to be targeted to the export pathway, secretory proteins contain signals in addition to the canonical signal sequence and the rate of folding.

  16. Analysis of protein folds using protein contact networks

    Indian Academy of Sciences (India)

    is a well-recognized classification system of proteins, which is based on manual in- ... can easily correspond to the information in the 2D matrix. ..... [7] U K Muppirala and Zhijun Li, Protein Engineering, Design & Selection 19, 265 (2006).

  17. FRAN and RBF-PSO as two components of a hyper framework to recognize protein folds.

    Science.gov (United States)

    Abbasi, Elham; Ghatee, Mehdi; Shiri, M E

    2013-09-01

    In this paper, an intelligent hyper framework is proposed to recognize protein folds from its amino acid sequence which is a fundamental problem in bioinformatics. This framework includes some statistical and intelligent algorithms for proteins classification. The main components of the proposed framework are the Fuzzy Resource-Allocating Network (FRAN) and the Radial Bases Function based on Particle Swarm Optimization (RBF-PSO). FRAN applies a dynamic method to tune up the RBF network parameters. Due to the patterns complexity captured in protein dataset, FRAN classifies the proteins under fuzzy conditions. Also, RBF-PSO applies PSO to tune up the RBF classifier. Experimental results demonstrate that FRAN improves prediction accuracy up to 51% and achieves acceptable multi-class results for protein fold prediction. Although RBF-PSO provides reasonable results for protein fold recognition up to 48%, it is weaker than FRAN in some cases. However the proposed hyper framework provides an opportunity to use a great range of intelligent methods and can learn from previous experiences. Thus it can avoid the weakness of some intelligent methods in terms of memory, computational time and static structure. Furthermore, the performance of this system can be enhanced throughout the system life-cycle. Copyright © 2013 Elsevier Ltd. All rights reserved.

  18. Protein Folding Free Energy Landscape along the Committor - the Optimal Folding Coordinate.

    Science.gov (United States)

    Krivov, Sergei V

    2018-06-06

    Recent advances in simulation and experiment have led to dramatic increases in the quantity and complexity of produced data, which makes the development of automated analysis tools very important. A powerful approach to analyze dynamics contained in such data sets is to describe/approximate it by diffusion on a free energy landscape - free energy as a function of reaction coordinates (RC). For the description to be quantitatively accurate, RCs should be chosen in an optimal way. Recent theoretical results show that such an optimal RC exists; however, determining it for practical systems is a very difficult unsolved problem. Here we describe a solution to this problem. We describe an adaptive nonparametric approach to accurately determine the optimal RC (the committor) for an equilibrium trajectory of a realistic system. In contrast to alternative approaches, which require a functional form with many parameters to approximate an RC and thus extensive expertise with the system, the suggested approach is nonparametric and can approximate any RC with high accuracy without system specific information. To avoid overfitting for a realistically sampled system, the approach performs RC optimization in an adaptive manner by focusing optimization on less optimized spatiotemporal regions of the RC. The power of the approach is illustrated on a long equilibrium atomistic folding simulation of HP35 protein. We have determined the optimal folding RC - the committor, which was confirmed by passing a stringent committor validation test. It allowed us to determine a first quantitatively accurate protein folding free energy landscape. We have confirmed the recent theoretical results that diffusion on such a free energy profile can be used to compute exactly the equilibrium flux, the mean first passage times, and the mean transition path times between any two points on the profile. We have shown that the mean squared displacement along the optimal RC grows linear with time as for

  19. Earthworm Lumbricus rubellus MT-2: Metal Binding and Protein Folding of a True Cadmium-MT

    Directory of Open Access Journals (Sweden)

    Gregory R. Kowald

    2016-01-01

    Full Text Available Earthworms express, as most animals, metallothioneins (MTs—small, cysteine-rich proteins that bind d10 metal ions (Zn(II, Cd(II, or Cu(I in clusters. Three MT homologues are known for Lumbricus rubellus, the common red earthworm, one of which, wMT-2, is strongly induced by exposure of worms to cadmium. This study concerns composition, metal binding affinity and metal-dependent protein folding of wMT-2 expressed recombinantly and purified in the presence of Cd(II and Zn(II. Crucially, whilst a single Cd7wMT-2 species was isolated from wMT-2-expressing E. coli cultures supplemented with Cd(II, expressions in the presence of Zn(II yielded mixtures. The average affinities of wMT-2 determined for either Cd(II or Zn(II are both within normal ranges for MTs; hence, differential behaviour cannot be explained on the basis of overall affinity. Therefore, the protein folding properties of Cd- and Zn-wMT-2 were compared by 1H NMR spectroscopy. This comparison revealed that the protein fold is better defined in the presence of cadmium than in the presence of zinc. These differences in folding and dynamics may be at the root of the differential behaviour of the cadmium- and zinc-bound protein in vitro, and may ultimately also help in distinguishing zinc and cadmium in the earthworm in vivo.

  20. The Dynameomics Entropy Dictionary: A Large-Scale Assessment of Conformational Entropy across Protein Fold Space.

    Science.gov (United States)

    Towse, Clare-Louise; Akke, Mikael; Daggett, Valerie

    2017-04-27

    Molecular dynamics (MD) simulations contain considerable information with regard to the motions and fluctuations of a protein, the magnitude of which can be used to estimate conformational entropy. Here we survey conformational entropy across protein fold space using the Dynameomics database, which represents the largest existing data set of protein MD simulations for representatives of essentially all known protein folds. We provide an overview of MD-derived entropies accounting for all possible degrees of dihedral freedom on an unprecedented scale. Although different side chains might be expected to impose varying restrictions on the conformational space that the backbone can sample, we found that the backbone entropy and side chain size are not strictly coupled. An outcome of these analyses is the Dynameomics Entropy Dictionary, the contents of which have been compared with entropies derived by other theoretical approaches and experiment. As might be expected, the conformational entropies scale linearly with the number of residues, demonstrating that conformational entropy is an extensive property of proteins. The calculated conformational entropies of folding agree well with previous estimates. Detailed analysis of specific cases identifies deviations in conformational entropy from the average values that highlight how conformational entropy varies with sequence, secondary structure, and tertiary fold. Notably, α-helices have lower entropy on average than do β-sheets, and both are lower than coil regions.

  1. Mapping the Protein Fold Universe Using the CamTube Force Field in Molecular Dynamics Simulations.

    Science.gov (United States)

    Kukic, Predrag; Kannan, Arvind; Dijkstra, Maurits J J; Abeln, Sanne; Camilloni, Carlo; Vendruscolo, Michele

    2015-10-01

    It has been recently shown that the coarse-graining of the structures of polypeptide chains as self-avoiding tubes can provide an effective representation of the conformational space of proteins. In order to fully exploit the opportunities offered by such a 'tube model' approach, we present here a strategy to combine it with molecular dynamics simulations. This strategy is based on the incorporation of the 'CamTube' force field into the Gromacs molecular dynamics package. By considering the case of a 60-residue polyvaline chain, we show that CamTube molecular dynamics simulations can comprehensively explore the conformational space of proteins. We obtain this result by a 20 μs metadynamics simulation of the polyvaline chain that recapitulates the currently known protein fold universe. We further show that, if residue-specific interaction potentials are added to the CamTube force field, it is possible to fold a protein into a topology close to that of its native state. These results illustrate how the CamTube force field can be used to explore efficiently the universe of protein folds with good accuracy and very limited computational cost.

  2. Anisotropy of the Coulomb Interaction between Folded Proteins: Consequences for Mesoscopic Aggregation of Lysozyme

    Science.gov (United States)

    Chan, Ho Yin; Lankevich, Vladimir; Vekilov, Peter G.; Lubchenko, Vassiliy

    2012-01-01

    Toward quantitative description of protein aggregation, we develop a computationally efficient method to evaluate the potential of mean force between two folded protein molecules that allows for complete sampling of their mutual orientation. Our model is valid at moderate ionic strengths and accounts for the actual charge distribution on the surface of the molecules, the dielectric discontinuity at the protein-solvent interface, and the possibility of protonation or deprotonation of surface residues induced by the electric field due to the other protein molecule. We apply the model to the protein lysozyme, whose solutions exhibit both mesoscopic clusters of protein-rich liquid and liquid-liquid separation; the former requires that protein form complexes with typical lifetimes of approximately milliseconds. We find the electrostatic repulsion is typically lower than the prediction of the Derjaguin-Landau-Verwey-Overbeek theory. The Coulomb interaction in the lowest-energy docking configuration is nonrepulsive, despite the high positive charge on the molecules. Typical docking configurations barely involve protonation or deprotonation of surface residues. The obtained potential of mean force between folded lysozyme molecules is consistent with the location of the liquid-liquid coexistence, but produces dimers that are too short-lived for clusters to exist, suggesting lysozyme undergoes conformational changes during cluster formation. PMID:22768950

  3. Amino acid alphabet reduction preserves fold information contained in contact interactions in proteins.

    Science.gov (United States)

    Solis, Armando D

    2015-12-01

    To reduce complexity, understand generalized rules of protein folding, and facilitate de novo protein design, the 20-letter amino acid alphabet is commonly reduced to a smaller alphabet by clustering amino acids based on some measure of similarity. In this work, we seek the optimal alphabet that preserves as much of the structural information found in long-range (contact) interactions among amino acids in natively-folded proteins. We employ the Information Maximization Device, based on information theory, to partition the amino acids into well-defined clusters. Numbering from 2 to 19 groups, these optimal clusters of amino acids, while generated automatically, embody well-known properties of amino acids such as hydrophobicity/polarity, charge, size, and aromaticity, and are demonstrated to maintain the discriminative power of long-range interactions with minimal loss of mutual information. Our measurements suggest that reduced alphabets (of less than 10) are able to capture virtually all of the information residing in native contacts and may be sufficient for fold recognition, as demonstrated by extensive threading tests. In an expansive survey of the literature, we observe that alphabets derived from various approaches-including those derived from physicochemical intuition, local structure considerations, and sequence alignments of remote homologs-fare consistently well in preserving contact interaction information, highlighting a convergence in the various factors thought to be relevant to the folding code. Moreover, we find that alphabets commonly used in experimental protein design are nearly optimal and are largely coherent with observations that have arisen in this work. © 2015 Wiley Periodicals, Inc.

  4. Discrete Frenet frame, inflection point solitons, and curve visualization with applications to folded proteins

    Science.gov (United States)

    Hu, Shuangwei; Lundgren, Martin; Niemi, Antti J.

    2011-06-01

    We develop a transfer matrix formalism to visualize the framing of discrete piecewise linear curves in three-dimensional space. Our approach is based on the concept of an intrinsically discrete curve. This enables us to more effectively describe curves that in the limit where the length of line segments vanishes approach fractal structures in lieu of continuous curves. We verify that in the case of differentiable curves the continuum limit of our discrete equation reproduces the generalized Frenet equation. In particular, we draw attention to the conceptual similarity between inflection points where the curvature vanishes and topologically stable solitons. As an application we consider folded proteins, their Hausdorff dimension is known to be fractal. We explain how to employ the orientation of Cβ carbons of amino acids along a protein backbone to introduce a preferred framing along the backbone. By analyzing the experimentally resolved fold geometries in the Protein Data Bank we observe that this Cβ framing relates intimately to the discrete Frenet framing. We also explain how inflection points (a.k.a. soliton centers) can be located in the loops and clarify their distinctive rôle in determining the loop structure of folded proteins.

  5. Shedding Light on Protein Folding, Structural and Functional Dynamics by Single Molecule Studies

    Directory of Open Access Journals (Sweden)

    Krutika Bavishi

    2014-11-01

    Full Text Available The advent of advanced single molecule measurements unveiled a great wealth of dynamic information revolutionizing our understanding of protein dynamics and behavior in ways unattainable by conventional bulk assays. Equipped with the ability to record distribution of behaviors rather than the mean property of a population, single molecule measurements offer observation and quantification of the abundance, lifetime and function of multiple protein states. They also permit the direct observation of the transient and rarely populated intermediates in the energy landscape that are typically averaged out in non-synchronized ensemble measurements. Single molecule studies have thus provided novel insights about how the dynamic sampling of the free energy landscape dictates all aspects of protein behavior; from its folding to function. Here we will survey some of the state of the art contributions in deciphering mechanisms that underlie protein folding, structural and functional dynamics by single molecule fluorescence microscopy techniques. We will discuss a few selected examples highlighting the power of the emerging techniques and finally discuss the future improvements and directions.

  6. SHuffle, a novel Escherichia coli protein expression strain capable of correctly folding disulfide bonded proteins in its cytoplasm

    Directory of Open Access Journals (Sweden)

    Lobstein Julie

    2012-05-01

    Full Text Available Abstract Background Production of correctly disulfide bonded proteins to high yields remains a challenge. Recombinant protein expression in Escherichia coli is the popular choice, especially within the research community. While there is an ever growing demand for new expression strains, few strains are dedicated to post-translational modifications, such as disulfide bond formation. Thus, new protein expression strains must be engineered and the parameters involved in producing disulfide bonded proteins must be understood. Results We have engineered a new E. coli protein expression strain named SHuffle, dedicated to producing correctly disulfide bonded active proteins to high yields within its cytoplasm. This strain is based on the trxB gor suppressor strain SMG96 where its cytoplasmic reductive pathways have been diminished, allowing for the formation of disulfide bonds in the cytoplasm. We have further engineered a major improvement by integrating into its chromosome a signal sequenceless disulfide bond isomerase, DsbC. We probed the redox state of DsbC in the oxidizing cytoplasm and evaluated its role in assisting the formation of correctly folded multi-disulfide bonded proteins. We optimized protein expression conditions, varying temperature, induction conditions, strain background and the co-expression of various helper proteins. We found that temperature has the biggest impact on improving yields and that the E. coli B strain background of this strain was superior to the K12 version. We also discovered that auto-expression of substrate target proteins using this strain resulted in higher yields of active pure protein. Finally, we found that co-expression of mutant thioredoxins and PDI homologs improved yields of various substrate proteins. Conclusions This work is the first extensive characterization of the trxB gor suppressor strain. The results presented should help researchers design the appropriate protein expression conditions using

  7. Protein Structure Prediction by Protein Threading

    Science.gov (United States)

    Xu, Ying; Liu, Zhijie; Cai, Liming; Xu, Dong

    The seminal work of Bowie, Lüthy, and Eisenberg (Bowie et al., 1991) on "the inverse protein folding problem" laid the foundation of protein structure prediction by protein threading. By using simple measures for fitness of different amino acid types to local structural environments defined in terms of solvent accessibility and protein secondary structure, the authors derived a simple and yet profoundly novel approach to assessing if a protein sequence fits well with a given protein structural fold. Their follow-up work (Elofsson et al., 1996; Fischer and Eisenberg, 1996; Fischer et al., 1996a,b) and the work by Jones, Taylor, and Thornton (Jones et al., 1992) on protein fold recognition led to the development of a new brand of powerful tools for protein structure prediction, which we now term "protein threading." These computational tools have played a key role in extending the utility of all the experimentally solved structures by X-ray crystallography and nuclear magnetic resonance (NMR), providing structural models and functional predictions for many of the proteins encoded in the hundreds of genomes that have been sequenced up to now.

  8. Oxidative protein folding: from thiol-disulfide exchange reactions to the redox poise of the endoplasmic reticulum.

    Science.gov (United States)

    Hudson, Devin A; Gannon, Shawn A; Thorpe, Colin

    2015-03-01

    This review examines oxidative protein folding within the mammalian endoplasmic reticulum (ER) from an enzymological perspective. In protein disulfide isomerase-first (PDI-first) pathways of oxidative protein folding, PDI is the immediate oxidant of reduced client proteins and then addresses disulfide mispairings in a second isomerization phase. In PDI-second pathways the initial oxidation is PDI-independent. Evidence for the rapid reduction of PDI by reduced glutathione is presented in the context of PDI-first pathways. Strategies and challenges are discussed for determination of the concentrations of reduced and oxidized glutathione and of the ratios of PDI(red):PDI(ox). The preponderance of evidence suggests that the mammalian ER is more reducing than first envisaged. The average redox state of major PDI-family members is largely to almost totally reduced. These observations are consistent with model studies showing that oxidative protein folding proceeds most efficiently at a reducing redox poise consistent with a stoichiometric insertion of disulfides into client proteins. After a discussion of the use of natively encoded fluorescent probes to report the glutathione redox poise of the ER, this review concludes with an elaboration of a complementary strategy to discontinuously survey the redox state of as many redox-active disulfides as can be identified by ratiometric LC-MS-MS methods. Consortia of oxidoreductases that are in redox equilibrium can then be identified and compared to the glutathione redox poise of the ER to gain a more detailed understanding of the factors that influence oxidative protein folding within the secretory compartment. Copyright © 2014 Elsevier Inc. All rights reserved.

  9. Microsecond simulations of the folding/unfolding thermodynamics of the Trp-cage mini protein

    Science.gov (United States)

    Day, Ryan; Paschek, Dietmar; Garcia, Angel E.

    2012-01-01

    We study the unbiased folding/unfolding thermodynamics of the Trp-cage miniprotein using detailed molecular dynamics simulations of an all-atom model of the protein in explicit solvent, using the Amberff99SB force field. Replica-exchange molecular dynamics (REMD) simulations are used to sample the protein ensembles over a broad range of temperatures covering the folded and unfolded states, and at two densities. The obtained ensembles are shown to reach equilibrium in the 1 μs per replica timescale. The total simulation time employed in the calculations exceeds 100 μs. Ensemble averages of the fraction folded, pressure, and energy differences between the folded and unfolded states as a function of temperature are used to model the free energy of the folding transition, ΔG(P,T), over the whole region of temperature and pressures sampled in the simulations. The ΔG(P,T) diagram describes an ellipse over the range of temperatures and pressures sampled, predicting that the system can undergo pressure induced unfolding and cold denaturation at low temperatures and high pressures, and unfolding at low pressures and high temperatures. The calculated free energy function exhibits remarkably good agreement with the experimental folding transition temperature (Tf = 321 K), free energy and specific heat changes. However, changes in enthalpy and entropy are significantly different than the experimental values. We speculate that these differences may be due to the simplicity of the semi-empirical force field used in the simulations and that more elaborate force fields may be required to describe appropriately the thermodynamics of proteins. PMID:20408169

  10. BCL::MP-Fold: membrane protein structure prediction guided by EPR restraints

    Science.gov (United States)

    Fischer, Axel W.; Alexander, Nathan S.; Woetzel, Nils; Karakaş, Mert; Weiner, Brian E.; Meiler, Jens

    2016-01-01

    For many membrane proteins, the determination of their topology remains a challenge for methods like X-ray crystallography and nuclear magnetic resonance (NMR) spectroscopy. Electron paramagnetic resonance (EPR) spectroscopy has evolved as an alternative technique to study structure and dynamics of membrane proteins. The present study demonstrates the feasibility of membrane protein topology determination using limited EPR distance and accessibility measurements. The BCL::MP-Fold algorithm assembles secondary structure elements (SSEs) in the membrane using a Monte Carlo Metropolis (MCM) approach. Sampled models are evaluated using knowledge-based potential functions and agreement with the EPR data and a knowledge-based energy function. Twenty-nine membrane proteins of up to 696 residues are used to test the algorithm. The protein-size-normalized root-mean-square-deviation (RMSD100) value of the most accurate model is better than 8 Å for twenty-seven, better than 6 Å for twenty-two, and better than 4 Å for fifteen out of twenty-nine proteins, demonstrating the algorithm’s ability to sample the native topology. The average enrichment could be improved from 1.3 to 2.5, showing the improved discrimination power by using EPR data. PMID:25820805

  11. Protein kinesis: The dynamics of protein trafficking and stability

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1995-12-31

    The purpose of this conference is to provide a multidisciplinary forum for exchange of state-of-the-art information on protein kinesis. This volume contains abstracts of papers in the following areas: protein folding and modification in the endoplasmic reticulum; protein trafficking; protein translocation and folding; protein degradation; polarity; nuclear trafficking; membrane dynamics; and protein import into organelles.

  12. MFIB: a repository of protein complexes with mutual folding induced by binding.

    Science.gov (United States)

    Fichó, Erzsébet; Reményi, István; Simon, István; Mészáros, Bálint

    2017-11-15

    It is commonplace that intrinsically disordered proteins (IDPs) are involved in crucial interactions in the living cell. However, the study of protein complexes formed exclusively by IDPs is hindered by the lack of data and such analyses remain sporadic. Systematic studies benefited other types of protein-protein interactions paving a way from basic science to therapeutics; yet these efforts require reliable datasets that are currently lacking for synergistically folding complexes of IDPs. Here we present the Mutual Folding Induced by Binding (MFIB) database, the first systematic collection of complexes formed exclusively by IDPs. MFIB contains an order of magnitude more data than any dataset used in corresponding studies and offers a wide coverage of known IDP complexes in terms of flexibility, oligomeric composition and protein function from all domains of life. The included complexes are grouped using a hierarchical classification and are complemented with structural and functional annotations. MFIB is backed by a firm development team and infrastructure, and together with possible future community collaboration it will provide the cornerstone for structural and functional studies of IDP complexes. MFIB is freely accessible at http://mfib.enzim.ttk.mta.hu/. The MFIB application is hosted by Apache web server and was implemented in PHP. To enrich querying features and to enhance backend performance a MySQL database was also created. simon.istvan@ttk.mta.hu, meszaros.balint@ttk.mta.hu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press.

  13. Engineering Aromatic-Aromatic Interactions To Nucleate Folding in Intrinsically Disordered Regions of Proteins.

    Science.gov (United States)

    Balakrishnan, Swati; Sarma, Siddhartha P

    2017-08-22

    Aromatic interactions are an important force in protein folding as they combine the stability of a hydrophobic interaction with the selectivity of a hydrogen bond. Much of our understanding of aromatic interactions comes from "bioinformatics" based analyses of protein structures and from the contribution of these interactions to stabilizing secondary structure motifs in model peptides. In this study, the structural consequences of aromatic interactions on protein folding have been explored in engineered mutants of the molten globule protein apo-cytochrome b 5 . Structural changes from disorder to order due to aromatic interactions in two variants of the protein, viz., WF-cytb5 and FF-cytb5, result in significant long-range secondary and tertiary structure. The results show that 54 and 52% of the residues in WF-cytb5 and FF-cytb5, respectively, occupy ordered regions versus 26% in apo-cytochrome b 5 . The interactions between the aromatic groups are offset-stacked and edge-to-face for the Trp-Phe and Phe-Phe mutants, respectively. Urea denaturation studies indicate that both mutants have a C m higher than that of apo-cytochrome b 5 and are more stable to chaotropic agents than apo-cytochrome b 5 . The introduction of these aromatic residues also results in "trimer" interactions with existing aromatic groups, reaffirming the selectivity of the aromatic interactions. These studies provide insights into the aromatic interactions that drive disorder-to-order transitions in intrinsically disordered regions of proteins and will aid in de novo protein design beyond small peptide scaffolds.

  14. A first-principles model of early evolution: emergence of gene families, species, and preferred protein folds.

    Directory of Open Access Journals (Sweden)

    Konstantin B Zeldovich

    2007-07-01

    Full Text Available In this work we develop a microscopic physical model of early evolution where phenotype--organism life expectancy--is directly related to genotype--the stability of its proteins in their native conformations-which can be determined exactly in the model. Simulating the model on a computer, we consistently observe the "Big Bang" scenario whereby exponential population growth ensues as soon as favorable sequence-structure combinations (precursors of stable proteins are discovered. Upon that, random diversity of the structural space abruptly collapses into a small set of preferred proteins. We observe that protein folds remain stable and abundant in the population at timescales much greater than mutation or organism lifetime, and the distribution of the lifetimes of dominant folds in a population approximately follows a power law. The separation of evolutionary timescales between discovery of new folds and generation of new sequences gives rise to emergence of protein families and superfamilies whose sizes are power-law distributed, closely matching the same distributions for real proteins. On the population level we observe emergence of species--subpopulations that carry similar genomes. Further, we present a simple theory that relates stability of evolving proteins to the sizes of emerging genomes. Together, these results provide a microscopic first-principles picture of how first-gene families developed in the course of early evolution.

  15. Studies of protein structure in solution and protein folding using synchrotron small-angle x-ray scattering

    Energy Technology Data Exchange (ETDEWEB)

    Chen, Lingling [Stanford Univ., CA (United States)

    1996-04-01

    Synchrotron small angle x-ray scattering (SAXS) has been applied to the structural study of several biological systems, including the nitrogenase complex, the heat shock cognate protein (hsc70), and lysozyme folding. The structural information revealed from the SAXS experiments is complementary to information obtained by other physical and biochemical methods, and adds to our knowledge and understanding of these systems.

  16. On the origins of the weak folding cooperativity of a designed ββα ultrafast protein FSD-1.

    Science.gov (United States)

    Wu, Chun; Shea, Joan-Emma

    2010-11-18

    FSD-1, a designed small ultrafast folder with a ββα fold, has been actively studied in the last few years as a model system for studying protein folding mechanisms and for testing of the accuracy of computational models. The suitability of this protein to describe the folding of naturally occurring α/β proteins has recently been challenged based on the observation that the melting transition is very broad, with ill-resolved baselines. Using molecular dynamics simulations with the AMBER protein force field (ff96) coupled with the implicit solvent model (IGB = 5), we shed new light into the nature of this transition and resolve the experimental controversies. We show that the melting transition corresponds to the melting of the protein as a whole, and not solely to the helix-coil transition. The breadth of the folding transition arises from the spread in the melting temperatures (from ∼325 K to ∼302 K) of the individual transitions: formation of the hydrophobic core, β-hairpin and tertiary fold, with the helix formed earlier. Our simulations initiated from an extended chain accurately predict the native structure, provide a reasonable estimate of the transition barrier height, and explicitly demonstrate the existence of multiple pathways and multiple transition states for folding. Our exhaustive sampling enables us to assess the quality of the Amber ff96/igb5 combination and reveals that while this force field can predict the correct native fold, it nonetheless overstabilizes the α-helix portion of the protein (Tm = ∼387K) as well as the denatured structures.

  17. Long range correlations and folding angle with applications to α-helical proteins

    Science.gov (United States)

    Krokhotin, Andrey; Nicolis, Stam; Niemi, Antti J.

    2014-03-01

    The conformational complexity of chain-like macromolecules such as proteins and other linear polymers is much larger than that of point-like atoms and molecules. Unlike particles, chains can bend, twist, and even become knotted. Thus chains might also display a much richer phase structure. Unfortunately, it is not very easy to characterize the phase of a long chain. Essentially, the only known attribute is the radius of gyration. The way how it changes when the degree of polymerization becomes different, and how it evolves when the ambient temperature and solvent properties change, is commonly used to disclose the phase. But in any finite length chain there are corrections to scaling that complicate the detailed analysis of the phase structure. Here we introduce a quantity that we call the folding angle to identify and scrutinize the phase structure, as a complement to the radius of gyration. We argue for a mean-field level relationship between the folding angle and the scaling exponent in the radius of gyration. We then estimate the value of the folding angle in the case of crystallographic α-helical protein structures in the Protein Data Bank. We also show how the experimental value of the folding angle can be obtained computationally, using a semiclassical Born-Oppenheimer description of α-helical chiral chains.

  18. TMFoldWeb: a web server for predicting transmembrane protein fold class.

    Science.gov (United States)

    Kozma, Dániel; Tusnády, Gábor E

    2015-09-17

    Here we present TMFoldWeb, the web server implementation of TMFoldRec, a transmembrane protein fold recognition algorithm. TMFoldRec uses statistical potentials and utilizes topology filtering and a gapless threading algorithm. It ranks template structures and selects the most likely candidates and estimates the reliability of the obtained lowest energy model. The statistical potential was developed in a maximum likelihood framework on a representative set of the PDBTM database. According to the benchmark test the performance of TMFoldRec is about 77 % in correctly predicting fold class for a given transmembrane protein sequence. An intuitive web interface has been developed for the recently published TMFoldRec algorithm. The query sequence goes through a pipeline of topology prediction and a systematic sequence to structure alignment (threading). Resulting templates are ordered by energy and reliability values and are colored according to their significance level. Besides the graphical interface, a programmatic access is available as well, via a direct interface for developers or for submitting genome-wide data sets. The TMFoldWeb web server is unique and currently the only web server that is able to predict the fold class of transmembrane proteins while assigning reliability scores for the prediction. This method is prepared for genome-wide analysis with its easy-to-use interface, informative result page and programmatic access. Considering the info-communication evolution in the last few years, the developed web server, as well as the molecule viewer, is responsive and fully compatible with the prevalent tablets and mobile devices.

  19. An overview on molecular chaperones enhancing solubility of expressed recombinant proteins with correct folding.

    Science.gov (United States)

    Mamipour, Mina; Yousefi, Mohammadreza; Hasanzadeh, Mohammad

    2017-09-01

    The majority of research topics declared that most of the recombinant proteins have been expressed by Escherichia coli in basic investigations. But the majority of high expressed proteins formed as inactive recombinant proteins that are called inclusion body. To overcome this problem, several methods have been used including suitable promoter, environmental factors, ladder tag to secretion of proteins into the periplasm, gene protein optimization, chemical chaperones and molecular chaperones sets. Co-expression of the interest protein with molecular chaperones is one of the common methods The chaperones are a group of proteins, which are involved in making correct folding of recombinant proteins. Chaperones are divided two groups including; cytoplasmic and periplasmic chaperones. Moreover, periplasmic chaperones and proteases can be manipulated to increase the yields of secreted proteins. In this article, we attempted to review cytoplasmic chaperones such as Hsp families and periplasmic chaperones including; generic chaperones, specialized chaperones, PPIases, and proteins involved in disulfide bond formation. Copyright © 2017 Elsevier B.V. All rights reserved.

  20. Entropic formulation for the protein folding process: Hydrophobic stability correlates with folding rates

    Science.gov (United States)

    Dal Molin, J. P.; Caliri, A.

    2018-01-01

    Here we focus on the conformational search for the native structure when it is ruled by the hydrophobic effect and steric specificities coming from amino acids. Our main tool of investigation is a 3D lattice model provided by a ten-letter alphabet, the stereochemical model. This minimalist model was conceived for Monte Carlo (MC) simulations when one keeps in mind the kinetic behavior of protein-like chains in solution. We have three central goals here. The first one is to characterize the folding time (τ) by two distinct sampling methods, so we present two sets of 103 MC simulations for a fast protein-like sequence. The resulting sets of characteristic folding times, τ and τq were obtained by the application of the standard Metropolis algorithm (MA), as well as by an enhanced algorithm (Mq A). The finding for τq shows two things: (i) the chain-solvent hydrophobic interactions {hk } plus a set of inter-residues steric constraints {ci,j } are able to emulate the conformational search for the native structure. For each one of the 103MC performed simulations, the target is always found within a finite time window; (ii) the ratio τq / τ ≅ 1 / 10 suggests that the effect of local thermal fluctuations, encompassed by the Tsallis weight, provides to the chain an innate efficiency to escape from energetic and steric traps. We performed additional MC simulations with variations of our design rule to attest this first result, both algorithms the MA and the Mq A were applied to a restricted set of targets, a physical insight is provided. Our second finding was obtained by a set of 600 independent MC simulations, only performed with the Mq A applied to an extended set of 200 representative targets, our native structures. The results show how structural patterns should modulate τq, which cover four orders of magnitude; this finding is our second goal. The third, and last result, was obtained with a special kind of simulation performed with the purpose to explore a

  1. In silico insights into protein-protein interactions and folding dynamics of the saposin-like domain of Solanum tuberosum aspartic protease.

    Directory of Open Access Journals (Sweden)

    Dref C De Moura

    Full Text Available The plant-specific insert is an approximately 100-residue domain found exclusively within the C-terminal lobe of some plant aspartic proteases. Structurally, this domain is a member of the saposin-like protein family, and is involved in plant pathogen defense as well as vacuolar targeting of the parent protease molecule. Similar to other members of the saposin-like protein family, most notably saposins A and C, the recently resolved crystal structure of potato (Solanum tuberosum plant-specific insert has been shown to exist in a substrate-bound open conformation in which the plant-specific insert oligomerizes to form homodimers. In addition to the open structure, a closed conformation also exists having the classic saposin fold of the saposin-like protein family as observed in the crystal structure of barley (Hordeum vulgare L. plant-specific insert. In the present study, the mechanisms of tertiary and quaternary conformation changes of potato plant-specific insert were investigated in silico as a function of pH. Umbrella sampling and determination of the free energy change of dissociation of the plant-specific insert homodimer revealed that increasing the pH of the system to near physiological levels reduced the free energy barrier to dissociation. Furthermore, principal component analysis was used to characterize conformational changes at both acidic and neutral pH. The results indicated that the plant-specific insert may adopt a tertiary structure similar to the characteristic saposin fold and suggest a potential new structural motif among saposin-like proteins. To our knowledge, this acidified PSI structure presents the first example of an alternative saposin-fold motif for any member of the large and diverse SAPLIP family.

  2. The Role of Backbone Hydrogen Bonds in the Transition State for Protein Folding of a PDZ Domain.

    Directory of Open Access Journals (Sweden)

    Søren W. Pedersen

    Full Text Available Backbone hydrogen bonds are important for the structure and stability of proteins. However, since conventional site-directed mutagenesis cannot be applied to perturb the backbone, the contribution of these hydrogen bonds in protein folding and stability has been assessed only for a very limited set of small proteins. We have here investigated effects of five amide-to-ester mutations in the backbone of a PDZ domain, a 90-residue globular protein domain, to probe the influence of hydrogen bonds in a β-sheet for folding and stability. The amide-to-ester mutation removes NH-mediated hydrogen bonds and destabilizes hydrogen bonds formed by the carbonyl oxygen. The overall stability of the PDZ domain generally decreased for all amide-to-ester mutants due to an increase in the unfolding rate constant. For this particular region of the PDZ domain, it is therefore clear that native hydrogen bonds are formed after crossing of the rate-limiting barrier for folding. Moreover, three of the five amide-to-ester mutants displayed an increase in the folding rate constant suggesting that the hydrogen bonds are involved in non-native interactions in the transition state for folding.

  3. Structure of the thioredoxin-fold domain of human phosducin-like protein 2

    International Nuclear Information System (INIS)

    Lou, Xiaochu; Bao, Rui; Zhou, Cong-Zhao; Chen, Yuxing

    2009-01-01

    The X-ray crystal structure of the Trx-fold domain of hPDCL2 was solved at 2.70 Å resolution and resembled the Trx-fold domain of rat phosducin. Human phosducin-like protein 2 (hPDCL2) has been identified as belonging to subgroup II of the phosducin (Pdc) family. The members of this family share an N-terminal helix domain and a C-terminal thioredoxin-fold (Trx-fold) domain. The X-ray crystal structure of the Trx-fold domain of hPDCL2 was solved at 2.70 Å resolution and resembled the Trx-fold domain of rat phosducin. Comparative structural analysis revealed the structural basis of their putative functional divergence

  4. On the origins of the weak folding cooperativity of a designed ββα ultrafast protein FSD-1.

    Directory of Open Access Journals (Sweden)

    Chun Wu

    Full Text Available FSD-1, a designed small ultrafast folder with a ββα fold, has been actively studied in the last few years as a model system for studying protein folding mechanisms and for testing of the accuracy of computational models. The suitability of this protein to describe the folding of naturally occurring α/β proteins has recently been challenged based on the observation that the melting transition is very broad, with ill-resolved baselines. Using molecular dynamics simulations with the AMBER protein force field (ff96 coupled with the implicit solvent model (IGB = 5, we shed new light into the nature of this transition and resolve the experimental controversies. We show that the melting transition corresponds to the melting of the protein as a whole, and not solely to the helix-coil transition. The breadth of the folding transition arises from the spread in the melting temperatures (from ∼325 K to ∼302 K of the individual transitions: formation of the hydrophobic core, β-hairpin and tertiary fold, with the helix formed earlier. Our simulations initiated from an extended chain accurately predict the native structure, provide a reasonable estimate of the transition barrier height, and explicitly demonstrate the existence of multiple pathways and multiple transition states for folding. Our exhaustive sampling enables us to assess the quality of the Amber ff96/igb5 combination and reveals that while this force field can predict the correct native fold, it nonetheless overstabilizes the α-helix portion of the protein (Tm = ∼387K as well as the denatured structures.

  5. The Role of Short-Chain Conjugated Poly-(R-3-Hydroxybutyrate (cPHB in Protein Folding

    Directory of Open Access Journals (Sweden)

    Rosetta N. Reusch

    2013-05-01

    Full Text Available Poly-(R-3-hydroxybutyrate (PHB, a linear polymer of R-3-hydroxybutyrate (R-3HB, is a fundamental constituent of biological cells. Certain prokaryotes accumulate PHB of very high molecular weight (10,000 to >1,000,000 residues, which is segregated within granular deposits in the cytoplasm; however, all prokaryotes and all eukaryotes synthesize PHB of medium-chain length (~100–200 residues which resides within lipid bilayers or lipid vesicles, and PHB of short-chain length (<12 residues which is conjugated to proteins (cPHB, primarily proteins in membranes and organelles. The physical properties of cPHB indicate it plays important roles in the targeting and folding of cPHB-proteins. Here we review the occurrence, physical properties and molecular characteristics of cPHB, and discuss its influence on the folding and structure of outer membrane protein A (OmpA of Escherichia coli.

  6. MICROFLUIDIC MIXERS FOR THE INVESTIGATION OF PROTEIN FOLDING USING SYNCHROTRON RADIATION CIRCULAR DICHROISM SPECTROSCOPY

    International Nuclear Information System (INIS)

    Kane, A; Hertzog, D; Baumgartel, P; Lengefeld, J; Horsley, D; Schuler, B; Bakajin, O

    2006-01-01

    The purpose of this study is to design, fabricate and optimize microfluidic mixers to investigate the kinetics of protein secondary structure formation with Synchrotron Radiation Circular Dichroism (SRCD) spectroscopy. The mixers are designed to rapidly initiate protein folding reaction through the dilution of denaturant. The devices are fabricated out of fused silica, so that they are transparent in the UV. We present characterization of mixing in the fabricated devices, as well as the initial SRCD data on proteins inside the mixers

  7. Protein folding kinetics by combined use of rapid mixing techniques and NMR observation of individual amide protons

    International Nuclear Information System (INIS)

    Roder, H.; Wuethrich, K.

    1986-01-01

    A method to be used for experimental studies of protein folding introduced by Schmid and Baldwin, which is based on the competition between amide hydrogen exchange and protein refolding, was extended by using rapid mixing techniques and 1 H NMR to provide site-resolved kinetic information on the early phases of protein structure acquisition. In this method, a protonated solution of the unfolded protein is rapidly mixed with a deuterated buffer solution at conditions assuring protein refolding in the mixture. This simultaneously initiates the exchange of unprotected amide protons with solvent deuterium and the refolding of protein segments which can protect amide groups from further exchange. After variable reaction times the amide proton exchange is quenched while folding to the native form continues to completion. By using 1 H NMR, the extent of exchange at individual amide sites is then measured in the refolded protein. Competition experiments at variable reaction times or variable pH indicate the time at which each amide group is protected in the refolding process. This technique was applied to the basic pancreatic trypsin inhibitor, for which sequence-specific assignments of the amide proton NMR lines had previously been obtained. For eight individual amide protons located in the beta-sheet and the C-terminal alpha-helix of this protein, apparent refolding rates in the range from 15 s-1 to 60 s-1 were observed. These rates are on the time scale of the fast folding phase observed with optical probes

  8. The energy landscapes of repeat-containing proteins: topology, cooperativity, and the folding funnels of one-dimensional architectures.

    Directory of Open Access Journals (Sweden)

    Diego U Ferreiro

    2008-05-01

    Full Text Available Repeat-proteins are made up of near repetitions of 20- to 40-amino acid stretches. These polypeptides usually fold up into non-globular, elongated architectures that are stabilized by the interactions within each repeat and those between adjacent repeats, but that lack contacts between residues distant in sequence. The inherent symmetries both in primary sequence and three-dimensional structure are reflected in a folding landscape that may be analyzed as a quasi-one-dimensional problem. We present a general description of repeat-protein energy landscapes based on a formal Ising-like treatment of the elementary interaction energetics in and between foldons, whose collective ensemble are treated as spin variables. The overall folding properties of a complete "domain" (the stability and cooperativity of the repeating array can be derived from this microscopic description. The one-dimensional nature of the model implies there are simple relations for the experimental observables: folding free-energy (DeltaG(water and the cooperativity of denaturation (m-value, which do not ordinarily apply for globular proteins. We show how the parameters for the "coarse-grained" description in terms of foldon spin variables can be extracted from more detailed folding simulations on perfectly funneled landscapes. To illustrate the ideas, we present a case-study of a family of tetratricopeptide (TPR repeat proteins and quantitatively relate the results to the experimentally observed folding transitions. Based on the dramatic effect that single point mutations exert on the experimentally observed folding behavior, we speculate that natural repeat proteins are "poised" at particular ratios of inter- and intra-element interaction energetics that allow them to readily undergo structural transitions in physiologically relevant conditions, which may be intrinsically related to their biological functions.

  9. Supramolecular Architectures and Mimics of Complex Natural Folds Derived from Rationally Designed alpha-Helical Protein Structures

    Science.gov (United States)

    Tavenor, Nathan Albert

    Protein-based supramolecular polymers (SMPs) are a class of biomaterials which draw inspiration from and expand upon the many examples of complex protein quaternary structures observed in nature: collagen, microtubules, viral capsids, etc. Designing synthetic supramolecular protein scaffolds both increases our understanding of natural superstructures and allows for the creation of novel materials. Similar to small-molecule SMPs, protein-based SMPs form due to self-assembly driven by intermolecular interactions between monomers, and monomer structure determines the properties of the overall material. Using protein-based monomers takes advantage of the self-assembly and highly specific molecular recognition properties encodable in polypeptide sequences to rationally design SMP architectures. The central hypothesis underlying our work is that alpha-helical coiled coils, a well-studied protein quaternary folding motif, are well-suited to SMP design through the addition of synthetic linkers at solvent-exposed sites. Through small changes in the structures of the cross-links and/or peptide sequence, we have been able to control both the nanoscale organization and the macroscopic properties of the SMPs. Changes to the linker and hydrophobic core of the peptide can be used to control polymer rigidity, stability, and dimensionality. The gaps in knowledge that this thesis sought to fill on this project were 1) the relationship between the molecular structure of the cross-linked polypeptides and the macroscopic properties of the SMPs and 2) a means of creating materials exhibiting multi-dimensional net or framework topologies. Separate from the above efforts on supramolecular architectures was work on improving backbone modification strategies for an alpha-helix in the context of a complex protein tertiary fold. Earlier work in our lab had successfully incorporated unnatural building blocks into every major secondary structure (beta-sheet, alpha-helix, loops and beta

  10. Unique Features of Halophilic Proteins.

    Science.gov (United States)

    Arakawa, Tsutomu; Yamaguchi, Rui; Tokunaga, Hiroko; Tokunaga, Masao

    2017-01-01

    Proteins from moderate and extreme halophiles have unique characteristics. They are highly acidic and hydrophilic, similar to intrinsically disordered proteins. These characteristics make the halophilic proteins soluble in water and fold reversibly. In addition to reversible folding, the rate of refolding of halophilic proteins from denatured structure is generally slow, often taking several days, for example, for extremely halophilic proteins. This slow folding rate makes the halophilic proteins a novel model system for folding mechanism analysis. High solubility and reversible folding also make the halophilic proteins excellent fusion partners for soluble expression of recombinant proteins.

  11. Localizing internal friction along the reaction coordinate of protein folding by combining ensemble and single-molecule fluorescence spectroscopy

    Science.gov (United States)

    Borgia, Alessandro; Wensley, Beth G.; Soranno, Andrea; Nettels, Daniel; Borgia, Madeleine B.; Hoffmann, Armin; Pfeil, Shawn H.; Lipman, Everett A.; Clarke, Jane; Schuler, Benjamin

    2012-01-01

    Theory, simulations and experimental results have suggested an important role of internal friction in the kinetics of protein folding. Recent experiments on spectrin domains provided the first evidence for a pronounced contribution of internal friction in proteins that fold on the millisecond timescale. However, it has remained unclear how this contribution is distributed along the reaction and what influence it has on the folding dynamics. Here we use a combination of single-molecule Förster resonance energy transfer, nanosecond fluorescence correlation spectroscopy, microfluidic mixing and denaturant- and viscosity-dependent protein-folding kinetics to probe internal friction in the unfolded state and at the early and late transition states of slow- and fast-folding spectrin domains. We find that the internal friction affecting the folding rates of spectrin domains is highly localized to the early transition state, suggesting an important role of rather specific interactions in the rate-limiting conformational changes. PMID:23149740

  12. Polymer collapse, protein folding, and the percolation threshold.

    Science.gov (United States)

    Meirovitch, Hagai

    2002-01-15

    We study the transition of polymers in the dilute regime from a swollen shape at high temperatures to their low-temperature structures. The polymers are modeled by a single self-avoiding walk (SAW) on a lattice for which l of the monomers (the H monomers) are self-attracting, i.e., if two nonbonded H monomers become nearest neighbors on the lattice they gain energy of interaction (epsilon = -/epsilon/); the second type of monomers, denoted P, are neutral. This HP model was suggested by Lau and Dill (Macromolecules 1989, 22, 3986-3997) to study protein folding, where H and P are the hydrophobic and polar amino acid residues, respectively. The model is simulated on the square and simple cubic (SC) lattices using the scanning method. We show that the ground state and the sharpness of the transition depend on the lattice, the fraction g of the H monomers, as well as on their arrangement along the chain. In particular, if the H monomers are distributed at random and g is larger than the site percolation threshold of the lattice, a collapsed transition is very likely to occur. This conclusion, drawn for the lattice models, is also applicable to proteins where an effective lattice with coordination number between that of the SC lattice and the body centered cubic lattice is defined. Thus, the average fraction of hydrophobic amino acid residues in globular proteins is found to be close to the percolation threshold of the effective lattice.

  13. Cofactor-binding sites in proteins of deviating sequence: comparative analysis and clustering in torsion angle, cavity, and fold space.

    Science.gov (United States)

    Stegemann, Björn; Klebe, Gerhard

    2012-02-01

    Small molecules are recognized in protein-binding pockets through surface-exposed physicochemical properties. To optimize binding, they have to adopt a conformation corresponding to a local energy minimum within the formed protein-ligand complex. However, their conformational flexibility makes them competent to bind not only to homologous proteins of the same family but also to proteins of remote similarity with respect to the shape of the binding pockets and folding pattern. Considering drug action, such observations can give rise to unexpected and undesired cross reactivity. In this study, datasets of six different cofactors (ADP, ATP, NAD(P)(H), FAD, and acetyl CoA, sharing an adenosine diphosphate moiety as common substructure), observed in multiple crystal structures of protein-cofactor complexes exhibiting sequence identity below 25%, have been analyzed for the conformational properties of the bound ligands, the distribution of physicochemical properties in the accommodating protein-binding pockets, and the local folding patterns next to the cofactor-binding site. State-of-the-art clustering techniques have been applied to group the different protein-cofactor complexes in the different spaces. Interestingly, clustering in cavity (Cavbase) and fold space (DALI) reveals virtually the same data structuring. Remarkable relationships can be found among the different spaces. They provide information on how conformations are conserved across the host proteins and which distinct local cavity and fold motifs recognize the different portions of the cofactors. In those cases, where different cofactors are found to be accommodated in a similar fashion to the same fold motifs, only a commonly shared substructure of the cofactors is used for the recognition process. Copyright © 2011 Wiley Periodicals, Inc.

  14. The ModFOLD4 server for the quality assessment of 3D protein models

    OpenAIRE

    McGuffin, Liam J.; Buenavista, Maria T.; Roche, Daniel B.

    2013-01-01

    Once you have generated a 3D model of a protein,\\ud how do you know whether it bears any resemblance\\ud to the actual structure? To determine the usefulness\\ud of 3D models of proteins, they must be assessed in\\ud terms of their quality by methods that predict their\\ud similarity to the native structure. The ModFOLD4\\ud server is the latest version of our leading independent\\ud server for the estimation of both the global and\\ud local (per-residue) quality of 3D protein models. The\\ud server ...

  15. Probing slowly exchanging protein systems via {sup 13}C{sup {alpha}}-CEST: monitoring folding of the Im7 protein

    Energy Technology Data Exchange (ETDEWEB)

    Hansen, Alexandar L.; Bouvignies, Guillaume; Kay, Lewis E., E-mail: kay@pound.med.utoronto.ca [University of Toronto, Departments of Molecular Genetics, Biochemistry and Chemistry (Canada)

    2013-03-15

    A {sup 13}C{sup {alpha}} chemical exchange saturation transfer based experiment is presented for the study of protein systems undergoing slow interconversion between an 'observable' ground state and one or more 'invisible' excited states. Here a labeling strategy whereby [2-{sup 13}C]-glucose is the sole carbon source is exploited, producing proteins with {sup 13}C at the C{sup {alpha}} position, while the majority of residues remain unlabeled at CO or C{sup {beta}}. The new experiment is demonstrated with an application to the folding reaction of the Im7 protein that involves an on-pathway excited state. The obtained excited state {sup 13}C{sup {alpha}} chemical shifts are cross validated by comparison to values extracted from analysis of CPMG relaxation dispersion profiles, establishing the utility of the methodology.

  16. Lipid-protein nanodiscs for cell-free production of integral membrane proteins in a soluble and folded state: comparison with detergent micelles, bicelles and liposomes.

    Science.gov (United States)

    Lyukmanova, E N; Shenkarev, Z O; Khabibullina, N F; Kopeina, G S; Shulepko, M A; Paramonov, A S; Mineev, K S; Tikhonov, R V; Shingarova, L N; Petrovskaya, L E; Dolgikh, D A; Arseniev, A S; Kirpichnikov, M P

    2012-03-01

    Production of integral membrane proteins (IMPs) in a folded state is a key prerequisite for their functional and structural studies. In cell-free (CF) expression systems membrane mimicking components could be added to the reaction mixture that promotes IMP production in a soluble form. Here lipid-protein nanodiscs (LPNs) of different lipid compositions (DMPC, DMPG, POPC, POPC/DOPG) have been compared with classical membrane mimicking media such as detergent micelles, lipid/detergent bicelles and liposomes by their ability to support CF synthesis of IMPs in a folded and soluble state. Three model membrane proteins of different topology were used: homodimeric transmembrane (TM) domain of human receptor tyrosine kinase ErbB3 (TM-ErbB3, 1TM); voltage-sensing domain of K(+) channel KvAP (VSD, 4TM); and bacteriorhodopsin from Exiguobacterium sibiricum (ESR, 7TM). Structural and/or functional properties of the synthesized proteins were analyzed. LPNs significantly enhanced synthesis of the IMPs in a soluble form regardless of the lipid composition. A partial disintegration of LPNs composed of unsaturated lipids was observed upon co-translational IMP incorporation. Contrary to detergents the nanodiscs resulted in the synthesis of ~80% active ESR and promoted correct folding of the TM-ErbB3. None of the tested membrane mimetics supported CF synthesis of correctly folded VSD, and the protocol of the domain refolding was developed. The use of LPNs appears to be the most promising approach to CF production of IMPs in a folded state. NMR analysis of (15)N-Ile-TM-ErbB3 co-translationally incorporated into LPNs shows the great prospects of this membrane mimetics for structural studies of IMPs produced by CF systems. Copyright © 2011 Elsevier B.V. All rights reserved.

  17. CATHEDRAL: a fast and effective algorithm to predict folds and domain boundaries from multidomain protein structures.

    Directory of Open Access Journals (Sweden)

    Oliver C Redfern

    2007-11-01

    Full Text Available We present CATHEDRAL, an iterative protocol for determining the location of previously observed protein folds in novel multidomain protein structures. CATHEDRAL builds on the features of a fast secondary-structure-based method (using graph theory to locate known folds within a multidomain context and a residue-based, double-dynamic programming algorithm, which is used to align members of the target fold groups against the query protein structure to identify the closest relative and assign domain boundaries. To increase the fidelity of the assignments, a support vector machine is used to provide an optimal scoring scheme. Once a domain is verified, it is excised, and the search protocol is repeated in an iterative fashion until all recognisable domains have been identified. We have performed an initial benchmark of CATHEDRAL against other publicly available structure comparison methods using a consensus dataset of domains derived from the CATH and SCOP domain classifications. CATHEDRAL shows superior performance in fold recognition and alignment accuracy when compared with many equivalent methods. If a novel multidomain structure contains a known fold, CATHEDRAL will locate it in 90% of cases, with <1% false positives. For nearly 80% of assigned domains in a manually validated test set, the boundaries were correctly delineated within a tolerance of ten residues. For the remaining cases, previously classified domains were very remotely related to the query chain so that embellishments to the core of the fold caused significant differences in domain sizes and manual refinement of the boundaries was necessary. To put this performance in context, a well-established sequence method based on hidden Markov models was only able to detect 65% of domains, with 33% of the subsequent boundaries assigned within ten residues. Since, on average, 50% of newly determined protein structures contain more than one domain unit, and typically 90% or more of these

  18. Protein folding on a chip

    CERN Multimedia

    2004-01-01

    "Scientists at the U.S. Department of Energy's Brookhaven National Laboratory are proposing to use a super- computer originally developed to simulate elementary particles in high- energy physics to help determine the structures and functions of proteins, including, for example, the 30,000 or so proteins encoded by the human genome" (1 page)

  19. Essential roles of protein-solvent many-body correlation in solvent-entropy effect on protein folding and denaturation: Comparison between hard-sphere solvent and water

    International Nuclear Information System (INIS)

    Oshima, Hiraku; Kinoshita, Masahiro

    2015-01-01

    In earlier works, we showed that the entropic effect originating from the translational displacement of water molecules plays the pivotal role in protein folding and denaturation. The two different solvent models, hard-sphere solvent and model water, were employed in theoretical methods wherein the entropic effect was treated as an essential factor. However, there were similarities and differences in the results obtained from the two solvent models. In the present work, to unveil the physical origins of the similarities and differences, we simultaneously consider structural transition, cold denaturation, and pressure denaturation for the same protein by employing the two solvent models and considering three different thermodynamic states for each solvent model. The solvent-entropy change upon protein folding/unfolding is decomposed into the protein-solvent pair (PA) and many-body (MB) correlation components using the integral equation theories. Each component is further decomposed into the excluded-volume (EV) and solvent-accessible surface (SAS) terms by applying the morphometric approach. The four physically insightful constituents, (PA, EV), (PA, SAS), (MB, EV), and (MB, SAS), are thus obtained. Moreover, (MB, SAS) is discussed by dividing it into two factors. This all-inclusive investigation leads to the following results: (1) the protein-water many-body correlation always plays critical roles in a variety of folding/unfolding processes; (2) the hard-sphere solvent model fails when it does not correctly reproduce the protein-water many-body correlation; (3) the hard-sphere solvent model becomes problematic when the dependence of the many-body correlation on the solvent number density and temperature is essential: it is not quite suited to studies on cold and pressure denaturating of a protein; (4) when the temperature and solvent number density are limited to the ambient values, the hard-sphere solvent model is usually successful; and (5) even at the ambient

  20. Ab initio folding of mixed-fold FSD-EY protein using formula-based polarizable hydrogen bond (PHB) charge model

    Science.gov (United States)

    Zhang, Dawei; Lazim, Raudah; Mun Yip, Yew

    2017-09-01

    We conducted an all-atom ab initio folding of FSD-EY, a protein with a ββα configuration using non-polarizable (AMBER) and polarizable force fields (PHB designed by Gao et al.) in implicit solvent. The effect of reducing the polarization effect integrated into the force field by the PHB model, termed the PHB0.7 was also examined in the folding of FSD-EY. This model incorporates into the force field 70% of the original polarization effect to minimize the likelihood of over-stabilizing the backbone hydrogen bonds. Precise folding of the β-sheet of FSD-EY was further achieved by relaxing the REMD structure obtained in explicit water.

  1. Determination of protein global folds using backbone residual dipolar coupling and long-range NOE restraints

    International Nuclear Information System (INIS)

    Giesen, Alexander W.; Homans, Steve W.; Brown, Jonathan Miles

    2003-01-01

    We report the determination of the global fold of human ubiquitin using protein backbone NMR residual dipolar coupling and long-range nuclear Overhauser effect (NOE) data as conformational restraints. Specifically, by use of a maximum of three backbone residual dipolar couplings per residue (N i -H N i , N i -C' i-1 , H N i - C' i-1 ) in two tensor frames and only backbone H N -H N NOEs, a global fold of ubiquitin can be derived with a backbone root-mean-square deviation of 1.4 A with respect to the crystal structure. This degree of accuracy is more than adequate for use in databases of structural motifs, and suggests a general approach for the determination of protein global folds using conformational restraints derived only from backbone atoms

  2. Production, purification and oxidative folding of the mouse recombinant prion protein

    Czech Academy of Sciences Publication Activity Database

    Pavlíček, A.; Bednárová, Lucie; Holada, K.

    2007-01-01

    Roč. 52, č. 4 (2007), s. 391-397 ISSN 0015-5632 R&D Projects: GA ČR GD310/05/H533 Grant - others:GA ČR(CZ) GA310/04/0419 Institutional research plan: CEZ:AV0Z40550506 Keywords : recombinant prion protein * production * purification * folding Subject RIV: CE - Biochemistry Impact factor: 0.989, year: 2007 http://www.biomed.cas.cz/mbu/folia/

  3. Changing folding and binding stability in a viral coat protein: a comparison between substitutions accessible through mutation and those fixed by natural selection.

    Science.gov (United States)

    Miller, Craig R; Lee, Kuo Hao; Wichman, Holly A; Ytreberg, F Marty

    2014-01-01

    Previous studies have shown that most random amino acid substitutions destabilize protein folding (i.e. increase the folding free energy). No analogous studies have been carried out for protein-protein binding. Here we use a structure-based model of the major coat protein in a simple virus, bacteriophage φX174, to estimate the free energy of folding of a single coat protein and binding of five coat proteins within a pentameric unit. We confirm and extend previous work in finding that most accessible substitutions destabilize both protein folding and protein-protein binding. We compare the pool of accessible substitutions with those observed among the φX174-like wild phage and in experimental evolution with φX174. We find that observed substitutions have smaller effects on stability than expected by chance. An analysis of adaptations at high temperatures suggests that selection favors either substitutions with no effect on stability or those that simultaneously stabilize protein folding and slightly destabilize protein binding. We speculate that these mutations might involve adjusting the rate of capsid assembly. At normal laboratory temperature there is little evidence of directional selection. Finally, we show that cumulative changes in stability are highly variable; sometimes they are well beyond the bounds of single substitution changes and sometimes they are not. The variation leads us to conclude that phenotype selection acts on more than just stability. Instances of larger cumulative stability change (never via a single substitution despite their availability) lead us to conclude that selection views stability at a local, not a global, level.

  4. Theory of the Protein Equilibrium Population Snapshot by H/D Exchange Electrospray Ionization Mass Spectrometry (PEPS-HDX-ESI-MS) Method used to obtain Protein Folding Energies/Rates and Selected Supporting Experimental Evidence.

    Science.gov (United States)

    Liyanage, Rohana; Devarapalli, Nagarjuna; Pyland, Derek B; Puckett, Latisha M; Phan, N H; Starch, Joel A; Okimoto, Mark R; Gidden, Jennifer; Stites, Wesley E; Lay, Jackson O

    2012-12-15

    Protein equilibrium snapshot by hydrogen/deuterium exchange electrospray ionization mass spectrometry (PEPS-HDX-ESI-MS or PEPS) is a method recently introduced for estimating protein folding energies and rates. Herein we describe the basis for this method using both theory and new experiments. Benchmark experiments were conducted using ubiquitin because of the availability of reference data for folding and unfolding rates from NMR studies. A second set of experiments was also conducted to illustrate the surprising resilience of the PEPS to changes in HDX time, using staphylococcal nuclease and time frames ranging from a few seconds to several minutes. Theory suggests that PEPS experiments should be conducted at relatively high denaturant concentrations, where the protein folding/unfolding rates are slow with respect to HDX and the life times of both the closed and open states are long enough to be sampled experimentally. Upon deliberate denaturation, changes in folding/unfolding are correlated with associated changes in the ESI-MS signal upon fast HDX. When experiments are done quickly, typically within a few seconds, ESI-MS signals, corresponding to the equilibrium population of the native (closed) and denatured (open) states can both be detected. The interior of folded proteins remains largely un-exchanged. Amongst MS methods, the simultaneous detection of both states in the spectrum is unique to PEPS and provides a "snapshot" of these populations. The associated ion intensities are used to estimate the protein folding equilibrium constant (or the free energy change, ΔG). Linear extrapolation method (LEM) plots of derived ΔG values for each denaturant concentration can then be used to calculate ΔG in the absence of denaturant, ΔG(H(2)O). In accordance with the requirement for detection of signals for both the folded and unfolded states, this theoretical framework predicts that PEPS experiments work best at the middle of the denaturation curve where natured

  5. ProteinShop: A tool for interactive protein manipulation and steering

    Energy Technology Data Exchange (ETDEWEB)

    Crivelli, Silvia; Kreylos, Oliver; Max, Nelson; Hamann, Bernd; Bethel, Wes

    2004-05-25

    We describe ProteinShop, a new visualization tool that streamlines and simplifies the process of determining optimal protein folds. ProteinShop may be used at different stages of a protein structure prediction process. First, it can create protein configurations containing secondary structures specified by the user. Second, it can interactively manipulate protein fragments to achieve desired folds by adjusting the dihedral angles of selected coil regions using an Inverse Kinematics method. Last, it serves as a visual framework to monitor and steer a protein structure prediction process that may be running on a remote machine. ProteinShop was used to create initial configurations for a protein structure prediction method developed by a team that competed in CASP5. ProteinShop's use accelerated the process of generating initial configurations, reducing the time required from days to hours. This paper describes the structure of ProteinShop and discusses its main features.

  6. Ancylostoma ceylanicum Excretory-Secretory Protein 2 Adopts a Netrin-Like Fold and Defines a Novel Family of Nematode Proteins

    Energy Technology Data Exchange (ETDEWEB)

    K Kucera; L Harrison; M Cappello; Y Modis

    2011-12-31

    Hookworms are human parasites that have devastating effects on global health, particularly in underdeveloped countries. Ancylostoma ceylanicum infects humans and animals, making it a useful model organism to study disease pathogenesis. A. ceylanicum excretory-secretory protein 2 (AceES-2), a highly immunoreactive molecule secreted by adult worms at the site of intestinal attachment, is partially protective when administered as a mucosal vaccine against hookworm anemia. The crystal structure of AceES-2 determined at 1.75 {angstrom} resolution shows that it adopts a netrin-like fold similar to that found in tissue inhibitors of matrix metalloproteases (TIMPs) and in complement factors C3 and C5. However, recombinant AceES-2 does not significantly inhibit the 10 most abundant human matrix metalloproteases or complement-mediated cell lysis. The presence of a highly acidic surface on AceES-2 suggests that it may function as a cytokine decoy receptor. Several small nematode proteins that have been annotated as TIMPs or netrin-domain-containing proteins display sequence homology in structurally important regions of AceES-2's netrin-likefold. Together, our results suggest that AceES-2 defines a novel family of nematode netrin-like proteins, which may function to modulate the host immune response to hookworm and other parasites.

  7. Improving protein fold recognition and structural class prediction accuracies using physicochemical properties of amino acids.

    Science.gov (United States)

    Raicar, Gaurav; Saini, Harsh; Dehzangi, Abdollah; Lal, Sunil; Sharma, Alok

    2016-08-07

    Predicting the three-dimensional (3-D) structure of a protein is an important task in the field of bioinformatics and biological sciences. However, directly predicting the 3-D structure from the primary structure is hard to achieve. Therefore, predicting the fold or structural class of a protein sequence is generally used as an intermediate step in determining the protein's 3-D structure. For protein fold recognition (PFR) and structural class prediction (SCP), two steps are required - feature extraction step and classification step. Feature extraction techniques generally utilize syntactical-based information, evolutionary-based information and physicochemical-based information to extract features. In this study, we explore the importance of utilizing the physicochemical properties of amino acids for improving PFR and SCP accuracies. For this, we propose a Forward Consecutive Search (FCS) scheme which aims to strategically select physicochemical attributes that will supplement the existing feature extraction techniques for PFR and SCP. An exhaustive search is conducted on all the existing 544 physicochemical attributes using the proposed FCS scheme and a subset of physicochemical attributes is identified. Features extracted from these selected attributes are then combined with existing syntactical-based and evolutionary-based features, to show an improvement in the recognition and prediction performance on benchmark datasets. Copyright © 2016 Elsevier Ltd. All rights reserved.

  8. The formation of a native-like structure containing eight conserved hydrophobic residues is rate limiting in two-state protein folding of ACBP

    DEFF Research Database (Denmark)

    Kragelund, Birthe Brandt; Osmark, Peter; Neergaard, Thomas B.

    1999-01-01

    The acyl-coenzyme A-binding proteins (ACBPs) contain 26 highly conserved sequence positions. The majority of these have been mutated in the bovine protein, and their influence on the rate of two-state folding and unfolding has been measured. The results identify eight sequence positions, out of 24...... probed, that are critical for fast productive folding. The residues are all hydrophobic and located in the interface between the N- and C-terminal helices. The results suggest that one specific site dominated by conserved hydrophobic residues forms the structure of the productive rate-determining folding...... step and that a sequential framework model can describe the protein folding reaction....

  9. Protein P7 of the cystovirus φ6 is located at the three-fold axis of the unexpanded procapsid.

    Directory of Open Access Journals (Sweden)

    Garrett Katz

    Full Text Available The objective of this study was to determine the location of protein P7, the RNA packaging factor, in the procapsid of the φ6 cystovirus. A comparison of cryo-electron microscopy high-resolution single particle reconstructions of the φ6 complete unexpanded procapsid, the protein P2-minus procapsid (P2 is the RNA directed RNA-polymerase, and the P7-minus procapsid, show that prior to RNA packaging the P7 protein is located near the three-fold axis of symmetry. Difference maps highlight the precise position of P7 and demonstrate that in P7-minus particles the P2 proteins are less localized with reduced densities at the three-fold axes. We propose that P7 performs the mechanical function of stabilizing P2 on the inner protein P1 shell which ensures that entering viral single-stranded RNA is replicated.

  10. Multiple scales and phases in discrete chains with application to folded proteins

    Science.gov (United States)

    Sinelnikova, A.; Niemi, A. J.; Nilsson, Johan; Ulybyshev, M.

    2018-05-01

    Chiral heteropolymers such as large globular proteins can simultaneously support multiple length scales. The interplay between the different scales brings about conformational diversity, determines the phase properties of the polymer chain, and governs the structure of the energy landscape. Most importantly, multiple scales produce complex dynamics that enable proteins to sustain live matter. However, at the moment there is incomplete understanding of how to identify and distinguish the various scales that determine the structure and dynamics of a complex protein. Here we address this impending problem. We develop a methodology with the potential to systematically identify different length scales, in the general case of a linear polymer chain. For this we introduce and analyze the properties of an order parameter that can both reveal the presence of different length scales and can also probe the phase structure. We first develop our concepts in the case of chiral homopolymers. We introduce a variant of Kadanoff's block-spin transformation to coarse grain piecewise linear chains, such as the C α backbone of a protein. We derive analytically, and then verify numerically, a number of properties that the order parameter can display, in the case of a chiral polymer chain. In particular, we propose that in the case of a chiral heteropolymer the order parameter can reveal traits of several different phases, contingent on the length scale at which it is scrutinized. We confirm that this is the case with crystallographic protein structures in the Protein Data Bank. Thus our results suggest relations between the scales, the phases, and the complexity of folding pathways.

  11. Folding behavior of four silks of giant honey bee reflects the evolutionary conservation of aculeate silk proteins.

    Science.gov (United States)

    Maitip, Jakkrawut; Trueman, Holly E; Kaehler, Benjamin D; Huttley, Gavin A; Chantawannakul, Panuwan; Sutherland, Tara D

    2015-04-01

    Multiple gene duplication events in the precursor of the Aculeata (bees, ants, hornets) gave rise to four silk genes. Whilst these homologs encode proteins with similar amino acid composition and coiled coil structure, the retention of all four homologs implies they each are important. In this study we identified, produced and characterized the four silk proteins from Apis dorsata, the giant Asian honeybee. The proteins were readily purified, allowing us to investigate the folding behavior of solutions of individual proteins in comparison to mixtures of all four proteins at concentrations where they assemble into their native coiled coil structure. In contrast to solutions of any one protein type, solutions of a mixture of the four proteins formed coiled coils that were stable against dilution and detergent denaturation. The results are consistent with the formation of a heteromeric coiled coil protein complex. The mechanism of silk protein coiled coil formation and evolution is discussed in light of these results. Copyright © 2015 Elsevier Ltd. All rights reserved.

  12. Folding machineries displayed on a cation-exchanger for the concerted refolding of cysteine- or proline-rich proteins

    Directory of Open Access Journals (Sweden)

    Lee Dae-Hee

    2009-03-01

    Full Text Available Abstract Background Escherichia coli has been most widely used for the production of valuable recombinant proteins. However, over-production of heterologous proteins in E. coli frequently leads to their misfolding and aggregation yielding inclusion bodies. Previous attempts to refold the inclusion bodies into bioactive forms usually result in poor recovery and account for the major cost in industrial production of desired proteins from recombinant E. coli. Here, we describe the successful use of the immobilized folding machineries for in vitro refolding with the examples of high yield refolding of a ribonuclease A (RNase A and cyclohexanone monooxygenase (CHMO. Results We have generated refolding-facilitating media immobilized with three folding machineries, mini-chaperone (a monomeric apical domain consisting of residues 191–345 of GroEL and two foldases (DsbA and human peptidyl-prolyl cis-trans isomerase by mimicking oxidative refolding chromatography. For efficient and simple purification and immobilization simultaneously, folding machineries were fused with the positively-charged consecutive 10-arginine tag at their C-terminal. The immobilized folding machineries were fully functional when assayed in a batch mode. When the refolding-facilitating matrices were applied to the refolding of denatured and reduced RNase A and CHMO, both of which contain many cysteine and proline residues, RNase A and CHMO were recovered in 73% and 53% yield of soluble protein with full enzyme activity, respectively. Conclusion The refolding-facilitating media presented here could be a cost-efficient platform and should be applicable to refold a wide range of E. coli inclusion bodies in high yield with biological function.

  13. A novel member of the split betaalphabeta fold: Solution structure of the hypothetical protein YML108W from Saccharomyces cerevisiae

    International Nuclear Information System (INIS)

    Pineda-Lucena, Antonio; Liao, Jack; Cort, John R.; Yee, Adelinda; Kennedy, Michael A.; Edwards, Aled M.

    2003-05-01

    As part of the Northeast Structural Genomics Consortium pilot project focused on small eukaryotic proteins and protein domains, we have determined the NMR structure of the protein encoded by open reading frame YML108W from Saccharomyces cerevisiae. YML108W belongs to one of the numerous structural proteomics targets whose biological function is unknown. Moreover, this protein does not have sequence similarity to any other protein. The NMR structure of YML108W consists of a four-stranded b-sheet with strand order 2143 and two a-helices, with an overall topology of bbabba. Strand b1 runs parallel to b4, and b2:b1 and b4:b3 pairs are arranged in an antiparallel fashion. While this fold belongs to the split bab family, it appears to be unique among this family; it is a novel arrangement of secondary structure, thereby expanding the universe of protein folds

  14. Thermodynamic Stabilization of the Folded Domain of Prion Protein Inhibits Prion Infection in Vivo

    Directory of Open Access Journals (Sweden)

    Qingzhong Kong

    2013-07-01

    Full Text Available Prion diseases, or transmissible spongiform encephalopathies (TSEs, are associated with the conformational conversion of the cellular prion protein, PrPC, into a protease-resistant form, PrPSc. Here, we show that mutation-induced thermodynamic stabilization of the folded, α-helical domain of PrPC has a dramatic inhibitory effect on the conformational conversion of prion protein in vitro, as well as on the propagation of TSE disease in vivo. Transgenic mice expressing a human prion protein variant with increased thermodynamic stability were found to be much more resistant to infection with the TSE agent than those expressing wild-type human prion protein, in both the primary passage and three subsequent subpassages. These findings not only provide a line of evidence in support of the protein-only model of TSEs but also yield insight into the molecular nature of the PrPC→PrPSc conformational transition, and they suggest an approach to the treatment of prion diseases.

  15. Course 12: Proteins: Structural, Thermodynamic and Kinetic Aspects

    Science.gov (United States)

    Finkelstein, A. V.

    1 Introduction 2 Overview of protein architectures and discussion of physical background of their natural selection 2.1 Protein structures 2.2 Physical selection of protein structures 3 Thermodynamic aspects of protein folding 3.1 Reversible denaturation of protein structures 3.2 What do denatured proteins look like? 3.3 Why denaturation of a globular protein is the first-order phase transition 3.4 "Gap" in energy spectrum: The main characteristic that distinguishes protein chains from random polymers 4 Kinetic aspects of protein folding 4.1 Protein folding in vivo 4.2 Protein folding in vitro (in the test-tube) 4.3 Theory of protein folding rates and solution of the Levinthal paradox

  16. Dependence of α-helical and β-sheet amino acid propensities on the overall protein fold type

    Directory of Open Access Journals (Sweden)

    Fujiwara Kazuo

    2012-08-01

    Full Text Available Abstract Background A large number of studies have been carried out to obtain amino acid propensities for α-helices and β-sheets. The obtained propensities for α-helices are consistent with each other, and the pair-wise correlation coefficient is frequently high. On the other hand, the β-sheet propensities obtained by several studies differed significantly, indicating that the context significantly affects β-sheet propensity. Results We calculated amino acid propensities for α-helices and β-sheets for 39 and 24 protein folds, respectively, and addressed whether they correlate with the fold. The propensities were also calculated for exposed and buried sites, respectively. Results showed that α-helix propensities do not differ significantly by fold, but β-sheet propensities are diverse and depend on the fold. The propensities calculated for exposed sites and buried sites are similar for α-helix, but such is not the case for the β-sheet propensities. We also found some fold dependence on amino acid frequency in β-strands. Folds with a high Ser, Thr and Asn content at exposed sites in β-strands tend to have a low Leu, Ile, Glu, Lys and Arg content (correlation coefficient = −0.90 and to have flat β-sheets. At buried sites in β-strands, the content of Tyr, Trp, Gln and Ser correlates negatively with the content of Val, Ile and Leu (correlation coefficient = −0.93. "All-β" proteins tend to have a higher content of Tyr, Trp, Gln and Ser, whereas "α/β" proteins tend to have a higher content of Val, Ile and Leu. Conclusions The α-helix propensities are similar for all folds and for exposed and buried residues. However, β-sheet propensities calculated for exposed residues differ from those for buried residues, indicating that the exposed-residue fraction is one of the major factors governing amino acid composition in β-strands. Furthermore, the correlations we detected suggest that amino acid composition is related to folding

  17. Imbalance of heterologous protein folding and disulfide bond formation rates yields runaway oxidative stress

    Directory of Open Access Journals (Sweden)

    Tyo Keith EJ

    2012-03-01

    Full Text Available Abstract Background The protein secretory pathway must process a wide assortment of native proteins for eukaryotic cells to function. As well, recombinant protein secretion is used extensively to produce many biologics and industrial enzymes. Therefore, secretory pathway dysfunction can be highly detrimental to the cell and can drastically inhibit product titers in biochemical production. Because the secretory pathway is a highly-integrated, multi-organelle system, dysfunction can happen at many levels and dissecting the root cause can be challenging. In this study, we apply a systems biology approach to analyze secretory pathway dysfunctions resulting from heterologous production of a small protein (insulin precursor or a larger protein (α-amylase. Results HAC1-dependent and independent dysfunctions and cellular responses were apparent across multiple datasets. In particular, processes involving (a degradation of protein/recycling amino acids, (b overall transcription/translation repression, and (c oxidative stress were broadly associated with secretory stress. Conclusions Apparent runaway oxidative stress due to radical production observed here and elsewhere can be explained by a futile cycle of disulfide formation and breaking that consumes reduced glutathione and produces reactive oxygen species. The futile cycle is dominating when protein folding rates are low relative to disulfide bond formation rates. While not strictly conclusive with the present data, this insight does provide a molecular interpretation to an, until now, largely empirical understanding of optimizing heterologous protein secretion. This molecular insight has direct implications on engineering a broad range of recombinant proteins for secretion and provides potential hypotheses for the root causes of several secretory-associated diseases.

  18. The Multiple-Minima Problem in Protein Folding

    Science.gov (United States)

    Scheraga, Harold A.

    1991-10-01

    The conformational energy surface of a polypeptide or protein has many local minima, and conventional energy minimization procedures reach only a local minimum (near the starting point of the optimization algorithm) instead of the global minimum (the multiple-minima problem). Several procedures have been developed to surmount this problem, the most promising of which are: (a) build up procedure, (b) optimization of electrostatics, (c) Monte Carlo-plus-energy minimization, (d) electrostatically-driven Monte Carlo, (e) inclusion of distance restraints, (f) adaptive importance-sampling Monte Carlo, (g) relaxation of dimensionality, (h) pattern-recognition, and (i) diffusion equation method. These procedures have been applied to a variety of polypeptide structural problems, and the results of such computations are presented. These include the computation of the structures of open-chain and cyclic peptides, fibrous proteins and globular proteins. Present efforts are being devoted to scaling up these procedures from small polypeptides to proteins, to try to compute the three-dimensional structure of a protein from its amino sequence.

  19. A Method for Extracting the Free Energy Surface and Conformational Dynamics of Fast-Folding Proteins from Single Molecule Photon Trajectories

    Science.gov (United States)

    2015-01-01

    Single molecule fluorescence spectroscopy holds the promise of providing direct measurements of protein folding free energy landscapes and conformational motions. However, fulfilling this promise has been prevented by technical limitations, most notably, the difficulty in analyzing the small packets of photons per millisecond that are typically recorded from individual biomolecules. Such limitation impairs the ability to accurately determine conformational distributions and resolve sub-millisecond processes. Here we develop an analytical procedure for extracting the conformational distribution and dynamics of fast-folding proteins directly from time-stamped photon arrival trajectories produced by single molecule FRET experiments. Our procedure combines the maximum likelihood analysis originally developed by Gopich and Szabo with a statistical mechanical model that describes protein folding as diffusion on a one-dimensional free energy surface. Using stochastic kinetic simulations, we thoroughly tested the performance of the method in identifying diverse fast-folding scenarios, ranging from two-state to one-state downhill folding, as a function of relevant experimental variables such as photon count rate, amount of input data, and background noise. The tests demonstrate that the analysis can accurately retrieve the original one-dimensional free energy surface and microsecond folding dynamics in spite of the sub-megahertz photon count rates and significant background noise levels of current single molecule fluorescence experiments. Therefore, our approach provides a powerful tool for the quantitative analysis of single molecule FRET experiments of fast protein folding that is also potentially extensible to the analysis of any other biomolecular process governed by sub-millisecond conformational dynamics. PMID:25988351

  20. Protein folding pathology in domestic animals.

    Science.gov (United States)

    Gruys, Erik

    2004-10-01

    Fibrillar proteins form structural elements of cells and the extracellular matrix. Pathological lesions of fibrillar microanatomical structures, or secondary fibrillar changes in globular proteins are well known. A special group concerns histologically amorphous deposits, amyloid. The major characteristics of amyloid are: apple green birefringence after Congo red staining of histological sections, and non-branching 7-10 nm thick fibrils on electron microscopy revealing a high content of cross beta pleated sheets. About 25 different types of amyloid have been characterised. In animals, AA-amyloid is the most frequent type. Other types of amyloid in animals represent: AIAPP (in cats), AApoAI, AApoAII, localised AL-amyloid, amyloid in odontogenic or mammary tumors and amyloid in the brain. In old dogs Abeta and in sheep APrPsc-amyloid can be encountered. AA-amyloidosis is a systemic disorder with a precursor in blood, acute phase serum amyloid A (SAA). In chronic inflammatory processes AA-amyloid can be deposited. A rapid crystallization of SAA to amyloid fibrils on small beta-sheeted fragments, the 'amyloid enhancing factor' (AEF), is known and the AEF has been shown to penetrate the enteric barrier. Amyloid fibrils can aggregate from various precursor proteins in vitro in particular at acidic pH and when proteolytic fragments are formed. Molecular chaperones influence this process. Tissue data point to amyloid fibrillogenesis in lysosomes and near cell surfaces. A comparison can be made of the fibrillogenesis in prion diseases and in enhanced AA-amyloidosis. In the reactive form, acute phase SAA is the supply of the precursor protein, whereas in the prion diseases, cell membrane proteins form a structural source. Abeta-amyloid in brain tissue of aged dogs showing signs of dementia forms a canine counterpart of senile dementia of the Alzheimer type (ccSDAT) in man. Misfolded proteins remain potential food hazards. Developments concerning prevention of amyloidogenesis

  1. SAAFEC: Predicting the Effect of Single Point Mutations on Protein Folding Free Energy Using a Knowledge-Modified MM/PBSA Approach.

    Science.gov (United States)

    Getov, Ivan; Petukh, Marharyta; Alexov, Emil

    2016-04-07

    Folding free energy is an important biophysical characteristic of proteins that reflects the overall stability of the 3D structure of macromolecules. Changes in the amino acid sequence, naturally occurring or made in vitro, may affect the stability of the corresponding protein and thus could be associated with disease. Several approaches that predict the changes of the folding free energy caused by mutations have been proposed, but there is no method that is clearly superior to the others. The optimal goal is not only to accurately predict the folding free energy changes, but also to characterize the structural changes induced by mutations and the physical nature of the predicted folding free energy changes. Here we report a new method to predict the Single Amino Acid Folding free Energy Changes (SAAFEC) based on a knowledge-modified Molecular Mechanics Poisson-Boltzmann (MM/PBSA) approach. The method is comprised of two main components: a MM/PBSA component and a set of knowledge based terms delivered from a statistical study of the biophysical characteristics of proteins. The predictor utilizes a multiple linear regression model with weighted coefficients of various terms optimized against a set of experimental data. The aforementioned approach yields a correlation coefficient of 0.65 when benchmarked against 983 cases from 42 proteins in the ProTherm database. the webserver can be accessed via http://compbio.clemson.edu/SAAFEC/.

  2. Pharmacological chaperone reshapes the energy landscape for folding and aggregation of the prion protein

    Science.gov (United States)

    Gupta, Amar Nath; Neupane, Krishna; Rezajooei, Negar; Cortez, Leonardo M.; Sim, Valerie L.; Woodside, Michael T.

    2016-06-01

    The development of small-molecule pharmacological chaperones as therapeutics for protein misfolding diseases has proven challenging, partly because their mechanism of action remains unclear. Here we study Fe-TMPyP, a tetrapyrrole that binds to the prion protein PrP and inhibits misfolding, examining its effects on PrP folding at the single-molecule level with force spectroscopy. Single PrP molecules are unfolded with and without Fe-TMPyP present using optical tweezers. Ligand binding to the native structure increases the unfolding force significantly and alters the transition state for unfolding, making it more brittle and raising the barrier height. Fe-TMPyP also binds the unfolded state, delaying native refolding. Furthermore, Fe-TMPyP binding blocks the formation of a stable misfolded dimer by interfering with intermolecular interactions, acting in a similar manner to some molecular chaperones. The ligand thus promotes native folding by stabilizing the native state while also suppressing interactions driving aggregation.

  3. RECOVERY ACT - Thylakoid Assembly and Folded Protein Transport by the Tat Pathway

    Energy Technology Data Exchange (ETDEWEB)

    Dabney-Smith, Carole [Miami Univ., Oxford, OH (United States)

    2016-07-18

    Assembly of functional photosystems complete with necessary intrinsic (membrane-bound) and extrinsic proteins requires the function of at least 3 protein transport pathways in thylakoid membranes. Our research focuses on one of those pathways, a unique and essential protein transport pathway found in the chloroplasts of plants, bacteria, and some archaebacteria, the Twin arginine translocation (Tat) system. The chloroplast Tat (cpTat) system is thought to be responsible for the proper location of ~50% of thylakoid lumen proteins, several of which are necessary for proper photosystem assembly, maintenance, and function. Specifically, cpTat systems are unique because they transport fully folded and assembled proteins across ion tight membranes using only three membrane components, Tha4, Hcf106, and cpTatC, and the protonmotive force generated by photosynthesis. Despite the importance of the cpTat system in plants, the mechanism of transport of a folded precursor is not well known. Our long-term goal is to investigate the role protein transport systems have on organelle biogenesis, particularly the assembly of membrane protein complexes in thylakoids of chloroplasts. The objective of this proposal is to correlate structural changes in the membrane-bound cpTat component, Tha4, to the mechanism of translocation of folded-precursor substrates across the membrane bilayer by using a cysteine accessibility and crosslinking approach. Our central hypothesis is that the precursor passes through a proteinaceous pore of assembled Tha4 protomers that have undergone a conformational or topological change in response to transport. This research is predicated upon the observations that Tha4 exists in molar excess in the membrane relative to the other cpTat components; its regulated assembly to the precursor-bound receptor; and our data showing oligomerization of Tha4 into very large complexes in response to transport. Our rationale for these studies is that understanding cp

  4. N-Terminal Domains in Two-Domain Proteins Are Biased to Be Shorter and Predicted to Fold Faster Than Their C-Terminal Counterparts

    Directory of Open Access Journals (Sweden)

    Etai Jacob

    2013-04-01

    Full Text Available Computational analysis of proteomes in all kingdoms of life reveals a strong tendency for N-terminal domains in two-domain proteins to have shorter sequences than their neighboring C-terminal domains. Given that folding rates are affected by chain length, we asked whether the tendency for N-terminal domains to be shorter than their neighboring C-terminal domains reflects selection for faster-folding N-terminal domains. Calculations of absolute contact order, another predictor of folding rate, provide additional evidence that N-terminal domains tend to fold faster than their neighboring C-terminal domains. A possible explanation for this bias, which is more pronounced in prokaryotes than in eukaryotes, is that faster folding of N-terminal domains reduces the risk for protein aggregation during folding by preventing formation of nonnative interdomain interactions. This explanation is supported by our finding that two-domain proteins with a shorter N-terminal domain are much more abundant than those with a shorter C-terminal domain.

  5. Web-based computational chemistry education with CHARMMing II: Coarse-grained protein folding.

    Directory of Open Access Journals (Sweden)

    Frank C Pickard

    2014-07-01

    Full Text Available A lesson utilizing a coarse-grained (CG Gō-like model has been implemented into the CHARMM INterface and Graphics (CHARMMing web portal (www.charmming.org to the Chemistry at HARvard Macromolecular Mechanics (CHARMM molecular simulation package. While widely used to model various biophysical processes, such as protein folding and aggregation, CG models can also serve as an educational tool because they can provide qualitative descriptions of complex biophysical phenomena for a relatively cheap computational cost. As a proof of concept, this lesson demonstrates the construction of a CG model of a small globular protein, its simulation via Langevin dynamics, and the analysis of the resulting data. This lesson makes connections between modern molecular simulation techniques and topics commonly presented in an advanced undergraduate lecture on physical chemistry. It culminates in a straightforward analysis of a short dynamics trajectory of a small fast folding globular protein; we briefly describe the thermodynamic properties that can be calculated from this analysis. The assumptions inherent in the model and the data analysis are laid out in a clear, concise manner, and the techniques used are consistent with those employed by specialists in the field of CG modeling. One of the major tasks in building the Gō-like model is determining the relative strength of the nonbonded interactions between coarse-grained sites. New functionality has been added to CHARMMing to facilitate this process. The implementation of these features into CHARMMing helps automate many of the tedious aspects of constructing a CG Gō model. The CG model builder and its accompanying lesson should be a valuable tool to chemistry students, teachers, and modelers in the field.

  6. Solution structure of an archaeal DNA binding protein with an eukaryotic zinc finger fold.

    Directory of Open Access Journals (Sweden)

    Florence Guillière

    Full Text Available While the basal transcription machinery in archaea is eukaryal-like, transcription factors in archaea and their viruses are usually related to bacterial transcription factors. Nevertheless, some of these organisms show predicted classical zinc fingers motifs of the C2H2 type, which are almost exclusively found in proteins of eukaryotes and most often associated with transcription regulators. In this work, we focused on the protein AFV1p06 from the hyperthermophilic archaeal virus AFV1. The sequence of the protein consists of the classical eukaryotic C2H2 motif with the fourth histidine coordinating zinc missing, as well as of N- and C-terminal extensions. We showed that the protein AFV1p06 binds zinc and solved its solution structure by NMR. AFV1p06 displays a zinc finger fold with a novel structure extension and disordered N- and C-termini. Structure calculations show that a glutamic acid residue that coordinates zinc replaces the fourth histidine of the C2H2 motif. Electromobility gel shift assays indicate that the protein binds to DNA with different affinities depending on the DNA sequence. AFV1p06 is the first experimentally characterised archaeal zinc finger protein with a DNA binding activity. The AFV1p06 protein family has homologues in diverse viruses of hyperthermophilic archaea. A phylogenetic analysis points out a common origin of archaeal and eukaryotic C2H2 zinc fingers.

  7. Evolution of an intricate J-protein network driving protein disaggregation in eukaryotes.

    Science.gov (United States)

    Nillegoda, Nadinath B; Stank, Antonia; Malinverni, Duccio; Alberts, Niels; Szlachcic, Anna; Barducci, Alessandro; De Los Rios, Paolo; Wade, Rebecca C; Bukau, Bernd

    2017-05-15

    Hsp70 participates in a broad spectrum of protein folding processes extending from nascent chain folding to protein disaggregation. This versatility in function is achieved through a diverse family of J-protein cochaperones that select substrates for Hsp70. Substrate selection is further tuned by transient complexation between different classes of J-proteins, which expands the range of protein aggregates targeted by metazoan Hsp70 for disaggregation. We assessed the prevalence and evolutionary conservation of J-protein complexation and cooperation in disaggregation. We find the emergence of a eukaryote-specific signature for interclass complexation of canonical J-proteins. Consistently, complexes exist in yeast and human cells, but not in bacteria, and correlate with cooperative action in disaggregation in vitro. Signature alterations exclude some J-proteins from networking, which ensures correct J-protein pairing, functional network integrity and J-protein specialization. This fundamental change in J-protein biology during the prokaryote-to-eukaryote transition allows for increased fine-tuning and broadening of Hsp70 function in eukaryotes.

  8. A Stevedore's protein knot.

    Directory of Open Access Journals (Sweden)

    Daniel Bölinger

    2010-04-01

    Full Text Available Protein knots, mostly regarded as intriguing oddities, are gradually being recognized as significant structural motifs. Seven distinctly knotted folds have already been identified. It is by and large unclear how these exceptional structures actually fold, and only recently, experiments and simulations have begun to shed some light on this issue. In checking the new protein structures submitted to the Protein Data Bank, we encountered the most complex and the smallest knots to date: A recently uncovered alpha-haloacid dehalogenase structure contains a knot with six crossings, a so-called Stevedore knot, in a projection onto a plane. The smallest protein knot is present in an as yet unclassified protein fragment that consists of only 92 amino acids. The topological complexity of the Stevedore knot presents a puzzle as to how it could possibly fold. To unravel this enigma, we performed folding simulations with a structure-based coarse-grained model and uncovered a possible mechanism by which the knot forms in a single loop flip.

  9. Can a pairwise contact potential stabilize native protein folds against decoys obtained by threading?

    Science.gov (United States)

    Vendruscolo, M; Najmanovich, R; Domany, E

    2000-02-01

    We present a method to derive contact energy parameters from large sets of proteins. The basic requirement on which our method is based is that for each protein in the database the native contact map has lower energy than all its decoy conformations that are obtained by threading. Only when this condition is satisfied one can use the proposed energy function for fold identification. Such a set of parameters can be found (by perceptron learning) if Mp, the number of proteins in the database, is not too large. Other aspects that influence the existence of such a solution are the exact definition of contact and the value of the critical distance Rc, below which two residues are considered to be in contact. Another important novel feature of our approach is its ability to determine whether an energy function of some suitable proposed form can or cannot be parameterized in a way that satisfies our basic requirement. As a demonstration of this, we determine the region in the (Rc, Mp) plane in which the problem is solvable, i.e., we can find a set of contact parameters that stabilize simultaneously all the native conformations. We show that for large enough databases the contact approximation to the energy cannot stabilize all the native folds even against the decoys obtained by gapless threading.

  10. Rapid expansion of the protein disulfide isomerase gene family facilitates the folding of venom peptides

    DEFF Research Database (Denmark)

    Safavi-Hemami, Helena; Li, Qing; Jackson, Ronneshia L.

    2016-01-01

    Formation of correct disulfide bonds in the endoplasmic reticulum is a crucial step for folding proteins destined for secretion. Protein disulfide isomerases (PDIs) play a central role in this process. We report a previously unidentified, hypervariable family of PDIs that represents the most...... diverse gene family of oxidoreductases described in a single genus to date. These enzymes are highly expressed specifically in the venom glands of predatory cone snails, animals that synthesize a remarkably diverse set of cysteine-rich peptide toxins (conotoxins). Enzymes in this PDI family, termed...

  11. Multi-scaled explorations of binding-induced folding of intrinsically disordered protein inhibitor IA3 to its target enzyme.

    Directory of Open Access Journals (Sweden)

    Jin Wang

    2011-04-01

    Full Text Available Biomolecular function is realized by recognition, and increasing evidence shows that recognition is determined not only by structure but also by flexibility and dynamics. We explored a biomolecular recognition process that involves a major conformational change - protein folding. In particular, we explore the binding-induced folding of IA3, an intrinsically disordered protein that blocks the active site cleft of the yeast aspartic proteinase saccharopepsin (YPrA by folding its own N-terminal residues into an amphipathic alpha helix. We developed a multi-scaled approach that explores the underlying mechanism by combining structure-based molecular dynamics simulations at the residue level with a stochastic path method at the atomic level. Both the free energy profile and the associated kinetic paths reveal a common scheme whereby IA3 binds to its target enzyme prior to folding itself into a helix. This theoretical result is consistent with recent time-resolved experiments. Furthermore, exploration of the detailed trajectories reveals the important roles of non-native interactions in the initial binding that occurs prior to IA3 folding. In contrast to the common view that non-native interactions contribute only to the roughness of landscapes and impede binding, the non-native interactions here facilitate binding by reducing significantly the entropic search space in the landscape. The information gained from multi-scaled simulations of the folding of this intrinsically disordered protein in the presence of its binding target may prove useful in the design of novel inhibitors of aspartic proteinases.

  12. Fiscal 1999 achievement report on research and development project on intellectual infrastructure creation and utilization technologies. Development of efficient protein expression system (Development of efficient protein expression system utilizing protein folding mechanism of hyperthermophilic bacteria); 1999 nendo kokoritsu tanpakushitsu hatsugen system no kaihatsu seika hokokusho. Chokonetsukin no tanpakushitsu oritatami kiko wo riyoshita kokoritsu tanpakushitsu hatsugen system no kaihatsu

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    2000-03-01

    Efforts were exerted to achieve efficient expression of proteins of hyperthermophilic bacteria, hyperthermophilic archaeabacteria in particular, using a heterogene expression system in which Escherichia coli was the host. In an effort to search for genes related to protein folding and to elucidate the mechanism of folding, chaperonin and prefoldin subunit genes, out of various factors participating in protein folding in hyperthermophilic archaeabacteria, were cloned, and expressed in Escherichia coli. As a system for analyzing protein folding reaction, an experimental system was established on a substrate comprising isopropyl malate dehydrogenase, citrate synthase, glucose dehydrogenase, and a green fluorescent protein. Studies were further conducted to elucidate the mechanism of expression of enzyme genes in Escherichia coli for the establishment of a mass production method for useful enzymes. Also carried out was the research and development of an element technology evaluation system involving protein expression. (NEDO)

  13. Protein- protein interaction detection system using fluorescent protein microdomains

    Science.gov (United States)

    Waldo, Geoffrey S.; Cabantous, Stephanie

    2010-02-23

    The invention provides a protein labeling and interaction detection system based on engineered fragments of fluorescent and chromophoric proteins that require fused interacting polypeptides to drive the association of the fragments, and further are soluble and stable, and do not change the solubility of polypeptides to which they are fused. In one embodiment, a test protein X is fused to a sixteen amino acid fragment of GFP (.beta.-strand 10, amino acids 198-214), engineered to not perturb fusion protein solubility. A second test protein Y is fused to a sixteen amino acid fragment of GFP (.beta.-strand 11, amino acids 215-230), engineered to not perturb fusion protein solubility. When X and Y interact, they bring the GFP strands into proximity, and are detected by complementation with a third GFP fragment consisting of GFP amino acids 1-198 (strands 1-9). When GFP strands 10 and 11 are held together by interaction of protein X and Y, they spontaneous association with GFP strands 1-9, resulting in structural complementation, folding, and concomitant GFP fluorescence.

  14. Conserved nucleation sites reinforce the significance of Phi value analysis in protein-folding studies.

    Science.gov (United States)

    Gianni, Stefano; Jemth, Per

    2014-07-01

    The only experimental strategy to address the structure of folding transition states, the so-called Φ value analysis, relies on the synergy between site directed mutagenesis and the measurement of reaction kinetics. Despite its importance, the Φ value analysis has been often criticized and its power to pinpoint structural information has been questioned. In this hypothesis, we demonstrate that comparing the Φ values between proteins not only allows highlighting the robustness of folding pathways but also provides per se a strong validation of the method. © 2014 International Union of Biochemistry and Molecular Biology.

  15. Exploring the universe of protein structures beyond the Protein Data Bank.

    Science.gov (United States)

    Cossio, Pilar; Trovato, Antonio; Pietrucci, Fabio; Seno, Flavio; Maritan, Amos; Laio, Alessandro

    2010-11-04

    It is currently believed that the atlas of existing protein structures is faithfully represented in the Protein Data Bank. However, whether this atlas covers the full universe of all possible protein structures is still a highly debated issue. By using a sophisticated numerical approach, we performed an exhaustive exploration of the conformational space of a 60 amino acid polypeptide chain described with an accurate all-atom interaction potential. We generated a database of around 30,000 compact folds with at least of secondary structure corresponding to local minima of the potential energy. This ensemble plausibly represents the universe of protein folds of similar length; indeed, all the known folds are represented in the set with good accuracy. However, we discover that the known folds form a rather small subset, which cannot be reproduced by choosing random structures in the database. Rather, natural and possible folds differ by the contact order, on average significantly smaller in the former. This suggests the presence of an evolutionary bias, possibly related to kinetic accessibility, towards structures with shorter loops between contacting residues. Beside their conceptual relevance, the new structures open a range of practical applications such as the development of accurate structure prediction strategies, the optimization of force fields, and the identification and design of novel folds.

  16. A structural basis for cellular uptake of GST-fold proteins.

    Directory of Open Access Journals (Sweden)

    Melanie J Morris

    Full Text Available It has recently emerged that glutathione transferase enzymes (GSTs and other structurally related molecules can be translocated from the external medium into many different cell types. In this study we aim to explore in detail, the structural features that govern cell translocation and by dissecting the human GST enzyme GSTM2-2 we quantatively demonstrate that the α-helical C-terminal domain (GST-C is responsible for this property. Attempts to further examine the constituent helices within GST-C resulted in a reduction in cell translocation efficiency, indicating that the intrinsic GST-C domain structure is necessary for maximal cell translocation capacity. In particular, it was noted that the α-6 helix of GST-C plays a stabilising role in the fold of this domain. By destabilising the conformation of GST-C, an increase in cell translocation efficiency of up to ∼2-fold was observed. The structural stability profiles of these protein constructs have been investigated by circular dichroism and differential scanning fluorimetry measurements and found to impact upon their cell translocation efficiency. These experiments suggest that the globular, helical domain in the 'GST-fold' structural motif plays a role in influencing cellular uptake, and that changes that affect the conformational stability of GST-C can significantly influence cell translocation efficiency.

  17. Architectures and Functional Coverage of Protein-Protein Interfaces

    Science.gov (United States)

    Tuncbag, Nurcan; Gursoy, Attila; Guney, Emre; Nussinov, Ruth; Keskin, Ozlem

    2008-01-01

    The diverse range of cellular functions is performed by a limited number of protein folds existing in nature. One may similarly expect that cellular functional diversity would be covered by a limited number of protein-protein interface architectures. Here, we present 8205 interface clusters, each representing unique interface architecture. This dataset of protein-protein interfaces is analyzed and compared with older datasets. We observe that the number of both biological and crystal interfaces increase significantly compared to the number of PDB entries. Further, we find that the number of distinct interface architectures grows at a much faster rate than the number of folds and is yet to level off. We further analyze the growth trend of the functional coverage by constructing functional interaction networks from interfaces. The functional coverage is also found to steadily increase. Interestingly, we also observe that despite the diversity of interface architectures, some are more favorable and frequently used, and of particular interest, those are the ones which are also preferred in single chains. PMID:18620705

  18. Improved in Vitro Folding of the Y2 G Protein-Coupled Receptor into Bicelles

    Directory of Open Access Journals (Sweden)

    Peter Schmidt

    2018-01-01

    Full Text Available Prerequisite for structural studies on G protein-coupled receptors is the preparation of highly concentrated, stable, and biologically active receptor samples in milligram amounts of protein. Here, we present an improved protocol for Escherichia coli expression, functional refolding, and reconstitution into bicelles of the human neuropeptide Y receptor type 2 (Y2R for solution and solid-state NMR experiments. The isotopically labeled receptor is expressed in inclusion bodies and purified using SDS. We studied the details of an improved preparation protocol including the in vitro folding of the receptor, e.g., the native disulfide bridge formation, the exchange of the denaturating detergent SDS, and the functional reconstitution into bicelle environments of varying size. Full pharmacological functionality of the Y2R preparation was shown by a ligand affinity of 4 nM and G-protein activation. Further, simple NMR experiments are used to test sample quality in high micromolar concentration.

  19. The unique fold and lability of the [2Fe-2S] clusters of NEET proteins mediate their key functions in health and disease.

    Science.gov (United States)

    Karmi, Ola; Marjault, Henri-Baptiste; Pesce, Luca; Carloni, Paolo; Onuchic, Jose' N; Jennings, Patricia A; Mittler, Ron; Nechushtai, Rachel

    2018-02-12

    NEET proteins comprise a new class of [2Fe-2S] cluster proteins. In human, three genes encode for NEET proteins: cisd1 encodes mitoNEET (mNT), cisd2 encodes the Nutrient-deprivation autophagy factor-1 (NAF-1) and cisd3 encodes MiNT (Miner2). These recently discovered proteins play key roles in many processes related to normal metabolism and disease. Indeed, NEET proteins are involved in iron, Fe-S, and reactive oxygen homeostasis in cells and play an important role in regulating apoptosis and autophagy. mNT and NAF-1 are homodimeric and reside on the outer mitochondrial membrane. NAF-1 also resides in the membranes of the ER associated mitochondrial membranes (MAM) and the ER. MiNT is a monomer with distinct asymmetry in the molecular surfaces surrounding the clusters. Unlike its paralogs mNT and NAF-1, it resides within the mitochondria. NAF-1 and mNT share similar backbone folds to the plant homodimeric NEET protein (At-NEET), while MiNT's backbone fold resembles a bacterial MiNT protein. Despite the variation of amino acid composition among these proteins, all NEET proteins retained their unique CDGSH domain harboring their unique 3Cys:1His [2Fe-2S] cluster coordination through evolution. The coordinating exposed His was shown to convey the lability to the NEET proteins' [2Fe-2S] clusters. In this minireview, we discuss the NEET fold and its structural elements. Special attention is given to the unique lability of the NEETs' [2Fe-2S] cluster and the implication of the latter to the NEET proteins' cellular and systemic function in health and disease.

  20. Exploring the universe of protein structures beyond the Protein Data Bank.

    Directory of Open Access Journals (Sweden)

    Pilar Cossio

    Full Text Available It is currently believed that the atlas of existing protein structures is faithfully represented in the Protein Data Bank. However, whether this atlas covers the full universe of all possible protein structures is still a highly debated issue. By using a sophisticated numerical approach, we performed an exhaustive exploration of the conformational space of a 60 amino acid polypeptide chain described with an accurate all-atom interaction potential. We generated a database of around 30,000 compact folds with at least of secondary structure corresponding to local minima of the potential energy. This ensemble plausibly represents the universe of protein folds of similar length; indeed, all the known folds are represented in the set with good accuracy. However, we discover that the known folds form a rather small subset, which cannot be reproduced by choosing random structures in the database. Rather, natural and possible folds differ by the contact order, on average significantly smaller in the former. This suggests the presence of an evolutionary bias, possibly related to kinetic accessibility, towards structures with shorter loops between contacting residues. Beside their conceptual relevance, the new structures open a range of practical applications such as the development of accurate structure prediction strategies, the optimization of force fields, and the identification and design of novel folds.

  1. A Library of Plasmodium vivax Recombinant Merozoite Proteins Reveals New Vaccine Candidates and Protein-Protein Interactions

    Science.gov (United States)

    Hostetler, Jessica B.; Sharma, Sumana; Bartholdson, S. Josefin; Wright, Gavin J.; Fairhurst, Rick M.; Rayner, Julian C.

    2015-01-01

    Background A vaccine targeting Plasmodium vivax will be an essential component of any comprehensive malaria elimination program, but major gaps in our understanding of P. vivax biology, including the protein-protein interactions that mediate merozoite invasion of reticulocytes, hinder the search for candidate antigens. Only one ligand-receptor interaction has been identified, that between P. vivax Duffy Binding Protein (PvDBP) and the erythrocyte Duffy Antigen Receptor for Chemokines (DARC), and strain-specific immune responses to PvDBP make it a complex vaccine target. To broaden the repertoire of potential P. vivax merozoite-stage vaccine targets, we exploited a recent breakthrough in expressing full-length ectodomains of Plasmodium proteins in a functionally-active form in mammalian cells and initiated a large-scale study of P. vivax merozoite proteins that are potentially involved in reticulocyte binding and invasion. Methodology/Principal Findings We selected 39 P. vivax proteins that are predicted to localize to the merozoite surface or invasive secretory organelles, some of which show homology to P. falciparum vaccine candidates. Of these, we were able to express 37 full-length protein ectodomains in a mammalian expression system, which has been previously used to express P. falciparum invasion ligands such as PfRH5. To establish whether the expressed proteins were correctly folded, we assessed whether they were recognized by antibodies from Cambodian patients with acute vivax malaria. IgG from these samples showed at least a two-fold change in reactivity over naïve controls in 27 of 34 antigens tested, and the majority showed heat-labile IgG immunoreactivity, suggesting the presence of conformation-sensitive epitopes and native tertiary protein structures. Using a method specifically designed to detect low-affinity, extracellular protein-protein interactions, we confirmed a predicted interaction between P. vivax 6-cysteine proteins P12 and P41, further

  2. A Soluble, Folded Protein without Charged Amino Acid Residues

    DEFF Research Database (Denmark)

    Højgaard, Casper; Kofoed, Christian; Espersen, Roall

    2016-01-01

    side chains can maintain solubility, stability, and function. As a model, we used a cellulose-binding domain from Cellulomonas fimi, which, among proteins of more than 100 amino acids, presently is the least charged in the Protein Data Bank, with a total of only four titratable residues. We find......Charges are considered an integral part of protein structure and function, enhancing solubility and providing specificity in molecular interactions. We wished to investigate whether charged amino acids are indeed required for protein biogenesis and whether a protein completely free of titratable...... that the protein shows a surprising resilience toward extremes of pH, demonstrating stability and function (cellulose binding) in the pH range from 2 to 11. To ask whether the four charged residues present were required for these properties of this protein, we altered them to nontitratable ones. Remarkably...

  3. Casein and soya-bean protein have different effects on whole body protein turnover at the same nitrogen balance

    DEFF Research Database (Denmark)

    Nielsen, K; Kondrup, J; Elsner, Petteri

    1994-01-01

    , or hydrolysed soya-bean protein at a level of 9.1 g/kg BW per d. The diets, which were isoenergetic with the same carbohydrate: fat ratio, were given as a continuous intragastric infusion for at least 4 d. During the last 19 h 15N-glycine (a primed continuous infusion) was given intragastrically and 15N...... synthesis. The protein diets produced a positive N balance which was independent of the protein source. Intact and hydrolysed casein increased protein synthesis 2.6- and 2.0-fold respectively, compared with the protein-free diet. Protein degradation increased 1.4- and 1.2-fold respectively. Hydrolysed soya-bean...... protein did not increase protein synthesis but decreased protein degradation by 35% compared with the protein-free diet. Compared with the hydrolysed soya-bean protein, intact casein resulted in 2.2- and 2.8-fold higher rates of protein synthesis and degradation respectively. These results are not easily...

  4. Conformational disorder in folded and intrinsically disordered proteins from nuclear magnetic resonance

    International Nuclear Information System (INIS)

    Salmon, Loic

    2010-01-01

    Biological macromolecules are, by essence, dynamical systems. While the importance of this flexibility is nowadays well established, the accurate characterization of the conformational disorder of these systems remains an important challenge. Nuclear magnetic resonance spectroscopy is a unique tool to probe these motions at atomic level, through the analysis of spin relaxation or residual dipolar couplings. The latter allows all motions occurring at timescales faster than the millisecond to be investigated, including physiologically important timescales. The information presents in those couplings is interpreted here using mainly analytical approaches in order to quantify the amounts of dynamics present in folded protein, to determine the direction of those motions and to obtain structural information within this conformational disorder. These analytical approaches are complemented by numerical methods, that allowed the observation of phenomena from a different point of view or the investigation of other systems such as intrinsically disordered proteins. All of these studies demonstrate an important complementarity between structural order and conformational disorder. (author)

  5. A thermodynamic definition of protein domains.

    Science.gov (United States)

    Porter, Lauren L; Rose, George D

    2012-06-12

    Protein domains are conspicuous structural units in globular proteins, and their identification has been a topic of intense biochemical interest dating back to the earliest crystal structures. Numerous disparate domain identification algorithms have been proposed, all involving some combination of visual intuition and/or structure-based decomposition. Instead, we present a rigorous, thermodynamically-based approach that redefines domains as cooperative chain segments. In greater detail, most small proteins fold with high cooperativity, meaning that the equilibrium population is dominated by completely folded and completely unfolded molecules, with a negligible subpopulation of partially folded intermediates. Here, we redefine structural domains in thermodynamic terms as cooperative folding units, based on m-values, which measure the cooperativity of a protein or its substructures. In our analysis, a domain is equated to a contiguous segment of the folded protein whose m-value is largely unaffected when that segment is excised from its parent structure. Defined in this way, a domain is a self-contained cooperative unit; i.e., its cooperativity depends primarily upon intrasegment interactions, not intersegment interactions. Implementing this concept computationally, the domains in a large representative set of proteins were identified; all exhibit consistency with experimental findings. Specifically, our domain divisions correspond to the experimentally determined equilibrium folding intermediates in a set of nine proteins. The approach was also proofed against a representative set of 71 additional proteins, again with confirmatory results. Our reframed interpretation of a protein domain transforms an indeterminate structural phenomenon into a quantifiable molecular property grounded in solution thermodynamics.

  6. A Soluble, Folded Protein without Charged Amino Acid Residues

    DEFF Research Database (Denmark)

    Højgaard, Casper; Kofoed, Christian; Espersen, Roall

    2016-01-01

    Charges are considered an integral part of protein structure and function, enhancing solubility and providing specificity in molecular interactions. We wished to investigate whether charged amino acids are indeed required for protein biogenesis and whether a protein completely free of titratable...... side chains can maintain solubility, stability, and function. As a model, we used a cellulose-binding domain from Cellulomonas fimi, which, among proteins of more than 100 amino acids, presently is the least charged in the Protein Data Bank, with a total of only four titratable residues. We find...

  7. Rapid protein fold determination using secondary chemical shifts and cross-hydrogen bond 15N-13C’ scalar couplings (3hbJNC’)

    NARCIS (Netherlands)

    Bonvin, A.M.J.J.; Houben, K.; Guenneugues, M.N.L.; Kaptein, R.; Boelens, R.

    2001-01-01

    The possibility of generating protein folds at the stage of backbone assignment using structural restraints derived from experimentally measured cross-hydrogen bond scalar couplings and secondary chemical shift information is investigated using as a test case the small alpha/beta protein

  8. 'Let the phage do the work': Using the phage P22 coat protein structures as a framework to understand its folding and assembly mutants

    International Nuclear Information System (INIS)

    Teschke, Carolyn M.; Parent, Kristin N.

    2010-01-01

    The amino acid sequence of viral capsid proteins contains information about their folding, structure and self-assembly processes. While some viruses assemble from small preformed oligomers of coat proteins, other viruses such as phage P22 and herpesvirus assemble from monomeric proteins (Fuller and King, 1980). The subunit assembly process is strictly controlled through protein:protein interactions such that icosahedral structures are formed with specific symmetries, rather than aberrant structures. dsDNA viruses commonly assemble by first forming a precursor capsid that serves as a DNA packaging machine. DNA packaging is accompanied by a conformational transition of the small precursor procapsid into a larger capsid for isometric viruses. Here we highlight the pseudo-atomic structures of phage P22 coat protein and rationalize several decades of data about P22 coat protein folding, assembly and maturation generated from a combination of genetics and biochemistry.

  9. Comparative analysis of the folding dynamics and kinetics of an engineered knotted protein and its variants derived from HP0242 of Helicobacter pylori

    Science.gov (United States)

    Wang, Liang-Wei; Liu, Yu-Nan; Lyu, Ping-Chiang; Jackson, Sophie E.; Hsu, Shang-Te Danny

    2015-09-01

    Understanding the mechanism by which a polypeptide chain thread itself spontaneously to attain a knotted conformation has been a major challenge in the field of protein folding. HP0242 is a homodimeric protein from Helicobacter pylori with intertwined helices to form a unique pseudo-knotted folding topology. A tandem HP0242 repeat has been constructed to become the first engineered trefoil-knotted protein. Its small size renders it a model system for computational analyses to examine its folding and knotting pathways. Here we report a multi-parametric study on the folding stability and kinetics of a library of HP0242 variants, including the trefoil-knotted tandem HP0242 repeat, using far-UV circular dichroism and fluorescence spectroscopy. Equilibrium chemical denaturation of HP0242 variants shows the presence of highly populated dimeric and structurally heterogeneous folding intermediates. Such equilibrium folding intermediates retain significant amount of helical structures except those at the N- and C-terminal regions in the native structure. Stopped-flow fluorescence measurements of HP0242 variants show that spontaneous refolding into knotted structures can be achieved within seconds, which is several orders of magnitude faster than previously observed for other knotted proteins. Nevertheless, the complex chevron plots indicate that HP0242 variants are prone to misfold into kinetic traps, leading to severely rolled-over refolding arms. The experimental observations are in general agreement with the previously reported molecular dynamics simulations. Based on our results, kinetic folding pathways are proposed to qualitatively describe the complex folding processes of HP0242 variants.

  10. Engineering and Characterization of a Superfolder Green Fluorescent Protein

    International Nuclear Information System (INIS)

    Pedelacq, J.; Cabantous, S.; Tran, T.; Terwilliger, T.; Waldo, G.

    2006-01-01

    Existing variants of green fluorescent protein (GFP) often misfold when expressed as fusions with other proteins. We have generated a robustly folded version of GFP, called 'superfolder' GFP, that folds well even when fused to poorly folded polypeptides. Compared to 'folding reporter' GFP, a folding-enhanced GFP containing the 'cycle-3' mutations and the 'enhanced GFP' mutations F64L and S65T, superfolder GFP shows improved tolerance of circular permutation, greater resistance to chemical denaturants and improved folding kinetics. The fluorescence of Escherichia coli cells expressing each of eighteen proteins from Pyrobaculum aerophilum as fusions with superfolder GFP was proportional to total protein expression. In contrast, fluorescence of folding reporter GFP fusion proteins was strongly correlated with the productive folding yield of the passenger protein. X-ray crystallographic structural analyses helped explain the enhanced folding of superfolder GFP relative to folding reporter GFP

  11. The porous borders of the protein world.

    Science.gov (United States)

    Cordes, Matthew H J; Stewart, Katie L

    2012-02-08

    Fold switching may play a role in the evolution of new protein folds and functions. He et al., in this issue of Structure, use protein design to illustrate that the same drastic change in a protein fold can occur via multiple different mutational pathways. Copyright © 2012 Elsevier Ltd. All rights reserved.

  12. ModFOLD6: an accurate web server for the global and local quality estimation of 3D protein models.

    Science.gov (United States)

    Maghrabi, Ali H A; McGuffin, Liam J

    2017-07-03

    Methods that reliably estimate the likely similarity between the predicted and native structures of proteins have become essential for driving the acceptance and adoption of three-dimensional protein models by life scientists. ModFOLD6 is the latest version of our leading resource for Estimates of Model Accuracy (EMA), which uses a pioneering hybrid quasi-single model approach. The ModFOLD6 server integrates scores from three pure-single model methods and three quasi-single model methods using a neural network to estimate local quality scores. Additionally, the server provides three options for producing global score estimates, depending on the requirements of the user: (i) ModFOLD6_rank, which is optimized for ranking/selection, (ii) ModFOLD6_cor, which is optimized for correlations of predicted and observed scores and (iii) ModFOLD6 global for balanced performance. The ModFOLD6 methods rank among the top few for EMA, according to independent blind testing by the CASP12 assessors. The ModFOLD6 server is also continuously automatically evaluated as part of the CAMEO project, where significant performance gains have been observed compared to our previous server and other publicly available servers. The ModFOLD6 server is freely available at: http://www.reading.ac.uk/bioinf/ModFOLD/. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. Design of an Efficient Turbulent Micro-Mixer for Protein Folding Experiments

    Science.gov (United States)

    Inguva, Venkatesh; Perot, Blair

    2015-11-01

    Protein folding studies require the development of micro-mixers that require less sample, mix at faster rates, and still provide a high signal to noise ratio. Chaotic to marginally turbulent micro-mixers are promising candidates for this application. In this study, various turbulence and unsteadiness generation concepts are explored that avoid cavitation. The mixing enhancements include flow turning regions, flow splitters, and vortex shedding. The relative effectiveness of these different approaches for rapid micro-mixing is discussed. Simulations found that flow turning regions provided the best mixing profile. Experimental validation of the optimal design is verified through laser confocal microscopy experiments. This work is support by the National Science Foundation.

  14. Chaotic Multiquenching Annealing Applied to the Protein Folding Problem

    Directory of Open Access Journals (Sweden)

    Juan Frausto-Solis

    2014-01-01

    Full Text Available The Chaotic Multiquenching Annealing algorithm (CMQA is proposed. CMQA is a new algorithm, which is applied to protein folding problem (PFP. This algorithm is divided into three phases: (i multiquenching phase (MQP, (ii annealing phase (AP, and (iii dynamical equilibrium phase (DEP. MQP enforces several stages of quick quenching processes that include chaotic functions. The chaotic functions can increase the exploration potential of solutions space of PFP. AP phase implements a simulated annealing algorithm (SA with an exponential cooling function. MQP and AP are delimited by different ranges of temperatures; MQP is applied for a range of temperatures which goes from extremely high values to very high values; AP searches for solutions in a range of temperatures from high values to extremely low values. DEP phase finds the equilibrium in a dynamic way by applying least squares method. CMQA is tested with several instances of PFP.

  15. Kinks, loops, and protein folding, with protein A as an example

    International Nuclear Information System (INIS)

    Krokhotin, Andrey; Liwo, Adam; Maisuradze, Gia G.; Scheraga, Harold A.; Niemi, Antti J.

    2014-01-01

    The dynamics and energetics of formation of loops in the 46-residue N-terminal fragment of the B-domain of staphylococcal protein A has been studied. Numerical simulations have been performed using coarse-grained molecular dynamics with the united-residue (UNRES) force field. The results have been analyzed in terms of a kink (heteroclinic standing wave solution) of a generalized discrete nonlinear Schrödinger (DNLS) equation. In the case of proteins, the DNLS equation arises from a C α -trace-based energy function. Three individual kink profiles were identified in the experimental three-α-helix structure of protein A, in the range of the Glu16-Asn29, Leu20-Asn29, and Gln33-Asn44 residues, respectively; these correspond to two loops in the native structure. UNRES simulations were started from the full right-handed α-helix to obtain a clear picture of kink formation, which would otherwise be blurred by helix formation. All three kinks emerged during coarse-grained simulations. It was found that the formation of each is accompanied by a local free energy increase; this is expressed as the change of UNRES energy which has the physical sense of the potential of mean force of a polypeptide chain. The increase is about 7 kcal/mol. This value can thus be considered as the free energy barrier to kink formation in full α-helical segments of polypeptide chains. During the simulations, the kinks emerge, disappear, propagate, and annihilate each other many times. It was found that the formation of a kink is initiated by an abrupt change in the orientation of a pair of consecutive side chains in the loop region. This resembles the formation of a Bloch wall along a spin chain, where the C α backbone corresponds to the chain, and the amino acid side chains are interpreted as the spin variables. This observation suggests that nearest-neighbor side chain–side chain interactions are responsible for initiation of loop formation. It was also found that the individual kinks are

  16. Topologies to geometries in protein folding: Hierarchical and nonhierarchical scenarios

    Science.gov (United States)

    Fernández, Ariel; Colubri, Andrés; Berry, R. Stephen

    2001-04-01

    This work presents a method to portray protein folding dynamics at a coarse resolution, based on a pattern-recognition-and-feedback description of the evolution of torsional motions of the backbone chain in the hydrophobic collapse of the protein. The approach permits theory and computation to treat the search of conformation space from picoseconds to the millisecond time scale or longer, the time scales of adiabatic evolution of soft-mode dynamics. The procedure tracks the backbone torsional coordinates modulo the basins of attraction to which they belong in the Ramachandran maps. The state and history of the backbone are represented in a map of local torsional states and hydrophobicity/hydrophilicity matching of the residues comprising the chain, the local topology matrix (LTM). From this map, we infer allowable structural features by recognizing patterns in the LTM as topologically compatible with particular structural forms within a level of frustration tolerance. Each such 3D realization of an LTM leads to a contact map, from which one can infer one or more structures. Introduction of energetic and entropic terms allow elimination of all but the most favored of these structures at each new juncture. The method's predictive power is first established by comparing "final," stable LTMs for natural sequences of intermediate length (N⩽120) with PDB data. The method is extended further to β-lactoglobulin (β-LG, N=162), the quintessential nonhierarchical folder.

  17. My 65 years in protein chemistry.

    Science.gov (United States)

    Scheraga, Harold A

    2015-05-01

    This is a tour of a physical chemist through 65 years of protein chemistry from the time when emphasis was placed on the determination of the size and shape of the protein molecule as a colloidal particle, with an early breakthrough by James Sumner, followed by Linus Pauling and Fred Sanger, that a protein was a real molecule, albeit a macromolecule. It deals with the recognition of the nature and importance of hydrogen bonds and hydrophobic interactions in determining the structure, properties, and biological function of proteins until the present acquisition of an understanding of the structure, thermodynamics, and folding pathways from a linear array of amino acids to a biological entity. Along the way, with a combination of experiment and theoretical interpretation, a mechanism was elucidated for the thrombin-induced conversion of fibrinogen to a fibrin blood clot and for the oxidative-folding pathways of ribonuclease A. Before the atomic structure of a protein molecule was determined by x-ray diffraction or nuclear magnetic resonance spectroscopy, experimental studies of the fundamental interactions underlying protein structure led to several distance constraints which motivated the theoretical approach to determine protein structure, and culminated in the Empirical Conformational Energy Program for Peptides (ECEPP), an all-atom force field, with which the structures of fibrous collagen-like proteins and the 46-residue globular staphylococcal protein A were determined. To undertake the study of larger globular proteins, a physics-based coarse-grained UNited-RESidue (UNRES) force field was developed, and applied to the protein-folding problem in terms of structure, thermodynamics, dynamics, and folding pathways. Initially, single-chain and, ultimately, multiple-chain proteins were examined, and the methodology was extended to protein-protein interactions and to nucleic acids and to protein-nucleic acid interactions. The ultimate results led to an understanding

  18. Supersymmetric quantum mechanics method for the Fokker-Planck equation with applications to protein folding dynamics

    Science.gov (United States)

    Polotto, Franciele; Drigo Filho, Elso; Chahine, Jorge; Oliveira, Ronaldo Junio de

    2018-03-01

    This work developed analytical methods to explore the kinetics of the time-dependent probability distributions over thermodynamic free energy profiles of protein folding and compared the results with simulation. The Fokker-Planck equation is mapped onto a Schrödinger-type equation due to the well-known solutions of the latter. Through a semi-analytical description, the supersymmetric quantum mechanics formalism is invoked and the time-dependent probability distributions are obtained with numerical calculations by using the variational method. A coarse-grained structure-based model of the two-state protein Tm CSP was simulated at a Cα level of resolution and the thermodynamics and kinetics were fully characterized. Analytical solutions from non-equilibrium conditions were obtained with the simulated double-well free energy potential and kinetic folding times were calculated. It was found that analytical folding time as a function of temperature agrees, quantitatively, with simulations and experiments from the literature of Tm CSP having the well-known 'U' shape of the Chevron Plots. The simple analytical model developed in this study has a potential to be used by theoreticians and experimentalists willing to explore, quantitatively, rates and the kinetic behavior of their system by informing the thermally activated barrier. The theory developed describes a stochastic process and, therefore, can be applied to a variety of biological as well as condensed-phase two-state systems.

  19. Functional structural motifs for protein-ligand, protein-protein, and protein-nucleic acid interactions and their connection to supersecondary structures.

    Science.gov (United States)

    Kinjo, Akira R; Nakamura, Haruki

    2013-01-01

    Protein functions are mediated by interactions between proteins and other molecules. One useful approach to analyze protein functions is to compare and classify the structures of interaction interfaces of proteins. Here, we describe the procedures for compiling a database of interface structures and efficiently comparing the interface structures. To do so requires a good understanding of the data structures of the Protein Data Bank (PDB). Therefore, we also provide a detailed account of the PDB exchange dictionary necessary for extracting data that are relevant for analyzing interaction interfaces and secondary structures. We identify recurring structural motifs by classifying similar interface structures, and we define a coarse-grained representation of supersecondary structures (SSS) which represents a sequence of two or three secondary structure elements including their relative orientations as a string of four to seven letters. By examining the correspondence between structural motifs and SSS strings, we show that no SSS string has particularly high propensity to be found interaction interfaces in general, indicating any SSS can be used as a binding interface. When individual structural motifs are examined, there are some SSS strings that have high propensity for particular groups of structural motifs. In addition, it is shown that while the SSS strings found in particular structural motifs for nonpolymer and protein interfaces are as abundant as in other structural motifs that belong to the same subunit, structural motifs for nucleic acid interfaces exhibit somewhat stronger preference for SSS strings. In regard to protein folds, many motif-specific SSS strings were found across many folds, suggesting that SSS may be a useful description to investigate the universality of ligand binding modes.

  20. Evidence for close side-chain packing in an early protein folding intermediate previously assumed to be a molten globule.

    Science.gov (United States)

    Rosen, Laura E; Connell, Katelyn B; Marqusee, Susan

    2014-10-14

    The molten globule, a conformational ensemble with significant secondary structure but only loosely packed tertiary structure, has been suggested to be a ubiquitous intermediate in protein folding. However, it is difficult to assess the tertiary packing of transiently populated species to evaluate this hypothesis. Escherichia coli RNase H is known to populate an intermediate before the rate-limiting barrier to folding that has long been thought to be a molten globule. We investigated this hypothesis by making mimics of the intermediate that are the ground-state conformation at equilibrium, using two approaches: a truncation to generate a fragment mimic of the intermediate, and selective destabilization of the native state using point mutations. Spectroscopic characterization and the response of the mimics to further mutation are consistent with studies on the transient kinetic intermediate, indicating that they model the early intermediate. Both mimics fold cooperatively and exhibit NMR spectra indicative of a closely packed conformation, in contrast to the hypothesis of molten tertiary packing. This result is important for understanding the nature of the subsequent rate-limiting barrier to folding and has implications for the assumption that many other proteins populate molten globule folding intermediates.

  1. Theory and simulation of explicit solvent effects on protein folding in vitro and in vivo

    Science.gov (United States)

    England, Jeremy L.

    The aim of this work is to develop theoretical tools for understanding what happens to water that is confined in amphipathic cavities, and for testing the consequences of this understanding for protein folding in vitro and in vivo. We begin in the first chapter with a brief review of the theoretical and simulation literature on the hydrophobic effect and the aqueous solvation of charged species that also puts forward a simple theoretical framework within which various solvation phenomena reported in past studies may be unified. Subsequently, in the second chapter we also review past computational and theoretical work on the specific question of how chaperonin complexes assist the folding of their substrates. With the context set, we turn in Chapter 3 to the case of an open system with water trapped between hydrophobic plates that experiences a uniform electric field normal to and between the plates. Classic bulk theory of electrostriction in polarizable fluids tells us that the electric field should cause an increase in local water density as it rises, yet some simulations have suggested the opposite. We present a mean-field Potts model we have developed to explain this discrepancy, and show how such a simple, coarse-grained lattice description can capture the fundamental consequences of the fact that external electric fields can frustrate the hydrogen bond network in confined water. Chapter 4 continues to pursue the issue of solvent evacuation between hydrophobic plates, but focuses on the impact of chemical denaturants on hydrophobic effects using molecular dynamics simulations of hydrophobic dewetting. We find that while urea and guanidinium have similar qualitative effects at the bulk level, they seem to differ in the microscopic mechanism by which they denature proteins, although both inhibit the onset of dewetting. Lastly, Chapters 5 and 6 examine the potential importance of solvent-mediated forces to protein folding in vivo. Chapter 5 develops a Landau

  2. The role of atomic level steric effects and attractive forces in protein folding.

    Science.gov (United States)

    Lammert, Heiko; Wolynes, Peter G; Onuchic, José N

    2012-02-01

    Protein folding into tertiary structures is controlled by an interplay of attractive contact interactions and steric effects. We investigate the balance between these contributions using structure-based models using an all-atom representation of the structure combined with a coarse-grained contact potential. Tertiary contact interactions between atoms are collected into a single broad attractive well between the C(β) atoms between each residue pair in a native contact. Through the width of these contact potentials we control their tolerance for deviations from the ideal structure and the spatial range of attractive interactions. In the compact native state dominant packing constraints limit the effects of a coarse-grained contact potential. During folding, however, the broad attractive potentials allow an early collapse that starts before the native local structure is completely adopted. As a consequence the folding transition is broadened and the free energy barrier is decreased. Eventually two-state folding behavior is lost completely for systems with very broad attractive potentials. The stabilization of native-like residue interactions in non-perfect geometries early in the folding process frequently leads to structural traps. Global mirror images are a notable example. These traps are penalized by the details of the repulsive interactions only after further collapse. Successful folding to the native state requires simultaneous guidance from both attractive and repulsive interactions. Copyright © 2011 Wiley Periodicals, Inc.

  3. Modulation of the Extent of Cooperative Structural Change During Protein Folding by Chemical Denaturant.

    Science.gov (United States)

    Jethva, Prashant N; Udgaonkar, Jayant B

    2017-09-07

    Protein folding and unfolding reactions invariably appear to be highly cooperative reactions, but the structural and sequence determinants of cooperativity are poorly understood. Importantly, it is not known whether cooperative structural change occurs throughout the protein, or whether some parts change cooperatively and other parts change noncooperatively. In the current study, hydrogen exchange mass spectrometry has been used to show that the mechanism of unfolding of the PI3K SH3 domain is similar in the absence and presence of 5 M urea. The data are well described by a four state N ↔ I N ↔ I 2 ↔ U model, in which structural changes occur noncooperatively during the N ↔ I N and I N ↔ I 2 transitions, and occur cooperatively during the I 2 ↔ U transition. The nSrc-loop and RT-loop, as well as β strands 4 and 5 undergo noncooperative unfolding, while β strands 1, 2, and 3 unfold cooperatively in the absence of urea. However, in the presence of 5 M urea, the unfolding of β strand 4 switches to become cooperative, leading to an increase in the extent of cooperative structural change. The current study highlights the relationship between protein stability and cooperativity, by showing how the extent of cooperativity can be varied, using chemical denaturant to alter protein stability.

  4. Protein degradation and protection against misfolded or damaged proteins

    Science.gov (United States)

    Goldberg, Alfred L.

    2003-12-01

    The ultimate mechanism that cells use to ensure the quality of intracellular proteins is the selective destruction of misfolded or damaged polypeptides. In eukaryotic cells, the large ATP-dependent proteolytic machine, the 26S proteasome, prevents the accumulation of non-functional, potentially toxic proteins. This process is of particular importance in protecting cells against harsh conditions (for example, heat shock or oxidative stress) and in a variety of diseases (for example, cystic fibrosis and the major neurodegenerative diseases). A full understanding of the pathogenesis of the protein-folding diseases will require greater knowledge of how misfolded proteins are recognized and selectively degraded.

  5. Structure-based barcoding of proteins.

    Science.gov (United States)

    Metri, Rahul; Jerath, Gaurav; Kailas, Govind; Gacche, Nitin; Pal, Adityabarna; Ramakrishnan, Vibin

    2014-01-01

    A reduced representation in the format of a barcode has been developed to provide an overview of the topological nature of a given protein structure from 3D coordinate file. The molecular structure of a protein coordinate file from Protein Data Bank is first expressed in terms of an alpha-numero code and further converted to a barcode image. The barcode representation can be used to compare and contrast different proteins based on their structure. The utility of this method has been exemplified by comparing structural barcodes of proteins that belong to same fold family, and across different folds. In addition to this, we have attempted to provide an illustration to (i) the structural changes often seen in a given protein molecule upon interaction with ligands and (ii) Modifications in overall topology of a given protein during evolution. The program is fully downloadable from the website http://www.iitg.ac.in/probar/. © 2013 The Protein Society.

  6. ProteinSplit: splitting of multi-domain proteins using prediction of ordered and disordered regions in protein sequences for virtual structural genomics

    International Nuclear Information System (INIS)

    Wyrwicz, Lucjan S; Koczyk, Grzegorz; Rychlewski, Leszek; Plewczynski, Dariusz

    2007-01-01

    The annotation of protein folds within newly sequenced genomes is the main target for semi-automated protein structure prediction (virtual structural genomics). A large number of automated methods have been developed recently with very good results in the case of single-domain proteins. Unfortunately, most of these automated methods often fail to properly predict the distant homology between a given multi-domain protein query and structural templates. Therefore a multi-domain protein should be split into domains in order to overcome this limitation. ProteinSplit is designed to identify protein domain boundaries using a novel algorithm that predicts disordered regions in protein sequences. The software utilizes various sequence characteristics to assess the local propensity of a protein to be disordered or ordered in terms of local structure stability. These disordered parts of a protein are likely to create interdomain spacers. Because of its speed and portability, the method was successfully applied to several genome-wide fold annotation experiments. The user can run an automated analysis of sets of proteins or perform semi-automated multiple user projects (saving the results on the server). Additionally the sequences of predicted domains can be sent to the Bioinfo.PL Protein Structure Prediction Meta-Server for further protein three-dimensional structure and function prediction. The program is freely accessible as a web service at http://lucjan.bioinfo.pl/proteinsplit together with detailed benchmark results on the critical assessment of a fully automated structure prediction (CAFASP) set of sequences. The source code of the local version of protein domain boundary prediction is available upon request from the authors

  7. Ricinus communis cyclophilin: functional characterisation of a sieve tube protein involved in protein folding.

    Science.gov (United States)

    Gottschalk, Maren; Dolgener, Elmar; Xoconostle-Cázares, Beatriz; Lucas, William J; Komor, Ewald; Schobert, Christian

    2008-09-01

    The phloem translocation stream of the angiosperms contains a special population of proteins and RNA molecules which appear to be produced in the companion cells prior to being transported into the sieve tube system through the interconnecting plasmodesmata. During this process, these non-cell-autonomous proteins are thought to undergo partial unfolding. Recent mass spectroscopy studies identified peptidyl-prolyl cis-trans isomerase (PPIases) as potential molecular chaperones functioning in the phloem translocation stream (Giavalisco et al. 2006). In the present study, we describe the cloning and characterisation of a castor bean phloem cyclophilin, RcCYP1 that has high peptidyl-prolyl cis-trans isomerase activity. Equivalent enzymatic activity was detected with phloem sap or purified recombinant (His)(6)-tagged RcCYP1. Mass spectrometry analysis of proteolytic peptides, derived from a 22 kDa band in HPLC-fractionated phloem sap, immunolocalisation studies and Western analysis of proteins extracted from castor bean tissues/organs indicated that RcCYP1 is an abundant protein in the companion cell-sieve element complex. Microinjection experiments established that purified recombinant (His)(6)-RcCYP1 can interact with plasmodesmata to both induce an increase in size exclusion limit and mediate its own cell-to-cell trafficking. Collectively, these findings support the hypothesis that RcCYP1 plays a role in the refolding of non-cell-autonomous proteins after their entry into the phloem translocation stream.

  8. Protein folding and non-conventional drug design: a primer for nuclear structure physicists

    International Nuclear Information System (INIS)

    Broglia, R.A.; Tiana, G.; Provasi, D.

    2004-01-01

    Some of the paradigms emerging from the study of the phenomena of phase transitions in finite many-body systems, like e.g. the atomic nucleus can be used at profit to solve the protein folding problem within the framework of simple (although not oversimplified) models. From this solution a paradigm emerges for the design of non-conventional drugs, which inhibit enzymatic action without inducing resistance (mutations). The application of these concepts to the design of an inhibitor to the HIV-protease central in the life cycle of the HIV virus is discussed

  9. The adsorption and unfolding kinetics determines the folding state of proteins at the air-water interface and thereby the equation of state

    NARCIS (Netherlands)

    Wierenga, P.A.; Egmond, M.R.; Voragen, A.G.J.; Jongh, H.H.J.de

    2006-01-01

    Unfolding of proteins has often been mentioned as an important factor during the adsorption process at air-water interfaces and in the increase of surface pressure at later stages of the adsorption process. This work focuses on the question whether the folding state of the adsorbed protein depends

  10. Biophysics of protein evolution and evolutionary protein biophysics

    Science.gov (United States)

    Sikosek, Tobias; Chan, Hue Sun

    2014-01-01

    The study of molecular evolution at the level of protein-coding genes often entails comparing large datasets of sequences to infer their evolutionary relationships. Despite the importance of a protein's structure and conformational dynamics to its function and thus its fitness, common phylogenetic methods embody minimal biophysical knowledge of proteins. To underscore the biophysical constraints on natural selection, we survey effects of protein mutations, highlighting the physical basis for marginal stability of natural globular proteins and how requirement for kinetic stability and avoidance of misfolding and misinteractions might have affected protein evolution. The biophysical underpinnings of these effects have been addressed by models with an explicit coarse-grained spatial representation of the polypeptide chain. Sequence–structure mappings based on such models are powerful conceptual tools that rationalize mutational robustness, evolvability, epistasis, promiscuous function performed by ‘hidden’ conformational states, resolution of adaptive conflicts and conformational switches in the evolution from one protein fold to another. Recently, protein biophysics has been applied to derive more accurate evolutionary accounts of sequence data. Methods have also been developed to exploit sequence-based evolutionary information to predict biophysical behaviours of proteins. The success of these approaches demonstrates a deep synergy between the fields of protein biophysics and protein evolution. PMID:25165599

  11. Basic Tilted Helix Bundle – A new protein fold in human FKBP25/FKBP3 and HectD1

    International Nuclear Information System (INIS)

    Helander, Sara; Montecchio, Meri; Lemak, Alexander; Farès, Christophe; Almlöf, Jonas; Li, Yanjun; Yee, Adelinda; Arrowsmith, Cheryl H.; Dhe-Paganon, Sirano; Sunnerhagen, Maria

    2014-01-01

    Highlights: • We describe the structure of a novel fold in FKBP25 and HectD. • The new fold is named the Basic Tilted Helix Bundle (BTHB) domain. • A conserved basic surface patch is presented, suggesting a functional role. - Abstract: In this paper, we describe the structure of a N-terminal domain motif in nuclear-localized FKBP25 1–73 , a member of the FKBP family, together with the structure of a sequence-related subdomain of the E3 ubiquitin ligase HectD1 that we show belongs to the same fold. This motif adopts a compact 5-helix bundle which we name the Basic Tilted Helix Bundle (BTHB) domain. A positively charged surface patch, structurally centered around the tilted helix H4, is present in both FKBP25 and HectD1 and is conserved in both proteins, suggesting a conserved functional role. We provide detailed comparative analysis of the structures of the two proteins and their sequence similarities, and analysis of the interaction of the proposed FKBP25 binding protein YY1. We suggest that the basic motif in BTHB is involved in the observed DNA binding of FKBP25, and that the function of this domain can be affected by regulatory YY1 binding and/or interactions with adjacent domains

  12. Basic Tilted Helix Bundle – A new protein fold in human FKBP25/FKBP3 and HectD1

    Energy Technology Data Exchange (ETDEWEB)

    Helander, Sara; Montecchio, Meri [Department of Physics, Chemistry and Biology, Division of Chemistry, Linköping University, SE-58183 Linköping (Sweden); Lemak, Alexander [Princess Margaret Cancer Centre and Department of Medical Biophysics, University of Toronto, Toronto, Ontario M5G 1L7 (Canada); Northeast Structural Genomics Consortium, Toronto, Ontario (Canada); Farès, Christophe [Princess Margaret Cancer Centre and Department of Medical Biophysics, University of Toronto, Toronto, Ontario M5G 1L7 (Canada); Almlöf, Jonas [Department of Physics, Chemistry and Biology, Division of Chemistry, Linköping University, SE-58183 Linköping (Sweden); Li, Yanjun [Structural Genomics Consortium, University of Toronto, 101 College St, Toronto, Ontario M5G 1L7 (Canada); Yee, Adelinda [Princess Margaret Cancer Centre and Department of Medical Biophysics, University of Toronto, Toronto, Ontario M5G 1L7 (Canada); Northeast Structural Genomics Consortium, Toronto, Ontario (Canada); Arrowsmith, Cheryl H. [Princess Margaret Cancer Centre and Department of Medical Biophysics, University of Toronto, Toronto, Ontario M5G 1L7 (Canada); Northeast Structural Genomics Consortium, Toronto, Ontario (Canada); Structural Genomics Consortium, University of Toronto, 101 College St, Toronto, Ontario M5G 1L7 (Canada); Dhe-Paganon, Sirano [Structural Genomics Consortium, University of Toronto, 101 College St, Toronto, Ontario M5G 1L7 (Canada); Sunnerhagen, Maria, E-mail: maria.sunnerhagen@liu.se [Department of Physics, Chemistry and Biology, Division of Chemistry, Linköping University, SE-58183 Linköping (Sweden)

    2014-04-25

    Highlights: • We describe the structure of a novel fold in FKBP25 and HectD. • The new fold is named the Basic Tilted Helix Bundle (BTHB) domain. • A conserved basic surface patch is presented, suggesting a functional role. - Abstract: In this paper, we describe the structure of a N-terminal domain motif in nuclear-localized FKBP25{sub 1–73}, a member of the FKBP family, together with the structure of a sequence-related subdomain of the E3 ubiquitin ligase HectD1 that we show belongs to the same fold. This motif adopts a compact 5-helix bundle which we name the Basic Tilted Helix Bundle (BTHB) domain. A positively charged surface patch, structurally centered around the tilted helix H4, is present in both FKBP25 and HectD1 and is conserved in both proteins, suggesting a conserved functional role. We provide detailed comparative analysis of the structures of the two proteins and their sequence similarities, and analysis of the interaction of the proposed FKBP25 binding protein YY1. We suggest that the basic motif in BTHB is involved in the observed DNA binding of FKBP25, and that the function of this domain can be affected by regulatory YY1 binding and/or interactions with adjacent domains.

  13. NAPS: Network Analysis of Protein Structures

    Science.gov (United States)

    Chakrabarty, Broto; Parekh, Nita

    2016-01-01

    Traditionally, protein structures have been analysed by the secondary structure architecture and fold arrangement. An alternative approach that has shown promise is modelling proteins as a network of non-covalent interactions between amino acid residues. The network representation of proteins provide a systems approach to topological analysis of complex three-dimensional structures irrespective of secondary structure and fold type and provide insights into structure-function relationship. We have developed a web server for network based analysis of protein structures, NAPS, that facilitates quantitative and qualitative (visual) analysis of residue–residue interactions in: single chains, protein complex, modelled protein structures and trajectories (e.g. from molecular dynamics simulations). The user can specify atom type for network construction, distance range (in Å) and minimal amino acid separation along the sequence. NAPS provides users selection of node(s) and its neighbourhood based on centrality measures, physicochemical properties of amino acids or cluster of well-connected residues (k-cliques) for further analysis. Visual analysis of interacting domains and protein chains, and shortest path lengths between pair of residues are additional features that aid in functional analysis. NAPS support various analyses and visualization views for identifying functional residues, provide insight into mechanisms of protein folding, domain-domain and protein–protein interactions for understanding communication within and between proteins. URL:http://bioinf.iiit.ac.in/NAPS/. PMID:27151201

  14. Prediction of heterodimeric protein complexes from weighted protein-protein interaction networks using novel features and kernel functions.

    Directory of Open Access Journals (Sweden)

    Peiying Ruan

    Full Text Available Since many proteins express their functional activity by interacting with other proteins and forming protein complexes, it is very useful to identify sets of proteins that form complexes. For that purpose, many prediction methods for protein complexes from protein-protein interactions have been developed such as MCL, MCODE, RNSC, PCP, RRW, and NWE. These methods have dealt with only complexes with size of more than three because the methods often are based on some density of subgraphs. However, heterodimeric protein complexes that consist of two distinct proteins occupy a large part according to several comprehensive databases of known complexes. In this paper, we propose several feature space mappings from protein-protein interaction data, in which each interaction is weighted based on reliability. Furthermore, we make use of prior knowledge on protein domains to develop feature space mappings, domain composition kernel and its combination kernel with our proposed features. We perform ten-fold cross-validation computational experiments. These results suggest that our proposed kernel considerably outperforms the naive Bayes-based method, which is the best existing method for predicting heterodimeric protein complexes.

  15. Extracting rate coefficients from single-molecule photon trajectories and FRET efficiency histograms for a fast-folding protein.

    Science.gov (United States)

    Chung, Hoi Sung; Gopich, Irina V; McHale, Kevin; Cellmer, Troy; Louis, John M; Eaton, William A

    2011-04-28

    Recently developed statistical methods by Gopich and Szabo were used to extract folding and unfolding rate coefficients from single-molecule Förster resonance energy transfer (FRET) data for proteins with kinetics too fast to measure waiting time distributions. Two types of experiments and two different analyses were performed. In one experiment bursts of photons were collected from donor and acceptor fluorophores attached to a 73-residue protein, α(3)D, freely diffusing through the illuminated volume of a confocal microscope system. In the second, the protein was immobilized by linkage to a surface, and photons were collected until one of the fluorophores bleached. Folding and unfolding rate coefficients and mean FRET efficiencies for the folded and unfolded subpopulations were obtained from a photon by photon analysis of the trajectories using a maximum likelihood method. The ability of the method to describe the data in terms of a two-state model was checked by recoloring the photon trajectories with the extracted parameters and comparing the calculated FRET efficiency histograms with the measured histograms. The sum of the rate coefficients for the two-state model agreed to within 30% with the relaxation rate obtained from the decay of the donor-acceptor cross-correlation function, confirming the high accuracy of the method. Interestingly, apparently reliable rate coefficients could be extracted using the maximum likelihood method, even at low (rate coefficients and mean FRET efficiencies were also obtained in an approximate procedure by simply fitting the FRET efficiency histograms, calculated by binning the donor and acceptor photons, with a sum of three-Gaussian functions. The kinetics are exposed in these histograms by the growth of a FRET efficiency peak at values intermediate between the folded and unfolded peaks as the bin size increases, a phenomenon with similarities to NMR exchange broadening. When comparable populations of folded and unfolded

  16. Quantifying why urea is a protein denaturant, whereas glycine betaine is a protein stabilizer

    Science.gov (United States)

    Guinn, Emily J.; Pegram, Laurel M.; Capp, Michael W.; Pollock, Michelle N.; Record, M. Thomas

    2011-01-01

    To explain the large, opposite effects of urea and glycine betaine (GB) on stability of folded proteins and protein complexes, we quantify and interpret preferential interactions of urea with 45 model compounds displaying protein functional groups and compare with a previous analysis of GB. This information is needed to use urea as a probe of coupled folding in protein processes and to tune molecular dynamics force fields. Preferential interactions between urea and model compounds relative to their interactions with water are determined by osmometry or solubility and dissected using a unique coarse-grained analysis to obtain interaction potentials quantifying the interaction of urea with each significant type of protein surface (aliphatic, aromatic hydrocarbon (C); polar and charged N and O). Microscopic local-bulk partition coefficients Kp for the accumulation or exclusion of urea in the water of hydration of these surfaces relative to bulk water are obtained. Kp values reveal that urea accumulates moderately at amide O and weakly at aliphatic C, whereas GB is excluded from both. These results provide both thermodynamic and molecular explanations for the opposite effects of urea and glycine betaine on protein stability, as well as deductions about strengths of amide NH—amide O and amide NH—amide N hydrogen bonds relative to hydrogen bonds to water. Interestingly, urea, like GB, is moderately accumulated at aromatic C surface. Urea m-values for protein folding and other protein processes are quantitatively interpreted and predicted using these urea interaction potentials or Kp values. PMID:21930943

  17. The functional significance of the autolysis loop in protein C and activated protein C.

    Science.gov (United States)

    Yang, Likui; Manithody, Chandrashekhara; Rezaie, Alireza R

    2005-07-01

    The autolysis loop of activated protein C (APC) is five residues longer than the autolysis loop of other vitamin K-dependent coagulation proteases. To investigate the role of this loop in the zymogenic and anticoagulant properties of the molecule, a protein C mutant was constructed in which the autolysis loop of the protein was replaced with the corresponding loop of factor X. The protein C mutant was activated by thrombin with approximately 5-fold higher rate in the presence of Ca2+. Both kinetics and direct binding studies revealed that the Ca2+ affinity of the mutant has been impaired approximately 3-fold. The result of a factor Va degradation assay revealed that the anticoagulant function of the mutant has been improved 4-5-fold in the absence but not in the presence of protein S. The improvement was due to a better recognition of both the P1-Arg506 and P1-Arg306 cleavage sites by the mutant protease. However, the plasma half-life of the mutant was markedly shortened due to faster inactivation by plasma serpins. These results suggest that the autolysis loop of protein C is critical for the Ca(2+)-dependence of activation by thrombin. Moreover, a longer autolysis loop in APC is not optimal for interaction with factor Va in the absence of protein S, but it contributes to the lack of serpin reactivity and longer half-life of the protease in plasma.

  18. Loss of conformational entropy in protein folding calculated using realistic ensembles and its implications for NMR-based calculations

    Science.gov (United States)

    Baxa, Michael C.; Haddadian, Esmael J.; Jumper, John M.; Freed, Karl F.; Sosnick, Tobin R.

    2014-01-01

    The loss of conformational entropy is a major contribution in the thermodynamics of protein folding. However, accurate determination of the quantity has proven challenging. We calculate this loss using molecular dynamic simulations of both the native protein and a realistic denatured state ensemble. For ubiquitin, the total change in entropy is TΔSTotal = 1.4 kcal⋅mol−1 per residue at 300 K with only 20% from the loss of side-chain entropy. Our analysis exhibits mixed agreement with prior studies because of the use of more accurate ensembles and contributions from correlated motions. Buried side chains lose only a factor of 1.4 in the number of conformations available per rotamer upon folding (ΩU/ΩN). The entropy loss for helical and sheet residues differs due to the smaller motions of helical residues (TΔShelix−sheet = 0.5 kcal⋅mol−1), a property not fully reflected in the amide N-H and carbonyl C=O bond NMR order parameters. The results have implications for the thermodynamics of folding and binding, including estimates of solvent ordering and microscopic entropies obtained from NMR. PMID:25313044

  19. High-Pressure NMR and SAXS Reveals How Capping Modulates Folding Cooperativity of the pp32 Leucine-rich Repeat Protein.

    Science.gov (United States)

    Zhang, Yi; Berghaus, Melanie; Klein, Sean; Jenkins, Kelly; Zhang, Siwen; McCallum, Scott A; Morgan, Joel E; Winter, Roland; Barrick, Doug; Royer, Catherine A

    2018-04-27

    Many repeat proteins contain capping motifs, which serve to shield the hydrophobic core from solvent and maintain structural integrity. While the role of capping motifs in enhancing the stability and structural integrity of repeat proteins is well documented, their contribution to folding cooperativity is not. Here we examined the role of capping motifs in defining the folding cooperativity of the leucine-rich repeat protein, pp32, by monitoring the pressure- and urea-induced unfolding of an N-terminal capping motif (N-cap) deletion mutant, pp32-∆N-cap, and a C-terminal capping motif destabilization mutant pp32-Y131F/D146L, using residue-specific NMR and small-angle X-ray scattering. Destabilization of the C-terminal capping motif resulted in higher cooperativity for the unfolding transition compared to wild-type pp32, as these mutations render the stability of the C-terminus similar to that of the rest of the protein. In contrast, deletion of the N-cap led to strong deviation from two-state unfolding. In both urea- and pressure-induced unfolding, residues in repeats 1-3 of pp32-ΔN-cap lost their native structure first, while the C-terminal half was more stable. The residue-specific free energy changes in all regions of pp32-ΔN-cap were larger in urea compared to high pressure, indicating a less cooperative destabilization by pressure. Moreover, in contrast to complete structural disruption of pp32-ΔN-cap at high urea concentration, its pressure unfolded state remained compact. The contrasting effects of the capping motifs on folding cooperativity arise from the differential local stabilities of pp32, whereas the contrasting effects of pressure and urea on the pp32-ΔN-cap variant arise from their distinct mechanisms of action. Copyright © 2018 Elsevier Ltd. All rights reserved.

  20. Protein Polymers and Amyloids

    DEFF Research Database (Denmark)

    Risør, Michael Wulff

    2014-01-01

    Several human disorders are caused by a common general disease mechanism arising from abnormal folding and aggregation of the underlying protein. These include the prevalent dementias like Alzheimer’s and Parkinson’s, where accumulation of protein fibrillar structures, known as amyloid fibrils......, is a general hallmark. They also include the α1-antitrypsin deficiency, where disease-causing mutations in the serine protease inhibitor, α1-antitrypsin (α1AT), leads to accumulation of the aberrant protein in the liver of these patients. The native metastable structure of α1AT constitutes a molecular trap...... that inhibits its target protease through a large conformational change but mutations compromise this function and cause premature structural collapse into hyperstable polymers. Understanding the conformational disorders at a molecular level is not only important for our general knowledge on protein folding...

  1. Evolutionary diversification of protein-protein interactions by interface add-ons.

    Science.gov (United States)

    Plach, Maximilian G; Semmelmann, Florian; Busch, Florian; Busch, Markus; Heizinger, Leonhard; Wysocki, Vicki H; Merkl, Rainer; Sterner, Reinhard

    2017-10-03

    Cells contain a multitude of protein complexes whose subunits interact with high specificity. However, the number of different protein folds and interface geometries found in nature is limited. This raises the question of how protein-protein interaction specificity is achieved on the structural level and how the formation of nonphysiological complexes is avoided. Here, we describe structural elements called interface add-ons that fulfill this function and elucidate their role for the diversification of protein-protein interactions during evolution. We identified interface add-ons in 10% of a representative set of bacterial, heteromeric protein complexes. The importance of interface add-ons for protein-protein interaction specificity is demonstrated by an exemplary experimental characterization of over 30 cognate and hybrid glutamine amidotransferase complexes in combination with comprehensive genetic profiling and protein design. Moreover, growth experiments showed that the lack of interface add-ons can lead to physiologically harmful cross-talk between essential biosynthetic pathways. In sum, our complementary in silico, in vitro, and in vivo analysis argues that interface add-ons are a practical and widespread evolutionary strategy to prevent the formation of nonphysiological complexes by specializing protein-protein interactions.

  2. Mapping monomeric threading to protein-protein structure prediction.

    Science.gov (United States)

    Guerler, Aysam; Govindarajoo, Brandon; Zhang, Yang

    2013-03-25

    The key step of template-based protein-protein structure prediction is the recognition of complexes from experimental structure libraries that have similar quaternary fold. Maintaining two monomer and dimer structure libraries is however laborious, and inappropriate library construction can degrade template recognition coverage. We propose a novel strategy SPRING to identify complexes by mapping monomeric threading alignments to protein-protein interactions based on the original oligomer entries in the PDB, which does not rely on library construction and increases the efficiency and quality of complex template recognitions. SPRING is tested on 1838 nonhomologous protein complexes which can recognize correct quaternary template structures with a TM score >0.5 in 1115 cases after excluding homologous proteins. The average TM score of the first model is 60% and 17% higher than that by HHsearch and COTH, respectively, while the number of targets with an interface RMSD benchmark proteins. Although the relative performance of SPRING and ZDOCK depends on the level of homology filters, a combination of the two methods can result in a significantly higher model quality than ZDOCK at all homology thresholds. These data demonstrate a new efficient approach to quaternary structure recognition that is ready to use for genome-scale modeling of protein-protein interactions due to the high speed and accuracy.

  3. Protein disulfide isomerase-like protein 1-1 controls endosperm development through regulation of the amount and composition of seed proteins in rice.

    Directory of Open Access Journals (Sweden)

    Yeon Jeong Kim

    Full Text Available Protein disulfide isomerase (PDI is a chaperone protein involved in oxidative protein folding by acting as a catalyst and assisting folding in the endoplasmic reticulum (ER. A genome database search showed that rice contains 19 PDI-like genes. However, their functions are not clearly identified. This paper shows possible functions of rice PDI-like protein 1-1 (PDIL1-1 during seed development. Seeds of the T-DNA insertion PDIL1-1 mutant, PDIL1-1Δ, identified by genomic DNA PCR and western blot analysis, display a chalky phenotype and a thick aleurone layer. Protein content per seed was significantly lower and free sugar content higher in PDIL1-1Δ mutant seeds than in the wild type. Proteomic analysis of PDIL1-1Δ mutant seeds showed that PDIL1-1 is post-translationally regulated, and its loss causes accumulation of many types of seed proteins including glucose/starch metabolism- and ROS (reactive oxygen species scavenging-related proteins. In addition, PDIL1-1 strongly interacts with the cysteine protease OsCP1. Our data indicate that the opaque phenotype of PDIL1-1Δ mutant seeds results from production of irregular starch granules and protein body through loss of regulatory activity for various proteins involved in the synthesis of seed components.

  4. Structural anatomy of telomere OB proteins.

    Science.gov (United States)

    Horvath, Martin P

    2011-10-01

    Telomere DNA-binding proteins protect the ends of chromosomes in eukaryotes. A subset of these proteins are constructed with one or more OB folds and bind with G+T-rich single-stranded DNA found at the extreme termini. The resulting DNA-OB protein complex interacts with other telomere components to coordinate critical telomere functions of DNA protection and DNA synthesis. While the first crystal and NMR structures readily explained protection of telomere ends, the picture of how single-stranded DNA becomes available to serve as primer and template for synthesis of new telomere DNA is only recently coming into focus. New structures of telomere OB fold proteins alongside insights from genetic and biochemical experiments have made significant contributions towards understanding how protein-binding OB proteins collaborate with DNA-binding OB proteins to recruit telomerase and DNA polymerase for telomere homeostasis. This review surveys telomere OB protein structures alongside highly comparable structures derived from replication protein A (RPA) components, with the goal of providing a molecular context for understanding telomere OB protein evolution and mechanism of action in protection and synthesis of telomere DNA.

  5. Hydrophobic patches on protein surfaces

    NARCIS (Netherlands)

    Lijnzaad, P.

    2007-01-01

    Hydrophobicity is a prime determinant of the structure and function of proteins. It is the driving force behind the folding of soluble proteins, and when exposed on the surface, it is frequently involved in recognition and binding of ligands and other proteins. The energetic cost of

  6. The mitochondrial translocator protein, TSPO, inhibits HIV-1 envelope glycoprotein biosynthesis via the endoplasmic reticulum-associated protein degradation pathway.

    Science.gov (United States)

    Zhou, Tao; Dang, Ying; Zheng, Yong-Hui

    2014-03-01

    The HIV-1 Env glycoprotein is folded in the endoplasmic reticulum (ER), which is necessary for viral entry and replication. Currently, it is still unclear how this process is regulated. The glycoprotein folding in the ER is controlled by the ER-associated protein degradation (ERAD) pathway, which specifically targets misfolded proteins for degradation. Previously, we reported that HIV-1 replication is restricted in the human CD4(+) T cell line CEM.NKR (NKR). To understand this mechanism, we first analyzed cellular protein expression in NKR cells and discovered that levels of the mitochondrial translocator protein TSPO were upregulated by ∼64-fold. Notably, when NKR cells were treated with TSPO antagonist PK-11195, Ro5-4864, or diazepam, HIV restriction was completely disrupted, and TSPO knockdown by short hairpin RNAs (shRNAs) achieved a similar effect. We next analyzed viral protein expression, and, interestingly, we discovered that Env expression was specifically inhibited. Both TSPO knockdown and treatment with TSPO antagonist could restore Env expression in NKR cells. We further discovered that Env proteins were rapidly degraded and that kifunensine, an ERAD pathway inhibitor, could restore Env expression and viral replication, indicating that Env proteins were misfolded and degraded through the ERAD pathway in NKR cells. We also knocked out the TSPO gene in 293T cells using CRISPR/Cas9 (clustered, regularly interspaced, short palindromic repeat [CRISPR]/CRISPR-associated-9) technology and found that TSPO could similarly inhibit Env expression in these cells. Taken together, these results demonstrate that TSPO inhibits Env protein expression through the ERAD pathway and suggest that mitochondria play an important role in regulating the Env folding process. The HIV-1 Env glycoprotein is absolutely required for viral infection, and an understanding of its expression pathway in infected cells will identify new targets for antiretroviral therapies. Env proteins

  7. Mis-translation of a Computationally Designed Protein Yields an Exceptionally Stable Homodimer: Implications for Protein Engineering and Evolution.

    Energy Technology Data Exchange (ETDEWEB)

    Dantas, Gautam; Watters, Alexander L.; Lunde, Bradley; Eletr, Ziad; Isern, Nancy G.; Roseman, Toby; Lipfert, Jan; Doniach, Sebastian; Tompa, Martin; Kuhlman, Brian; Stoddard, Barry L.; Varani, Gabriele; Baker, David

    2006-10-06

    We recently used computational protein design to create an extremely stable, globular protein, Top7, with a sequence and fold not observed previously in nature. Since Top7 was created in the absence of genetic selection, it provides a rare opportunity to investigate aspects of the cellular protein production and surveillance machinery that are subject to natural selection. Here we show that a portion of the Top7 protein corresponding to the final 49 C-terminal residues is efficiently mistranslated and accumulates at high levels in E. coli. We used circular dichroism spectroscopy, size-exclusion chromatography, small-angle x-ray scattering, analytical ultra-centrifugation, and NMR spectroscopy to show that the resulting CFr protein adopts a compact, extremely-stable, obligate, symmetric, homo-dimeric structure. Based on the solution structure, we engineered an even more stable variant of CFr by disulfide-induced covalent circularisation that should be an excellent platform for design of novel functions. The accumulation of high levels of CFr exposes the high error rate of the protein translation machinery, and the rarity of correspondingly stable fragments in natural proteins implies a stringent evolutionary pressure against protein sub-fragments that can independently fold into stable structures. The symmetric self-association between two identical mistranslated CFr sub-units to generate an extremely stable structure parallels a mechanism for natural protein-fold evolution by modular recombination of stable protein sub-structures.

  8. Mechanisms of protein misfolding in conformational lung diseases.

    LENUS (Irish Health Repository)

    McElvaney, N G

    2012-08-01

    Genetic or environmentally-induced alterations in protein structure interfere with the correct folding, assembly and trafficking of proteins. In the lung the expression of misfolded proteins can induce a variety of pathogenetic effects. Cystic fibrosis (CF) and alpha-1 antitrypsin (AAT) deficiency are two major clinically relevant pulmonary disorders associated with protein misfolding. Both are genetic diseases the primary causes of which are expression of mutant alleles of the cystic fibrosis transmembrane conductance regulator (CFTR) and SERPINA1, respectively. The most common and best studied mutant forms of CFTR and AAT are ΔF508 CFTR and the Glu342Lys mutant of AAT called ZAAT, respectively. Non-genetic mechanisms can also damage protein structure and induce protein misfolding in the lung. Cigarette-smoke contains oxidants and other factors that can modify a protein\\'s structure, and is one of the most significant environmental causes of protein damage within the lung. Herein we describe the mechanisms controlling the folding of wild type and mutant versions of CFTR and AAT proteins, and explore the consequences of cigarette-smoke-induced effects on the protein folding machinery in the lung.

  9. Structural and biochemical characterization of the cell fate determining nucleotidyltransferase fold protein MAB21L1.

    Science.gov (United States)

    de Oliveira Mann, Carina C; Kiefersauer, Reiner; Witte, Gregor; Hopfner, Karl-Peter

    2016-06-08

    The exceptionally conserved metazoan MAB21 proteins are implicated in cell fate decisions and share considerable sequence homology with the cyclic GMP-AMP synthase. cGAS is the major innate immune sensor for cytosolic DNA and produces the second messenger 2'-5', 3'-5' cyclic GMP-AMP. Little is known about the structure and biochemical function of other proteins of the cGAS-MAB21 subfamily, such as MAB21L1, MAB21L2 and MAB21L3. We have determined the crystal structure of human full-length MAB21L1. Our analysis reveals high structural conservation between MAB21L1 and cGAS but also uncovers important differences. Although monomeric in solution, MAB21L1 forms a highly symmetric double-pentameric oligomer in the crystal, raising the possibility that oligomerization could be a feature of MAB21L1. In the crystal, MAB21L1 is in an inactive conformation requiring a conformational change - similar to cGAS - to develop any nucleotidyltransferase activity. Co-crystallization with NTP identified a putative ligand binding site of MAB21 proteins that corresponds to the DNA binding site of cGAS. Finally, we offer a structure-based explanation for the effects of MAB21L2 mutations in patients with eye malformations. The underlying residues participate in fold-stabilizing interaction networks and mutations destabilize the protein. In summary, we provide a first structural framework for MAB21 proteins.

  10. Structure of the N-terminal domain of the protein Expansion: an ‘Expansion’ to the Smad MH2 fold

    International Nuclear Information System (INIS)

    Beich-Frandsen, Mads; Aragón, Eric; Llimargas, Marta; Benach, Jordi; Riera, Antoni; Pous, Joan; Macias, Maria J.

    2015-01-01

    Expansion is a modular protein that is conserved in protostomes. The first structure of the N-terminal domain of Expansion has been determined at 1.6 Å resolution and the new Nα-MH2 domain was found to belong to the Smad/FHA superfamily of structures. Gene-expression changes observed in Drosophila embryos after inducing the transcription factor Tramtrack led to the identification of the protein Expansion. Expansion contains an N-terminal domain similar in sequence to the MH2 domain characteristic of Smad proteins, which are the central mediators of the effects of the TGF-β signalling pathway. Apart from Smads and Expansion, no other type of protein belonging to the known kingdoms of life contains MH2 domains. To compare the Expansion and Smad MH2 domains, the crystal structure of the Expansion domain was determined at 1.6 Å resolution, the first structure of a non-Smad MH2 domain to be characterized to date. The structure displays the main features of the canonical MH2 fold with two main differences: the addition of an α-helical region and the remodelling of a protein-interaction site that is conserved in the MH2 domain of Smads. Owing to these differences, to the new domain was referred to as Nα-MH2. Despite the presence of the Nα-MH2 domain, Expansion does not participate in TGF-β signalling; instead, it is required for other activities specific to the protostome phyla. Based on the structural similarities to the MH2 fold, it is proposed that the Nα-MH2 domain should be classified as a new member of the Smad/FHA superfamily

  11. Structure of the N-terminal domain of the protein Expansion: an ‘Expansion’ to the Smad MH2 fold

    Energy Technology Data Exchange (ETDEWEB)

    Beich-Frandsen, Mads; Aragón, Eric [Institute for Research in Biomedicine (IRB Barcelona), Baldiri Reixac 10, 08028 Barcelona (Spain); Llimargas, Marta [Institut de Biologia Molecular de Barcelona, IBMB–CSIC, Baldiri Reixac 10, 08028 Barcelona (Spain); Benach, Jordi [ALBA Synchrotron, BP 1413, km 3.3, Cerdanyola del Vallès (Spain); Riera, Antoni [Institute for Research in Biomedicine (IRB Barcelona), Baldiri Reixac 10, 08028 Barcelona (Spain); Universitat de Barcelona, Martí i Franqués 1-11, 08028 Barcelona (Spain); Pous, Joan [Institute for Research in Biomedicine (IRB Barcelona), Baldiri Reixac 10, 08028 Barcelona (Spain); Platform of Crystallography IBMB–CSIC, Baldiri Reixac 10, 08028 Barcelona (Spain); Macias, Maria J., E-mail: maria.macias@irbbarcelona.org [Institute for Research in Biomedicine (IRB Barcelona), Baldiri Reixac 10, 08028 Barcelona (Spain); Catalan Institution for Research and Advanced Studies (ICREA), Passeig Lluís Companys 23, 08010 Barcelona (Spain)

    2015-04-01

    Expansion is a modular protein that is conserved in protostomes. The first structure of the N-terminal domain of Expansion has been determined at 1.6 Å resolution and the new Nα-MH2 domain was found to belong to the Smad/FHA superfamily of structures. Gene-expression changes observed in Drosophila embryos after inducing the transcription factor Tramtrack led to the identification of the protein Expansion. Expansion contains an N-terminal domain similar in sequence to the MH2 domain characteristic of Smad proteins, which are the central mediators of the effects of the TGF-β signalling pathway. Apart from Smads and Expansion, no other type of protein belonging to the known kingdoms of life contains MH2 domains. To compare the Expansion and Smad MH2 domains, the crystal structure of the Expansion domain was determined at 1.6 Å resolution, the first structure of a non-Smad MH2 domain to be characterized to date. The structure displays the main features of the canonical MH2 fold with two main differences: the addition of an α-helical region and the remodelling of a protein-interaction site that is conserved in the MH2 domain of Smads. Owing to these differences, to the new domain was referred to as Nα-MH2. Despite the presence of the Nα-MH2 domain, Expansion does not participate in TGF-β signalling; instead, it is required for other activities specific to the protostome phyla. Based on the structural similarities to the MH2 fold, it is proposed that the Nα-MH2 domain should be classified as a new member of the Smad/FHA superfamily.

  12. Annotating the protein-RNA interaction sites in proteins using evolutionary information and protein backbone structure.

    Science.gov (United States)

    Li, Tao; Li, Qian-Zhong

    2012-11-07

    RNA-protein interactions play important roles in various biological processes. The precise detection of RNA-protein interaction sites is very important for understanding essential biological processes and annotating the function of the proteins. In this study, based on various features from amino acid sequence and structure, including evolutionary information, solvent accessible surface area and torsion angles (φ, ψ) in the backbone structure of the polypeptide chain, a computational method for predicting RNA-binding sites in proteins is proposed. When the method is applied to predict RNA-binding sites in three datasets: RBP86 containing 86 protein chains, RBP107 containing 107 proteins chains and RBP109 containing 109 proteins chains, better sensitivities and specificities are obtained compared to previously published methods in five-fold cross-validation tests. In order to make further examination for the efficiency of our method, the RBP107 dataset is used as training set, RBP86 and RBP109 datasets are used as the independent test sets. In addition, as examples of our prediction, RNA-binding sites in a few proteins are presented. The annotated results are consistent with the PDB annotation. These results show that our method is useful for annotating RNA binding sites of novel proteins.

  13. Heavy metal ions are potent inhibitors of protein folding

    International Nuclear Information System (INIS)

    Sharma, Sandeep K.; Goloubinoff, Pierre; Christen, Philipp

    2008-01-01

    Environmental and occupational exposure to heavy metals such as cadmium, mercury and lead results in severe health hazards including prenatal and developmental defects. The deleterious effects of heavy metal ions have hitherto been attributed to their interactions with specific, particularly susceptible native proteins. Here, we report an as yet undescribed mode of heavy metal toxicity. Cd 2+ , Hg 2+ and Pb 2+ proved to inhibit very efficiently the spontaneous refolding of chemically denatured proteins by forming high-affinity multidentate complexes with thiol and other functional groups (IC 50 in the nanomolar range). With similar efficacy, the heavy metal ions inhibited the chaperone-assisted refolding of chemically denatured and heat-denatured proteins. Thus, the toxic effects of heavy metal ions may result as well from their interaction with the more readily accessible functional groups of proteins in nascent and other non-native form. The toxic scope of heavy metals seems to be substantially larger than assumed so far

  14. Protein misfolding disorders: pathogenesis and intervention

    DEFF Research Database (Denmark)

    Gregersen, Niels

    2006-01-01

    of the functional structure of cellular proteins. Aberrant proteins, the result of production errors, inherited or acquired amino acid substitutions or damage, especially oxidative modifications, can in many cases not fold correctly and will be trapped in misfolded conformations. To rid the cell of misfolded...... be accompanied by a gain-of-function pathogenesis, which in many cases determines the pathological and clinical features. Examples are Parkinson and Huntington diseases. Although a number of strategies have been tried to decrease the amounts of accumulated and aggregated proteins, a likely future strategy seems......Newly synthesized proteins in the living cell must go through a folding process to attain their functional structure. To achieve this in an efficient fashion, all organisms, including humans, have evolved a large set of molecular chaperones that assist the folding as well as the maintenance...

  15. Mitochondrial associated ubiquitin fold modifier-1 mediated protein conjugation in Leishmania donovani.

    Directory of Open Access Journals (Sweden)

    Sreenivas Gannavaram

    2011-01-01

    Full Text Available In this report, we demonstrate the existence of the ubiquitin fold modifier-1 (Ufm1 and its conjugation pathway in trypanosomatid parasite Leishmania donovani. LdUfm1 is activated by E1-like enzyme LdUba5. LdUfc1 (E2 specifically interacted with LdUfm1 and LdUba5 to conjugate LdUfm1 to proteinaceous targets. Mass spectrometry analysis revealed that LdUfm1 is conjugated to Leishmania protein targets that are associated with mitochondria. Immunofluorescence experiments showed that Leishmania Ufm1, Uba5 and Ufc1 are associated with the mitochondria. The demonstration that all the components of this system as well as the substrates are associated with mitochondrion suggests it may have physiological roles not yet described in any other organism. Overexpression of a non-conjugatable form of LdUfm1 and an active site mutant of LdUba5 resulted in reduced survival of Leishmania in the macrophage. Since mitochondrial activities are developmentally regulated in the life cycle of trypanosomatids, Ufm1 mediated modifications of mitochondrial proteins may be important in such regulation. Thus, Ufm1 conjugation pathway in Leishmania could be explored as a potential drug target in the control of Leishmaniasis.

  16. Using protein design algorithms to understand the molecular basis of disease caused by protein-DNA interactions: the Pax6 example

    DEFF Research Database (Denmark)

    Alibes, A.; Nadra, A.; De Masi, Federico

    2010-01-01

    diseases such as aniridia. The validity of FoldX to deal with protein-DNA interactions was demonstrated by showing that high levels of accuracy can be achieved for mutations affecting these interactions. Also we showed that protein-design algorithms can accurately reproduce experimental DNA-binding logos......Quite often a single or a combination of protein mutations is linked to specific diseases. However, distinguishing from sequence information which mutations have real effects in the protein's function is not trivial. Protein design tools are commonly used to explain mutations that affect protein...... stability, or protein-protein interaction, but not for mutations that could affect protein-DNA binding. Here, we used the protein design algorithm FoldX to model all known missense mutations in the paired box domain of Pax6, a highly conserved transcription factor involved in eye development and in several...

  17. Scale-free behaviour of amino acid pair interactions in folded proteins

    DEFF Research Database (Denmark)

    Petersen, Steffen B.; Neves-Petersen, Maria Teresa; Mortensen, Rasmus J.

    2012-01-01

    The protein structure is a cumulative result of interactions between amino acid residues interacting with each other through space and/or chemical bonds. Despite the large number of high resolution protein structures, the ‘‘protein structure code’’ has not been fully identified. Our manuscript...... presents a novel approach to protein structure analysis in order to identify rules for spatial packing of amino acid pairs in proteins. We have investigated 8706 high resolution non-redundant protein chains and quantified amino acid pair interactions in terms of solvent accessibility, spatial and sequence...... which amino acid paired residues contributed to the cells with a population above 50, pairs of Ala, Ile, Leu and Val dominate the results. This result is statistically highly significant. We postulate that such pairs form ‘‘structural stability points’’ in the protein structure. Our data shows...

  18. Molecular nonlinear dynamics and protein thermal uncertainty quantification

    Science.gov (United States)

    Xia, Kelin; Wei, Guo-Wei

    2014-01-01

    This work introduces molecular nonlinear dynamics (MND) as a new approach for describing protein folding and aggregation. By using a mode system, we show that the MND of disordered proteins is chaotic while that of folded proteins exhibits intrinsically low dimensional manifolds (ILDMs). The stability of ILDMs is found to strongly correlate with protein energies. We propose a novel method for protein thermal uncertainty quantification based on persistently invariant ILDMs. Extensive comparison with experimental data and the state-of-the-art methods in the field validate the proposed new method for protein B-factor prediction. PMID:24697365

  19. Investigating homology between proteins using energetic profiles.

    Science.gov (United States)

    Wrabl, James O; Hilser, Vincent J

    2010-03-26

    Accumulated experimental observations demonstrate that protein stability is often preserved upon conservative point mutation. In contrast, less is known about the effects of large sequence or structure changes on the stability of a particular fold. Almost completely unknown is the degree to which stability of different regions of a protein is generally preserved throughout evolution. In this work, these questions are addressed through thermodynamic analysis of a large representative sample of protein fold space based on remote, yet accepted, homology. More than 3,000 proteins were computationally analyzed using the structural-thermodynamic algorithm COREX/BEST. Estimated position-specific stability (i.e., local Gibbs free energy of folding) and its component enthalpy and entropy were quantitatively compared between all proteins in the sample according to all-vs.-all pairwise structural alignment. It was discovered that the local stabilities of homologous pairs were significantly more correlated than those of non-homologous pairs, indicating that local stability was indeed generally conserved throughout evolution. However, the position-specific enthalpy and entropy underlying stability were less correlated, suggesting that the overall regional stability of a protein was more important than the thermodynamic mechanism utilized to achieve that stability. Finally, two different types of statistically exceptional evolutionary structure-thermodynamic relationships were noted. First, many homologous proteins contained regions of similar thermodynamics despite localized structure change, suggesting a thermodynamic mechanism enabling evolutionary fold change. Second, some homologous proteins with extremely similar structures nonetheless exhibited different local stabilities, a phenomenon previously observed experimentally in this laboratory. These two observations, in conjunction with the principal conclusion that homologous proteins generally conserved local stability, may

  20. Investigating homology between proteins using energetic profiles.

    Directory of Open Access Journals (Sweden)

    James O Wrabl

    2010-03-01

    Full Text Available Accumulated experimental observations demonstrate that protein stability is often preserved upon conservative point mutation. In contrast, less is known about the effects of large sequence or structure changes on the stability of a particular fold. Almost completely unknown is the degree to which stability of different regions of a protein is generally preserved throughout evolution. In this work, these questions are addressed through thermodynamic analysis of a large representative sample of protein fold space based on remote, yet accepted, homology. More than 3,000 proteins were computationally analyzed using the structural-thermodynamic algorithm COREX/BEST. Estimated position-specific stability (i.e., local Gibbs free energy of folding and its component enthalpy and entropy were quantitatively compared between all proteins in the sample according to all-vs.-all pairwise structural alignment. It was discovered that the local stabilities of homologous pairs were significantly more correlated than those of non-homologous pairs, indicating that local stability was indeed generally conserved throughout evolution. However, the position-specific enthalpy and entropy underlying stability were less correlated, suggesting that the overall regional stability of a protein was more important than the thermodynamic mechanism utilized to achieve that stability. Finally, two different types of statistically exceptional evolutionary structure-thermodynamic relationships were noted. First, many homologous proteins contained regions of similar thermodynamics despite localized structure change, suggesting a thermodynamic mechanism enabling evolutionary fold change. Second, some homologous proteins with extremely similar structures nonetheless exhibited different local stabilities, a phenomenon previously observed experimentally in this laboratory. These two observations, in conjunction with the principal conclusion that homologous proteins generally conserved

  1. Modular protein switches derived from antibody mimetic proteins.

    Science.gov (United States)

    Nicholes, N; Date, A; Beaujean, P; Hauk, P; Kanwar, M; Ostermeier, M

    2016-02-01

    Protein switches have potential applications as biosensors and selective protein therapeutics. Protein switches built by fusion of proteins with the prerequisite input and output functions are currently developed using an ad hoc process. A modular switch platform in which existing switches could be readily adapted to respond to any ligand would be advantageous. We investigated the feasibility of a modular protein switch platform based on fusions of the enzyme TEM-1 β-lactamase (BLA) with two different antibody mimetic proteins: designed ankyrin repeat proteins (DARPins) and monobodies. We created libraries of random insertions of the gene encoding BLA into genes encoding a DARPin or a monobody designed to bind maltose-binding protein (MBP). From these libraries, we used a genetic selection system for β-lactamase activity to identify genes that conferred MBP-dependent ampicillin resistance to Escherichia coli. Some of these selected genes encoded switch proteins whose enzymatic activity increased up to 14-fold in the presence of MBP. We next introduced mutations into the antibody mimetic domain of these switches that were known to cause binding to different ligands. To different degrees, introduction of the mutations resulted in switches with the desired specificity, illustrating the potential modularity of these platforms. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  2. Prediction of Protein Configurational Entropy (Popcoen).

    Science.gov (United States)

    Goethe, Martin; Gleixner, Jan; Fita, Ignacio; Rubi, J Miguel

    2018-03-13

    A knowledge-based method for configurational entropy prediction of proteins is presented; this methodology is extremely fast, compared to previous approaches, because it does not involve any type of configurational sampling. Instead, the configurational entropy of a query fold is estimated by evaluating an artificial neural network, which was trained on molecular-dynamics simulations of ∼1000 proteins. The predicted entropy can be incorporated into a large class of protein software based on cost-function minimization/evaluation, in which configurational entropy is currently neglected for performance reasons. Software of this type is used for all major protein tasks such as structure predictions, proteins design, NMR and X-ray refinement, docking, and mutation effect predictions. Integrating the predicted entropy can yield a significant accuracy increase as we show exemplarily for native-state identification with the prominent protein software FoldX. The method has been termed Popcoen for Prediction of Protein Configurational Entropy. An implementation is freely available at http://fmc.ub.edu/popcoen/ .

  3. Aquaporin Protein-Protein Interactions

    Directory of Open Access Journals (Sweden)

    Jennifer Virginia Roche

    2017-10-01

    Full Text Available Aquaporins are tetrameric membrane-bound channels that facilitate transport of water and other small solutes across cell membranes. In eukaryotes, they are frequently regulated by gating or trafficking, allowing for the cell to control membrane permeability in a specific manner. Protein–protein interactions play crucial roles in both regulatory processes and also mediate alternative functions such as cell adhesion. In this review, we summarize recent knowledge about aquaporin protein–protein interactions; dividing the interactions into three types: (1 interactions between aquaporin tetramers; (2 interactions between aquaporin monomers within a tetramer (hetero-tetramerization; and (3 transient interactions with regulatory proteins. We particularly focus on the structural aspects of the interactions, discussing the small differences within a conserved overall fold that allow for aquaporins to be differentially regulated in an organism-, tissue- and trigger-specific manner. A deep knowledge about these differences is needed to fully understand aquaporin function and regulation in many physiological processes, and may enable design of compounds targeting specific aquaporins for treatment of human disease.

  4. Cell-free system for synthesizing membrane proteins cell free method for synthesizing membrane proteins

    Science.gov (United States)

    Laible, Philip D; Hanson, Deborah K

    2013-06-04

    The invention provides an in vitro method for producing proteins, membrane proteins, membrane-associated proteins, and soluble proteins that interact with membrane-associated proteins for assembly into an oligomeric complex or that require association with a membrane for proper folding. The method comprises, supplying intracytoplasmic membranes from organisms; modifying protein composition of intracytoplasmic membranes from organism by modifying DNA to delete genes encoding functions of the organism not associated with the formation of the intracytoplasmic membranes; generating appropriate DNA or RNA templates that encode the target protein; and mixing the intracytoplasmic membranes with the template and a transcription/translation-competent cellular extract to cause simultaneous production of the membrane proteins and encapsulation of the membrane proteins within the intracytoplasmic membranes.

  5. Improved protein extraction and protein identification from archival formalin-fixed paraffin-embedded human aortas.

    Science.gov (United States)

    Fu, Zongming; Yan, Kun; Rosenberg, Avraham; Jin, Zhicheng; Crain, Barbara; Athas, Grace; Heide, Richard S Vander; Howard, Timothy; Everett, Allen D; Herrington, David; Van Eyk, Jennifer E

    2013-04-01

    Evaluate combination of heat and elevated pressure to enhance protein extraction and quality of formalin-fixed (FF), and FF paraffin-embedded (FFPE) aorta for proteomics. Proteins were extracted from fresh frozen aorta at room temperature (RT). FF and FFPE aortas (3 months and 15 years) were extracted at RT, heat alone, or a combination of heat and high pressure. Protein yields were compared, and digested peptides from the extracts were analyzed with MS. Combined heat and elevated pressure increased protein yield from human FF or FFPE aorta compared to matched tissues with heat alone (1.5-fold) or at RT (8.3-fold), resulting in more proteins identified and with more sequence coverage. The length of storage did adversely affect the quality of proteins from FF tissue. For long-term storage, aorta was preserved better with FFPE than FF alone. Periostin and MGF-E8 were demonstrated suitable for MRM assays from FFPE aorta. Combination of heat and high pressure is an effective method to extract proteins from FFPE aorta for downstream proteomics. This method opens the possibility for use of archival and often rare FFPE aortas and possibly other tissues available to proteomics for biomarker discovery and quantification. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  6. Distinctive serum protein profiles involving abundant proteins in lung cancer patients based upon antibody microarray analysis

    International Nuclear Information System (INIS)

    Gao, Wei-Min; Haab, Brian B; Hanash, Samir M; Kuick, Rork; Orchekowski, Randal P; Misek, David E; Qiu, Ji; Greenberg, Alissa K; Rom, William N; Brenner, Dean E; Omenn, Gilbert S

    2005-01-01

    Cancer serum protein profiling by mass spectrometry has uncovered mass profiles that are potentially diagnostic for several common types of cancer. However, direct mass spectrometric profiling has a limited dynamic range and difficulties in providing the identification of the distinctive proteins. We hypothesized that distinctive profiles may result from the differential expression of relatively abundant serum proteins associated with the host response. Eighty-four antibodies, targeting a wide range of serum proteins, were spotted onto nitrocellulose-coated microscope slides. The abundances of the corresponding proteins were measured in 80 serum samples, from 24 newly diagnosed subjects with lung cancer, 24 healthy controls, and 32 subjects with chronic obstructive pulmonary disease (COPD). Two-color rolling-circle amplification was used to measure protein abundance. Seven of the 84 antibodies gave a significant difference (p < 0.01) for the lung cancer patients as compared to healthy controls, as well as compared to COPD patients. Proteins that exhibited higher abundances in the lung cancer samples relative to the control samples included C-reactive protein (CRP; a 13.3 fold increase), serum amyloid A (SAA; a 2.0 fold increase), mucin 1 and α-1-antitrypsin (1.4 fold increases). The increased expression levels of CRP and SAA were validated by Western blot analysis. Leave-one-out cross-validation was used to construct Diagonal Linear Discriminant Analysis (DLDA) classifiers. At a cutoff where all 56 of the non-tumor samples were correctly classified, 15/24 lung tumor patient sera were correctly classified. Our results suggest that a distinctive serum protein profile involving abundant proteins may be observed in lung cancer patients relative to healthy subjects or patients with chronic disease and may have utility as part of strategies for detecting lung cancer

  7. GRP94: An HSP90-like protein specialized for protein folding and quality control in the endoplasmic reticulum

    DEFF Research Database (Denmark)

    Marzec, Michal; Eletto, Davide; Argon, Yair

    2012-01-01

    Glucose-regulated protein 94 is the HSP90-like protein in the lumen of the endoplasmic reticulum and therefore it chaperones secreted and membrane proteins. It has essential functions in development and physiology of multicellular organisms, at least in part because of this unique clientele. GRP94...

  8. Escherichia coli fusion carrier proteins act as solubilizing agents for recombinant uncoupling protein 1 through interactions with GroEL

    International Nuclear Information System (INIS)

    Douette, Pierre; Navet, Rachel; Gerkens, Pascal; Galleni, Moreno; Levy, Daniel; Sluse, Francis E.

    2005-01-01

    Fusing recombinant proteins to highly soluble partners is frequently used to prevent aggregation of recombinant proteins in Escherichia coli. Moreover, co-overexpression of prokaryotic chaperones can increase the amount of properly folded recombinant proteins. To understand the solubility enhancement of fusion proteins, we designed two recombinant proteins composed of uncoupling protein 1 (UCP1), a mitochondrial membrane protein, in fusion with MBP or NusA. We were able to express soluble forms of MBP-UCP1 and NusA-UCP1 despite the high hydrophobicity of UCP1. Furthermore, the yield of soluble fusion proteins depended on co-overexpression of GroEL that catalyzes folding of polypeptides. MBP-UCP1 was expressed in the form of a non-covalent complex with GroEL. MBP-UCP1/GroEL was purified and characterized by dynamic light scattering, gel filtration, and electron microscopy. Our findings suggest that MBP and NusA act as solubilizing agents by forcing the recombinant protein to pass through the bacterial chaperone pathway in the context of fusion protein

  9. Analysis of Translocation-Competent Secretory Proteins by HDX-MS

    DEFF Research Database (Denmark)

    Tsirigotaki, A.; Papanastasiou, M.; Trelle, M. B.

    2017-01-01

    Protein folding is an intricate and precise process in living cells. Most exported proteins evade cytoplasmic folding, become targeted to the membrane, and then trafficked into/across membranes. Their targeting and translocation-competent states are nonnatively folded. However, once they reach...... the appropriate cellular compartment, they can fold to their native states. The nonnative states of preproteins remain structurally poorly characterized since increased disorder, protein sizes, aggregation propensity, and the observation timescale are often limiting factors for typical structural approaches...... such as X-ray crystallography and NMR. Here, we present an alternative approach for the in vitro analysis of nonfolded translocation-competent protein states and their comparison with their native states. We make use of hydrogen/deuterium exchange coupled with mass spectrometry (HDX-MS), a method based...

  10. The protein side of the central dogma: permanence and change.

    Science.gov (United States)

    Morange, Michel

    2006-01-01

    There are two facets to the central dogma proposed by Francis Crick in 1957. One concerns the relation between the sequence of nucleotides and the sequence of amino acids, the second is devoted to the relation between the sequence of amino acids and the native three-dimensional structure of proteins. 'Folding is simply a function of the order of the amino acids,' i.e. no information is required for the proper folding of a protein other than the information contained in its sequence. This protein side of the central dogma was elaborated in a scientific context in which the characteristics and functions of proteins, and the mechanisms of protein folding, were seen very differently. This context, which made the folding problem a simple one, supported the bold proposition of Francis Crick. The protein side of the central dogma was not challenged by the discovery of prions if one adopts the definition of information given by Francis Crick. It might have been challenged by the discovery that regulatory enzymes exist in different conformations, and the evidence for the existence of chaperones assisting protein folding. But it was not, and folding remains what it was for Francis Crick, 'simply a function of the order of amino acids'. But the meaning of 'function' has dramatically changed. It is no longer the result of simple physicochemical laws, but that of a long evolutionary process which has optimized protein folding. Molecular mechanistic explanations have to be allied with evolutionary explanations, in a way characteristic of present biology.

  11. Foldability of a Natural De Novo Evolved Protein.

    Science.gov (United States)

    Bungard, Dixie; Copple, Jacob S; Yan, Jing; Chhun, Jimmy J; Kumirov, Vlad K; Foy, Scott G; Masel, Joanna; Wysocki, Vicki H; Cordes, Matthew H J

    2017-11-07

    The de novo evolution of protein-coding genes from noncoding DNA is emerging as a source of molecular innovation in biology. Studies of random sequence libraries, however, suggest that young de novo proteins will not fold into compact, specific structures typical of native globular proteins. Here we show that Bsc4, a functional, natural de novo protein encoded by a gene that evolved recently from noncoding DNA in the yeast S. cerevisiae, folds to a partially specific three-dimensional structure. Bsc4 forms soluble, compact oligomers with high β sheet content and a hydrophobic core, and undergoes cooperative, reversible denaturation. Bsc4 lacks a specific quaternary state, however, existing instead as a continuous distribution of oligomer sizes, and binds dyes indicative of amyloid oligomers or molten globules. The combination of native-like and non-native-like properties suggests a rudimentary fold that could potentially act as a functional intermediate in the emergence of new folded proteins de novo. Copyright © 2017 Elsevier Ltd. All rights reserved.

  12. Chaperoning Roles of Macromolecules Interacting with Proteins in Vivo

    Directory of Open Access Journals (Sweden)

    Baik L. Seong

    2011-03-01

    Full Text Available The principles obtained from studies on molecular chaperones have provided explanations for the assisted protein folding in vivo. However, the majority of proteins can fold without the assistance of the known molecular chaperones, and little attention has been paid to the potential chaperoning roles of other macromolecules. During protein biogenesis and folding, newly synthesized polypeptide chains interact with a variety of macromolecules, including ribosomes, RNAs, cytoskeleton, lipid bilayer, proteolytic system, etc. In general, the hydrophobic interactions between molecular chaperones and their substrates have been widely believed to be mainly responsible for the substrate stabilization against aggregation. Emerging evidence now indicates that other features of macromolecules such as their surface charges, probably resulting in electrostatic repulsions, and steric hindrance, could play a key role in the stabilization of their linked proteins against aggregation. Such stabilizing mechanisms are expected to give new insights into our understanding of the chaperoning functions for de novo protein folding. In this review, we will discuss the possible chaperoning roles of these macromolecules in de novo folding, based on their charge and steric features.

  13. Soliton concepts and protein structure

    Science.gov (United States)

    Krokhotin, Andrei; Niemi, Antti J.; Peng, Xubiao

    2012-03-01

    Structural classification shows that the number of different protein folds is surprisingly small. It also appears that proteins are built in a modular fashion from a relatively small number of components. Here we propose that the modular building blocks are made of the dark soliton solution of a generalized discrete nonlinear Schrödinger equation. We find that practically all protein loops can be obtained simply by scaling the size and by joining together a number of copies of the soliton, one after another. The soliton has only two loop-specific parameters, and we compute their statistical distribution in the Protein Data Bank (PDB). We explicitly construct a collection of 200 sets of parameters, each determining a soliton profile that describes a different short loop. The ensuing profiles cover practically all those proteins in PDB that have a resolution which is better than 2.0 Å, with a precision such that the average root-mean-square distance between the loop and its soliton is less than the experimental B-factor fluctuation distance. We also present two examples that describe how the loop library can be employed both to model and to analyze folded proteins.

  14. Disaggregases, molecular chaperones that resolubilize protein aggregates

    Directory of Open Access Journals (Sweden)

    David Z. Mokry

    2015-08-01

    Full Text Available The process of folding is a seminal event in the life of a protein, as it is essential for proper protein function and therefore cell physiology. Inappropriate folding, or misfolding, can not only lead to loss of function, but also to the formation of protein aggregates, an insoluble association of polypeptides that harm cell physiology, either by themselves or in the process of formation. Several biological processes have evolved to prevent and eliminate the existence of non-functional and amyloidogenic aggregates, as they are associated with several human pathologies. Molecular chaperones and heat shock proteins are specialized in controlling the quality of the proteins in the cell, specifically by aiding proper folding, and dissolution and clearance of already formed protein aggregates. The latter is a function of disaggregases, mainly represented by the ClpB/Hsp104 subfamily of molecular chaperones, that are ubiquitous in all organisms but, surprisingly, have no orthologs in the cytosol of metazoan cells. This review aims to describe the characteristics of disaggregases and to discuss the function of yeast Hsp104, a disaggregase that is also involved in prion propagation and inheritance.

  15. Protein folding and translocation : single-molecule investigations

    NARCIS (Netherlands)

    Leeuwen, Rudolphus Gerardus Henricus van

    2006-01-01

    This thesis describes experiments, in which we used an optical-tweezers setup to study a number of biological systems. We studied the interaction between the E. coli molecular chaperone SecB and a protein that was being unfolded and refolded using our optical tweezers setup. Our measurements clearly

  16. A new protein-protein interaction sensor based on tripartite split-GFP association.

    Science.gov (United States)

    Cabantous, Stéphanie; Nguyen, Hau B; Pedelacq, Jean-Denis; Koraïchi, Faten; Chaudhary, Anu; Ganguly, Kumkum; Lockard, Meghan A; Favre, Gilles; Terwilliger, Thomas C; Waldo, Geoffrey S

    2013-10-04

    Monitoring protein-protein interactions in living cells is key to unraveling their roles in numerous cellular processes and various diseases. Previously described split-GFP based sensors suffer from poor folding and/or self-assembly background fluorescence. Here, we have engineered a micro-tagging system to monitor protein-protein interactions in vivo and in vitro. The assay is based on tripartite association between two twenty amino-acids long GFP tags, GFP10 and GFP11, fused to interacting protein partners, and the complementary GFP1-9 detector. When proteins interact, GFP10 and GFP11 self-associate with GFP1-9 to reconstitute a functional GFP. Using coiled-coils and FRB/FKBP12 model systems we characterize the sensor in vitro and in Escherichia coli. We extend the studies to mammalian cells and examine the FK-506 inhibition of the rapamycin-induced association of FRB/FKBP12. The small size of these tags and their minimal effect on fusion protein behavior and solubility should enable new experiments for monitoring protein-protein association by fluorescence.

  17. High-resolution nuclear magnetic resonance studies of proteins.

    Science.gov (United States)

    Jonas, Jiri

    2002-03-25

    The combination of advanced high-resolution nuclear magnetic resonance (NMR) techniques with high-pressure capability represents a powerful experimental tool in studies of protein folding. This review is organized as follows: after a general introduction of high-pressure, high-resolution NMR spectroscopy of proteins, the experimental part deals with instrumentation. The main section of the review is devoted to NMR studies of reversible pressure unfolding of proteins with special emphasis on pressure-assisted cold denaturation and the detection of folding intermediates. Recent studies investigating local perturbations in proteins and the experiments following the effects of point mutations on pressure stability of proteins are also discussed. Ribonuclease A, lysozyme, ubiquitin, apomyoglobin, alpha-lactalbumin and troponin C were the model proteins investigated.

  18. Inferring repeat-protein energetics from evolutionary information.

    Directory of Open Access Journals (Sweden)

    Rocío Espada

    2017-06-01

    Full Text Available Natural protein sequences contain a record of their history. A common constraint in a given protein family is the ability to fold to specific structures, and it has been shown possible to infer the main native ensemble by analyzing covariations in extant sequences. Still, many natural proteins that fold into the same structural topology show different stabilization energies, and these are often related to their physiological behavior. We propose a description for the energetic variation given by sequence modifications in repeat proteins, systems for which the overall problem is simplified by their inherent symmetry. We explicitly account for single amino acid and pair-wise interactions and treat higher order correlations with a single term. We show that the resulting evolutionary field can be interpreted with structural detail. We trace the variations in the energetic scores of natural proteins and relate them to their experimental characterization. The resulting energetic evolutionary field allows the prediction of the folding free energy change for several mutants, and can be used to generate synthetic sequences that are statistically indistinguishable from the natural counterparts.

  19. A Self-Assisting Protein Folding Model for Teaching Structural Molecular Biology.

    Science.gov (United States)

    Davenport, Jodi; Pique, Michael; Getzoff, Elizabeth; Huntoon, Jon; Gardner, Adam; Olson, Arthur

    2017-04-04

    Structural molecular biology is now becoming part of high school science curriculum thus posing a challenge for teachers who need to convey three-dimensional (3D) structures with conventional text and pictures. In many cases even interactive computer graphics does not go far enough to address these challenges. We have developed a flexible model of the polypeptide backbone using 3D printing technology. With this model we have produced a polypeptide assembly kit to create an idealized model of the Triosephosphate isomerase mutase enzyme (TIM), which forms a structure known as TIM barrel. This kit has been used in a laboratory practical where students perform a step-by-step investigation into the nature of protein folding, starting with the handedness of amino acids to the formation of secondary and tertiary structure. Based on the classroom evidence we collected, we conclude that these models are valuable and inexpensive resource for teaching structural molecular biology. Copyright © 2017 Elsevier Ltd. All rights reserved.

  20. Enhancing the productivity of soluble green fluorescent protein ...

    African Journals Online (AJOL)

    Protein sequences might have been evolved against different environmental pressures, which results in non-optimum properties in their stability, activity and folding efficiency. Directed evolution and consensus-based engineering of proteins are the protein engineering principles for the re-evolution of such natural proteins ...

  1. Origin and Evolution of Protein Fold Designs Inferred from Phylogenomic Analysis of CATH Domain Structures in Proteomes

    Science.gov (United States)

    Bukhari, Syed Abbas; Caetano-Anollés, Gustavo

    2013-01-01

    The spatial arrangements of secondary structures in proteins, irrespective of their connectivity, depict the overall shape and organization of protein domains. These features have been used in the CATH and SCOP classifications to hierarchically partition fold space and define the architectural make up of proteins. Here we use phylogenomic methods and a census of CATH structures in hundreds of genomes to study the origin and diversification of protein architectures (A) and their associated topologies (T) and superfamilies (H). Phylogenies that describe the evolution of domain structures and proteomes were reconstructed from the structural census and used to generate timelines of domain discovery. Phylogenies of CATH domains at T and H levels of structural abstraction and associated chronologies revealed patterns of reductive evolution, the early rise of Archaea, three epochs in the evolution of the protein world, and patterns of structural sharing between superkingdoms. Phylogenies of proteomes confirmed the early appearance of Archaea. While these findings are in agreement with previous phylogenomic studies based on the SCOP classification, phylogenies unveiled sharing patterns between Archaea and Eukarya that are recent and can explain the canonical bacterial rooting typically recovered from sequence analysis. Phylogenies of CATH domains at A level uncovered general patterns of architectural origin and diversification. The tree of A structures showed that ancient structural designs such as the 3-layer (αβα) sandwich (3.40) or the orthogonal bundle (1.10) are comparatively simpler in their makeup and are involved in basic cellular functions. In contrast, modern structural designs such as prisms, propellers, 2-solenoid, super-roll, clam, trefoil and box are not widely distributed and were probably adopted to perform specialized functions. Our timelines therefore uncover a universal tendency towards protein structural complexity that is remarkable. PMID:23555236

  2. Light-activated control of protein channel assembly mediated by membrane mechanics

    Science.gov (United States)

    Miller, David M.; Findlay, Heather E.; Ces, Oscar; Templer, Richard H.; Booth, Paula J.

    2016-12-01

    Photochemical processes provide versatile triggers of chemical reactions. Here, we use a photoactivated lipid switch to modulate the folding and assembly of a protein channel within a model biological membrane. In contrast to the information rich field of water-soluble protein folding, there is only a limited understanding of the assembly of proteins that are integral to biological membranes. It is however possible to exploit the foreboding hydrophobic lipid environment and control membrane protein folding via lipid bilayer mechanics. Mechanical properties such as lipid chain lateral pressure influence the insertion and folding of proteins in membranes, with different stages of folding having contrasting sensitivities to the bilayer properties. Studies to date have relied on altering bilayer properties through lipid compositional changes made at equilibrium, and thus can only be made before or after folding. We show that light-activation of photoisomerisable di-(5-[[4-(4-butylphenyl)azo]phenoxy]pentyl)phosphate (4-Azo-5P) lipids influences the folding and assembly of the pentameric bacterial mechanosensitive channel MscL. The use of a photochemical reaction enables the bilayer properties to be altered during folding, which is unprecedented. This mechanical manipulation during folding, allows for optimisation of different stages of the component insertion, folding and assembly steps within the same lipid system. The photochemical approach offers the potential to control channel assembly when generating synthetic devices that exploit the mechanosensitive protein as a nanovalve.

  3. Use of designed sequences in protein structure recognition.

    Science.gov (United States)

    Kumar, Gayatri; Mudgal, Richa; Srinivasan, Narayanaswamy; Sandhya, Sankaran

    2018-05-09

    Knowledge of the protein structure is a pre-requisite for improved understanding of molecular function. The gap in the sequence-structure space has increased in the post-genomic era. Grouping related protein sequences into families can aid in narrowing the gap. In the Pfam database, structure description is provided for part or full-length proteins of 7726 families. For the remaining 52% of the families, information on 3-D structure is not yet available. We use the computationally designed sequences that are intermediately related to two protein domain families, which are already known to share the same fold. These strategically designed sequences enable detection of distant relationships and here, we have employed them for the purpose of structure recognition of protein families of yet unknown structure. We first measured the success rate of our approach using a dataset of protein families of known fold and achieved a success rate of 88%. Next, for 1392 families of yet unknown structure, we made structural assignments for part/full length of the proteins. Fold association for 423 domains of unknown function (DUFs) are provided as a step towards functional annotation. The results indicate that knowledge-based filling of gaps in protein sequence space is a lucrative approach for structure recognition. Such sequences assist in traversal through protein sequence space and effectively function as 'linkers', where natural linkers between distant proteins are unavailable. This article was reviewed by Oliviero Carugo, Christine Orengo and Srikrishna Subramanian.

  4. Protein Design Using Unnatural Amino Acids

    Science.gov (United States)

    Bilgiçer, Basar; Kumar, Krishna

    2003-11-01

    With the increasing availability of whole organism genome sequences, understanding protein structure and function is of capital importance. Recent developments in the methodology of incorporation of unnatural amino acids into proteins allow the exploration of proteins at a very detailed level. Furthermore, de novo design of novel protein structures and function is feasible with unprecedented sophistication. Using examples from the literature, this article describes the available methods for unnatural amino acid incorporation and highlights some recent applications including the design of hyperstable protein folds.

  5. Arraying proteins by cell-free synthesis.

    Science.gov (United States)

    He, Mingyue; Wang, Ming-Wei

    2007-10-01

    Recent advances in life science have led to great motivation for the development of protein arrays to study functions of genome-encoded proteins. While traditional cell-based methods have been commonly used for generating protein arrays, they are usually a time-consuming process with a number of technical challenges. Cell-free protein synthesis offers an attractive system for making protein arrays, not only does it rapidly converts the genetic information into functional proteins without the need for DNA cloning, but also presents a flexible environment amenable to production of folded proteins or proteins with defined modifications. Recent advancements have made it possible to rapidly generate protein arrays from PCR DNA templates through parallel on-chip protein synthesis. This article reviews current cell-free protein array technologies and their proteomic applications.

  6. Using linear algebra for protein structural comparison and classification.

    Science.gov (United States)

    Gomide, Janaína; Melo-Minardi, Raquel; Dos Santos, Marcos Augusto; Neshich, Goran; Meira, Wagner; Lopes, Júlio César; Santoro, Marcelo

    2009-07-01

    In this article, we describe a novel methodology to extract semantic characteristics from protein structures using linear algebra in order to compose structural signature vectors which may be used efficiently to compare and classify protein structures into fold families. These signatures are built from the pattern of hydrophobic intrachain interactions using Singular Value Decomposition (SVD) and Latent Semantic Indexing (LSI) techniques. Considering proteins as documents and contacts as terms, we have built a retrieval system which is able to find conserved contacts in samples of myoglobin fold family and to retrieve these proteins among proteins of varied folds with precision of up to 80%. The classifier is a web tool available at our laboratory website. Users can search for similar chains from a specific PDB, view and compare their contact maps and browse their structures using a JMol plug-in.

  7. Using linear algebra for protein structural comparison and classification

    Directory of Open Access Journals (Sweden)

    Janaína Gomide

    2009-01-01

    Full Text Available In this article, we describe a novel methodology to extract semantic characteristics from protein structures using linear algebra in order to compose structural signature vectors which may be used efficiently to compare and classify protein structures into fold families. These signatures are built from the pattern of hydrophobic intrachain interactions using Singular Value Decomposition (SVD and Latent Semantic Indexing (LSI techniques. Considering proteins as documents and contacts as terms, we have built a retrieval system which is able to find conserved contacts in samples of myoglobin fold family and to retrieve these proteins among proteins of varied folds with precision of up to 80%. The classifier is a web tool available at our laboratory website. Users can search for similar chains from a specific PDB, view and compare their contact maps and browse their structures using a JMol plug-in.

  8. Chaperone-protease networks in mitochondrial protein homeostasis.

    Science.gov (United States)

    Voos, Wolfgang

    2013-02-01

    As essential organelles, mitochondria are intimately integrated into the metabolism of a eukaryotic cell. The maintenance of the functional integrity of the mitochondrial proteome, also termed protein homeostasis, is facing many challenges both under normal and pathological conditions. First, since mitochondria are derived from bacterial ancestor cells, the proteins in this endosymbiotic organelle have a mixed origin. Only a few proteins are encoded on the mitochondrial genome, most genes for mitochondrial proteins reside in the nuclear genome of the host cell. This distribution requires a complex biogenesis of mitochondrial proteins, which are mostly synthesized in the cytosol and need to be imported into the organelle. Mitochondrial protein biogenesis usually therefore comprises complex folding and assembly processes to reach an enzymatically active state. In addition, specific protein quality control (PQC) processes avoid an accumulation of damaged or surplus polypeptides. Mitochondrial protein homeostasis is based on endogenous enzymatic components comprising a diverse set of chaperones and proteases that form an interconnected functional network. This review describes the different types of mitochondrial proteins with chaperone functions and covers the current knowledge of their roles in protein biogenesis, folding, proteolytic removal and prevention of aggregation, the principal reactions of protein homeostasis. This article is part of a Special Issue entitled: Protein Import and Quality Control in Mitochondria and Plastids. Copyright © 2012 Elsevier B.V. All rights reserved.

  9. Thermodynamics of protein folding using a modified Wako-Saitô-Muñoz-Eaton model.

    Science.gov (United States)

    Tsai, Min-Yeh; Yuan, Jian-Min; Teranishi, Yoshiaki; Lin, Sheng Hsien

    2012-09-01

    Herein, we propose a modified version of the Wako-Saitô-Muñoz-Eaton (WSME) model. The proposed model introduces an empirical temperature parameter for the hypothetical structural units (i.e., foldons) in proteins to include site-dependent thermodynamic behavior. The thermodynamics for both our proposed model and the original WSME model were investigated. For a system with beta-hairpin topology, a mathematical treatment (contact-pair treatment) to facilitate the calculation of its partition function was developed. The results show that the proposed model provides better insight into the site-dependent thermodynamic behavior of the system, compared with the original WSME model. From this site-dependent point of view, the relationship between probe-dependent experimental results and model's thermodynamic predictions can be explained. The model allows for suggesting a general principle to identify foldon behavior. We also find that the backbone hydrogen bonds may play a role of structural constraints in modulating the cooperative system. Thus, our study may contribute to the understanding of the fundamental principles for the thermodynamics of protein folding.

  10. Structure-function correlations of pulmonary surfactant protein SP-B and the saposin-like family of proteins.

    Science.gov (United States)

    Olmeda, Bárbara; García-Álvarez, Begoña; Pérez-Gil, Jesús

    2013-03-01

    Pulmonary surfactant is a lipid-protein complex secreted by the respiratory epithelium of mammalian lungs, which plays an essential role in stabilising the alveolar surface and so reducing the work of breathing. The surfactant protein SP-B is part of this complex, and is strictly required for the assembly of pulmonary surfactant and its extracellular development to form stable surface-active films at the air-liquid alveolar interface, making the lack of SP-B incompatible with life. In spite of its physiological importance, a model for the structure and the mechanism of action of SP-B is still needed. The sequence of SP-B is homologous to that of the saposin-like family of proteins, which are membrane-interacting polypeptides with apparently diverging activities, from the co-lipase action of saposins to facilitate the degradation of sphingolipids in the lysosomes to the cytolytic actions of some antibiotic proteins, such as NK-lysin and granulysin or the amoebapore of Entamoeba histolytica. Numerous studies on the interactions of these proteins with membranes have still not explained how a similar sequence and a potentially related fold can sustain such apparently different activities. In the present review, we have summarised the most relevant features of the structure, lipid-protein and protein-protein interactions of SP-B and the saposin-like family of proteins, as a basis to propose an integrated model and a common mechanistic framework of the apparent functional versatility of the saposin fold.

  11. Method of generating ploynucleotides encoding enhanced folding variants

    Energy Technology Data Exchange (ETDEWEB)

    Bradbury, Andrew M.; Kiss, Csaba; Waldo, Geoffrey S.

    2017-05-02

    The invention provides directed evolution methods for improving the folding, solubility and stability (including thermostability) characteristics of polypeptides. In one aspect, the invention provides a method for generating folding and stability-enhanced variants of proteins, including but not limited to fluorescent proteins, chromophoric proteins and enzymes. In another aspect, the invention provides methods for generating thermostable variants of a target protein or polypeptide via an internal destabilization baiting strategy. Internally destabilization a protein of interest is achieved by inserting a heterologous, folding-destabilizing sequence (folding interference domain) within DNA encoding the protein of interest, evolving the protein sequences adjacent to the heterologous insertion to overcome the destabilization (using any number of mutagenesis methods), thereby creating a library of variants. The variants in the library are expressed, and those with enhanced folding characteristics selected.

  12. Classification of proteins: available structural space for molecular modeling.

    Science.gov (United States)

    Andreeva, Antonina

    2012-01-01

    The wealth of available protein structural data provides unprecedented opportunity to study and better understand the underlying principles of protein folding and protein structure evolution. A key to achieving this lies in the ability to analyse these data and to organize them in a coherent classification scheme. Over the past years several protein classifications have been developed that aim to group proteins based on their structural relationships. Some of these classification schemes explore the concept of structural neighbourhood (structural continuum), whereas other utilize the notion of protein evolution and thus provide a discrete rather than continuum view of protein structure space. This chapter presents a strategy for classification of proteins with known three-dimensional structure. Steps in the classification process along with basic definitions are introduced. Examples illustrating some fundamental concepts of protein folding and evolution with a special focus on the exceptions to them are presented.

  13. Rapid protein fold determination using secondary chemical shifts and cross-hydrogen bond 15N-13C' scalar couplings (3hbJNC')

    Energy Technology Data Exchange (ETDEWEB)

    Bonvin, Alexandre M.J.J.; Houben, Klaartje; Guenneugues, Marc; Kaptein, Robert; Boelens, Rolf [Utrecht University, Bijvoet Center for Biomolecular Research, NMR Spectroscopy (Netherlands)

    2001-11-15

    The possibility of generating protein folds at the stage of backbone assignment using structural restraints derived from experimentally measured cross-hydrogen bond scalar couplings and secondary chemical shift information is investigated using as a test case the small {alpha}/{beta} protein chymotrypsin inhibitor 2. Dihedral angle restraints for the {phi} and {psi} angles of 32 out of 64 residues could be obtained from secondary chemical shift analysis with the TALOS program (Corneliscu et al., 1999a). This information was supplemented by 18 hydrogen-bond restraints derived from experimentally measured cross-hydrogen bond {sup 3hb}J{sub NC'} coupling constants. These experimental data were sufficient to generate structures that are as close as 1.0 A backbone rmsd from the crystal structure. The fold is, however, not uniquely defined and several solutions are generated that cannot be distinguished on the basis of violations or energetic considerations. Correct folds could be identified by combining clustering methods with knowledge-based potentials derived from structural databases.

  14. Simulation of Protein and Peptide-Based Biomaterials

    National Research Council Canada - National Science Library

    Daggett, Valerie

    2002-01-01

    The overall goal of the proposed research is to pursue realistic molecular modeling studies of the stability, dynamics, structure, function, and folding of proteins and protein-based biomaterials in solution...

  15. Dancing Protein Clouds: The Strange Biology and Chaotic Physics of Intrinsically Disordered Proteins.

    Science.gov (United States)

    Uversky, Vladimir N

    2016-03-25

    Biologically active but floppy proteins represent a new reality of modern protein science. These intrinsically disordered proteins (IDPs) and hybrid proteins containing ordered and intrinsically disordered protein regions (IDPRs) constitute a noticeable part of any given proteome. Functionally, they complement ordered proteins, and their conformational flexibility and structural plasticity allow them to perform impossible tricks and be engaged in biological activities that are inaccessible to well folded proteins with their unique structures. The major goals of this minireview are to show that, despite their simplified amino acid sequences, IDPs/IDPRs are complex entities often resembling chaotic systems, are structurally and functionally heterogeneous, and can be considered an important part of the structure-function continuum. Furthermore, IDPs/IDPRs are everywhere, and are ubiquitously engaged in various interactions characterized by a wide spectrum of binding scenarios and an even wider spectrum of structural and functional outputs. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  16. Small Scaffolds, Big Potential: Developing Miniature Proteins as Therapeutic Agents.

    Science.gov (United States)

    Holub, Justin M

    2017-09-01

    Preclinical Research Miniature proteins are a class of oligopeptide characterized by their short sequence lengths and ability to adopt well-folded, three-dimensional structures. Because of their biomimetic nature and synthetic tractability, miniature proteins have been used to study a range of biochemical processes including fast protein folding, signal transduction, catalysis and molecular transport. Recently, miniature proteins have been gaining traction as potential therapeutic agents because their small size and ability to fold into defined tertiary structures facilitates their development as protein-based drugs. This research overview discusses emerging developments involving the use of miniature proteins as scaffolds to design novel therapeutics for the treatment and study of human disease. Specifically, this review will explore strategies to: (i) stabilize miniature protein tertiary structure; (ii) optimize biomolecular recognition by grafting functional epitopes onto miniature protein scaffolds; and (iii) enhance cytosolic delivery of miniature proteins through the use of cationic motifs that facilitate endosomal escape. These objectives are discussed not only to address challenges in developing effective miniature protein-based drugs, but also to highlight the tremendous potential miniature proteins hold for combating and understanding human disease. Drug Dev Res 78 : 268-282, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.

  17. The human protein disulfide isomerase gene family

    Directory of Open Access Journals (Sweden)

    Galligan James J

    2012-07-01

    Full Text Available Abstract Enzyme-mediated disulfide bond formation is a highly conserved process affecting over one-third of all eukaryotic proteins. The enzymes primarily responsible for facilitating thiol-disulfide exchange are members of an expanding family of proteins known as protein disulfide isomerases (PDIs. These proteins are part of a larger superfamily of proteins known as the thioredoxin protein family (TRX. As members of the PDI family of proteins, all proteins contain a TRX-like structural domain and are predominantly expressed in the endoplasmic reticulum. Subcellular localization and the presence of a TRX domain, however, comprise the short list of distinguishing features required for gene family classification. To date, the PDI gene family contains 21 members, varying in domain composition, molecular weight, tissue expression, and cellular processing. Given their vital role in protein-folding, loss of PDI activity has been associated with the pathogenesis of numerous disease states, most commonly related to the unfolded protein response (UPR. Over the past decade, UPR has become a very attractive therapeutic target for multiple pathologies including Alzheimer disease, Parkinson disease, alcoholic and non-alcoholic liver disease, and type-2 diabetes. Understanding the mechanisms of protein-folding, specifically thiol-disulfide exchange, may lead to development of a novel class of therapeutics that would help alleviate a wide range of diseases by targeting the UPR.

  18. Two-dimensional NMR and photo-CIDNP studies of the insulin monomer: Assignment of aromatic resonances with application to protein folding, structure, and dynamics

    International Nuclear Information System (INIS)

    Weiss, M.A.; Shoelson, S.E.; Nguyen, D.T.; O'Shea, E.; Karplus, M.; Khait, I.; Neuringer, L.J.; Inouye, K.; Frank, B.H.; Beckage, M.

    1989-01-01

    The aromatic 1 H NMR resonances of the insulin monomer are assigned at 500 MHz by comparative studies of chemically modified and genetically altered variants, including a mutant insulin (PheB25 → Leu) associated with diabetes mellitus. The two histidines, three phenylalanines, and four tyrosines are observed to be in distinct local environments; their assignment provides sensitive markers for studies of tertiary structure, protein dynamics, and protein folding. The environments of the tyrosine residues have also been investigated by photochemically induced dynamic nuclear polarization (photo-CIDNP) and analyzed in relation to packing constrains in the crystal structures of insulin. Dimerization involving specific B-chain interactions is observed with increasing protein concentration and is shown to depend on temperature, pH, and solvent composition. The differences between proinsulin and mini-proinsulin suggest a structural mechanism for the observation that the fully reduced B29-A1 analogue folds more efficiently than proinsulin to form the correct pattern of disulfide bonds. These results are discussed in relation to molecular mechanics calculations of insulin based on the available crystal structures

  19. Neural Networks for protein Structure Prediction

    DEFF Research Database (Denmark)

    Bohr, Henrik

    1998-01-01

    This is a review about neural network applications in bioinformatics. Especially the applications to protein structure prediction, e.g. prediction of secondary structures, prediction of surface structure, fold class recognition and prediction of the 3-dimensional structure of protein backbones...

  20. Protein Attachment on Nanodiamonds.

    Science.gov (United States)

    Lin, Chung-Lun; Lin, Cheng-Huang; Chang, Huan-Cheng; Su, Meng-Chih

    2015-07-16

    A recent advance in nanotechnology is the scale-up production of small and nonaggregated diamond nanoparticles suitable for biological applications. Using detonation nanodiamonds (NDs) with an average diameter of ∼4 nm as the adsorbents, we have studied the static attachment of three proteins (myoglobin, bovine serum albumin, and insulin) onto the nanoparticles by optical spectroscopy, mass spectrometry, and dynamic light scattering, and electrophoretic zeta potential measurements. Results show that the protein surface coverage is predominantly determined by the competition between protein-protein and protein-ND interactions, giving each protein a unique and characteristic structural configuration in its own complex. Specifically, both myoglobin and bovine serum albumin show a Langmuir-type adsorption behavior, forming 1:1 complexes at saturation, whereas insulin folds into a tightly bound multimer before adsorption. The markedly different adsorption patterns appear to be independent of the protein concentration and are closely related to the affinity of the individual proteins for the NDs. The present study provides a fundamental understanding for the use of NDs as a platform for nanomedical drug delivery.

  1. A probabilistic fragment-based protein structure prediction algorithm.

    Directory of Open Access Journals (Sweden)

    David Simoncini

    Full Text Available Conformational sampling is one of the bottlenecks in fragment-based protein structure prediction approaches. They generally start with a coarse-grained optimization where mainchain atoms and centroids of side chains are considered, followed by a fine-grained optimization with an all-atom representation of proteins. It is during this coarse-grained phase that fragment-based methods sample intensely the conformational space. If the native-like region is sampled more, the accuracy of the final all-atom predictions may be improved accordingly. In this work we present EdaFold, a new method for fragment-based protein structure prediction based on an Estimation of Distribution Algorithm. Fragment-based approaches build protein models by assembling short fragments from known protein structures. Whereas the probability mass functions over the fragment libraries are uniform in the usual case, we propose an algorithm that learns from previously generated decoys and steers the search toward native-like regions. A comparison with Rosetta AbInitio protocol shows that EdaFold is able to generate models with lower energies and to enhance the percentage of near-native coarse-grained decoys on a benchmark of [Formula: see text] proteins. The best coarse-grained models produced by both methods were refined into all-atom models and used in molecular replacement. All atom decoys produced out of EdaFold's decoy set reach high enough accuracy to solve the crystallographic phase problem by molecular replacement for some test proteins. EdaFold showed a higher success rate in molecular replacement when compared to Rosetta. Our study suggests that improving low resolution coarse-grained decoys allows computational methods to avoid subsequent sampling issues during all-atom refinement and to produce better all-atom models. EdaFold can be downloaded from http://www.riken.jp/zhangiru/software.html [corrected].

  2. Is a malleable protein necessarily highly dynamic?

    DEFF Research Database (Denmark)

    Kjærgaard, Magnus; Poulsen, Flemming Martin; Teilum, Kaare

    2012-01-01

    core of NCBD in the ligand-free state and in a well-folded complex with the ligand activator for thyroid hormone and retinoid receptors using multiple NMR methods including methyl chemical shifts, coupling constants, and methyl order parameters. From all NMR measures, the aliphatic side chains...... in the hydrophobic core are slightly more dynamic in the free protein than in the complex, but have mobility comparable to the hydrophobic cores of average folded proteins. Urea titration monitored by NMR reveals that all parts of the protein, including the side-chain packing in the hydrophobic core, denatures...

  3. Protein unfolding with a steric trap.

    Science.gov (United States)

    Blois, Tracy M; Hong, Heedeok; Kim, Tae H; Bowie, James U

    2009-10-07

    The study of protein folding requires a method to drive unfolding, which is typically accomplished by altering solution conditions to favor the denatured state. This has the undesirable consequence that the molecular forces responsible for configuring the polypeptide chain are also changed. It would therefore be useful to develop methods that can drive unfolding without the need for destabilizing solvent conditions. Here we introduce a new method to accomplish this goal, which we call steric trapping. In the steric trap method, the target protein is labeled with two biotin tags placed close in space so that both biotin tags can only be bound by streptavidin when the protein unfolds. Thus, binding of the second streptavidin is energetically coupled to unfolding of the target protein. Testing the method on a model protein, dihydrofolate reductase (DHFR), we find that streptavidin binding can drive unfolding and that the apparent binding affinity reports on changes in DHFR stability. Finally, by employing the slow off-rate of wild-type streptavidin, we find that DHFR can be locked in the unfolded state. The steric trap method provides a simple method for studying aspects of protein folding and stability in native solvent conditions, could be used to specifically unfold selected domains, and could be applicable to membrane proteins.

  4. Increased nitration and carbonylation of proteins in MRL +/+ mice exposed to trichloroethene: Potential role of protein oxidation in autoimmunity

    International Nuclear Information System (INIS)

    Wang Gangduo; Wang Jianling; Ma Huaxian; Khan, M. Firoze

    2009-01-01

    Even though reactive oxygen and nitrogen species (RONS) are implicated as mediators of autoimmune diseases (ADs), little is known about contribution of protein oxidation (carbonylation and nitration) in the pathogenesis of such diseases. The focus of this study was, therefore, to establish a link between protein oxidation and induction and/or exacerbation of autoimmunity. To achieve this, female MRL +/+ mice were treated with trichloroethene (TCE), an environmental contaminant known to induce autoimmune response, for 6 or 12 weeks (10 mmol/kg, i.p., every 4 th day). TCE treatment resulted in significantly increased formation of nitrotyrosine (NT) and induction of iNOS in the serum at both 6 and 12 weeks of treatment, but the response was greater at 12 weeks. Likewise, TCE treatment led to greater NT formation, and iNOS protein and mRNA expression in the livers and kidneys. Moreover, TCE treatment also caused significant increases (∼3 fold) in serum protein carbonyls (a marker of protein oxidation) at both 6 and 12 weeks. Significantly increased protein carbonyls were also observed in the livers and kidneys (2.1 and 1.3 fold, respectively) at 6 weeks, and to a greater extent at 12 weeks (3.5 and 2.1 fold, respectively) following TCE treatment. The increases in TCE-induced protein oxidation (carbonylation and nitration) were associated with significant increases in Th1 specific cytokine (IL-2, IFN-γ) release into splenocyte cultures. These results suggest an association between protein oxidation and induction/exacerbation of autoimmune response. The results present a potential mechanism by which oxidatively modified proteins could contribute to TCE-induced autoimmune response and necessitates further investigations for clearly establishing the role of protein oxidation in the pathogenesis of ADs.

  5. The IntFOLD server: an integrated web resource for protein fold recognition, 3D model quality assessment, intrinsic disorder prediction, domain prediction and ligand binding site prediction.

    Science.gov (United States)

    Roche, Daniel B; Buenavista, Maria T; Tetchner, Stuart J; McGuffin, Liam J

    2011-07-01

    The IntFOLD server is a novel independent server that integrates several cutting edge methods for the prediction of structure and function from sequence. Our guiding principles behind the server development were as follows: (i) to provide a simple unified resource that makes our prediction software accessible to all and (ii) to produce integrated output for predictions that can be easily interpreted. The output for predictions is presented as a simple table that summarizes all results graphically via plots and annotated 3D models. The raw machine readable data files for each set of predictions are also provided for developers, which comply with the Critical Assessment of Methods for Protein Structure Prediction (CASP) data standards. The server comprises an integrated suite of five novel methods: nFOLD4, for tertiary structure prediction; ModFOLD 3.0, for model quality assessment; DISOclust 2.0, for disorder prediction; DomFOLD 2.0 for domain prediction; and FunFOLD 1.0, for ligand binding site prediction. Predictions from the IntFOLD server were found to be competitive in several categories in the recent CASP9 experiment. The IntFOLD server is available at the following web site: http://www.reading.ac.uk/bioinf/IntFOLD/.

  6. Complex folding and misfolding effects of deer-specific amino acid substitutions in the β2-α2 loop of murine prion protein

    Science.gov (United States)

    Agarwal, Sonya; Döring, Kristina; Gierusz, Leszek A.; Iyer, Pooja; Lane, Fiona M.; Graham, James F.; Goldmann, Wilfred; Pinheiro, Teresa J. T.; Gill, Andrew C.

    2015-10-01

    The β2-α2 loop of PrPC is a key modulator of disease-associated prion protein misfolding. Amino acids that differentiate mouse (Ser169, Asn173) and deer (Asn169, Thr173) PrPC appear to confer dramatically different structural properties in this region and it has been suggested that amino acid sequences associated with structural rigidity of the loop also confer susceptibility to prion disease. Using mouse recombinant PrP, we show that mutating residue 173 from Asn to Thr alters protein stability and misfolding only subtly, whilst changing Ser to Asn at codon 169 causes instability in the protein, promotes oligomer formation and dramatically potentiates fibril formation. The doubly mutated protein exhibits more complex folding and misfolding behaviour than either single mutant, suggestive of differential effects of the β2-α2 loop sequence on both protein stability and on specific misfolding pathways. Molecular dynamics simulation of protein structure suggests a key role for the solvent accessibility of Tyr168 in promoting molecular interactions that may lead to prion protein misfolding. Thus, we conclude that ‘rigidity’ in the β2-α2 loop region of the normal conformer of PrP has less effect on misfolding than other sequence-related effects in this region.

  7. Dissecting Protein Configurational Entropy into Conformational and Vibrational Contributions.

    Science.gov (United States)

    Chong, Song-Ho; Ham, Sihyun

    2015-10-01

    Quantifying how the rugged nature of the underlying free-energy landscape determines the entropic cost a protein must incur upon folding and ligand binding is a challenging problem. Here, we present a novel computational approach that dissects the protein configurational entropy on the basis of the classification of protein dynamics on the landscape into two separate components: short-term vibrational dynamics related to individual free-energy wells and long-term conformational dynamics associated with transitions between wells. We apply this method to separate the configurational entropy of the protein villin headpiece subdomain into its conformational and vibrational components. We find that the change in configurational entropy upon folding is dominated by the conformational entropy despite the fact that the magnitude of the vibrational entropy is the significantly larger component in each of the folded and unfolded states, which is in accord with the previous empirical estimations. The straightforward applicability of our method to unfolded proteins promises a wide range of applications, including those related to intrinsically disordered proteins.

  8. Blind Test of Physics-Based Prediction of Protein Structures

    Science.gov (United States)

    Shell, M. Scott; Ozkan, S. Banu; Voelz, Vincent; Wu, Guohong Albert; Dill, Ken A.

    2009-01-01

    We report here a multiprotein blind test of a computer method to predict native protein structures based solely on an all-atom physics-based force field. We use the AMBER 96 potential function with an implicit (GB/SA) model of solvation, combined with replica-exchange molecular-dynamics simulations. Coarse conformational sampling is performed using the zipping and assembly method (ZAM), an approach that is designed to mimic the putative physical routes of protein folding. ZAM was applied to the folding of six proteins, from 76 to 112 monomers in length, in CASP7, a community-wide blind test of protein structure prediction. Because these predictions have about the same level of accuracy as typical bioinformatics methods, and do not utilize information from databases of known native structures, this work opens up the possibility of predicting the structures of membrane proteins, synthetic peptides, or other foldable polymers, for which there is little prior knowledge of native structures. This approach may also be useful for predicting physical protein folding routes, non-native conformations, and other physical properties from amino acid sequences. PMID:19186130

  9. Differential Precipitation and Solubilization of Proteins.

    Science.gov (United States)

    Ryan, Barry J; Kinsella, Gemma K

    2017-01-01

    Differential protein precipitation is a rapid and economical step in protein purification and is based on exploiting the inherent physicochemical properties of the polypeptide. Precipitation of recombinant proteins, lysed from the host cell, is commonly used to concentrate the protein of choice before further polishing steps with more selective purification columns (e.g., His-Tag, Size Exclusion, etc.). Recombinant proteins can also precipitate naturally as inclusion bodies due to various influences during overexpression in the host cell. Although this phenomenon permits easier initial separation from native proteins, these inclusion bodies must carefully be differentially solubilized so as to reform functional, correctly folded proteins. Here, appropriate bioinformatics tools to aid in understanding a protein's propensity to aggregate and solubilize are explored as a backdrop for a typical protein extraction, precipitation, and selective resolubilization procedure, based on a recombinantly expressed protein.

  10. Golf-course and funnel energy landscapes: Protein folding concepts in martensites

    Science.gov (United States)

    Shankaraiah, N.

    2017-06-01

    We use protein folding energy landscape concepts such as golf course and funnel to study re-equilibration in athermal martensites under systematic temperature quench Monte Carlo simulations. On quenching below a transition temperature, the seeded high-symmetry parent-phase austenite that converts to the low-symmetry product-phase martensite, through autocatalytic twinning or elastic photocopying, has both rapid conversions and incubation delays in the temperature-time-transformation phase diagram. We find the rapid (incubation delays) conversions at low (high) temperatures arises from the presence of large (small) size of golf-course edge that has the funnel inside for negative energy states. In the incubating state, the strain structure factor enters into the Brillouin-zone golf course through searches for finite transitional pathways which close off at the transition temperature with Vogel-Fulcher divergences that are insensitive to Hamiltonian energy scales and log-normal distributions, as signatures of dominant entropy barriers. The crossing of the entropy barrier is identified through energy occupancy distributions, Monte Carlo acceptance fractions, heat emission, and internal work.

  11. [In vitro renaturation of proteins from inclusion bodies].

    Science.gov (United States)

    Porowińska, Dorota; Marszałek, Ewelina; Wardęcka, Paulina; Komoszyński, Michał

    2012-06-11

    Recombinant proteins and enzymes are commonly used in many areas of our life, such as diagnostics, industry and medicine, due to heterologous synthesis in prokaryotic expression systems. However, a high expression level of foreign protein in bacteria cells results in formation of inactive and insoluble aggregates--inclusion bodies. Reactivation of aggregated proteins is a complex and time-consuming process. Every protein requires experimental optimization of the process conditions. The choice of the refolding method depends on the type of recombinant protein and its physical, chemical and biological properties. Recovery of the activity of proteins accumulated in inclusion bodies can be divided into 4 steps: 1) inclusion bodies isolation, 2) solubilization of aggregates, 3) renaturation, 4) purification of catalytically active molecules. Efficiency of the refolding process depends on many physical factors and chemical and biological agents. The above parameters determine the time of the folding and prevent protein aggregation. They also assist the folding and have an influence on the solubility and stability of native molecules. To date, dilution, dialysis and chromatography are the most often used methods for protein refolding.

  12. Benchmarking protein classification algorithms via supervised cross-validation

    NARCIS (Netherlands)

    Kertész-Farkas, A.; Dhir, S.; Sonego, P.; Pacurar, M.; Netoteia, S.; Nijveen, H.; Kuzniar, A.; Leunissen, J.A.M.; Kocsor, A.; Pongor, S.

    2008-01-01

    Development and testing of protein classification algorithms are hampered by the fact that the protein universe is characterized by groups vastly different in the number of members, in average protein size, similarity within group, etc. Datasets based on traditional cross-validation (k-fold,

  13. Beta-structures in fibrous proteins.

    Science.gov (United States)

    Kajava, Andrey V; Squire, John M; Parry, David A D

    2006-01-01

    The beta-form of protein folding, one of the earliest protein structures to be defined, was originally observed in studies of silks. It was then seen in early studies of synthetic polypeptides and, of course, is now known to be present in a variety of guises as an essential component of globular protein structures. However, in the last decade or so it has become clear that the beta-conformation of chains is present not only in many of the amyloid structures associated with, for example, Alzheimer's Disease, but also in the prion structures associated with the spongiform encephalopathies. Furthermore, X-ray crystallography studies have revealed the high incidence of the beta-fibrous proteins among virulence factors of pathogenic bacteria and viruses. Here we describe the basic forms of the beta-fold, summarize the many different new forms of beta-structural fibrous arrangements that have been discovered, and review advances in structural studies of amyloid and prion fibrils. These and other issues are described in detail in later chapters.

  14. Shortening a loop can increase protein native state entropy.

    Science.gov (United States)

    Gavrilov, Yulian; Dagan, Shlomi; Levy, Yaakov

    2015-12-01

    Protein loops are essential structural elements that influence not only function but also protein stability and folding rates. It was recently reported that shortening a loop in the AcP protein may increase its native state conformational entropy. This effect on the entropy of the folded state can be much larger than the lower entropic penalty of ordering a shorter loop upon folding, and can therefore result in a more pronounced stabilization than predicted by polymer model for loop closure entropy. In this study, which aims at generalizing the effect of loop length shortening on native state dynamics, we use all-atom molecular dynamics simulations to study how gradual shortening a very long or solvent-exposed loop region in four different proteins can affect their stability. For two proteins, AcP and Ubc7, we show an increase in native state entropy in addition to the known effect of the loop length on the unfolded state entropy. However, for two permutants of SH3 domain, shortening a loop results only with the expected change in the entropy of the unfolded state, which nicely reproduces the observed experimental stabilization. Here, we show that an increase in the native state entropy following loop shortening is not unique to the AcP protein, yet nor is it a general rule that applies to all proteins following the truncation of any loop. This modification of the loop length on the folded state and on the unfolded state may result with a greater effect on protein stability. © 2015 Wiley Periodicals, Inc.

  15. Cytosolic protein quality control of the orphan protein Fas2, a novel physiological substrate of the E3 ligase Ubr1

    OpenAIRE

    Scazzari, Mario

    2013-01-01

    Cellular protein quality control (PQC) monitors the proper folding of polypeptides, assembly of protein subunits into protein complexes as well as the delivery of terminally misfolded proteins to degradation. The components of PQC known best at the moment are molecular chaperones and the ubiquitin proteasome system. In contrast to the well-described protein quality control system of the endoplasmic reticulum (ERAD), less is known about how misfolded proteins in the cytosol are recognized and ...

  16. NMR Studies of Protein Hydration and Protein-Ligand Interactions

    Science.gov (United States)

    Chong, Yuan

    Water on the surface of a protein is called hydration water. Hydration water is known to play a crucial role in a variety of biological processes including protein folding, enzymatic activation, and drug binding. Although the significance of hydration water has been recognized, the underlying mechanism remains far from being understood. This dissertation employs a unique in-situ nuclear magnetic resonance (NMR) technique to study the mechanism of protein hydration and the role of hydration in alcohol-protein interactions. Water isotherms in proteins are measured at different temperatures via the in-situ NMR technique. Water is found to interact differently with hydrophilic and hydrophobic groups on the protein. Water adsorption on hydrophilic groups is hardly affected by the temperature, while water adsorption on hydrophobic groups strongly depends on the temperature around 10 C, below which the adsorption is substantially reduced. This effect is induced by the dramatic decrease in the protein flexibility below 10 C. Furthermore, nanosecond to microsecond protein dynamics and the free energy, enthalpy, and entropy of protein hydration are studied as a function of hydration level and temperature. A crossover at 10 C in protein dynamics and thermodynamics is revealed. The effect of water at hydrophilic groups on protein dynamics and thermodynamics shows little temperature dependence, whereas water at hydrophobic groups has stronger effect above 10 C. In addition, I investigate the role of water in alcohol binding to the protein using the in-situ NMR detection. The isotherms of alcohols are first measured on dry proteins, then on proteins with a series of controlled hydration levels. The free energy, enthalpy, and entropy of alcohol binding are also determined. Two distinct types of alcohol binding are identified. On the one hand, alcohols can directly bind to a few specific sites on the protein. This type of binding is independent of temperature and can be

  17. Coarsely resolved topography along protein folding pathways

    Science.gov (United States)

    Fernández, Ariel; Kostov, Konstantin S.; Berry, R. Stephen

    2000-03-01

    The kinetic data from the coarse representation of polypeptide torsional dynamics described in the preceding paper [Fernandez and Berry, J. Chem. Phys. 112, 5212 (2000), preceding paper] is inverted by using detailed balance to obtain a topographic description of the potential-energy surface (PES) along the dominant folding pathway of the bovine pancreatic trypsin inhibitor (BPTI). The topography is represented as a sequence of minima and effective saddle points. The dominant folding pathway displays an overall monotonic decrease in energy with a large number of staircaselike steps, a clear signature of a good structure-seeker. The diversity and availability of alternative folding pathways is analyzed in terms of the Shannon entropy σ(t) associated with the time-dependent probability distribution over the kinetic ensemble of contact patterns. Several stages in the folding process are evident. Initially misfolded states form and dismantle revealing no definite pattern in the topography and exhibiting high Shannon entropy. Passage down a sequence of staircase steps then leads to the formation of a nativelike intermediate, for which σ(t) is much lower and fairly constant. Finally, the structure of the intermediate is refined to produce the native state of BPTI. We also examine how different levels of tolerance to mismatches of side chain contacts influence the folding kinetics, the topography of the dominant folding pathway, and the Shannon entropy. This analysis yields upper and lower bounds of the frustration tolerance required for the expeditious and robust folding of BPTI.

  18. Protein-protein interactions within late pre-40S ribosomes.

    Directory of Open Access Journals (Sweden)

    Melody G Campbell

    2011-01-01

    Full Text Available Ribosome assembly in eukaryotic organisms requires more than 200 assembly factors to facilitate and coordinate rRNA transcription, processing, and folding with the binding of the ribosomal proteins. Many of these assembly factors bind and dissociate at defined times giving rise to discrete assembly intermediates, some of which have been partially characterized with regards to their protein and RNA composition. Here, we have analyzed the protein-protein interactions between the seven assembly factors bound to late cytoplasmic pre-40S ribosomes using recombinant proteins in binding assays. Our data show that these factors form two modules: one comprising Enp1 and the export adaptor Ltv1 near the beak structure, and the second comprising the kinase Rio2, the nuclease Nob1, and a regulatory RNA binding protein Dim2/Pno1 on the front of the head. The GTPase-like Tsr1 and the universally conserved methylase Dim1 are also peripherally connected to this second module. Additionally, in an effort to further define the locations for these essential proteins, we have analyzed the interactions between these assembly factors and six ribosomal proteins: Rps0, Rps3, Rps5, Rps14, Rps15 and Rps29. Together, these results and previous RNA-protein crosslinking data allow us to propose a model for the binding sites of these seven assembly factors. Furthermore, our data show that the essential kinase Rio2 is located at the center of the pre-ribosomal particle and interacts, directly or indirectly, with every other assembly factor, as well as three ribosomal proteins required for cytoplasmic 40S maturation. These data suggest that Rio2 could play a central role in regulating cytoplasmic maturation steps.

  19. On the Trails of the Proteasome Fold: Structural and Functional Analysis of the Ancestral β-Subunit Protein Anbu.

    Science.gov (United States)

    Vielberg, Marie-Theres; Bauer, Verena C; Groll, Michael

    2018-03-02

    The 20S proteasome is a key player in eukaryotic and archaeal protein degradation, but its progenitor in eubacteria is unknown. Recently, the ancestral β-subunit protein (Anbu) was predicted to be the evolutionary precursor of the proteasome. We crystallized Anbu from Hyphomicrobium sp. strain MC1 in four different space groups and solved the structures by SAD-phasing and Patterson search calculation techniques. Our data reveal that Anbu adopts the classical fold of Ntn-hydrolases, but its oligomeric state differs from that of barrel-shaped proteases. In contrast to their typical architecture, the Anbu protomer is a tightly interacting dimer that can assemble into a helical superstructure. Although Anbu features a catalytic triad of Thr1O γ , Asp17O δ1 and Lys32N ε , it is unable to hydrolyze standard protease substrates. The lack of activity might be caused by the incapacity of Thr1NH 2 to function as a Brønsted acid during substrate cleavage due to its missing activation via hydrogen bonding. Altogether, we demonstrate that the topology of the proteasomal fold is conserved in Anbu, but whether it acts as a protease still needs to be clarified. Copyright © 2018 Elsevier Ltd. All rights reserved.

  20. The Leptospiral Antigen Lp49 is a Two-Domain Protein with Putative Protein Binding Function

    Energy Technology Data Exchange (ETDEWEB)

    Oliveira Giuseppe,P.; Oliveira Neves, F.; Nascimento, A.; Gomes Guimaraes, B.

    2008-01-01

    Pathogenic Leptospira is the etiological agent of leptospirosis, a life-threatening disease that affects populations worldwide. Currently available vaccines have limited effectiveness and therapeutic interventions are complicated by the difficulty in making an early diagnosis of leptospirosis. The genome of Leptospira interrogans was recently sequenced and comparative genomic analysis contributed to the identification of surface antigens, potential candidates for development of new vaccines and serodiagnosis. Lp49 is a membrane-associated protein recognized by antibodies present in sera from early and convalescent phases of leptospirosis patients. Its crystal structure was determined by single-wavelength anomalous diffraction using selenomethionine-labelled crystals and refined at 2.0 Angstroms resolution. Lp49 is composed of two domains and belongs to the all-beta-proteins class. The N-terminal domain folds in an immunoglobulin-like beta-sandwich structure, whereas the C-terminal domain presents a seven-bladed beta-propeller fold. Structural analysis of Lp49 indicates putative protein-protein binding sites, suggesting a role in Leptospira-host interaction. This is the first crystal structure of a leptospiral antigen described to date.

  1. The Inner Membrane Complex Sub-compartment Proteins Critical for Replication of the Apicomplexan Parasite Toxoplasma gondii Adopt a Pleckstrin Homology Fold*

    Science.gov (United States)

    Tonkin, Michelle L.; Beck, Josh R.; Bradley, Peter J.; Boulanger, Martin J.

    2014-01-01

    Toxoplasma gondii, an apicomplexan parasite prevalent in developed nations, infects up to one-third of the human population. The success of this parasite depends on several unique structures including an inner membrane complex (IMC) that lines the interior of the plasma membrane and contains proteins important for gliding motility and replication. Of these proteins, the IMC sub-compartment proteins (ISPs) have recently been shown to play a role in asexual T. gondii daughter cell formation, yet the mechanism is unknown. Complicating mechanistic characterization of the ISPs is a lack of sequence identity with proteins of known structure or function. In support of elucidating the function of ISPs, we first determined the crystal structures of representative members TgISP1 and TgISP3 to a resolution of 2.10 and 2.32 Å, respectively. Structural analysis revealed that both ISPs adopt a pleckstrin homology fold often associated with phospholipid binding or protein-protein interactions. Substitution of basic for hydrophobic residues in the region that overlays with phospholipid binding in related pleckstrin homology domains, however, suggests that ISPs do not retain phospholipid binding activity. Consistent with this observation, biochemical assays revealed no phospholipid binding activity. Interestingly, mapping of conserved surface residues combined with crystal packing analysis indicates that TgISPs have functionally repurposed the phospholipid-binding site likely to coordinate protein partners. Recruitment of larger protein complexes may also be aided through avidity-enhanced interactions resulting from multimerization of the ISPs. Overall, we propose a model where TgISPs recruit protein partners to the IMC to ensure correct progression of daughter cell formation. PMID:24675080

  2. Hidden Structural Codes in Protein Intrinsic Disorder.

    Science.gov (United States)

    Borkosky, Silvia S; Camporeale, Gabriela; Chemes, Lucía B; Risso, Marikena; Noval, María Gabriela; Sánchez, Ignacio E; Alonso, Leonardo G; de Prat Gay, Gonzalo

    2017-10-17

    Intrinsic disorder is a major structural category in biology, accounting for more than 30% of coding regions across the domains of life, yet consists of conformational ensembles in equilibrium, a major challenge in protein chemistry. Anciently evolved papillomavirus genomes constitute an unparalleled case for sequence to structure-function correlation in cases in which there are no folded structures. E7, the major transforming oncoprotein of human papillomaviruses, is a paradigmatic example among the intrinsically disordered proteins. Analysis of a large number of sequences of the same viral protein allowed for the identification of a handful of residues with absolute conservation, scattered along the sequence of its N-terminal intrinsically disordered domain, which intriguingly are mostly leucine residues. Mutation of these led to a pronounced increase in both α-helix and β-sheet structural content, reflected by drastic effects on equilibrium propensities and oligomerization kinetics, and uncovers the existence of local structural elements that oppose canonical folding. These folding relays suggest the existence of yet undefined hidden structural codes behind intrinsic disorder in this model protein. Thus, evolution pinpoints conformational hot spots that could have not been identified by direct experimental methods for analyzing or perturbing the equilibrium of an intrinsically disordered protein ensemble.

  3. Aging Is Accompanied by a Blunted Muscle Protein Synthetic Response to Protein Ingestion.

    Directory of Open Access Journals (Sweden)

    Benjamin Toby Wall

    Full Text Available Progressive loss of skeletal muscle mass with aging (sarcopenia forms a global health concern. It has been suggested that an impaired capacity to increase muscle protein synthesis rates in response to protein intake is a key contributor to sarcopenia. We assessed whether differences in post-absorptive and/or post-prandial muscle protein synthesis rates exist between large cohorts of healthy young and older men.We performed a cross-sectional, retrospective study comparing in vivo post-absorptive muscle protein synthesis rates determined with stable isotope methodologies between 34 healthy young (22±1 y and 72 older (75±1 y men, and post-prandial muscle protein synthesis rates between 35 healthy young (22±1 y and 40 older (74±1 y men.Post-absorptive muscle protein synthesis rates did not differ significantly between the young and older group. Post-prandial muscle protein synthesis rates were 16% lower in the older subjects when compared with the young. Muscle protein synthesis rates were >3 fold more responsive to dietary protein ingestion in the young. Irrespective of age, there was a strong negative correlation between post-absorptive muscle protein synthesis rates and the increase in muscle protein synthesis rate following protein ingestion.Aging is associated with the development of muscle anabolic inflexibility which represents a key physiological mechanism underpinning sarcopenia.

  4. Comparative sensitivity of 125I-protein A and enzyme-conjugated antibodies for detection of immunoblotted proteins

    International Nuclear Information System (INIS)

    Bernstein, J.M.; Stokes, C.E.; Fernie, B.

    1987-01-01

    Immunoblotting is a powerful technique for the detection of small amounts of immunologically interesting proteins in unpurified preparations. Iodinated protein A (PA) has been widely used as a second antibody for detection of proteins; however, it does not bind equally well to immunoglobulins from different species nor does it bind to all subclasses of immunoglobulin G (IgG). We compared the sensitivity of [ 125 I]PA with those of both horseradish peroxidase-conjugated second antibodies (HRP) and glucose oxidase-anti-glucose oxidase (GAG) soluble complexes for visualizing bovine serum albumin, human IgG, or human C3 which was either dot blotted or electroblotted to nitrocellulose. [ 125 I]PA was uniformly 10- to 100-fold less sensitive than either HRP or GAG. GAG was more sensitive than HRP except for C3 (electroblotting) and bovine serum albumin and IgG (dot blotting), in which they were equivalent. In general, dot blotting was 10- to 1000-fold more sensitive than electroblotting. Although relative sensitivities varied depending on the proteins analyzed and the antisera used, GAG appeared to be superior to [ 125 I]PA and HRP for detection of immunoblotted proteins

  5. A collaborative visual analytics suite for protein folding research.

    Science.gov (United States)

    Harvey, William; Park, In-Hee; Rübel, Oliver; Pascucci, Valerio; Bremer, Peer-Timo; Li, Chenglong; Wang, Yusu

    2014-09-01

    Molecular dynamics (MD) simulation is a crucial tool for understanding principles behind important biochemical processes such as protein folding and molecular interaction. With the rapidly increasing power of modern computers, large-scale MD simulation experiments can be performed regularly, generating huge amounts of MD data. An important question is how to analyze and interpret such massive and complex data. One of the (many) challenges involved in analyzing MD simulation data computationally is the high-dimensionality of such data. Given a massive collection of molecular conformations, researchers typically need to rely on their expertise and prior domain knowledge in order to retrieve certain conformations of interest. It is not easy to make and test hypotheses as the data set as a whole is somewhat "invisible" due to its high dimensionality. In other words, it is hard to directly access and examine individual conformations from a sea of molecular structures, and to further explore the entire data set. There is also no easy and convenient way to obtain a global view of the data or its various modalities of biochemical information. To this end, we present an interactive, collaborative visual analytics tool for exploring massive, high-dimensional molecular dynamics simulation data sets. The most important utility of our tool is to provide a platform where researchers can easily and effectively navigate through the otherwise "invisible" simulation data sets, exploring and examining molecular conformations both as a whole and at individual levels. The visualization is based on the concept of a topological landscape, which is a 2D terrain metaphor preserving certain topological and geometric properties of the high dimensional protein energy landscape. In addition to facilitating easy exploration of conformations, this 2D terrain metaphor also provides a platform where researchers can visualize and analyze various properties (such as contact density) overlayed on the

  6. Quality control mechanisms of protein biogenesis: proteostasis dies hard

    Directory of Open Access Journals (Sweden)

    Timothy Jan Bergmann

    2016-10-01

    Full Text Available The biosynthesis of proteins entails a complex series of chemical reactions that transform the information stored in the nucleic acid sequence into a polypeptide chain that needs to properly fold and reach its functional location in or outside the cell. It is of no surprise that errors might occur that alter the polypeptide sequence leading to a non-functional proteins or that impede delivery of proteins at the appropriate site of activity. In order to minimize such mistakes and guarantee the synthesis of the correct amount and quality of the proteome, cells have developed folding, quality control, degradation and transport mechanisms that ensure and tightly regulate protein biogenesis. Genetic mutations, harsh environmental conditions or attack by pathogens can subvert the cellular quality control machineries and perturb cellular proteostasis leading to pathological conditions. This review summarizes basic concepts of the flow of information from DNA to folded and active proteins and to the variable fidelity (from incredibly high to quite sloppy characterizing these processes. We will give particular emphasis on events that maintain or recover the homeostasis of the endoplasmic reticulum (ER, a major site of proteins synthesis and folding in eukaryotic cells. Finally, we will report on how cells can adapt to stressful conditions, how perturbation of ER homeostasis may result in diseases and how these can be treated.

  7. Chemical shift homology in proteins

    International Nuclear Information System (INIS)

    Potts, Barbara C.M.; Chazin, Walter J.

    1998-01-01

    The degree of chemical shift similarity for homologous proteins has been determined from a chemical shift database of over 50 proteins representing a variety of families and folds, and spanning a wide range of sequence homologies. After sequence alignment, the similarity of the secondary chemical shifts of C α protons was examined as a function of amino acid sequence identity for 37 pairs of structurally homologous proteins. A correlation between sequence identity and secondary chemical shift rmsd was observed. Important insights are provided by examining the sequence identity of homologous proteins versus percentage of secondary chemical shifts that fall within 0.1 and 0.3 ppm thresholds. These results begin to establish practical guidelines for the extent of chemical shift similarity to expect among structurally homologous proteins

  8. Identification of membrane proteins by tandem mass spectrometry of protein ions

    Science.gov (United States)

    Carroll, Joe; Altman, Matthew C.; Fearnley, Ian M.; Walker, John E.

    2007-01-01

    The most common way of identifying proteins in proteomic analyses is to use short segments of sequence (“tags”) determined by mass spectrometric analysis of proteolytic fragments. The approach is effective with globular proteins and with membrane proteins with significant polar segments between membrane-spanning α-helices, but it is ineffective with other hydrophobic proteins where protease cleavage sites are either infrequent or absent. By developing methods to purify hydrophobic proteins in organic solvents and by fragmenting ions of these proteins by collision induced dissociation with argon, we have shown that partial sequences of many membrane proteins can be deduced easily by manual inspection. The spectra from small proteolipids (1–4 transmembrane α-helices) are dominated usually by fragment ions arising from internal amide cleavages, from which internal sequences can be obtained, whereas the spectra from larger membrane proteins (5–18 transmembrane α-helices) often contain fragment ions from N- and/or C-terminal parts yielding sequences in those regions. With these techniques, we have, for example, identified an abundant protein of unknown function from inner membranes of mitochondria that to our knowledge has escaped detection in proteomic studies, and we have produced sequences from 10 of 13 proteins encoded in mitochondrial DNA. They include the ND6 subunit of complex I, the last of its 45 subunits to be analyzed. The procedures have the potential to be developed further, for example by using newly introduced methods for protein ion dissociation to induce fragmentation of internal regions of large membrane proteins, which may remain partially folded in the gas phase. PMID:17720804

  9. Heavy metals and metalloids as a cause for protein misfolding and aggregation.

    Science.gov (United States)

    Tamás, Markus J; Sharma, Sandeep K; Ibstedt, Sebastian; Jacobson, Therese; Christen, Philipp

    2014-02-25

    While the toxicity of metals and metalloids, like arsenic, cadmium, mercury, lead and chromium, is undisputed, the underlying molecular mechanisms are not entirely clear. General consensus holds that proteins are the prime targets; heavy metals interfere with the physiological activity of specific, particularly susceptible proteins, either by forming a complex with functional side chain groups or by displacing essential metal ions in metalloproteins. Recent studies have revealed an additional mode of metal action targeted at proteins in a non-native state; certain heavy metals and metalloids have been found to inhibit the in vitro refolding of chemically denatured proteins, to interfere with protein folding in vivo and to cause aggregation of nascent proteins in living cells. Apparently, unfolded proteins with motile backbone and side chains are considerably more prone to engage in stable, pluridentate metal complexes than native proteins with their well-defined 3D structure. By interfering with the folding process, heavy metal ions and metalloids profoundly affect protein homeostasis and cell viability. This review describes how heavy metals impede protein folding and promote protein aggregation, how cells regulate quality control systems to protect themselves from metal toxicity and how metals might contribute to protein misfolding disorders.

  10. MOLECULAR DOCKING AND DYNAMICS STUDIES ON THE PROTEIN-PROTEIN INTERACTIONS OF ELECTRICALLY ACTIVE PILIN NANOWIRES OF GEOBACTER SULFURREDUCENS.

    Directory of Open Access Journals (Sweden)

    D. Jeya Sundara Sharmila1 *

    2017-06-01

    Full Text Available Molecular interactions are key aspects in biological recognitions applicable in nano/micro systems. Bacterial nanowires are pilus filament based structures that can conduct electrons. The transport of electron is proposed to be facilitated by filamentous fibers made up of polymeric assemblies of proteins called pilin. Geobacter sulfurreducens is capable of delivering electrons through extracellular electron transport (EET by employing conductive nanowires, which are pilin proteins composed of type IV subunit PilA. Protein-protein interactions play an important role in the stabilization of the pilin nanowire assembly complex and it contains transmembrane (TM domain. In current study, protein-protein docking and multiple molecular dynamic (MD simulations were performed to understand the binding mode of pilin nanowires. The MD result explains the conformational behavior and folding of pilin nanowires in water environment in different time scale duration 20, 5, 5, 10 and 20ns (total of 60ns. Direct hydrogen bonds and water mediated hydrogen bonds that play a crucial role during the simulation were investigated. The conformational state, folding, end-toend distance profile and hydrogen bonding behavior had indicated that the Geobacter sulfurreducens pilin nanowires have electrical conductivity properties.

  11. Why are proteins with glutamine- and asparagine-rich regions associated with protein misfolding diseases?

    Energy Technology Data Exchange (ETDEWEB)

    Cruzeiro, Leonor [CCMAR and FCT, University of Algarve, Campus de Gambelas, 8000 Faro (Portugal)

    2005-12-21

    The possibility that vibrational excited states (VESs) are the drivers of protein folding and function (the VES hypothesis) is explored to explain the reason why Gln- and Asn-rich proteins are associated with degenerative diseases. The Davydov/Scott model is extended to describe energy transfer from the water solution to the protein and vice versa. Computer simulations show that, on average, Gln and Asn residues lead to an initial larger absorption of energy from the environment to the protein, something that can explain the greater structural instability of prions. The sporadic, inherited and infectious character of prion diseases is discussed in the light of the VES hypothesis. An alternative treatment for prion diseases is suggested.

  12. Analysis of residuals from enzyme kinetic and protein folding experiments in the presence of correlated experimental noise.

    Science.gov (United States)

    Kuzmic, Petr; Lorenz, Thorsten; Reinstein, Jochen

    2009-12-01

    Experimental data from continuous enzyme assays or protein folding experiments often contain hundreds, or even thousands, of densely spaced data points. When the sampling interval is extremely short, the experimental data points might not be statistically independent. The resulting neighborhood correlation invalidates important theoretical assumptions of nonlinear regression analysis. As a consequence, certain goodness-of-fit criteria, such as the runs-of-signs test and the autocorrelation function, might indicate a systematic lack of fit even if the experiment does agree very well with the underlying theoretical model. A solution to this problem is to analyze only a subset of the residuals of fit, such that any excessive neighborhood correlation is eliminated. Substrate kinetics of the HIV protease and the unfolding kinetics of UMP/CMP kinase, a globular protein from Dictyostelium discoideum, serve as two illustrative examples. A suitable data-reduction algorithm has been incorporated into software DYNAFIT [P. Kuzmic, Anal. Biochem. 237 (1996) 260-273], freely available to all academic researchers from http://www.biokin.com.

  13. Chaperoning Proteins for Destruction: Diverse Roles of Hsp70 Chaperones and their Co-Chaperones in Targeting Misfolded Proteins to the Proteasome

    Directory of Open Access Journals (Sweden)

    Ayala Shiber

    2014-07-01

    Full Text Available Molecular chaperones were originally discovered as heat shock-induced proteins that facilitate proper folding of proteins with non-native conformations. While the function of chaperones in protein folding has been well documented over the last four decades, more recent studies have shown that chaperones are also necessary for the clearance of terminally misfolded proteins by the Ub-proteasome system. In this capacity, chaperones protect misfolded degradation substrates from spontaneous aggregation, facilitate their recognition by the Ub ligation machinery and finally shuttle the ubiquitylated substrates to the proteasome. The physiological importance of these functions is manifested by inefficient proteasomal degradation and the accumulation of protein aggregates during ageing or in certain neurodegenerative diseases, when chaperone levels decline. In this review, we focus on the diverse roles of stress-induced chaperones in targeting misfolded proteins to the proteasome and the consequences of their compromised activity. We further discuss the implications of these findings to the identification of new therapeutic targets for the treatment of amyloid diseases.

  14. Dry molten globule intermediates and the mechanism of protein unfolding.

    Science.gov (United States)

    Baldwin, Robert L; Frieden, Carl; Rose, George D

    2010-10-01

    New experimental results show that either gain or loss of close packing can be observed as a discrete step in protein folding or unfolding reactions. This finding poses a significant challenge to the conventional two-state model of protein folding. Results of interest involve dry molten globule (DMG) intermediates, an expanded form of the protein that lacks appreciable solvent. When an unfolding protein expands to the DMG state, side chains unlock and gain conformational entropy, while liquid-like van der Waals interactions persist. Four unrelated proteins are now known to form DMGs as the first step of unfolding, suggesting that such an intermediate may well be commonplace in both folding and unfolding. Data from the literature show that peptide amide protons are protected in the DMG, indicating that backbone structure is intact despite loss of side-chain close packing. Other complementary evidence shows that secondary structure formation provides a major source of compaction during folding. In our model, the major free-energy barrier separating unfolded from native states usually occurs during the transition between the unfolded state and the DMG. The absence of close packing at this barrier provides an explanation for why phi-values, derived from a Brønsted-Leffler plot, depend primarily on structure at the mutational site and not on specific side-chain interactions. The conventional two-state folding model breaks down when there are DMG intermediates, a realization that has major implications for future experimental work on the mechanism of protein folding. 2010 Wiley-Liss, Inc.

  15. Advanced path sampling of the kinetic network of small proteins

    NARCIS (Netherlands)

    Du, W.

    2014-01-01

    This thesis is focused on developing advanced path sampling simulation methods to study protein folding and unfolding, and to build kinetic equilibrium networks describing these processes. In Chapter 1 the basic knowledge of protein structure and folding theories were introduced and a brief overview

  16. Expansion of the octarepeat domain alters the misfolding pathway but not the folding pathway of the prion protein.

    Science.gov (United States)

    Leliveld, S Rutger; Stitz, Lothar; Korth, Carsten

    2008-06-10

    A misfolded conformation of the prion protein (PrP), PrP (Sc), is the essential component of prions, the infectious agents that cause transmissible neurodegenerative diseases. Insertional mutations that lead to an increase in the number of octarepeats (ORs) in PrP are linked to familial human prion disease. In this study, we investigated how expansion of the OR domain causes PrP to favor a prion-like conformation. Therefore, we compared the conformational and aggregation modulating properties of wild-type versus expanded OR domains, either as a fusion construct with the protein G B1 domain (GB1-OR) or as an integral part of full-length mouse PrP (MoPrP). Using circular dichroism spectroscopy, we first demonstrated that ORs are not unfolded but exist as an ensemble of three distinct conformers: polyproline helix-like, beta-turn, and "Trp-related". Domain expansion had little effect on the conformation of GB1-OR fusion proteins. When part of MoPrP however, OR domain expansion changed PrP's folding landscape, not by hampering the production of native alpha-helical monomers but by greatly reducing the propensity to form amyloid and by altering the assembly of misfolded, beta-rich aggregates. These features may relate to subtle pH-dependent conformational differences between wild-type and mutant monomers. In conclusion, we propose that PrP insertional mutations are pathogenic because they enhance specific misfolding pathways of PrP rather than by undermining native folding. This idea was supported by a trial bioassay in transgenic mice overexpressing wild-type MoPrP, where intracerebral injection of recombinant MoPrP with an expanded OR domain but not wild-type MoPrP caused prion disease.

  17. Evidence of non-coincidence of normalized sigmoidal curves of two different structural properties for two-state protein folding/unfolding

    International Nuclear Information System (INIS)

    Rahaman, Hamidur; Khan, Md. Khurshid Alam; Hassan, Md. Imtaiyaz; Islam, Asimul; Moosavi-Movahedi, Ali Akbar; Ahmad, Faizan

    2013-01-01

    Highlights: ► Non-coincidence of normalized sigmoidal curves of two different structural properties is consistence with the two-state protein folding/unfolding. ► DSC measurements of denaturation show a two-state behavior of g-cyt-c at pH 6.0. ► Urea-induced denaturation of g-cyt-c is a variable two- state process at pH 6.0. ► GdmCl-induced denaturation of g-cyt-c is a fixed two- state process at pH 6.0. -- Abstract: In practice, the observation of non-coincidence of normalized sigmoidal transition curves measured by two different structural properties constitutes a proof of existence of thermodynamically stable intermediate(s) on the folding ↔ unfolding pathway of a protein. Here we give first experimental evidence that this non-coincidence is also observed for a two-state protein denaturation. Proof of this evidence comes from our studies of denaturation of goat cytochrome-c (g-cyt-c) at pH 6.0. These studies involve differential scanning calorimetry (DSC) measurements in the absence of urea and measurements of urea-induced denaturation curves monitored by observing changes in absorbance at 405, 530, and 695 nm and circular dichroism (CD) at 222, 405, and 416 nm. DSC measurements showed that denaturation of the protein is a two-state process, for calorimetric and van’t Hoff enthalpy changes are, within experimental errors, identical. Normalization of urea-induced denaturation curves monitored by optical properties leads to noncoincident sigmoidal curves. Heat-induced transition of g-cyt-c in the presence of different urea concentrations was monitored by CD at 222 nm and absorption at 405 nm. It was observed that these two different structural probes gave not only identical values of T m (transition temperature), ΔH m (change in enthalpy at T m ) and ΔC p (constant-pressure heat capacity change), but these thermodynamic parameters in the absence of urea are also in agreement with those obtained from DSC measurements

  18. Method-Unifying View of Loop-Formation Kinetics in Peptide and Protein Folding.

    Science.gov (United States)

    Jacob, Maik H; D'Souza, Roy N; Schwarzlose, Thomas; Wang, Xiaojuan; Huang, Fang; Haas, Elisha; Nau, Werner M

    2018-04-26

    Protein folding can be described as a probabilistic succession of events in which the peptide chain forms loops closed by specific amino acid residue contacts, herein referred to as loop nodes. To measure loop rates, several photophysical methods have been introduced where a pair of optically active probes is incorporated at selected chain positions and the excited probe undergoes contact quenching (CQ) upon collision with the second probe. The quenching mechanisms involved triplet-triplet energy transfer, photoinduced electron transfer, and collision-induced fluorescence quenching, where the fluorescence of Dbo, an asparagine residue conjugated to 2,3-diazabicyclo[2.2.2]octane, is quenched by tryptophan. The discrepancy between the loop rates afforded from these three CQ techniques has, however, remained unresolved. In analyzing this discrepancy, we now report two short-distance FRET methods where Dbo acts as an energy acceptor in combination with tryptophan and naphtylalanine, two donors with largely different fluorescence lifetimes of 1.3 and 33 ns, respectively. Despite the different quenching mechanisms, the rates from FRET and CQ methods were, surprisingly, of comparable magnitude. This combination of FRET and CQ data led to a unifying physical model and to the conclusion that the rate of loop formation in folding reactions varies not only with the kind and number of residues that constitute the chain but also in particular with the size and properties of the residues that constitute the loop node.

  19. Optimization of translation profiles enhances protein expression and solubility.

    Directory of Open Access Journals (Sweden)

    Anne-Katrin Hess

    Full Text Available mRNA is translated with a non-uniform speed that actively coordinates co-translational folding of protein domains. Using structure-based homology we identified the structural domains in epoxide hydrolases (EHs and introduced slow-translating codons to delineate the translation of single domains. These changes in translation speed dramatically improved the solubility of two EHs of metagenomic origin in Escherichia coli. Conversely, the importance of transient attenuation for the folding, and consequently solubility, of EH was evidenced with a member of the EH family from Agrobacterium radiobacter, which partitions in the soluble fraction when expressed in E. coli. Synonymous substitutions of codons shaping the slow-transiting regions to fast-translating codons render this protein insoluble. Furthermore, we show that low protein yield can be enhanced by decreasing the free folding energy of the initial 5'-coding region, which can disrupt mRNA secondary structure and enhance ribosomal loading. This study provides direct experimental evidence that mRNA is not a mere messenger for translation of codons into amino acids but bears an additional layer of information for folding, solubility and expression level of the encoded protein. Furthermore, it provides a general frame on how to modulate and fine-tune gene expression of a target protein.

  20. Protein-protein interaction inference based on semantic similarity of Gene Ontology terms.

    Science.gov (United States)

    Zhang, Shu-Bo; Tang, Qiang-Rong

    2016-07-21

    Identifying protein-protein interactions is important in molecular biology. Experimental methods to this issue have their limitations, and computational approaches have attracted more and more attentions from the biological community. The semantic similarity derived from the Gene Ontology (GO) annotation has been regarded as one of the most powerful indicators for protein interaction. However, conventional methods based on GO similarity fail to take advantage of the specificity of GO terms in the ontology graph. We proposed a GO-based method to predict protein-protein interaction by integrating different kinds of similarity measures derived from the intrinsic structure of GO graph. We extended five existing methods to derive the semantic similarity measures from the descending part of two GO terms in the GO graph, then adopted a feature integration strategy to combines both the ascending and the descending similarity scores derived from the three sub-ontologies to construct various kinds of features to characterize each protein pair. Support vector machines (SVM) were employed as discriminate classifiers, and five-fold cross validation experiments were conducted on both human and yeast protein-protein interaction datasets to evaluate the performance of different kinds of integrated features, the experimental results suggest the best performance of the feature that combines information from both the ascending and the descending parts of the three ontologies. Our method is appealing for effective prediction of protein-protein interaction. Copyright © 2016 Elsevier Ltd. All rights reserved.

  1. COMe: the ontology of bioinorganic proteins

    Directory of Open Access Journals (Sweden)

    Contrino Sergio

    2004-02-01

    Full Text Available Abstract Background Many characterised proteins contain metal ions, small organic molecules or modified residues. In contrast, the huge amount of data generated by genome projects consists exclusively of sequences with almost no annotation. One of the goals of the structural genomics initiative is to provide representative three-dimensional (3-D structures for as many protein/domain folds as possible to allow successful homology modelling. However, important functional features such as metal co-ordination or a type of prosthetic group are not always conserved in homologous proteins. So far, the problem of correct annotation of bioinorganic proteins has been largely ignored by the bioinformatics community and information on bioinorganic centres obtained by methods other than crystallography or NMR is only available in literature databases. Results COMe (Co-Ordination of Metals represents the ontology for bioinorganic and other small molecule centres in complex proteins. COMe consists of three types of entities: 'bioinorganic motif' (BIM, 'molecule' (MOL, and 'complex proteins' (PRX, with each entity being assigned a unique identifier. A BIM consists of at least one centre (metal atom, inorganic cluster, organic molecule and two or more endogenous and/or exogenous ligands. BIMs are represented as one-dimensional (1-D strings and 2-D diagrams. A MOL entity represents a 'small molecule' which, when in complex with one or more polypeptides, forms a functional protein. The PRX entities refer to the functional proteins as well as to separate protein domains and subunits. The complex proteins in COMe are subdivided into three categories: (i metalloproteins, (ii organic prosthetic group proteins and (iii modified amino acid proteins. The data are currently stored in both XML format and a relational database and are available at http://www.ebi.ac.uk/come/. Conclusion COMe provides the classification of proteins according to their 'bioinorganic' features

  2. Random copolymers that protect proteins

    Science.gov (United States)

    Alexander-Katz, Alfredo; Van Lehn, Reid C.

    2018-03-01

    Scientists have tried and in some limited cases succeeded to harness proteins to do chemistry (1) or use them in functional materials. However, most proteins only function correctly if they fold into specific conformations, which typically occurs with the assistance of other proteins (such as chaperones, translocons, or transporters) that mediate structure formation, membrane insertion, and intracellular trafficking (2, 3). Several methods have been used to improve protein stability in nonbiological environments—including micelle encapsulation, polymer conjugation, and sol-gel trapping (4)—but for most intended applications, they suffer from low levels of functionality, difficult chemical postfunctionalization, or the requirement of very specific solvent environments. On page 1239 of this issue, Panganiban et al. (5) introduce an approach for stabilizing proteins in disparate solvent environments that does not suffer from these drawbacks.

  3. TDP-43 Proteinopathies: A New Player in Neurodegenerative Diseases with Defective Protein Folding

    Directory of Open Access Journals (Sweden)

    Suna Lahut

    2012-03-01

    Full Text Available The proteome is the sum of all proteins inside a cell, and proteostasis (protein homeostasis is the stable condition of the proteome. Proteostasis is essential for the cellular and organismal health. Stress, aging and the chronic expression of misfolded proteins challenge the proteostasis machinery and the vitality of the cell. There is increasing evidence that the accumulation of damaged proteins not only has direct consequences on the efficiency and fidelity of cellular processes but, when not corrected, that they initiate a cascade of dysfunction, which in humans is associated with a plethora of diseases of protein conformation, referred to as proteinopathies. Alzheimer’s Disease (AD, Parkinson’s Disease (PD, Huntington’s Disease (HD, Amyotrophic Lateral Sclerosis (ALS, cancer and diabetes, whose frequencies have drastically increased in countries with aging populations, are all consequences of misfolded proteins. This paper focuses on TDP-43, which excelled as a key protein in neurodegenerative processes because of its association with different diseases, especially with ALS and Frontotemporal Lobar Dementia (FTLD, the two best studied examples of TDP-43 proteinopathies

  4. TDP-43 Proteinopathies: A New Player in Neurodegenerative Diseases with Defective Protein Folding

    Directory of Open Access Journals (Sweden)

    Suna Lahut

    2012-03-01

    Full Text Available The proteome is the sum of all proteins inside a cell, and proteostasis (protein homeostasis is the stable condition of the proteome. Proteostasis is essential for the cellular and organismal health. Stress, aging and the chronic expression of misfolded proteins challenge the proteostasis machinery and the vitality of the cell. There is increasing evidence that the accumulation of damaged proteins not only has direct consequences on the efficiency and fidelity of cellular processes but, when not corrected, that they initiate a cascade of dysfunction, which in humans is associated with a plethora of diseases of protein conformation, referred to as proteinopathies. Alzheimer’s Disease (AD, Parkinson’s Disease (PD, Huntington’s Disease (HD, Amyotrophic Lateral Sclerosis (ALS, cancer and diabetes, whose frequencies have drastically increased in countries with aging populations, are all consequences of misfolded proteins. This paper focuses on TDP-43, which excelled as a key protein in neurodegenerative processes because of its association with different diseases, especially with ALS and Frontotemporal Lobar Dementia (FTLD, the two best studied examples of TDP-43 proteinopathies.

  5. A nonadaptive origin of a beneficial trait: in silico selection for free energy of folding leads to the neutral emergence of mutational robustness in single domain proteins.

    Science.gov (United States)

    Pagan, Rafael F; Massey, Steven E

    2014-02-01

    Proteins are regarded as being robust to the deleterious effects of mutations. Here, the neutral emergence of mutational robustness in a population of single domain proteins is explored using computer simulations. A pairwise contact model was used to calculate the ΔG of folding (ΔG folding) using the three dimensional protein structure of leech eglin C. A random amino acid sequence with low mutational robustness, defined as the average ΔΔG resulting from a point mutation (ΔΔG average), was threaded onto the structure. A population of 1,000 threaded sequences was evolved under selection for stability, using an upper and lower energy threshold. Under these conditions, mutational robustness increased over time in the most common sequence in the population. In contrast, when the wild type sequence was used it did not show an increase in robustness. This implies that the emergence of mutational robustness is sequence specific and that wild type sequences may be close to maximal robustness. In addition, an inverse relationship between ∆∆G average and protein stability is shown, resulting partly from a larger average effect of point mutations in more stable proteins. The emergence of mutational robustness was also observed in the Escherichia coli colE1 Rop and human CD59 proteins, implying that the property may be common in single domain proteins under certain simulation conditions. The results indicate that at least a portion of mutational robustness in small globular proteins might have arisen by a process of neutral emergence, and could be an example of a beneficial trait that has not been directly selected for, termed a "pseudaptation."

  6. How Robust Is the Mechanism of Folding-Upon-Binding for an Intrinsically Disordered Protein?

    Science.gov (United States)

    Bonetti, Daniela; Troilo, Francesca; Brunori, Maurizio; Longhi, Sonia; Gianni, Stefano

    2018-04-24

    The mechanism of interaction of an intrinsically disordered protein (IDP) with its physiological partner is characterized by a disorder-to-order transition in which a recognition and a binding step take place. Even if the mechanism is quite complex, IDPs tend to bind their partner in a cooperative manner such that it is generally possible to detect experimentally only the disordered unbound state and the structured complex. The interaction between the disordered C-terminal domain of the measles virus nucleoprotein (N TAIL ) and the X domain (XD) of the viral phosphoprotein allows us to detect and quantify the two distinct steps of the overall reaction. Here, we analyze the robustness of the folding of N TAIL upon binding to XD by measuring the effect on both the folding and binding steps of N TAIL when the structure of XD is modified. Because it has been shown that wild-type XD is structurally heterogeneous, populating an on-pathway intermediate under native conditions, we investigated the binding to 11 different site-directed variants of N TAIL of one particular variant of XD (I504A XD) that populates only the native state. Data reveal that the recognition and the folding steps are both affected by the structure of XD, indicating a highly malleable pathway. The experimental results are briefly discussed in the light of previous experiments on other IDPs. Copyright © 2018 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  7. Origins of pressure-induced protein transitions.

    Science.gov (United States)

    Chalikian, Tigran V; Macgregor, Robert B

    2009-12-18

    The molecular mechanisms underlying pressure-induced protein denaturation can be analyzed based on the pressure-dependent differences in the apparent volume occupied by amino acids inside the protein and when they are exposed to water in an unfolded conformation. We present here an analysis for the peptide group and the 20 naturally occurring amino acid side chains based on volumetric parameters for the amino acids in the interior of the native state, the micelle-like interior of the pressure-induced denatured state, and the unfolded conformation modeled by N-acetyl amino acid amides. The transfer of peptide groups from the protein interior to water becomes increasingly favorable as pressure increases. Thus, solvation of peptide groups represents a major driving force in pressure-induced protein denaturation. Polar side chains do not appear to exhibit significant pressure-dependent changes in their preference for the protein interior or solvent. The transfer of nonpolar side chains from the protein interior to water becomes more unfavorable as pressure increases. We conclude that a sizeable population of nonpolar side chains remains buried inside a solvent-inaccessible core of the pressure-induced denatured state. At elevated pressures, this core may become packed almost as tightly as the interior of the native state. The presence and partial disappearance of large intraglobular voids is another driving force facilitating pressure-induced denaturation of individual proteins. Our data also have implications for the kinetics of protein folding and shed light on the nature of the folding transition state ensemble.

  8. New force replica exchange method and protein folding pathways probed by force-clamp technique.

    Science.gov (United States)

    Kouza, Maksim; Hu, Chin-Kun; Li, Mai Suan

    2008-01-28

    We have developed a new extended replica exchange method to study thermodynamics of a system in the presence of external force. Our idea is based on the exchange between different force replicas to accelerate the equilibrium process. This new approach was applied to obtain the force-temperature phase diagram and other thermodynamical quantities of the three-domain ubiquitin. Using the C(alpha)-Go model and the Langevin dynamics, we have shown that the refolding pathways of single ubiquitin depend on which terminus is fixed. If the N end is fixed then the folding pathways are different compared to the case when both termini are free, but fixing the C terminal does not change them. Surprisingly, we have found that the anchoring terminal does not affect the pathways of individual secondary structures of three-domain ubiquitin, indicating the important role of the multidomain construction. Therefore, force-clamp experiments, in which one end of a protein is kept fixed, can probe the refolding pathways of a single free-end ubiquitin if one uses either the polyubiquitin or a single domain with the C terminus anchored. However, it is shown that anchoring one end does not affect refolding pathways of the titin domain I27, and the force-clamp spectroscopy is always capable to predict folding sequencing of this protein. We have obtained the reasonable estimate for unfolding barrier of ubiquitin, using the microscopic theory for the dependence of unfolding time on the external force. The linkage between residue Lys48 and the C terminal of ubiquitin is found to have the dramatic effect on the location of the transition state along the end-to-end distance reaction coordinate, but the multidomain construction leaves the transition state almost unchanged. We have found that the maximum force in the force-extension profile from constant velocity force pulling simulations depends on temperature nonlinearly. However, for some narrow temperature interval this dependence becomes

  9. Protein Topology Determines Cysteine Oxidation Fate: The Case of Sulfenyl Amide Formation among Protein Families

    Science.gov (United States)

    Defelipe, Lucas A.; Lanzarotti, Esteban; Gauto, Diego; Marti, Marcelo A.; Turjanski, Adrián G.

    2015-01-01

    Cysteine residues have a rich chemistry and play a critical role in the catalytic activity of a plethora of enzymes. However, cysteines are susceptible to oxidation by Reactive Oxygen and Nitrogen Species, leading to a loss of their catalytic function. Therefore, cysteine oxidation is emerging as a relevant physiological regulatory mechanism. Formation of a cyclic sulfenyl amide residue at the active site of redox-regulated proteins has been proposed as a protection mechanism against irreversible oxidation as the sulfenyl amide intermediate has been identified in several proteins. However, how and why only some specific cysteine residues in particular proteins react to form this intermediate is still unknown. In the present work using in-silico based tools, we have identified a constrained conformation that accelerates sulfenyl amide formation. By means of combined MD and QM/MM calculation we show that this conformation positions the NH backbone towards the sulfenic acid and promotes the reaction to yield the sulfenyl amide intermediate, in one step with the concomitant release of a water molecule. Moreover, in a large subset of the proteins we found a conserved beta sheet-loop-helix motif, which is present across different protein folds, that is key for sulfenyl amide production as it promotes the previous formation of sulfenic acid. For catalytic activity, in several cases, proteins need the Cysteine to be in the cysteinate form, i.e. a low pKa Cys. We found that the conserved motif stabilizes the cysteinate by hydrogen bonding to several NH backbone moieties. As cysteinate is also more reactive toward ROS we propose that the sheet-loop-helix motif and the constraint conformation have been selected by evolution for proteins that need a reactive Cys protected from irreversible oxidation. Our results also highlight how fold conservation can be correlated to redox chemistry regulation of protein function. PMID:25741692

  10. Protein Misfolding Cyclic Amplification of Infectious Prions.

    Science.gov (United States)

    Moda, Fabio

    2017-01-01

    Transmissible spongiform encephalopathies, or prion diseases, are a group of incurable disorders caused by the accumulation of an abnormally folded prion protein (PrP Sc ) in the brain. According to the "protein-only" hypothesis, PrP Sc is the infectious agent able to propagate the disease by acting as a template for the conversion of the correctly folded prion protein (PrP C ) into the pathological isoform. Recently, the mechanism of PrP C conversion has been mimicked in vitro using an innovative technique named protein misfolding cyclic amplification (PMCA). This technology represents a great tool for studying diverse aspects of prion biology in the field of basic research and diagnosis. Moreover, PMCA can be expanded for the study of the misfolding process associated to other neurodegenerative diseases, including Alzheimer's disease, Parkinson's disease, and frontotemporal lobar degeneration. © 2017 Elsevier Inc. All rights reserved.

  11. Interplay between chaperones and protein disorder promotes the evolution of protein networks.

    Directory of Open Access Journals (Sweden)

    Sebastian Pechmann

    2014-06-01

    Full Text Available Evolution is driven by mutations, which lead to new protein functions but come at a cost to protein stability. Non-conservative substitutions are of interest in this regard because they may most profoundly affect both function and stability. Accordingly, organisms must balance the benefit of accepting advantageous substitutions with the possible cost of deleterious effects on protein folding and stability. We here examine factors that systematically promote non-conservative mutations at the proteome level. Intrinsically disordered regions in proteins play pivotal roles in protein interactions, but many questions regarding their evolution remain unanswered. Similarly, whether and how molecular chaperones, which have been shown to buffer destabilizing mutations in individual proteins, generally provide robustness during proteome evolution remains unclear. To this end, we introduce an evolutionary parameter λ that directly estimates the rate of non-conservative substitutions. Our analysis of λ in Escherichia coli, Saccharomyces cerevisiae, and Homo sapiens sequences reveals how co- and post-translationally acting chaperones differentially promote non-conservative substitutions in their substrates, likely through buffering of their destabilizing effects. We further find that λ serves well to quantify the evolution of intrinsically disordered proteins even though the unstructured, thus generally variable regions in proteins are often flanked by very conserved sequences. Crucially, we show that both intrinsically disordered proteins and highly re-wired proteins in protein interaction networks, which have evolved new interactions and functions, exhibit a higher λ at the expense of enhanced chaperone assistance. Our findings thus highlight an intricate interplay of molecular chaperones and protein disorder in the evolvability of protein networks. Our results illuminate the role of chaperones in enabling protein evolution, and underline the

  12. Automated design evolution of stereochemically randomized protein foldamers

    Science.gov (United States)

    Ranbhor, Ranjit; Kumar, Anil; Patel, Kirti; Ramakrishnan, Vibin; Durani, Susheel

    2018-05-01

    Diversification of chain stereochemistry opens up the possibilities of an ‘in principle’ increase in the design space of proteins. This huge increase in the sequence and consequent structural variation is aimed at the generation of smart materials. To diversify protein structure stereochemically, we introduced L- and D-α-amino acids as the design alphabet. With a sequence design algorithm, we explored the usage of specific variables such as chirality and the sequence of this alphabet in independent steps. With molecular dynamics, we folded stereochemically diverse homopolypeptides and evaluated their ‘fitness’ for possible design as protein-like foldamers. We propose a fitness function to prune the most optimal fold among 1000 structures simulated with an automated repetitive simulated annealing molecular dynamics (AR-SAMD) approach. The highly scored poly-leucine fold with sequence lengths of 24 and 30 amino acids were later sequence-optimized using a Dead End Elimination cum Monte Carlo based optimization tool. This paper demonstrates a novel approach for the de novo design of protein-like foldamers.

  13. In-cell thermodynamics and a new role for protein surfaces.

    Science.gov (United States)

    Smith, Austin E; Zhou, Larry Z; Gorensek, Annelise H; Senske, Michael; Pielak, Gary J

    2016-02-16

    There is abundant, physiologically relevant knowledge about protein cores; they are hydrophobic, exquisitely well packed, and nearly all hydrogen bonds are satisfied. An equivalent understanding of protein surfaces has remained elusive because proteins are almost exclusively studied in vitro in simple aqueous solutions. Here, we establish the essential physiological roles played by protein surfaces by measuring the equilibrium thermodynamics and kinetics of protein folding in the complex environment of living Escherichia coli cells, and under physiologically relevant in vitro conditions. Fluorine NMR data on the 7-kDa globular N-terminal SH3 domain of Drosophila signal transduction protein drk (SH3) show that charge-charge interactions are fundamental to protein stability and folding kinetics in cells. Our results contradict predictions from accepted theories of macromolecular crowding and show that cosolutes commonly used to mimic the cellular interior do not yield physiologically relevant information. As such, we provide the foundation for a complete picture of protein chemistry in cells.

  14. Novel fusion protein approach for efficient high-throughput screening of small molecule-mediating protein-protein interactions in cells and living animals.

    Science.gov (United States)

    Paulmurugan, Ramasamy; Gambhir, Sanjiv S

    2005-08-15

    Networks of protein interactions execute many different intracellular pathways. Small molecules either synthesized within the cell or obtained from the external environment mediate many of these protein-protein interactions. The study of these small molecule-mediated protein-protein interactions is important in understanding abnormal signal transduction pathways in a variety of disorders, as well as in optimizing the process of drug development and validation. In this study, we evaluated the rapamycin-mediated interaction of the human proteins FK506-binding protein (FKBP12) rapamycin-binding domain (FRB) and FKBP12 by constructing a fusion of these proteins with a split-Renilla luciferase or a split enhanced green fluorescent protein (split-EGFP) such that complementation of the reporter fragments occurs in the presence of rapamycin. Different linker peptides in the fusion protein were evaluated for the efficient maintenance of complemented reporter activity. This system was studied in both cell culture and xenografts in living animals. We found that peptide linkers with two or four EAAAR repeat showed higher protein-protein interaction-mediated signal with lower background signal compared with having no linker or linkers with amino acid sequences GGGGSGGGGS, ACGSLSCGSF, and ACGSLSCGSFACGSLSCGSF. A 9 +/- 2-fold increase in signal intensity both in cell culture and in living mice was seen compared with a system that expresses both reporter fragments and the interacting proteins separately. In this fusion system, rapamycin induced heterodimerization of the FRB and FKBP12 moieties occurred rapidly even at very lower concentrations (0.00001 nmol/L) of rapamycin. For a similar fusion system employing split-EGFP, flow cytometry analysis showed significant level of rapamycin-induced complementation.

  15. Can misfolded proteins be beneficial? The HAMLET case.

    Science.gov (United States)

    Pettersson-Kastberg, Jenny; Aits, Sonja; Gustafsson, Lotta; Mossberg, Anki; Storm, Petter; Trulsson, Maria; Persson, Filip; Mok, K Hun; Svanborg, Catharina

    2009-01-01

    By changing the three-dimensional structure, a protein can attain new functions, distinct from those of the native protein. Amyloid-forming proteins are one example, in which conformational change may lead to fibril formation and, in many cases, neurodegenerative disease. We have proposed that partial unfolding provides a mechanism to generate new and useful functional variants from a given polypeptide chain. Here we present HAMLET (Human Alpha-lactalbumin Made LEthal to Tumor cells) as an example where partial unfolding and the incorporation of cofactor create a complex with new, beneficial properties. Native alpha-lactalbumin functions as a substrate specifier in lactose synthesis, but when partially unfolded the protein binds oleic acid and forms the tumoricidal HAMLET complex. When the properties of HAMLET were first described they were surprising, as protein folding intermediates and especially amyloid-forming protein intermediates had been regarded as toxic conformations, but since then structural studies have supported functional diversity arising from a change in fold. The properties of HAMLET suggest a mechanism of structure-function variation, which might help the limited number of human protein genes to generate sufficient structural diversity to meet the diverse functional demands of complex organisms.

  16. Folding DNA into a Lipid-Conjugated Nanobarrel for Controlled Reconstitution of Membrane Proteins.

    Science.gov (United States)

    Dong, Yuanchen; Chen, Shuobing; Zhang, Shijian; Sodroski, Joseph; Yang, Zhongqiang; Liu, Dongsheng; Mao, Youdong

    2018-02-19

    Building upon DNA origami technology, we introduce a method to reconstitute a single membrane protein into a self-assembled DNA nanobarrel that scaffolds a nanodisc-like lipid environment. Compared with the membrane-scaffolding-protein nanodisc technique, our approach gives rise to defined stoichiometry, controlled sizes, as well as enhanced stability and homogeneity in membrane protein reconstitution. We further demonstrate potential applications of the DNA nanobarrels in the structural analysis of membrane proteins. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  17. Applications of Protein Thermodynamic Database for Understanding Protein Mutant Stability and Designing Stable Mutants.

    Science.gov (United States)

    Gromiha, M Michael; Anoosha, P; Huang, Liang-Tsung

    2016-01-01

    Protein stability is the free energy difference between unfolded and folded states of a protein, which lies in the range of 5-25 kcal/mol. Experimentally, protein stability is measured with circular dichroism, differential scanning calorimetry, and fluorescence spectroscopy using thermal and denaturant denaturation methods. These experimental data have been accumulated in the form of a database, ProTherm, thermodynamic database for proteins and mutants. It also contains sequence and structure information of a protein, experimental methods and conditions, and literature information. Different features such as search, display, and sorting options and visualization tools have been incorporated in the database. ProTherm is a valuable resource for understanding/predicting the stability of proteins and it can be accessed at http://www.abren.net/protherm/ . ProTherm has been effectively used to examine the relationship among thermodynamics, structure, and function of proteins. We describe the recent progress on the development of methods for understanding/predicting protein stability, such as (1) general trends on mutational effects on stability, (2) relationship between the stability of protein mutants and amino acid properties, (3) applications of protein three-dimensional structures for predicting their stability upon point mutations, (4) prediction of protein stability upon single mutations from amino acid sequence, and (5) prediction methods for addressing double mutants. A list of online resources for predicting has also been provided.

  18. Unraveling a phosphorylation event in a folded protein by NMR spectroscopy: phosphorylation of the Pin1 WW domain by PKA

    International Nuclear Information System (INIS)

    Smet-Nocca, Caroline; Launay, Hélène; Wieruszeski, Jean-Michel; Lippens, Guy; Landrieu, Isabelle

    2013-01-01

    The Pin1 protein plays a critical role in the functional regulation of the hyperphosphorylated neuronal Tau protein in Alzheimer’s disease and is by itself regulated by phosphorylation. We have used Nuclear Magnetic Resonance (NMR) spectroscopy to both identify the PKA phosphorylation site in the Pin1 WW domain and investigate the functional consequences of this phosphorylation. Detection and identification of phosphorylation on serine/threonine residues in a globular protein, while mostly occurring in solvent-exposed flexible loops, does not lead to chemical shift changes as obvious as in disordered proteins and hence does not necessarily shift the resonances outside the spectrum of the folded protein. Other complications were encountered to characterize the extent of the phosphorylation, as part of the 1 H, 15 N amide resonances around the phosphorylation site are specifically broadened in the unphosphorylated state. Despite these obstacles, NMR spectroscopy was an efficient tool to confirm phosphorylation on S16 of the WW domain and to quantify the level of phosphorylation. Based on this analytical characterization, we show that WW phosphorylation on S16 abolishes its binding capacity to a phosphorylated Tau peptide. A reduced conformational heterogeneity and flexibility of the phospho-binding loop upon S16 phosphorylation could account for part of the decreased affinity for its phosphorylated partner. Additionally, a structural model of the phospho-WW obtained by molecular dynamics simulation and energy minimization suggests that the phosphate moiety of phospho-S16 could compete with the phospho-substrate.

  19. Exploring protein dynamics space: the dynasome as the missing link between protein structure and function.

    Directory of Open Access Journals (Sweden)

    Ulf Hensen

    Full Text Available Proteins are usually described and classified according to amino acid sequence, structure or function. Here, we develop a minimally biased scheme to compare and classify proteins according to their internal mobility patterns. This approach is based on the notion that proteins not only fold into recurring structural motifs but might also be carrying out only a limited set of recurring mobility motifs. The complete set of these patterns, which we tentatively call the dynasome, spans a multi-dimensional space with axes, the dynasome descriptors, characterizing different aspects of protein dynamics. The unique dynamic fingerprint of each protein is represented as a vector in the dynasome space. The difference between any two vectors, consequently, gives a reliable measure of the difference between the corresponding protein dynamics. We characterize the properties of the dynasome by comparing the dynamics fingerprints obtained from molecular dynamics simulations of 112 proteins but our approach is, in principle, not restricted to any specific source of data of protein dynamics. We conclude that: 1. the dynasome consists of a continuum of proteins, rather than well separated classes. 2. For the majority of proteins we observe strong correlations between structure and dynamics. 3. Proteins with similar function carry out similar dynamics, which suggests a new method to improve protein function annotation based on protein dynamics.

  20. Using the Computer Game "FoldIt" to Entice Students to Explore External Representations of Protein Structure in a Biochemistry Course for Nonmajors

    Science.gov (United States)

    Farley, Peter C.

    2013-01-01

    This article describes a novel approach to teaching novice Biochemistry students visual literacy skills and understanding of some aspects of protein structure using the internet resource FoldIt and a worksheet based on selected Introductory Puzzles from this computer game. In responding to a questionnaire, students indicated that they (94%)…

  1. Illuminating structural proteins in viral "dark matter" with metaproteomics.

    Science.gov (United States)

    Brum, Jennifer R; Ignacio-Espinoza, J Cesar; Kim, Eun-Hae; Trubl, Gareth; Jones, Robert M; Roux, Simon; VerBerkmoes, Nathan C; Rich, Virginia I; Sullivan, Matthew B

    2016-03-01

    Viruses are ecologically important, yet environmental virology is limited by dominance of unannotated genomic sequences representing taxonomic and functional "viral dark matter." Although recent analytical advances are rapidly improving taxonomic annotations, identifying functional dark matter remains problematic. Here, we apply paired metaproteomics and dsDNA-targeted metagenomics to identify 1,875 virion-associated proteins from the ocean. Over one-half of these proteins were newly functionally annotated and represent abundant and widespread viral metagenome-derived protein clusters (PCs). One primarily unannotated PC dominated the dataset, but structural modeling and genomic context identified this PC as a previously unidentified capsid protein from multiple uncultivated tailed virus families. Furthermore, four of the five most abundant PCs in the metaproteome represent capsid proteins containing the HK97-like protein fold previously found in many viruses that infect all three domains of life. The dominance of these proteins within our dataset, as well as their global distribution throughout the world's oceans and seas, supports prior hypotheses that this HK97-like protein fold is the most abundant biological structure on Earth. Together, these culture-independent analyses improve virion-associated protein annotations, facilitate the investigation of proteins within natural viral communities, and offer a high-throughput means of illuminating functional viral dark matter.

  2. Effects of Polymer Hydrophobicity on Protein Structure and Aggregation Kinetics in Crowded Milieu.

    Science.gov (United States)

    Breydo, Leonid; Sales, Amanda E; Frege, Telma; Howell, Mark C; Zaslavsky, Boris Y; Uversky, Vladimir N

    2015-05-19

    We examined the effects of water-soluble polymers of various degrees of hydrophobicity on the folding and aggregation of proteins. The polymers we chose were polyethylene glycol (PEG) and UCON (1:1 copolymer of ethylene glycol and propylene glycol). The presence of additional methyl groups in UCON makes it more hydrophobic than PEG. Our earlier analysis revealed that similarly sized PEG and UCON produced different changes in the solvent properties of water in their solutions and induced morphologically different α-synuclein aggregates [Ferreira, L. A., et al. (2015) Role of solvent properties of aqueous media in macromolecular crowding effects. J. Biomol. Struct. Dyn., in press]. To improve our understanding of molecular mechanisms defining behavior of proteins in a crowded environment, we tested the effects of these polymers on secondary and tertiary structure and aromatic residue solvent accessibility of 10 proteins [five folded proteins, two hybrid proteins; i.e., protein containing ordered and disordered domains, and three intrinsically disordered proteins (IDPs)] and on the aggregation kinetics of insulin and α-synuclein. We found that effects of both polymers on secondary and tertiary structures of folded and hybrid proteins were rather limited with slight unfolding observed in some cases. Solvent accessibility of aromatic residues was significantly increased for the majority of the studied proteins in the presence of UCON but not PEG. PEG also accelerated the aggregation of protein into amyloid fibrils, whereas UCON promoted aggregation to amyloid oligomers instead. These results indicate that even a relatively small change in polymer structure leads to a significant change in the effect of this polymer on protein folding and aggregation. This is an indication that protein folding and especially aggregation are highly sensitive to the presence of other macromolecules, and an excluded volume effect is insufficient to describe their effect.

  3. Functional studies on the phosphatidychloride transfer protein

    NARCIS (Netherlands)

    Brouwer, A.P.M. de

    2002-01-01

    The phosphatidylcholine transfer protein (PC-TP) has been studied for over 30 years now. Despite extensive research concerning the biochemical, biophysical and structural properties of PC-TP, the function of this protein is still elusive. We have studied in vitro the folding and the mechanism of PC

  4. Regulation of the endoplasmic reticulum calcium storage during the unfolded protein response--significance in tissue ischemia?

    DEFF Research Database (Denmark)

    Treiman, Marek

    2002-01-01

    for the protein folding pathway require Ca(2+) binding for their activity. A number of factors, including Ca(2+) depletion, may interfere with the folding pathway within the ER, with a potential for cell injury through an accumulation of malfolded protein aggregates. The Unfolded Protein Response involves...... a transcriptional upregulation of a number of the ER-resident folding helper proteins and becomes triggered when the folding pathway is blocked. To be effective, these upregulated proteins require a sufficient supply of Ca(2+) cofactor within the ER lumen. In tissue ischemia, where the availablity of this cofactor...

  5. Protein structure recognition: From eigenvector analysis to structural threading method

    Science.gov (United States)

    Cao, Haibo

    In this work, we try to understand the protein folding problem using pair-wise hydrophobic interaction as the dominant interaction for the protein folding process. We found a strong correlation between amino acid sequence and the corresponding native structure of the protein. Some applications of this correlation were discussed in this dissertation include the domain partition and a new structural threading method as well as the performance of this method in the CASP5 competition. In the first part, we give a brief introduction to the protein folding problem. Some essential knowledge and progress from other research groups was discussed. This part include discussions of interactions among amino acids residues, lattice HP model, and the designablity principle. In the second part, we try to establish the correlation between amino acid sequence and the corresponding native structure of the protein. This correlation was observed in our eigenvector study of protein contact matrix. We believe the correlation is universal, thus it can be used in automatic partition of protein structures into folding domains. In the third part, we discuss a threading method based on the correlation between amino acid sequence and ominant eigenvector of the structure contact-matrix. A mathematically straightforward iteration scheme provides a self-consistent optimum global sequence-structure alignment. The computational efficiency of this method makes it possible to search whole protein structure databases for structural homology without relying on sequence similarity. The sensitivity and specificity of this method is discussed, along with a case of blind test prediction. In the appendix, we list the overall performance of this threading method in CASP5 blind test in comparison with other existing approaches.

  6. Protein Structure Recognition: From Eigenvector Analysis to Structural Threading Method

    Energy Technology Data Exchange (ETDEWEB)

    Cao, Haibo [Iowa State Univ., Ames, IA (United States)

    2003-01-01

    In this work, they try to understand the protein folding problem using pair-wise hydrophobic interaction as the dominant interaction for the protein folding process. They found a strong correlation between amino acid sequences and the corresponding native structure of the protein. Some applications of this correlation were discussed in this dissertation include the domain partition and a new structural threading method as well as the performance of this method in the CASP5 competition. In the first part, they give a brief introduction to the protein folding problem. Some essential knowledge and progress from other research groups was discussed. This part includes discussions of interactions among amino acids residues, lattice HP model, and the design ability principle. In the second part, they try to establish the correlation between amino acid sequence and the corresponding native structure of the protein. This correlation was observed in the eigenvector study of protein contact matrix. They believe the correlation is universal, thus it can be used in automatic partition of protein structures into folding domains. In the third part, they discuss a threading method based on the correlation between amino acid sequences and ominant eigenvector of the structure contact-matrix. A mathematically straightforward iteration scheme provides a self-consistent optimum global sequence-structure alignment. The computational efficiency of this method makes it possible to search whole protein structure databases for structural homology without relying on sequence similarity. The sensitivity and specificity of this method is discussed, along with a case of blind test prediction. In the appendix, they list the overall performance of this threading method in CASP5 blind test in comparison with other existing approaches.

  7. Protein Structure Recognition: From Eigenvector Analysis to Structural Threading Method

    International Nuclear Information System (INIS)

    Haibo Cao

    2003-01-01

    In this work, they try to understand the protein folding problem using pair-wise hydrophobic interaction as the dominant interaction for the protein folding process. They found a strong correlation between amino acid sequences and the corresponding native structure of the protein. Some applications of this correlation were discussed in this dissertation include the domain partition and a new structural threading method as well as the performance of this method in the CASP5 competition. In the first part, they give a brief introduction to the protein folding problem. Some essential knowledge and progress from other research groups was discussed. This part includes discussions of interactions among amino acids residues, lattice HP model, and the design ability principle. In the second part, they try to establish the correlation between amino acid sequence and the corresponding native structure of the protein. This correlation was observed in the eigenvector study of protein contact matrix. They believe the correlation is universal, thus it can be used in automatic partition of protein structures into folding domains. In the third part, they discuss a threading method based on the correlation between amino acid sequences and ominant eigenvector of the structure contact-matrix. A mathematically straightforward iteration scheme provides a self-consistent optimum global sequence-structure alignment. The computational efficiency of this method makes it possible to search whole protein structure databases for structural homology without relying on sequence similarity. The sensitivity and specificity of this method is discussed, along with a case of blind test prediction. In the appendix, they list the overall performance of this threading method in CASP5 blind test in comparison with other existing approaches

  8. Bactericidal/Permeability-increasing protein fold-containing family member A1 in airway host protection and respiratory disease.

    Science.gov (United States)

    Britto, Clemente J; Cohn, Lauren

    2015-05-01

    Bactericidal/permeability-increasing protein fold-containing family member A1 (BPIFA1), formerly known as SPLUNC1, is one of the most abundant proteins in respiratory secretions and has been identified with increasing frequency in studies of pulmonary disease. Its expression is largely restricted to the respiratory tract, being highly concentrated in the upper airways and proximal trachea. BPIFA1 is highly responsive to airborne pathogens, allergens, and irritants. BPIFA1 actively participates in host protection through antimicrobial, surfactant, airway surface liquid regulation, and immunomodulatory properties. Its expression is modulated in multiple lung diseases, including cystic fibrosis, chronic obstructive pulmonary disease, respiratory malignancies, and idiopathic pulmonary fibrosis. However, the role of BPIFA1 in pulmonary pathogenesis remains to be elucidated. This review highlights the versatile properties of BPIFA1 in antimicrobial protection and its roles as a sensor of environmental exposure and regulator of immune cell function. A greater understanding of the contribution of BPIFA1 to disease pathogenesis and activity may clarify if BPIFA1 is a biomarker and potential drug target in pulmonary disease.

  9. Structural Basis for Target Protein Regcognition by Thiredoxin

    DEFF Research Database (Denmark)

    Maeda, Kenji

    2007-01-01

    Ser) and a mutant of an in vitro substrate alpha-amylase/subtilisin inhibitor (BASI) (Cys144Ser), as a reaction intermediate-mimic of Trx-catalyzed disulfide reduction. The resultant structure showed a sequence of BASI residues along a conserved hydrophobic groove constituted of three loop segments...... of Trx-fold proteins glutaredoxin and glutathione transferase. This study suggests that the features of main chain conformation as well as charge property around disulfide bonds in protein substrates are important factors for interaction with Trx. Moreover, this study describes a detailed structural......Thioredoxin (Trx) is an ubiquitous protein disulfide reductase that possesses two redox active cysteines in the conserved active site sequence motif, Trp-CysN-Gly/Pro-Pro-CysC situated in the so called Trx-fold. The lack of insight into the protein substrate recognition mechanism of Trx has to date...

  10. Tracking of protein folding by chiral spectroscopic methods

    Czech Academy of Sciences Publication Activity Database

    Krupová, Monika; Andrushchenko, Valery; Bouř, Petr

    2016-01-01

    Roč. 23, č. 1 (2016), s. 36 ISSN 1211-5894. [Discussions in Structural Molecular Biology /14./. 17.03.2016-19.03.2016, Nové Hrady] R&D Projects: GA ČR(CZ) GA16-04902S; GA ČR GA15-09072S Institutional support: RVO:61388963 Keywords : proteins * fibrils * lanthanides * vibrational circular dichroism * circularly polarised luminescence Subject RIV: CF - Physical ; Theoretical Chemistry

  11. Integrated Structural Biology for α-Helical Membrane Protein Structure Determination.

    Science.gov (United States)

    Xia, Yan; Fischer, Axel W; Teixeira, Pedro; Weiner, Brian; Meiler, Jens

    2018-04-03

    While great progress has been made, only 10% of the nearly 1,000 integral, α-helical, multi-span membrane protein families are represented by at least one experimentally determined structure in the PDB. Previously, we developed the algorithm BCL::MP-Fold, which samples the large conformational space of membrane proteins de novo by assembling predicted secondary structure elements guided by knowledge-based potentials. Here, we present a case study of rhodopsin fold determination by integrating sparse and/or low-resolution restraints from multiple experimental techniques including electron microscopy, electron paramagnetic resonance spectroscopy, and nuclear magnetic resonance spectroscopy. Simultaneous incorporation of orthogonal experimental restraints not only significantly improved the sampling accuracy but also allowed identification of the correct fold, which is demonstrated by a protein size-normalized transmembrane root-mean-square deviation as low as 1.2 Å. The protocol developed in this case study can be used for the determination of unknown membrane protein folds when limited experimental restraints are available. Copyright © 2018 Elsevier Ltd. All rights reserved.

  12. Effect of Addition of Concentrated Proteins and Seminal Plasma Low Molecular Weight Proteins in Freezing and Thawing of Equine Semen

    Directory of Open Access Journals (Sweden)

    Fagundes, B.

    2011-07-01

    Full Text Available Difficulties in obtaining equine frozen semen with potential fertility are recognized. This study was designed to investigate the effect of seminal plasma on frozen/thawing of eight stallion semen from different breed using the following treatments: Seminal plasma with ten-fold concentrated proteins with molecular weight above 10 kDa on frozen extender; Part of seminal plasma with proteins under 10 kDa on frozen extender; Conventional freezing, using whole seminal plasma on frozen extender. Using the parameter of 30% of seminal motility post-thawing as index of good freezability, it was verified an increased percentage of stallions that presented good freezability when semen was frozen with seminal plasma containing ten-fold concentrated proteins with molecular weight above 10 kDa on frozen extender. These results, suggested the use of seminal plasma concentrated proteins from own stallion to freezing/thawing semen.

  13. Active site mutations in yeast protein disulfide isomerase cause dithiothreitol sensitivity and a reduced rate of protein folding in the endoplasmic reticulum

    DEFF Research Database (Denmark)

    Holst, B; Tachibana, C; Winther, Jakob R.

    1997-01-01

    Aspects of protein disulfide isomerase (PDI) function have been studied in yeast in vivo. PDI contains two thioredoxin-like domains, a and a', each of which contains an active-site CXXC motif. The relative importance of the two domains was analyzed by rendering each one inactive by mutation to SGAS....... Such mutations had no significant effect on growth. The domains however, were not equivalent since the rate of folding of carboxypeptidase Y (CPY) in vivo was reduced by inactivation of the a domain but not the a' domain. To investigate the relevance of PDI redox potential, the G and H positions of each CGHC......-deleted strains overexpressing the yeast PDI homologue EUG1 are viable. Exchanging the wild-type Eug1p C(L/I)HS active site sequences for C(L/I)HC increased the growth rate significantly, however, further highlighting the importance of the oxidizing function for optimal growth....

  14. An approach for high-throughput structure determination of proteins by NMR spectroscopy

    Energy Technology Data Exchange (ETDEWEB)

    Medek, Ales; Olejniczak, Edward T.; Meadows, Robert P.; Fesik, Stephen W. [Abbott Laboratories, Pharmaceutical Discovery Division (United States)

    2000-11-15

    An approach is described for rapidly determining protein structures by NMR that utilizes proteins containing {sup 13}C-methyl labeled Val, Leu, and Ile ({delta}1) and protonated Phe and Tyr in a deuterated background. Using this strategy, the key NOEs that define the hydrophobic core and overall fold of the protein are easily obtained. NMR data are acquired using cryogenic probe technology which markedly reduces the spectrometer time needed for data acquisition. The approach is demonstrated by determining the overall fold of the antiapoptotic protein, Bcl-xL, from data collected in only 4 days. Refinement of the Bcl-xL structure to a backbone rmsd of 0.95 A was accomplished with data collected in an additional 3 days. A distance analysis of 180 different proteins and structure calculations using simulated data suggests that our method will allow the global folds of a wide variety of proteins to be determined.

  15. A de novo designed monomeric, compact three helix bundle protein on a carbohydrate template

    DEFF Research Database (Denmark)

    Malik, Leila; Nygård, Jesper; Christensen, Niels Johan

    2015-01-01

    De novo design and chemical synthesis of proteins and of other artificial structures, which mimic them, is a central strategy for understanding protein folding and for accessing proteins with novel functions. We have previously described carbohydrates as templates for the assembly of artificial...... the template could facilitate protein folding. Here we report the design and synthesis of 3-helix bundle carboproteins on deoxy-hexopyranosides. The carboproteins were analyzed by CD, AUC, SAXS, and NMR, which revealed the formation of the first compact, and folded monomeric carboprotein distinctly different...

  16. Cyanobacteria contain a structural homologue of the Hfq protein with altered RNA binding properties

    DEFF Research Database (Denmark)

    Bøggild, Andreas; Overgaard, Martin; Valentin-Hansen, Poul

    2009-01-01

    Hfq proteins are common in many species of enterobacteria, where they participate in RNA folding and translational regulation through pairing of small RNAs and messenger RNAs. Hfq proteins share the distinctive Sm fold, and form ring-shaped structures similar to those of the Sm/Lsm proteins...... proteins from the cyanobacteria Synechocystis sp. PCC 6803 and Anabaena PCC 7120 at 1.3 and 2.3 A resolution, respectively, and show that they retain the classic Sm fold despite low sequence conservation. In addition, the intersubunit contacts and RNA-binding site are divergent, and we show biochemically...

  17. Cyanobacteria contain a structural homologue of the Hfq protein with altered RNA-binding properties

    DEFF Research Database (Denmark)

    Bøggild, Andreas; Overgaard, Martin; Valentin-Hansen, Poul

    2009-01-01

    Hfq proteins are common in many species of enterobacteria, where they participate in RNA folding and translational regulation through pairing of small RNAs and messenger RNAs. Hfq proteins share the distinctive Sm fold, and form ring-shaped structures similar to those of the Sm/Lsm proteins...... proteins from the cyanobacteria Synechocystis sp. PCC 6803 and Anabaena PCC 7120 at 1.3 and 2.3 A resolution, respectively, and show that they retain the classic Sm fold despite low sequence conservation. In addition, the intersubunit contacts and RNA-binding site are divergent, and we show biochemically...

  18. Protein 3D structure computed from evolutionary sequence variation.

    Directory of Open Access Journals (Sweden)

    Debora S Marks

    Full Text Available The evolutionary trajectory of a protein through sequence space is constrained by its function. Collections of sequence homologs record the outcomes of millions of evolutionary experiments in which the protein evolves according to these constraints. Deciphering the evolutionary record held in these sequences and exploiting it for predictive and engineering purposes presents a formidable challenge. The potential benefit of solving this challenge is amplified by the advent of inexpensive high-throughput genomic sequencing.In this paper we ask whether we can infer evolutionary constraints from a set of sequence homologs of a protein. The challenge is to distinguish true co-evolution couplings from the noisy set of observed correlations. We address this challenge using a maximum entropy model of the protein sequence, constrained by the statistics of the multiple sequence alignment, to infer residue pair couplings. Surprisingly, we find that the strength of these inferred couplings is an excellent predictor of residue-residue proximity in folded structures. Indeed, the top-scoring residue couplings are sufficiently accurate and well-distributed to define the 3D protein fold with remarkable accuracy.We quantify this observation by computing, from sequence alone, all-atom 3D structures of fifteen test proteins from different fold classes, ranging in size from 50 to 260 residues, including a G-protein coupled receptor. These blinded inferences are de novo, i.e., they do not use homology modeling or sequence-similar fragments from known structures. The co-evolution signals provide sufficient information to determine accurate 3D protein structure to 2.7-4.8 Å C(α-RMSD error relative to the observed structure, over at least two-thirds of the protein (method called EVfold, details at http://EVfold.org. This discovery provides insight into essential interactions constraining protein evolution and will facilitate a comprehensive survey of the universe of

  19. Adjustable chain trees for proteins

    DEFF Research Database (Denmark)

    Winter, Pawel; Fonseca, Rasmus

    2012-01-01

    A chain tree is a data structure for changing protein conformations. It enables very fast detection of clashes and free energy potential calculations. A modified version of chain trees that adjust themselves to the changing conformations of folding proteins is introduced. This results in much...... tighter bounding volume hierarchies and therefore fewer intersection checks. Computational results indicate that the efficiency of the adjustable chain trees is significantly improved compared to the traditional chain trees....

  20. Protein flexibility in the light of structural alphabets. \

    Czech Academy of Sciences Publication Activity Database

    Craveur, P.; Joseph, A,P.; Esque, J.; Narwani, T.J.; Noël, F.; Shinada, N.; Goguet, M.; Leonard, S.; Poulain, P.; Bertrand, O.; Faure, G.; Rebehmed, J.; Ghozlane, A.; Swapna, L.S.; Bhaskara, R.M.; Barnoud, J.; Téletchéa, S.; Jallu, V.; Černý, Jiří; Schneider, Bohdan; Etchebest, C.; Srinivasan, N.; Gelly, J.-Ch.; deBrevern, A.G.

    2015-01-01

    Roč. 2, 27 May 2015 (2015) ISSN 2296-889X R&D Projects: GA MŠk(CZ) ED1.1.00/02.0109 Institutional support: RVO:86652036 Keywords : allostery * protein complexes * protein folding Subject RIV: EB - Genetics ; Molecular Biology

  1. Toxicological relationships between proteins obtained from protein target predictions of large toxicity databases

    International Nuclear Information System (INIS)

    Nigsch, Florian; Mitchell, John B.O.

    2008-01-01

    The combination of models for protein target prediction with large databases containing toxicological information for individual molecules allows the derivation of 'toxiclogical' profiles, i.e., to what extent are molecules of known toxicity predicted to interact with a set of protein targets. To predict protein targets of drug-like and toxic molecules, we built a computational multiclass model using the Winnow algorithm based on a dataset of protein targets derived from the MDL Drug Data Report. A 15-fold Monte Carlo cross-validation using 50% of each class for training, and the remaining 50% for testing, provided an assessment of the accuracy of that model. We retained the 3 top-ranking predictions and found that in 82% of all cases the correct target was predicted within these three predictions. The first prediction was the correct one in almost 70% of cases. A model built on the whole protein target dataset was then used to predict the protein targets for 150 000 molecules from the MDL Toxicity Database. We analysed the frequency of the predictions across the panel of protein targets for experimentally determined toxicity classes of all molecules. This allowed us to identify clusters of proteins related by their toxicological profiles, as well as toxicities that are related. Literature-based evidence is provided for some specific clusters to show the relevance of the relationships identified

  2. Reticulomics: Protein-Protein Interaction Studies with Two Plasmodesmata-Localized Reticulon Family Proteins Identify Binding Partners Enriched at Plasmodesmata, Endoplasmic Reticulum, and the Plasma Membrane.

    Science.gov (United States)

    Kriechbaumer, Verena; Botchway, Stanley W; Slade, Susan E; Knox, Kirsten; Frigerio, Lorenzo; Oparka, Karl; Hawes, Chris

    2015-11-01

    The endoplasmic reticulum (ER) is a ubiquitous organelle that plays roles in secretory protein production, folding, quality control, and lipid biosynthesis. The cortical ER in plants is pleomorphic and structured as a tubular network capable of morphing into flat cisternae, mainly at three-way junctions, and back to tubules. Plant reticulon family proteins (RTNLB) tubulate the ER by dimerization and oligomerization, creating localized ER membrane tensions that result in membrane curvature. Some RTNLB ER-shaping proteins are present in the plasmodesmata (PD) proteome and may contribute to the formation of the desmotubule, the axial ER-derived structure that traverses primary PD. Here, we investigate the binding partners of two PD-resident reticulon proteins, RTNLB3 and RTNLB6, that are located in primary PD at cytokinesis in tobacco (Nicotiana tabacum). Coimmunoprecipitation of green fluorescent protein-tagged RTNLB3 and RTNLB6 followed by mass spectrometry detected a high percentage of known PD-localized proteins as well as plasma membrane proteins with putative membrane-anchoring roles. Förster resonance energy transfer by fluorescence lifetime imaging microscopy assays revealed a highly significant interaction of the detected PD proteins with the bait RTNLB proteins. Our data suggest that RTNLB proteins, in addition to a role in ER modeling, may play important roles in linking the cortical ER to the plasma membrane. © 2015 American Society of Plant Biologists. All Rights Reserved.

  3. Circular Permutation of a Chaperonin Protein: Biophysics and Application to Nanotechnology

    Science.gov (United States)

    Paavola, Chad; Chan, Suzanne; Li, Yi-Fen; McMillan, R. Andrew; Trent, Jonathan

    2004-01-01

    We have designed five circular permutants of a chaperonin protein derived from the hyperthermophilic organism Sulfolobus shibatae. These permuted proteins were expressed in E. coli and are well-folded. Furthermore, all the permutants assemble into 18-mer double rings of the same form as the wild-type protein. We characterized the thermodynamics of folding for each permutant by both guanidine denaturation and differential scanning calorimetry. We also examined the assembly of chaperonin rings into higher order structures that may be used as nanoscale templates. The results show that circular permutation can be used to tune the thermodynamic properties of a protein template as well as facilitating the fusion of peptides, binding proteins or enzymes onto nanostructured templates.

  4. The small angle x-ray scattering of globular proteins in solution during heat denaturation

    Science.gov (United States)

    Banuelos, Jose; Urquidi, Jacob

    2008-10-01

    The ability of proteins to change their conformation in response to changes in their environment has consequences in biological processes like metabolism, chemical regulation in cells, and is believed to play a role in the onset of several neurodegenerative diseases. Factors such as a change in temperature, pressure, and the introduction of ions into the aqueous environment of a protein can give rise to the folding/unfolding of a protein. As a protein unfolds, the ratio of nonpolar to polar groups exposed to water changes, affecting a protein's thermodynamic properties. Using small angle x-ray scattering (SAXS), we are currently studying the intermediate protein conformations that arise during the folding/unfolding process as a function of temperature for five globular proteins. Trends in the observed intermediate structures of these globular proteins, along with correlations with data on protein thermodynamics may help elucidate shared characteristics between all proteins in the folding/unfolding process. Experimental design considerations will be discussed and preliminary results for some of these systems will be presented.

  5. False positive reduction in protein-protein interaction predictions using gene ontology annotations

    Directory of Open Access Journals (Sweden)

    Lin Yen-Han

    2007-07-01

    Full Text Available Abstract Background Many crucial cellular operations such as metabolism, signalling, and regulations are based on protein-protein interactions. However, the lack of robust protein-protein interaction information is a challenge. One reason for the lack of solid protein-protein interaction information is poor agreement between experimental findings and computational sets that, in turn, comes from huge false positive predictions in computational approaches. Reduction of false positive predictions and enhancing true positive fraction of computationally predicted protein-protein interaction datasets based on highly confident experimental results has not been adequately investigated. Results Gene Ontology (GO annotations were used to reduce false positive protein-protein interactions (PPI pairs resulting from computational predictions. Using experimentally obtained PPI pairs as a training dataset, eight top-ranking keywords were extracted from GO molecular function annotations. The sensitivity of these keywords is 64.21% in the yeast experimental dataset and 80.83% in the worm experimental dataset. The specificities, a measure of recovery power, of these keywords applied to four predicted PPI datasets for each studied organisms, are 48.32% and 46.49% (by average of four datasets in yeast and worm, respectively. Based on eight top-ranking keywords and co-localization of interacting proteins a set of two knowledge rules were deduced and applied to remove false positive protein pairs. The 'strength', a measure of improvement provided by the rules was defined based on the signal-to-noise ratio and implemented to measure the applicability of knowledge rules applying to the predicted PPI datasets. Depending on the employed PPI-predicting methods, the strength varies between two and ten-fold of randomly removing protein pairs from the datasets. Conclusion Gene Ontology annotations along with the deduced knowledge rules could be implemented to partially

  6. Direct Detection of Biotinylated Proteins by Mass Spectrometry

    Science.gov (United States)

    2015-01-01

    Mass spectrometric strategies to identify protein subpopulations involved in specific biological functions rely on covalently tagging biotin to proteins using various chemical modification methods. The biotin tag is primarily used for enrichment of the targeted subpopulation for subsequent mass spectrometry (MS) analysis. A limitation of these strategies is that MS analysis does not easily discriminate unlabeled contaminants from the labeled protein subpopulation under study. To solve this problem, we developed a flexible method that only relies on direct MS detection of biotin-tagged proteins called “Direct Detection of Biotin-containing Tags” (DiDBiT). Compared with conventional targeted proteomic strategies, DiDBiT improves direct detection of biotinylated proteins ∼200 fold. We show that DiDBiT is applicable to several protein labeling protocols in cell culture and in vivo using cell permeable NHS-biotin and incorporation of the noncanonical amino acid, azidohomoalanine (AHA), into newly synthesized proteins, followed by click chemistry tagging with biotin. We demonstrate that DiDBiT improves the direct detection of biotin-tagged newly synthesized peptides more than 20-fold compared to conventional methods. With the increased sensitivity afforded by DiDBiT, we demonstrate the MS detection of newly synthesized proteins labeled in vivo in the rodent nervous system with unprecedented temporal resolution as short as 3 h. PMID:25117199

  7. Unraveling a phosphorylation event in a folded protein by NMR spectroscopy: phosphorylation of the Pin1 WW domain by PKA

    Energy Technology Data Exchange (ETDEWEB)

    Smet-Nocca, Caroline, E-mail: caroline.smet@univ-lille1.fr; Launay, Helene; Wieruszeski, Jean-Michel; Lippens, Guy; Landrieu, Isabelle, E-mail: isabelle.landrieu@univ-lille1.fr [Universite de Lille-Nord de France, Institut Federatif de Recherches 147, CNRS UMR 8576 (France)

    2013-04-15

    The Pin1 protein plays a critical role in the functional regulation of the hyperphosphorylated neuronal Tau protein in Alzheimer's disease and is by itself regulated by phosphorylation. We have used Nuclear Magnetic Resonance (NMR) spectroscopy to both identify the PKA phosphorylation site in the Pin1 WW domain and investigate the functional consequences of this phosphorylation. Detection and identification of phosphorylation on serine/threonine residues in a globular protein, while mostly occurring in solvent-exposed flexible loops, does not lead to chemical shift changes as obvious as in disordered proteins and hence does not necessarily shift the resonances outside the spectrum of the folded protein. Other complications were encountered to characterize the extent of the phosphorylation, as part of the {sup 1}H,{sup 15}N amide resonances around the phosphorylation site are specifically broadened in the unphosphorylated state. Despite these obstacles, NMR spectroscopy was an efficient tool to confirm phosphorylation on S16 of the WW domain and to quantify the level of phosphorylation. Based on this analytical characterization, we show that WW phosphorylation on S16 abolishes its binding capacity to a phosphorylated Tau peptide. A reduced conformational heterogeneity and flexibility of the phospho-binding loop upon S16 phosphorylation could account for part of the decreased affinity for its phosphorylated partner. Additionally, a structural model of the phospho-WW obtained by molecular dynamics simulation and energy minimization suggests that the phosphate moiety of phospho-S16 could compete with the phospho-substrate.

  8. Understanding curcumin-induced modulation of protein aggregation.

    Science.gov (United States)

    Ahmad, Basir; Borana, Mohanish S; Chaudhary, Ankur P

    2017-07-01

    Curcumin, a diarylheptanoid compound, found in spice turmeric is known to alter the aggregation of proteins and reduce the toxicity of the aggregates. This review looks at the molecular basis of modulating protein aggregation and toxicity of the aggregates. Foremost, we identify the interaction of curcumin and its derivatives with proteins/peptides and the effect of their interaction on the conformational stability and unfolding/folding pathway(s). The unfolding/folding processes generate partially folded/unfolded intermediate, which serve as aggregation precursor state. Secondly, we discuss the effect of curcumin binding on the kinetics parameters of the aggregation process, which give information about the mechanism of the aggregation inhibition. We describe, in addition, that curcumin can accelerate/promote fibril formation by binding to oligomeric intermediate(s) accumulated in the aggregation pathway. Finally, we discuss the correlation of curcumin-induced monomeric and/or oligomeric precursor states with aggregate structure and toxicity. On the basis of these discussions, we propose a model describing curcumin-induced inhibition/promotion of formation of amyloid-like fibrils. Copyright © 2016 Elsevier B.V. All rights reserved.

  9. Parabolic section and distance excess of space curves applied to protein structure classification

    DEFF Research Database (Denmark)

    Røgen, Peter; Karlsson, Per W.

    2008-01-01

    Proteins are long chain molecules that fold up into beautiful and complicated three-dimensional structures before fulfilling their biological functions in the living organisms. With the aim of providing an efficient tool for describing the proteins' native folds, we present a global geometric mea...... measure of a space curve. This geometric measure allows us to define descriptors of protein structures that quantify how parallel the secondary structure elements of a protein are. These descriptors are C-2 in deformations of the protein structure, are evaluated very fast and reliably...

  10. Design of tryptophan-containing mutants of the symmetrical Pizza protein for biophysical studies.

    Science.gov (United States)

    Noguchi, Hiroki; Mylemans, Bram; De Zitter, Elke; Van Meervelt, Luc; Tame, Jeremy R H; Voet, Arnout

    2018-03-18

    β-propeller proteins are highly symmetrical, being composed of a repeated motif with four anti-parallel β-sheets arranged around a central axis. Recently we designed the first completely symmetrical β-propeller protein, Pizza6, consisting of six identical tandem repeats. Pizza6 is expected to prove a useful building block for bionanotechnology, and also a tool to investigate the folding and evolution of β-propeller proteins. Folding studies are made difficult by the high stability and the lack of buried Trp residues to act as monitor fluorophores, so we have designed and characterized several Trp-containing Pizza6 derivatives. In total four proteins were designed, of which three could be purified and characterized. Crystal structures confirm these mutant proteins maintain the expected structure, and a clear redshift of Trp fluorescence emission could be observed upon denaturation. Among the derivative proteins, Pizza6-AYW appears to be the most suitable model protein for future folding/unfolding kinetics studies as it has a comparable stability as natural β-propeller proteins. Copyright © 2018 Elsevier Inc. All rights reserved.

  11. Structures of two Arabidopsis thaliana major latex proteins represent novel helix-grip folds

    Energy Technology Data Exchange (ETDEWEB)

    Lytle, Betsy L.; Song, Jikui; de la Cruz, Norberto B.; Peterson, Francis C.; Johnson, Kenneth A.; Bingman, Craig A.; Phillips, Jr., George N.; Volkman, Brian F.; (MCW); (UW)

    2009-06-02

    Here we report the first structures of two major latex proteins (MLPs) which display unique structural differences from the canonical Bet v 1 fold described earlier. MLP28 (SwissProt/TrEMBL ID Q9SSK9), the product of gene At1g70830.1, and the At1g24000.1 gene product (Swiss- Prot/TrEMBL ID P0C0B0), proteins which share 32% sequence identity, were independently selected as foldspace targets by the Center for Eukaryotic Structural Genomics. The structure of a single domain (residues 17-173) of MLP28 was solved by NMR spectroscopy, while the full-length At1g24000.1 structure was determined by X-ray crystallography. MLP28 displays greater than 30% sequence identity to at least eight MLPs from other species. For example, the MLP28 sequence shares 64% identity to peach Pp-MLP119 and 55% identity to cucumber Csf2.20 In contrast, the At1g24000.1 sequence is highly divergent (see Fig. 1), containing a gap of 33 amino acids when compared with all other known MLPs. Even when the gap is excluded, the sequence identity with MLPs from other species is less than 30%. Unlike some of the MLPs from other species, none of the A. thaliana MLPs have been characterized biochemically. We show by NMR chemical shift mapping that At1g24000.1 binds progesterone, demonstrating that despite its sequence dissimilarity, the hydrophobic binding pocket is conserved and, therefore, may play a role in its biological function and that of the MLP family in general.

  12. Random heteropolymers preserve protein function in foreign environments

    Science.gov (United States)

    Panganiban, Brian; Qiao, Baofu; Jiang, Tao; DelRe, Christopher; Obadia, Mona M.; Nguyen, Trung Dac; Smith, Anton A. A.; Hall, Aaron; Sit, Izaac; Crosby, Marquise G.; Dennis, Patrick B.; Drockenmuller, Eric; Olvera de la Cruz, Monica; Xu, Ting

    2018-03-01

    The successful incorporation of active proteins into synthetic polymers could lead to a new class of materials with functions found only in living systems. However, proteins rarely function under the conditions suitable for polymer processing. On the basis of an analysis of trends in protein sequences and characteristic chemical patterns on protein surfaces, we designed four-monomer random heteropolymers to mimic intrinsically disordered proteins for protein solubilization and stabilization in non-native environments. The heteropolymers, with optimized composition and statistical monomer distribution, enable cell-free synthesis of membrane proteins with proper protein folding for transport and enzyme-containing plastics for toxin bioremediation. Controlling the statistical monomer distribution in a heteropolymer, rather than the specific monomer sequence, affords a new strategy to interface with biological systems for protein-based biomaterials.

  13. Functional advantages of dynamic protein disorder.

    Science.gov (United States)

    Berlow, Rebecca B; Dyson, H Jane; Wright, Peter E

    2015-09-14

    Intrinsically disordered proteins participate in many important cellular regulatory processes. The absence of a well-defined structure in the free state of a disordered domain, and even on occasion when it is bound to physiological partners, is fundamental to its function. Disordered domains are frequently the location of multiple sites for post-translational modification, the key element of metabolic control in the cell. When a disordered domain folds upon binding to a partner, the resulting complex buries a far greater surface area than in an interaction of comparably-sized folded proteins, thus maximizing specificity at modest protein size. Disorder also maintains accessibility of sites for post-translational modification. Because of their inherent plasticity, disordered domains frequently adopt entirely different structures when bound to different partners, increasing the repertoire of available interactions without the necessity for expression of many different proteins. This feature also adds to the faithfulness of cellular regulation, as the availability of a given disordered domain depends on competition between various partners relevant to different cellular processes. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  14. Step-wise refolding of recombinant proteins.

    Science.gov (United States)

    Tsumoto, Kouhei; Arakawa, Tsutomu; Chen, Linda

    2010-04-01

    Protein refolding is still on trial-and-error basis. Here we describe step-wise dialysis refolding, in which denaturant concentration is altered in step-wise fashion. This technology controls the folding pathway by adjusting the concentrations of the denaturant and other solvent additives to induce sequential folding or disulfide formation.

  15. Biophysical and structural considerations for protein sequence evolution

    Directory of Open Access Journals (Sweden)

    Grahnen Johan A

    2011-12-01

    Full Text Available Abstract Background Protein sequence evolution is constrained by the biophysics of folding and function, causing interdependence between interacting sites in the sequence. However, current site-independent models of sequence evolutions do not take this into account. Recent attempts to integrate the influence of structure and biophysics into phylogenetic models via statistical/informational approaches have not resulted in expected improvements in model performance. This suggests that further innovations are needed for progress in this field. Results Here we develop a coarse-grained physics-based model of protein folding and binding function, and compare it to a popular informational model. We find that both models violate the assumption of the native sequence being close to a thermodynamic optimum, causing directional selection away from the native state. Sampling and simulation show that the physics-based model is more specific for fold-defining interactions that vary less among residue type. The informational model diffuses further in sequence space with fewer barriers and tends to provide less support for an invariant sites model, although amino acid substitutions are generally conservative. Both approaches produce sequences with natural features like dN/dS Conclusions Simple coarse-grained models of protein folding can describe some natural features of evolving proteins but are currently not accurate enough to use in evolutionary inference. This is partly due to improper packing of the hydrophobic core. We suggest possible improvements on the representation of structure, folding energy, and binding function, as regards both native and non-native conformations, and describe a large number of possible applications for such a model.

  16. Molecular mechanics work station for protein conformational studies

    International Nuclear Information System (INIS)

    Fine, R.; Levinthal, C.; Schoenborn, B.; Dimmier, G.; Rankowitz, C.

    1984-01-01

    Interest in computational problems in Biology has intensified over the last few years, partly due to the development of techniques for the rapid cloning, sequencing, and mutagenesis of genes from organisims ranging from E. coli to Man. The central dogma of molecular biology; that DNA codes for mRNA which codes for protein, has been understood in a linear programming sense since the genetic code was cracked. But what is not understood at present is how a protein, once assembled as a long sequence of amino acids, folds back on itself to produce a three-dimensional structure which is unique to that protein and which dictates its chemical and biological activity. This folding process is purely physics, and involves the time evolution of a system of several thousand atoms which interact with each other and with atoms from the surrounding solvent. Molecular dynamics simulations on smaller molecules suggest that approaches which treat the protein as a classical ensemble of atoms interacting with each other via an empirical Hamiltonian can yield the kind of predictive results one would like when applied to proteins

  17. Context-dependent protein folding of a virulence peptide in the bacterial and host environments: structure of an SycH–YopH chaperone–effector complex

    International Nuclear Information System (INIS)

    Vujanac, Milos; Stebbins, C. Erec

    2013-01-01

    The structure of a SycH–YopH chaperone–effector complex from Yersinia reveals the bacterial state of a protein that adopts different folds in the host and pathogen environments. Yersinia pestis injects numerous bacterial proteins into host cells through an organic nanomachine called the type 3 secretion system. One such substrate is the tyrosine phosphatase YopH, which requires an interaction with a cognate chaperone in order to be effectively injected. Here, the first crystal structure of a SycH–YopH complex is reported, determined to 1.9 Å resolution. The structure reveals the presence of (i) a nonglobular polypeptide in YopH, (ii) a so-called β-motif in YopH and (iii) a conserved hydrophobic patch in SycH that recognizes the β-motif. Biochemical studies establish that the β-motif is critical to the stability of this complex. Finally, since previous work has shown that the N-terminal portion of YopH adopts a globular fold that is functional in the host cell, aspects of how this polypeptide adopts radically different folds in the host and in the bacterial environments are analysed

  18. The simulation approach to lipid-protein interactions.

    Science.gov (United States)

    Paramo, Teresa; Garzón, Diana; Holdbrook, Daniel A; Khalid, Syma; Bond, Peter J

    2013-01-01

    The interactions between lipids and proteins are crucial for a range of biological processes, from the folding and stability of membrane proteins to signaling and metabolism facilitated by lipid-binding proteins. However, high-resolution structural details concerning functional lipid/protein interactions are scarce due to barriers in both experimental isolation of native lipid-bound complexes and subsequent biophysical characterization. The molecular dynamics (MD) simulation approach provides a means to complement available structural data, yielding dynamic, structural, and thermodynamic data for a protein embedded within a physiologically realistic, modelled lipid environment. In this chapter, we provide a guide to current methods for setting up and running simulations of membrane proteins and soluble, lipid-binding proteins, using standard atomistically detailed representations, as well as simplified, coarse-grained models. In addition, we outline recent studies that illustrate the power of the simulation approach in the context of biologically relevant lipid/protein interactions.

  19. A highly compliant protein native state with a spontaneous-like mechanical unfolding pathway

    DEFF Research Database (Denmark)

    Heiðarsson, Pétur Orri; Valpapuram, Immanuel; Camilloni, Carlo

    2012-01-01

    The mechanical properties of proteins and their force-induced structural changes play key roles in many biological processes. Previous studies have shown that natively folded proteins are brittle under tension, unfolding after small mechanical deformations, while partially folded intermediate...... states, such as molten globules, are compliant and can deform elastically a great amount before crossing the transition state barrier. Moreover, under tension proteins appear to unfold through a different sequence of events than during spontaneous unfolding. Here, we describe the response to force...... of the four-α-helix acyl-CoA binding protein (ACBP) in the low-force regime using optical tweezers and ratcheted molecular dynamics simulations. The results of our studies reveal an unprecedented mechanical behavior of a natively folded protein. ACBP displays an atypical compliance along two nearly orthogonal...

  20. Systematic identification of yeast proteins extracted into model wine during aging on the yeast lees.

    Science.gov (United States)

    Rowe, Jeffrey D; Harbertson, James F; Osborne, James P; Freitag, Michael; Lim, Juyun; Bakalinsky, Alan T

    2010-02-24

    Total protein and protein-associated mannan concentrations were measured, and individual proteins were identified during extraction into model wines over 9 months of aging on the yeast lees following completion of fermentations by seven wine strains of Saccharomyces cerevisiae. In aged wines, protein-associated mannan increased about 6-fold (+/-66%), while total protein only increased 2-fold (+/-20%), which resulted in a significantly greater protein-associated mannan/total protein ratio for three strains. A total of 219 proteins were identified among all wine samples taken over the entire time course. Of the 17 "long-lived" proteins detected in all 9 month samples, 13 were cell wall mannoproteins, and four were glycolytic enzymes. Most cytosolic proteins were not detected after 6 months. Native mannosylated yeast invertase was assayed for binding to wine tannin and was found to have a 10-fold lower affinity than nonglycosylated bovine serum albumin. Enrichment of mannoproteins in the aged model wines implies greater solution stability than other yeast proteins and the possibility that their contributions to wine quality may persist long after bottling.

  1. WW Domain Folding Complexity Revealed by Infrared Spectroscopy

    OpenAIRE

    Davis, Caitlin M.; Dyer, R. Brian

    2014-01-01

    Although the intrinsic tryptophan fluorescence of proteins offers a convenient probe of protein folding, interpretation of the fluorescence spectrum is often difficult because it is sensitive to both global and local changes. Infrared (IR) spectroscopy offers a complementary measure of structural changes involved in protein folding, because it probes changes in the secondary structure of the protein backbone. Here we demonstrate the advantages of using multiple probes, infrared and fluorescen...

  2. The dynamic multisite interactions between two intrinsically disordered proteins

    KAUST Repository

    Wu, Shaowen; Wang, Dongdong; Liu, Jin; Feng, Yitao; Weng, Jingwei; Li, Yu; Gao, Xin; Liu, Jianwei; Wang, Wenning

    2017-01-01

    Protein interactions involving intrinsically disordered proteins (IDPs) comprise a variety of binding modes, from the well characterized folding upon binding to dynamic fuzzy complex. To date, most studies concern the binding of an IDP to a

  3. Specific interaction of central nervous system myelin basic protein with lipids effects of basic protein on glucose leakage from liposomes

    NARCIS (Netherlands)

    Gould, R.M.; London, Y.

    1972-01-01

    The leakage from liposomes preloaded with glucose was continuously monitored in a Perkin-Elmer Model 356 dual beam spectrophotometer using an enzyme-linked assay system. The central nervous system myelin basic protein (A1 protein) caused a 3–4-fold increase in the rate of leakage from liposomes

  4. Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces

    Directory of Open Access Journals (Sweden)

    Gorin Andrey A

    2008-05-01

    Full Text Available Abstract Background Protein-protein interactions are ubiquitous and essential for all cellular processes. High-resolution X-ray crystallographic structures of protein complexes can reveal the details of their function and provide a basis for many computational and experimental approaches. Differentiation between biological and non-biological contacts and reconstruction of the intact complex is a challenging computational problem. A successful solution can provide additional insights into the fundamental principles of biological recognition and reduce errors in many algorithms and databases utilizing interaction information extracted from the Protein Data Bank (PDB. Results We have developed a method for identifying protein complexes in the PDB X-ray structures by a four step procedure: (1 comprehensively collecting all protein-protein interfaces; (2 clustering similar protein-protein interfaces together; (3 estimating the probability that each cluster is relevant based on a diverse set of properties; and (4 combining these scores for each PDB entry in order to predict the complex structure. The resulting clusters of biologically relevant interfaces provide a reliable catalog of evolutionary conserved protein-protein interactions. These interfaces, as well as the predicted protein complexes, are available from the Protein Interface Server (PInS website (see Availability and requirements section. Conclusion Our method demonstrates an almost two-fold reduction of the annotation error rate as evaluated on a large benchmark set of complexes validated from the literature. We also estimate relative contributions of each interface property to the accurate discrimination of biologically relevant interfaces and discuss possible directions for further improving the prediction method.

  5. Comparative analysis of twin-arginine (Tat)-dependent protein secretion of a heterologous model protein (GFP) in three different Gram-positive bacteria

    NARCIS (Netherlands)

    Meissner, Daniel; Vollstedt, Angela; van Dijl, Jan Maarten; Freudl, Roland

    In contrast to the general protein secretion (Sec) system, the twin-arginine translocation (Tat) export pathway allows the translocation of proteins across the bacterial plasma membrane in a fully folded conformation. Due to this feature, the Tat pathway provides an attractive alternative to the

  6. The N-terminal sequence of ribosomal protein L10 from the archaebacterium Halobacterium marismortui and its relationship to eubacterial protein L6 and other ribosomal proteins.

    Science.gov (United States)

    Dijk, J; van den Broek, R; Nasiulas, G; Beck, A; Reinhardt, R; Wittmann-Liebold, B

    1987-08-01

    The amino-terminal sequence of ribosomal protein L10 from Halobacterium marismortui has been determined up to residue 54, using both a liquid- and a gas-phase sequenator. The two sequences are in good agreement. The protein is clearly homologous to protein HcuL10 from the related strain Halobacterium cutirubrum. Furthermore, a weaker but distinct homology to ribosomal protein L6 from Escherichia coli and Bacillus stearothermophilus can be detected. In addition to 7 identical amino acids in the first 36 residues in all four sequences a number of conservative replacements occurs, of mainly hydrophobic amino acids. In this common region the pattern of conserved amino acids suggests the presence of a beta-alpha fold as it occurs in ribosomal proteins L12 and L30. Furthermore, several potential cases of homology to other ribosomal components of the three ur-kingdoms have been found.

  7. Knotting and unknotting proteins in the chaperonin cage: Effects of the excluded volume.

    Directory of Open Access Journals (Sweden)

    Szymon Niewieczerzal

    Full Text Available Molecular dynamics simulations are used to explore the effects of chaperonin-like cages on knotted proteins with very low sequence similarity, different depths of a knot but with a similar fold, and the same type of topology. The investigated proteins are VirC2, DndE and MJ0366 with two depths of a knot. A comprehensive picture how encapsulation influences folding rates is provided based on the analysis of different cage sizes and temperature conditions. Neither of these two effects with regard to knotted proteins has been studied by means of molecular dynamics simulations with coarse-grained structure-based models before. We show that encapsulation in a chaperonin is sufficient to self-tie and untie small knotted proteins (VirC2, DndE, for which the equilibrium process is not accessible in the bulk solvent. Furthermore, we find that encapsulation reduces backtracking that arises from the destabilisation of nucleation sites, smoothing the free energy landscape. However, this effect can also be coupled with temperature rise. Encapsulation facilitates knotting at the early stage of folding and can enhance an alternative folding route. Comparison to unknotted proteins with the same fold shows directly how encapsulation influences the free energy landscape. In addition, we find that as the size of the cage decreases, folding times increase almost exponentially in a certain range of cage sizes, in accordance with confinement theory and experimental data for unknotted proteins.

  8. New variants of known folds: do they bring new biology?

    International Nuclear Information System (INIS)

    Koonin, Eugene V.

    2010-01-01

    New distinct versions of known protein folds provide a powerful means of protein-function prediction that complements sequence and genomic context analysis. New distinct versions of known protein folds provide a powerful means of protein-function prediction that complements sequence and genomic context analysis. These structures do not supplant direct biochemical experiments, but are indispensable for the complete characterization of proteins

  9. Energy landscape of all-atom protein-protein interactions revealed by multiscale enhanced sampling.

    Directory of Open Access Journals (Sweden)

    Kei Moritsugu

    2014-10-01

    Full Text Available Protein-protein interactions are regulated by a subtle balance of complicated atomic interactions and solvation at the interface. To understand such an elusive phenomenon, it is necessary to thoroughly survey the large configurational space from the stable complex structure to the dissociated states using the all-atom model in explicit solvent and to delineate the energy landscape of protein-protein interactions. In this study, we carried out a multiscale enhanced sampling (MSES simulation of the formation of a barnase-barstar complex, which is a protein complex characterized by an extraordinary tight and fast binding, to determine the energy landscape of atomistic protein-protein interactions. The MSES adopts a multicopy and multiscale scheme to enable for the enhanced sampling of the all-atom model of large proteins including explicit solvent. During the 100-ns MSES simulation of the barnase-barstar system, we observed the association-dissociation processes of the atomistic protein complex in solution several times, which contained not only the native complex structure but also fully non-native configurations. The sampled distributions suggest that a large variety of non-native states went downhill to the stable complex structure, like a fast folding on a funnel-like potential. This funnel landscape is attributed to dominant configurations in the early stage of the association process characterized by near-native orientations, which will accelerate the native inter-molecular interactions. These configurations are guided mostly by the shape complementarity between barnase and barstar, and lead to the fast formation of the final complex structure along the downhill energy landscape.

  10. Cell penetrating peptides to dissect host-pathogen protein-protein interactions in Theileria -transformed leukocytes

    KAUST Repository

    Haidar, Malak

    2017-09-08

    One powerful application of cell penetrating peptides is the delivery into cells of molecules that function as specific competitors or inhibitors of protein-protein interactions. Ablating defined protein-protein interactions is a refined way to explore their contribution to a particular cellular phenotype in a given disease context. Cell-penetrating peptides can be synthetically constrained through various chemical modifications that stabilize a given structural fold with the potential to improve competitive binding to specific targets. Theileria-transformed leukocytes display high PKA activity, but PKAis an enzyme that plays key roles in multiple cellular processes; consequently genetic ablation of kinase activity gives rise to a myriad of confounding phenotypes. By contrast, ablation of a specific kinase-substrate interaction has the potential to give more refined information and we illustrate this here by describing how surgically ablating PKA interactions with BAD gives precise information on the type of glycolysis performed by Theileria-transformed leukocytes. In addition, we provide two other examples of how ablating specific protein-protein interactions in Theileria-infected leukocytes leads to precise phenotypes and argue that constrained penetrating peptides have great therapeutic potential to combat infectious diseases in general.

  11. Dissociation of activated protein C functions by elimination of protein S cofactor enhancement.

    LENUS (Irish Health Repository)

    Harmon, Shona

    2008-11-07

    Activated protein C (APC) plays a critical anticoagulant role in vivo by inactivating procoagulant factor Va and factor VIIIa and thus down-regulating thrombin generation. In addition, APC bound to the endothelial cell protein C receptor can initiate protease-activated receptor-1 (PAR-1)-mediated cytoprotective signaling. Protein S constitutes a critical cofactor for the anticoagulant function of APC but is not known to be involved in regulating APC-mediated protective PAR-1 signaling. In this study we utilized a site-directed mutagenesis strategy to characterize a putative protein S binding region within the APC Gla domain. Three single amino acid substitutions within the APC Gla domain (D35T, D36A, and A39V) were found to mildly impair protein S-dependent anticoagulant activity (<2-fold) but retained entirely normal cytoprotective activity. However, a single amino acid substitution (L38D) ablated the ability of protein S to function as a cofactor for this APC variant. Consequently, in assays of protein S-dependent factor Va proteolysis using purified proteins or in the plasma milieu, APC-L38D variant exhibited minimal residual anticoagulant activity compared with wild type APC. Despite the location of Leu-38 in the Gla domain, APC-L38D interacted normally with endothelial cell protein C receptor and retained its ability to trigger PAR-1 mediated cytoprotective signaling in a manner indistinguishable from that of wild type APC. Consequently, elimination of protein S cofactor enhancement of APC anticoagulant function represents a novel and effective strategy by which to separate the anticoagulant and cytoprotective functions of APC for potential therapeutic gain.

  12. Perceptron learning of pairwise contact energies for proteins incorporating the amino acid environment

    Science.gov (United States)

    Heo, Muyoung; Kim, Suhkmann; Moon, Eun-Joung; Cheon, Mookyung; Chung, Kwanghoon; Chang, Iksoo

    2005-07-01

    Although a coarse-grained description of proteins is a simple and convenient way to attack the protein folding problem, the construction of a global pairwise energy function which can simultaneously recognize the native folds of many proteins has resulted in partial success. We have sought the possibility of a systematic improvement of this pairwise-contact energy function as we extended the parameter space of amino acids, incorporating local environments of amino acids, beyond a 20×20 matrix. We have studied the pairwise contact energy functions of 20×20 , 60×60 , and 180×180 matrices depending on the extent of parameter space, and compared their effect on the learnability of energy parameters in the context of a gapless threading, bearing in mind that a 20×20 pairwise contact matrix has been shown to be too simple to recognize the native folds of many proteins. In this paper, we show that the construction of a global pairwise energy function was achieved using 1006 training proteins of a homology of less than 30%, which include all representatives of different protein classes. After parametrizing the local environments of the amino acids into nine categories depending on three secondary structures and three kinds of hydrophobicity (desolvation), the 16290 pairwise contact energies (scores) of the amino acids could be determined by perceptron learning and protein threading. These could simultaneously recognize all the native folds of the 1006 training proteins. When these energy parameters were tested on the 382 test proteins of a homology of less than 90%, 370 (96.9%) proteins could recognize their native folds. We set up a simple thermodynamic framework in the conformational space of decoys to calculate the unfolded fraction and the specific heat of real proteins. The different thermodynamic stabilities of E.coli ribonuclease H (RNase H) and its mutants were well described in our calculation, agreeing with the experiment.

  13. Measurement of glutathione-protein mixed disulfides

    International Nuclear Information System (INIS)

    Livesey, J.C.; Reed, D.J.

    1984-01-01

    The development of a sensitive and highly specific assay for the presence of mixed disulfides between protein thiol groups and endogenous thiols has been undertaken. Previous investigations on the concentrations of glutathione (GSH), glutathione disulfide (GSSG) and protein glutathione mixed disulfides (ProSSG) have been of limited usefulness because of the poor specificity of the assays used. Our assay for these forms of glutathione is based on high performance liquid chromatography (HPLC) and is an extension of an earlier method. After perchloric acid precipitation, the protein sample is washed with an organic solvent to fully denature the protein. Up to a 10-fold increase in GSH released from fetal bovine serum (FBS) protein has been found when the protein precipitate is washed with ethanol rather than ether, as earlier suggested. Similar effects have been observed with an as yet unidentified thiol which elutes in the chromatography system with a retention volume similar to cysteine

  14. Structures composing protein domains.

    Science.gov (United States)

    Kubrycht, Jaroslav; Sigler, Karel; Souček, Pavel; Hudeček, Jiří

    2013-08-01

    This review summarizes available data concerning intradomain structures (IS) such as functionally important amino acid residues, short linear motifs, conserved or disordered regions, peptide repeats, broadly occurring secondary structures or folds, etc. IS form structural features (units or elements) necessary for interactions with proteins or non-peptidic ligands, enzyme reactions and some structural properties of proteins. These features have often been related to a single structural level (e.g. primary structure) mostly requiring certain structural context of other levels (e.g. secondary structures or supersecondary folds) as follows also from some examples reported or demonstrated here. In addition, we deal with some functionally important dynamic properties of IS (e.g. flexibility and different forms of accessibility), and more special dynamic changes of IS during enzyme reactions and allosteric regulation. Selected notes concern also some experimental methods, still more necessary tools of bioinformatic processing and clinically interesting relationships. Copyright © 2013 Elsevier Masson SAS. All rights reserved.

  15. Endoplasmic reticulum proteins SDF2 and SDF2L1 act as components of the BiP chaperone cycle to prevent protein aggregation.

    Science.gov (United States)

    Fujimori, Tsutomu; Suno, Ryoji; Iemura, Shun-Ichiro; Natsume, Tohru; Wada, Ikuo; Hosokawa, Nobuko

    2017-08-01

    The folding of newly synthesized proteins in the endoplasmic reticulum (ER) is assisted by ER-resident chaperone proteins. BiP (immunoglobulin heavy-chain-binding protein), a member of the HSP70 family, plays a central role in protein quality control. The chaperone function of BiP is regulated by its intrinsic ATPase activity, which is stimulated by ER-resident proteins of the HSP40/DnaJ family, including ERdj3. Here, we report that two closely related proteins, SDF2 and SDF2L1, regulate the BiP chaperone cycle. Both are ER-resident, but SDF2 is constitutively expressed, whereas SDF2L1 expression is induced by ER stress. Both luminal proteins formed a stable complex with ERdj3 and potently inhibited the aggregation of different types of misfolded ER cargo. These proteins associated with non-native proteins, thus promoting the BiP-substrate interaction cycle. A dominant-negative ERdj3 mutant that inhibits the interaction between ERdj3 and BiP prevented the dissociation of misfolded cargo from the ERdj3-SDF2L1 complex. Our findings indicate that SDF2 and SDF2L1 associate with ERdj3 and act as components in the BiP chaperone cycle to prevent the aggregation of misfolded proteins, partly explaining the broad folding capabilities of the ER under various physiological conditions. © 2017 Molecular Biology Society of Japan and John Wiley & Sons Australia, Ltd.

  16. PROTEINS IN VACUO . A MORE EFFICIENT MEANS OF ...

    African Journals Online (AJOL)

    With the aim of understanding solvent effects in protein folding, unfolding, stability and dynamic behavior, studies of protein ions in vacuo have become popular in recent years. One experimental descriptor which gives a general overview of ionic structure is the orientationally-averaged collision cross section , which is ...

  17. Fragment-Based Protein-Protein Interaction Antagonists of a Viral Dimeric Protease.

    Science.gov (United States)

    Gable, Jonathan E; Lee, Gregory M; Acker, Timothy M; Hulce, Kaitlin R; Gonzalez, Eric R; Schweigler, Patrick; Melkko, Samu; Farady, Christopher J; Craik, Charles S

    2016-04-19

    Fragment-based drug discovery has shown promise as an approach for challenging targets such as protein-protein interfaces. We developed and applied an activity-based fragment screen against dimeric Kaposi's sarcoma-associated herpesvirus protease (KSHV Pr) using an optimized fluorogenic substrate. Dose-response determination was performed as a confirmation screen, and NMR spectroscopy was used to map fragment inhibitor binding to KSHV Pr. Kinetic assays demonstrated that several initial hits also inhibit human cytomegalovirus protease (HCMV Pr). Binding of these hits to HCMV Pr was also confirmed by NMR spectroscopy. Despite the use of a target-agnostic fragment library, more than 80 % of confirmed hits disrupted dimerization and bound to a previously reported pocket at the dimer interface of KSHV Pr, not to the active site. One class of fragments, an aminothiazole scaffold, was further explored using commercially available analogues. These compounds demonstrated greater than 100-fold improvement of inhibition. This study illustrates the power of fragment-based screening for these challenging enzymatic targets and provides an example of the potential druggability of pockets at protein-protein interfaces. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  18. Radioimmunoassay for pregnancy-associated plasma protein A

    International Nuclear Information System (INIS)

    Sinosich, M.J.; Teisner, B.; Folkerson, J.; Saunders, D.M.; Grudzinskas, J.G.

    1982-01-01

    A specific and highly sensitive radioimmunoassay for determination of pregnancy-associated plasma protein A in human serum is described. The minimum detection limit for this protein was 2.9 μg/L. The within- and between-assay coefficients of variation were 4.0 and 4.5%, respectively. The circulating protein was detected within 32 days of conception in eight normal pregnancies and within 21 days in a twin pregnancy. Circulating concentrations in the mother at term were consistently higher (10-fold) than in matched amniotic fluid; none was detected in the umbilical circulation. This protein was also detected in the circulation of patients with hydatidiform mole. This assay will permit investigations into the clinical evaluation of measurements of the protein during early pregnancy and trophoblastic disease

  19. Recognition determinants for proteins and antibiotics within 23S rRNA

    DEFF Research Database (Denmark)

    Douthwaite, Stephen Roger; Voldborg, Bjørn Gunnar Rude; Hansen, Lykke Haastrup

    1995-01-01

    Ribosomal RNAs fold into phylogenetically conserved secondary and tertiary structures that determine their function in protein synthesis. We have investigated Escherichia coli 23S rRNA to identify structural elements that interact with antibiotic and protein ligands. Using a combination of molecu......Ribosomal RNAs fold into phylogenetically conserved secondary and tertiary structures that determine their function in protein synthesis. We have investigated Escherichia coli 23S rRNA to identify structural elements that interact with antibiotic and protein ligands. Using a combination......-proteins L10.(L12)4 and L11 and is inhibited by interaction with the antibiotic thiostrepton. The peptidyltransferase center within domain V is inhibited by macrolide, lincosamide, and streptogramin B antibiotics, which interact with the rRNA around nucleotide A2058. Drug resistance is conferred by mutations...

  20. Heat shock response improves heterologous protein secretion in Saccharomyces cerevisiae

    DEFF Research Database (Denmark)

    Hou, Jin; Österlund, Tobias; Liu, Zihe

    2013-01-01

    The yeast Saccharomyces cerevisiae is a widely used platform for the production of heterologous proteins of medical or industrial interest. However, heterologous protein productivity is often low due to limitations of the host strain. Heat shock response (HSR) is an inducible, global, cellular...... stress response, which facilitates the cell recovery from many forms of stress, e.g., heat stress. In S. cerevisiae, HSR is regulated mainly by the transcription factor heat shock factor (Hsf1p) and many of its targets are genes coding for molecular chaperones that promote protein folding and prevent...... the accumulation of mis-folded or aggregated proteins. In this work, we over-expressed a mutant HSF1 gene HSF1-R206S which can constitutively activate HSR, so the heat shock response was induced at different levels, and we studied the impact of HSR on heterologous protein secretion. We found that moderate and high...

  1. Effect of the unfolded protein response on ER protein export: a potential new mechanism to relieve ER stress.

    Science.gov (United States)

    Shaheen, Alaa

    2018-05-05

    The unfolded protein response (UPR) is an adaptive cellular response that aims to relieve endoplasmic reticulum (ER) stress via several mechanisms, including inhibition of protein synthesis and enhancement of protein folding and degradation. There is a controversy over the effect of the UPR on ER protein export. While some investigators suggested that ER export is inhibited during ER stress, others suggested the opposite. In this article, their conflicting studies are analyzed and compared in attempt to solve this controversy. The UPR appears indeed to enhance ER export, possibly via multiple mechanisms. However, another factor, which is the integrity of the folding machinery/environment inside ER, determines whether ER export will appear increased or decreased during experimentation. Also, different methods of stress induction appear to have different effects on ER export. Thus, improvement of ER export may represent a new mechanism by which the UPR alleviates ER stress. This may help researchers to understand how the UPR works inside cells and how to manipulate it to alter cell fate during stress, either to promote cell survival or death. This may open up new approaches for the treatment of ER stress-related diseases.

  2. RosettaTMH: a method for membrane protein structure elucidation combining EPR distance restraints with assembly of transmembrane helices

    Directory of Open Access Journals (Sweden)

    Andrew Leaver-Fay

    2015-12-01

    Full Text Available Membrane proteins make up approximately one third of all proteins, and they play key roles in a plethora of physiological processes. However, membrane proteins make up less than 2% of experimentally determined structures, despite significant advances in structure determination methods, such as X-ray crystallography, nuclear magnetic resonance spectroscopy, and cryo-electron microscopy. One potential alternative means of structure elucidation is to combine computational methods with experimental EPR data. In 2011, Hirst and others introduced RosettaEPR and demonstrated that this approach could be successfully applied to fold soluble proteins. Furthermore, few computational methods for de novo folding of integral membrane proteins have been presented. In this work, we present RosettaTMH, a novel algorithm for structure prediction of helical membrane proteins. A benchmark set of 34 proteins, in which the proteins ranged in size from 91 to 565 residues, was used to compare RosettaTMH to Rosetta’s two existing membrane protein folding protocols: the published RosettaMembrane folding protocol (“MembraneAbinitio” and folding from an extended chain (“ExtendedChain”. When EPR distance restraints are used, RosettaTMH+EPR outperforms ExtendedChain+EPR for 11 proteins, including the largest six proteins tested. RosettaTMH+EPR is capable of achieving native-like folds for 30 of 34 proteins tested, including receptors and transporters. For example, the average RMSD100SSE relative to the crystal structure for rhodopsin was 6.1 ± 0.4 Å and 6.5 ± 0.6 Å for the 449-residue nitric oxide reductase subunit B, where the standard deviation reflects variance in RMSD100SSE values across ten different EPR distance restraint sets. The addition of RosettaTMH and RosettaTMH+EPR to the Rosetta family of de novo folding methods broadens the scope of helical membrane proteins that can be accurately modeled with this software suite.

  3. Probing Protein Structure and Folding in the Gas Phase by Electron Capture Dissociation

    Science.gov (United States)

    Schennach, Moritz; Breuker, Kathrin

    2015-07-01

    The established methods for the study of atom-detailed protein structure in the condensed phases, X-ray crystallography and nuclear magnetic resonance spectroscopy, have recently been complemented by new techniques by which nearly or fully desolvated protein structures are probed in gas-phase experiments. Electron capture dissociation (ECD) is unique among these as it provides residue-specific, although indirect, structural information. In this Critical Insight article, we discuss the development of ECD for the structural probing of gaseous protein ions, its potential, and limitations.

  4. Cellular Handling of Protein Aggregates by Disaggregation Machines.

    Science.gov (United States)

    Mogk, Axel; Bukau, Bernd; Kampinga, Harm H

    2018-01-18

    Both acute proteotoxic stresses that unfold proteins and expression of disease-causing mutant proteins that expose aggregation-prone regions can promote protein aggregation. Protein aggregates can interfere with cellular processes and deplete factors crucial for protein homeostasis. To cope with these challenges, cells are equipped with diverse folding and degradation activities to rescue or eliminate aggregated proteins. Here, we review the different chaperone disaggregation machines and their mechanisms of action. In all these machines, the coating of protein aggregates by Hsp70 chaperones represents the conserved, initializing step. In bacteria, fungi, and plants, Hsp70 recruits and activates Hsp100 disaggregases to extract aggregated proteins. In the cytosol of metazoa, Hsp70 is empowered by a specific cast of J-protein and Hsp110 co-chaperones allowing for standalone disaggregation activity. Both types of disaggregation machines are supported by small Hsps that sequester misfolded proteins. Copyright © 2018 Elsevier Inc. All rights reserved.

  5. Comparing side chain packing in soluble proteins, protein-protein interfaces, and transmembrane proteins.

    Science.gov (United States)

    Gaines, J C; Acebes, S; Virrueta, A; Butler, M; Regan, L; O'Hern, C S

    2018-05-01

    We compare side chain prediction and packing of core and non-core regions of soluble proteins, protein-protein interfaces, and transmembrane proteins. We first identified or created comparable databases of high-resolution crystal structures of these 3 protein classes. We show that the solvent-inaccessible cores of the 3 classes of proteins are equally densely packed. As a result, the side chains of core residues at protein-protein interfaces and in the membrane-exposed regions of transmembrane proteins can be predicted by the hard-sphere plus stereochemical constraint model with the same high prediction accuracies (>90%) as core residues in soluble proteins. We also find that for all 3 classes of proteins, as one moves away from the solvent-inaccessible core, the packing fraction decreases as the solvent accessibility increases. However, the side chain predictability remains high (80% within 30°) up to a relative solvent accessibility, rSASA≲0.3, for all 3 protein classes. Our results show that ≈40% of the interface regions in protein complexes are "core", that is, densely packed with side chain conformations that can be accurately predicted using the hard-sphere model. We propose packing fraction as a metric that can be used to distinguish real protein-protein interactions from designed, non-binding, decoys. Our results also show that cores of membrane proteins are the same as cores of soluble proteins. Thus, the computational methods we are developing for the analysis of the effect of hydrophobic core mutations in soluble proteins will be equally applicable to analyses of mutations in membrane proteins. © 2018 Wiley Periodicals, Inc.

  6. Malfolded protein structure and proteostasis in lung diseases.

    Science.gov (United States)

    Balch, William E; Sznajder, Jacob I; Budinger, Scott; Finley, Daniel; Laposky, Aaron D; Cuervo, Ana Maria; Benjamin, Ivor J; Barreiro, Esther; Morimoto, Richard I; Postow, Lisa; Weissman, Allan M; Gail, Dorothy; Banks-Schlegel, Susan; Croxton, Thomas; Gan, Weiniu

    2014-01-01

    Recent discoveries indicate that disorders of protein folding and degradation play a particularly important role in the development of lung diseases and their associated complications. The overarching purpose of the National Heart, Lung, and Blood Institute workshop on "Malformed Protein Structure and Proteostasis in Lung Diseases" was to identify mechanistic and clinical research opportunities indicated by these recent discoveries in proteostasis science that will advance our molecular understanding of lung pathobiology and facilitate the development of new diagnostic and therapeutic strategies for the prevention and treatment of lung disease. The workshop's discussion focused on identifying gaps in scientific knowledge with respect to proteostasis and lung disease, discussing new research advances and opportunities in protein folding science, and highlighting novel technologies with potential therapeutic applications for diagnosis and treatment.

  7. Intrinsically Disordered Proteins in a Physics-Based World

    Directory of Open Access Journals (Sweden)

    Jianhan Chen

    2010-12-01

    Full Text Available Intrinsically disordered proteins (IDPs are a newly recognized class of functional proteins that rely on a lack of stable structure for function. They are highly prevalent in biology, play fundamental roles, and are extensively involved in human diseases. For signaling and regulation, IDPs often fold into stable structures upon binding to specific targets. The mechanisms of these coupled binding and folding processes are of significant importance because they underlie the organization of regulatory networks that dictate various aspects of cellular decision-making. This review first discusses the challenge in detailed experimental characterization of these heterogeneous and dynamics proteins and the unique and exciting opportunity for physics-based modeling to make crucial contributions, and then summarizes key lessons from recent de novo simulations of the structure and interactions of several regulatory IDPs.

  8. [L-arginine metabolism enzyme activities in rat liver subcellular fractions under condition of protein deprivation].

    Science.gov (United States)

    Kopyl'chuk, G P; Buchkovskaia, I M

    2014-01-01

    The features of arginase and NO-synthase pathways of arginine's metabolism have been studied in rat liver subcellular fractions under condition of protein deprivation. During the experimental period (28 days) albino male rats were kept on semi synthetic casein diet AIN-93. The protein deprivation conditions were designed as total absence of protein in the diet and consumption of the diet partially deprived with 1/2 of the casein amount compared to in the regular diet. Daily diet consumption was regulated according to the pair feeding approach. It has been shown that the changes of enzyme activities, involved in L-arginine metabolism, were characterized by 1.4-1.7 fold decrease in arginase activity, accompanied with unchanged NO-synthase activity in cytosol. In mitochondrial fraction the unchanged arginase activity was accompanied by 3-5 fold increase of NO-synthase activity. At the terminal stages of the experiment the monodirectional dynamics in the studied activities have been observed in the mitochondrial and cytosolfractions in both experimental groups. In the studied subcellular fractions arginase activity decreased (2.4-2.7 fold with no protein in the diet and 1.5 fold with partly supplied protein) and was accompanied by NO-synthase activity increase by 3.8 fold in cytosole fraction, by 7.2 fold in mitochondrial fraction in the group with no protein in the diet and by 2.2 and 3.5 fold in the group partialy supplied with protein respectively. The observed tendency is presumably caused by the switch of L-arginine metabolism from arginase into oxidizing NO-synthase parthway.

  9. Evaluating the effects of cutoffs and treatment of long-range electrostatics in protein folding simulations.

    Directory of Open Access Journals (Sweden)

    Stefano Piana

    Full Text Available The use of molecular dynamics simulations to provide atomic-level descriptions of biological processes tends to be computationally demanding, and a number of approximations are thus commonly employed to improve computational efficiency. In the past, the effect of these approximations on macromolecular structure and stability has been evaluated mostly through quantitative studies of small-molecule systems or qualitative observations of short-timescale simulations of biological macromolecules. Here we present a quantitative evaluation of two commonly employed approximations, using a test system that has been the subject of a number of previous protein folding studies--the villin headpiece. In particular, we examined the effect of (i the use of a cutoff-based force-shifting technique rather than an Ewald summation for the treatment of electrostatic interactions, and (ii the length of the cutoff used to determine how many pairwise interactions are included in the calculation of both electrostatic and van der Waals forces. Our results show that the free energy of folding is relatively insensitive to the choice of cutoff beyond 9 Å, and to whether an Ewald method is used to account for long-range electrostatic interactions. In contrast, we find that the structural properties of the unfolded state depend more strongly on the two approximations examined here.

  10. Adaptive enhanced sampling with a path-variable for the simulation of protein folding and aggregation

    Science.gov (United States)

    Peter, Emanuel K.

    2017-12-01

    In this article, we present a novel adaptive enhanced sampling molecular dynamics (MD) method for the accelerated simulation of protein folding and aggregation. We introduce a path-variable L based on the un-biased momenta p and displacements dq for the definition of the bias s applied to the system and derive 3 algorithms: general adaptive bias MD, adaptive path-sampling, and a hybrid method which combines the first 2 methodologies. Through the analysis of the correlations between the bias and the un-biased gradient in the system, we find that the hybrid methodology leads to an improved force correlation and acceleration in the sampling of the phase space. We apply our method on SPC/E water, where we find a conservation of the average water structure. We then use our method to sample dialanine and the folding of TrpCage, where we find a good agreement with simulation data reported in the literature. Finally, we apply our methodologies on the initial stages of aggregation of a hexamer of Alzheimer's amyloid β fragment 25-35 (Aβ 25-35) and find that transitions within the hexameric aggregate are dominated by entropic barriers, while we speculate that especially the conformation entropy plays a major role in the formation of the fibril as a rate limiting factor.

  11. Amino acid code of protein secondary structure.

    Science.gov (United States)

    Shestopalov, B V

    2003-01-01

    -dimensional structure from the amino acid sequence, and the calculated secondary structure and codon strength distribution can be used for simulating the next step of protein folding; b) one can propose that the same secondary structures can be folded into different tertiary structures and, vice versa, different secondary structures can be folded into the same tertiary structures, provided codon distributions are considered also; c) codons can be considered as first elements of protein three-dimensional structure language.

  12. Proteomic data from human cell cultures refine mechanisms of chaperone-mediated protein homeostasis.

    Science.gov (United States)

    Finka, Andrija; Goloubinoff, Pierre

    2013-09-01

    In the crowded environment of human cells, folding of nascent polypeptides and refolding of stress-unfolded proteins is error prone. Accumulation of cytotoxic misfolded and aggregated species may cause cell death, tissue loss, degenerative conformational diseases, and aging. Nevertheless, young cells effectively express a network of molecular chaperones and folding enzymes, termed here "the chaperome," which can prevent formation of potentially harmful misfolded protein conformers and use the energy of adenosine triphosphate (ATP) to rehabilitate already formed toxic aggregates into native functional proteins. In an attempt to extend knowledge of chaperome mechanisms in cellular proteostasis, we performed a meta-analysis of human chaperome using high-throughput proteomic data from 11 immortalized human cell lines. Chaperome polypeptides were about 10% of total protein mass of human cells, half of which were Hsp90s and Hsp70s. Knowledge of cellular concentrations and ratios among chaperome polypeptides provided a novel basis to understand mechanisms by which the Hsp60, Hsp70, Hsp90, and small heat shock proteins (HSPs), in collaboration with cochaperones and folding enzymes, assist de novo protein folding, import polypeptides into organelles, unfold stress-destabilized toxic conformers, and control the conformal activity of native proteins in the crowded environment of the cell. Proteomic data also provided means to distinguish between stable components of chaperone core machineries and dynamic regulatory cochaperones.

  13. Competition between folding and glycosylation in the endoplasmic reticulum

    DEFF Research Database (Denmark)

    Holst, B; Bruun, A W; Kielland-Brandt, Morten

    1996-01-01

    Using carboxypeptidase Y in Saccharomyces cerevisiae as a model system, the in vivo relationship between protein folding and N-glycosylation was studied. Seven new sites for N-glycosylation were introduced at positions buried in the folded protein structure. The level of glycosylation of such new...... acceptor sites. In some cases, all the newly synthesized mutant protein was modified at the novel site while in others no modification took place. In the most interesting category of mutants, the level of glycosylation was dependent on the conditions for folding. This shows that folding and glycosylation...

  14. Structural entanglements in protein complexes

    Science.gov (United States)

    Zhao, Yani; Chwastyk, Mateusz; Cieplak, Marek

    2017-06-01

    We consider multi-chain protein native structures and propose a criterion that determines whether two chains in the system are entangled or not. The criterion is based on the behavior observed by pulling at both termini of each chain simultaneously in the two chains. We have identified about 900 entangled systems in the Protein Data Bank and provided a more detailed analysis for several of them. We argue that entanglement enhances the thermodynamic stability of the system but it may have other functions: burying the hydrophobic residues at the interface and increasing the DNA or RNA binding area. We also study the folding and stretching properties of the knotted dimeric proteins MJ0366, YibK, and bacteriophytochrome. These proteins have been studied theoretically in their monomeric versions so far. The dimers are seen to separate on stretching through the tensile mechanism and the characteristic unraveling force depends on the pulling direction.

  15. Pierced Lasso Proteins

    Science.gov (United States)

    Jennings, Patricia

    topologies, the threaded topology is formed by a covalent loop where part of the polypeptide chain is threaded through, forming what we term a PL. The advantage of a PL topology for fundamental studies, compared to other knotted proteins, is that the threaded topology can easily be manipulated to yield an unknotted state. Exploiting the oxidative state of the cysteines, the building blocks that form the disulphide bridge generating the covalent loop, through altering the chemical environment, and thereby controlling the formation of the covalent loop, easily generates unknotted protein. The biological advantage, we have found, is that the PL can exert allosteric control through this on/off mechanism in a target protein. Most significantly, as the disulphide bridge acts as an on/off switch in knotting, the biophysical investigation of PL topologies can provide a new tool to steer folding and function in proteins, as disulphide bridges are commonly used in protein engineering and therapeutics.

  16. Peroxisome protein import: a complex journey.

    Science.gov (United States)

    Baker, Alison; Lanyon-Hogg, Thomas; Warriner, Stuart L

    2016-06-15

    The import of proteins into peroxisomes possesses many unusual features such as the ability to import folded proteins, and a surprising diversity of targeting signals with differing affinities that can be recognized by the same receptor. As understanding of the structure and function of many components of the protein import machinery has grown, an increasingly complex network of factors affecting each step of the import pathway has emerged. Structural studies have revealed the presence of additional interactions between cargo proteins and the PEX5 receptor that affect import potential, with a subtle network of cargo-induced conformational changes in PEX5 being involved in the import process. Biochemical studies have also indicated an interdependence of receptor-cargo import with release of unloaded receptor from the peroxisome. Here, we provide an update on recent literature concerning mechanisms of protein import into peroxisomes. © 2016 The Author(s).

  17. Trade-off between positive and negative design of protein stability: from lattice models to real proteins.

    Directory of Open Access Journals (Sweden)

    Orly Noivirt-Brik

    2009-12-01

    Full Text Available Two different strategies for stabilizing proteins are (i positive design in which the native state is stabilized and (ii negative design in which competing non-native conformations are destabilized. Here, the circumstances under which one strategy might be favored over the other are explored in the case of lattice models of proteins and then generalized and discussed with regard to real proteins. The balance between positive and negative design of proteins is found to be determined by their average "contact-frequency", a property that corresponds to the fraction of states in the conformational ensemble of the sequence in which a pair of residues is in contact. Lattice model proteins with a high average contact-frequency are found to use negative design more than model proteins with a low average contact-frequency. A mathematical derivation of this result indicates that it is general and likely to hold also for real proteins. Comparison of the results of correlated mutation analysis for real proteins with typical contact-frequencies to those of proteins likely to have high contact-frequencies (such as disordered proteins and proteins that are dependent on chaperonins for their folding indicates that the latter tend to have stronger interactions between residues that are not in contact in their native conformation. Hence, our work indicates that negative design is employed when insufficient stabilization is achieved via positive design owing to high contact-frequencies.

  18. Peripheral Protein Unfolding Drives Membrane Bending.

    Science.gov (United States)

    Siaw, Hew Ming Helen; Raghunath, Gokul; Dyer, R Brian

    2018-06-20

    Dynamic modulation of lipid membrane curvature can be achieved by a number of peripheral protein binding mechanisms such as hy-drophobic insertion of amphipathic helices and membrane scaffolding. Recently, an alternative mechanism was proposed in which crowding of peripherally bound proteins induces membrane curvature through steric pressure generated by lateral collisions. This effect was enhanced using intrinsically disordered proteins that possess high hydrodynamic radii, prompting us to explore whether membrane bending can be triggered by the folding-unfolding transition of surface-bound proteins. We utilized histidine-tagged human serum albumin bound to Ni-NTA-DGS containing liposomes as our model system to test this hypothesis. We found that reduction of the disulfide bonds in the protein resulted in unfolding of HSA, which subsequently led to membrane tubule formation. The frequency of tubule formation was found to be significantly higher when the proteins were unfolded while being localized to a phase-separated domain as opposed to randomly distributed in fluid phase liposomes, indicating that the steric pressure generated from protein unfolding is directly responsible for membrane deformation. Our results are critical for the design of peripheral membrane protein-immobilization strategies and open new avenues for exploring mechanisms of membrane bending driven by conformational changes of peripheral membrane proteins.

  19. Sulfated glycopeptide nanostructures for multipotent protein activation

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Sungsoo S.; Fyrner, Timmy; Chen, Feng; Álvarez, Zaida; Sleep, Eduard; Chun, Danielle S.; Weiner, Joseph A.; Cook, Ralph W.; Freshman, Ryan D.; Schallmo, Michael S.; Katchko, Karina M.; Schneider, Andrew D.; Smith, Justin T.; Yun, Chawon; Singh, Gurmit; Hashmi, Sohaib Z.; McClendon, Mark T.; Yu, Zhilin; Stock, Stuart R.; Hsu, Wellington K.; Hsu, Erin L.; Stupp , Samuel I. (NWU)

    2017-06-19

    Biological systems have evolved to utilize numerous proteins with capacity to bind polysaccharides for the purpose of optimizing their function. A well-known subset of these proteins with binding domains for the highly diverse sulfated polysaccharides are important growth factors involved in biological development and tissue repair. We report here on supramolecular sulfated glycopeptide nanostructures, which display a trisulfated monosaccharide on their surfaces and bind five critical proteins with different polysaccharide-binding domains. Binding does not disrupt the filamentous shape of the nanostructures or their internal β-sheet backbone, but must involve accessible adaptive configurations to interact with such different proteins. The glycopeptide nanostructures amplified signalling of bone morphogenetic protein 2 significantly more than the natural sulfated polysaccharide heparin, and promoted regeneration of bone in the spine with a protein dose that is 100-fold lower than that required in the animal model. These highly bioactive nanostructures may enable many therapies in the future involving proteins.

  20. Comparison of three methods for determination of protein ...

    African Journals Online (AJOL)

    However, a six fold greater amount of protein was obtained when FastPrep was applied to lyse LAB cells. Our results also indicate that, this fast and easy extraction method allows more spot-abundant polyacrylamide gels. More clear and consistent strips were detected by SDS-PAGE when proteins were extracted by ...

  1. Different secondary structure elements as scaffolds for protein folding transition states of two homologous four-helix bundles.

    Science.gov (United States)

    Teilum, Kaare; Thormann, Thorsten; Caterer, Nigel R; Poulsen, Heidi I; Jensen, Peter H; Knudsen, Jens; Kragelund, Birthe B; Poulsen, Flemming M

    2005-04-01

    Comparison of the folding processes for homologue proteins can provide valuable information about details in the interactions leading to the formation of the folding transition state. Here the folding kinetics of 18 variants of yACBP and 3 variants of bACBP have been studied by Phi-value analysis. In combination with Phi-values from previous work, detailed insight into the transition states for folding of both yACBP and bACBP has been obtained. Of the 16 sequence positions that have been studied in both yACBP and bACBP, 5 (V12, I/L27, Y73, V77, and L80) have high Phi-values and appear to be important for the transition state formation in both homologues. Y31, A34, and A69 have high Phi-values only in yACBP, while F5, A9, and I74 have high Phi-values only in bACBP. Thus, additional interactions between helices A2 and A4 appear to be important for the transition state of yACBP, whereas additional interactions between helices A1 and A4 appear to be important for the transition state of bACBP. To examine whether these differences could be assigned to different packing of the residues in the native state, a solution structure of yACBP was determined by NMR. Small changes in the packing of the hydrophobic side-chains, which strengthen the interactions between helices A2 and A4, are observed in yACBP relative to bACBP. It is suggested that different structure elements serve as scaffolds for the folding of the 2 ACBP homologues. (c) 2005 Wiley-Liss, Inc.

  2. RNA chaperoning and intrinsic disorder in the core proteins of Flaviviridae.

    Science.gov (United States)

    Ivanyi-Nagy, Roland; Lavergne, Jean-Pierre; Gabus, Caroline; Ficheux, Damien; Darlix, Jean-Luc

    2008-02-01

    RNA chaperone proteins are essential partners of RNA in living organisms and viruses. They are thought to assist in the correct folding and structural rearrangements of RNA molecules by resolving misfolded RNA species in an ATP-independent manner. RNA chaperoning is probably an entropy-driven process, mediated by the coupled binding and folding of intrinsically disordered protein regions and the kinetically trapped RNA. Previously, we have shown that the core protein of hepatitis C virus (HCV) is a potent RNA chaperone that can drive profound structural modifications of HCV RNA in vitro. We now examined the RNA chaperone activity and the disordered nature of core proteins from different Flaviviridae genera, namely that of HCV, GBV-B (GB virus B), WNV (West Nile virus) and BVDV (bovine viral diarrhoea virus). Despite low-sequence similarities, all four proteins demonstrated general nucleic acid annealing and RNA chaperone activities. Furthermore, heat resistance of core proteins, as well as far-UV circular dichroism spectroscopy suggested that a well-defined 3D protein structure is not necessary for core-induced RNA structural rearrangements. These data provide evidence that RNA chaperoning-possibly mediated by intrinsically disordered protein segments-is conserved in Flaviviridae core proteins. Thus, besides nucleocapsid formation, core proteins may function in RNA structural rearrangements taking place during virus replication.

  3. Proteins aggregation and human diseases

    International Nuclear Information System (INIS)

    Hu, Chin-Kun

    2015-01-01

    Many human diseases and the death of most supercentenarians are related to protein aggregation. Neurodegenerative diseases include Alzheimer's disease (AD), Huntington's disease (HD), Parkinson's disease (PD), frontotemporallobar degeneration, etc. Such diseases are due to progressive loss of structure or function of neurons caused by protein aggregation. For example, AD is considered to be related to aggregation of Aβ40 (peptide with 40 amino acids) and Aβ42 (peptide with 42 amino acids) and HD is considered to be related to aggregation of polyQ (polyglutamine) peptides. In this paper, we briefly review our recent discovery of key factors for protein aggregation. We used a lattice model to study the aggregation rates of proteins and found that the probability for a protein sequence to appear in the conformation of the aggregated state can be used to determine the temperature at which proteins can aggregate most quickly. We used molecular dynamics and simple models of polymer chains to study relaxation and aggregation of proteins under various conditions and found that when the bending-angle dependent and torsion-angle dependent interactions are zero or very small, then protein chains tend to aggregate at lower temperatures. All atom models were used to identify a key peptide chain for the aggregation of insulin chains and to find that two polyQ chains prefer anti-parallel conformation. It is pointed out that in many cases, protein aggregation does not result from protein mis-folding. A potential drug from Chinese medicine was found for Alzheimer's disease. (paper)

  4. Proteins aggregation and human diseases

    Science.gov (United States)

    Hu, Chin-Kun

    2015-04-01

    Many human diseases and the death of most supercentenarians are related to protein aggregation. Neurodegenerative diseases include Alzheimer's disease (AD), Huntington's disease (HD), Parkinson's disease (PD), frontotemporallobar degeneration, etc. Such diseases are due to progressive loss of structure or function of neurons caused by protein aggregation. For example, AD is considered to be related to aggregation of Aβ40 (peptide with 40 amino acids) and Aβ42 (peptide with 42 amino acids) and HD is considered to be related to aggregation of polyQ (polyglutamine) peptides. In this paper, we briefly review our recent discovery of key factors for protein aggregation. We used a lattice model to study the aggregation rates of proteins and found that the probability for a protein sequence to appear in the conformation of the aggregated state can be used to determine the temperature at which proteins can aggregate most quickly. We used molecular dynamics and simple models of polymer chains to study relaxation and aggregation of proteins under various conditions and found that when the bending-angle dependent and torsion-angle dependent interactions are zero or very small, then protein chains tend to aggregate at lower temperatures. All atom models were used to identify a key peptide chain for the aggregation of insulin chains and to find that two polyQ chains prefer anti-parallel conformation. It is pointed out that in many cases, protein aggregation does not result from protein mis-folding. A potential drug from Chinese medicine was found for Alzheimer's disease.

  5. Structure of PIN-domain protein PH0500 from Pyrococcus horikoshii

    International Nuclear Information System (INIS)

    Jeyakanthan, Jeyaraman; Inagaki, Eiji; Kuroishi, Chizu; Tahirov, Tahir H.

    2005-01-01

    The structure of P. horikoshii OT3 protein PH0500 was determined by the multiple anomalous dispersion method and refined in two crystal forms. The protein is a dimer and has a PIN-domain fold. The Pyrococcus horikoshii OT3 protein PH0500 is highly conserved within the Pyrococcus genus of hyperthermophilic archaea and shows low amino-acid sequence similarity with a family of PIN-domain proteins. The protein has been expressed, purified and crystallized in two crystal forms: PH0500-I and PH0500-II. The structure was determined at 2.0 Å by the multiple anomalous dispersion method using a selenomethionyl derivative of crystal form PH0500-I (PH0500-I-Se). The structure of PH0500-I has been refined at 1.75 Å resolution to an R factor of 20.9% and the structure of PH0500-II has been refined at 2.0 Å resolution to an R factor of 23.4%. In both crystal forms as well as in solution the molecule appears to be a dimer. Searches of the databases for protein-fold similarities confirmed that the PH0500 protein is a PIN-domain protein with possible exonuclease activity and involvement in DNA or RNA editing

  6. SHEETSPAIR: A Database of Amino Acid Pairs in Protein Sheet Structures

    Directory of Open Access Journals (Sweden)

    Ning Zhang

    2007-10-01

    Full Text Available Within folded strands of a protein, amino acids (AAs on every adjacent two strands form a pair of AAs. To explore the interactions between strands in a protein sheet structure, we have established an Internet-accessible relational database named SheetsPairs based on SQL Server 2000. The database has collected AAs pairs in proteins with detailed information. Furthermore, it utilizes a non-freetext database structure to store protein sequences and a specific database table with a unique number to store strands, which provides more searching options and rapid and accurate access to data queries. An IIS web server has been set up for data retrieval through a custom web interface, which enables complex data queries. Also searchable are parallel or anti-parallel folded strands and the list of strands in a specified protein.

  7. Effect of the level of dietary protein on the utilization of alpha-ketoisocaproate for protein synthesis

    Energy Technology Data Exchange (ETDEWEB)

    Kang, C.W.; Tungsanga, K.; Walser, M.

    1986-04-01

    The efficiency of alpha-ketoisocaproate (KIC) as a dietary substitute for leucine in rats on varying protein intake was estimated by an isotopic method, previously shown to yield the same results as comparative growth experiments. /sup 14/C-KIC and /sup 3/H-leucine are injected orally. Six hours later the ratio, R, of /sup 14/C//sup 3/H in isolated proteins, divided by the same ratio in the injectate is measured. This ratio has been shown to be approximately equal to nutritional efficiency of KIC relative to leucine. As dietary protein increased from 6.3% to 48.3%, whole body protein R decreased from 0.515 +/- 0.045 to 0.299 +/- 0.016. Variations with protein intake were noted in R of protein isolated from individual organs. The magnitude of R in these organs varied two-fold, in the following sequence: brain greater than heart greater than or equal to skeletal muscle greater than or equal to salivary gland greater than or equal to kidney greater than liver. Whole body protein R could be confidently predicted (r2 = 0.992) from R in the protein of kidney and muscle. Thus, the nutritional efficiency of KIC as a dietary substitute for leucine in individual organs as well as in the whole animal is strongly dependent on the level of protein intake.

  8. Effect of the level of dietary protein on the utilization of alpha-ketoisocaproate for protein synthesis

    International Nuclear Information System (INIS)

    Kang, C.W.; Tungsanga, K.; Walser, M.

    1986-01-01

    The efficiency of alpha-ketoisocaproate (KIC) as a dietary substitute for leucine in rats on varying protein intake was estimated by an isotopic method, previously shown to yield the same results as comparative growth experiments. 14 C-KIC and 3 H-leucine are injected orally. Six hours later the ratio, R, of 14 C/ 3 H in isolated proteins, divided by the same ratio in the injectate is measured. This ratio has been shown to be approximately equal to nutritional efficiency of KIC relative to leucine. As dietary protein increased from 6.3% to 48.3%, whole body protein R decreased from 0.515 +/- 0.045 to 0.299 +/- 0.016. Variations with protein intake were noted in R of protein isolated from individual organs. The magnitude of R in these organs varied two-fold, in the following sequence: brain greater than heart greater than or equal to skeletal muscle greater than or equal to salivary gland greater than or equal to kidney greater than liver. Whole body protein R could be confidently predicted (r2 = 0.992) from R in the protein of kidney and muscle. Thus, the nutritional efficiency of KIC as a dietary substitute for leucine in individual organs as well as in the whole animal is strongly dependent on the level of protein intake

  9. Production of membrane proteins without cells or detergents.

    Science.gov (United States)

    Rajesh, Sundaresan; Knowles, Timothy; Overduin, Michael

    2011-04-30

    The production of membrane proteins in cellular systems is besieged by several problems due to their hydrophobic nature which often causes misfolding, protein aggregation and cytotoxicity, resulting in poor yields of stable proteins. Cell-free expression has emerged as one of the most versatile alternatives for circumventing these obstacles by producing membrane proteins directly into designed hydrophobic environments. Efficient optimisation of expression and solubilisation conditions using a variety of detergents, membrane mimetics and lipids has yielded structurally and functionally intact membrane proteins, with yields several fold above the levels possible from cell-based systems. Here we review recently developed techniques available to produce functional membrane proteins, and discuss amphipols, nanodisc and styrene maleic acid lipid particle (SMALP) technologies that can be exploited alongside cell-free expression of membrane proteins. Copyright © 2010 Elsevier B.V. All rights reserved.

  10. Effects of punctal occlusion on global tear proteins in patients with dry eye.

    Science.gov (United States)

    Tong, Louis; Zhou, Lei; Beuerman, Roger; Simonyi, Susan; Hollander, David A; Stern, Michael E

    2017-10-01

    To investigate effects of punctal occlusion on global tear protein levels in patients with dry eye. In this prospective, longitudinal, single-center study, nonabsorbable punctal plugs were inserted bilaterally into the lower punctum of 30 patients with moderate dry eye. Dry eye symptoms, fluorescein corneal staining, Schirmer I test, tear film break-up time, and safety were assessed in the more severely affected eye. Tear proteins at weeks 1 and 3 were quantified by iTRAQ relative to baseline preocclusion levels. Of 29 patients who completed the study, 23 (mean age 49.8 years) had sufficient tear samples for analysis. After 3 weeks, punctal occlusion significantly upregulated tear proteins, including glutathione synthase (mean of 1.6-fold, P = 0.01) and interleukin-1 receptor antagonist (1.7-fold, P = 0.032) and downregulated cholinergic receptor (neuronal) alpha-7 (0.79-fold, P = 0.039) and lymphocyte cytosolic protein-1 (0.66-fold, P = 0.012). Clustering analysis of global tear proteins revealed two clear profile changes; the first group of patients (cluster 1, n = 10) had a reduction in the inflammatory proteins (e.g., S100A8) and rise in lacrimal proteins supporting the ocular surface (e.g., lysozyme), whereas the second group (cluster 2, n = 13) had an increase in inflammatory proteins and a decrease in lacrimal proteins. Logistic regression analysis revealed that cluster 1 patients had significantly (P = 0.006) lower Schirmer scores at baseline (mean [standard deviation]: 4.3 [4.3] mm) than cluster 2 (6.8 [2.6] mm). Punctal plugs produced a beneficial pattern of tear protein change in patients with relatively low Schirmer scores within 3 weeks of punctal occlusion. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  11. Protein docking prediction using predicted protein-protein interface

    Directory of Open Access Journals (Sweden)

    Li Bin

    2012-01-01

    Full Text Available Abstract Background Many important cellular processes are carried out by protein complexes. To provide physical pictures of interacting proteins, many computational protein-protein prediction methods have been developed in the past. However, it is still difficult to identify the correct docking complex structure within top ranks among alternative conformations. Results We present a novel protein docking algorithm that utilizes imperfect protein-protein binding interface prediction for guiding protein docking. Since the accuracy of protein binding site prediction varies depending on cases, the challenge is to develop a method which does not deteriorate but improves docking results by using a binding site prediction which may not be 100% accurate. The algorithm, named PI-LZerD (using Predicted Interface with Local 3D Zernike descriptor-based Docking algorithm, is based on a pair wise protein docking prediction algorithm, LZerD, which we have developed earlier. PI-LZerD starts from performing docking prediction using the provided protein-protein binding interface prediction as constraints, which is followed by the second round of docking with updated docking interface information to further improve docking conformation. Benchmark results on bound and unbound cases show that PI-LZerD consistently improves the docking prediction accuracy as compared with docking without using binding site prediction or using the binding site prediction as post-filtering. Conclusion We have developed PI-LZerD, a pairwise docking algorithm, which uses imperfect protein-protein binding interface prediction to improve docking accuracy. PI-LZerD consistently showed better prediction accuracy over alternative methods in the series of benchmark experiments including docking using actual docking interface site predictions as well as unbound docking cases.

  12. Protein docking prediction using predicted protein-protein interface.

    Science.gov (United States)

    Li, Bin; Kihara, Daisuke

    2012-01-10

    Many important cellular processes are carried out by protein complexes. To provide physical pictures of interacting proteins, many computational protein-protein prediction methods have been developed in the past. However, it is still difficult to identify the correct docking complex structure within top ranks among alternative conformations. We present a novel protein docking algorithm that utilizes imperfect protein-protein binding interface prediction for guiding protein docking. Since the accuracy of protein binding site prediction varies depending on cases, the challenge is to develop a method which does not deteriorate but improves docking results by using a binding site prediction which may not be 100% accurate. The algorithm, named PI-LZerD (using Predicted Interface with Local 3D Zernike descriptor-based Docking algorithm), is based on a pair wise protein docking prediction algorithm, LZerD, which we have developed earlier. PI-LZerD starts from performing docking prediction using the provided protein-protein binding interface prediction as constraints, which is followed by the second round of docking with updated docking interface information to further improve docking conformation. Benchmark results on bound and unbound cases show that PI-LZerD consistently improves the docking prediction accuracy as compared with docking without using binding site prediction or using the binding site prediction as post-filtering. We have developed PI-LZerD, a pairwise docking algorithm, which uses imperfect protein-protein binding interface prediction to improve docking accuracy. PI-LZerD consistently showed better prediction accuracy over alternative methods in the series of benchmark experiments including docking using actual docking interface site predictions as well as unbound docking cases.

  13. Genetic analysis of RPA single-stranded DNA binding protein in Haloferax volcanii

    OpenAIRE

    Stroud, A. L.

    2012-01-01

    Replication protein A (RPA) is a single-stranded DNA-binding protein that is present in all three domains of life. The roles of RPA include stabilising and protecting single- stranded DNA from nuclease degradation during DNA replication and repair. To achieve this, RPA uses an oligosaccharide-binding fold (OB fold) to bind single- stranded DNA. Haloferax volcanii encodes three RPAs – RPA1, RPA2 and RPA3, of which rpa1 and rpa3 are in operons with genes encoding associated proteins (APs). ...

  14. Computational design of proteins with novel structure and functions

    International Nuclear Information System (INIS)

    Yang Wei; Lai Lu-Hua

    2016-01-01

    Computational design of proteins is a relatively new field, where scientists search the enormous sequence space for sequences that can fold into desired structure and perform desired functions. With the computational approach, proteins can be designed, for example, as regulators of biological processes, novel enzymes, or as biotherapeutics. These approaches not only provide valuable information for understanding of sequence–structure–function relations in proteins, but also hold promise for applications to protein engineering and biomedical research. In this review, we briefly introduce the rationale for computational protein design, then summarize the recent progress in this field, including de novo protein design, enzyme design, and design of protein–protein interactions. Challenges and future prospects of this field are also discussed. (topical review)

  15. Prediction of Protein-Protein Interactions Related to Protein Complexes Based on Protein Interaction Networks

    Directory of Open Access Journals (Sweden)

    Peng Liu

    2015-01-01

    Full Text Available A method for predicting protein-protein interactions based on detected protein complexes is proposed to repair deficient interactions derived from high-throughput biological experiments. Protein complexes are pruned and decomposed into small parts based on the adaptive k-cores method to predict protein-protein interactions associated with the complexes. The proposed method is adaptive to protein complexes with different structure, number, and size of nodes in a protein-protein interaction network. Based on different complex sets detected by various algorithms, we can obtain different prediction sets of protein-protein interactions. The reliability of the predicted interaction sets is proved by using estimations with statistical tests and direct confirmation of the biological data. In comparison with the approaches which predict the interactions based on the cliques, the overlap of the predictions is small. Similarly, the overlaps among the predicted sets of interactions derived from various complex sets are also small. Thus, every predicted set of interactions may complement and improve the quality of the original network data. Meanwhile, the predictions from the proposed method replenish protein-protein interactions associated with protein complexes using only the network topology.

  16. The 'tubulin-like' S1 protein of Spirochaeta is a member of the hsp65 stress protein family

    Science.gov (United States)

    Munson, D.; Obar, R.; Tzertzinis, G.; Margulis, L.

    1993-01-01

    A 65-kDa protein (called S1) from Spirochaeta bajacaliforniensis was identified as 'tubulin-like' because it cross-reacted with at least four different antisera raised against tubulin and was isolated, with a co-polymerizing 45-kDa protein, by warm-cold cycling procedures used to purify tubulin from mammalian brain. Furthermore, at least three genera of non-cultivable symbiotic spirochetes (Pillotina, Diplocalyx, and Hollandina) that contain conspicuous 24-nm cytoplasmic tubules displayed a strong fluorescence in situ when treated with polyclonal antisera raised against tubulin. Here we summarize results that lead to the conclusion that this 65-kDa protein has no homology to tubulin. S1 is an hsp65 stress protein homologue. Hsp65 is a highly immunogenic family of hsp60 proteins which includes the 65-kDa antigens of Mycobacterium tuberculosis (an active component of Freund's complete adjuvant), Borrelia, Treponema, Chlamydia, Legionella, and Salmonella. The hsp60s, also known as chaperonins, include E. coli GroEL, mitochondrial and chloroplast chaperonins, the pea aphid 'symbionin' and many other proteins involved in protein folding and the stress response.

  17. Chaperone activity of human small heat shock protein-GST fusion proteins.

    Science.gov (United States)

    Arbach, Hannah; Butler, Caley; McMenimen, Kathryn A

    2017-07-01

    Small heat shock proteins (sHsps) are a ubiquitous part of the machinery that maintains cellular protein homeostasis by acting as molecular chaperones. sHsps bind to and prevent the aggregation of partially folded substrate proteins in an ATP-independent manner. sHsps are dynamic, forming an ensemble of structures from dimers to large oligomers through concentration-dependent equilibrium dissociation. Based on structural studies and mutagenesis experiments, it is proposed that the dimer is the smallest active chaperone unit, while larger oligomers may act as storage depots for sHsps or play additional roles in chaperone function. The complexity and dynamic nature of their structural organization has made elucidation of their chaperone function challenging. HspB1 and HspB5 are two canonical human sHsps that vary in sequence and are expressed in a wide variety of tissues. In order to determine the role of the dimer in chaperone activity, glutathione-S-transferase (GST) was genetically linked as a fusion protein to the N-terminus regions of both HspB1 and HspB5 (also known as Hsp27 and αB-crystallin, respectively) proteins in order to constrain oligomer formation of HspB1 and HspB5, by using GST, since it readily forms a dimeric structure. We monitored the chaperone activity of these fusion proteins, which suggest they primarily form dimers and monomers and function as active molecular chaperones. Furthermore, the two different fusion proteins exhibit different chaperone activity for two model substrate proteins, citrate synthase (CS) and malate dehydrogenase (MDH). GST-HspB1 prevents more aggregation of MDH compared to GST-HspB5 and wild type HspB1. However, when CS is the substrate, both GST-HspB1 and GST-HspB5 are equally effective chaperones. Furthermore, wild type proteins do not display equal activity toward the substrates, suggesting that each sHsp exhibits different substrate specificity. Thus, substrate specificity, as described here for full-length GST

  18. Cellular proteostasis: degradation of misfolded proteins by lysosomes

    Science.gov (United States)

    Jackson, Matthew P.

    2016-01-01

    Proteostasis refers to the regulation of the cellular concentration, folding, interactions and localization of each of the proteins that comprise the proteome. One essential element of proteostasis is the disposal of misfolded proteins by the cellular pathways of protein degradation. Lysosomes are an important site for the degradation of misfolded proteins, which are trafficked to this organelle by the pathways of macroautophagy, chaperone-mediated autophagy and endocytosis. Conversely, amyloid diseases represent a failure in proteostasis, in which proteins misfold, forming amyloid deposits that are not degraded effectively by cells. Amyloid may then exacerbate this failure by disrupting autophagy and lysosomal proteolysis. However, targeting the pathways that regulate autophagy and the biogenesis of lysosomes may present approaches that can rescue cells from the deleterious effects of amyloidogenic proteins. PMID:27744333

  19. The virion N protein of infectious bronchitis virus is more phosphorylated than the N protein from infected cell lysates

    International Nuclear Information System (INIS)

    Jayaram, Jyothi; Youn, Soonjeon; Collisson, Ellen W.

    2005-01-01

    Because phosphorylation of the infectious bronchitis virus (IBV) nucleocapsid protein (N) may regulate its multiple roles in viral replication, the dynamics of N phosphorylation were examined. 32 P-orthophosphate labeling and Western blot analyses confirmed that N was the only viral protein that was phosphorylated. Pulse labeling with 32 P-orthophosphate indicated that the IBV N protein was phosphorylated in the virion, as well as at all times during infection in either chicken embryo kidney cells or Vero cells. Pulse-chase analyses followed by immunoprecipitation of IBV N proteins using rabbit anti-IBV N polyclonal antibody demonstrated that the phosphate on the N protein was stable for at least 1 h. Simultaneous labeling with 32 P-orthophosphate and 3 H-leucine identified a 3.5-fold increase in the 32 P: 3 H counts per minute (cpm) ratio of N in the virion as compared to the 32 P: 3 H cpm ratio of N in the cell lysates from chicken embryo kidney cells, whereas in Vero cells the 32 P: 3 H cpm ratio of N from the virion was 10.5-fold greater than the 32 P: 3 H cpm ratio of N from the cell lysates. These studies are consistent with the phosphorylation of the IBV N playing a role in assembly or maturation of the viral particle

  20. Effect of Processing Intensity on Immunologically Active Bovine Milk Serum Proteins.

    Science.gov (United States)

    Brick, Tabea; Ege, Markus; Boeren, Sjef; Böck, Andreas; von Mutius, Erika; Vervoort, Jacques; Hettinga, Kasper

    2017-08-31

    Consumption of raw cow's milk instead of industrially processed milk has been reported to protect children from developing asthma, allergies, and respiratory infections. Several heat-sensitive milk serum proteins have been implied in this effect though unbiased assessment of milk proteins in general is missing. The aim of this study was to compare the native milk serum proteome between raw cow's milk and various industrially applied processing methods, i.e., homogenization, fat separation, pasteurization, ultra-heat treatment (UHT), treatment for extended shelf-life (ESL), and conventional boiling. Each processing method was applied to the same three pools of raw milk. Levels of detectable proteins were quantified by liquid chromatography/tandem mass spectrometry following filter aided sample preparation. In total, 364 milk serum proteins were identified. The 140 proteins detectable in 66% of all samples were entered in a hierarchical cluster analysis. The resulting proteomics pattern separated mainly as high (boiling, UHT, ESL) versus no/low heat treatment (raw, skimmed, pasteurized). Comparing these two groups revealed 23 individual proteins significantly reduced by heating, e.g., lactoferrin (log2-fold change = -0.37, p = 0.004), lactoperoxidase (log2-fold change = -0.33, p = 0.001), and lactadherin (log2-fold change = -0.22, p = 0.020). The abundance of these heat sensitive proteins found in higher quantity in native cow's milk compared to heat treated milk, renders them potential candidates for protection from asthma, allergies, and respiratory infections.