WorldWideScience

Sample records for backbone protein structure

  1. Annotating the protein-RNA interaction sites in proteins using evolutionary information and protein backbone structure.

    Science.gov (United States)

    Li, Tao; Li, Qian-Zhong

    2012-11-07

    RNA-protein interactions play important roles in various biological processes. The precise detection of RNA-protein interaction sites is very important for understanding essential biological processes and annotating the function of the proteins. In this study, based on various features from amino acid sequence and structure, including evolutionary information, solvent accessible surface area and torsion angles (φ, ψ) in the backbone structure of the polypeptide chain, a computational method for predicting RNA-binding sites in proteins is proposed. When the method is applied to predict RNA-binding sites in three datasets: RBP86 containing 86 protein chains, RBP107 containing 107 proteins chains and RBP109 containing 109 proteins chains, better sensitivities and specificities are obtained compared to previously published methods in five-fold cross-validation tests. In order to make further examination for the efficiency of our method, the RBP107 dataset is used as training set, RBP86 and RBP109 datasets are used as the independent test sets. In addition, as examples of our prediction, RNA-binding sites in a few proteins are presented. The annotated results are consistent with the PDB annotation. These results show that our method is useful for annotating RNA binding sites of novel proteins.

  2. Structural test of the parameterized-backbone method for protein design.

    Science.gov (United States)

    Plecs, Joseph J; Harbury, Pehr B; Kim, Peter S; Alber, Tom

    2004-09-03

    Designing new protein folds requires a method for simultaneously optimizing the conformation of the backbone and the side-chains. One approach to this problem is the use of a parameterized backbone, which allows the systematic exploration of families of structures. We report the crystal structure of RH3, a right-handed, three-helix coiled coil that was designed using a parameterized backbone and detailed modeling of core packing. This crystal structure was determined using another rationally designed feature, a metal-binding site that permitted experimental phasing of the X-ray data. RH3 adopted the intended fold, which has not been observed previously in biological proteins. Unanticipated structural asymmetry in the trimer was a principal source of variation within the RH3 structure. The sequence of RH3 differs from that of a previously characterized right-handed tetramer, RH4, at only one position in each 11 amino acid sequence repeat. This close similarity indicates that the design method is sensitive to the core packing interactions that specify the protein structure. Comparison of the structures of RH3 and RH4 indicates that both steric overlap and cavity formation provide strong driving forces for oligomer specificity.

  3. Prediction of backbone dihedral angles and protein secondary structure using support vector machines

    Directory of Open Access Journals (Sweden)

    Hirst Jonathan D

    2009-12-01

    Full Text Available Abstract Background The prediction of the secondary structure of a protein is a critical step in the prediction of its tertiary structure and, potentially, its function. Moreover, the backbone dihedral angles, highly correlated with secondary structures, provide crucial information about the local three-dimensional structure. Results We predict independently both the secondary structure and the backbone dihedral angles and combine the results in a loop to enhance each prediction reciprocally. Support vector machines, a state-of-the-art supervised classification technique, achieve secondary structure predictive accuracy of 80% on a non-redundant set of 513 proteins, significantly higher than other methods on the same dataset. The dihedral angle space is divided into a number of regions using two unsupervised clustering techniques in order to predict the region in which a new residue belongs. The performance of our method is comparable to, and in some cases more accurate than, other multi-class dihedral prediction methods. Conclusions We have created an accurate predictor of backbone dihedral angles and secondary structure. Our method, called DISSPred, is available online at http://comp.chem.nottingham.ac.uk/disspred/.

  4. Correlation between protein secondary structure, backbone bond angles, and side-chain orientations

    Science.gov (United States)

    Lundgren, Martin; Niemi, Antti J.

    2012-08-01

    We investigate the fine structure of the sp3 hybridized covalent bond geometry that governs the tetrahedral architecture around the central Cα carbon of a protein backbone, and for this we develop new visualization techniques to analyze high-resolution x-ray structures in the Protein Data Bank. We observe that there is a correlation between the deformations of the ideal tetrahedral symmetry and the local secondary structure of the protein. We propose a universal coarse-grained energy function to describe the ensuing side-chain geometry in terms of the Cβ carbon orientations. The energy function can model the side-chain geometry with a subatomic precision. As an example we construct the Cα-Cβ structure of HP35 chicken villin headpiece. We obtain a configuration that deviates less than 0.4 Å in root-mean-square distance from the experimental x-ray structure.

  5. APSY-NMR for protein backbone assignment in high-throughput structural biology

    Energy Technology Data Exchange (ETDEWEB)

    Dutta, Samit Kumar; Serrano, Pedro; Proudfoot, Andrew; Geralt, Michael [The Scripps Research Institute, Department of Integrative Structural and Computational Biology (United States); Pedrini, Bill [Paul Scherrer Institute (PSI), SwissFEL Project (Switzerland); Herrmann, Torsten [Université de Lyon, Institut des Sciences Analytiques, Centre de RMN à Très Hauts Champs, UMR 5280 CNRS, ENS Lyon, UCB Lyon 1 (France); Wüthrich, Kurt, E-mail: wuthrich@scripps.edu [The Scripps Research Institute, Department of Integrative Structural and Computational Biology (United States)

    2015-01-15

    A standard set of three APSY-NMR experiments has been used in daily practice to obtain polypeptide backbone NMR assignments in globular proteins with sizes up to about 150 residues, which had been identified as targets for structure determination by the Joint Center for Structural Genomics (JCSG) under the auspices of the Protein Structure Initiative (PSI). In a representative sample of 30 proteins, initial fully automated data analysis with the software UNIO-MATCH-2014 yielded complete or partial assignments for over 90 % of the residues. For most proteins the APSY data acquisition was completed in less than 30 h. The results of the automated procedure provided a basis for efficient interactive validation and extension to near-completion of the assignments by reference to the same 3D heteronuclear-resolved [{sup 1}H,{sup 1}H]-NOESY spectra that were subsequently used for the collection of conformational constraints. High-quality structures were obtained for all 30 proteins, using the J-UNIO protocol, which includes extensive automation of NMR structure determination.

  6. Orientation-dependent backbone-only residue pair scoring functions for fixed backbone protein design

    Directory of Open Access Journals (Sweden)

    Bordner Andrew J

    2010-04-01

    Full Text Available Abstract Background Empirical scoring functions have proven useful in protein structure modeling. Most such scoring functions depend on protein side chain conformations. However, backbone-only scoring functions do not require computationally intensive structure optimization and so are well suited to protein design, which requires fast score evaluation. Furthermore, scoring functions that account for the distinctive relative position and orientation preferences of residue pairs are expected to be more accurate than those that depend only on the separation distance. Results Residue pair scoring functions for fixed backbone protein design were derived using only backbone geometry. Unlike previous studies that used spherical harmonics to fit 2D angular distributions, Gaussian Mixture Models were used to fit the full 3D (position only and 6D (position and orientation distributions of residue pairs. The performance of the 1D (residue separation only, 3D, and 6D scoring functions were compared by their ability to identify correct threading solutions for a non-redundant benchmark set of protein backbone structures. The threading accuracy was found to steadily increase with increasing dimension, with the 6D scoring function achieving the highest accuracy. Furthermore, the 3D and 6D scoring functions were shown to outperform side chain-dependent empirical potentials from three other studies. Next, two computational methods that take advantage of the speed and pairwise form of these new backbone-only scoring functions were investigated. The first is a procedure that exploits available sequence data by averaging scores over threading solutions for homologs. This was evaluated by applying it to the challenging problem of identifying interacting transmembrane alpha-helices and found to further improve prediction accuracy. The second is a protein design method for determining the optimal sequence for a backbone structure by applying Belief Propagation

  7. High-resolution protein design with backbone freedom.

    Science.gov (United States)

    Harbury, P B; Plecs, J J; Tidor, B; Alber, T; Kim, P S

    1998-11-20

    Recent advances in computational techniques have allowed the design of precise side-chain packing in proteins with predetermined, naturally occurring backbone structures. Because these methods do not model protein main-chain flexibility, they lack the breadth to explore novel backbone conformations. Here the de novo design of a family of alpha-helical bundle proteins with a right-handed superhelical twist is described. In the design, the overall protein fold was specified by hydrophobic-polar residue patterning, whereas the bundle oligomerization state, detailed main-chain conformation, and interior side-chain rotamers were engineered by computational enumerations of packing in alternate backbone structures. Main-chain flexibility was incorporated through an algebraic parameterization of the backbone. The designed peptides form alpha-helical dimers, trimers, and tetramers in accord with the design goals. The crystal structure of the tetramer matches the designed structure in atomic detail.

  8. Backbone assignment and secondary structure of the PsbQ protein from Photosystem II

    Czech Academy of Sciences Publication Activity Database

    Horničáková, M.; Kohoutová, Jaroslava; Schlagnitweit, J.; Wohlschlager, Ch.; Ettrich, Rüdiger; Fiala, R.; Schoefberger, W.; Müller, N.

    2011-01-01

    Roč. 5, č. 2 (2011), s. 169-175 ISSN 1874-2718 R&D Projects: GA MŠk(CZ) LC06010 Institutional research plan: CEZ:AV0Z60870520 Keywords : Photosystem II * PsbQ * Missing link * NMR resonance assignment * Protein-protein interaction Subject RIV: BO - Biophysics Impact factor: 0.720, year: 2011 http://www.springerlink.com/content/3n38075w5h1l1082/fulltext.pdf

  9. Exact Solutions for Internuclear Vectors and Backbone Dihedral Angles from NH Residual Dipolar Couplings in Two Media, and their Application in a Systematic Search Algorithm for Determining Protein Backbone Structure

    International Nuclear Information System (INIS)

    Wang Lincong; Donald, Bruce Randall

    2004-01-01

    We have derived a quartic equation for computing the direction of an internuclear vector from residual dipolar couplings (RDCs) measured in two aligning media, and two simple trigonometric equations for computing the backbone (φ,ψ) angles from two backbone vectors in consecutive peptide planes. These equations make it possible to compute, exactly and in constant time, the backbone (φ,ψ) angles for a residue from RDCs in two media on any single backbone vector type. Building upon these exact solutions we have designed a novel algorithm for determining a protein backbone substructure consisting of α-helices and β-sheets. Our algorithm employs a systematic search technique to refine the conformation of both α-helices and β-sheets and to determine their orientations using exclusively the angular restraints from RDCs. The algorithm computes the backbone substructure employing very sparse distance restraints between pairs of α-helices and β-sheets refined by the systematic search. The algorithm has been demonstrated on the protein human ubiquitin using only backbone NH RDCs, plus twelve hydrogen bonds and four NOE distance restraints. Further, our results show that both the global orientations and the conformations of α-helices and β-strands can be determined with high accuracy using only two RDCs per residue. The algorithm requires, as its input, backbone resonance assignments, the identification of α-helices and β-sheets as well as sparse NOE distance and hydrogen bond restraints.Abbreviations: NMR - nuclear magnetic resonance; RDC - residual dipolar coupling; NOE - nuclear Overhauser effect; SVD - singular value decomposition; DFS - depth-first search; RMSD - root mean square deviation; POF - principal order frame; PDB - protein data bank; SA - simulated annealing; MD - molecular dynamics

  10. Mars - robust automatic backbone assignment of proteins

    International Nuclear Information System (INIS)

    Jung, Young-Sang; Zweckstetter, Markus

    2004-01-01

    MARS a program for robust automatic backbone assignment of 13 C/ 15 N labeled proteins is presented. MARS does not require tight thresholds for establishing sequential connectivity or detailed adjustment of these thresholds and it can work with a wide variety of NMR experiments. Using only 13 C α / 13 C β connectivity information, MARS allows automatic, error-free assignment of 96% of the 370-residue maltose-binding protein. MARS can successfully be used when data are missing for a substantial portion of residues or for proteins with very high chemical shift degeneracy such as partially or fully unfolded proteins. Other sources of information, such as residue specific information or known assignments from a homologues protein, can be included into the assignment process. MARS exports its result in SPARKY format. This allows visual validation and integration of automated and manual assignment

  11. CSSI-PRO: a method for secondary structure type editing, assignment and estimation in proteins using linear combination of backbone chemical shifts

    International Nuclear Information System (INIS)

    Swain, Monalisa; Atreya, Hanudatta S.

    2009-01-01

    Estimation of secondary structure in polypeptides is important for studying their structure, folding and dynamics. In NMR spectroscopy, such information is generally obtained after sequence specific resonance assignments are completed. We present here a new methodology for assignment of secondary structure type to spin systems in proteins directly from NMR spectra, without prior knowledge of resonance assignments. The methodology, named Combination of Shifts for Secondary Structure Identification in Proteins (CSSI-PRO), involves detection of specific linear combination of backbone 1 H α and 13 C' chemical shifts in a two-dimensional (2D) NMR experiment based on G-matrix Fourier transform (GFT) NMR spectroscopy. Such linear combinations of shifts facilitate editing of residues belonging to α-helical/β-strand regions into distinct spectral regions nearly independent of the amino acid type, thereby allowing the estimation of overall secondary structure content of the protein. Comparison of the predicted secondary structure content with those estimated based on their respective 3D structures and/or the method of Chemical Shift Index for 237 proteins gives a correlation of more than 90% and an overall rmsd of 7.0%, which is comparable to other biophysical techniques used for structural characterization of proteins. Taken together, this methodology has a wide range of applications in NMR spectroscopy such as rapid protein structure determination, monitoring conformational changes in protein-folding/ligand-binding studies and automated resonance assignment

  12. Structural insights into the evolution of a sexy protein: novel topology and restricted backbone flexibility in a hypervariable pheromone from the red-legged salamander, Plethodon shermani.

    Science.gov (United States)

    Wilburn, Damien B; Bowen, Kathleen E; Doty, Kari A; Arumugam, Sengodagounder; Lane, Andrew N; Feldhoff, Pamela W; Feldhoff, Richard C

    2014-01-01

    In response to pervasive sexual selection, protein sex pheromones often display rapid mutation and accelerated evolution of corresponding gene sequences. For proteins, the general dogma is that structure is maintained even as sequence or function may rapidly change. This phenomenon is well exemplified by the three-finger protein (TFP) superfamily: a diverse class of vertebrate proteins co-opted for many biological functions - such as components of snake venoms, regulators of the complement system, and coordinators of amphibian limb regeneration. All of the >200 structurally characterized TFPs adopt the namesake "three-finger" topology. In male red-legged salamanders, the TFP pheromone Plethodontid Modulating Factor (PMF) is a hypervariable protein such that, through extensive gene duplication and pervasive sexual selection, individual male salamanders express more than 30 unique isoforms. However, it remained unclear how this accelerated evolution affected the protein structure of PMF. Using LC/MS-MS and multidimensional NMR, we report the 3D structure of the most abundant PMF isoform, PMF-G. The high resolution structural ensemble revealed a highly modified TFP structure, including a unique disulfide bonding pattern and loss of secondary structure, that define a novel protein topology with greater backbone flexibility in the third peptide finger. Sequence comparison, models of molecular evolution, and homology modeling together support that this flexible third finger is the most rapidly evolving segment of PMF. Combined with PMF sequence hypervariability, this structural flexibility may enhance the plasticity of PMF as a chemical signal by permitting potentially thousands of structural conformers. We propose that the flexible third finger plays a critical role in PMF:receptor interactions. As female receptors co-evolve, this flexibility may allow PMF to still bind its receptor(s) without the immediate need for complementary mutations. Consequently, this unique

  13. Structural insights into the evolution of a sexy protein: novel topology and restricted backbone flexibility in a hypervariable pheromone from the red-legged salamander, Plethodon shermani.

    Directory of Open Access Journals (Sweden)

    Damien B Wilburn

    Full Text Available In response to pervasive sexual selection, protein sex pheromones often display rapid mutation and accelerated evolution of corresponding gene sequences. For proteins, the general dogma is that structure is maintained even as sequence or function may rapidly change. This phenomenon is well exemplified by the three-finger protein (TFP superfamily: a diverse class of vertebrate proteins co-opted for many biological functions - such as components of snake venoms, regulators of the complement system, and coordinators of amphibian limb regeneration. All of the >200 structurally characterized TFPs adopt the namesake "three-finger" topology. In male red-legged salamanders, the TFP pheromone Plethodontid Modulating Factor (PMF is a hypervariable protein such that, through extensive gene duplication and pervasive sexual selection, individual male salamanders express more than 30 unique isoforms. However, it remained unclear how this accelerated evolution affected the protein structure of PMF. Using LC/MS-MS and multidimensional NMR, we report the 3D structure of the most abundant PMF isoform, PMF-G. The high resolution structural ensemble revealed a highly modified TFP structure, including a unique disulfide bonding pattern and loss of secondary structure, that define a novel protein topology with greater backbone flexibility in the third peptide finger. Sequence comparison, models of molecular evolution, and homology modeling together support that this flexible third finger is the most rapidly evolving segment of PMF. Combined with PMF sequence hypervariability, this structural flexibility may enhance the plasticity of PMF as a chemical signal by permitting potentially thousands of structural conformers. We propose that the flexible third finger plays a critical role in PMF:receptor interactions. As female receptors co-evolve, this flexibility may allow PMF to still bind its receptor(s without the immediate need for complementary mutations. Consequently

  14. Optimized set of two-dimensional experiments for fast sequential assignment, secondary structure determination, and backbone fold validation of 13C/15N-labelled proteins

    International Nuclear Information System (INIS)

    Bersch, Beate; Rossy, Emmanuel; Coves, Jacques; Brutscher, Bernhard

    2003-01-01

    NMR experiments are presented which allow backbone resonance assignment, secondary structure identification, and in favorable cases also molecular fold topology determination from a series of two-dimensional 1 H- 15 N HSQC-like spectra. The 1 H- 15 N correlation peaks are frequency shifted by an amount ± ω X along the 15 N dimension, where ω X is the C α , C β , or H α frequency of the same or the preceding residue. Because of the low dimensionality (2D) of the experiments, high-resolution spectra are obtained in a short overall experimental time. The whole series of seven experiments can be performed in typically less than one day. This approach significantly reduces experimental time when compared to the standard 3D-based methods. The here presented methodology is thus especially appealing in the context of high-throughput NMR studies of protein structure, dynamics or molecular interfaces

  15. Backbone dynamics of the EIAV-Tat protein from 15N relaxation studies

    International Nuclear Information System (INIS)

    Ejchart, A.; Herrmann, F.; Roesch, P.; Sticht, H.; Willbold, D.

    1994-01-01

    The work investigates the mobility of EIAV-Tat protein backbone by measuring the relaxation parameters of the 15 N nitrogens. High degree of the flexibility, non-typical of rigid, well structured proteins was shown

  16. Underestimated Halogen Bonds Forming with Protein Backbone in Protein Data Bank.

    Science.gov (United States)

    Zhang, Qian; Xu, Zhijian; Shi, Jiye; Zhu, Weiliang

    2017-07-24

    Halogen bonds (XBs) are attracting increasing attention in biological systems. Protein Data Bank (PDB) archives experimentally determined XBs in biological macromolecules. However, no software for structure refinement in X-ray crystallography takes into account XBs, which might result in the weakening or even vanishing of experimentally determined XBs in PDB. In our previous study, we showed that side-chain XBs forming with protein side chains are underestimated in PDB on the basis of the phenomenon that the proportion of side-chain XBs to overall XBs decreases as structural resolution becomes lower and lower. However, whether the dominant backbone XBs forming with protein backbone are overlooked is still a mystery. Here, with the help of the ratio (R F ) of the observed XBs' frequency of occurrence to their frequency expected at random, we demonstrated that backbone XBs are largely overlooked in PDB, too. Furthermore, three cases were discovered possessing backbone XBs in high resolution structures while losing the XBs in low resolution structures. In the last two cases, even at 1.80 Å resolution, the backbone XBs were lost, manifesting the urgent need to consider XBs in the refinement process during X-ray crystallography study.

  17. Determination of protein global folds using backbone residual dipolar coupling and long-range NOE restraints

    International Nuclear Information System (INIS)

    Giesen, Alexander W.; Homans, Steve W.; Brown, Jonathan Miles

    2003-01-01

    We report the determination of the global fold of human ubiquitin using protein backbone NMR residual dipolar coupling and long-range nuclear Overhauser effect (NOE) data as conformational restraints. Specifically, by use of a maximum of three backbone residual dipolar couplings per residue (N i -H N i , N i -C' i-1 , H N i - C' i-1 ) in two tensor frames and only backbone H N -H N NOEs, a global fold of ubiquitin can be derived with a backbone root-mean-square deviation of 1.4 A with respect to the crystal structure. This degree of accuracy is more than adequate for use in databases of structural motifs, and suggests a general approach for the determination of protein global folds using conformational restraints derived only from backbone atoms

  18. Predicting the tolerated sequences for proteins and protein interfaces using RosettaBackrub flexible backbone design.

    Directory of Open Access Journals (Sweden)

    Colin A Smith

    Full Text Available Predicting the set of sequences that are tolerated by a protein or protein interface, while maintaining a desired function, is useful for characterizing protein interaction specificity and for computationally designing sequence libraries to engineer proteins with new functions. Here we provide a general method, a detailed set of protocols, and several benchmarks and analyses for estimating tolerated sequences using flexible backbone protein design implemented in the Rosetta molecular modeling software suite. The input to the method is at least one experimentally determined three-dimensional protein structure or high-quality model. The starting structure(s are expanded or refined into a conformational ensemble using Monte Carlo simulations consisting of backrub backbone and side chain moves in Rosetta. The method then uses a combination of simulated annealing and genetic algorithm optimization methods to enrich for low-energy sequences for the individual members of the ensemble. To emphasize certain functional requirements (e.g. forming a binding interface, interactions between and within parts of the structure (e.g. domains can be reweighted in the scoring function. Results from each backbone structure are merged together to create a single estimate for the tolerated sequence space. We provide an extensive description of the protocol and its parameters, all source code, example analysis scripts and three tests applying this method to finding sequences predicted to stabilize proteins or protein interfaces. The generality of this method makes many other applications possible, for example stabilizing interactions with small molecules, DNA, or RNA. Through the use of within-domain reweighting and/or multistate design, it may also be possible to use this method to find sequences that stabilize particular protein conformations or binding interactions over others.

  19. Structural insights into the backbone-circularized granulocyte colony-stimulating factor containing a short connector.

    Science.gov (United States)

    Miyafusa, Takamitsu; Shibuya, Risa; Honda, Shinya

    2018-06-02

    Backbone circularization is a powerful approach for enhancing the structural stability of polypeptides. Herein, we present the crystal structure of the circularized variant of the granulocyte colony-stimulating factor (G-CSF) in which the terminal helical region was circularized using a short, two-amino acid connector. The structure revealed that the N- and C-termini were indeed connected by a peptide bond. The local structure of the C-terminal region transited from an α helix to 3 10 helix with a bend close to the N-terminal region, indicating that the structural change offset the insufficient length of the connector. This is the first-ever report of a crystal structure of the backbone of a circularized protein. It will facilitate the development of backbone circularization methodology. Copyright © 2018 Elsevier Inc. All rights reserved.

  20. Protein backbone angle restraints from searching a database for chemical shift and sequence homology

    Energy Technology Data Exchange (ETDEWEB)

    Cornilescu, Gabriel; Delaglio, Frank; Bax, Ad [National Institutes of Health, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases (United States)

    1999-03-15

    Chemical shifts of backbone atoms in proteins are exquisitely sensitive to local conformation, and homologous proteins show quite similar patterns of secondary chemical shifts. The inverse of this relation is used to search a database for triplets of adjacent residues with secondary chemical shifts and sequence similarity which provide the best match to the query triplet of interest. The database contains 13C{alpha}, 13C{beta}, 13C', 1H{alpha} and 15N chemical shifts for 20 proteins for which a high resolution X-ray structure is available. The computer program TALOS was developed to search this database for strings of residues with chemical shift and residue type homology. The relative importance of the weighting factors attached to the secondary chemical shifts of the five types of resonances relative to that of sequence similarity was optimized empirically. TALOS yields the 10 triplets which have the closest similarity in secondary chemical shift and amino acid sequence to those of the query sequence. If the central residues in these 10 triplets exhibit similar {phi} and {psi} backbone angles, their averages can reliably be used as angular restraints for the protein whose structure is being studied. Tests carried out for proteins of known structure indicate that the root-mean-square difference (rmsd) between the output of TALOS and the X-ray derived backbone angles is about 15 deg. Approximately 3% of the predictions made by TALOS are found to be in error.

  1. Automated backbone assignment of labeled proteins using the threshold accepting algorithm

    International Nuclear Information System (INIS)

    Leutner, Michael; Gschwind, Ruth M.; Liermann, Jens; Schwarz, Christian; Gemmecker, Gerd; Kessler, Horst

    1998-01-01

    The sequential assignment of backbone resonances is the first step in the structure determination of proteins by heteronuclear NMR. For larger proteins, an assignment strategy based on proton side-chain information is no longer suitable for the use in an automated procedure. Our program PASTA (Protein ASsignment by Threshold Accepting) is therefore designed to partially or fully automate the sequential assignment of proteins, based on the analysis of NMR backbone resonances plus C β information. In order to overcome the problems caused by peak overlap and missing signals in an automated assignment process, PASTA uses threshold accepting, a combinatorial optimization strategy, which is superior to simulated annealing due to generally faster convergence and better solutions. The reliability of this algorithm is shown by reproducing the complete sequential backbone assignment of several proteins from published NMR data. The robustness of the algorithm against misassigned signals, noise, spectral overlap and missing peaks is shown by repeating the assignment with reduced sequential information and increased chemical shift tolerances. The performance of the program on real data is finally demonstrated with automatically picked peak lists of human nonpancreatic synovial phospholipase A 2 , a protein with 124 residues

  2. Protein backbone and sidechain torsion angles predicted from NMR chemical shifts using artificial neural networks

    Energy Technology Data Exchange (ETDEWEB)

    Shen Yang; Bax, Ad, E-mail: bax@nih.gov [National Institutes of Health, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases (United States)

    2013-07-15

    A new program, TALOS-N, is introduced for predicting protein backbone torsion angles from NMR chemical shifts. The program relies far more extensively on the use of trained artificial neural networks than its predecessor, TALOS+. Validation on an independent set of proteins indicates that backbone torsion angles can be predicted for a larger, {>=}90 % fraction of the residues, with an error rate smaller than ca 3.5 %, using an acceptance criterion that is nearly two-fold tighter than that used previously, and a root mean square difference between predicted and crystallographically observed ({phi}, {psi}) torsion angles of ca 12 Masculine-Ordinal-Indicator . TALOS-N also reports sidechain {chi}{sup 1} rotameric states for about 50 % of the residues, and a consistency with reference structures of 89 %. The program includes a neural network trained to identify secondary structure from residue sequence and chemical shifts.

  3. On the relationship between NMR-derived amide order parameters and protein backbone entropy changes.

    Science.gov (United States)

    Sharp, Kim A; O'Brien, Evan; Kasinath, Vignesh; Wand, A Joshua

    2015-05-01

    Molecular dynamics simulations are used to analyze the relationship between NMR-derived squared generalized order parameters of amide NH groups and backbone entropy. Amide order parameters (O(2) NH ) are largely determined by the secondary structure and average values appear unrelated to the overall flexibility of the protein. However, analysis of the more flexible subset (O(2) NH  entropy than that reported by the side chain methyl axis order parameters, O(2) axis . A calibration curve for backbone entropy vs. O(2) NH is developed, which accounts for both correlations between amide group motions of different residues, and correlations between backbone and side chain motions. This calibration curve can be used with experimental values of O(2) NH changes obtained by NMR relaxation measurements to extract backbone entropy changes, for example, upon ligand binding. In conjunction with our previous calibration for side chain entropy derived from measured O(2) axis values this provides a prescription for determination of the total protein conformational entropy changes from NMR relaxation measurements. © 2015 Wiley Periodicals, Inc.

  4. Solution Structure and Backbone Dynamics of the Pleckstrin Homology Domain of the Human Protein Kinase B (PKB/Akt). Interaction with Inositol Phosphates

    International Nuclear Information System (INIS)

    Auguin, Daniel; Barthe, Philippe; Auge-Senegas, Marie-Therese; Stern, Marc-Henri; Noguchi, Masayuki; Roumestand, Christian

    2004-01-01

    The programmed cell death occurs as part of normal mammalian development. The induction of developmental cell death is a highly regulated process and can be suppressed by a variety of extracellular stimuli. Recently, the ability of trophic factors to promote survival have been attributed, at least in part, to the phosphatidylinositide 3'-OH kinase (PI3K)/Protein Kinase B (PKB, also named Akt) cascade. Several targets of the PI3K/PKB signaling pathway have been identified that may underlie the ability of this regulatory cascade to promote cell survival. PKB possesses a N-terminal Pleckstrin Homology (PH) domain that binds specifically and with high affinity to PtIns(3,4,5)P 3 and PtIns(3,4)P 2 , the PI3K second messengers. PKB is then recruited to the plasma membrane by virtue of its interaction with 3'-OH phosphatidylinositides and activated. Recent evidence indicates that PKB is active in various types of human cancer; constitutive PKB signaling activation is believed to promote proliferation and increased cell survival, thereby contributing to cancer progression. Thus, it has been shown that induction of PKB activity is augmented by the TCL1/MTCP1 oncoproteins through a physical association requiring the PKB PH domain. Here we present the three-dimensional solution structure of the PH domain of the human protein PKB (isoform β). PKBβ-PH is an electrostatically polarized molecule that adopts the same fold and topology as other PH-domains, consisting of a β-sandwich of seven strands capped on one top by an α-helix. The opposite face presents three variable loops that appear poorly defined in the NMR structure. Measurements of 15 N spin relaxation times and heteronuclear 15 N{ 1 H}NOEs showed that this poor definition is due to intrinsic flexibility, involving complex motions on different time scales. Chemical shift mapping studies correctly defined the binding site of Ins(1,3,4,5)P 4 (the head group of PtIns(3,4,5)P 3 ), as was previously proposed from a

  5. Protein backbone chemical shifts predicted from searching a database for torsion angle and sequence homology

    International Nuclear Information System (INIS)

    Shen Yang; Bax, Ad

    2007-01-01

    Chemical shifts of nuclei in or attached to a protein backbone are exquisitely sensitive to their local environment. A computer program, SPARTA, is described that uses this correlation with local structure to predict protein backbone chemical shifts, given an input three-dimensional structure, by searching a newly generated database for triplets of adjacent residues that provide the best match in φ/ψ/χ 1 torsion angles and sequence similarity to the query triplet of interest. The database contains 15 N, 1 H N , 1 H α , 13 C α , 13 C β and 13 C' chemical shifts for 200 proteins for which a high resolution X-ray (≤2.4 A) structure is available. The relative importance of the weighting factors for the φ/ψ/χ 1 angles and sequence similarity was optimized empirically. The weighted, average secondary shifts of the central residues in the 20 best-matching triplets, after inclusion of nearest neighbor, ring current, and hydrogen bonding effects, are used to predict chemical shifts for the protein of known structure. Validation shows good agreement between the SPARTA-predicted and experimental shifts, with standard deviations of 2.52, 0.51, 0.27, 0.98, 1.07 and 1.08 ppm for 15 N, 1 H N , 1 H α , 13 C α , 13 C β and 13 C', respectively, including outliers

  6. TALOS+: a hybrid method for predicting protein backbone torsion angles from NMR chemical shifts

    Energy Technology Data Exchange (ETDEWEB)

    Shen Yang; Delaglio, Frank [National Institutes of Health, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases (United States); Cornilescu, Gabriel [National Magnetic Resonance Facility (United States); Bax, Ad [National Institutes of Health, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases (United States)], E-mail: bax@nih.gov

    2009-08-15

    NMR chemical shifts in proteins depend strongly on local structure. The program TALOS establishes an empirical relation between {sup 13}C, {sup 15}N and {sup 1}H chemical shifts and backbone torsion angles {phi} and {psi} (Cornilescu et al. J Biomol NMR 13 289-302, 1999). Extension of the original 20-protein database to 200 proteins increased the fraction of residues for which backbone angles could be predicted from 65 to 74%, while reducing the error rate from 3 to 2.5%. Addition of a two-layer neural network filter to the database fragment selection process forms the basis for a new program, TALOS+, which further enhances the prediction rate to 88.5%, without increasing the error rate. Excluding the 2.5% of residues for which TALOS+ makes predictions that strongly differ from those observed in the crystalline state, the accuracy of predicted {phi} and {psi} angles, equals {+-}13{sup o}. Large discrepancies between predictions and crystal structures are primarily limited to loop regions, and for the few cases where multiple X-ray structures are available such residues are often found in different states in the different structures. The TALOS+ output includes predictions for individual residues with missing chemical shifts, and the neural network component of the program also predicts secondary structure with good accuracy.

  7. Capturing non-local interactions by long short-term memory bidirectional recurrent neural networks for improving prediction of protein secondary structure, backbone angles, contact numbers and solvent accessibility.

    Science.gov (United States)

    Heffernan, Rhys; Yang, Yuedong; Paliwal, Kuldip; Zhou, Yaoqi

    2017-09-15

    The accuracy of predicting protein local and global structural properties such as secondary structure and solvent accessible surface area has been stagnant for many years because of the challenge of accounting for non-local interactions between amino acid residues that are close in three-dimensional structural space but far from each other in their sequence positions. All existing machine-learning techniques relied on a sliding window of 10-20 amino acid residues to capture some 'short to intermediate' non-local interactions. Here, we employed Long Short-Term Memory (LSTM) Bidirectional Recurrent Neural Networks (BRNNs) which are capable of capturing long range interactions without using a window. We showed that the application of LSTM-BRNN to the prediction of protein structural properties makes the most significant improvement for residues with the most long-range contacts (|i-j| >19) over a previous window-based, deep-learning method SPIDER2. Capturing long-range interactions allows the accuracy of three-state secondary structure prediction to reach 84% and the correlation coefficient between predicted and actual solvent accessible surface areas to reach 0.80, plus a reduction of 5%, 10%, 5% and 10% in the mean absolute error for backbone ϕ , ψ , θ and τ angles, respectively, from SPIDER2. More significantly, 27% of 182724 40-residue models directly constructed from predicted C α atom-based θ and τ have similar structures to their corresponding native structures (6Å RMSD or less), which is 3% better than models built by ϕ and ψ angles. We expect the method to be useful for assisting protein structure and function prediction. The method is available as a SPIDER3 server and standalone package at http://sparks-lab.org . yaoqi.zhou@griffith.edu.au or yuedong.yang@griffith.edu.au. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email

  8. Hidden Markov model approach for identifying the modular framework of the protein backbone.

    Science.gov (United States)

    Camproux, A C; Tuffery, P; Chevrolat, J P; Boisvieux, J F; Hazout, S

    1999-12-01

    The hidden Markov model (HMM) was used to identify recurrent short 3D structural building blocks (SBBs) describing protein backbones, independently of any a priori knowledge. Polypeptide chains are decomposed into a series of short segments defined by their inter-alpha-carbon distances. Basically, the model takes into account the sequentiality of the observed segments and assumes that each one corresponds to one of several possible SBBs. Fitting the model to a database of non-redundant proteins allowed us to decode proteins in terms of 12 distinct SBBs with different roles in protein structure. Some SBBs correspond to classical regular secondary structures. Others correspond to a significant subdivision of their bounding regions previously considered to be a single pattern. The major contribution of the HMM is that this model implicitly takes into account the sequential connections between SBBs and thus describes the most probable pathways by which the blocks are connected to form the framework of the protein structures. Validation of the SBBs code was performed by extracting SBB series repeated in recoding proteins and examining their structural similarities. Preliminary results on the sequence specificity of SBBs suggest promising perspectives for the prediction of SBBs or series of SBBs from the protein sequences.

  9. Selective backbone labelling of ILV methyl labelled proteins

    International Nuclear Information System (INIS)

    Sibille, Nathalie; Hanoulle, Xavier; Bonachera, Fanny; Verdegem, Dries; Landrieu, Isabelle; Wieruszeski, Jean-Michel; Lippens, Guy

    2009-01-01

    Adding the 13 C labelled 2-keto-isovalerate and 2-oxobutanoate precursors to a minimal medium composed of 12 C labelled glucose instead of the commonly used ( 2 D, 13 C) glucose leads not only to the 13 C labelling of (I, L, V) methyls but also to the selective 13 C labelling of the backbone C α and CO carbons of the Ile and Val residues. As a result, the backbone ( 1 H, 15 N) correlations of the Ile and Val residues and their next neighbours in the (i + 1) position can be selectively identified in HN(CA) and HN(CO) planes. The availability of a selective HSQC spectrum corresponding to the sole amide resonances of the Ile and Val residues allows connecting them to their corresponding methyls by the intra-residue NOE effect, and should therefore be applicable to larger systems

  10. Assignment of protein backbone resonances using connectivity, torsion angles and 13Cα chemical shifts

    International Nuclear Information System (INIS)

    Morris, Laura C.; Valafar, Homayoun; Prestegard, James H.

    2004-01-01

    A program is presented which will return the most probable sequence location for a short connected set of residues in a protein given just 13 C α chemical shifts (δ( 13 C α )) and data restricting the φ and ψ backbone angles. Data taken from both the BioMagResBank and the Protein Data Bank were used to create a probability density function (PDF) using a multivariate normal distribution in δ( 13 C α ), φ, and ψ space for each amino acid residue. Extracting and combining probabilities for particular amino acid residues in a short proposed sequence yields a score indicative of the correctness of the proposed assignment. The program is illustrated using several proteins for which structure and 13 C α chemical shift data are available

  11. Wetting of nonconserved residue-backbones: A feature indicative of aggregation associated regions of proteins.

    Science.gov (United States)

    Pradhan, Mohan R; Pal, Arumay; Hu, Zhongqiao; Kannan, Srinivasaraghavan; Chee Keong, Kwoh; Lane, David P; Verma, Chandra S

    2016-02-01

    Aggregation is an irreversible form of protein complexation and often toxic to cells. The process entails partial or major unfolding that is largely driven by hydration. We model the role of hydration in aggregation using "Dehydrons." "Dehydrons" are unsatisfied backbone hydrogen bonds in proteins that seek shielding from water molecules by associating with ligands or proteins. We find that the residues at aggregation interfaces have hydrated backbones, and in contrast to other forms of protein-protein interactions, are under less evolutionary pressure to be conserved. Combining evolutionary conservation of residues and extent of backbone hydration allows us to distinguish regions on proteins associated with aggregation (non-conserved dehydron-residues) from other interaction interfaces (conserved dehydron-residues). This novel feature can complement the existing strategies used to investigate protein aggregation/complexation. © 2015 Wiley Periodicals, Inc.

  12. On Backbone Structure for a Future Multipurpose Network

    DEFF Research Database (Denmark)

    Gutierrez Lopez, Jose Manuel; Cuevas, Ruben; Riaz, M. Tahir

    2008-01-01

    Telecommunications are evolving towards the unification of services and infrastructures. This unification must be achieved at the highest hierarchical level for a complete synergy of services. Therefore, one of the requirements is a multipurpose backbone network capable of supporting all the curr......Telecommunications are evolving towards the unification of services and infrastructures. This unification must be achieved at the highest hierarchical level for a complete synergy of services. Therefore, one of the requirements is a multipurpose backbone network capable of supporting all...

  13. The determinants of bond angle variability in protein/peptide backbones: A comprehensive statistical/quantum mechanics analysis.

    Science.gov (United States)

    Improta, Roberto; Vitagliano, Luigi; Esposito, Luciana

    2015-11-01

    The elucidation of the mutual influence between peptide bond geometry and local conformation has important implications for protein structure refinement, validation, and prediction. To gain insights into the structural determinants and the energetic contributions associated with protein/peptide backbone plasticity, we here report an extensive analysis of the variability of the peptide bond angles by combining statistical analyses of protein structures and quantum mechanics calculations on small model peptide systems. Our analyses demonstrate that all the backbone bond angles strongly depend on the peptide conformation and unveil the existence of regular trends as function of ψ and/or φ. The excellent agreement of the quantum mechanics calculations with the statistical surveys of protein structures validates the computational scheme here employed and demonstrates that the valence geometry of protein/peptide backbone is primarily dictated by local interactions. Notably, for the first time we show that the position of the H(α) hydrogen atom, which is an important parameter in NMR structural studies, is also dependent on the local conformation. Most of the trends observed may be satisfactorily explained by invoking steric repulsive interactions; in some specific cases the valence bond variability is also influenced by hydrogen-bond like interactions. Moreover, we can provide a reliable estimate of the energies involved in the interplay between geometry and conformations. © 2015 Wiley Periodicals, Inc.

  14. Cross-correlated relaxation rates between protein backbone H–X dipolar interactions

    International Nuclear Information System (INIS)

    Vögeli, Beat

    2017-01-01

    The relaxation interference between dipole–dipole interactions of two separate spin pairs carries structural and dynamics information. In particular, when compared to individual dynamic behavior of those spin pairs, such cross-correlated relaxation (CCR) rates report on the correlation between the spin pairs. We have recently mapped out correlated motion along the backbone of the protein GB3, using CCR rates among and between consecutive H N –N and H α –C α dipole–dipole interactions. Here, we provide a detailed account of the measurement of the four types of CCR rates. All rates were obtained from at least two different pulse sequences, of which the yet unpublished ones are presented. Detailed comparisons between the different methods and corrections for unwanted pathways demonstrate that the averaged CCR rates are highly accurate and precise with errors of 1.5–3% of the entire value ranges.

  15. Cross-correlated relaxation rates between protein backbone H–X dipolar interactions

    Energy Technology Data Exchange (ETDEWEB)

    Vögeli, Beat, E-mail: beat.vogeli@ucdenver.edu [University of Colorado Denver, Department of Biochemistry and Molecular Genetics (United States)

    2017-03-15

    The relaxation interference between dipole–dipole interactions of two separate spin pairs carries structural and dynamics information. In particular, when compared to individual dynamic behavior of those spin pairs, such cross-correlated relaxation (CCR) rates report on the correlation between the spin pairs. We have recently mapped out correlated motion along the backbone of the protein GB3, using CCR rates among and between consecutive H{sup N}–N and H{sup α}–C{sup α} dipole–dipole interactions. Here, we provide a detailed account of the measurement of the four types of CCR rates. All rates were obtained from at least two different pulse sequences, of which the yet unpublished ones are presented. Detailed comparisons between the different methods and corrections for unwanted pathways demonstrate that the averaged CCR rates are highly accurate and precise with errors of 1.5–3% of the entire value ranges.

  16. Quantification of protein backbone hydrogen-deuterium exchange rates by solid state NMR spectroscopy

    International Nuclear Information System (INIS)

    Lopez del Amo, Juan-Miguel; Fink, Uwe; Reif, Bernd

    2010-01-01

    We present the quantification of backbone amide hydrogen-deuterium exchange rates (HDX) for immobilized proteins. The experiments make use of the deuterium isotope effect on the amide nitrogen chemical shift, as well as on proton dilution by deuteration. We find that backbone amides in the microcrystalline α-spectrin SH3 domain exchange rather slowly with the solvent (with exchange rates negligible within the individual 15 N-T 1 timescales). We observed chemical exchange for 6 residues with HDX exchange rates in the range from 0.2 to 5 s -1 . Backbone amide 15 N longitudinal relaxation times that we determined previously are not significantly affected for most residues, yielding no systematic artifacts upon quantification of backbone dynamics (Chevelkov et al. 2008b). Significant exchange was observed for the backbone amides of R21, S36 and K60, as well as for the sidechain amides of N38, N35 and for W41ε. These residues could not be fit in our previous motional analysis, demonstrating that amide proton chemical exchange needs to be considered in the analysis of protein dynamics in the solid-state, in case D 2 O is employed as a solvent for sample preparation. Due to the intrinsically long 15 N relaxation times in the solid-state, the approach proposed here can expand the range of accessible HDX rates in the intermediate regime that is not accessible so far with exchange quench and MEXICO type experiments.

  17. The Role of Backbone Hydrogen Bonds in the Transition State for Protein Folding of a PDZ Domain.

    Directory of Open Access Journals (Sweden)

    Søren W. Pedersen

    Full Text Available Backbone hydrogen bonds are important for the structure and stability of proteins. However, since conventional site-directed mutagenesis cannot be applied to perturb the backbone, the contribution of these hydrogen bonds in protein folding and stability has been assessed only for a very limited set of small proteins. We have here investigated effects of five amide-to-ester mutations in the backbone of a PDZ domain, a 90-residue globular protein domain, to probe the influence of hydrogen bonds in a β-sheet for folding and stability. The amide-to-ester mutation removes NH-mediated hydrogen bonds and destabilizes hydrogen bonds formed by the carbonyl oxygen. The overall stability of the PDZ domain generally decreased for all amide-to-ester mutants due to an increase in the unfolding rate constant. For this particular region of the PDZ domain, it is therefore clear that native hydrogen bonds are formed after crossing of the rate-limiting barrier for folding. Moreover, three of the five amide-to-ester mutants displayed an increase in the folding rate constant suggesting that the hydrogen bonds are involved in non-native interactions in the transition state for folding.

  18. Solution, solid phase and computational structures of apicidin and its backbone-reduced analogs.

    Science.gov (United States)

    Kranz, Michael; Murray, Peter John; Taylor, Stephen; Upton, Richard J; Clegg, William; Elsegood, Mark R J

    2006-06-01

    The recently isolated broad-spectrum antiparasitic apicidin (1) is one of the few naturally occurring cyclic tetrapeptides (CTP). Depending on the solvent, the backbone of 1 exhibits two gamma-turns (in CH(2)Cl(2)) or a beta-turn (in DMSO), differing solely in the rotation of the plane of one of the amide bonds. In the X-ray crystal structure, the peptidic C==Os and NHs are on opposite sides of the backbone plane, giving rise to infinite stacks of cyclotetrapeptides connected by three intermolecular hydrogen bonds between the backbones. Conformational searches (Amber force field) on a truncated model system of 1 confirm all three backbone conformations to be low-energy states. The previously synthesized analogs of 1 containing a reduced amide bond exhibit the same backbone conformation as 1 in DMSO, which is confirmed further by the X-ray crystal structure of a model system of the desoxy analogs of 1. This similarity helps in explaining why the desoxy analogs retain some of the antiprotozoal activities of apicidin. The backbone-reduction approach designed to facilitate the cyclization step of the acyclic precursors of the CTPs seems to retain the conformational preferences of the parent peptide backbone.

  19. 4D experiments measured with APSY for automated backbone resonance assignments of large proteins

    International Nuclear Information System (INIS)

    Krähenbühl, Barbara; Boudet, Julien; Wider, Gerhard

    2013-01-01

    Detailed structural and functional characterization of proteins by solution NMR requires sequence-specific resonance assignment. We present a set of transverse relaxation optimization (TROSY) based four-dimensional automated projection spectroscopy (APSY) experiments which are designed for resonance assignments of proteins with a size up to 40 kDa, namely HNCACO, HNCOCA, HNCACB and HN(CO)CACB. These higher-dimensional experiments include several sensitivity-optimizing features such as multiple quantum parallel evolution in a ‘just-in-time’ manner, aliased off-resonance evolution, evolution-time optimized APSY acquisition, selective water-handling and TROSY. The experiments were acquired within the concept of APSY, but they can also be used within the framework of sparsely sampled experiments. The multidimensional peak lists derived with APSY provided chemical shifts with an approximately 20 times higher precision than conventional methods usually do, and allowed the assignment of 90 % of the backbone resonances of the perdeuterated primase-polymerase ORF904, which contains 331 amino acid residues and has a molecular weight of 38.4 kDa.

  20. PASA - A Program for Automated Protein NMR Backbone Signal Assignment by Pattern-Filtering Approach

    International Nuclear Information System (INIS)

    Xu Yizhuang; Wang Xiaoxia; Yang Jun; Vaynberg, Julia; Qin Jun

    2006-01-01

    We present a new program, PASA (Program for Automated Sequential Assignment), for assigning protein backbone resonances based on multidimensional heteronuclear NMR data. Distinct from existing programs, PASA emphasizes a per-residue-based pattern-filtering approach during the initial stage of the automated 13 C α and/or 13 C β chemical shift matching. The pattern filter employs one or multiple constraints such as 13 C α /C β chemical shift ranges for different amino acid types and side-chain spin systems, which helps to rule out, in a stepwise fashion, improbable assignments as resulted from resonance degeneracy or missing signals. Such stepwise filtering approach substantially minimizes early false linkage problems that often propagate, amplify, and ultimately cause complication or combinatorial explosion of the automation process. Our program (http://www.lerner.ccf.org/moleccard/qin/) was tested on four representative small-large sized proteins with various degrees of resonance degeneracy and missing signals, and we show that PASA achieved the assignments efficiently and rapidly that are fully consistent with those obtained by laborious manual protocols. The results demonstrate that PASA may be a valuable tool for NMR-based structural analyses, genomics, and proteomics

  1. Predicting backbone Cα angles and dihedrals from protein sequences by stacked sparse auto-encoder deep neural network.

    Science.gov (United States)

    Lyons, James; Dehzangi, Abdollah; Heffernan, Rhys; Sharma, Alok; Paliwal, Kuldip; Sattar, Abdul; Zhou, Yaoqi; Yang, Yuedong

    2014-10-30

    Because a nearly constant distance between two neighbouring Cα atoms, local backbone structure of proteins can be represented accurately by the angle between C(αi-1)-C(αi)-C(αi+1) (θ) and a dihedral angle rotated about the C(αi)-C(αi+1) bond (τ). θ and τ angles, as the representative of structural properties of three to four amino-acid residues, offer a description of backbone conformations that is complementary to φ and ψ angles (single residue) and secondary structures (>3 residues). Here, we report the first machine-learning technique for sequence-based prediction of θ and τ angles. Predicted angles based on an independent test have a mean absolute error of 9° for θ and 34° for τ with a distribution on the θ-τ plane close to that of native values. The average root-mean-square distance of 10-residue fragment structures constructed from predicted θ and τ angles is only 1.9Å from their corresponding native structures. Predicted θ and τ angles are expected to be complementary to predicted ϕ and ψ angles and secondary structures for using in model validation and template-based as well as template-free structure prediction. The deep neural network learning technique is available as an on-line server called Structural Property prediction with Integrated DEep neuRal network (SPIDER) at http://sparks-lab.org. Copyright © 2014 Wiley Periodicals, Inc.

  2. A density functional study of backbone structures of polydiacetylene: destabilization of butatriene structure

    International Nuclear Information System (INIS)

    Katagiri, Hideki; Shimoi, Yukihiro; Abe, Shuji

    2004-01-01

    Backbone structures of polydiacetylene are studied with first-principles electronic structure method using plane-waves within generalized gradient approximation (GGA) of density functional theory. In spin-restricted calculations a coarse k-point sampling gives a potential energy curve with two local minima corresponding to acetylene and butatriene structures. However, the potential barrier between the two structures rapidly decreases with increasing number of k-points, which results in destabilization of the butatriene structure. Spin polarization effects also destabilize the butatriene structure, inducing atom-centered spin-density-wave state. These potential energies were compared with those obtained by Hartree-Fock, density functional within local density approximation (LDA) and GGA, and hybrid density functional methods using a gaussian basis set. The comparison shows that the density functional methods within LDA and GGA favor the destabilization of the butatriene structure in contrast to the Hartree-Fock method

  3. Influence of structures of polymer backbones on cooperative photoreorientation behavior of p-cyanoazobenzene side chains

    DEFF Research Database (Denmark)

    Han, Mina; Kidowaki, Masatoshi; Ichimura, Kunihiro

    2001-01-01

    Photoinduced orientational behavior of a polymethacrylate (CN6) and a polyester (p6a12) with p-cyanoazobenzene side chains was studied to reveal the structural effect of the liquid crystalline polymer backbones. Irradiation with linearly polarized W light resulted in the reorientation of the azob...

  4. Protein backbone motions viewed by intraresidue and sequential H{sup N}-H{sup {alpha}} residual dipolar couplings

    Energy Technology Data Exchange (ETDEWEB)

    Voegeli, Beat; Yao Lishan; Bax, Ad [National Institutes of Health, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases (United States)], E-mail: bax@nih.gov

    2008-05-15

    Triple resonance E.COSY-based techniques were used to measure intra-residue and sequential H{sup N}-H{sup {alpha}} residual dipolar couplings (RDCs) for the third IgG-binding domain of protein G (GB3), aligned in Pf1 medium. Measurements closely correlate with values predicted on the basis of an NMR structure, previously determined on the basis of a large number of one-bond backbone RDCs measured in five alignment media. However, in particular the sequential H{sup N}-H{sup {alpha}} RDCs are smaller than predicted for a static structure, suggesting a degree of motion for these internuclear vectors that exceeds that of the backbone amide N-H vectors. Of all experimentally determined GB3 structures available, the best correlation between experimental {sup 1}H-{sup 1}H couplings is observed for a GB3 ensemble, previously derived to generate a realistic picture of the conformational space sampled by GB3 (Clore and Schwieters, J Mol Biol 355:879-886, 2006). However, for both NMR and X-ray-derived structures the {sup 1}H-{sup 1}H couplings are found to be systematically smaller than expected on the basis of alignment tensors derived from {sup 15}N-{sup 1}H amide RDCs, assuming librationally corrected N-H bond lengths of 1.041 A.

  5. Three-Dimensional Protein Fold Determination from Backbone Amide Pseudocontact Shifts Generated by Lanthanide Tags at Multiple Sites

    KAUST Repository

    Yagi, Hiromasa

    2013-06-01

    Site-specific attachment of paramagnetic lanthanide ions to a protein generates pseudocontact shifts (PCS) in the nuclear magnetic resonance (NMR) spectra of the protein that are easily measured as changes in chemical shifts. By labeling the protein with lanthanide tags at four different sites, PCSs are observed for most amide protons and accurate information is obtained about their coordinates in three-dimensional space. The approach is demonstrated with the chaperone ERp29, for which large differences have been reported between X-ray and NMR structures of the C-terminal domain, ERp29-C. The results unambiguously show that the structure of rat ERp29-C in solution is similar to the crystal structure of human ERp29-C. PCSs of backbone amides were the only structural restraints required. Because these can be measured for more dilute protein solutions than other NMR restraints, the approach greatly widens the range of proteins amenable to structural studies in solution. © 2013 Elsevier Ltd. All rights reserved.

  6. Backbone structure of Yersinia pestis Ail determined in micelles by NMR-restrained simulated annealing with implicit membrane solvation

    International Nuclear Information System (INIS)

    Marassi, Francesca M.; Ding, Yi; Schwieters, Charles D.; Tian, Ye; Yao, Yong

    2015-01-01

    The outer membrane protein Ail (attachment invasion locus) is a virulence factor of Yersinia pestis that mediates cell invasion, cell attachment and complement resistance. Here we describe its three-dimensional backbone structure determined in decyl-phosphocholine (DePC) micelles by NMR spectroscopy. The NMR structure was calculated using the membrane function of the implicit solvation potential, eefxPot, which we have developed to facilitate NMR structure calculations in a physically realistic environment. We show that the eefxPot force field guides the protein towards its native fold. The resulting structures provide information about the membrane-embedded global position of Ail, and have higher accuracy, higher precision and improved conformational properties, compared to the structures calculated with the standard repulsive potential

  7. A Bayesian-probability-based method for assigning protein backbone dihedral angles based on chemical shifts and local sequences

    Energy Technology Data Exchange (ETDEWEB)

    Wang Jun; Liu Haiyan [University of Science and Technology of China, Hefei National Laboratory for Physical Sciences at the Microscale, and Key Laboratory of Structural Biology, School of Life Sciences (China)], E-mail: hyliu@ustc.edu.cn

    2007-01-15

    Chemical shifts contain substantial information about protein local conformations. We present a method to assign individual protein backbone dihedral angles into specific regions on the Ramachandran map based on the amino acid sequences and the chemical shifts of backbone atoms of tripeptide segments. The method uses a scoring function derived from the Bayesian probability for the central residue of a query tripeptide segment to have a particular conformation. The Ramachandran map is partitioned into representative regions at two levels of resolution. The lower resolution partitioning is equivalent to the conventional definitions of different secondary structure regions on the map. At the higher resolution level, the {alpha} and {beta} regions are further divided into subregions. Predictions are attempted at both levels of resolution. We compared our method with TALOS using the original TALOS database, and obtained comparable results. Although TALOS may produce the best results with currently available databases which are much enlarged, the Bayesian-probability-based approach can provide a quantitative measure for the reliability of predictions.

  8. On the purported "backbone fluorescence" in protein three-dimensional fluorescence spectra

    DEFF Research Database (Denmark)

    Bortolotti, Annalisa; Wong, Yin How; Korsholm, Stine S.

    2016-01-01

    In this study, several proteins (albumin, lysozyme, insulin) and model compounds (Trp, Tyr, homopolypeptides) were used to demonstrate the origin of the fluorescence observed upon their excitation at 220-230 nm. In the last 10 years we have observed a worrying increase in the number of articles...... as any traditional protein emission spectrum. The many papers in reputable journals erroneously reporting this peak assignment, contradicting 5 decades of prior knowledge, have led to the creation of a new dogma, where many authors and reviewers now take the purported backbone fluorescence...... as an established fact. We hope the current paper helps counter this new situation and leads to a reassessment of those papers that make this erroneous claim....

  9. Neural Networks for protein Structure Prediction

    DEFF Research Database (Denmark)

    Bohr, Henrik

    1998-01-01

    This is a review about neural network applications in bioinformatics. Especially the applications to protein structure prediction, e.g. prediction of secondary structures, prediction of surface structure, fold class recognition and prediction of the 3-dimensional structure of protein backbones...

  10. Probing the role of backbone hydrogen bonds in protein-peptide interactions by amide-to-ester mutations

    DEFF Research Database (Denmark)

    Eildal, Jonas N N; Hultqvist, Greta; Balle, Thomas

    2013-01-01

    -protein interactions, those of the PDZ domain family involve formation of intermolecular hydrogen bonds: C-termini or internal linear motifs of proteins bind as β-strands to form an extended antiparallel β-sheet with the PDZ domain. Whereas extensive work has focused on the importance of the amino acid side chains...... of the protein ligand, the role of the backbone hydrogen bonds in the binding reaction is not known. Using amide-to-ester substitutions to perturb the backbone hydrogen-bonding pattern, we have systematically probed putative backbone hydrogen bonds between four different PDZ domains and peptides corresponding...... to natural protein ligands. Amide-to-ester mutations of the three C-terminal amides of the peptide ligand severely affected the affinity with the PDZ domain, demonstrating that hydrogen bonds contribute significantly to ligand binding (apparent changes in binding energy, ΔΔG = 1.3 to >3.8 kcal mol(-1...

  11. Chemical synthesis of membrane proteins by the removable backbone modification method.

    Science.gov (United States)

    Tang, Shan; Zuo, Chao; Huang, Dong-Liang; Cai, Xiao-Ying; Zhang, Long-Hua; Tian, Chang-Lin; Zheng, Ji-Shen; Liu, Lei

    2017-12-01

    Chemical synthesis can produce membrane proteins bearing specifically designed modifications (e.g., phosphorylation, isotope labeling) that are difficult to obtain through recombinant protein expression approaches. The resulting homogeneously modified synthetic membrane proteins are valuable tools for many advanced biochemical and biophysical studies. This protocol describes the chemical synthesis of membrane proteins by condensation of transmembrane peptide segments through native chemical ligation. To avoid common problems encountered due to the poor solubility of transmembrane peptides in almost any solvent, we describe an effective procedure for the chemical synthesis of membrane proteins through the removable-backbone modification (RBM) strategy. Two key steps of this protocol are: (i) installation of solubilizing Arg4-tagged RBM groups into the transmembrane peptides at any primary amino acid through Fmoc (9-fluorenylmethyloxycarbonyl) solid-phase peptide synthesis and (ii) native ligation of the full-length sequence, followed by removal of the RBM tags by TFA (trifluoroacetic acid) cocktails to afford the native protein. The installation of RBM groups is achieved by using 4-methoxy-5-nitrosalicyladehyde by reduction amination to incorporate an activated O-to-N acyl transfer auxiliary. The Arg4-tag-modified membrane-spanning peptide segments behave like water-soluble peptides to facilitate their purification, ligation and mass characterization.

  12. Optical alignment control of polyimide molecules containing azobenzene in the backbone structure

    International Nuclear Information System (INIS)

    Sakamoto, Kenji; Usami, Kiyoaki; Sasaki, Toru; Kanayama, Takashi; Ushioda, Sukekatsu

    2004-01-01

    Using polarized infrared absorption spectroscopy, we have determined the orientation of the polyimide backbone structure in photo-alignment films for liquid crystals (LC). The polyimide used in this study contains azobenzene in the backbone structure. Photo-alignment treatment was performed on the corresponding polyamic acid film, using a light source of wavelength 340-500 nm. The polyamic acid film (∼16 nm thick) was first irradiated at normal incidence with linearly polarized light (LP-light) of 156 J/cm 2 , and then oblique angle irradiation of unpolarized light (UP-light) was performed in the plane of incidence perpendicular to the polarization direction of the LP-light. The UP-light exposure was varied up to 882 J/cm 2 . We found that the average inclination angle of the polyimide backbone structure, measured from the surface plane, increases almost linearly with UP-light exposure. On the other hand, the in-plane anisotropy induced by the first irradiation with LP-light decreases with the increase of UP-light exposure

  13. Effect of backbone structure on charge transport along isolated conjugated polymer chains

    International Nuclear Information System (INIS)

    Siebbeles, Laurens D.A.; Grozema, Ferdinand C.; Haas, Matthijs P. de; Warman, John M.

    2005-01-01

    Fast charge transport in conjugated polymers is essential for their application in opto-electronic devices. In the present paper, measurements and theoretical modeling of the mobility of excess charges along isolated chains of conjugated polymers in dilute solution are presented. Charge carriers were produced by irradiation of the polymer solution with 3-MeV electrons from a Van de Graaff accelerator. The mobilities of the charges along the polymer chains were obtained from time-resolved microwave conductivity measurements. The mobilities are strongly dependent on the chemical nature of the polymer backbone. Comparison of the experimental data with results from ab initio quantum mechanical calculations shows that the measured mobilities are strongly limited by torsional disorder, chemical defects and chain ends. Improvement of the structure of polymer backbones is therefore expected to significantly enhance the performance of these materials in 'plastic electronics'

  14. Systematic determination of the mosaic structure of bacterial genomes: species backbone versus strain-specific loops

    Directory of Open Access Journals (Sweden)

    Gendrault-Jacquemard A

    2005-07-01

    Full Text Available Abstract Background Public databases now contain multitude of complete bacterial genomes, including several genomes of the same species. The available data offers new opportunities to address questions about bacterial genome evolution, a task that requires reliable fine comparison data of closely related genomes. Recent analyses have shown, using pairwise whole genome alignments, that it is possible to segment bacterial genomes into a common conserved backbone and strain-specific sequences called loops. Results Here, we generalize this approach and propose a strategy that allows systematic and non-biased genome segmentation based on multiple genome alignments. Segmentation analyses, as applied to 13 different bacterial species, confirmed the feasibility of our approach to discern the 'mosaic' organization of bacterial genomes. Segmentation results are available through a Web interface permitting functional analysis, extraction and visualization of the backbone/loops structure of documented genomes. To illustrate the potential of this approach, we performed a precise analysis of the mosaic organization of three E. coli strains and functional characterization of the loops. Conclusion The segmentation results including the backbone/loops structure of 13 bacterial species genomes are new and available for use by the scientific community at the URL: http://genome.jouy.inra.fr/mosaic.

  15. Solution NMR Structures of Oxidized and Reduced Ehrlichia chaffeensis thioredoxin: NMR-Invisible Structure Owing to Backbone Dynamics

    Energy Technology Data Exchange (ETDEWEB)

    Buchko, Garry W.; Hewitt, Stephen N.; Van Voorhis, Wesley C.; Myler, Peter J.

    2018-01-02

    Thioredoxins (Trxs) are small ubiquitous proteins that participate in a diverse variety of redox reactions via the reversible oxidation of two cysteine thiol groups in a structurally conserved active site, CGPC. Here, we describe the NMR solution structures of a Trx from Ehrlichia chaffeensis (Ec-Trx, ECH_0218), the etiological agent responsible for human monocytic ehrlichiosis, in both the oxidized and reduced states. The overall topology of the calculated structures is similar in both redox states and similar to other Trx structures, a five-strand, mixed -sheet (1:3:2:4:5) surrounded by four -helices. Unlike other Trxs studied by NMR in both redox states, the 1H-15N HSQC spectra of reduced Ec-Trx was missing eight amide cross peaks relative to the spectra of oxidized Ec-Trx. These missing amides correspond to residues C32-E39 in the active site containing helix (2) and S72-I75 in a loop near the active site and suggest a substantial change in the backbone dynamics associated with the formation of an intramolecular C32-C35 disulfide bond.

  16. A new strategy for backbone resonance assignment in large proteins using a MQ-HACACO experiment

    International Nuclear Information System (INIS)

    Pervushin, Konstantin; Eletsky, Alexander

    2003-01-01

    A new strategy of backbone resonance assignment is proposed based on a combination of the most sensitive TROSY-type triple resonance experiments such as TROSY-HNCA and TROSY-HNCO with a new 3D multiple-quantum HACACO experiment. The favourable relaxation properties of the multiple-quantum coherences and signal detection using the 13 C' antiphase coherences optimize the performance of the proposed experiment for application to larger proteins. In addition to the 1 H N , 15 N, 13 C α and 13 C' chemical shifts the 3D multiple-quantum HACACO experiment provides assignment for the 1 H α resonances in contrast to previously proposed experiments for large proteins. The strategy is demonstrated with the 44 kDa uniformly 15 N, 13 C-labeled and fractionally 35% deuterated trimeric B. subtilis Chorismate Mutase measured at 20 deg. C and 9 deg. C. Measurements at the lower temperature indicate that the new strategy can be applied to even larger proteins with molecular weights up to 80 kDa

  17. Backbone resonance assignments for G protein α(i3) subunit in the GDP-bound state.

    Science.gov (United States)

    Mase, Yoko; Yokogawa, Mariko; Osawa, Masanori; Shimada, Ichio

    2014-10-01

    Guanine-nucleotide binding proteins (G proteins) serve as molecular switches in signaling pathways, by coupling the activation of G protein-coupled receptors (GPCRs) at the cell surface to intracellular responses. In the resting state, G protein forms a heterotrimer, consisting of the G protein α subunit with GDP (Gα·GDP) and the G protein βγ subunit (Gβγ). Ligand binding to GPCRs promotes the GDP-GTP exchange on Gα, leading to the dissociation of the GTP-bound form of Gα (Gα·GTP) and Gβγ. Then, Gα·GTP and Gβγ bind to their downstream effector enzymes or ion channels and regulate their activities, leading to a variety of cellular responses. Finally, Gα hydrolyzes the bound GTP to GDP and returns to the resting state by re-associating with Gβγ. The G proteins are classified with four major families based on the amino acid sequences of Gα: i/o, s, q/11, and 12/13. Here, we established the backbone resonance assignments of human Gαi3, a member of the i/o family with a molecular weight of 41 K, in complex with GDP. The chemical shifts were compared with those of Gα(i3) in complex with a GTP-analogue, GTPγS, which we recently reported, indicating that the residues with significant chemical shift differences are mostly consistent with the regions with the structural differences between the GDP- and GTPγS-bound states, as indicated in the crystal structures. The assignments of Gα(i3)·GDP would be useful for the analyses of the dynamics of Gα(i3) and its interactions with various target molecules.

  18. Trappist: european project dedicated to an open backbone structure for NDT expertise

    International Nuclear Information System (INIS)

    Nouailhas, B.; Vailhen, O.

    1993-01-01

    Non Destructive Testing (NDT) on critical components such as the reactor vessel, primary coolant pipes and steam generators have already been, and are still the subject of many development concerning the improvement of measuring techniques, data processing and on site operation. The tools developed for these tests are generally closed, difficult to extend and of proprietary type. Productivity could be increased if an open backbone structure common to several types of test were available. Moreover, these components are generally submitted to a test involving a single method. In certain cases, the produced information is an insufficient basis for drawing up a satisfactory diagnosis: the test operator or expert often faces problems in extracting more information from signals that are generally noisy. It may prove necessary to complete the inspection with another NDT method based on different principles in order to obtain better performances. It is then by combining the information obtained by two complementary methods that it will be possible to draw up a more reliable diagnosis. These components have also a complex shape. In the case of ultrasonic testing, the accurate following of probe paths requires 3D representation of the geometry, as it is built, to position and display the data obtained from the inspection. To take these geometric constraints into account, it is imperative to use computer tools allowing the three-dimensional representation of the reconstructed information on the components' actual geometry. This specific difficulty, which has long been appreciated, is the subject of developments resulting to industrial products that are more or less satisfactory. The aim of the European Project TRAPPIST (Race Program) is to study an open backbone structure. A mock-up of an analysis station dedicated to NDT expertise will be built and evaluated with specific examples. (authors). 6 figs., 1 ref

  19. Combining automated peak tracking in SAR by NMR with structure-based backbone assignment from 15N-NOESY

    KAUST Repository

    Jang, Richard; Gao, Xin; Li, Ming

    2012-01-01

    Background: Chemical shift mapping is an important technique in NMR-based drug screening for identifying the atoms of a target protein that potentially bind to a drug molecule upon the molecule's introduction in increasing concentrations. The goal is to obtain a mapping of peaks with known residue assignment from the reference spectrum of the unbound protein to peaks with unknown assignment in the target spectrum of the bound protein. Although a series of perturbed spectra help to trace a path from reference peaks to target peaks, a one-to-one mapping generally is not possible, especially for large proteins, due to errors, such as noise peaks, missing peaks, missing but then reappearing, overlapped, and new peaks not associated with any peaks in the reference. Due to these difficulties, the mapping is typically done manually or semi-automatically, which is not efficient for high-throughput drug screening.Results: We present PeakWalker, a novel peak walking algorithm for fast-exchange systems that models the errors explicitly and performs many-to-one mapping. On the proteins: hBclXL, UbcH5B, and histone H1, it achieves an average accuracy of over 95% with less than 1.5 residues predicted per target peak. Given these mappings as input, we present PeakAssigner, a novel combined structure-based backbone resonance and NOE assignment algorithm that uses just 15N-NOESY, while avoiding TOCSY experiments and 13C-labeling, to resolve the ambiguities for a one-to-one mapping. On the three proteins, it achieves an average accuracy of 94% or better.Conclusions: Our mathematical programming approach for modeling chemical shift mapping as a graph problem, while modeling the errors directly, is potentially a time- and cost-effective first step for high-throughput drug screening based on limited NMR data and homologous 3D structures. 2012 Jang et al.; licensee BioMed Central Ltd.

  20. Combining automated peak tracking in SAR by NMR with structure-based backbone assignment from 15N-NOESY

    KAUST Repository

    Jang, Richard

    2012-03-21

    Background: Chemical shift mapping is an important technique in NMR-based drug screening for identifying the atoms of a target protein that potentially bind to a drug molecule upon the molecule\\'s introduction in increasing concentrations. The goal is to obtain a mapping of peaks with known residue assignment from the reference spectrum of the unbound protein to peaks with unknown assignment in the target spectrum of the bound protein. Although a series of perturbed spectra help to trace a path from reference peaks to target peaks, a one-to-one mapping generally is not possible, especially for large proteins, due to errors, such as noise peaks, missing peaks, missing but then reappearing, overlapped, and new peaks not associated with any peaks in the reference. Due to these difficulties, the mapping is typically done manually or semi-automatically, which is not efficient for high-throughput drug screening.Results: We present PeakWalker, a novel peak walking algorithm for fast-exchange systems that models the errors explicitly and performs many-to-one mapping. On the proteins: hBclXL, UbcH5B, and histone H1, it achieves an average accuracy of over 95% with less than 1.5 residues predicted per target peak. Given these mappings as input, we present PeakAssigner, a novel combined structure-based backbone resonance and NOE assignment algorithm that uses just 15N-NOESY, while avoiding TOCSY experiments and 13C-labeling, to resolve the ambiguities for a one-to-one mapping. On the three proteins, it achieves an average accuracy of 94% or better.Conclusions: Our mathematical programming approach for modeling chemical shift mapping as a graph problem, while modeling the errors directly, is potentially a time- and cost-effective first step for high-throughput drug screening based on limited NMR data and homologous 3D structures. 2012 Jang et al.; licensee BioMed Central Ltd.

  1. Beta-scission of alkoxyl radicals on peptides and proteins can give rise to backbone cleavage and loss of side-chains

    International Nuclear Information System (INIS)

    Headlam, H.A.; Davies, M.J.; Mortimer, A.; Easton, C.J.

    2000-01-01

    Full text: Exposure of proteins to radicals in the presence of O 2 brings about multiple changes including side-chain oxidation, backbone fragmentation, cross-linking, unfolding, changes in hydrophobicity and conformation, altered susceptibility to proteolytic enzymes and formation of new reactive groups (e.g. hydroperoxides and 3,4-dihydroxyphenylalanine). All of these processes can result in loss of structural or enzymatic activity. The mechanisms that give rise to backbone cleavage are only partly understood. Whilst it is known that direct hydrogen atom abstraction at a-carbon sites gives backbone cleavages it has also been proposed that initial attack at side-chain sites might also give rise to backbone cleavage. In this study we have examined whether initial attack at the β- (C-3) position can give rise to α-carbon radicals (and hence backbone cleavage) via the formation, and subsequent β- scission, of C-3 alkoxyl radicals. This process has been observed previously with protected amino acids in organic solvents (J. Chem. Soc. Perkin Trans. 2, 1997, 503-507) but the occurrence of such reactions with proteins in aqueous solution has not been explored. Alkoxyl radicals were generated at the C-3 position of a variety of protected amino acids and small peptides by two methods: metal-ion catalysed decomposition of hydroperoxides formed as a result of γ-radiolysis in the presence of O 2 , and UV photolysis of nitrate esters. In most cases radicals have been detected by EPR spectroscopy using nitroso and nitrone spin traps, which can be assigned by comparison with literature data to α-carbon radicals; in some case assignments were confirmed by the generation of the putative species by other routes. With Ala peptide hydroperoxides and nitrate esters, and MNP as the spin trap, the major radical detected in each case has been assigned to the adduct of an α-carbon radical with partial structure - NH- . CH-C(O) - consistent with the rapid occurrence of the above

  2. Prediction of mutational tolerance in HIV-1 protease and reverse transcriptase using flexible backbone protein design.

    Directory of Open Access Journals (Sweden)

    Elisabeth Humphris-Narayanan

    Full Text Available Predicting which mutations proteins tolerate while maintaining their structure and function has important applications for modeling fundamental properties of proteins and their evolution; it also drives progress in protein design. Here we develop a computational model to predict the tolerated sequence space of HIV-1 protease reachable by single mutations. We assess the model by comparison to the observed variability in more than 50,000 HIV-1 protease sequences, one of the most comprehensive datasets on tolerated sequence space. We then extend the model to a second protein, reverse transcriptase. The model integrates multiple structural and functional constraints acting on a protein and uses ensembles of protein conformations. We find the model correctly captures a considerable fraction of protease and reverse-transcriptase mutational tolerance and shows comparable accuracy using either experimentally determined or computationally generated structural ensembles. Predictions of tolerated sequence space afforded by the model provide insights into stability-function tradeoffs in the emergence of resistance mutations and into strengths and limitations of the computational model.

  3. Coupling between myosin head conformation and the thick filament backbone structure.

    Science.gov (United States)

    Hu, Zhongjun; Taylor, Dianne W; Edwards, Robert J; Taylor, Kenneth A

    2017-12-01

    The recent high-resolution structure of the thick filament from Lethocerus asynchronous flight muscle shows aspects of thick filament structure never before revealed that may shed some light on how striated muscles function. The phenomenon of stretch activation underlies the function of asynchronous flight muscle. It is most highly developed in flight muscle, but is also observed in other striated muscles such as cardiac muscle. Although stretch activation is likely to be complex, involving more than a single structural aspect of striated muscle, the thick filament itself, would be a prime site for regulatory function because it must bear all of the tension produced by both its associated myosin motors and any externally applied force. Here we show the first structural evidence that the arrangement of myosin heads within the interacting heads motif is coupled to the structure of the thick filament backbone. We find that a change in helical angle of 0.16° disorders the blocked head preferentially within the Lethocerus interacting heads motif. This observation suggests a mechanism for how tension affects the dynamics of the myosin heads leading to a detailed hypothesis for stretch activation and shortening deactivation, in which the blocked head preferentially binds the thin filament followed by the free head when force production occurs. Copyright © 2017 Elsevier Inc. All rights reserved.

  4. Combining ambiguous chemical shift mapping with structure-based backbone and NOE assignment from 15N-NOESY

    KAUST Repository

    Jang, Richard

    2011-01-01

    Chemical shift mapping is an important technique in NMRbased drug screening for identifying the atoms of a target protein that potentially bind to a drug molecule upon the molecule\\'s introduction in increasing concentrations. The goal is to obtain a mapping of peaks with known residue assignment from the reference spectrum of the unbound protein to peaks with unknown assignment in the target spectrum of the bound protein. Although a series of perturbed spectra help to trace a path from reference peaks to target peaks, a one-to-one mapping generally is not possible, especially for large proteins, due to errors, such as noise peaks, missing peaks, missing but then reappearing, overlapped, and new peaks not associated with any peaks in the reference. Due to these difficulties, the mapping is typically done manually or semi-automatically. However, automated methods are necessary for high-throughput drug screening. We present PeakWalker, a novel peak walking algorithm for fast-exchange systems that models the errors explicitly and performs many-to-one mapping. On the proteins: hBclXL, UbcH5B, and histone H1, it achieves an average accuracy of over 95% with less than 1.5 residues predicted per target peak. Given these mappings as input, we present PeakAssigner, a novel combined structure-based backbone resonance and NOE assignment algorithm that uses just 15N-NOESY, while avoiding TOCSY experiments and 13C- labeling, to resolve the ambiguities for a one-toone mapping. On the three proteins, it achieves an average accuracy of 94% or better. Copyright © 2011 ACM.

  5. Five and four dimensional experiments for robust backbone resonance assignment of large intrinsically disordered proteins: application to Tau3x protein

    International Nuclear Information System (INIS)

    Żerko, Szymon; Byrski, Piotr; Włodarczyk-Pruszyński, Paweł; Górka, Michał; Ledolter, Karin; Masliah, Eliezer; Konrat, Robert; Koźmiński, Wiktor

    2016-01-01

    New experiments dedicated for large IDPs backbone resonance assignment are presented. The most distinctive feature of all described techniques is the employment of MOCCA-XY16 mixing sequences to obtain effective magnetization transfers between carbonyl carbon backbone nuclei. The proposed 4 and 5 dimensional experiments provide a high dispersion of obtained signals making them suitable for use in the case of large IDPs (application to 354 a. a. residues of Tau protein 3x isoform is presented) as well as provide both forward and backward connectivities. What is more, connecting short chains interrupted with proline residues is also possible. All the experiments employ non-uniform sampling.

  6. Five and four dimensional experiments for robust backbone resonance assignment of large intrinsically disordered proteins: application to Tau3x protein

    Energy Technology Data Exchange (ETDEWEB)

    Żerko, Szymon; Byrski, Piotr; Włodarczyk-Pruszyński, Paweł; Górka, Michał [University of Warsaw, Faculty of Chemistry, Biological and Chemical Research Centre (Poland); Ledolter, Karin [University of Vienna, Department of Computational and Structural Biology, Max F. Perutz Laboratories (Austria); Masliah, Eliezer [University of California, San Diego, Departments of Neuroscience and Pathology (United States); Konrat, Robert [University of Vienna, Department of Computational and Structural Biology, Max F. Perutz Laboratories (Austria); Koźmiński, Wiktor, E-mail: kozmin@chem.uw.edu.pl [University of Warsaw, Faculty of Chemistry, Biological and Chemical Research Centre (Poland)

    2016-08-15

    New experiments dedicated for large IDPs backbone resonance assignment are presented. The most distinctive feature of all described techniques is the employment of MOCCA-XY16 mixing sequences to obtain effective magnetization transfers between carbonyl carbon backbone nuclei. The proposed 4 and 5 dimensional experiments provide a high dispersion of obtained signals making them suitable for use in the case of large IDPs (application to 354 a. a. residues of Tau protein 3x isoform is presented) as well as provide both forward and backward connectivities. What is more, connecting short chains interrupted with proline residues is also possible. All the experiments employ non-uniform sampling.

  7. Structural conservation, variability, and immunogenicity of the T6 backbone pilin of serotype M6 Streptococcus pyogenes.

    Science.gov (United States)

    Young, Paul G; Moreland, Nicole J; Loh, Jacelyn M; Bell, Anita; Atatoa Carr, Polly; Proft, Thomas; Baker, Edward N

    2014-07-01

    Group A streptococcus (GAS; Streptococcus pyogenes) is a Gram-positive human pathogen that causes a broad range of diseases ranging from acute pharyngitis to the poststreptococcal sequelae of acute rheumatic fever. GAS pili are highly diverse, long protein polymers that extend from the cell surface. They have multiple roles in infection and are promising candidates for vaccine development. This study describes the structure of the T6 backbone pilin (BP; Lancefield T-antigen) from the important M6 serotype. The structure reveals a modular arrangement of three tandem immunoglobulin-like domains, two with internal isopeptide bonds. The T6 pilin lysine, essential for polymerization, is located in a novel VAKS motif that is structurally homologous to the canonical YPKN pilin lysine in other three- and four-domain Gram-positive pilins. The T6 structure also highlights a conserved pilin core whose surface is decorated with highly variable loops and extensions. Comparison to other Gram-positive BPs shows that many of the largest variable extensions are found in conserved locations. Studies with sera from patients diagnosed with GAS-associated acute rheumatic fever showed that each of the three T6 domains, and the largest of the variable extensions (V8), are targeted by IgG during infection in vivo. Although the GAS BP show large variations in size and sequence, the modular nature of the pilus proteins revealed by the T6 structure may aid the future design of a pilus-based vaccine. Copyright © 2014, American Society for Microbiology. All Rights Reserved.

  8. 5D {sup 13}C-detected experiments for backbone assignment of unstructured proteins with a very low signal dispersion

    Energy Technology Data Exchange (ETDEWEB)

    Novacek, Jiri [Masaryk University, Faculty of Science, NCBR, and CEITEC (Czech Republic); Zawadzka-Kazimierczuk, Anna [University of Warsaw, Faculty of Chemistry (Poland); Papouskova, Veronika; Zidek, Lukas, E-mail: lzidek@chemi.muni.cz [Masaryk University, Faculty of Science, NCBR, and CEITEC (Czech Republic); Sanderova, Hana; Krasny, Libor [Institute of Microbiology, Academy of Sciences of the Czech Republic, Laboratory of Molecular Genetics of Bacteria and Department of Bacteriology (Czech Republic); Kozminski, Wiktor [University of Warsaw, Faculty of Chemistry (Poland); Sklenar, Vladimir [Masaryk University, Faculty of Science, NCBR, and CEITEC (Czech Republic)

    2011-05-15

    Two novel 5D NMR experiments (CACONCACO, NCOCANCO) for backbone assignment of disordered proteins are presented. The pulse sequences exploit relaxation properties of the unstructured proteins and combine the advantages of {sup 13}C-direct detection, non-uniform sampling, and longitudinal relaxation optimization to maximize the achievable resolution and minimize the experimental time. The pulse sequences were successfully tested on the sample of partially disordered delta subunit from RNA polymerase from Bacillus subtilis. The unstructured part of this 20 kDa protein consists of 81 amino acids with frequent sequential repeats. A collection of 0.0003% of the data needed for a conventional experiment with linear sampling was sufficient to perform an unambiguous assignment of the disordered part of the protein from a single 5D spectrum.

  9. Assessing protein conformational sampling methods based on bivariate lag-distributions of backbone angles

    KAUST Repository

    Maadooliat, Mehdi; Gao, Xin; Huang, Jianhua Z.

    2012-01-01

    Despite considerable progress in the past decades, protein structure prediction remains one of the major unsolved problems in computational biology. Angular-sampling-based methods have been extensively studied recently due to their ability to capture the continuous conformational space of protein structures. The literature has focused on using a variety of parametric models of the sequential dependencies between angle pairs along the protein chains. In this article, we present a thorough review of angular-sampling-based methods by assessing three main questions: What is the best distribution type to model the protein angles? What is a reasonable number of components in a mixture model that should be considered to accurately parameterize the joint distribution of the angles? and What is the order of the local sequence-structure dependency that should be considered by a prediction method? We assess the model fits for different methods using bivariate lag-distributions of the dihedral/planar angles. Moreover, the main information across the lags can be extracted using a technique called Lag singular value decomposition (LagSVD), which considers the joint distribution of the dihedral/planar angles over different lags using a nonparametric approach and monitors the behavior of the lag-distribution of the angles using singular value decomposition. As a result, we developed graphical tools and numerical measurements to compare and evaluate the performance of different model fits. Furthermore, we developed a web-tool (http://www.stat.tamu. edu/~madoliat/LagSVD) that can be used to produce informative animations. © The Author 2012. Published by Oxford University Press.

  10. Assessing protein conformational sampling methods based on bivariate lag-distributions of backbone angles

    KAUST Repository

    Maadooliat, Mehdi

    2012-08-27

    Despite considerable progress in the past decades, protein structure prediction remains one of the major unsolved problems in computational biology. Angular-sampling-based methods have been extensively studied recently due to their ability to capture the continuous conformational space of protein structures. The literature has focused on using a variety of parametric models of the sequential dependencies between angle pairs along the protein chains. In this article, we present a thorough review of angular-sampling-based methods by assessing three main questions: What is the best distribution type to model the protein angles? What is a reasonable number of components in a mixture model that should be considered to accurately parameterize the joint distribution of the angles? and What is the order of the local sequence-structure dependency that should be considered by a prediction method? We assess the model fits for different methods using bivariate lag-distributions of the dihedral/planar angles. Moreover, the main information across the lags can be extracted using a technique called Lag singular value decomposition (LagSVD), which considers the joint distribution of the dihedral/planar angles over different lags using a nonparametric approach and monitors the behavior of the lag-distribution of the angles using singular value decomposition. As a result, we developed graphical tools and numerical measurements to compare and evaluate the performance of different model fits. Furthermore, we developed a web-tool (http://www.stat.tamu. edu/~madoliat/LagSVD) that can be used to produce informative animations. © The Author 2012. Published by Oxford University Press.

  11. Empirical correlation between protein backbone {sup 15}N and {sup 13}C secondary chemical shifts and its application to nitrogen chemical shift re-referencing

    Energy Technology Data Exchange (ETDEWEB)

    Wang Liya [Cold Spring Harbor Laboratory (United States); Markley, John L. [University of Wisconsin, Biochemistry Department (United States)], E-mail: markley@nmrfam.wisc.edu

    2009-06-15

    The linear analysis of chemical shifts (LACS) has provided a robust method for identifying and correcting {sup 13}C chemical shift referencing problems in data from protein NMR spectroscopy. Unlike other approaches, LACS does not require prior knowledge of the three-dimensional structure or inference of the secondary structure of the protein. It also does not require extensive assignment of the NMR data. We report here a way of extending the LACS approach to {sup 15}N NMR data from proteins, so as to enable the detection and correction of inconsistencies in chemical shift referencing for this nucleus. The approach is based on our finding that the secondary {sup 15}N chemical shift of the backbone nitrogen atom of residue i is strongly correlated with the secondary chemical shift difference (experimental minus random coil) between the alpha and beta carbons of residue i - 1. Thus once alpha and beta {sup 13}C chemical shifts are available (their difference is referencing error-free), the {sup 15}N referencing can be validated, and an appropriate offset correction can be derived. This approach can be implemented prior to a structure determination and can be used to analyze potential referencing problems in database data not associated with three-dimensional structure. Application of the LACS algorithm to the current BMRB protein chemical shift database, revealed that nearly 35% of the BMRB entries have {delta}{sup 15}N values mis-referenced by over 0.7 ppm and over 25% of them have {delta}{sup 1}H{sup N} values mis-referenced by over 0.12 ppm. One implication of the findings reported here is that a backbone {sup 15}N chemical shift provides a better indicator of the conformation of the preceding residue than of the residue itself.

  12. Solution structure and backbone dynamics of recombinant Cucurbita maxima trypsin inhibitor-V determined by NMR spectroscopy.

    Science.gov (United States)

    Liu, J; Prakash, O; Cai, M; Gong, Y; Huang, Y; Wen, L; Wen, J J; Huang, J K; Krishnamoorthi, R

    1996-02-06

    The solution structure of recombinant Cucurbita maxima trypsin inhibitor-V (rCMTI-V), whose N-terminal is unacetylated and carries an extra glycine residue, was determined by means of two-dimensional (2D) homo and 3D hetero NMR experiments in combination with a distance geometry and simulated annealing algorithm. A total of 927 interproton distances and 123 torsion angle constraints were utilized to generate 18 structures. The root mean squared deviation (RMSD) of the mean structure is 0.53 A for main-chain atoms and 0.95 A for all the non-hydrogen atoms of residues 3-40 and 49-67. The average structure of rCMTI-V is found to be almost the same as that of the native protein [Cai, M., Gong, Y., Kao, J.-L., & Krishnamoorthi, R. (1995) Biochemistry 34, 5201-5211]. The backbone dynamics of uniformly 15N-labeled rCMTI-V were characterized by 2D 1H-15N NMR methods. 15N spin-lattice and spin-spin relaxation rate constants (R1 and R2, respectively) and [1H]-15N steady-state heteronuclear Overhauser effect enhancements were measured for the peptide NH units and, using the model-free formalism [Lipari, G., & Szabo, A. (1982) J. Am. Chem. Soc. 104, 4546-4559, 4559-4570], the following parameters were determined: overall tumbling correlation time for the protein molecule (tau m), generalized order parameters for the individual N-H vectors (S2), effective correlation times for their internal motions (tau e), and terms to account for motions on a slower time scale (second) due to chemical exchange and/or conformational averaging (R(ex)). Most of the backbone NH groups of rCMTI-V are found to be highly constrained ((S2) = 0.83) with the exception of those in the binding loop (residues 41-48, (S2) = 0.71) and the N-terminal region ((S2) = 0.73). Main-chain atoms in these regions show large RMSD values in the average NMR structure. Residues involved in turns also appear to have more mobility ((S2) = 0.80). Dynamical properties of rCMTI-V were compared with those of two other

  13. Nuclear Magnetic Resonance-Based Structural Characterization and Backbone Dynamics of Recombinant Bee Venom Melittin.

    Science.gov (United States)

    Ramirez, Lisa; Shekhtman, Alexander; Pande, Jayanti

    2018-04-30

    In recent years, there has been a resurgence of interest in melittin and its variants as their therapeutic potential has become increasingly evident. Melittin is a 26-residue peptide and a toxic component of honey bee venom. The versatility of melittin in interacting with various biological substrates, such as membranes, glycosaminoglycans, and a variety of proteins, has inspired a slew of studies that aim to improve our understanding of the structural basis of such interactions. However, these studies have largely focused on melittin solutions at high concentrations (>1 mM), even though melittin is generally effective at lower (micromolar) concentrations. Here we present high-resolution nuclear magnetic resonance studies in the lower-concentration regime using a novel method to produce isotope-labeled ( 15 N and 13 C) recombinant melittin. We provide residue-specific structural characterization of melittin in dilute aqueous solution and in 2,2,2-trifluoroethanol/water mixtures, which mimic melittin structure-function and interactions in aqueous and membrane-like environments, respectively. We find that the cis-trans isomerization of Pro14 is key to changes in the secondary structure of melittin. Thus, this study provides residue-specific structural information about melittin in the free state and in a model of the substrate-bound state. These results, taken together with published work from other laboratories, reveal the peptide's structural versatility that resembles that of intrinsically disordered proteins and peptides.

  14. MERA: a webserver for evaluating backbone torsion angle distributions in dynamic and disordered proteins from NMR data

    Energy Technology Data Exchange (ETDEWEB)

    Mantsyzov, Alexey B. [M.V. Lomonosov Moscow State University, Faculty of Fundamental Medicine (Russian Federation); Shen, Yang; Lee, Jung Ho [National Institutes of Health, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases (United States); Hummer, Gerhard [Max Planck Institute of Biophysics (Germany); Bax, Ad, E-mail: bax@nih.gov [National Institutes of Health, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases (United States)

    2015-09-15

    MERA (Maximum Entropy Ramachandran map Analysis from NMR data) is a new webserver that generates residue-by-residue Ramachandran map distributions for disordered proteins or disordered regions in proteins on the basis of experimental NMR parameters. As input data, the program currently utilizes up to 12 different parameters. These include three different types of short-range NOEs, three types of backbone chemical shifts ({sup 15}N, {sup 13}C{sup α}, and {sup 13}C′), six types of J couplings ({sup 3}J{sub HNHα}, {sup 3}J{sub C′C′}, {sup 3}J{sub C′Hα}, {sup 1}J{sub HαCα}, {sup 2}J{sub CαN} and {sup 1}J{sub CαN}), as well as the {sup 15}N-relaxation derived J(0) spectral density. The Ramachandran map distributions are reported in terms of populations of their 15° × 15° voxels, and an adjustable maximum entropy weight factor is available to ensure that the obtained distributions will not deviate more from a newly derived coil library distribution than required to account for the experimental data. MERA output includes the agreement between each input parameter and its distribution-derived value. As an application, we demonstrate performance of the program for several residues in the intrinsically disordered protein α-synuclein, as well as for several static and dynamic residues in the folded protein GB3.

  15. Predicting beta-turns and their types using predicted backbone dihedral angles and secondary structures.

    Science.gov (United States)

    Kountouris, Petros; Hirst, Jonathan D

    2010-07-31

    Beta-turns are secondary structure elements usually classified as coil. Their prediction is important, because of their role in protein folding and their frequent occurrence in protein chains. We have developed a novel method that predicts beta-turns and their types using information from multiple sequence alignments, predicted secondary structures and, for the first time, predicted dihedral angles. Our method uses support vector machines, a supervised classification technique, and is trained and tested on three established datasets of 426, 547 and 823 protein chains. We achieve a Matthews correlation coefficient of up to 0.49, when predicting the location of beta-turns, the highest reported value to date. Moreover, the additional dihedral information improves the prediction of beta-turn types I, II, IV, VIII and "non-specific", achieving correlation coefficients up to 0.39, 0.33, 0.27, 0.14 and 0.38, respectively. Our results are more accurate than other methods. We have created an accurate predictor of beta-turns and their types. Our method, called DEBT, is available online at http://comp.chem.nottingham.ac.uk/debt/.

  16. Density functional calculations of backbone 15N shielding tensors in beta-sheet and turn residues of protein G

    International Nuclear Information System (INIS)

    Cai Ling; Kosov, Daniel S.; Fushman, David

    2011-01-01

    We performed density functional calculations of backbone 15 N shielding tensors in the regions of beta-sheet and turns of protein G. The calculations were carried out for all twenty-four beta-sheet residues and eight beta-turn residues in the protein GB3 and the results were compared with the available experimental data from solid-state and solution NMR measurements. Together with the alpha-helix data, our calculations cover 39 out of the 55 residues (or 71%) in GB3. The applicability of several computational models developed previously (Cai et al. in J Biomol NMR 45:245–253, 2009) to compute 15 N shielding tensors of alpha-helical residues is assessed. We show that the proposed quantum chemical computational model is capable of predicting isotropic 15 N chemical shifts for an entire protein that are in good correlation with experimental data. However, the individual components of the predicted 15 N shielding tensor agree with experiment less well: the computed values show much larger spread than the experimental data, and there is a profound difference in the behavior of the tensor components for alpha-helix/turns and beta-sheet residues. We discuss possible reasons for this.

  17. (nBuCp)2ZrCl2-catalyzed Ethylene-4M1P Copolymerization: Copolymer Backbone Structure, Melt Behavior, and Crystallization

    KAUST Repository

    Atiqullah, Muhammad; Adamu, Sagir; Malaibari, Zuhair O.; Al-Harthi, Mamdouh A.; Emwas, Abdul-Hamid M.

    2016-01-01

    The judicious design of methylaluminoxane (MAO) anions expands the scope for developing industrial metallocene catalysts. Therefore, the effects of MAO anion design on the backbone structure, melt behavior, and crystallization of ethylene−4-methyl-1

  18. High dimensional and high resolution pulse sequences for backbone resonance assignment of intrinsically disordered proteins

    Energy Technology Data Exchange (ETDEWEB)

    Zawadzka-Kazimierczuk, Anna; Kozminski, Wiktor, E-mail: kozmin@chem.uw.edu.pl [University of Warsaw, Faculty of Chemistry (Poland); Sanderova, Hana; Krasny, Libor [Institute of Microbiology, Academy of Sciences of the Czech Republic, Laboratory of Molecular Genetics of Bacteria, Department of Bacteriology (Czech Republic)

    2012-04-15

    Four novel 5D (HACA(N)CONH, HNCOCACB, (HACA)CON(CA)CONH, (H)NCO(NCA)CONH), and one 6D ((H)NCO(N)CACONH) NMR pulse sequences are proposed. The new experiments employ non-uniform sampling that enables achieving high resolution in indirectly detected dimensions. The experiments facilitate resonance assignment of intrinsically disordered proteins. The novel pulse sequences were successfully tested using {delta} subunit (20 kDa) of Bacillus subtilis RNA polymerase that has an 81-amino acid disordered part containing various repetitive sequences.

  19. Computing a new family of shape descriptors for protein structures

    DEFF Research Database (Denmark)

    Røgen, Peter; Sinclair, Robert

    2003-01-01

    The large-scale 3D structure of a protein can be represented by the polygonal curve through the carbon a atoms of the protein backbone. We introduce an algorithm for computing the average number of times that a given configuration of crossings on such polygonal curves is seen, the average being...

  20. Backbone and sidechain methyl Ile (δ1), Leu and Val chemical shift assignments of RDE-4 (1-243), an RNA interference initiation protein in C. elegans.

    Science.gov (United States)

    Chiliveri, Sai Chaitanya; Kumar, Sonu; Marelli, Udaya Kiran; Deshmukh, Mandar V

    2012-10-01

    The RNAi pathway of several organisms requires presence of double stranded RNA binding proteins for functioning of Dicer in gene regulation. In C. elegans, a double stranded RNA binding protein, RDE-4 (385 aa, 44 kDa) recognizes long exogenous dsRNA and initiates the RNAi pathway. We have achieved complete backbone and stereospecific methyl sidechain Ile (δ1), Leu and Val chemical shifts of first 243 amino acids of RDE-4, namely RDE-4ΔC.

  1. Accurate protein structure modeling using sparse NMR data and homologous structure information.

    Science.gov (United States)

    Thompson, James M; Sgourakis, Nikolaos G; Liu, Gaohua; Rossi, Paolo; Tang, Yuefeng; Mills, Jeffrey L; Szyperski, Thomas; Montelione, Gaetano T; Baker, David

    2012-06-19

    While information from homologous structures plays a central role in X-ray structure determination by molecular replacement, such information is rarely used in NMR structure determination because it can be incorrect, both locally and globally, when evolutionary relationships are inferred incorrectly or there has been considerable evolutionary structural divergence. Here we describe a method that allows robust modeling of protein structures of up to 225 residues by combining (1)H(N), (13)C, and (15)N backbone and (13)Cβ chemical shift data, distance restraints derived from homologous structures, and a physically realistic all-atom energy function. Accurate models are distinguished from inaccurate models generated using incorrect sequence alignments by requiring that (i) the all-atom energies of models generated using the restraints are lower than models generated in unrestrained calculations and (ii) the low-energy structures converge to within 2.0 Å backbone rmsd over 75% of the protein. Benchmark calculations on known structures and blind targets show that the method can accurately model protein structures, even with very remote homology information, to a backbone rmsd of 1.2-1.9 Å relative to the conventional determined NMR ensembles and of 0.9-1.6 Å relative to X-ray structures for well-defined regions of the protein structures. This approach facilitates the accurate modeling of protein structures using backbone chemical shift data without need for side-chain resonance assignments and extensive analysis of NOESY cross-peak assignments.

  2. Microsecond molecular dynamics simulation shows effect of slow loop dynamics on backbone amide order parameters of proteins

    DEFF Research Database (Denmark)

    Maragakis, Paul; Lindorff-Larsen, Kresten; Eastwood, Michael P

    2008-01-01

    . Molecular dynamics (MD) simulation provides a complementary approach to the study of protein dynamics on similar time scales. Comparisons between NMR spectroscopy and MD simulations can be used to interpret experimental results and to improve the quality of simulation-related force fields and integration......A molecular-level understanding of the function of a protein requires knowledge of both its structural and dynamic properties. NMR spectroscopy allows the measurement of generalized order parameters that provide an atomistic description of picosecond and nanosecond fluctuations in protein structure...... methods. However, apparent systematic discrepancies between order parameters extracted from simulations and experiments are common, particularly for elements of noncanonical secondary structure. In this paper, results from a 1.2 micros explicit solvent MD simulation of the protein ubiquitin are compared...

  3. Backbone cup – a structure design competition based on topology optimization and 3D printing

    Directory of Open Access Journals (Sweden)

    Zhu Ji-Hong

    2016-01-01

    Full Text Available This paper addresses a structure design competition based on topology optimization and 3D Printing, and proposes an experimental approach to efficiently and quickly measure the mechanical performance of the structures designed using topology optimization. Since the topology optimized structure designs are prone to be geometrically complex, it is extremely inconvenient to fabricate these designs with traditional machining. In this study, we not only fabricated the topology optimized structure designs using one kind of 3D Printing technology known as stereolithography (SLA, but also tested the mechanical performance of the produced prototype parts. The finite element method is used to analyze the structure responses, and the consistent results of the numerical simulations and structure experiments prove the validity of this new structure testing approach. This new approach will not only provide a rapid access to topology optimized structure designs verifying, but also cut the turnaround time of structure design significantly.

  4. Analysis of the HIV-2 protease's adaptation to various ligands: characterization of backbone asymmetry using a structural alphabet.

    Science.gov (United States)

    Triki, Dhoha; Cano Contreras, Mario Enrique; Flatters, Delphine; Visseaux, Benoit; Descamps, Diane; Camproux, Anne-Claude; Regad, Leslie

    2018-01-15

    The HIV-2 protease (PR2) is a homodimer of 99 residues with asymmetric assembly and binding various ligands. We propose an exhaustive study of the local structural asymmetry between the two monomers of all available PR2 structures complexed with various inhibitors using a structural alphabet approach. On average, PR2 exhibits asymmetry in 31% of its positions-i.e., exhibiting different backbone local conformations in the two monomers. This asymmetry was observed all along its structure, particularly in the elbow and flap regions. We first differentiated structural asymmetry conserved in most PR2 structures from the one specific to some PR2. Then, we explored the origin of the detected asymmetry in PR2. We localized asymmetry that could be induced by PR2's flexibility, allowing transition from the semi-open to closed conformations and the asymmetry potentially induced by ligand binding. This latter could be important for the PR2's adaptation to diverse ligands. Our results highlighted some differences between asymmetry of PR2 bound to darunavir and amprenavir that could explain their differences of affinity. This knowledge is critical for a better description of PR2's recognition and adaptation to various ligands and for a better understanding of the resistance of PR2 to most PR2 inhibitors, a major antiretroviral class.

  5. Structural basis for target protein recognition by the protein disulfide reductase thioredoxin

    DEFF Research Database (Denmark)

    Maeda, Kenji; Hägglund, Per; Finnie, Christine

    2006-01-01

    Thioredoxin is ubiquitous and regulates various target proteins through disulfide bond reduction. We report the structure of thioredoxin (HvTrxh2 from barley) in a reaction intermediate complex with a protein substrate, barley alpha-amylase/subtilisin inhibitor (BASI). The crystal structure...... of this mixed disulfide shows a conserved hydrophobic motif in thioredoxin interacting with a sequence of residues from BASI through van der Waals contacts and backbone-backbone hydrogen bonds. The observed structural complementarity suggests that the recognition of features around protein disulfides plays...... a major role in the specificity and protein disulfide reductase activity of thioredoxin. This novel insight into the function of thioredoxin constitutes a basis for comprehensive understanding of its biological role. Moreover, comparison with structurally related proteins shows that thioredoxin shares...

  6. Building alternate protein structures using the elastic network model.

    Science.gov (United States)

    Yang, Qingyi; Sharp, Kim A

    2009-02-15

    We describe a method for efficiently generating ensembles of alternate, all-atom protein structures that (a) differ significantly from the starting structure, (b) have good stereochemistry (bonded geometry), and (c) have good steric properties (absence of atomic overlap). The method uses reconstruction from a series of backbone framework structures that are obtained from a modified elastic network model (ENM) by perturbation along low-frequency normal modes. To ensure good quality backbone frameworks, the single force parameter ENM is modified by introducing two more force parameters to characterize the interaction between the consecutive carbon alphas and those within the same secondary structure domain. The relative stiffness of the three parameters is parameterized to reproduce B-factors, while maintaining good bonded geometry. After parameterization, violations of experimental Calpha-Calpha distances and Calpha-Calpha-Calpha pseudo angles along the backbone are reduced to less than 1%. Simultaneously, the average B-factor correlation coefficient improves to R = 0.77. Two applications illustrate the potential of the approach. (1) 102,051 protein backbones spanning a conformational space of 15 A root mean square deviation were generated from 148 nonredundant proteins in the PDB database, and all-atom models with minimal bonded and nonbonded violations were produced from this ensemble of backbone structures using the SCWRL side chain building program. (2) Improved backbone templates for homology modeling. Fifteen query sequences were each modeled on two targets. For each of the 30 target frameworks, dozens of improved templates could be produced In all cases, improved full atom homology models resulted, of which 50% could be identified blind using the D-Fire statistical potential. (c) 2008 Wiley-Liss, Inc.

  7. Identifying secondary structures in proteins using NMR chemical shift 3D correlation maps

    Science.gov (United States)

    Kumari, Amrita; Dorai, Kavita

    2013-06-01

    NMR chemical shifts are accurate indicators of molecular environment and have been extensively used as aids in protein structure determination. This work focuses on creating empirical 3D correlation maps of backbone chemical shift nuclei for use as identifiers of secondary structure elements in proteins. A correlated database of backbone nuclei chemical shifts was constructed from experimental structural data gathered from entries in the Protein Data Bank (PDB) as well as isotropic chemical shift values from the RefDB database. Rigorous statistical analysis of the maps led to the conclusion that specific correlations between triplets of backbone chemical shifts are best able to differentiate between different secondary structures such as α-helices, β-strands and turns. The method is compared with similar techniques that use NMR chemical shift information as aids in biomolecular structure determination and performs well in tests done on experimental data determined for different types of proteins, including large multi-domain proteins and membrane proteins.

  8. Protein structure database search and evolutionary classification.

    Science.gov (United States)

    Yang, Jinn-Moon; Tung, Chi-Hua

    2006-01-01

    As more protein structures become available and structural genomics efforts provide structural models in a genome-wide strategy, there is a growing need for fast and accurate methods for discovering homologous proteins and evolutionary classifications of newly determined structures. We have developed 3D-BLAST, in part, to address these issues. 3D-BLAST is as fast as BLAST and calculates the statistical significance (E-value) of an alignment to indicate the reliability of the prediction. Using this method, we first identified 23 states of the structural alphabet that represent pattern profiles of the backbone fragments and then used them to represent protein structure databases as structural alphabet sequence databases (SADB). Our method enhanced BLAST as a search method, using a new structural alphabet substitution matrix (SASM) to find the longest common substructures with high-scoring structured segment pairs from an SADB database. Using personal computers with Intel Pentium4 (2.8 GHz) processors, our method searched more than 10 000 protein structures in 1.3 s and achieved a good agreement with search results from detailed structure alignment methods. [3D-BLAST is available at http://3d-blast.life.nctu.edu.tw].

  9. Structural deformation upon protein-protein interaction: a structural alphabet approach.

    Science.gov (United States)

    Martin, Juliette; Regad, Leslie; Lecornet, Hélène; Camproux, Anne-Claude

    2008-02-28

    In a number of protein-protein complexes, the 3D structures of bound and unbound partners significantly differ, supporting the induced fit hypothesis for protein-protein binding. In this study, we explore the induced fit modifications on a set of 124 proteins available in both bound and unbound forms, in terms of local structure. The local structure is described thanks to a structural alphabet of 27 structural letters that allows a detailed description of the backbone. Using a control set to distinguish induced fit from experimental error and natural protein flexibility, we show that the fraction of structural letters modified upon binding is significantly greater than in the control set (36% versus 28%). This proportion is even greater in the interface regions (41%). Interface regions preferentially involve coils. Our analysis further reveals that some structural letters in coil are not favored in the interface. We show that certain structural letters in coil are particularly subject to modifications at the interface, and that the severity of structural change also varies. These information are used to derive a structural letter substitution matrix that summarizes the local structural changes observed in our data set. We also illustrate the usefulness of our approach to identify common binding motifs in unrelated proteins. Our study provides qualitative information about induced fit. These results could be of help for flexible docking.

  10. Structural deformation upon protein-protein interaction: A structural alphabet approach

    Directory of Open Access Journals (Sweden)

    Lecornet Hélène

    2008-02-01

    Full Text Available Abstract Background In a number of protein-protein complexes, the 3D structures of bound and unbound partners significantly differ, supporting the induced fit hypothesis for protein-protein binding. Results In this study, we explore the induced fit modifications on a set of 124 proteins available in both bound and unbound forms, in terms of local structure. The local structure is described thanks to a structural alphabet of 27 structural letters that allows a detailed description of the backbone. Using a control set to distinguish induced fit from experimental error and natural protein flexibility, we show that the fraction of structural letters modified upon binding is significantly greater than in the control set (36% versus 28%. This proportion is even greater in the interface regions (41%. Interface regions preferentially involve coils. Our analysis further reveals that some structural letters in coil are not favored in the interface. We show that certain structural letters in coil are particularly subject to modifications at the interface, and that the severity of structural change also varies. These information are used to derive a structural letter substitution matrix that summarizes the local structural changes observed in our data set. We also illustrate the usefulness of our approach to identify common binding motifs in unrelated proteins. Conclusion Our study provides qualitative information about induced fit. These results could be of help for flexible docking.

  11. Adding diverse noncanonical backbones to rosetta: enabling peptidomimetic design.

    Directory of Open Access Journals (Sweden)

    Kevin Drew

    Full Text Available Peptidomimetics are classes of molecules that mimic structural and functional attributes of polypeptides. Peptidomimetic oligomers can frequently be synthesized using efficient solid phase synthesis procedures similar to peptide synthesis. Conformationally ordered peptidomimetic oligomers are finding broad applications for molecular recognition and for inhibiting protein-protein interactions. One critical limitation is the limited set of design tools for identifying oligomer sequences that can adopt desired conformations. Here, we present expansions to the ROSETTA platform that enable structure prediction and design of five non-peptidic oligomer scaffolds (noncanonical backbones, oligooxopiperazines, oligo-peptoids, [Formula: see text]-peptides, hydrogen bond surrogate helices and oligosaccharides. This work is complementary to prior additions to model noncanonical protein side chains in ROSETTA. The main purpose of our manuscript is to give a detailed description to current and future developers of how each of these noncanonical backbones was implemented. Furthermore, we provide a general outline for implementation of new backbone types not discussed here. To illustrate the utility of this approach, we describe the first tests of the ROSETTA molecular mechanics energy function in the context of oligooxopiperazines, using quantum mechanical calculations as comparison points, scanning through backbone and side chain torsion angles for a model peptidomimetic. Finally, as an example of a novel design application, we describe the automated design of an oligooxopiperazine that inhibits the p53-MDM2 protein-protein interaction. For the general biological and bioengineering community, several noncanonical backbones have been incorporated into web applications that allow users to freely and rapidly test the presented protocols (http://rosie.rosettacommons.org. This work helps address the peptidomimetic community's need for an automated and expandable

  12. Local Backbone Flexibility as a Determinant of the Apparent pKa Values of Buried Ionizable Groups in Proteins.

    Science.gov (United States)

    Peck, Meredith T; Ortega, Gabriel; De Luca-Johnson, Javier N; Schlessman, Jamie L; Robinson, Aaron C; García-Moreno E, Bertrand

    2017-10-10

    Ionizable groups buried in the hydrophobic interior of proteins are essential for energy transduction. These groups can have highly anomalous pK a values that reflect the incompatibility between charges and dehydrated environments. A systematic study of pK a values of buried ionizable groups in staphylococcal nuclease (SNase) suggests that these pK a values are determined in part by conformational reorganization of the protein. Lys-66 is one of the most deeply buried residues in SNase. We show that its apparent pK a of 5.7 reflects the average of the pK a values of Lys-66 in different conformational states of the protein. In the fully folded state, Lys-66 is deeply buried in the hydrophobic core of SNase and must titrate with a pK a of ≪5.7. In other states, the side chain of Lys-66 is fully solvent-exposed and has a normal pK a of ≈10.4. We show that the pK a of Lys-66 can be shifted from 5.7 toward a more normal value of 7.1 via the insertion of flanking Gly residues at positions 64 and 67 to promote an "open" conformation of SNase. Crystal structures and nuclear magnetic resonance spectroscopy show that in these Gly-containing variants Lys-66 can access bulk water as a consequence of overwinding of the C-terminal end of helix 1. These data illustrate that the apparent pK a values of buried groups in proteins are governed in part by the difference in free energy between different conformational states of the protein and by differences in the pK a values of the buried groups in the different conformations.

  13. Solution structure and dynamics of melanoma inhibitory activity protein

    International Nuclear Information System (INIS)

    Lougheed, Julie C.; Domaille, Peter J.; Handel, Tracy M.

    2002-01-01

    Melanoma inhibitory activity (MIA) is a small secreted protein that is implicated in cartilage cell maintenance and melanoma metastasis. It is representative of a recently discovered family of proteins that contain a Src Homologous 3 (SH3) subdomain. While SH3 domains are normally found in intracellular proteins and mediate protein-protein interactions via recognition of polyproline helices, MIA is single-domain extracellular protein, and it probably binds to a different class of ligands.Here we report the assignments, solution structure, and dynamics of human MIA determined by heteronuclear NMR methods. The structures were calculated in a semi-automated manner without manual assignment of NOE crosspeaks, and have a backbone rmsd of 0.38 A over the ordered regions of the protein. The structure consists of an SH3-like subdomain with N- and C-terminal extensions of approximately 20 amino acids each that together form a novel fold. The rmsd between the solution structure and our recently reported crystal structure is 0.86 A over the ordered regions of the backbone, and the main differences are localized to the most dynamic regions of the protein. The similarity between the NMR and crystal structures supports the use of automated NOE assignments and ambiguous restraints to accelerate the calculation of NMR structures

  14. Structures composing protein domains.

    Science.gov (United States)

    Kubrycht, Jaroslav; Sigler, Karel; Souček, Pavel; Hudeček, Jiří

    2013-08-01

    This review summarizes available data concerning intradomain structures (IS) such as functionally important amino acid residues, short linear motifs, conserved or disordered regions, peptide repeats, broadly occurring secondary structures or folds, etc. IS form structural features (units or elements) necessary for interactions with proteins or non-peptidic ligands, enzyme reactions and some structural properties of proteins. These features have often been related to a single structural level (e.g. primary structure) mostly requiring certain structural context of other levels (e.g. secondary structures or supersecondary folds) as follows also from some examples reported or demonstrated here. In addition, we deal with some functionally important dynamic properties of IS (e.g. flexibility and different forms of accessibility), and more special dynamic changes of IS during enzyme reactions and allosteric regulation. Selected notes concern also some experimental methods, still more necessary tools of bioinformatic processing and clinically interesting relationships. Copyright © 2013 Elsevier Masson SAS. All rights reserved.

  15. Solution structure, copper binding and backbone dynamics of recombinant Ber e 1-the major allergen from Brazil nut.

    Directory of Open Access Journals (Sweden)

    Louise Rundqvist

    Full Text Available BACKGROUND: The 2S albumin Ber e 1 is the major allergen in Brazil nuts. Previous findings indicated that the protein alone does not cause an allergenic response in mice, but the addition of components from a Brazil nut lipid fraction were required. Structural details of Ber e 1 may contribute to the understanding of the allergenic properties of the protein and its potential interaction partners. METHODOLOGY/PRINCIPAL FINDINGS: The solution structure of recombinant Ber e 1 was solved using NMR spectroscopy and measurements of the protein back bone dynamics at a residue-specific level were extracted using (15N-spin relaxation. A hydrophobic cavity was identified in the structure of Ber e 1. Using the paramagnetic relaxation enhancement property of Cu(2+ in conjunction with NMR, it was shown that Ber e 1 is able to specifically interact with the divalent copper ion and the binding site was modeled into the structure. The IgE binding region as well as the copper binding site show increased dynamics on both fast ps-ns timescale as well as slower µs-ms timescale. CONCLUSIONS/SIGNIFICANCE: The overall fold of Ber e 1 is similar to other 2S albumins, but the hydrophobic cavity resembles that of a homologous non-specific lipid transfer protein. Ber e 1 is the first 2S albumin shown to interact with Cu(2+ ions. This Cu(2+ binding has minimal effect on the electrostatic potential on the surface of the protein, but the charge distribution within the hydrophobic cavity is significantly altered. As the hydrophobic cavity is likely to be involved in a putative lipid interaction the Cu(2+ can in turn affect the interaction that is essential to provoke an allergenic response.

  16. Testing Backbone.js

    CERN Document Server

    Roemer, Ryan

    2013-01-01

    This book is packed with the step by step tutorial and instructions in recipe format helping you setup test infrastructure and gradually advance your skills to plan, develop, and test your backbone applications.If you are a JavaScript developer looking for recipes to create and implement test support for your backbone application, then this book is ideal for you.

  17. Backbone amide linker strategy

    DEFF Research Database (Denmark)

    Shelton, Anne Pernille Tofteng; Jensen, Knud Jørgen

    2013-01-01

    In the backbone amide linker (BAL) strategy, the peptide is anchored not at the C-terminus but through a backbone amide, which leaves the C-terminal available for various modifications. This is thus a very general strategy for the introduction of C-terminal modifications. The BAL strategy...

  18. Detection of closed influenza virus hemagglutinin fusion peptide structures in membranes by backbone {sup 13}CO-{sup 15}N rotational-echo double-resonance solid-state NMR

    Energy Technology Data Exchange (ETDEWEB)

    Ghosh, Ujjayini; Xie Li; Weliky, David P., E-mail: weliky@chemistry.msu.edu [Michigan State University, Department of Chemistry (United States)

    2013-02-15

    The influenza virus fusion peptide is the N-terminal {approx}20 residues of the HA2 subunit of the hemagglutinin protein and this peptide plays a key role in the fusion of the viral and endosomal membranes during initial infection of a cell. The fusion peptide adopts N-helix/turn/C-helix structure in both detergent and membranes with reports of both open and closed interhelical topologies. In the present study, backbone {sup 13}CO-{sup 15}N REDOR solid-state NMR was applied to the membrane-associated fusion peptide to detect the distribution of interhelical distances. The data clearly showed a large fraction of closed and semi-closed topologies and were best-fitted to a mixture of two structures that do not exchange. One of the earlier open structural models may have incorrect G13 dihedral angles derived from TALOS analysis of experimentally correct {sup 13}C shifts.

  19. Protein structure validation and refinement using amide proton chemical shifts derived from quantum mechanics

    DEFF Research Database (Denmark)

    Christensen, Anders Steen; Linnet, Troels Emtekær; Borg, Mikael

    2013-01-01

    We present the ProCS method for the rapid and accurate prediction of protein backbone amide proton chemical shifts - sensitive probes of the geometry of key hydrogen bonds that determine protein structure. ProCS is parameterized against quantum mechanical (QM) calculations and reproduces high level...

  20. New insights into structure and function of the different types of fatty acid-binding protein

    NARCIS (Netherlands)

    Zimmerman, Augusta Wilhelmina

    2002-01-01

    Fatty acid binding proteins (FABPs) are small cytosolic proteins with virtually identical backbone structures that facilitate the solubility and intracellular transport of fatty acids. They may also modulate the effect of fatty acids on various metabolic enzymes and receptors and on cellular

  1. Discrete Haar transform and protein structure.

    Science.gov (United States)

    Morosetti, S

    1997-12-01

    The discrete Haar transform of the sequence of the backbone dihedral angles (phi and psi) was performed over a set of X-ray protein structures of high resolution from the Brookhaven Protein Data Bank. Afterwards, the new dihedral angles were calculated by the inverse transform, using a growing number of Haar functions, from the lower to the higher degree. New structures were obtained using these dihedral angles, with standard values for bond lengths and angles, and with omega = 0 degree. The reconstructed structures were compared with the experimental ones, and analyzed by visual inspection and statistical analysis. When half of the Haar coefficients were used, all the reconstructed structures were not yet collapsed to a tertiary folding, but they showed yet realized most of the secondary motifs. These results indicate a substantial separation of structural information in the space of Haar transform, with the secondary structural information mainly present in the Haar coefficients of lower degrees, and the tertiary one present in the higher degree coefficients. Because of this separation, the representation of the folded structures in the space of Haar transform seems a promising candidate to encompass the problem of premature convergence in genetic algorithms.

  2. Phosphorus Binding Sites in Proteins: Structural Preorganization and Coordination

    DEFF Research Database (Denmark)

    Gruber, Mathias Felix; Greisen, Per Junior; Junker, Märta Caroline

    2014-01-01

    to individual structures that bind to phosphate groups; here, we investigate a total of 8307 structures obtained from the RCSB Protein Data Bank (PDB). An analysis of the binding site amino acid propensities reveals very characteristic first shell residue distributions, which are found to be influenced...... by the characteristics of the phosphorus compound and by the presence of cobound cations. The second shell, which supports the coordinating residues in the first shell, is found to consist mainly of protein backbone groups. Our results show how the second shell residue distribution is dictated mainly by the first shell...

  3. Improved protein surface comparison and application to low-resolution protein structure data

    Directory of Open Access Journals (Sweden)

    Kihara Daisuke

    2010-12-01

    Full Text Available Abstract Background Recent advancements of experimental techniques for determining protein tertiary structures raise significant challenges for protein bioinformatics. With the number of known structures of unknown function expanding at a rapid pace, an urgent task is to provide reliable clues to their biological function on a large scale. Conventional approaches for structure comparison are not suitable for a real-time database search due to their slow speed. Moreover, a new challenge has arisen from recent techniques such as electron microscopy (EM, which provide low-resolution structure data. Previously, we have introduced a method for protein surface shape representation using the 3D Zernike descriptors (3DZDs. The 3DZD enables fast structure database searches, taking advantage of its rotation invariance and compact representation. The search results of protein surface represented with the 3DZD has showngood agreement with the existing structure classifications, but some discrepancies were also observed. Results The three new surface representations of backbone atoms, originally devised all-atom-surface representation, and the combination of all-atom surface with the backbone representation are examined. All representations are encoded with the 3DZD. Also, we have investigated the applicability of the 3DZD for searching protein EM density maps of varying resolutions. The surface representations are evaluated on structure retrieval using two existing classifications, SCOP and the CE-based classification. Conclusions Overall, the 3DZDs representing backbone atoms show better retrieval performance than the original all-atom surface representation. The performance further improved when the two representations are combined. Moreover, we observed that the 3DZD is also powerful in comparing low-resolution structures obtained by electron microscopy.

  4. Improved protein surface comparison and application to low-resolution protein structure data.

    Science.gov (United States)

    Sael, Lee; Kihara, Daisuke

    2010-12-14

    Recent advancements of experimental techniques for determining protein tertiary structures raise significant challenges for protein bioinformatics. With the number of known structures of unknown function expanding at a rapid pace, an urgent task is to provide reliable clues to their biological function on a large scale. Conventional approaches for structure comparison are not suitable for a real-time database search due to their slow speed. Moreover, a new challenge has arisen from recent techniques such as electron microscopy (EM), which provide low-resolution structure data. Previously, we have introduced a method for protein surface shape representation using the 3D Zernike descriptors (3DZDs). The 3DZD enables fast structure database searches, taking advantage of its rotation invariance and compact representation. The search results of protein surface represented with the 3DZD has showngood agreement with the existing structure classifications, but some discrepancies were also observed. The three new surface representations of backbone atoms, originally devised all-atom-surface representation, and the combination of all-atom surface with the backbone representation are examined. All representations are encoded with the 3DZD. Also, we have investigated the applicability of the 3DZD for searching protein EM density maps of varying resolutions. The surface representations are evaluated on structure retrieval using two existing classifications, SCOP and the CE-based classification. Overall, the 3DZDs representing backbone atoms show better retrieval performance than the original all-atom surface representation. The performance further improved when the two representations are combined. Moreover, we observed that the 3DZD is also powerful in comparing low-resolution structures obtained by electron microscopy.

  5. A hidden markov model derived structural alphabet for proteins.

    Science.gov (United States)

    Camproux, A C; Gautier, R; Tufféry, P

    2004-06-04

    Understanding and predicting protein structures depends on the complexity and the accuracy of the models used to represent them. We have set up a hidden Markov model that discretizes protein backbone conformation as series of overlapping fragments (states) of four residues length. This approach learns simultaneously the geometry of the states and their connections. We obtain, using a statistical criterion, an optimal systematic decomposition of the conformational variability of the protein peptidic chain in 27 states with strong connection logic. This result is stable over different protein sets. Our model fits well the previous knowledge related to protein architecture organisation and seems able to grab some subtle details of protein organisation, such as helix sub-level organisation schemes. Taking into account the dependence between the states results in a description of local protein structure of low complexity. On an average, the model makes use of only 8.3 states among 27 to describe each position of a protein structure. Although we use short fragments, the learning process on entire protein conformations captures the logic of the assembly on a larger scale. Using such a model, the structure of proteins can be reconstructed with an average accuracy close to 1.1A root-mean-square deviation and for a complexity of only 3. Finally, we also observe that sequence specificity increases with the number of states of the structural alphabet. Such models can constitute a very relevant approach to the analysis of protein architecture in particular for protein structure prediction.

  6. Protein Structure Prediction by Protein Threading

    Science.gov (United States)

    Xu, Ying; Liu, Zhijie; Cai, Liming; Xu, Dong

    The seminal work of Bowie, Lüthy, and Eisenberg (Bowie et al., 1991) on "the inverse protein folding problem" laid the foundation of protein structure prediction by protein threading. By using simple measures for fitness of different amino acid types to local structural environments defined in terms of solvent accessibility and protein secondary structure, the authors derived a simple and yet profoundly novel approach to assessing if a protein sequence fits well with a given protein structural fold. Their follow-up work (Elofsson et al., 1996; Fischer and Eisenberg, 1996; Fischer et al., 1996a,b) and the work by Jones, Taylor, and Thornton (Jones et al., 1992) on protein fold recognition led to the development of a new brand of powerful tools for protein structure prediction, which we now term "protein threading." These computational tools have played a key role in extending the utility of all the experimentally solved structures by X-ray crystallography and nuclear magnetic resonance (NMR), providing structural models and functional predictions for many of the proteins encoded in the hundreds of genomes that have been sequenced up to now.

  7. A ‘just-in-time’ HN(CA)CO experiment for the backbone assignment of large proteins with high sensitivity

    Science.gov (United States)

    Werner-Allen, Jon W.; Jiang, Ling; Zhou, Pei

    2006-07-01

    Among the suite of commonly used backbone experiments, HNCACO presents an unresolved sensitivity limitation due to fast 13CO transverse relaxation and passive 13Cα-13Cβ coupling. Here, we present a high-sensitivity 'just-in-time' (JIT) HN(CA)CO pulse sequence that uniformly refocuses 13Cα-13Cβ coupling while collecting 13CO shifts in real time. Sensitivity comparisons of the 3-D JIT HN(CA)CO, a CT-HMQC-based control, and a HSQC-based control with selective 13Cα inversion pulses were performed using a 2H/13C/15N labeled sample of the 29 kDa HCA II protein at 15 °C. The JIT experiment shows a 42% signal enhancement over the CT-HMQC-based experiment. Compared to the HSQC-based experiment, the JIT experiment is 16% less sensitive for residues experiencing proper 13Cα refocusing and 13Cα-13Cβ decoupling. However, for the remaining residues, the JIT spectrum shows a 106% average sensitivity gain over the HSQC-based experiment. The high-sensitivity JIT HNCACO experiment should be particularly beneficial for studies of large proteins to provide 13CO resonance information regardless of residue type.

  8. Pairwise NMR experiments for the determination of protein backbone dihedral angle Φ based on cross-correlated spin relaxation

    International Nuclear Information System (INIS)

    Takahashi, Hideo; Shimada, Ichio

    2007-01-01

    Novel cross-correlated spin relaxation (CCR) experiments are described, which measure pairwise CCR rates for obtaining peptide dihedral angles Φ. The experiments utilize intra-HNCA type coherence transfer to refocus 2-bond J NCα coupling evolution and generate the N (i)-C α (i) or C'(i-1)-C α (i) multiple quantum coherences which are required for measuring the desired CCR rates. The contribution from other coherences is also discussed and an appropriate setting of the evolution delays is presented. These CCR experiments were applied to 15 N- and 13 C-labeled human ubiquitin. The relevant CCR rates showed a high degree of correlation with the Φ angles observed in the X-ray structure. By utilizing these CCR experiments in combination with those previously established for obtaining dihedral angle Ψ, we can determine high resolution structures of peptides that bind weakly to large target molecules

  9. Easy and unambiguous sequential assignments of intrinsically disordered proteins by correlating the backbone 15N or 13C′ chemical shifts of multiple contiguous residues in highly resolved 3D spectra

    International Nuclear Information System (INIS)

    Yoshimura, Yuichi; Kulminskaya, Natalia V.; Mulder, Frans A. A.

    2015-01-01

    Sequential resonance assignment strategies are typically based on matching one or two chemical shifts of adjacent residues. However, resonance overlap often leads to ambiguity in resonance assignments in particular for intrinsically disordered proteins. We investigated the potential of establishing connectivity through the three-bond couplings between sequentially adjoining backbone carbonyl carbon nuclei, combined with semi-constant time chemical shift evolution, for resonance assignments of small folded and larger unfolded proteins. Extended sequential connectivity strongly lifts chemical shift degeneracy of the backbone nuclei in disordered proteins. We show here that 3D (H)N(COCO)NH and (HN)CO(CO)NH experiments with relaxation-optimized multiple pulse mixing correlate up to seven adjacent backbone amide nitrogen or carbonyl carbon nuclei, respectively, and connections across proline residues are also obtained straightforwardly. Multiple, recurrent long-range correlations with ultra-high resolution allow backbone 1 H N , 15 N H , and 13 C′ resonance assignments to be completed from a single pair of 3D experiments

  10. Perspective: Structural fluctuation of protein and Anfinsen's thermodynamic hypothesis

    Science.gov (United States)

    Hirata, Fumio; Sugita, Masatake; Yoshida, Masasuke; Akasaka, Kazuyuki

    2018-01-01

    The thermodynamics hypothesis, casually referred to as "Anfinsen's dogma," is described theoretically in terms of a concept of the structural fluctuation of protein or the first moment (average structure) and the second moment (variance and covariance) of the structural distribution. The new theoretical concept views the unfolding and refolding processes of protein as a shift of the structural distribution induced by a thermodynamic perturbation, with the variance-covariance matrix varying. Based on the theoretical concept, a method to characterize the mechanism of folding (or unfolding) is proposed. The transition state, if any, between two stable states is interpreted as a gap in the distribution, which is created due to an extensive reorganization of hydrogen bonds among back-bone atoms of protein and with water molecules in the course of conformational change. Further perspective to applying the theory to the computer-aided drug design, and to the material science, is briefly discussed.

  11. Tertiary alphabet for the observable protein structural universe.

    Science.gov (United States)

    Mackenzie, Craig O; Zhou, Jianfu; Grigoryan, Gevorg

    2016-11-22

    Here, we systematically decompose the known protein structural universe into its basic elements, which we dub tertiary structural motifs (TERMs). A TERM is a compact backbone fragment that captures the secondary, tertiary, and quaternary environments around a given residue, comprising one or more disjoint segments (three on average). We seek the set of universal TERMs that capture all structure in the Protein Data Bank (PDB), finding remarkable degeneracy. Only ∼600 TERMs are sufficient to describe 50% of the PDB at sub-Angstrom resolution. However, more rare geometries also exist, and the overall structural coverage grows logarithmically with the number of TERMs. We go on to show that universal TERMs provide an effective mapping between sequence and structure. We demonstrate that TERM-based statistics alone are sufficient to recapitulate close-to-native sequences given either NMR or X-ray backbones. Furthermore, sequence variability predicted from TERM data agrees closely with evolutionary variation. Finally, locations of TERMs in protein chains can be predicted from sequence alone based on sequence signatures emergent from TERM instances in the PDB. For multisegment motifs, this method identifies spatially adjacent fragments that are not contiguous in sequence-a major bottleneck in structure prediction. Although all TERMs recur in diverse proteins, some appear specialized for certain functions, such as interface formation, metal coordination, or even water binding. Structural biology has benefited greatly from previously observed degeneracies in structure. The decomposition of the known structural universe into a finite set of compact TERMs offers exciting opportunities toward better understanding, design, and prediction of protein structure.

  12. A rapid NMR-based method for discrimination of strain-specific cell wall teichoic acid structures reveals a third backbone type in Lactobacillus plantarum.

    Science.gov (United States)

    Tomita, Satoru; Tanaka, Naoto; Okada, Sanae

    2017-03-01

    The lactic acid bacterium Lactobacillus plantarum is capable of producing strain-specific structures of cell wall teichoic acid (WTA), an anionic polysaccharide found in the Gram-positive bacterial cell wall. In this study, we established a rapid, NMR-based procedure to discriminate WTA structures in this species, and applied it to 94 strains of L. plantarum. Six previously reported glycerol- and ribitol-containing WTA subtypes were successfully identified from 78 strains, suggesting that these were the dominant structures. However, the level of structural variety differed markedly among bacterial sources, possibly reflecting differences in strain-level microbial diversity. WTAs from eight strains were not identified based on NMR spectra and were classified into three groups. Structural analysis of a partial degradation product of an unidentified WTA produced by strain TUA 1496L revealed that the WTA was 1-O-β-d-glucosylglycerol. Two-dimensional NMR analysis of the polymer structure showed phosphodiester bonds between C-3 and C-6 of the glycerol and glucose residues, suggesting a polymer structure of 3,6΄-linked poly(1-O-β-d-glucosyl-sn-glycerol phosphate). This is the third WTA backbone structure in L. plantarum, following 3,6΄-linked poly(1-O-α-d-glucosyl-sn-glycerol phosphate) and 1,5-linked poly(ribitol phosphate). © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  13. Integral membrane protein structure determination using pseudocontact shifts

    Energy Technology Data Exchange (ETDEWEB)

    Crick, Duncan J.; Wang, Jue X. [University of Cambridge, Department of Biochemistry (United Kingdom); Graham, Bim; Swarbrick, James D. [Monash University, Monash Institute of Pharmaceutical Sciences (Australia); Mott, Helen R.; Nietlispach, Daniel, E-mail: dn206@cam.ac.uk [University of Cambridge, Department of Biochemistry (United Kingdom)

    2015-04-15

    Obtaining enough experimental restraints can be a limiting factor in the NMR structure determination of larger proteins. This is particularly the case for large assemblies such as membrane proteins that have been solubilized in a membrane-mimicking environment. Whilst in such cases extensive deuteration strategies are regularly utilised with the aim to improve the spectral quality, these schemes often limit the number of NOEs obtainable, making complementary strategies highly beneficial for successful structure elucidation. Recently, lanthanide-induced pseudocontact shifts (PCSs) have been established as a structural tool for globular proteins. Here, we demonstrate that a PCS-based approach can be successfully applied for the structure determination of integral membrane proteins. Using the 7TM α-helical microbial receptor pSRII, we show that PCS-derived restraints from lanthanide binding tags attached to four different positions of the protein facilitate the backbone structure determination when combined with a limited set of NOEs. In contrast, the same set of NOEs fails to determine the correct 3D fold. The latter situation is frequently encountered in polytopical α-helical membrane proteins and a PCS approach is thus suitable even for this particularly challenging class of membrane proteins. The ease of measuring PCSs makes this an attractive route for structure determination of large membrane proteins in general.

  14. Protein Structure Classification and Loop Modeling Using Multiple Ramachandran Distributions

    KAUST Repository

    Najibi, Seyed Morteza

    2017-02-08

    Recently, the study of protein structures using angular representations has attracted much attention among structural biologists. The main challenge is how to efficiently model the continuous conformational space of the protein structures based on the differences and similarities between different Ramachandran plots. Despite the presence of statistical methods for modeling angular data of proteins, there is still a substantial need for more sophisticated and faster statistical tools to model the large-scale circular datasets. To address this need, we have developed a nonparametric method for collective estimation of multiple bivariate density functions for a collection of populations of protein backbone angles. The proposed method takes into account the circular nature of the angular data using trigonometric spline which is more efficient compared to existing methods. This collective density estimation approach is widely applicable when there is a need to estimate multiple density functions from different populations with common features. Moreover, the coefficients of adaptive basis expansion for the fitted densities provide a low-dimensional representation that is useful for visualization, clustering, and classification of the densities. The proposed method provides a novel and unique perspective to two important and challenging problems in protein structure research: structure-based protein classification and angular-sampling-based protein loop structure prediction.

  15. Protein Structure Classification and Loop Modeling Using Multiple Ramachandran Distributions

    KAUST Repository

    Najibi, Seyed Morteza; Maadooliat, Mehdi; Zhou, Lan; Huang, Jianhua Z.; Gao, Xin

    2017-01-01

    Recently, the study of protein structures using angular representations has attracted much attention among structural biologists. The main challenge is how to efficiently model the continuous conformational space of the protein structures based on the differences and similarities between different Ramachandran plots. Despite the presence of statistical methods for modeling angular data of proteins, there is still a substantial need for more sophisticated and faster statistical tools to model the large-scale circular datasets. To address this need, we have developed a nonparametric method for collective estimation of multiple bivariate density functions for a collection of populations of protein backbone angles. The proposed method takes into account the circular nature of the angular data using trigonometric spline which is more efficient compared to existing methods. This collective density estimation approach is widely applicable when there is a need to estimate multiple density functions from different populations with common features. Moreover, the coefficients of adaptive basis expansion for the fitted densities provide a low-dimensional representation that is useful for visualization, clustering, and classification of the densities. The proposed method provides a novel and unique perspective to two important and challenging problems in protein structure research: structure-based protein classification and angular-sampling-based protein loop structure prediction.

  16. 1990s: High Capacity Backbones

    Indian Academy of Sciences (India)

    First page Back Continue Last page Overview Graphics. 1990s: High Capacity Backbones. Backbone capacities increased from 2.5 Gb/s to 100s of Gb/s during the 1990's. Wavelength division multiplexing with 160 waves of 10 Gb/s was commercially available. Several high-capacity backbones built in the US and Europe.

  17. Sub-nanoscale surface ruggedness provides a water-tight seal for exposed regions in soluble protein structure.

    Directory of Open Access Journals (Sweden)

    Erica Schulz

    2010-09-01

    Full Text Available Soluble proteins must maintain backbone hydrogen bonds (BHBs water-tight to ensure structural integrity. This protection is often achieved by burying the BHBs or wrapping them through intermolecular associations. On the other hand, water has low coordination resilience, with loss of hydrogen-bonding partnerships carrying significant thermodynamic cost. Thus, a core problem in structural biology is whether natural design actually exploits the water coordination stiffness to seal the backbone in regions that are exposed to the solvent. This work explores the molecular design features that make this type of seal operative, focusing on the side-chain arrangements that shield the protein backbone. We show that an efficient sealing is achieved by adapting the sub-nanoscale surface topography to the stringency of water coordination: an exposed BHB may be kept dry if the local concave curvature is small enough to impede formation of the coordination shell of a penetrating water molecule. Examination of an exhaustive database of uncomplexed proteins reveals that exposed BHBs invariably occur within such sub-nanoscale cavities in native folds, while this level of local ruggedness is absent in other regions. By contrast, BHB exposure in misfolded proteins occurs with larger local curvature promoting backbone hydration and consequently, structure disruption. These findings unravel physical constraints fitting a spatially dependent least-action for water coordination, introduce a molecular design concept, and herald the advent of water-tight peptide-based materials with sufficient backbone exposure to remain flexible.

  18. Quantitative assessments of the distinct contributions of polypeptide backbone amides versus sidechain groups to chain expansion via chemical denaturation

    Science.gov (United States)

    Holehouse, Alex S.; Garai, Kanchan; Lyle, Nicholas; Vitalis, Andreas; Pappu, Rohit V.

    2015-01-01

    In aqueous solutions with high concentrations of chemical denaturants such as urea and guanidinium chloride (GdmCl) proteins expand to populate heterogeneous conformational ensembles. These denaturing environments are thought to be good solvents for generic protein sequences because properties of conformational distributions align with those of canonical random coils. Previous studies showed that water is a poor solvent for polypeptide backbones and therefore backbones form collapsed globular structures in aqueous solvents. Here, we ask if polypeptide backbones can intrinsically undergo the requisite chain expansion in aqueous solutions with high concentrations of urea and GdmCl. We answer this question using a combination of molecular dynamics simulations and fluorescence correlation spectroscopy. We find that the degree of backbone expansion is minimal in aqueous solutions with high concentrations denaturants. Instead, polypeptide backbones sample conformations that are denaturant-specific mixtures of coils and globules, with a persistent preference for globules. Therefore, typical denaturing environments cannot be classified as good solvents for polypeptide backbones. How then do generic protein sequences expand in denaturing environments? To answer this question, we investigated the effects of sidechains using simulations of two archetypal sequences with amino acid compositions that are mixtures of charged, hydrophobic, and polar groups. We find that sidechains lower the effective concentration of backbone amides in water leading to an intrinsic expansion of polypeptide backbones in the absence of denaturants. Additional dilution of the effective concentration of backbone amides is achieved through preferential interactions with denaturants. These effects lead to conformational statistics in denaturing environments that are congruent with those of canonical random coils. Our results highlight the role of sidechain-mediated interactions as determinants of the

  19. An Algebro-Topological Description of Protein Domain Structure

    Science.gov (United States)

    Penner, Robert Clark; Knudsen, Michael; Wiuf, Carsten; Andersen, Jørgen Ellegaard

    2011-01-01

    The space of possible protein structures appears vast and continuous, and the relationship between primary, secondary and tertiary structure levels is complex. Protein structure comparison and classification is therefore a difficult but important task since structure is a determinant for molecular interaction and function. We introduce a novel mathematical abstraction based on geometric topology to describe protein domain structure. Using the locations of the backbone atoms and the hydrogen bonds, we build a combinatorial object – a so-called fatgraph. The description is discrete yet gives rise to a 2-dimensional mathematical surface. Thus, each protein domain corresponds to a particular mathematical surface with characteristic topological invariants, such as the genus (number of holes) and the number of boundary components. Both invariants are global fatgraph features reflecting the interconnectivity of the domain by hydrogen bonds. We introduce the notion of robust variables, that is variables that are robust towards minor changes in the structure/fatgraph, and show that the genus and the number of boundary components are robust. Further, we invesigate the distribution of different fatgraph variables and show how only four variables are capable of distinguishing different folds. We use local (secondary) and global (tertiary) fatgraph features to describe domain structures and illustrate that they are useful for classification of domains in CATH. In addition, we combine our method with two other methods thereby using primary, secondary, and tertiary structure information, and show that we can identify a large percentage of new and unclassified structures in CATH. PMID:21629687

  20. (nBuCp)2ZrCl2-catalyzed Ethylene-4M1P Copolymerization: Copolymer Backbone Structure, Melt Behavior, and Crystallization

    KAUST Repository

    Atiqullah, Muhammad

    2016-01-08

    The judicious design of methylaluminoxane (MAO) anions expands the scope for developing industrial metallocene catalysts. Therefore, the effects of MAO anion design on the backbone structure, melt behavior, and crystallization of ethylene−4-methyl-1-pentene (E−4M1P) copolymer were investigated. Ethylene was homopolymerized, as well as copolymerized with 4M1P, using (i) MAO anion A (unsupported [MAOCl2]−) premixed with dehydroxylated silica, (nBuCp)2ZrCl2, and Me2SiCl2; and (ii) MAO anion B (Si−O−Me2Si−[MAOCl2]−) supported with (nBuCp)2ZrCl2 on Me2SiCl2-functionalized silica. Unsupported Me2SiCl2, opposite to the supported analogue, acted as a co-chain transfer agent with 4M1P. The modeling of polyethylene melting and crystallization kinetics, including critical crystallite stability, produced insightful results. This study especially illustrates how branched polyethylene can be prepared from ethylene alone using particularly one metallocene-MAO ion pair, and how a compound, that functionalizes silica as well as terminates the chain, can synthesize ethylene−α-olefin copolymers with novel structures. Hence, it unfolds prospective future research niches in polyethyne systhesis. This article is protected by copyright. All rights reserved.

  1. Formatt: Correcting protein multiple structural alignments by incorporating sequence alignment

    Directory of Open Access Journals (Sweden)

    Daniels Noah M

    2012-10-01

    Full Text Available Abstract Background The quality of multiple protein structure alignments are usually computed and assessed based on geometric functions of the coordinates of the backbone atoms from the protein chains. These purely geometric methods do not utilize directly protein sequence similarity, and in fact, determining the proper way to incorporate sequence similarity measures into the construction and assessment of protein multiple structure alignments has proved surprisingly difficult. Results We present Formatt, a multiple structure alignment based on the Matt purely geometric multiple structure alignment program, that also takes into account sequence similarity when constructing alignments. We show that Formatt outperforms Matt and other popular structure alignment programs on the popular HOMSTRAD benchmark. For the SABMark twilight zone benchmark set that captures more remote homology, Formatt and Matt outperform other programs; depending on choice of embedded sequence aligner, Formatt produces either better sequence and structural alignments with a smaller core size than Matt, or similarly sized alignments with better sequence similarity, for a small cost in average RMSD. Conclusions Considering sequence information as well as purely geometric information seems to improve quality of multiple structure alignments, though defining what constitutes the best alignment when sequence and structural measures would suggest different alignments remains a difficult open question.

  2. Oligomeric protein structure networks: insights into protein-protein interactions

    Directory of Open Access Journals (Sweden)

    Brinda KV

    2005-12-01

    Full Text Available Abstract Background Protein-protein association is essential for a variety of cellular processes and hence a large number of investigations are being carried out to understand the principles of protein-protein interactions. In this study, oligomeric protein structures are viewed from a network perspective to obtain new insights into protein association. Structure graphs of proteins have been constructed from a non-redundant set of protein oligomer crystal structures by considering amino acid residues as nodes and the edges are based on the strength of the non-covalent interactions between the residues. The analysis of such networks has been carried out in terms of amino acid clusters and hubs (highly connected residues with special emphasis to protein interfaces. Results A variety of interactions such as hydrogen bond, salt bridges, aromatic and hydrophobic interactions, which occur at the interfaces are identified in a consolidated manner as amino acid clusters at the interface, from this study. Moreover, the characterization of the highly connected hub-forming residues at the interfaces and their comparison with the hubs from the non-interface regions and the non-hubs in the interface regions show that there is a predominance of charged interactions at the interfaces. Further, strong and weak interfaces are identified on the basis of the interaction strength between amino acid residues and the sizes of the interface clusters, which also show that many protein interfaces are stronger than their monomeric protein cores. The interface strengths evaluated based on the interface clusters and hubs also correlate well with experimentally determined dissociation constants for known complexes. Finally, the interface hubs identified using the present method correlate very well with experimentally determined hotspots in the interfaces of protein complexes obtained from the Alanine Scanning Energetics database (ASEdb. A few predictions of interface hot

  3. A 3D-structural model of unsulfated chondroitin from high-field NMR: 4-sulfation has little effect on backbone conformation

    Science.gov (United States)

    Sattelle, Benedict M.; Shakeri, Javad; Roberts, Ian S.; Almond, Andrew

    2010-01-01

    The glycosaminoglycan chondroitin sulfate is essential in human health and disease but exactly how sulfation dictates its 3D-strucutre at the atomic level is unclear. To address this, we have purified homogenous oligosaccharides of unsulfated chondroitin (with and without 15N-enrichment) and analysed them by high-field NMR to make a comparison published chondroitin sulfate and hyaluronan 3D-structures. The result is the first full assignment of the tetrasaccharide and an experimental 3D-model of the hexasaccharide (PDB code 2KQO). In common with hyaluronan, we confirm that the amide proton is not involved in strong, persistent inter-residue hydrogen bonds. However, in contrast to hyaluronan, a hydrogen bond is not inferred between the hexosamine OH-4 and the glucuronic acid O5 atoms across the β(1→3) glycosidic linkage. The unsulfated chondroitin bond geometry differs slightly from hyaluronan by rotation about the β(1→3) ψ dihedral (as previously predicted by simulation), while the β(1→4) linkage is unaffected. Furthermore, comparison shows that this glycosidic linkage geometry is similar in chondroitin-4-sulfate. We therefore hypothesise that both hexosamine OH-4 and OH-6 atoms are solvent exposed in chondroitin, explaining why it is amenable to sulfation and hyaluronan is not, and also that 4-sulfation has little effect on backbone conformation. Our conclusions exemplify the value of the 3D-model presented here and progress our understanding of glycosaminoglycan molecular properties. PMID:20022001

  4. Fragger: a protein fragment picker for structural queries.

    Science.gov (United States)

    Berenger, Francois; Simoncini, David; Voet, Arnout; Shrestha, Rojan; Zhang, Kam Y J

    2017-01-01

    Protein modeling and design activities often require querying the Protein Data Bank (PDB) with a structural fragment, possibly containing gaps. For some applications, it is preferable to work on a specific subset of the PDB or with unpublished structures. These requirements, along with specific user needs, motivated the creation of a new software to manage and query 3D protein fragments. Fragger is a protein fragment picker that allows protein fragment databases to be created and queried. All fragment lengths are supported and any set of PDB files can be used to create a database. Fragger can efficiently search a fragment database with a query fragment and a distance threshold. Matching fragments are ranked by distance to the query. The query fragment can have structural gaps and the allowed amino acid sequences matching a query can be constrained via a regular expression of one-letter amino acid codes. Fragger also incorporates a tool to compute the backbone RMSD of one versus many fragments in high throughput. Fragger should be useful for protein design, loop grafting and related structural bioinformatics tasks.

  5. Ribosomal proteins L11 and L10.(L12)4 and the antibiotic thiostrepton interact with overlapping regions of the 23 S rRNA backbone in the ribosomal GTPase centre

    DEFF Research Database (Denmark)

    Rosendahl, G; Douthwaite, S

    1993-01-01

    RNA, and to investigate how this interaction is influenced by other ribosomal components. Complexes were characterized in both naked 23 S rRNA and ribosomes from an E. coli L11-minus strain, before and after reconstitution with L11. The protein protects 17 riboses between positions 1058 and 1085 in the naked 23 S r......The Escherichia coli ribosomal protein (r-protein) L11 and its binding site on 23 S ribosomal RNA (rRNA) are associated with ribosomal hydrolysis of guanosine 5'-triphosphate (GTP). We have used hydroxyl radical footprinting to map the contacts between L11 and the backbone riboses in 23 S r......)4 and other proteins within the ribosome. The antibiotics thiostrepton and micrococcin inhibit the catalytic functions of this region by slotting in between the accessible loops and interacting with nucleotides there....

  6. Compare local pocket and global protein structure models by small structure patterns

    KAUST Repository

    Cui, Xuefeng

    2015-09-09

    Researchers proposed several criteria to assess the quality of predicted protein structures because it is one of the essential tasks in the Critical Assessment of Techniques for Protein Structure Prediction (CASP) competitions. Popular criteria include root mean squared deviation (RMSD), MaxSub score, TM-score, GDT-TS and GDT-HA scores. All these criteria require calculation of rigid transformations to superimpose the the predicted protein structure to the native protein structure. Yet, how to obtain the rigid transformations is unknown or with high time complexity, and, hence, heuristic algorithms were proposed. In this work, we carefully design various small structure patterns, including the ones specifically tuned for local pockets. Such structure patterns are biologically meaningful, and address the issue of relying on a sufficient number of backbone residue fragments for existing methods. We sample the rigid transformations from these small structure patterns; and the optimal superpositions yield by these small structures are refined and reported. As a result, among 11; 669 pairs of predicted and native local protein pocket models from the CASP10 dataset, the GDT-TS scores calculated by our method are significantly higher than those calculated by LGA. Moreover, our program is computationally much more efficient. Source codes and executables are publicly available at http://www.cbrc.kaust.edu.sa/prosta/

  7. Deuterium isotope shifts for backbone {sup 1}H, {sup 15}N and {sup 13}C nuclei in intrinsically disordered protein {alpha}-synuclein

    Energy Technology Data Exchange (ETDEWEB)

    Maltsev, Alexander S.; Ying Jinfa; Bax, Ad, E-mail: bax@nih.gov [National Institutes of Health, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases (United States)

    2012-10-15

    Intrinsically disordered proteins (IDPs) are abundant in nature and characterization of their potential structural propensities remains a widely pursued but challenging task. Analysis of NMR secondary chemical shifts plays an important role in such studies, but the output of such analyses depends on the accuracy of reference random coil chemical shifts. Although uniform perdeuteration of IDPs can dramatically increase spectral resolution, a feature particularly important for the poorly dispersed IDP spectra, the impact of deuterium isotope shifts on random coil values has not yet been fully characterized. Very precise {sup 2}H isotope shift measurements for {sup 13}C{sup {alpha}}, {sup 13}C{sup {beta}}, {sup 13}C Prime , {sup 15}N, and {sup 1}H{sup N} have been obtained by using a mixed sample of protonated and uniformly perdeuterated {alpha}-synuclein, a protein with chemical shifts exceptionally close to random coil values. Decomposition of these isotope shifts into one-bond, two-bond and three-bond effects as well as intra- and sequential residue contributions shows that such an analysis, which ignores conformational dependence, is meaningful but does not fully describe the total isotope shift to within the precision of the measurements. Random coil {sup 2}H isotope shifts provide an important starting point for analysis of such shifts in structural terms in folded proteins, where they are known to depend strongly on local geometry.

  8. Optimization of amino acid type-specific 13C and 15N labeling for the backbone assignment of membrane proteins by solution- and solid-state NMR with the UPLABEL algorithm

    International Nuclear Information System (INIS)

    Hefke, Frederik; Bagaria, Anurag; Reckel, Sina; Ullrich, Sandra Johanna; Dötsch, Volker; Glaubitz, Clemens; Güntert, Peter

    2011-01-01

    We present a computational method for finding optimal labeling patterns for the backbone assignment of membrane proteins and other large proteins that cannot be assigned by conventional strategies. Following the approach of Kainosho and Tsuji (Biochemistry 21:6273–6279 (1982)), types of amino acids are labeled with 13 C or/and 15 N such that cross peaks between 13 CO(i – 1) and 15 NH(i) result only for pairs of sequentially adjacent amino acids of which the first is labeled with 13 C and the second with 15 N. In this way, unambiguous sequence-specific assignments can be obtained for unique pairs of amino acids that occur exactly once in the sequence of the protein. To be practical, it is crucial to limit the number of differently labeled protein samples that have to be prepared while obtaining an optimal extent of labeled unique amino acid pairs. Our computer algorithm UPLABEL for optimal unique pair labeling, implemented in the program CYANA and in a standalone program, and also available through a web portal, uses combinatorial optimization to find for a given amino acid sequence labeling patterns that maximize the number of unique pair assignments with a minimal number of differently labeled protein samples. Various auxiliary conditions, including labeled amino acid availability and price, previously known partial assignments, and sequence regions of particular interest can be taken into account when determining optimal amino acid type-specific labeling patterns. The method is illustrated for the assignment of the human G-protein coupled receptor bradykinin B2 (B 2 R) and applied as a starting point for the backbone assignment of the membrane protein proteorhodopsin.

  9. Protein interfacial structure and nanotoxicology

    International Nuclear Information System (INIS)

    White, John W.; Perriman, Adam W.; McGillivray, Duncan J.; Lin, J.-M.

    2009-01-01

    Here we briefly recapitulate the use of X-ray and neutron reflectometry at the air-water interface to find protein structures and thermodynamics at interfaces and test a possibility for understanding those interactions between nanoparticles and proteins which lead to nanoparticle toxicology through entry into living cells. Stable monomolecular protein films have been made at the air-water interface and, with a specially designed vessel, the substrate changed from that which the air-water interfacial film was deposited. This procedure allows interactions, both chemical and physical, between introduced species and the monomolecular film to be studied by reflectometry. The method is briefly illustrated here with some new results on protein-protein interaction between β-casein and κ-casein at the air-water interface using X-rays. These two proteins are an essential component of the structure of milk. In the experiments reported, specific and directional interactions appear to cause different interfacial structures if first, a β-casein monolayer is attacked by a κ-casein solution compared to the reverse. The additional contrast associated with neutrons will be an advantage here. We then show the first results of experiments on the interaction of a β-casein monolayer with a nanoparticle titanium oxide sol, foreshadowing the study of the nanoparticle 'corona' thought to be important for nanoparticle-cell wall penetration.

  10. Protein interfacial structure and nanotoxicology

    Energy Technology Data Exchange (ETDEWEB)

    White, John W. [Research School of Chemistry, Australian National University, Canberra (Australia)], E-mail: jww@rsc.anu.edu.au; Perriman, Adam W.; McGillivray, Duncan J.; Lin, J.-M. [Research School of Chemistry, Australian National University, Canberra (Australia)

    2009-02-21

    Here we briefly recapitulate the use of X-ray and neutron reflectometry at the air-water interface to find protein structures and thermodynamics at interfaces and test a possibility for understanding those interactions between nanoparticles and proteins which lead to nanoparticle toxicology through entry into living cells. Stable monomolecular protein films have been made at the air-water interface and, with a specially designed vessel, the substrate changed from that which the air-water interfacial film was deposited. This procedure allows interactions, both chemical and physical, between introduced species and the monomolecular film to be studied by reflectometry. The method is briefly illustrated here with some new results on protein-protein interaction between {beta}-casein and {kappa}-casein at the air-water interface using X-rays. These two proteins are an essential component of the structure of milk. In the experiments reported, specific and directional interactions appear to cause different interfacial structures if first, a {beta}-casein monolayer is attacked by a {kappa}-casein solution compared to the reverse. The additional contrast associated with neutrons will be an advantage here. We then show the first results of experiments on the interaction of a {beta}-casein monolayer with a nanoparticle titanium oxide sol, foreshadowing the study of the nanoparticle 'corona' thought to be important for nanoparticle-cell wall penetration.

  11. Chemical crosslinking and mass spectrometry studies of the structure and dynamics of membrane proteins and receptors.

    Energy Technology Data Exchange (ETDEWEB)

    Haskins, William E.; Leavell, Michael D.; Lane, Pamela; Jacobsen, Richard B.; Hong, Joohee; Ayson, Marites J.; Wood, Nichole L.; Schoeniger, Joseph S.; Kruppa, Gary Hermann; Sale, Kenneth L.; Young, Malin M.; Novak, Petr

    2005-03-01

    Membrane proteins make up a diverse and important subset of proteins for which structural information is limited. In this study, chemical cross-linking and mass spectrometry were used to explore the structure of the G-protein-coupled photoreceptor bovine rhodopsin in the dark-state conformation. All experiments were performed in rod outer segment membranes using amino acid 'handles' in the native protein sequence and thus minimizing perturbations to the native protein structure. Cysteine and lysine residues were covalently cross-linked using commercially available reagents with a range of linker arm lengths. Following chemical digestion of cross-linked protein, cross-linked peptides were identified by accurate mass measurement using liquid chromatography-fourier transform mass spectrometry and an automated data analysis pipeline. Assignments were confirmed and, if necessary, resolved, by tandem MS. The relative reactivity of lysine residues participating in cross-links was evaluated by labeling with NHS-esters. A distinct pattern of cross-link formation within the C-terminal domain, and between loop I and the C-terminal domain, emerged. Theoretical distances based on cross-linking were compared to inter-atomic distances determined from the energy-minimized X-ray crystal structure and Monte Carlo conformational search procedures. In general, the observed cross-links can be explained by re-positioning participating side-chains without significantly altering backbone structure. One exception, between C3 16 and K325, requires backbone motion to bring the reactive atoms into sufficient proximity for cross-linking. Evidence from other studies suggests that residues around K325 for a region of high backbone mobility. These findings show that cross-linking studies can provide insight into the structural dynamics of membrane proteins in their native environment.

  12. Temperature-dependent spectral density analysis applied to monitoring backbone dynamics of major urinary protein-I complexed with the pheromone 2-sec-butyl-4,5-dihydrothiazole

    International Nuclear Information System (INIS)

    Krizova, Hana; Zidek, Lukas; Stone, Martin J.; Novotny, Milos V.; Sklenar, Vladimir

    2004-01-01

    Backbone dynamics of mouse major urinary protein I (MUP-I) was studied by 15 N NMR relaxation. Data were collected at multiple temperatures for a complex of MUP-I with its natural pheromonal ligand, 2-sec-4,5-dihydrothiazole, and for the free protein. The measured relaxation rates were analyzed using the reduced spectral density mapping. Graphical analysis of the spectral density values provided an unbiased qualitative picture of the internal motions. Varying temperature greatly increased the range of analyzed spectral density values and therefore improved reliability of the analysis. Quantitative parameters describing the dynamics on picosecond to nanosecond time scale were obtained using a novel method of simultaneous data fitting at multiple temperatures. Both methods showed that the backbone flexibility on the fast time scale is slightly increased upon pheromone binding, in accordance with the previously reported results. Zero-frequency spectral density values revealed conformational changes on the microsecond to millisecond time scale. Measurements at different temperatures allowed to monitor temperature depencence of the motional parameters

  13. Structural entanglements in protein complexes

    Science.gov (United States)

    Zhao, Yani; Chwastyk, Mateusz; Cieplak, Marek

    2017-06-01

    We consider multi-chain protein native structures and propose a criterion that determines whether two chains in the system are entangled or not. The criterion is based on the behavior observed by pulling at both termini of each chain simultaneously in the two chains. We have identified about 900 entangled systems in the Protein Data Bank and provided a more detailed analysis for several of them. We argue that entanglement enhances the thermodynamic stability of the system but it may have other functions: burying the hydrophobic residues at the interface and increasing the DNA or RNA binding area. We also study the folding and stretching properties of the knotted dimeric proteins MJ0366, YibK, and bacteriophytochrome. These proteins have been studied theoretically in their monomeric versions so far. The dimers are seen to separate on stretching through the tensile mechanism and the characteristic unraveling force depends on the pulling direction.

  14. An approach for high-throughput structure determination of proteins by NMR spectroscopy

    Energy Technology Data Exchange (ETDEWEB)

    Medek, Ales; Olejniczak, Edward T.; Meadows, Robert P.; Fesik, Stephen W. [Abbott Laboratories, Pharmaceutical Discovery Division (United States)

    2000-11-15

    An approach is described for rapidly determining protein structures by NMR that utilizes proteins containing {sup 13}C-methyl labeled Val, Leu, and Ile ({delta}1) and protonated Phe and Tyr in a deuterated background. Using this strategy, the key NOEs that define the hydrophobic core and overall fold of the protein are easily obtained. NMR data are acquired using cryogenic probe technology which markedly reduces the spectrometer time needed for data acquisition. The approach is demonstrated by determining the overall fold of the antiapoptotic protein, Bcl-xL, from data collected in only 4 days. Refinement of the Bcl-xL structure to a backbone rmsd of 0.95 A was accomplished with data collected in an additional 3 days. A distance analysis of 180 different proteins and structure calculations using simulated data suggests that our method will allow the global folds of a wide variety of proteins to be determined.

  15. AbDesign: An algorithm for combinatorial backbone design guided by natural conformations and sequences.

    Science.gov (United States)

    Lapidoth, Gideon D; Baran, Dror; Pszolla, Gabriele M; Norn, Christoffer; Alon, Assaf; Tyka, Michael D; Fleishman, Sarel J

    2015-08-01

    Computational design of protein function has made substantial progress, generating new enzymes, binders, inhibitors, and nanomaterials not previously seen in nature. However, the ability to design new protein backbones for function--essential to exert control over all polypeptide degrees of freedom--remains a critical challenge. Most previous attempts to design new backbones computed the mainchain from scratch. Here, instead, we describe a combinatorial backbone and sequence optimization algorithm called AbDesign, which leverages the large number of sequences and experimentally determined molecular structures of antibodies to construct new antibody models, dock them against target surfaces and optimize their sequence and backbone conformation for high stability and binding affinity. We used the algorithm to produce antibody designs that target the same molecular surfaces as nine natural, high-affinity antibodies; in five cases interface sequence identity is above 30%, and in four of those the backbone conformation at the core of the antibody binding surface is within 1 Å root-mean square deviation from the natural antibodies. Designs recapitulate polar interaction networks observed in natural complexes, and amino acid sidechain rigidity at the designed binding surface, which is likely important for affinity and specificity, is high compared to previous design studies. In designed anti-lysozyme antibodies, complementarity-determining regions (CDRs) at the periphery of the interface, such as L1 and H2, show greater backbone conformation diversity than the CDRs at the core of the interface, and increase the binding surface area compared to the natural antibody, potentially enhancing affinity and specificity. © 2015 Wiley Periodicals, Inc.

  16. Predicting protein structures with a multiplayer online game.

    Science.gov (United States)

    Cooper, Seth; Khatib, Firas; Treuille, Adrien; Barbero, Janos; Lee, Jeehyung; Beenen, Michael; Leaver-Fay, Andrew; Baker, David; Popović, Zoran; Players, Foldit

    2010-08-05

    People exert large amounts of problem-solving effort playing computer games. Simple image- and text-recognition tasks have been successfully 'crowd-sourced' through games, but it is not clear if more complex scientific problems can be solved with human-directed computing. Protein structure prediction is one such problem: locating the biologically relevant native conformation of a protein is a formidable computational challenge given the very large size of the search space. Here we describe Foldit, a multiplayer online game that engages non-scientists in solving hard prediction problems. Foldit players interact with protein structures using direct manipulation tools and user-friendly versions of algorithms from the Rosetta structure prediction methodology, while they compete and collaborate to optimize the computed energy. We show that top-ranked Foldit players excel at solving challenging structure refinement problems in which substantial backbone rearrangements are necessary to achieve the burial of hydrophobic residues. Players working collaboratively develop a rich assortment of new strategies and algorithms; unlike computational approaches, they explore not only the conformational space but also the space of possible search strategies. The integration of human visual problem-solving and strategy development capabilities with traditional computational algorithms through interactive multiplayer games is a powerful new approach to solving computationally-limited scientific problems.

  17. Soliton concepts and protein structure

    Science.gov (United States)

    Krokhotin, Andrei; Niemi, Antti J.; Peng, Xubiao

    2012-03-01

    Structural classification shows that the number of different protein folds is surprisingly small. It also appears that proteins are built in a modular fashion from a relatively small number of components. Here we propose that the modular building blocks are made of the dark soliton solution of a generalized discrete nonlinear Schrödinger equation. We find that practically all protein loops can be obtained simply by scaling the size and by joining together a number of copies of the soliton, one after another. The soliton has only two loop-specific parameters, and we compute their statistical distribution in the Protein Data Bank (PDB). We explicitly construct a collection of 200 sets of parameters, each determining a soliton profile that describes a different short loop. The ensuing profiles cover practically all those proteins in PDB that have a resolution which is better than 2.0 Å, with a precision such that the average root-mean-square distance between the loop and its soliton is less than the experimental B-factor fluctuation distance. We also present two examples that describe how the loop library can be employed both to model and to analyze folded proteins.

  18. Protein structure refinement using a quantum mechanics-based chemical shielding predictor

    DEFF Research Database (Denmark)

    Bratholm, Lars Andersen; Jensen, Jan Halborg

    2017-01-01

    The accurate prediction of protein chemical shifts using a quantum mechanics (QM)-based method has been the subject of intense research for more than 20 years but so far empirical methods for chemical shift prediction have proven more accurate. In this paper we show that a QM-based predictor...... of a protein backbone and CB chemical shifts (ProCS15, PeerJ, 2016, 3, e1344) is of comparable accuracy to empirical chemical shift predictors after chemical shift-based structural refinement that removes small structural errors. We present a method by which quantum chemistry based predictions of isotropic...

  19. High Performance Infiltrated Backbones for Cathode-Supported SOFC's

    DEFF Research Database (Denmark)

    Gil, Vanesa; Kammer Hansen, Kent

    2014-01-01

    A four-step infiltration method has been developed to infiltrate La0.75Sr0.25MnO3+δ (LSM25) nanoparticles into porous structures (YSZ or LSM-YSZ backbones). The pore size distribution in the backbones is obtained either by using PMMA and/or graphites as pore formers or by leaching treatment of sa...... of samples with Ni remained in the YSZ structure at high temperatures. All impregnated backbones, presented Rs comparable to a standard screen printed cathode, which proves that LSM nanoparticles forms a pathway for electron conduction....

  20. The DNA and RNA sugar-phosphate backbone emerges as the key player. An overview of quantum-chemical, structural biology and simulation studies

    Czech Academy of Sciences Publication Activity Database

    Šponer, Jiří; Mládek, Arnošt; Šponer, Judit E.; Svozil, Daniel; Zgarbová, M.; Banáš, Pavel; Jurečka, P.; Otyepka, M.

    2012-01-01

    Roč. 14, č. 44 (2012), s. 15257-15277 ISSN 1463-9076 R&D Projects: GA ČR(CZ) GD203/09/H046; GA ČR(CZ) GAP208/10/2302; GA ČR(CZ) GAP208/11/1822; GA ČR(CZ) GAP208/12/1878; GA ČR(CZ) GA203/09/1476; GA ČR(CZ) GBP305/12/G034 Institutional research plan: CEZ:AV0Z50040702 Keywords : DNA * RNA * sugar-phosphate backbone Subject RIV: BO - Biophysics Impact factor: 3.829, year: 2012

  1. Protein Structure Refinement by Optimization

    DEFF Research Database (Denmark)

    Carlsen, Martin

    on whether the three-dimensional structure of a homologous sequence is known. Whether or not a protein model can be used for industrial purposes depends on the quality of the predicted structure. A model can be used to design a drug when the quality is high. The overall goal of this project is to assess...... that correlates maximally to a native-decoy distance. The main contribution of this thesis is methods developed for analyzing the performance of metrically trained knowledge-based potentials and for optimizing their performance while making them less dependent on the decoy set used to define them. We focus...... being at-least a local minimum of the potential. To address how far the current functional form of the potential is from an ideal potential we present two methods for finding the optimal metrically trained potential that simultaneous has a number of native structures as a local minimum. Our results...

  2. Structure refinement of flexible proteins using dipolar couplings: Application to the protein p8MTCP1

    International Nuclear Information System (INIS)

    Demene, Helene; Ducat, Thierry; Barthe, Philippe; Delsuc, Marc-Andre; Roumestand, Christian

    2002-01-01

    The present study deals with the relevance of using mobility-averaged dipolar couplings for the structure refinement of flexible proteins. The 68-residue protein p8 MTCP1 has been chosen as model for this study. Its solution state consists mainly of three α-helices. The two N-terminal helices are strapped in a well-determined α-hairpin, whereas, due to an intrinsic mobility, the position of the third helix is less well defined in the NMR structure. To further characterize the degrees of freedom of this helix, we have measured the dipolar coupling constants in the backbone of p8 MTCP1 in a bicellar medium. We show here that including D HN dip dipolar couplings in the structure calculation protocol improves the structure of the α-hairpin but not the positioning of the third helix. This is due to the motional averaging of the dipolar couplings measured in the last helix. Performing two calculations with different force constants for the dipolar restraints highlights the inconstancy of these mobility-averaged dipolar couplings. Alternatively, prior to any structure calculations, comparing the values of the dipolar couplings measured in helix III to values back-calculated from an ideal helix demonstrates that they are atypical for a helix. This can be partly attributed to mobility effects since the inclusion of the 15 N relaxation derived order parameter allows for a better fit

  3. Backbone and sidechain 1H, 13C and 15N resonance assignments of the human brain-type fatty acid binding protein (FABP7) in its apo form and the holo forms binding to DHA, oleic acid, linoleic acid and elaidic acid

    DEFF Research Database (Denmark)

    Oeemig, Jesper S; Jørgensen, Mathilde L; Hansen, Mikka S

    2009-01-01

    In this manuscript, we present the backbone and side chain assignments of human brain-type fatty acid binding protein, also known as FABP7, in its apo form and in four different holo forms, bound to DHA, oleic acid, linoleic acid and elaidic acid.......In this manuscript, we present the backbone and side chain assignments of human brain-type fatty acid binding protein, also known as FABP7, in its apo form and in four different holo forms, bound to DHA, oleic acid, linoleic acid and elaidic acid....

  4. Formation of 1D hierarchical structures composed of Ni{sub 3}S{sub 2} nanosheets on CNTs backbone for supercapacitors and photocatalytic H{sub 2} production

    Energy Technology Data Exchange (ETDEWEB)

    Zhu, Ting; Wu, Hao Bin; Wang, Yabo; Xu, Rong; Lou, Xiong Wen [David] [School of Chemical and Biomedical Engineering, Nanyang Technological University, 70 Nanyang Drive, Singapore 637457 (Singapore)

    2012-12-15

    One-dimensional (1D) hierarchical structures composed of Ni{sub 3}S{sub 2} nanosheets grown on carbon nanotube (CNT) backbone (denoted as CNT rate at Ni{sub 3}S{sub 2}) are fabricated by a rational multi-step transformation route. The first step involves coating the CNT backbone with a layer of silica to form CNT rate at SiO{sub 2}, which serves as the substrate for the growth of nickel silicate (NiSilicate) nanosheets in the second step to form CNT rate at SiO{sub 2} rate at NiSilicate core-double shell 1D structures. Finally the as-formed CNT rate at SiO{sub 2} rate at NiSilicate 1D structures are converted into CNT-supported Ni{sub 3}S{sub 2} nanosheets via hydrothermal treatment in the presence of Na{sub 2}S. Simultaneously the intermediate silica layer is eliminated during the hydrothermal treatment, leading to the formation of CNT rate at Ni{sub 3}S{sub 2} nanostructures. Because of the unique hybrid nano-architecture, the as-prepared 1D hierarchical structure is shown to exhibit excellent performance in both supercapacitors and photocatalytic H{sub 2} production. (Copyright copyright 2012 WILEY-VCH Verlag GmbH and Co. KGaA, Weinheim)

  5. SDSL-ESR-based protein structure characterization.

    Science.gov (United States)

    Strancar, Janez; Kavalenka, Aleh; Urbancic, Iztok; Ljubetic, Ajasja; Hemminga, Marcus A

    2010-03-01

    As proteins are key molecules in living cells, knowledge about their structure can provide important insights and applications in science, biotechnology, and medicine. However, many protein structures are still a big challenge for existing high-resolution structure-determination methods, as can be seen in the number of protein structures published in the Protein Data Bank. This is especially the case for less-ordered, more hydrophobic and more flexible protein systems. The lack of efficient methods for structure determination calls for urgent development of a new class of biophysical techniques. This work attempts to address this problem with a novel combination of site-directed spin labelling electron spin resonance spectroscopy (SDSL-ESR) and protein structure modelling, which is coupled by restriction of the conformational spaces of the amino acid side chains. Comparison of the application to four different protein systems enables us to generalize the new method and to establish a general procedure for determination of protein structure.

  6. Comparison of NMR and crystal structures for the proteins TM1112 and TM1367

    International Nuclear Information System (INIS)

    Mohanty, Biswaranjan; Serrano, Pedro; Pedrini, Bill; Jaudzems, Kristaps; Geralt, Michael; Horst, Reto; Herrmann, Torsten; Elsliger, Marc-André; Wilson, Ian A.; Wüthrich, Kurt

    2010-01-01

    NMR structures of the proteins TM1112 and TM1367 solved by the JCSG in solution at 298 K could be superimposed with the corresponding crystal structures at 100 K with r.m.s.d. values of <1.0 Å for the backbone heavy atoms. For both proteins the structural differences between multiple molecules in the asymmetric unit of the crystals correlated with structural variations within the bundles of conformers used to represent the NMR solution structures. A recently introduced JCSG NMR structure-determination protocol, which makes use of the software package UNIO for extensive automation, was further evaluated by comparison of the TM1112 structure obtained using these automated methods with another NMR structure that was independently solved in another PSI center, where a largely interactive approach was applied. The NMR structures of the TM1112 and TM1367 proteins from Thermotoga maritima in solution at 298 K were determined following a new protocol which uses the software package UNIO for extensive automation. The results obtained with this novel procedure were evaluated by comparison with the crystal structures solved by the JCSG at 100 K to 1.83 and 1.90 Å resolution, respectively. In addition, the TM1112 solution structure was compared with an NMR structure solved by the NESG using a conventional largely interactive methodology. For both proteins, the newly determined NMR structure could be superimposed with the crystal structure with r.m.s.d. values of <1.0 Å for the backbone heavy atoms, which provided a starting platform to investigate local structure variations, which may arise from either the methods used or from the different chemical environments in solution and in the crystal. Thereby, these comparative studies were further explored with the use of reference NMR and crystal structures, which were computed using the NMR software with input of upper-limit distance constraints derived from the molecular models that represent the results of structure

  7. Measurement of backbone hydrogen-deuterium exchange in the type III secretion system needle protein PrgI by solid-state NMR

    Science.gov (United States)

    Chevelkov, Veniamin; Giller, Karin; Becker, Stefan; Lange, Adam

    2017-10-01

    In this report we present site-specific measurements of amide hydrogen-deuterium exchange rates in a protein in the solid state phase by MAS NMR. Employing perdeuteration, proton detection and a high external magnetic field we could adopt the highly efficient Relax-EXSY protocol previously developed for liquid state NMR. According to this method, we measured the contribution of hydrogen exchange on apparent 15N longitudinal relaxation rates in samples with differing D2O buffer content. Differences in the apparent T1 times allowed us to derive exchange rates for multiple residues in the type III secretion system needle protein.

  8. Modularity in protein structures: study on all-alpha proteins.

    Science.gov (United States)

    Khan, Taushif; Ghosh, Indira

    2015-01-01

    Modularity is known as one of the most important features of protein's robust and efficient design. The architecture and topology of proteins play a vital role by providing necessary robust scaffolds to support organism's growth and survival in constant evolutionary pressure. These complex biomolecules can be represented by several layers of modular architecture, but it is pivotal to understand and explore the smallest biologically relevant structural component. In the present study, we have developed a component-based method, using protein's secondary structures and their arrangements (i.e. patterns) in order to investigate its structural space. Our result on all-alpha protein shows that the known structural space is highly populated with limited set of structural patterns. We have also noticed that these frequently observed structural patterns are present as modules or "building blocks" in large proteins (i.e. higher secondary structure content). From structural descriptor analysis, observed patterns are found to be within similar deviation; however, frequent patterns are found to be distinctly occurring in diverse functions e.g. in enzymatic classes and reactions. In this study, we are introducing a simple approach to explore protein structural space using combinatorial- and graph-based geometry methods, which can be used to describe modularity in protein structures. Moreover, analysis indicates that protein function seems to be the driving force that shapes the known structure space.

  9. Protein enriched pasta: structure and digestibility of its protein network.

    Science.gov (United States)

    Laleg, Karima; Barron, Cécile; Santé-Lhoutellier, Véronique; Walrand, Stéphane; Micard, Valérie

    2016-02-01

    Wheat (W) pasta was enriched in 6% gluten (G), 35% faba (F) or 5% egg (E) to increase its protein content (13% to 17%). The impact of the enrichment on the multiscale structure of the pasta and on in vitro protein digestibility was studied. Increasing the protein content (W- vs. G-pasta) strengthened pasta structure at molecular and macroscopic scales but reduced its protein digestibility by 3% by forming a higher covalently linked protein network. Greater changes in the macroscopic and molecular structure of the pasta were obtained by varying the nature of protein used for enrichment. Proteins in G- and E-pasta were highly covalently linked (28-32%) resulting in a strong pasta structure. Conversely, F-protein (98% SDS-soluble) altered the pasta structure by diluting gluten and formed a weak protein network (18% covalent link). As a result, protein digestibility in F-pasta was significantly higher (46%) than in E- (44%) and G-pasta (39%). The effect of low (55 °C, LT) vs. very high temperature (90 °C, VHT) drying on the protein network structure and digestibility was shown to cause greater molecular changes than pasta formulation. Whatever the pasta, a general strengthening of its structure, a 33% to 47% increase in covalently linked proteins and a higher β-sheet structure were observed. However, these structural differences were evened out after the pasta was cooked, resulting in identical protein digestibility in LT and VHT pasta. Even after VHT drying, F-pasta had the best amino acid profile with the highest protein digestibility, proof of its nutritional interest.

  10. NMR structure of the N-terminal domain of the replication initiator protein DnaA

    Energy Technology Data Exchange (ETDEWEB)

    Wemmer, David E.; Lowery, Thomas J.; Pelton, Jeffrey G.; Chandonia, John-Marc; Kim, Rosalind; Yokota, Hisao; Wemmer, David E.

    2007-08-07

    DnaA is an essential component in the initiation of bacterial chromosomal replication. DnaA binds to a series of 9 base pair repeats leading to oligomerization, recruitment of the DnaBC helicase, and the assembly of the replication fork machinery. The structure of the N-terminal domain (residues 1-100) of DnaA from Mycoplasma genitalium was determined by NMR spectroscopy. The backbone r.m.s.d. for the first 86 residues was 0.6 +/- 0.2 Angstrom based on 742 NOE, 50 hydrogen bond, 46 backbone angle, and 88 residual dipolar coupling restraints. Ultracentrifugation studies revealed that the domain is monomeric in solution. Features on the protein surface include a hydrophobic cleft flanked by several negative residues on one side, and positive residues on the other. A negatively charged ridge is present on the opposite face of the protein. These surfaces may be important sites of interaction with other proteins involved in the replication process. Together, the structure and NMR assignments should facilitate the design of new experiments to probe the protein-protein interactions essential for the initiation of DNA replication.

  11. Inferential backbone assignment for sparse data

    International Nuclear Information System (INIS)

    Vitek, Olga; Bailey-Kellogg, Chris; Craig, Bruce; Vitek, Jan

    2006-01-01

    This paper develops an approach to protein backbone NMR assignment that effectively assigns large proteins while using limited sets of triple-resonance experiments. Our approach handles proteins with large fractions of missing data and many ambiguous pairs of pseudoresidues, and provides a statistical assessment of confidence in global and position-specific assignments. The approach is tested on an extensive set of experimental and synthetic data of up to 723 residues, with match tolerances of up to 0.5 ppm for C α and C β resonance types. The tests show that the approach is particularly helpful when data contain experimental noise and require large match tolerances. The keys to the approach are an empirical Bayesian probability model that rigorously accounts for uncertainty in the data at all stages in the analysis, and a hybrid stochastic tree-based search algorithm that effectively explores the large space of possible assignments

  12. p15PAF is an intrinsically disordered protein with nonrandom structural preferences at sites of interaction with other proteins.

    Science.gov (United States)

    De Biasio, Alfredo; Ibáñez de Opakua, Alain; Cordeiro, Tiago N; Villate, Maider; Merino, Nekane; Sibille, Nathalie; Lelli, Moreno; Diercks, Tammo; Bernadó, Pau; Blanco, Francisco J

    2014-02-18

    We present to our knowledge the first structural characterization of the proliferating-cell-nuclear-antigen-associated factor p15(PAF), showing that it is monomeric and intrinsically disordered in solution but has nonrandom conformational preferences at sites of protein-protein interactions. p15(PAF) is a 12 kDa nuclear protein that acts as a regulator of DNA repair during DNA replication. The p15(PAF) gene is overexpressed in several types of human cancer. The nearly complete NMR backbone assignment of p15(PAF) allowed us to measure 86 N-H(N) residual dipolar couplings. Our residual dipolar coupling analysis reveals nonrandom conformational preferences in distinct regions, including the proliferating-cell-nuclear-antigen-interacting protein motif (PIP-box) and the KEN-box (recognized by the ubiquitin ligase that targets p15(PAF) for degradation). In accordance with these findings, analysis of the (15)N R2 relaxation rates shows a relatively reduced mobility for the residues in these regions. The agreement between the experimental small angle x-ray scattering curve of p15(PAF) and that computed from a statistical coil ensemble corrected for the presence of local secondary structural elements further validates our structural model for p15(PAF). The coincidence of these transiently structured regions with protein-protein interaction and posttranslational modification sites suggests a possible role for these structures as molecular recognition elements for p15(PAF). Copyright © 2014 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  13. Backbone Brackets and Arginine Tweezers delineate Class I and Class II aminoacyl tRNA synthetases

    Science.gov (United States)

    Haupt, V. Joachim; Schroeder, Michael; Labudde, Dirk

    2018-01-01

    The origin of the machinery that realizes protein biosynthesis in all organisms is still unclear. One key component of this machinery are aminoacyl tRNA synthetases (aaRS), which ligate tRNAs to amino acids while consuming ATP. Sequence analyses revealed that these enzymes can be divided into two complementary classes. Both classes differ significantly on a sequence and structural level, feature different reaction mechanisms, and occur in diverse oligomerization states. The one unifying aspect of both classes is their function of binding ATP. We identified Backbone Brackets and Arginine Tweezers as most compact ATP binding motifs characteristic for each Class. Geometric analysis shows a structural rearrangement of the Backbone Brackets upon ATP binding, indicating a general mechanism of all Class I structures. Regarding the origin of aaRS, the Rodin-Ohno hypothesis states that the peculiar nature of the two aaRS classes is the result of their primordial forms, called Protozymes, being encoded on opposite strands of the same gene. Backbone Brackets and Arginine Tweezers were traced back to the proposed Protozymes and their more efficient successors, the Urzymes. Both structural motifs can be observed as pairs of residues in contemporary structures and it seems that the time of their addition, indicated by their placement in the ancient aaRS, coincides with the evolutionary trace of Proto- and Urzymes. PMID:29659563

  14. Structure-based barcoding of proteins.

    Science.gov (United States)

    Metri, Rahul; Jerath, Gaurav; Kailas, Govind; Gacche, Nitin; Pal, Adityabarna; Ramakrishnan, Vibin

    2014-01-01

    A reduced representation in the format of a barcode has been developed to provide an overview of the topological nature of a given protein structure from 3D coordinate file. The molecular structure of a protein coordinate file from Protein Data Bank is first expressed in terms of an alpha-numero code and further converted to a barcode image. The barcode representation can be used to compare and contrast different proteins based on their structure. The utility of this method has been exemplified by comparing structural barcodes of proteins that belong to same fold family, and across different folds. In addition to this, we have attempted to provide an illustration to (i) the structural changes often seen in a given protein molecule upon interaction with ligands and (ii) Modifications in overall topology of a given protein during evolution. The program is fully downloadable from the website http://www.iitg.ac.in/probar/. © 2013 The Protein Society.

  15. Peptoid-Peptide hybrid backbone architectures

    DEFF Research Database (Denmark)

    Olsen, Christian Adam

    2010-01-01

    Peptidomimetic oligomers and foldamers have received considerable attention for over a decade, with beta-peptides and the so-called peptoids (N-alkylglycine oligomers) representing prominent examples of such architectures. Lately, hybrid or mixed backbones consisting of both alpha- and beta......-amino acids (alpha/beta-peptides) have been investigated in some detail as well. The present Minireview is a survey of the literature concerning hybrid structures of alpha-amino acids and peptoids, including beta-peptoids (N-alkyl-beta-alanine oligomers), and is intended to give an overview of this area...

  16. Backbone upgrades and DEC equipment replacement

    Science.gov (United States)

    Vancamp, Warren

    1991-01-01

    The NASA Science Internet (NSI) dual protocol backbone is outlined. It includes DECnet link upgrades to match TCP/IP link performance. It also includes the integration of backbone resources and central management. The phase 1 transition process is outlined.

  17. The interface of protein structure, protein biophysics, and molecular evolution

    Science.gov (United States)

    Liberles, David A; Teichmann, Sarah A; Bahar, Ivet; Bastolla, Ugo; Bloom, Jesse; Bornberg-Bauer, Erich; Colwell, Lucy J; de Koning, A P Jason; Dokholyan, Nikolay V; Echave, Julian; Elofsson, Arne; Gerloff, Dietlind L; Goldstein, Richard A; Grahnen, Johan A; Holder, Mark T; Lakner, Clemens; Lartillot, Nicholas; Lovell, Simon C; Naylor, Gavin; Perica, Tina; Pollock, David D; Pupko, Tal; Regan, Lynne; Roger, Andrew; Rubinstein, Nimrod; Shakhnovich, Eugene; Sjölander, Kimmen; Sunyaev, Shamil; Teufel, Ashley I; Thorne, Jeffrey L; Thornton, Joseph W; Weinreich, Daniel M; Whelan, Simon

    2012-01-01

    Abstract The interface of protein structural biology, protein biophysics, molecular evolution, and molecular population genetics forms the foundations for a mechanistic understanding of many aspects of protein biochemistry. Current efforts in interdisciplinary protein modeling are in their infancy and the state-of-the art of such models is described. Beyond the relationship between amino acid substitution and static protein structure, protein function, and corresponding organismal fitness, other considerations are also discussed. More complex mutational processes such as insertion and deletion and domain rearrangements and even circular permutations should be evaluated. The role of intrinsically disordered proteins is still controversial, but may be increasingly important to consider. Protein geometry and protein dynamics as a deviation from static considerations of protein structure are also important. Protein expression level is known to be a major determinant of evolutionary rate and several considerations including selection at the mRNA level and the role of interaction specificity are discussed. Lastly, the relationship between modeling and needed high-throughput experimental data as well as experimental examination of protein evolution using ancestral sequence resurrection and in vitro biochemistry are presented, towards an aim of ultimately generating better models for biological inference and prediction. PMID:22528593

  18. SDSL-ESR-based protein structure characterization

    NARCIS (Netherlands)

    Strancar, J.; Kavalenka, A.A.; Urbancic, I.; Ljubetic, A.; Hemminga, M.A.

    2010-01-01

    As proteins are key molecules in living cells, knowledge about their structure can provide important insights and applications in science, biotechnology, and medicine. However, many protein structures are still a big challenge for existing high-resolution structure-determination methods, as can be

  19. Overcoming barriers to membrane protein structure determination.

    Science.gov (United States)

    Bill, Roslyn M; Henderson, Peter J F; Iwata, So; Kunji, Edmund R S; Michel, Hartmut; Neutze, Richard; Newstead, Simon; Poolman, Bert; Tate, Christopher G; Vogel, Horst

    2011-04-01

    After decades of slow progress, the pace of research on membrane protein structures is beginning to quicken thanks to various improvements in technology, including protein engineering and microfocus X-ray diffraction. Here we review these developments and, where possible, highlight generic new approaches to solving membrane protein structures based on recent technological advances. Rational approaches to overcoming the bottlenecks in the field are urgently required as membrane proteins, which typically comprise ~30% of the proteomes of organisms, are dramatically under-represented in the structural database of the Protein Data Bank.

  20. Backbone dynamics of a model membrane protein: measurement of individual amide hydrogen-exchange rates in detergent-solubilized M13 coat protein using 13C NMR hydrogen/deuterium isotope shifts

    International Nuclear Information System (INIS)

    Henry, G.D.; Weiner, J.H.; Sykes, B.D.

    1987-01-01

    Hydrogen-exchange rates have been measured for individual assigned amide protons in M13 coat protein, a 50-residue integral membrane protein, using a 13 C nuclear magnetic resonance (NMR) equilibrium isotope shift technique. The locations of the more rapidly exchanging amides have been determined. In D 2 O solutions, a peptide carbonyl resonance undergoes a small upfield isotope shift (0.08-0.09 ppm) from its position in H 2 O solutions; in 1:1 H 2 O/D 2 O mixtures, the carbonyl line shape is determined by the exchange rate at the adjacent nitrogen atom. M13 coat protein was labeled biosynthetically with 13 C at the peptide carbonyls of alanine, glycine, phenylalanine, proline, and lysine, and the exchange rates of 12 assigned amide protons in the hydrophilic regions were measured as a function of pH by using the isotope shift method. This equilibrium technique is sensitive to the more rapidly exchanging protons which are difficult to measure by classical exchange-out experiments. In proteins, structural factors, notably H bonding, can decrease the exchange rate of an amide proton by many orders of magnitude from that observed in the freely exposed amides of model peptides such as poly(DL-alanine). With corrections for sequence-related inductive effects, the retardation of amide exchange in sodium dodecyl sulfate solubilized coat protein has been calculated with respect to poly(DL-alanine). The most rapidly exchanging protons, which are retarded very little or not at all, are shown to occur at the N- and C-termini of the molecule. A model of the detergent-solubilized coat protein is constructed from these H-exchange data which is consistent with circular dichroism and other NMR results

  1. CASD-NMR 2: robust and accurate unsupervised analysis of raw NOESY spectra and protein structure determination with UNIO

    International Nuclear Information System (INIS)

    Guerry, Paul; Duong, Viet Dung; Herrmann, Torsten

    2015-01-01

    UNIO is a comprehensive software suite for protein NMR structure determination that enables full automation of all NMR data analysis steps involved—including signal identification in NMR spectra, sequence-specific backbone and side-chain resonance assignment, NOE assignment and structure calculation. Within the framework of the second round of the community-wide stringent blind NMR structure determination challenge (CASD-NMR 2), we participated in two categories of CASD-NMR 2, namely using either raw NMR spectra or unrefined NOE peak lists as input. A total of 15 resulting NMR structure bundles were submitted for 9 out of 10 blind protein targets. All submitted UNIO structures accurately coincided with the corresponding blind targets as documented by an average backbone root mean-square deviation to the reference proteins of only 1.2 Å. Also, the precision of the UNIO structure bundles was virtually identical to the ensemble of reference structures. By assessing the quality of all UNIO structures submitted to the two categories, we find throughout that only the UNIO–ATNOS/CANDID approach using raw NMR spectra consistently yielded structure bundles of high quality for direct deposition in the Protein Data Bank. In conclusion, the results obtained in CASD-NMR 2 are another vital proof for robust, accurate and unsupervised NMR data analysis by UNIO for real-world applications

  2. Mapping monomeric threading to protein-protein structure prediction.

    Science.gov (United States)

    Guerler, Aysam; Govindarajoo, Brandon; Zhang, Yang

    2013-03-25

    The key step of template-based protein-protein structure prediction is the recognition of complexes from experimental structure libraries that have similar quaternary fold. Maintaining two monomer and dimer structure libraries is however laborious, and inappropriate library construction can degrade template recognition coverage. We propose a novel strategy SPRING to identify complexes by mapping monomeric threading alignments to protein-protein interactions based on the original oligomer entries in the PDB, which does not rely on library construction and increases the efficiency and quality of complex template recognitions. SPRING is tested on 1838 nonhomologous protein complexes which can recognize correct quaternary template structures with a TM score >0.5 in 1115 cases after excluding homologous proteins. The average TM score of the first model is 60% and 17% higher than that by HHsearch and COTH, respectively, while the number of targets with an interface RMSD benchmark proteins. Although the relative performance of SPRING and ZDOCK depends on the level of homology filters, a combination of the two methods can result in a significantly higher model quality than ZDOCK at all homology thresholds. These data demonstrate a new efficient approach to quaternary structure recognition that is ready to use for genome-scale modeling of protein-protein interactions due to the high speed and accuracy.

  3. DNA-to-protein crosslinks and backbone breaks caused by far- and near-ultraviolet, and visible light radiations in mammalian cells

    International Nuclear Information System (INIS)

    Peak, M.J.; Peak, J.G.

    1986-01-01

    Spectral responses for DNA damages caused by far-uv, near-uv, and visible light radiations have been studied. The near congruence of the spectra for far-uv damages and the spectrum of DNA is good evidence that the mechanism is the same for the induction of breaks, crosslinks, and pyrimidine dimers. For near-uv, the different spectra imply that at least several nonDNA sensitizer molecules act as primary chromophores, but that DNA damage eventually results. With the understanding that near-uv and visible radiations produce a variety of chemically potent reactive oxygen species within the cell, we recognize the possibility for many types of DNA damage. If we assume that SSBs and DNA-to-protein crosslinks are random single events along the genome, it is possible to compute the number of events per cell genome per lethal event caused by the different energies used. In the near-uv and visible region, many more breaks and crosslinks are formed per lethal event than by far-uv. About 20 times more SSBs per lethal event are caused by 365-nm radiation than by x-rays, strong evidence that these breaks are effectively repaired. It is therefore likely that SSBs are not a serious event with regard to cellular lethality. The role of crosslinks and their repair in lethal events is less clear. The lack of any correlation at all between the action spectra for SSBs, or crosslinks, and lethality and mutagenesis in the same cells is evidence that another lesion or lesions are involved in these events. The multitude of chemical events that can be caused in cellular metabolites by the reactive species generated by these long wavelengths of radiation means that death is attributable to the total spectrum of changed chemicals delivered by a lethal dose, only some of which are DNA changes leading to SSBs and crosslinks. 43 refs., 3 figs., 2 tabs

  4. PSAIA – Protein Structure and Interaction Analyzer

    Directory of Open Access Journals (Sweden)

    Vlahoviček Kristian

    2008-04-01

    Full Text Available Abstract Background PSAIA (Protein Structure and Interaction Analyzer was developed to compute geometric parameters for large sets of protein structures in order to predict and investigate protein-protein interaction sites. Results In addition to most relevant established algorithms, PSAIA offers a new method PIADA (Protein Interaction Atom Distance Algorithm for the determination of residue interaction pairs. We found that PIADA produced more satisfactory results than comparable algorithms implemented in PSAIA. Particular advantages of PSAIA include its capacity to combine different methods to detect the locations and types of interactions between residues and its ability, without any further automation steps, to handle large numbers of protein structures and complexes. Generally, the integration of a variety of methods enables PSAIA to offer easier automation of analysis and greater reliability of results. PSAIA can be used either via a graphical user interface or from the command-line. Results are generated in either tabular or XML format. Conclusion In a straightforward fashion and for large sets of protein structures, PSAIA enables the calculation of protein geometric parameters and the determination of location and type for protein-protein interaction sites. XML formatted output enables easy conversion of results to various formats suitable for statistic analysis. Results from smaller data sets demonstrated the influence of geometry on protein interaction sites. Comprehensive analysis of properties of large data sets lead to new information useful in the prediction of protein-protein interaction sites.

  5. NAPS: Network Analysis of Protein Structures

    Science.gov (United States)

    Chakrabarty, Broto; Parekh, Nita

    2016-01-01

    Traditionally, protein structures have been analysed by the secondary structure architecture and fold arrangement. An alternative approach that has shown promise is modelling proteins as a network of non-covalent interactions between amino acid residues. The network representation of proteins provide a systems approach to topological analysis of complex three-dimensional structures irrespective of secondary structure and fold type and provide insights into structure-function relationship. We have developed a web server for network based analysis of protein structures, NAPS, that facilitates quantitative and qualitative (visual) analysis of residue–residue interactions in: single chains, protein complex, modelled protein structures and trajectories (e.g. from molecular dynamics simulations). The user can specify atom type for network construction, distance range (in Å) and minimal amino acid separation along the sequence. NAPS provides users selection of node(s) and its neighbourhood based on centrality measures, physicochemical properties of amino acids or cluster of well-connected residues (k-cliques) for further analysis. Visual analysis of interacting domains and protein chains, and shortest path lengths between pair of residues are additional features that aid in functional analysis. NAPS support various analyses and visualization views for identifying functional residues, provide insight into mechanisms of protein folding, domain-domain and protein–protein interactions for understanding communication within and between proteins. URL:http://bioinf.iiit.ac.in/NAPS/. PMID:27151201

  6. PPM-One: a static protein structure based chemical shift predictor

    International Nuclear Information System (INIS)

    Li, Dawei; Brüschweiler, Rafael

    2015-01-01

    We mined the most recent editions of the BioMagResDataBank and the protein data bank to parametrize a new empirical knowledge-based chemical shift predictor of protein backbone atoms using either a linear or an artificial neural network model. The resulting chemical shift predictor PPM-One accepts a single static 3D structure as input and emulates the effect of local protein dynamics via interatomic steric contacts. Furthermore, the chemical shift prediction was extended to most side-chain protons and it is found that the prediction accuracy is at a level allowing an independent assessment of stereospecific assignments. For a previously established set of test proteins some overall improvement was achieved over current top-performing chemical shift prediction programs

  7. Nonribosomal biosynthesis of backbone-modified peptides

    Science.gov (United States)

    Niquille, David L.; Hansen, Douglas A.; Mori, Takahiro; Fercher, David; Kries, Hajo; Hilvert, Donald

    2018-03-01

    Biosynthetic modification of nonribosomal peptide backbones represents a potentially powerful strategy to modulate the structure and properties of an important class of therapeutics. Using a high-throughput assay for catalytic activity, we show here that an L-Phe-specific module of an archetypal nonribosomal peptide synthetase can be reprogrammed to accept and process the backbone-modified amino acid (S)-β-Phe with near-native specificity and efficiency. A co-crystal structure with a non-hydrolysable aminoacyl-AMP analogue reveals the origins of the 40,000-fold α/β-specificity switch, illuminating subtle but precise remodelling of the active site. When the engineered catalyst was paired with downstream module(s), (S)-β-Phe-containing peptides were produced at preparative scale in vitro (~1 mmol) and high titres in vivo (~100 mg l-1), highlighting the potential of biosynthetic pathway engineering for the construction of novel nonribosomal β-frameworks.

  8. Solution NMR structure determination of proteins revisited

    International Nuclear Information System (INIS)

    Billeter, Martin; Wagner, Gerhard; Wuethrich, Kurt

    2008-01-01

    This 'Perspective' bears on the present state of protein structure determination by NMR in solution. The focus is on a comparison of the infrastructure available for NMR structure determination when compared to protein crystal structure determination by X-ray diffraction. The main conclusion emerges that the unique potential of NMR to generate high resolution data also on dynamics, interactions and conformational equilibria has contributed to a lack of standard procedures for structure determination which would be readily amenable to improved efficiency by automation. To spark renewed discussion on the topic of NMR structure determination of proteins, procedural steps with high potential for improvement are identified

  9. Extracting knowledge from protein structure geometry

    DEFF Research Database (Denmark)

    Røgen, Peter; Koehl, Patrice

    2013-01-01

    potential from geometric knowledge extracted from native and misfolded conformers of protein structures. This new potential, Metric Protein Potential (MPP), has two main features that are key to its success. Firstly, it is composite in that it includes local and nonlocal geometric information on proteins...

  10. Top-Down Hydrogen-Deuterium Exchange Analysis of Protein Structures Using Ultraviolet Photodissociation.

    Science.gov (United States)

    Brodie, Nicholas I; Huguet, Romain; Zhang, Terry; Viner, Rosa; Zabrouskov, Vlad; Pan, Jingxi; Petrotchenko, Evgeniy V; Borchers, Christoph H

    2018-03-06

    Top-down hydrogen-deuterium exchange (HDX) analysis using electron capture or transfer dissociation Fourier transform mass spectrometry (FTMS) is a powerful method for the analysis of secondary structure of proteins in solution. The resolution of the method is a function of the degree of fragmentation of backbone bonds in the proteins. While fragmentation is usually extensive near the N- and C-termini, electron capture (ECD) or electron transfer dissociation (ETD) fragmentation methods sometimes lack good coverage of certain regions of the protein, most often in the middle of the sequence. Ultraviolet photodissociation (UVPD) is a recently developed fast-fragmentation technique, which provides extensive backbone fragmentation that can be complementary in sequence coverage to the aforementioned electron-based fragmentation techniques. Here, we explore the application of electrospray ionization (ESI)-UVPD FTMS on an Orbitrap Fusion Lumos Tribrid mass spectrometer to top-down HDX analysis of proteins. We have incorporated UVPD-specific fragment-ion types and fragment-ion mixtures into our isotopic envelope fitting software (HDX Match) for the top-down HDX analysis. We have shown that UVPD data is complementary to ETD, thus improving the overall resolution when used as a combined approach.

  11. Validation-driven protein-structure improvement

    NARCIS (Netherlands)

    Touw, W.G.

    2016-01-01

    High-quality protein structure models are essential for many Life Science applications, such as protein engineering, molecular dynamics, drug design, and homology modelling. The WHAT_CHECK model validation project and the PDB_REDO model optimisation project have shown that many structure models in

  12. Development of techniques in magnetic resonance and structural studies of the prion protein

    Energy Technology Data Exchange (ETDEWEB)

    Bitter, Hans-Marcus L. [Univ. of California, Berkeley, CA (United States)

    2000-07-01

    Magnetic resonance is the most powerful analytical tool used by chemists today. Its applications range from determining structures of large biomolecules to imaging of human brains. Nevertheless, magnetic resonance remains a relatively young field, in which many techniques are currently being developed that have broad applications. In this dissertation, two new techniques are presented, one that enables the determination of torsion angles in solid-state peptides and proteins, and another that involves imaging of heterogenous materials at ultra-low magnetic fields. In addition, structural studies of the prion protein via solid-state NMR are described. More specifically, work is presented in which the dependence of chemical shifts on local molecular structure is used to predict chemical shift tensors in solid-state peptides with theoretical ab initio surfaces. These predictions are then used to determine the backbone dihedral angles in peptides. This method utilizes the theoretical chemicalshift tensors and experimentally determined chemical-shift anisotropies (CSAs) to predict the backbone and side chain torsion angles in alanine, leucine, and valine residues. Additionally, structural studies of prion protein fragments are described in which conformationally-dependent chemical-shift measurements were made to gain insight into the structural differences between the various conformational states of the prion protein. These studies are of biological and pathological interest since conformational changes in the prion protein are believed to cause prion diseases. Finally, an ultra-low field magnetic resonance imaging technique is described that enables imaging and characterization of heterogeneous and porous media. The notion of imaging gases at ultra-low fields would appear to be very difficult due to the prohibitively low polarization and spin densities as well as the low sensitivities of conventional Faraday coil detectors. However, Chapter 5 describes how gas imaging

  13. On the relationship between residue structural environment and sequence conservation in proteins.

    Science.gov (United States)

    Liu, Jen-Wei; Lin, Jau-Ji; Cheng, Chih-Wen; Lin, Yu-Feng; Hwang, Jenn-Kang; Huang, Tsun-Tsao

    2017-09-01

    Residues that are crucial to protein function or structure are usually evolutionarily conserved. To identify the important residues in protein, sequence conservation is estimated, and current methods rely upon the unbiased collection of homologous sequences. Surprisingly, our previous studies have shown that the sequence conservation is closely correlated with the weighted contact number (WCN), a measure of packing density for residue's structural environment, calculated only based on the C α positions of a protein structure. Moreover, studies have shown that sequence conservation is correlated with environment-related structural properties calculated based on different protein substructures, such as a protein's all atoms, backbone atoms, side-chain atoms, or side-chain centroid. To know whether the C α atomic positions are adequate to show the relationship between residue environment and sequence conservation or not, here we compared C α atoms with other substructures in their contributions to the sequence conservation. Our results show that C α positions are substantially equivalent to the other substructures in calculations of various measures of residue environment. As a result, the overlapping contributions between C α atoms and the other substructures are high, yielding similar structure-conservation relationship. Take the WCN as an example, the average overlapping contribution to sequence conservation is 87% between C α and all-atom substructures. These results indicate that only C α atoms of a protein structure could reflect sequence conservation at the residue level. © 2017 Wiley Periodicals, Inc.

  14. Heterochiral Knottin Protein: Folding and Solution Structure.

    Science.gov (United States)

    Mong, Surin K; Cochran, Frank V; Yu, Hongtao; Graziano, Zachary; Lin, Yu-Shan; Cochran, Jennifer R; Pentelute, Bradley L

    2017-10-31

    Homochirality is a general feature of biological macromolecules, and Nature includes few examples of heterochiral proteins. Herein, we report on the design, chemical synthesis, and structural characterization of heterochiral proteins possessing loops of amino acids of chirality opposite to that of the rest of a protein scaffold. Using the protein Ecballium elaterium trypsin inhibitor II, we discover that selective β-alanine substitution favors the efficient folding of our heterochiral constructs. Solution nuclear magnetic resonance spectroscopy of one such heterochiral protein reveals a homogeneous global fold. Additionally, steered molecular dynamics simulation indicate β-alanine reduces the free energy required to fold the protein. We also find these heterochiral proteins to be more resistant to proteolysis than homochiral l-proteins. This work informs the design of heterochiral protein architectures containing stretches of both d- and l-amino acids.

  15. Amino acid code of protein secondary structure.

    Science.gov (United States)

    Shestopalov, B V

    2003-01-01

    The calculation of protein three-dimensional structure from the amino acid sequence is a fundamental problem to be solved. This paper presents principles of the code theory of protein secondary structure, and their consequence--the amino acid code of protein secondary structure. The doublet code model of protein secondary structure, developed earlier by the author (Shestopalov, 1990), is part of this theory. The theory basis are: 1) the name secondary structure is assigned to the conformation, stabilized only by the nearest (intraresidual) and middle-range (at a distance no more than that between residues i and i + 5) interactions; 2) the secondary structure consists of regular (alpha-helical and beta-structural) and irregular (coil) segments; 3) the alpha-helices, beta-strands and coil segments are encoded, respectively, by residue pairs (i, i + 4), (i, i + 2), (i, i = 1), according to the numbers of residues per period, 3.6, 2, 1; 4) all such pairs in the amino acid sequence are codons for elementary structural elements, or structurons; 5) the codons are divided into 21 types depending on their strength, i.e. their encoding capability; 6) overlappings of structurons of one and the same structure generate the longer segments of this structure; 7) overlapping of structurons of different structures is forbidden, and therefore selection of codons is required, the codon selection is hierarchic; 8) the code theory of protein secondary structure generates six variants of the amino acid code of protein secondary structure. There are two possible kinds of model construction based on the theory: the physical one using physical properties of amino acid residues, and the statistical one using results of statistical analysis of a great body of structural data. Some evident consequences of the theory are: a) the theory can be used for calculating the secondary structure from the amino acid sequence as a partial solution of the problem of calculation of protein three

  16. A Self-Assisting Protein Folding Model for Teaching Structural Molecular Biology.

    Science.gov (United States)

    Davenport, Jodi; Pique, Michael; Getzoff, Elizabeth; Huntoon, Jon; Gardner, Adam; Olson, Arthur

    2017-04-04

    Structural molecular biology is now becoming part of high school science curriculum thus posing a challenge for teachers who need to convey three-dimensional (3D) structures with conventional text and pictures. In many cases even interactive computer graphics does not go far enough to address these challenges. We have developed a flexible model of the polypeptide backbone using 3D printing technology. With this model we have produced a polypeptide assembly kit to create an idealized model of the Triosephosphate isomerase mutase enzyme (TIM), which forms a structure known as TIM barrel. This kit has been used in a laboratory practical where students perform a step-by-step investigation into the nature of protein folding, starting with the handedness of amino acids to the formation of secondary and tertiary structure. Based on the classroom evidence we collected, we conclude that these models are valuable and inexpensive resource for teaching structural molecular biology. Copyright © 2017 Elsevier Ltd. All rights reserved.

  17. K-nearest uphill clustering in the protein structure space

    KAUST Repository

    Cui, Xuefeng; Gao, Xin

    2016-01-01

    The protein structure classification problem, which is to assign a protein structure to a cluster of similar proteins, is one of the most fundamental problems in the construction and application of the protein structure space. Early manually curated

  18. Automated protein structure calculation from NMR data

    International Nuclear Information System (INIS)

    Williamson, Mike P.; Craven, C. Jeremy

    2009-01-01

    Current software is almost at the stage to permit completely automatic structure determination of small proteins of <15 kDa, from NMR spectra to structure validation with minimal user interaction. This goal is welcome, as it makes structure calculation more objective and therefore more easily validated, without any loss in the quality of the structures generated. Moreover, it releases expert spectroscopists to carry out research that cannot be automated. It should not take much further effort to extend automation to ca 20 kDa. However, there are technological barriers to further automation, of which the biggest are identified as: routines for peak picking; adoption and sharing of a common framework for structure calculation, including the assembly of an automated and trusted package for structure validation; and sample preparation, particularly for larger proteins. These barriers should be the main target for development of methodology for protein structure determination, particularly by structural genomics consortia

  19. Structural anatomy of telomere OB proteins.

    Science.gov (United States)

    Horvath, Martin P

    2011-10-01

    Telomere DNA-binding proteins protect the ends of chromosomes in eukaryotes. A subset of these proteins are constructed with one or more OB folds and bind with G+T-rich single-stranded DNA found at the extreme termini. The resulting DNA-OB protein complex interacts with other telomere components to coordinate critical telomere functions of DNA protection and DNA synthesis. While the first crystal and NMR structures readily explained protection of telomere ends, the picture of how single-stranded DNA becomes available to serve as primer and template for synthesis of new telomere DNA is only recently coming into focus. New structures of telomere OB fold proteins alongside insights from genetic and biochemical experiments have made significant contributions towards understanding how protein-binding OB proteins collaborate with DNA-binding OB proteins to recruit telomerase and DNA polymerase for telomere homeostasis. This review surveys telomere OB protein structures alongside highly comparable structures derived from replication protein A (RPA) components, with the goal of providing a molecular context for understanding telomere OB protein evolution and mechanism of action in protection and synthesis of telomere DNA.

  20. Toward structural dynamics: protein motions viewed by chemical shift modulations and direct detection of C'N multiple-quantum relaxation.

    Science.gov (United States)

    Mori, Mirko; Kateb, Fatiha; Bodenhausen, Geoffrey; Piccioli, Mario; Abergel, Daniel

    2010-03-17

    Multiple quantum relaxation in proteins reveals unexpected relationships between correlated or anti-correlated conformational backbone dynamics in alpha-helices or beta-sheets. The contributions of conformational exchange to the relaxation rates of C'N coherences (i.e., double- and zero-quantum coherences involving backbone carbonyl (13)C' and neighboring amide (15)N nuclei) depend on the kinetics of slow exchange processes, as well as on the populations of the conformations and chemical shift differences of (13)C' and (15)N nuclei. The relaxation rates of C'N coherences, which reflect concerted fluctuations due to slow chemical shift modulations (CSMs), were determined by direct (13)C detection in diamagnetic and paramagnetic proteins. In well-folded proteins such as lanthanide-substituted calbindin (CaLnCb), copper,zinc superoxide dismutase (Cu,Zn SOD), and matrix metalloproteinase (MMP12), slow conformational exchange occurs along the entire backbone. Our observations demonstrate that relaxation rates of C'N coherences arising from slow backbone dynamics have positive signs (characteristic of correlated fluctuations) in beta-sheets and negative signs (characteristic of anti-correlated fluctuations) in alpha-helices. This extends the prospects of structure-dynamics relationships to slow time scales that are relevant for protein function and enzymatic activity.

  1. Understanding Protein-Protein Interactions Using Local Structural Features

    DEFF Research Database (Denmark)

    Planas-Iglesias, Joan; Bonet, Jaume; García-García, Javier

    2013-01-01

    Protein-protein interactions (PPIs) play a relevant role among the different functions of a cell. Identifying the PPI network of a given organism (interactome) is useful to shed light on the key molecular mechanisms within a biological system. In this work, we show the role of structural features...... interacting and non-interacting protein pairs to classify the structural features that sustain the binding (or non-binding) behavior. Our study indicates that not only the interacting region but also the rest of the protein surface are important for the interaction fate. The interpretation...... to score the likelihood of the interaction between two proteins and to develop a method for the prediction of PPIs. We have tested our method on several sets with unbalanced ratios of interactions and non-interactions to simulate real conditions, obtaining accuracies higher than 25% in the most unfavorable...

  2. Algorithms for Protein Structure Prediction

    DEFF Research Database (Denmark)

    Paluszewski, Martin

    -trace. Here we present three different approaches for reconstruction of C-traces from predictable measures. In our first approach [63, 62], the C-trace is positioned on a lattice and a tabu-search algorithm is applied to find minimum energy structures. The energy function is based on half-sphere-exposure (HSE......) is more robust than standard Monte Carlo search. In the second approach for reconstruction of C-traces, an exact branch and bound algorithm has been developed [67, 65]. The model is discrete and makes use of secondary structure predictions, HSE, CN and radius of gyration. We show how to compute good lower...... bounds for partial structures very fast. Using these lower bounds, we are able to find global minimum structures in a huge conformational space in reasonable time. We show that many of these global minimum structures are of good quality compared to the native structure. Our branch and bound algorithm...

  3. Structural symmetry and protein function.

    Science.gov (United States)

    Goodsell, D S; Olson, A J

    2000-01-01

    The majority of soluble and membrane-bound proteins in modern cells are symmetrical oligomeric complexes with two or more subunits. The evolutionary selection of symmetrical oligomeric complexes is driven by functional, genetic, and physicochemical needs. Large proteins are selected for specific morphological functions, such as formation of rings, containers, and filaments, and for cooperative functions, such as allosteric regulation and multivalent binding. Large proteins are also more stable against denaturation and have a reduced surface area exposed to solvent when compared with many individual, smaller proteins. Large proteins are constructed as oligomers for reasons of error control in synthesis, coding efficiency, and regulation of assembly. Symmetrical oligomers are favored because of stability and finite control of assembly. Several functions limit symmetry, such as interaction with DNA or membranes, and directional motion. Symmetry is broken or modified in many forms: quasisymmetry, in which identical subunits adopt similar but different conformations; pleomorphism, in which identical subunits form different complexes; pseudosymmetry, in which different molecules form approximately symmetrical complexes; and symmetry mismatch, in which oligomers of different symmetries interact along their respective symmetry axes. Asymmetry is also observed at several levels. Nearly all complexes show local asymmetry at the level of side chain conformation. Several complexes have reciprocating mechanisms in which the complex is asymmetric, but, over time, all subunits cycle through the same set of conformations. Global asymmetry is only rarely observed. Evolution of oligomeric complexes may favor the formation of dimers over complexes with higher cyclic symmetry, through a mechanism of prepositioned pairs of interacting residues. However, examples have been found for all of the crystallographic point groups, demonstrating that functional need can drive the evolution of

  4. Protein folding and wring resonances

    DEFF Research Database (Denmark)

    Bohr, Jakob; Bohr, Henrik; Brunak, Søren

    1997-01-01

    The polypeptide chain of a protein is shown to obey topological contraints which enable long range excitations in the form of wring modes of the protein backbone. Wring modes of proteins of specific lengths can therefore resonate with molecular modes present in the cell. It is suggested that prot......The polypeptide chain of a protein is shown to obey topological contraints which enable long range excitations in the form of wring modes of the protein backbone. Wring modes of proteins of specific lengths can therefore resonate with molecular modes present in the cell. It is suggested...... that protein folding takes place when the amplitude of a wring excitation becomes so large that it is energetically favorable to bend the protein backbone. The condition under which such structural transformations can occur is found, and it is shown that both cold and hot denaturation (the unfolding...

  5. Efficient protein structure search using indexing methods.

    Science.gov (United States)

    Kim, Sungchul; Sael, Lee; Yu, Hwanjo

    2013-01-01

    Understanding functions of proteins is one of the most important challenges in many studies of biological processes. The function of a protein can be predicted by analyzing the functions of structurally similar proteins, thus finding structurally similar proteins accurately and efficiently from a large set of proteins is crucial. A protein structure can be represented as a vector by 3D-Zernike Descriptor (3DZD) which compactly represents the surface shape of the protein tertiary structure. This simplified representation accelerates the searching process. However, computing the similarity of two protein structures is still computationally expensive, thus it is hard to efficiently process many simultaneous requests of structurally similar protein search. This paper proposes indexing techniques which substantially reduce the search time to find structurally similar proteins. In particular, we first exploit two indexing techniques, i.e., iDistance and iKernel, on the 3DZDs. After that, we extend the techniques to further improve the search speed for protein structures. The extended indexing techniques build and utilize an reduced index constructed from the first few attributes of 3DZDs of protein structures. To retrieve top-k similar structures, top-10 × k similar structures are first found using the reduced index, and top-k structures are selected among them. We also modify the indexing techniques to support θ-based nearest neighbor search, which returns data points less than θ to the query point. The results show that both iDistance and iKernel significantly enhance the searching speed. In top-k nearest neighbor search, the searching time is reduced 69.6%, 77%, 77.4% and 87.9%, respectively using iDistance, iKernel, the extended iDistance, and the extended iKernel. In θ-based nearest neighbor serach, the searching time is reduced 80%, 81%, 95.6% and 95.6% using iDistance, iKernel, the extended iDistance, and the extended iKernel, respectively.

  6. Protein structure: geometry, topology and classification

    Energy Technology Data Exchange (ETDEWEB)

    Taylor, William R.; May, Alex C.W.; Brown, Nigel P.; Aszodi, Andras [Division of Mathematical Biology, National Institute for Medical Research, London (United Kingdom)

    2001-04-01

    The structural principals of proteins are reviewed and analysed from a geometric perspective with a view to revealing the underlying regularities in their construction. Computer methods for the automatic comparison and classification of these structures are then reviewed with an analysis of the statistical significance of comparing different shapes. Following an analysis of the current state of the classification of proteins, more abstract geometric and topological representations are explored, including the occurrence of knotted topologies. The review concludes with a consideration of the origin of higher-level symmetries in protein structure. (author)

  7. Taking advantage of local structure descriptors to analyze interresidue contacts in protein structures and protein complexes.

    Science.gov (United States)

    Martin, Juliette; Regad, Leslie; Etchebest, Catherine; Camproux, Anne-Claude

    2008-11-15

    Interresidue protein contacts in proteins structures and at protein-protein interface are classically described by the amino acid types of interacting residues and the local structural context of the contact, if any, is described using secondary structures. In this study, we present an alternate analysis of interresidue contact using local structures defined by the structural alphabet introduced by Camproux et al. This structural alphabet allows to describe a 3D structure as a sequence of prototype fragments called structural letters, of 27 different types. Each residue can then be assigned to a particular local structure, even in loop regions. The analysis of interresidue contacts within protein structures defined using Voronoï tessellations reveals that pairwise contact specificity is greater in terms of structural letters than amino acids. Using a simple heuristic based on specificity score comparison, we find that 74% of the long-range contacts within protein structures are better described using structural letters than amino acid types. The investigation is extended to a set of protein-protein complexes, showing that the similar global rules apply as for intraprotein contacts, with 64% of the interprotein contacts best described by local structures. We then present an evaluation of pairing functions integrating structural letters to decoy scoring and show that some complexes could benefit from the use of structural letter-based pairing functions.

  8. Future High Capacity Backbone Networks

    DEFF Research Database (Denmark)

    Wang, Jiayuan

    are proposed. The work focuses on energy efficient routing algorithms in a dynamic optical core network environment, with Generalized MultiProtocol Label Switching (GMPLS) as the control plane. Energy ef- ficient routing algorithms for energy savings and CO2 savings are proposed, and their performance...... aiming for reducing the dynamic part of the energy consumption of the network may increase the fixed part of the energy consumption meanwhile. In the second half of the thesis, the conflict between energy efficiency and Quality of Service (QoS) is addressed by introducing a novel software defined......This thesis - Future High Capacity Backbone Networks - deals with the energy efficiency problems associated with the development of future optical networks. In the first half of the thesis, novel approaches for using multiple/single alternative energy sources for improving energy efficiency...

  9. Fast loop modeling for protein structures

    Science.gov (United States)

    Zhang, Jiong; Nguyen, Son; Shang, Yi; Xu, Dong; Kosztin, Ioan

    2015-03-01

    X-ray crystallography is the main method for determining 3D protein structures. In many cases, however, flexible loop regions of proteins cannot be resolved by this approach. This leads to incomplete structures in the protein data bank, preventing further computational study and analysis of these proteins. For instance, all-atom molecular dynamics (MD) simulation studies of structure-function relationship require complete protein structures. To address this shortcoming, we have developed and implemented an efficient computational method for building missing protein loops. The method is database driven and uses deep learning and multi-dimensional scaling algorithms. We have implemented the method as a simple stand-alone program, which can also be used as a plugin in existing molecular modeling software, e.g., VMD. The quality and stability of the generated structures are assessed and tested via energy scoring functions and by equilibrium MD simulations. The proposed method can also be used in template-based protein structure prediction. Work supported by the National Institutes of Health [R01 GM100701]. Computer time was provided by the University of Missouri Bioinformatics Consortium.

  10. Interplay between Peptide Bond Geometrical Parameters in Nonglobular Structural Contexts

    OpenAIRE

    Esposito, Luciana; Balasco, Nicole; De Simone, Alfonso; Berisio, Rita; Vitagliano, Luigi

    2013-01-01

    Several investigations performed in the last two decades have unveiled that geometrical parameters of protein backbone show a remarkable variability. Although these studies have provided interesting insights into one of the basic aspects of protein structure, they have been conducted on globular and water-soluble proteins. We report here a detailed analysis of backbone geometrical parameters in nonglobular proteins/peptides. We considered membrane proteins and two distinct fibrous systems (am...

  11. Simultaneous determination of protein structure and dynamics

    DEFF Research Database (Denmark)

    Lindorff-Larsen, Kresten; Best, Robert B.; DePristo, M. A.

    2005-01-01

    at the atomic level about the structural and dynamical features of proteins-with the ability of molecular dynamics simulations to explore a wide range of protein conformations. We illustrate the method for human ubiquitin in solution and find that there is considerable conformational heterogeneity throughout......We present a protocol for the experimental determination of ensembles of protein conformations that represent simultaneously the native structure and its associated dynamics. The procedure combines the strengths of nuclear magnetic resonance spectroscopy-for obtaining experimental information...... the protein structure. The interior atoms of the protein are tightly packed in each individual conformation that contributes to the ensemble but their overall behaviour can be described as having a significant degree of liquid-like character. The protocol is completely general and should lead to significant...

  12. Protein Molecular Structures, Protein SubFractions, and Protein Availability Affected by Heat Processing: A Review

    International Nuclear Information System (INIS)

    Yu, P.

    2007-01-01

    The utilization and availability of protein depended on the types of protein and their specific susceptibility to enzymatic hydrolysis (inhibitory activities) in the gastrointestine and was highly associated with protein molecular structures. Studying internal protein structure and protein subfraction profiles leaded to an understanding of the components that make up a whole protein. An understanding of the molecular structure of the whole protein was often vital to understanding its digestive behavior and nutritive value in animals. In this review, recently obtained information on protein molecular structural effects of heat processing was reviewed, in relation to protein characteristics affecting digestive behavior and nutrient utilization and availability. The emphasis of this review was on (1) using the newly advanced synchrotron technology (S-FTIR) as a novel approach to reveal protein molecular chemistry affected by heat processing within intact plant tissues; (2) revealing the effects of heat processing on the profile changes of protein subfractions associated with digestive behaviors and kinetics manipulated by heat processing; (3) prediction of the changes of protein availability and supply after heat processing, using the advanced DVE/OEB and NRC-2001 models, and (4) obtaining information on optimal processing conditions of protein as intestinal protein source to achieve target values for potential high net absorbable protein in the small intestine. The information described in this article may give better insight in the mechanisms involved and the intrinsic protein molecular structural changes occurring upon processing.

  13. Fragger: a protein fragment picker for structural queries [version 2; referees: 2 approved

    Directory of Open Access Journals (Sweden)

    Francois Berenger

    2018-04-01

    Full Text Available Protein modeling and design activities often require querying the Protein Data Bank (PDB with a structural fragment, possibly containing gaps. For some applications, it is preferable to work on a specific subset of the PDB or with unpublished structures. These requirements, along with specific user needs, motivated the creation of a new software to manage and query 3D protein fragments. Fragger is a protein fragment picker that allows protein fragment databases to be created and queried. All fragment lengths are supported and any set of PDB files can be used to create a database. Fragger can efficiently search a fragment database with a query fragment and a distance threshold. Matching fragments are ranked by distance to the query. The query fragment can have structural gaps and the allowed amino acid sequences matching a query can be constrained via a regular expression of one-letter amino acid codes. Fragger also incorporates a tool to compute the backbone RMSD of one versus many fragments in high throughput. Fragger should be useful for protein design, loop grafting and related structural bioinformatics tasks.

  14. Structure Prediction of Outer Membrane Protease Protein of Salmonella typhimurium Using Computational Techniques

    Directory of Open Access Journals (Sweden)

    Rozina Tabassum

    2016-03-01

    Full Text Available Salmonella typhimurium, a facultative gram-negative intracellular pathogen belonging to family Enterobacteriaceae, is the most frequent cause of human gastroenteritis worldwide. PgtE gene product, outer membrane protease emerges important in the intracellular phases of salmonellosis. The pgtE gene product of S. typhimurium was predicted to be capable of proteolyzing T7 RNA polymerase and localize in the outer membrane of these gram negative bacteria. PgtE product of S. enterica and OmpT of E. coli, having high sequence similarity have been revealed to degrade macrophages, causing salmonellosis and other diseases. The three-dimensional structure of the protein was not available through Protein Data Bank (PDB creating lack of structural information about E protein. In our study, by performing Comparative model building, the three dimensional structure of outer membrane protease protein was generated using the backbone of the crystal structure of Pla of Yersinia pestis, retrieved from PDB, with MODELLER (9v8. Quality of the model was assessed by validation tool PROCHECK, web servers like ERRAT and ProSA are used to certify the reliability of the predicted model. This information might offer clues for better understanding of E protein and consequently for developmet of better therapeutic treatment against pathogenic role of this protein in salmonellosis and other diseases.

  15. Human cancer protein-protein interaction network: a structural perspective.

    Directory of Open Access Journals (Sweden)

    Gozde Kar

    2009-12-01

    Full Text Available Protein-protein interaction networks provide a global picture of cellular function and biological processes. Some proteins act as hub proteins, highly connected to others, whereas some others have few interactions. The dysfunction of some interactions causes many diseases, including cancer. Proteins interact through their interfaces. Therefore, studying the interface properties of cancer-related proteins will help explain their role in the interaction networks. Similar or overlapping binding sites should be used repeatedly in single interface hub proteins, making them promiscuous. Alternatively, multi-interface hub proteins make use of several distinct binding sites to bind to different partners. We propose a methodology to integrate protein interfaces into cancer interaction networks (ciSPIN, cancer structural protein interface network. The interactions in the human protein interaction network are replaced by interfaces, coming from either known or predicted complexes. We provide a detailed analysis of cancer related human protein-protein interfaces and the topological properties of the cancer network. The results reveal that cancer-related proteins have smaller, more planar, more charged and less hydrophobic binding sites than non-cancer proteins, which may indicate low affinity and high specificity of the cancer-related interactions. We also classified the genes in ciSPIN according to phenotypes. Within phenotypes, for breast cancer, colorectal cancer and leukemia, interface properties were found to be discriminating from non-cancer interfaces with an accuracy of 71%, 67%, 61%, respectively. In addition, cancer-related proteins tend to interact with their partners through distinct interfaces, corresponding mostly to multi-interface hubs, which comprise 56% of cancer-related proteins, and constituting the nodes with higher essentiality in the network (76%. We illustrate the interface related affinity properties of two cancer-related hub

  16. Protein Structure and the Sequential Structure of mRNA

    DEFF Research Database (Denmark)

    Brunak, Søren; Engelbrecht, Jacob

    1996-01-01

    entries in the Brookhaven Protein Data Bank produced 719 protein chains with matching mRNA sequence, amino acid sequence, and secondary structure assignment, By neural network analysis, we found strong signals in mRNA sequence regions surrounding helices and sheets, These signals do not originate from......A direct comparison of experimentally determined protein structures and their corresponding protein coding mRNA sequences has been performed, We examine whether real world data support the hypothesis that clusters of rare codons correlate with the location of structural units in the resulting...... protein, The degeneracy of the genetic code allows for a biased selection of codons which may control the translational rate of the ribosome, and may thus in vivo have a catalyzing effect on the folding of the polypeptide chain, A complete search for GenBank nucleotide sequences coding for structural...

  17. Structural and dynamic characterization of eukaryotic gene regulatory protein domains in solution

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Andrew Loyd [Univ. of California, Berkeley, CA (United States). Dept. of Chemistry

    1996-05-01

    Solution NMR was primarily used to characterize structure and dynamics in two different eukaryotic protein systems: the δ-Al-ε activation domain from c-jun and the Drosophila RNA-binding protein Sex-lethal. The second system is the Drosophila Sex-lethal (Sxl) protein, an RNA-binding protein which is the ``master switch`` in sex determination. Sxl contains two adjacent RNA-binding domains (RBDs) of the RNP consensus-type. The NMR spectrum of the second RBD (Sxl-RBD2) was assigned using multidimensional heteronuclear NMR, and an intermediate-resolution family of structures was calculated from primarily NOE distance restraints. The overall fold was determined to be similar to other RBDs: a βαβ-βαβ pattern of secondary structure, with the two helices packed against a 4-stranded anti-parallel β-sheet. In addition 15N T1, T2, and 15N/1H NOE relaxation measurements were carried out to characterize the backbone dynamics of Sxl-RBD2 in solution. RNA corresponding to the polypyrimidine tract of transformer pre-mRNA was generated and titrated into 3 different Sxl-RBD protein constructs. Combining Sxl-RBD1+2 (bht RBDs) with this RNA formed a specific, high affinity protein/RNA complex that is amenable to further NMR characterization. The backbone 1H, 13C, and 15N resonances of Sxl-RBD1+2 were assigned using a triple-resonance approach, and 15N relaxation experiments were carried out to characterize the backbone dynamics of this complex. The changes in chemical shift in Sxl-RBD1+2 upon binding RNA are observed using Sxl-RBD2 as a substitute for unbound Sxl-RBD1+2. This allowed the binding interface to be qualitatively mapped for the second domain.

  18. Modeling protein structures: construction and their applications.

    Science.gov (United States)

    Ring, C S; Cohen, F E

    1993-06-01

    Although no general solution to the protein folding problem exists, the three-dimensional structures of proteins are being successfully predicted when experimentally derived constraints are used in conjunction with heuristic methods. In the case of interleukin-4, mutagenesis data and CD spectroscopy were instrumental in the accurate assignment of secondary structure. In addition, the tertiary structure was highly constrained by six cysteines separated by many residues that formed three disulfide bridges. Although the correct structure was a member of a short list of plausible structures, the "best" structure was the topological enantiomer of the experimentally determined conformation. For many proteases, other experimentally derived structures can be used as templates to identify the secondary structure elements. In a procedure called modeling by homology, the structure of a known protein is used as a scaffold to predict the structure of another related protein. This method has been used to model a serine and a cysteine protease that are important in the schistosome and malarial life cycles, respectively. The model structures were then used to identify putative small molecule enzyme inhibitors computationally. Experiments confirm that some of these nonpeptidic compounds are active at concentrations of less than 10 microM.

  19. Three-dimensional structure of the human immunodeficiency virus type 1 matrix protein.

    Science.gov (United States)

    Massiah, M A; Starich, M R; Paschall, C; Summers, M F; Christensen, A M; Sundquist, W I

    1994-11-25

    The HIV-1 matrix protein forms an icosahedral shell associated with the inner membrane of the mature virus. Genetic analyses have indicated that the protein performs important functions throughout the viral life-cycle, including anchoring the transmembrane envelope protein on the surface of the virus, assisting in viral penetration, transporting the proviral integration complex across the nuclear envelope, and localizing the assembling virion to the cell membrane. We now report the three-dimensional structure of recombinant HIV-1 matrix protein, determined at high resolution by nuclear magnetic resonance (NMR) methods. The HIV-1 matrix protein is the first retroviral matrix protein to be characterized structurally and only the fourth HIV-1 protein of known structure. NMR signal assignments required recently developed triple-resonance (1H, 13C, 15N) NMR methodologies because signals for 91% of 132 assigned H alpha protons and 74% of the 129 assignable backbone amide protons resonate within chemical shift ranges of 0.8 p.p.m. and 1 p.p.m., respectively. A total of 636 nuclear Overhauser effect-derived distance restraints were employed for distance geometry-based structure calculations, affording an average of 13.0 NMR-derived distance restraints per residue for the experimentally constrained amino acids. An ensemble of 25 refined distance geometry structures with penalties (sum of the squares of the distance violations) of 0.32 A2 or less and individual distance violations under 0.06 A was generated; best-fit superposition of ordered backbone heavy atoms relative to mean atom positions afforded root-mean-square deviations of 0.50 (+/- 0.08) A. The folded HIV-1 matrix protein structure is composed of five alpha-helices, a short 3(10) helical stretch, and a three-strand mixed beta-sheet. Helices I to III and the 3(10) helix pack about a central helix (IV) to form a compact globular domain that is capped by the beta-sheet. The C-terminal helix (helix V) projects away

  20. Proteins with Novel Structure, Function and Dynamics

    Science.gov (United States)

    Pohorille, Andrew

    2014-01-01

    Recently, a small enzyme that ligates two RNA fragments with the rate of 10(exp 6) above background was evolved in vitro (Seelig and Szostak, Nature 448:828-831, 2007). This enzyme does not resemble any contemporary protein (Chao et al., Nature Chem. Biol. 9:81-83, 2013). It consists of a dynamic, catalytic loop, a small, rigid core containing two zinc ions coordinated by neighboring amino acids, and two highly flexible tails that might be unimportant for protein function. In contrast to other proteins, this enzyme does not contain ordered secondary structure elements, such as alpha-helix or beta-sheet. The loop is kept together by just two interactions of a charged residue and a histidine with a zinc ion, which they coordinate on the opposite side of the loop. Such structure appears to be very fragile. Surprisingly, computer simulations indicate otherwise. As the coordinating, charged residue is mutated to alanine, another, nearby charged residue takes its place, thus keeping the structure nearly intact. If this residue is also substituted by alanine a salt bridge involving two other, charged residues on the opposite sides of the loop keeps the loop in place. These adjustments are facilitated by high flexibility of the protein. Computational predictions have been confirmed experimentally, as both mutants retain full activity and overall structure. These results challenge our notions about what is required for protein activity and about the relationship between protein dynamics, stability and robustness. We hypothesize that small, highly dynamic proteins could be both active and fault tolerant in ways that many other proteins are not, i.e. they can adjust to retain their structure and activity even if subjected to mutations in structurally critical regions. This opens the doors for designing proteins with novel functions, structures and dynamics that have not been yet considered.

  1. Overcoming barriers to membrane protein structure determination

    NARCIS (Netherlands)

    Bill, Roslyn M.; Henderson, Peter J. F.; Iwata, So; Kunji, Edmund R. S.; Michel, Hartmut; Neutze, Richard; Newstead, Simon; Poolman, Bert; Tate, Christopher G.; Vogel, Horst

    After decades of slow progress, the pace of research on membrane protein structures is beginning to quicken thanks to various improvements in technology, including protein engineering and microfocus X-ray diffraction. Here we review these developments and, where possible, highlight generic new

  2. Protein structural similarity search by Ramachandran codes

    Directory of Open Access Journals (Sweden)

    Chang Chih-Hung

    2007-08-01

    Full Text Available Abstract Background Protein structural data has increased exponentially, such that fast and accurate tools are necessary to access structure similarity search. To improve the search speed, several methods have been designed to reduce three-dimensional protein structures to one-dimensional text strings that are then analyzed by traditional sequence alignment methods; however, the accuracy is usually sacrificed and the speed is still unable to match sequence similarity search tools. Here, we aimed to improve the linear encoding methodology and develop efficient search tools that can rapidly retrieve structural homologs from large protein databases. Results We propose a new linear encoding method, SARST (Structural similarity search Aided by Ramachandran Sequential Transformation. SARST transforms protein structures into text strings through a Ramachandran map organized by nearest-neighbor clustering and uses a regenerative approach to produce substitution matrices. Then, classical sequence similarity search methods can be applied to the structural similarity search. Its accuracy is similar to Combinatorial Extension (CE and works over 243,000 times faster, searching 34,000 proteins in 0.34 sec with a 3.2-GHz CPU. SARST provides statistically meaningful expectation values to assess the retrieved information. It has been implemented into a web service and a stand-alone Java program that is able to run on many different platforms. Conclusion As a database search method, SARST can rapidly distinguish high from low similarities and efficiently retrieve homologous structures. It demonstrates that the easily accessible linear encoding methodology has the potential to serve as a foundation for efficient protein structural similarity search tools. These search tools are supposed applicable to automated and high-throughput functional annotations or predictions for the ever increasing number of published protein structures in this post-genomic era.

  3. A 'periodic table' for protein structures.

    Science.gov (United States)

    Taylor, William R

    2002-04-11

    Current structural genomics programs aim systematically to determine the structures of all proteins coded in both human and other genomes, providing a complete picture of the number and variety of protein structures that exist. In the past, estimates have been made on the basis of the incomplete sample of structures currently known. These estimates have varied greatly (between 1,000 and 10,000; see for example refs 1 and 2), partly because of limited sample size but also owing to the difficulties of distinguishing one structure from another. This distinction is usually topological, based on the fold of the protein; however, in strict topological terms (neglecting to consider intra-chain cross-links), protein chains are open strings and hence are all identical. To avoid this trivial result, topologies are determined by considering secondary links in the form of intra-chain hydrogen bonds (secondary structure) and tertiary links formed by the packing of secondary structures. However, small additions to or loss of structure can make large changes to these perceived topologies and such subjective solutions are neither robust nor amenable to automation. Here I formalize both secondary and tertiary links to allow the rigorous and automatic definition of protein topology.

  4. Marburg virus VP35 can both fully coat the backbone and cap the ends of dsRNA for interferon antagonism.

    Directory of Open Access Journals (Sweden)

    Shridhar Bale

    2012-09-01

    Full Text Available Filoviruses, including Marburg virus (MARV and Ebola virus (EBOV, cause fatal hemorrhagic fever in humans and non-human primates. All filoviruses encode a unique multi-functional protein termed VP35. The C-terminal double-stranded (dsRNA-binding domain (RBD of VP35 has been implicated in interferon antagonism and immune evasion. Crystal structures of the VP35 RBD from two ebolaviruses have previously demonstrated that the viral protein caps the ends of dsRNA. However, it is not yet understood how the expanses of dsRNA backbone, between the ends, are masked from immune surveillance during filovirus infection. Here, we report the crystal structure of MARV VP35 RBD bound to dsRNA. In the crystal structure, molecules of dsRNA stack end-to-end to form a pseudo-continuous oligonucleotide. This oligonucleotide is continuously and completely coated along its sugar-phosphate backbone by the MARV VP35 RBD. Analysis of dsRNA binding by dot-blot and isothermal titration calorimetry reveals that multiple copies of MARV VP35 RBD can indeed bind the dsRNA sugar-phosphate backbone in a cooperative manner in solution. Further, MARV VP35 RBD can also cap the ends of the dsRNA in solution, although this arrangement was not captured in crystals. Together, these studies suggest that MARV VP35 can both coat the backbone and cap the ends, and that for MARV, coating of the dsRNA backbone may be an essential mechanism by which dsRNA is masked from backbone-sensing immune surveillance molecules.

  5. Structural analysis of recombinant human protein QM

    International Nuclear Information System (INIS)

    Gualberto, D.C.H.; Fernandes, J.L.; Silva, F.S.; Saraiva, K.W.; Affonso, R.; Pereira, L.M.; Silva, I.D.C.G.

    2012-01-01

    Full text: The ribosomal protein QM belongs to a family of ribosomal proteins, which is highly conserved from yeast to humans. The presence of the QM protein is necessary for joining the 60S and 40S subunits in a late step of the initiation of mRNA translation. Although the exact extra-ribosomal functions of QM are not yet fully understood, it has been identified as a putative tumor suppressor. This protein was reported to interact with the transcription factor c-Jun and thereby prevent c-Jun actives genes of the cellular growth. In this study, the human QM protein was expressed in bacterial system, in the soluble form and this structure was analyzed by Circular Dichroism and Fluorescence. The results of Circular Dichroism showed that this protein has less alpha helix than beta sheet, as described in the literature. QM protein does not contain a leucine zipper region; however the ion zinc is necessary for binding of QM to c-Jun. Then we analyzed the relationship between the removal of zinc ions and folding of protein. Preliminary results obtained by the technique Fluorescence showed a gradual increase in fluorescence with the addition of increasing concentration of EDTA. This suggests that the zinc is important in the tertiary structure of the protein. More studies are being made for better understand these results. (author)

  6. Protein Structure Determination Using Chemical Shifts

    DEFF Research Database (Denmark)

    Christensen, Anders Steen

    is determined using only chemical shifts recorded and assigned through automated processes. The CARMSD to the experimental X-ray for this structure is 1.1. Å. Additionally, the method is combined with very sparse NOE-restraints and evolutionary distance restraints and tested on several protein structures >100...

  7. On characterization of anisotropic plant protein structures

    NARCIS (Netherlands)

    Krintiras, G.A.; Göbel, J.; Bouwman, W.G.; Goot, van der A.J.; Stefanidis, G.D.

    2014-01-01

    In this paper, a set of complementary techniques was used to characterize surface and bulk structures of an anisotropic Soy Protein Isolate (SPI)–vital wheat gluten blend after it was subjected to heat and simple shear flow in a Couette Cell. The structured biopolymer blend can form a basis for a

  8. Hidden Structural Codes in Protein Intrinsic Disorder.

    Science.gov (United States)

    Borkosky, Silvia S; Camporeale, Gabriela; Chemes, Lucía B; Risso, Marikena; Noval, María Gabriela; Sánchez, Ignacio E; Alonso, Leonardo G; de Prat Gay, Gonzalo

    2017-10-17

    Intrinsic disorder is a major structural category in biology, accounting for more than 30% of coding regions across the domains of life, yet consists of conformational ensembles in equilibrium, a major challenge in protein chemistry. Anciently evolved papillomavirus genomes constitute an unparalleled case for sequence to structure-function correlation in cases in which there are no folded structures. E7, the major transforming oncoprotein of human papillomaviruses, is a paradigmatic example among the intrinsically disordered proteins. Analysis of a large number of sequences of the same viral protein allowed for the identification of a handful of residues with absolute conservation, scattered along the sequence of its N-terminal intrinsically disordered domain, which intriguingly are mostly leucine residues. Mutation of these led to a pronounced increase in both α-helix and β-sheet structural content, reflected by drastic effects on equilibrium propensities and oligomerization kinetics, and uncovers the existence of local structural elements that oppose canonical folding. These folding relays suggest the existence of yet undefined hidden structural codes behind intrinsic disorder in this model protein. Thus, evolution pinpoints conformational hot spots that could have not been identified by direct experimental methods for analyzing or perturbing the equilibrium of an intrinsically disordered protein ensemble.

  9. Data Acquisition Backbone Core DABC

    International Nuclear Information System (INIS)

    Adamczewski, J; Essel, H G; Kurz, N; Linev, S

    2008-01-01

    For the new experiments at FAIR new concepts of data acquisition systems have to be developed like the distribution of self-triggered, time stamped data streams over high performance networks for event building. The Data Acquisition Backbone Core (DABC) is a software package currently under development for FAIR detector tests, readout components test, and data flow investigations. All kinds of data channels (front-end systems) are connected by program plug-ins into functional components of DABC like data input, combiner, scheduler, event builder, analysis and storage components. After detailed simulations real tests of event building over a switched network (InfiniBand clusters with up to 110 nodes) have been performed. With the DABC software more than 900 MByte/s input and output per node can be achieved meeting the most demanding requirements. The software is ready for the implementation of various test beds needed for the final design of data acquisition systems at FAIR. The development of key components is supported by the FutureDAQ project of the European Union (FP6 I3HP JRA1)

  10. Protein Structure Recognition: From Eigenvector Analysis to Structural Threading Method

    Energy Technology Data Exchange (ETDEWEB)

    Cao, Haibo [Iowa State Univ., Ames, IA (United States)

    2003-01-01

    In this work, they try to understand the protein folding problem using pair-wise hydrophobic interaction as the dominant interaction for the protein folding process. They found a strong correlation between amino acid sequences and the corresponding native structure of the protein. Some applications of this correlation were discussed in this dissertation include the domain partition and a new structural threading method as well as the performance of this method in the CASP5 competition. In the first part, they give a brief introduction to the protein folding problem. Some essential knowledge and progress from other research groups was discussed. This part includes discussions of interactions among amino acids residues, lattice HP model, and the design ability principle. In the second part, they try to establish the correlation between amino acid sequence and the corresponding native structure of the protein. This correlation was observed in the eigenvector study of protein contact matrix. They believe the correlation is universal, thus it can be used in automatic partition of protein structures into folding domains. In the third part, they discuss a threading method based on the correlation between amino acid sequences and ominant eigenvector of the structure contact-matrix. A mathematically straightforward iteration scheme provides a self-consistent optimum global sequence-structure alignment. The computational efficiency of this method makes it possible to search whole protein structure databases for structural homology without relying on sequence similarity. The sensitivity and specificity of this method is discussed, along with a case of blind test prediction. In the appendix, they list the overall performance of this threading method in CASP5 blind test in comparison with other existing approaches.

  11. Protein structure recognition: From eigenvector analysis to structural threading method

    Science.gov (United States)

    Cao, Haibo

    In this work, we try to understand the protein folding problem using pair-wise hydrophobic interaction as the dominant interaction for the protein folding process. We found a strong correlation between amino acid sequence and the corresponding native structure of the protein. Some applications of this correlation were discussed in this dissertation include the domain partition and a new structural threading method as well as the performance of this method in the CASP5 competition. In the first part, we give a brief introduction to the protein folding problem. Some essential knowledge and progress from other research groups was discussed. This part include discussions of interactions among amino acids residues, lattice HP model, and the designablity principle. In the second part, we try to establish the correlation between amino acid sequence and the corresponding native structure of the protein. This correlation was observed in our eigenvector study of protein contact matrix. We believe the correlation is universal, thus it can be used in automatic partition of protein structures into folding domains. In the third part, we discuss a threading method based on the correlation between amino acid sequence and ominant eigenvector of the structure contact-matrix. A mathematically straightforward iteration scheme provides a self-consistent optimum global sequence-structure alignment. The computational efficiency of this method makes it possible to search whole protein structure databases for structural homology without relying on sequence similarity. The sensitivity and specificity of this method is discussed, along with a case of blind test prediction. In the appendix, we list the overall performance of this threading method in CASP5 blind test in comparison with other existing approaches.

  12. Protein Structure Recognition: From Eigenvector Analysis to Structural Threading Method

    International Nuclear Information System (INIS)

    Haibo Cao

    2003-01-01

    In this work, they try to understand the protein folding problem using pair-wise hydrophobic interaction as the dominant interaction for the protein folding process. They found a strong correlation between amino acid sequences and the corresponding native structure of the protein. Some applications of this correlation were discussed in this dissertation include the domain partition and a new structural threading method as well as the performance of this method in the CASP5 competition. In the first part, they give a brief introduction to the protein folding problem. Some essential knowledge and progress from other research groups was discussed. This part includes discussions of interactions among amino acids residues, lattice HP model, and the design ability principle. In the second part, they try to establish the correlation between amino acid sequence and the corresponding native structure of the protein. This correlation was observed in the eigenvector study of protein contact matrix. They believe the correlation is universal, thus it can be used in automatic partition of protein structures into folding domains. In the third part, they discuss a threading method based on the correlation between amino acid sequences and ominant eigenvector of the structure contact-matrix. A mathematically straightforward iteration scheme provides a self-consistent optimum global sequence-structure alignment. The computational efficiency of this method makes it possible to search whole protein structure databases for structural homology without relying on sequence similarity. The sensitivity and specificity of this method is discussed, along with a case of blind test prediction. In the appendix, they list the overall performance of this threading method in CASP5 blind test in comparison with other existing approaches

  13. Structure and non-structure of centrosomal proteins.

    Science.gov (United States)

    Dos Santos, Helena G; Abia, David; Janowski, Robert; Mortuza, Gulnahar; Bertero, Michela G; Boutin, Maïlys; Guarín, Nayibe; Méndez-Giraldez, Raúl; Nuñez, Alfonso; Pedrero, Juan G; Redondo, Pilar; Sanz, María; Speroni, Silvia; Teichert, Florian; Bruix, Marta; Carazo, José M; Gonzalez, Cayetano; Reina, José; Valpuesta, José M; Vernos, Isabelle; Zabala, Juan C; Montoya, Guillermo; Coll, Miquel; Bastolla, Ugo; Serrano, Luis

    2013-01-01

    Here we perform a large-scale study of the structural properties and the expression of proteins that constitute the human Centrosome. Centrosomal proteins tend to be larger than generic human proteins (control set), since their genes contain in average more exons (20.3 versus 14.6). They are rich in predicted disordered regions, which cover 57% of their length, compared to 39% in the general human proteome. They also contain several regions that are dually predicted to be disordered and coiled-coil at the same time: 55 proteins (15%) contain disordered and coiled-coil fragments that cover more than 20% of their length. Helices prevail over strands in regions homologous to known structures (47% predicted helical residues against 17% predicted as strands), and even more in the whole centrosomal proteome (52% against 7%), while for control human proteins 34.5% of the residues are predicted as helical and 12.8% are predicted as strands. This difference is mainly due to residues predicted as disordered and helical (30% in centrosomal and 9.4% in control proteins), which may correspond to alpha-helix forming molecular recognition features (α-MoRFs). We performed expression assays for 120 full-length centrosomal proteins and 72 domain constructs that we have predicted to be globular. These full-length proteins are often insoluble: Only 39 out of 120 expressed proteins (32%) and 19 out of 72 domains (26%) were soluble. We built or retrieved structural models for 277 out of 361 human proteins whose centrosomal localization has been experimentally verified. We could not find any suitable structural template with more than 20% sequence identity for 84 centrosomal proteins (23%), for which around 74% of the residues are predicted to be disordered or coiled-coils. The three-dimensional models that we built are available at http://ub.cbm.uam.es/centrosome/models/index.php.

  14. PONDEROSA, an automated 3D-NOESY peak picking program, enables automated protein structure determination.

    Science.gov (United States)

    Lee, Woonghee; Kim, Jin Hae; Westler, William M; Markley, John L

    2011-06-15

    PONDEROSA (Peak-picking Of Noe Data Enabled by Restriction of Shift Assignments) accepts input information consisting of a protein sequence, backbone and sidechain NMR resonance assignments, and 3D-NOESY ((13)C-edited and/or (15)N-edited) spectra, and returns assignments of NOESY crosspeaks, distance and angle constraints, and a reliable NMR structure represented by a family of conformers. PONDEROSA incorporates and integrates external software packages (TALOS+, STRIDE and CYANA) to carry out different steps in the structure determination. PONDEROSA implements internal functions that identify and validate NOESY peak assignments and assess the quality of the calculated three-dimensional structure of the protein. The robustness of the analysis results from PONDEROSA's hierarchical processing steps that involve iterative interaction among the internal and external modules. PONDEROSA supports a variety of input formats: SPARKY assignment table (.shifts) and spectrum file formats (.ucsf), XEASY proton file format (.prot), and NMR-STAR format (.star). To demonstrate the utility of PONDEROSA, we used the package to determine 3D structures of two proteins: human ubiquitin and Escherichia coli iron-sulfur scaffold protein variant IscU(D39A). The automatically generated structural constraints and ensembles of conformers were as good as or better than those determined previously by much less automated means. The program, in the form of binary code along with tutorials and reference manuals, is available at http://ponderosa.nmrfam.wisc.edu/.

  15. Crystal structure and conformational flexibility of the unligated FK506-binding protein FKBP12.6

    Energy Technology Data Exchange (ETDEWEB)

    Chen, Hui; Mustafi, Sourajit M. [New York State Department of Health, Empire State Plaza, Albany, NY 12201 (United States); LeMaster, David M. [New York State Department of Health, Empire State Plaza, Albany, NY 12201 (United States); University at Albany – SUNY, Empire State Plaza, Albany, NY 12201 (United States); Li, Zhong [New York State Department of Health, Empire State Plaza, Albany, NY 12201 (United States); Héroux, Annie [Brookhaven National Laboratory, Upton, NY 11973 (United States); Li, Hongmin; Hernández, Griselda, E-mail: griselda@wadsworth.org [New York State Department of Health, Empire State Plaza, Albany, NY 12201 (United States); University at Albany – SUNY, Empire State Plaza, Albany, NY 12201 (United States)

    2014-03-01

    Two crystal forms of unligated FKBP12.6 exhibit multiple conformations in the active site and in the 80s loop, the primary site for known protein-recognition interactions. The previously unreported NMR backbone assignment of FKBP12.6 revealed extensive doubling of amide resonances, which reflects a slow conformational transition centered in the 80s loop. The primary known physiological function of FKBP12.6 involves its role in regulating the RyR2 isoform of ryanodine receptor Ca{sup 2+} channels in cardiac muscle, pancreatic β islets and the central nervous system. With only a single previously reported X-ray structure of FKBP12.6, bound to the immunosuppressant rapamycin, structural inferences for this protein have been drawn from the more extensive studies of the homologous FKBP12. X-ray structures at 1.70 and 1.90 Å resolution from P2{sub 1} and P3{sub 1}21 crystal forms are reported for an unligated cysteine-free variant of FKBP12.6 which exhibit a notable diversity of conformations. In one monomer from the P3{sub 1}21 crystal form, the aromatic ring of Phe59 at the base of the active site is rotated perpendicular to its typical orientation, generating a steric conflict for the immunosuppressant-binding mode. The peptide unit linking Gly89 and Val90 at the tip of the protein-recognition ‘80s loop’ is flipped in the P2{sub 1} crystal form. Unlike the >30 reported FKBP12 structures, the backbone conformation of this loop closely follows that of the first FKBP domain of FKBP51. The NMR resonances for 21 backbone amides of FKBP12.6 are doubled, corresponding to a slow conformational transition centered near the tip of the 80s loop, as recently reported for 31 amides of FKBP12. The comparative absence of doubling for residues along the opposite face of the active-site pocket in FKBP12.6 may in part reflect attenuated structural coupling owing to increased conformational plasticity around the Phe59 ring.

  16. High-resolution crystal structures of protein helices reconciled with three-centered hydrogen bonds and multipole electrostatics.

    Science.gov (United States)

    Kuster, Daniel J; Liu, Chengyu; Fang, Zheng; Ponder, Jay W; Marshall, Garland R

    2015-01-01

    Theoretical and experimental evidence for non-linear hydrogen bonds in protein helices is ubiquitous. In particular, amide three-centered hydrogen bonds are common features of helices in high-resolution crystal structures of proteins. These high-resolution structures (1.0 to 1.5 Å nominal crystallographic resolution) position backbone atoms without significant bias from modeling constraints and identify Φ = -62°, ψ = -43 as the consensus backbone torsional angles of protein helices. These torsional angles preserve the atomic positions of α-β carbons of the classic Pauling α-helix while allowing the amide carbonyls to form bifurcated hydrogen bonds as first suggested by Némethy et al. in 1967. Molecular dynamics simulations of a capped 12-residue oligoalanine in water with AMOEBA (Atomic Multipole Optimized Energetics for Biomolecular Applications), a second-generation force field that includes multipole electrostatics and polarizability, reproduces the experimentally observed high-resolution helical conformation and correctly reorients the amide-bond carbonyls into bifurcated hydrogen bonds. This simple modification of backbone torsional angles reconciles experimental and theoretical views to provide a unified view of amide three-centered hydrogen bonds as crucial components of protein helices. The reason why they have been overlooked by structural biologists depends on the small crankshaft-like changes in orientation of the amide bond that allows maintenance of the overall helical parameters (helix pitch (p) and residues per turn (n)). The Pauling 3.6(13) α-helix fits the high-resolution experimental data with the minor exception of the amide-carbonyl electron density, but the previously associated backbone torsional angles (Φ, Ψ) needed slight modification to be reconciled with three-atom centered H-bonds and multipole electrostatics. Thus, a new standard helix, the 3.6(13/10)-, Némethy- or N-helix, is proposed. Due to the use of constraints from

  17. Mechanical reliability of porous low-k dielectrics for advanced interconnect: Study of the instability mechanisms in porous low-k dielectrics and their mediation through inert plasma induced re-polymerization of the backbone structure

    Science.gov (United States)

    Sa, Yoonki

    catalysts for reactions that break the cross-linked backbone. Clearly, both changes in PLK chemistry and bond structure must be addressed in order for any repair method to be favorable. For this reason, Ar plasma treatment with low energy ions is employed to repair the plasma induced damage by creating the desired changes in the film matrix without a significant loss of other properties. Our approach of using inert plasma as a way for damage recovery is motivated by the realization that there is no possibility of chemical reaction with any organic species, driving the energy transfer only from the plasma species towards the respective film matrix. As results, after applying Ar plasma beam treatment followed by annealing on damaged PLK films, the resistance against thermal instability and viscoplastic deformation is found to be improved. Ball indentation depth of the films with Ar plasma process is drastically reduced at the identical condition. More noticeable is the fact that such alternation is converted towards a dehydration reaction under hydrostatic thermal pressure, which causes dielectric constant to decrease and films shrinkage to restore during reconstruction of polymer chains. It is suggested that the immediate event of an Ar plasma beam radiation is to deposit energy from the plasma species (ions, electrons) and this energy input produces the excited state species because Ar cannot chemically react with the film matrix. As a consequence, the radical sites are generated at the less stable area such as colony boundary or pore surface with the decay of the excited species, leading to the production of free radicals by an energy transfer to the bonds which are to be broken. Then, the activated sites experience chemical bond rearrangement by chain-scission, branching, or cross-linking. In our case, crosslink with C is involved with silylmethylene (Si-(CH 2)x-Si) groups and it is turned out that some of these groups are converted to methyl groups terminally bonded to

  18. Beta-structures in fibrous proteins.

    Science.gov (United States)

    Kajava, Andrey V; Squire, John M; Parry, David A D

    2006-01-01

    The beta-form of protein folding, one of the earliest protein structures to be defined, was originally observed in studies of silks. It was then seen in early studies of synthetic polypeptides and, of course, is now known to be present in a variety of guises as an essential component of globular protein structures. However, in the last decade or so it has become clear that the beta-conformation of chains is present not only in many of the amyloid structures associated with, for example, Alzheimer's Disease, but also in the prion structures associated with the spongiform encephalopathies. Furthermore, X-ray crystallography studies have revealed the high incidence of the beta-fibrous proteins among virulence factors of pathogenic bacteria and viruses. Here we describe the basic forms of the beta-fold, summarize the many different new forms of beta-structural fibrous arrangements that have been discovered, and review advances in structural studies of amyloid and prion fibrils. These and other issues are described in detail in later chapters.

  19. Fibrous Protein Structures: Hierarchy, History and Heroes.

    Science.gov (United States)

    Squire, John M; Parry, David A D

    2017-01-01

    During the 1930s and 1940s the technique of X-ray diffraction was applied widely by William Astbury and his colleagues to a number of naturally-occurring fibrous materials. On the basis of the diffraction patterns obtained, he observed that the structure of each of the fibres was dominated by one of a small number of different types of molecular conformation. One group of fibres, known as the k-m-e-f group of proteins (keratin - myosin - epidermin - fibrinogen), gave rise to diffraction characteristics that became known as the α-pattern. Others, such as those from a number of silks, gave rise to a different pattern - the β-pattern, while connective tissues yielded a third unique set of diffraction characteristics. At the time of Astbury's work, the structures of these materials were unknown, though the spacings of the main X-ray reflections gave an idea of the axial repeats and the lateral packing distances. In a breakthrough in the early 1950s, the basic structures of all of these fibrous proteins were determined. It was found that the long protein chains, composed of strings of amino acids, could be folded up in a systematic manner to generate a limited number of structures that were consistent with the X-ray data. The most important of these were known as the α-helix, the β-sheet, and the collagen triple helix. These studies provided information about the basic building blocks of all proteins, both fibrous and globular. They did not, however, provide detailed information about how these molecules packed together in three-dimensions to generate the fibres found in vivo. A number of possible packing arrangements were subsequently deduced from the X-ray diffraction and other data, but it is only in the last few years, through the continued improvements of electron microscopy, that the packing details within some fibrous proteins can now be seen directly. Here we outline briefly some of the milestones in fibrous protein structure determination, the role of the

  20. pi-Turns: types, systematics and the context of their occurrence in protein structures.

    Science.gov (United States)

    Dasgupta, Bhaskar; Chakrabarti, Pinak

    2008-09-22

    For a proper understanding of protein structure and folding it is important to know if a polypeptide segment adopts a conformation inherent in the sequence or it depends on the context of its flanking secondary structures. Turns of various lengths have been studied and characterized starting from three-residue gamma-turn to six-residue pi-turn. The Schellman motif occurring at the C-terminal end of alpha-helices is a classical example of hydrogen bonded pi-turn involving residues at (i) and (i+5) positions. Hydrogen bonded and non-hydrogen bonded beta- and alpha-turns have been identified previously; likewise, a systematic characterization of pi-turns would provide valuable insight into turn structures. An analysis of protein structures indicates that at least 20% of pi-turns occur independent of the Schellman motif. The two categories of pi-turns, designated as pi-HB and SCH, have been further classified on the basis of backbone conformation and both have AAAa as the major class. They differ in the residue usage at position (i+1), the former having a large preference for Pro that is absent in the latter. As in the case of shorter length beta- and alpha-turns, pi-turns have also been identified not only on the basis of the existence of hydrogen bond, but also using the distance between terminal C alpha-atoms, and this resulted in a comparable number of non-hydrogen-bonded pi-turns (pi-NHB). The presence of shorter beta- and alpha-turns within all categories of pi-turns, the subtle variations in backbone torsion angles along the turn residues, the location of the turns in the context of tertiary structures have been studied. pi-turns have been characterized, first using hydrogen bond and the distance between C alpha atoms of the terminal residues, and then using backbone torsion angles. While the Schellman motif has a structural role in helix termination, many of the pi-HB turns, being located on surface cavities, have functional role and there is also sequence

  1. Optical burst switching based satellite backbone network

    Science.gov (United States)

    Li, Tingting; Guo, Hongxiang; Wang, Cen; Wu, Jian

    2018-02-01

    We propose a novel time slot based optical burst switching (OBS) architecture for GEO/LEO based satellite backbone network. This architecture can provide high speed data transmission rate and high switching capacity . Furthermore, we design the control plane of this optical satellite backbone network. The software defined network (SDN) and network slice (NS) technologies are introduced. Under the properly designed control mechanism, this backbone network is flexible to support various services with diverse transmission requirements. Additionally, the LEO access and handoff management in this network is also discussed.

  2. Structure of the complex between teicoplanin and a bacterial cell-wall peptide: use of a carrier-protein approach

    International Nuclear Information System (INIS)

    Economou, Nicoleta J.; Zentner, Isaac J.; Lazo, Edwin; Jakoncic, Jean; Stojanoff, Vivian; Weeks, Stephen D.; Grasty, Kimberly C.; Cocklin, Simon; Loll, Patrick J.

    2013-01-01

    Using a carrier-protein strategy, the structure of teicoplanin bound to its bacterial cell-wall target has been determined. The structure reveals the molecular determinants of target recognition, flexibility in the antibiotic backbone and intrinsic radiation sensitivity of teicoplanin. Multidrug-resistant bacterial infections are commonly treated with glycopeptide antibiotics such as teicoplanin. This drug inhibits bacterial cell-wall biosynthesis by binding and sequestering a cell-wall precursor: a d-alanine-containing peptide. A carrier-protein strategy was used to crystallize the complex of teicoplanin and its target peptide by fusing the cell-wall peptide to either MBP or ubiquitin via native chemical ligation and subsequently crystallizing the protein–peptide–antibiotic complex. The 2.05 Å resolution MBP–peptide–teicoplanin structure shows that teicoplanin recognizes its ligand through a combination of five hydrogen bonds and multiple van der Waals interactions. Comparison of this teicoplanin structure with that of unliganded teicoplanin reveals a flexibility in the antibiotic peptide backbone that has significant implications for ligand recognition. Diffraction experiments revealed an X-ray-induced dechlorination of the sixth amino acid of the antibiotic; it is shown that teicoplanin is significantly more radiation-sensitive than other similar antibiotics and that ligand binding increases radiosensitivity. Insights derived from this new teicoplanin structure may contribute to the development of next-generation antibacterials designed to overcome bacterial resistance

  3. A Kernel for Protein Secondary Structure Prediction

    OpenAIRE

    Guermeur , Yann; Lifchitz , Alain; Vert , Régis

    2004-01-01

    http://mitpress.mit.edu/catalog/item/default.asp?ttype=2&tid=10338&mode=toc; International audience; Multi-class support vector machines have already proved efficient in protein secondary structure prediction as ensemble methods, to combine the outputs of sets of classifiers based on different principles. In this chapter, their implementation as basic prediction methods, processing the primary structure or the profile of multiple alignments, is investigated. A kernel devoted to the task is in...

  4. 3D bioprinting of structural proteins.

    Science.gov (United States)

    Włodarczyk-Biegun, Małgorzata K; Del Campo, Aránzazu

    2017-07-01

    3D bioprinting is a booming method to obtain scaffolds of different materials with predesigned and customized morphologies and geometries. In this review we focus on the experimental strategies and recent achievements in the bioprinting of major structural proteins (collagen, silk, fibrin), as a particularly interesting technology to reconstruct the biochemical and biophysical composition and hierarchical morphology of natural scaffolds. The flexibility in molecular design offered by structural proteins, combined with the flexibility in mixing, deposition, and mechanical processing inherent to bioprinting technologies, enables the fabrication of highly functional scaffolds and tissue mimics with a degree of complexity and organization which has only just started to be explored. Here we describe the printing parameters and physical (mechanical) properties of bioinks based on structural proteins, including the biological function of the printed scaffolds. We describe applied printing techniques and cross-linking methods, highlighting the modifications implemented to improve scaffold properties. The used cell types, cell viability, and possible construct applications are also reported. We envision that the application of printing technologies to structural proteins will enable unprecedented control over their supramolecular organization, conferring printed scaffolds biological properties and functions close to natural systems. Copyright © 2017 Elsevier Ltd. All rights reserved.

  5. Functions and structures of eukaryotic recombination proteins

    International Nuclear Information System (INIS)

    Ogawa, Tomoko

    1994-01-01

    We have found that Rad51 and RecA Proteins form strikingly similar structures together with dsDNA and ATP. Their right handed helical nucleoprotein filaments extend the B-form DNA double helixes to 1.5 times in length and wind the helix. The similarity and uniqueness of their structures must reflect functional homologies between these proteins. Therefore, it is highly probable that similar recombination proteins are present in various organisms of different evolutional states. We have succeeded to clone RAD51 genes from human, mouse, chicken and fission yeast genes, and found that the homologues are widely distributed in eukaryotes. The HsRad51 and MmRad51 or ChRad51 proteins consist of 339 amino acids differing only by 4 or 12 amino acids, respectively, and highly homologous to both yeast proteins, but less so to Dmcl. All of these proteins are homologous to the region from residues 33 to 240 of RecA which was named ''homologous core. The homologous core is likely to be responsible for functions common for all of them, such as the formation of helical nucleoprotein filament that is considered to be involved in homologous pairing in the recombination reaction. The mouse gene is transcribed at a high level in thymus, spleen, testis, and ovary, at lower level in brain and at a further lower level in some other tissues. It is transcribed efficiently in recombination active tissues. A clear functional difference of Rad51 homologues from RecA was suggested by the failure of heterologous genes to complement the deficiency of Scrad51 mutants. This failure seems to reflect the absence of a compatible partner, such as ScRad52 protein in the case of ScRad51 protein, between different species. Thus, these discoveries play a role of the starting point to understand the fundamental gene targeting in mammalian cells and in gene therapy. (J.P.N.)

  6. Protein structure refinement using a quantum mechanics-based chemical shielding predictor.

    Science.gov (United States)

    Bratholm, Lars A; Jensen, Jan H

    2017-03-01

    The accurate prediction of protein chemical shifts using a quantum mechanics (QM)-based method has been the subject of intense research for more than 20 years but so far empirical methods for chemical shift prediction have proven more accurate. In this paper we show that a QM-based predictor of a protein backbone and CB chemical shifts (ProCS15, PeerJ , 2016, 3, e1344) is of comparable accuracy to empirical chemical shift predictors after chemical shift-based structural refinement that removes small structural errors. We present a method by which quantum chemistry based predictions of isotropic chemical shielding values (ProCS15) can be used to refine protein structures using Markov Chain Monte Carlo (MCMC) simulations, relating the chemical shielding values to the experimental chemical shifts probabilistically. Two kinds of MCMC structural refinement simulations were performed using force field geometry optimized X-ray structures as starting points: simulated annealing of the starting structure and constant temperature MCMC simulation followed by simulated annealing of a representative ensemble structure. Annealing of the CHARMM structure changes the CA-RMSD by an average of 0.4 Å but lowers the chemical shift RMSD by 1.0 and 0.7 ppm for CA and N. Conformational averaging has a relatively small effect (0.1-0.2 ppm) on the overall agreement with carbon chemical shifts but lowers the error for nitrogen chemical shifts by 0.4 ppm. If an amino acid specific offset is included the ProCS15 predicted chemical shifts have RMSD values relative to experiments that are comparable to popular empirical chemical shift predictors. The annealed representative ensemble structures differ in CA-RMSD relative to the initial structures by an average of 2.0 Å, with >2.0 Å difference for six proteins. In four of the cases, the largest structural differences arise in structurally flexible regions of the protein as determined by NMR, and in the remaining two cases, the large structural

  7. Induced helical backbone conformations of self-organizable dendronized polymers.

    Science.gov (United States)

    Rudick, Jonathan G; Percec, Virgil

    2008-12-01

    Control of function through the primary structure of a molecule presents a significant challenge with valuable rewards for nanoscience. Dendritic building blocks encoded with information that defines their three-dimensional shape (e.g., flat-tapered or conical) and how they associate with each other are referred to as self-assembling dendrons. Self-organizable dendronized polymers possess a flat-tapered or conical self-assembling dendritic side chain on each repeat unit of a linear polymer backbone. When appended to a covalent polymer, the self-assembling dendrons direct a folding process (i.e., intramolecular self-assembly). Alternatively, intermolecular self-assembly of dendrons mediated by noncovalent interactions between apex groups can generate a supramolecular polymer backbone. Self-organization, as we refer to it, is the spontaneous formation of periodic and quasiperiodic arrays from supramolecular elements. Covalent and supramolecular polymers jacketed with self-assembling dendrons self-organize. The arrays are most often comprised of cylindrical or spherical objects. The shape of the object is determined by the primary structure of the dendronized polymer: the structure of the self-assembling dendron and the length of the polymer backbone. It is therefore possible to predictably generate building blocks for single-molecule nanotechnologies or arrays of supramolecules for bottom-up self-assembly. We exploit the self-organization of polymers jacketed with self-assembling dendrons to elucidate how primary structure determines the adopted conformation and fold (i.e., secondary and tertiary structure), how the supramolecules associate (i.e., quaternary structure), and their resulting functions. A combination of experimental techniques is employed to interrogate the primary, secondary, tertiary, and quaternary structure of the self-organizable dendronized polymers. We refer to the process by which we interpolate between the various levels of structural

  8. Protein structure based prediction of catalytic residues.

    Science.gov (United States)

    Fajardo, J Eduardo; Fiser, Andras

    2013-02-22

    Worldwide structural genomics projects continue to release new protein structures at an unprecedented pace, so far nearly 6000, but only about 60% of these proteins have any sort of functional annotation. We explored a range of features that can be used for the prediction of functional residues given a known three-dimensional structure. These features include various centrality measures of nodes in graphs of interacting residues: closeness, betweenness and page-rank centrality. We also analyzed the distance of functional amino acids to the general center of mass (GCM) of the structure, relative solvent accessibility (RSA), and the use of relative entropy as a measure of sequence conservation. From the selected features, neural networks were trained to identify catalytic residues. We found that using distance to the GCM together with amino acid type provide a good discriminant function, when combined independently with sequence conservation. Using an independent test set of 29 annotated protein structures, the method returned 411 of the initial 9262 residues as the most likely to be involved in function. The output 411 residues contain 70 of the annotated 111 catalytic residues. This represents an approximately 14-fold enrichment of catalytic residues on the entire input set (corresponding to a sensitivity of 63% and a precision of 17%), a performance competitive with that of other state-of-the-art methods. We found that several of the graph based measures utilize the same underlying feature of protein structures, which can be simply and more effectively captured with the distance to GCM definition. This also has the added the advantage of simplicity and easy implementation. Meanwhile sequence conservation remains by far the most influential feature in identifying functional residues. We also found that due the rapid changes in size and composition of sequence databases, conservation calculations must be recalibrated for specific reference databases.

  9. NMR backbone resonance assignments of the prodomain variants of BDNF in the urea denatured state.

    Science.gov (United States)

    Wang, Jing; Bains, Henrietta; Anastasia, Agustin; Bracken, Clay

    2018-04-01

    Brain derived neurotrophic factor (BDNF) is a member of the neurotrophin family of proteins which plays a central role in neuronal survival, growth, plasticity and memory. A single Val66Met variant has been identified in the prodomain of human BDNF that is associated with anxiety, depression and memory disorders. The structural differences within the full-length prodomain Val66 and Met66 isoforms could shed light on the mechanism of action of the Met66 and its impact on the development of neuropsychiatric-associated disorders. In the present study, we report the backbone 1 H, 13 C, and 15 N NMR assignments of both full-length Val66 and Met66 prodomains in the presence of 2 M urea. These conditions were utilized to suppress residual structure and aid subsequent native state structural investigations aimed at mapping and identifying variant-dependent conformational differences under native-state conditions.

  10. A protein-dependent side-chain rotamer library.

    KAUST Repository

    Bhuyan, M.S.

    2011-12-14

    Protein side-chain packing problem has remained one of the key open problems in bioinformatics. The three main components of protein side-chain prediction methods are a rotamer library, an energy function and a search algorithm. Rotamer libraries summarize the existing knowledge of the experimentally determined structures quantitatively. Depending on how much contextual information is encoded, there are backbone-independent rotamer libraries and backbone-dependent rotamer libraries. Backbone-independent libraries only encode sequential information, whereas backbone-dependent libraries encode both sequential and locally structural information. However, side-chain conformations are determined by spatially local information, rather than sequentially local information. Since in the side-chain prediction problem, the backbone structure is given, spatially local information should ideally be encoded into the rotamer libraries. In this paper, we propose a new type of backbone-dependent rotamer library, which encodes structural information of all the spatially neighboring residues. We call it protein-dependent rotamer libraries. Given any rotamer library and a protein backbone structure, we first model the protein structure as a Markov random field. Then the marginal distributions are estimated by the inference algorithms, without doing global optimization or search. The rotamers from the given library are then re-ranked and associated with the updated probabilities. Experimental results demonstrate that the proposed protein-dependent libraries significantly outperform the widely used backbone-dependent libraries in terms of the side-chain prediction accuracy and the rotamer ranking ability. Furthermore, without global optimization/search, the side-chain prediction power of the protein-dependent library is still comparable to the global-search-based side-chain prediction methods.

  11. A protein-dependent side-chain rotamer library.

    KAUST Repository

    Bhuyan, M.S.; Gao, Xin

    2011-01-01

    Protein side-chain packing problem has remained one of the key open problems in bioinformatics. The three main components of protein side-chain prediction methods are a rotamer library, an energy function and a search algorithm. Rotamer libraries summarize the existing knowledge of the experimentally determined structures quantitatively. Depending on how much contextual information is encoded, there are backbone-independent rotamer libraries and backbone-dependent rotamer libraries. Backbone-independent libraries only encode sequential information, whereas backbone-dependent libraries encode both sequential and locally structural information. However, side-chain conformations are determined by spatially local information, rather than sequentially local information. Since in the side-chain prediction problem, the backbone structure is given, spatially local information should ideally be encoded into the rotamer libraries. In this paper, we propose a new type of backbone-dependent rotamer library, which encodes structural information of all the spatially neighboring residues. We call it protein-dependent rotamer libraries. Given any rotamer library and a protein backbone structure, we first model the protein structure as a Markov random field. Then the marginal distributions are estimated by the inference algorithms, without doing global optimization or search. The rotamers from the given library are then re-ranked and associated with the updated probabilities. Experimental results demonstrate that the proposed protein-dependent libraries significantly outperform the widely used backbone-dependent libraries in terms of the side-chain prediction accuracy and the rotamer ranking ability. Furthermore, without global optimization/search, the side-chain prediction power of the protein-dependent library is still comparable to the global-search-based side-chain prediction methods.

  12. Approach to characterization of the higher order structure of disulfide-containing proteins using hydrogen/deuterium exchange and top-down mass spectrometry.

    Science.gov (United States)

    Wang, Guanbo; Kaltashov, Igor A

    2014-08-05

    Top-down hydrogen/deuterium exchange (HDX) with mass spectrometric (MS) detection has recently matured to become a potent biophysical tool capable of providing valuable information on higher order structure and conformational dynamics of proteins at an unprecedented level of structural detail. However, the scope of the proteins amenable to the analysis by top-down HDX MS still remains limited, with the protein size and the presence of disulfide bonds being the two most important limiting factors. While the limitations imposed by the physical size of the proteins gradually become more relaxed as the sensitivity, resolution and dynamic range of modern MS instrumentation continue to improve at an ever accelerating pace, the presence of the disulfide linkages remains a much less forgiving limitation even for the proteins of relatively modest size. To circumvent this problem, we introduce an online chemical reduction step following completion and quenching of the HDX reactions and prior to the top-down MS measurements of deuterium occupancy of individual backbone amides. Application of the new methodology to the top-down HDX MS characterization of a small (99 residue long) disulfide-containing protein β2-microglobulin allowed the backbone amide protection to be probed with nearly a single-residue resolution across the entire sequence. The high-resolution backbone protection pattern deduced from the top-down HDX MS measurements carried out under native conditions is in excellent agreement with the crystal structure of the protein and high-resolution NMR data, suggesting that introduction of the chemical reduction step to the top-down routine does not trigger hydrogen scrambling either during the electrospray ionization process or in the gas phase prior to the protein ion dissociation.

  13. Recognition of functional sites in protein structures.

    Science.gov (United States)

    Shulman-Peleg, Alexandra; Nussinov, Ruth; Wolfson, Haim J

    2004-06-04

    Recognition of regions on the surface of one protein, that are similar to a binding site of another is crucial for the prediction of molecular interactions and for functional classifications. We first describe a novel method, SiteEngine, that assumes no sequence or fold similarities and is able to recognize proteins that have similar binding sites and may perform similar functions. We achieve high efficiency and speed by introducing a low-resolution surface representation via chemically important surface points, by hashing triangles of physico-chemical properties and by application of hierarchical scoring schemes for a thorough exploration of global and local similarities. We proceed to rigorously apply this method to functional site recognition in three possible ways: first, we search a given functional site on a large set of complete protein structures. Second, a potential functional site on a protein of interest is compared with known binding sites, to recognize similar features. Third, a complete protein structure is searched for the presence of an a priori unknown functional site, similar to known sites. Our method is robust and efficient enough to allow computationally demanding applications such as the first and the third. From the biological standpoint, the first application may identify secondary binding sites of drugs that may lead to side-effects. The third application finds new potential sites on the protein that may provide targets for drug design. Each of the three applications may aid in assigning a function and in classification of binding patterns. We highlight the advantages and disadvantages of each type of search, provide examples of large-scale searches of the entire Protein Data Base and make functional predictions.

  14. Structural basis of antifreeze activity of a bacterial multi-domain antifreeze protein.

    Directory of Open Access Journals (Sweden)

    Chen Wang

    Full Text Available Antifreeze proteins (AFPs enhance the survival of organisms inhabiting cold environments by affecting the formation and/or structure of ice. We report the crystal structure of the first multi-domain AFP that has been characterized. The two ice binding domains are structurally similar. Each consists of an irregular β-helix with a triangular cross-section and a long α-helix that runs parallel on one side of the β-helix. Both domains are stabilized by hydrophobic interactions. A flat plane on the same face of each domain's β-helix was identified as the ice binding site. Mutating any of the smaller residues on the ice binding site to bulkier ones decreased the antifreeze activity. The bulky side chain of Leu174 in domain A sterically hinders the binding of water molecules to the protein backbone, partially explaining why antifreeze activity by domain A is inferior to that of domain B. Our data provide a molecular basis for understanding differences in antifreeze activity between the two domains of this protein and general insight on how structural differences in the ice-binding sites affect the activity of AFPs.

  15. Automated Protein Structure Modeling with SWISS-MODEL Workspace and the Protein Model Portal

    OpenAIRE

    Bordoli, Lorenza; Schwede, Torsten

    2012-01-01

    Comparative protein structure modeling is a computational approach to build three-dimensional structural models for proteins using experimental structures of related protein family members as templates. Regular blind assessments of modeling accuracy have demonstrated that comparative protein structure modeling is currently the most reliable technique to model protein structures. Homology models are often sufficiently accurate to substitute for experimental structures in a wide variety of appl...

  16. Investigation of non-corrin cobalt(II)-containing sites in protein structures of the Protein Data Bank.

    Science.gov (United States)

    Abriata, Luciano Andres

    2013-04-01

    Protein X-ray structures with non-corrin cobalt(II)-containing sites, either natural or substituting another native ion, were downloaded from the Protein Data Bank and explored to (i) describe which amino acids are involved in their first ligand shells and (ii) analyze cobalt(II)-donor bond lengths in comparison with previously reported target distances, CSD data and EXAFS data. The set of amino acids involved in Co(II) binding is similar to that observed for catalytic Zn(II) sites, i.e. with a large fraction of carboxylate O atoms from aspartate and glutamate and aromatic N atoms from histidine. The computed Co(II)-donor bond lengths were found to depend strongly on structure resolution, an artifact previously detected for other metal-donor distances. Small corrections are suggested for the target bond lengths to the aromatic N atoms of histidines and the O atoms of water and hydroxide. The available target distance for cysteine (Scys) is confirmed; those for backbone O and other donors remain uncertain and should be handled with caution in refinement and modeling protocols. Finally, a relationship between both Co(II)-O bond lengths in bidentate carboxylates is quantified.

  17. Accurate determination of interfacial protein secondary structure by combining interfacial-sensitive amide I and amide III spectral signals.

    Science.gov (United States)

    Ye, Shuji; Li, Hongchun; Yang, Weilai; Luo, Yi

    2014-01-29

    Accurate determination of protein structures at the interface is essential to understand the nature of interfacial protein interactions, but it can only be done with a few, very limited experimental methods. Here, we demonstrate for the first time that sum frequency generation vibrational spectroscopy can unambiguously differentiate the interfacial protein secondary structures by combining surface-sensitive amide I and amide III spectral signals. This combination offers a powerful tool to directly distinguish random-coil (disordered) and α-helical structures in proteins. From a systematic study on the interactions between several antimicrobial peptides (including LKα14, mastoparan X, cecropin P1, melittin, and pardaxin) and lipid bilayers, it is found that the spectral profiles of the random-coil and α-helical structures are well separated in the amide III spectra, appearing below and above 1260 cm(-1), respectively. For the peptides with a straight backbone chain, the strength ratio for the peaks of the random-coil and α-helical structures shows a distinct linear relationship with the fraction of the disordered structure deduced from independent NMR experiments reported in the literature. It is revealed that increasing the fraction of negatively charged lipids can induce a conformational change of pardaxin from random-coil to α-helical structures. This experimental protocol can be employed for determining the interfacial protein secondary structures and dynamics in situ and in real time without extraneous labels.

  18. PCNA Structure and Interactions with Partner Proteins

    KAUST Repository

    Oke, Muse; Zaher, Manal S.; Hamdan, Samir

    2018-01-01

    Proliferating cell nuclear antigen (PCNA) consists of three identical monomers that topologically encircle double-stranded DNA. PCNA stimulates the processivity of DNA polymerase δ and, to a less extent, the intrinsically highly processive DNA polymerase ε. It also functions as a platform that recruits and coordinates the activities of a large number of DNA processing proteins. Emerging structural and biochemical studies suggest that the nature of PCNA-partner proteins interactions is complex. A hydrophobic groove at the front side of PCNA serves as a primary docking site for the consensus PIP box motifs present in many PCNA-binding partners. Sequences that immediately flank the PIP box motif or regions that are distant from it could also interact with the hydrophobic groove and other regions of PCNA. Posttranslational modifications on the backside of PCNA could add another dimension to its interaction with partner proteins. An encounter of PCNA with different DNA structures might also be involved in coordinating its interactions. Finally, the ability of PCNA to bind up to three proteins while topologically linked to DNA suggests that it would be a versatile toolbox in many different DNA processing reactions.

  19. PCNA Structure and Interactions with Partner Proteins

    KAUST Repository

    Oke, Muse

    2018-01-29

    Proliferating cell nuclear antigen (PCNA) consists of three identical monomers that topologically encircle double-stranded DNA. PCNA stimulates the processivity of DNA polymerase δ and, to a less extent, the intrinsically highly processive DNA polymerase ε. It also functions as a platform that recruits and coordinates the activities of a large number of DNA processing proteins. Emerging structural and biochemical studies suggest that the nature of PCNA-partner proteins interactions is complex. A hydrophobic groove at the front side of PCNA serves as a primary docking site for the consensus PIP box motifs present in many PCNA-binding partners. Sequences that immediately flank the PIP box motif or regions that are distant from it could also interact with the hydrophobic groove and other regions of PCNA. Posttranslational modifications on the backside of PCNA could add another dimension to its interaction with partner proteins. An encounter of PCNA with different DNA structures might also be involved in coordinating its interactions. Finally, the ability of PCNA to bind up to three proteins while topologically linked to DNA suggests that it would be a versatile toolbox in many different DNA processing reactions.

  20. Protein secondary structure: category assignment and predictability

    DEFF Research Database (Denmark)

    Andersen, Claus A.; Bohr, Henrik; Brunak, Søren

    2001-01-01

    In the last decade, the prediction of protein secondary structure has been optimized using essentially one and the same assignment scheme known as DSSP. We present here a different scheme, which is more predictable. This scheme predicts directly the hydrogen bonds, which stabilize the secondary......-forward neural network with one hidden layer on a data set identical to the one used in earlier work....

  1. Protein-mediated surface structuring in biomembranes

    Directory of Open Access Journals (Sweden)

    Maggio B.

    2005-01-01

    Full Text Available The lipids and proteins of biomembranes exhibit highly dissimilar conformations, geometrical shapes, amphipathicity, and thermodynamic properties which constrain their two-dimensional molecular packing, electrostatics, and interaction preferences. This causes inevitable development of large local tensions that frequently relax into phase or compositional immiscibility along lateral and transverse planes of the membrane. On the other hand, these effects constitute the very codes that mediate molecular and structural changes determining and controlling the possibilities for enzymatic activity, apposition and recombination in biomembranes. The presence of proteins constitutes a major perturbing factor for the membrane sculpturing both in terms of its surface topography and dynamics. We will focus on some results from our group within this context and summarize some recent evidence for the active involvement of extrinsic (myelin basic protein, integral (Folch-Lees proteolipid protein and amphitropic (c-Fos and c-Jun proteins, as well as a membrane-active amphitropic phosphohydrolytic enzyme (neutral sphingomyelinase, in the process of lateral segregation and dynamics of phase domains, sculpturing of the surface topography, and the bi-directional modulation of the membrane biochemical reactivity.

  2. TRX-LOGOS - a graphical tool to demonstrate DNA information content dependent upon backbone dynamics in addition to base sequence.

    Science.gov (United States)

    Fortin, Connor H; Schulze, Katharina V; Babbitt, Gregory A

    2015-01-01

    It is now widely-accepted that DNA sequences defining DNA-protein interactions functionally depend upon local biophysical features of DNA backbone that are important in defining sites of binding interaction in the genome (e.g. DNA shape, charge and intrinsic dynamics). However, these physical features of DNA polymer are not directly apparent when analyzing and viewing Shannon information content calculated at single nucleobases in a traditional sequence logo plot. Thus, sequence logos plots are severely limited in that they convey no explicit information regarding the structural dynamics of DNA backbone, a feature often critical to binding specificity. We present TRX-LOGOS, an R software package and Perl wrapper code that interfaces the JASPAR database for computational regulatory genomics. TRX-LOGOS extends the traditional sequence logo plot to include Shannon information content calculated with regard to the dinucleotide-based BI-BII conformation shifts in phosphate linkages on the DNA backbone, thereby adding a visual measure of intrinsic DNA flexibility that can be critical for many DNA-protein interactions. TRX-LOGOS is available as an R graphics module offered at both SourceForge and as a download supplement at this journal. To demonstrate the general utility of TRX logo plots, we first calculated the information content for 416 Saccharomyces cerevisiae transcription factor binding sites functionally confirmed in the Yeastract database and matched to previously published yeast genomic alignments. We discovered that flanking regions contain significantly elevated information content at phosphate linkages than can be observed at nucleobases. We also examined broader transcription factor classifications defined by the JASPAR database, and discovered that many general signatures of transcription factor binding are locally more information rich at the level of DNA backbone dynamics than nucleobase sequence. We used TRX-logos in combination with MEGA 6.0 software

  3. The Three-Dimensional Solution Structure of the Src Homology Domain-2 of the Growth Factor Receptor-Bound Protein-2

    International Nuclear Information System (INIS)

    Senior, Mary M.; Frederick, Anne F.; Black, Stuart; Murgolo, Nicholas J.; Perkins, Louise M.; Wilson, Oswald; Snow, Mark E.; Wang Yusen

    1998-01-01

    A set of high-resolution three-dimensional solution structures of the Src homology region-2 (SH2) domain of the growth factor receptor-bound protein-2 was determined using heteronuclear NMR spectroscopy. The NMR data used in this study were collected on a stable monomeric protein solution that was free of protein aggregates and proteolysis. The solution structure was determined based upon a total of 1439 constraints, which included 1326 nuclear Overhauser effect distance constraints, 70 hydrogen bond constraints, and 43 dihedral angle constraints. Distance geometry-simulated annealing calculations followed by energy minimization yielded a family of 18 structures that converged to a root-mean-square deviation of 1.09 A for all backbone atoms and 0.40 A for the backbone atoms of the central β-sheet. The core structure of the SH2 domain contains an antiparallel β-sheet flanked by two parallel α-helices displaying an overall architecture that is similar to other known SH2 domain structures. This family of NMR structures is compared to the X-ray structure and to another family of NMR solution structures determined under different solution conditions

  4. Distill: a suite of web servers for the prediction of one-, two- and three-dimensional structural features of proteins

    Directory of Open Access Journals (Sweden)

    Walsh Ian

    2006-09-01

    Full Text Available Abstract Background We describe Distill, a suite of servers for the prediction of protein structural features: secondary structure; relative solvent accessibility; contact density; backbone structural motifs; residue contact maps at 6, 8 and 12 Angstrom; coarse protein topology. The servers are based on large-scale ensembles of recursive neural networks and trained on large, up-to-date, non-redundant subsets of the Protein Data Bank. Together with structural feature predictions, Distill includes a server for prediction of Cα traces for short proteins (up to 200 amino acids. Results The servers are state-of-the-art, with secondary structure predicted correctly for nearly 80% of residues (currently the top performance on EVA, 2-class solvent accessibility nearly 80% correct, and contact maps exceeding 50% precision on the top non-diagonal contacts. A preliminary implementation of the predictor of protein Cα traces featured among the top 20 Novel Fold predictors at the last CASP6 experiment as group Distill (ID 0348. The majority of the servers, including the Cα trace predictor, now take into account homology information from the PDB, when available, resulting in greatly improved reliability. Conclusion All predictions are freely available through a simple joint web interface and the results are returned by email. In a single submission the user can send protein sequences for a total of up to 32k residues to all or a selection of the servers. Distill is accessible at the address: http://distill.ucd.ie/distill/.

  5. Mapping the backbone of science.

    Energy Technology Data Exchange (ETDEWEB)

    Klavans, Richard (Indiana University, Bloomington, IN); BÞorner, Katy (Strategies for Science & Technology, Incorporation, Berwyn, PA); Boyack, Kevin W.

    2004-11-01

    This paper presents a new map representing the structure of all of science, based on journal articles, including both the natural and social sciences. Similar to cartographic maps of our world, the map of science provides a bird's eye view of today's scientific landscape. It can be used to visually identify major areas of science, their size, similarity, and interconnectedness. In order to be useful, the map needs to be accurate on a local and on a global scale. While our recent work has focused on the former aspect, this paper summarizes results on how to achieve structural accuracy. Eight alternative measures of journal similarity were applied to a data set of 7,121 journals covering over 1 million documents in the combined Science Citation and Social Science Citation Indexes. For each journal similarity measure we generated two-dimensional spatial layouts using the force-directed graph layout tool, VxOrd. Next, mutual information values were calculated for each graph at different clustering levels to give a measure of structural accuracy for each map. The best co-citation and inter-citation maps according to local and structural accuracy were selected and are presented and characterized. These two maps are compared to establish robustness. The inter-citation map is then used to examine linkages between disciplines. Biochemistry appears as the most interdisciplinary discipline in science.

  6. Protein structure modeling for CASP10 by multiple layers of global optimization.

    Science.gov (United States)

    Joo, Keehyoung; Lee, Juyong; Sim, Sangjin; Lee, Sun Young; Lee, Kiho; Heo, Seungryong; Lee, In-Ho; Lee, Sung Jong; Lee, Jooyoung

    2014-02-01

    In the template-based modeling (TBM) category of CASP10 experiment, we introduced a new protocol called protein modeling system (PMS) to generate accurate protein structures in terms of side-chains as well as backbone trace. In the new protocol, a global optimization algorithm, called conformational space annealing (CSA), is applied to the three layers of TBM procedure: multiple sequence-structure alignment, 3D chain building, and side-chain re-modeling. For 3D chain building, we developed a new energy function which includes new distance restraint terms of Lorentzian type (derived from multiple templates), and new energy terms that combine (physical) energy terms such as dynamic fragment assembly (DFA) energy, DFIRE statistical potential energy, hydrogen bonding term, etc. These physical energy terms are expected to guide the structure modeling especially for loop regions where no template structures are available. In addition, we developed a new quality assessment method based on random forest machine learning algorithm to screen templates, multiple alignments, and final models. For TBM targets of CASP10, we find that, due to the combination of three stages of CSA global optimizations and quality assessment, the modeling accuracy of PMS improves at each additional stage of the protocol. It is especially noteworthy that the side-chains of the final PMS models are far more accurate than the models in the intermediate steps. Copyright © 2013 Wiley Periodicals, Inc.

  7. Green Network Planning Model for Optical Backbones

    DEFF Research Database (Denmark)

    Gutierrez Lopez, Jose Manuel; Riaz, M. Tahir; Jensen, Michael

    2010-01-01

    on the environment in general. In network planning there are existing planning models focused on QoS provisioning, investment minimization or combinations of both and other parameters. But there is a lack of a model for designing green optical backbones. This paper presents novel ideas to be able to define......Communication networks are becoming more essential for our daily lives and critically important for industry and governments. The intense growth in the backbone traffic implies an increment of the power demands of the transmission systems. This power usage might have a significant negative effect...

  8. SCit: web tools for protein side chain conformation analysis

    OpenAIRE

    Gautier, R.; Camproux, A.-C.; Tufféry, P.

    2004-01-01

    SCit is a web server providing services for protein side chain conformation analysis and side chain positioning. Specific services use the dependence of the side chain conformations on the local backbone conformation, which is described using a structural alphabet that describes the conformation of fragments of four-residue length in a limited library of structural prototypes. Based on this concept, SCit uses sets of rotameric conformations dependent on the local backbone conformation of each...

  9. Bulk Heterojunction Solar Cells: Impact of Minor Structural Modifications to the Polymer Backbone on the Polymer-Fullerene Mixing and Packing and on the Fullerene-Fullerene Connecting Network

    KAUST Repository

    Wang, Tonghui

    2018-01-25

    The morphology of the active layer of a bulk heterojunction solar cell, made of a blend of an electron-donating polymer and an electron-accepting fullerene derivative, is known to play a determining role in device performance. Here, a combination of molecular dynamics simulations and long-range corrected density functional theory calculations is used to elucidate the molecular-scale effects that even minor structural changes to the polymer backbone can have on the “local” morphology; this study focuses on the extent of polymer–fullerene mixing, on their packing, and on the characteristics of the fullerene–fullerene connecting network in the mixed regions, aspects that are difficult to access experimentally. Three representative polymer donors are investigated: (i) poly[(5,6-difluoro-2,1,3-benzothiadiazol-4,7-diyl)-alt-(3,3′″-di(2-octyldodecyl)-2,2′;5′,2″;5″,2′″-quaterthiophen-5,5′″-diyl)] (PffBT4T-2OD); (ii) poly[(2,1,3-benzothiadiazol-4,7-diyl)-alt-(3,3′″-di(2-octyldodecyl)-2,2′;5′,2″;5″,2′″-quaterthiophen-5,5′″-diyl)] (PBT4T-2OD), where the fluorine atoms in the benzothiadiazole moieties of PffBT4T-2OD are replaced with hydrogen atoms; and (iii) poly[(2,2′-bithiophene)-alt-(4,7-bis((2-decyltetradecyl)thiophen-2-yl)-5,6-difluoro-2-propyl-2H-benzo[d][1,2,3]triazole)] (PT2-FTAZ), where the sulfur atoms in the benzothiadiazole moieties of PffBT4T-2OD are replaced with nitrogen atoms carrying a linear C3H7 side-chain; these polymers are mixed with the phenyl-C71-butyric acid methyl ester (PC71BM) acceptor. This study also discusses the nature of the charge-transfer electronic states appearing at the donor–acceptor interfaces, the electronic couplings relevant for the charge-recombination process, and the electron-transfer features between neighboring PC71BM molecules.

  10. Localization of binding sites of Ulex europaeus I, Helix pomatia and Griffonia simplicifolia I-B4 lectins and analysis of their backbone structures by several glycosidases and poly-N-acetyllactosamine-specific lectins in human breast carcinomas.

    Science.gov (United States)

    Ito, N; Imai, S; Haga, S; Nagaike, C; Morimura, Y; Hatake, K

    1996-09-01

    Several studies have shown the deletion of blood group A or B antigens and the accumulation of H antigens in human breast carcinomas. Other studies have independently demonstrated that the binding sites of lectins such as Helix pomatia agglutinin (HPA) and Griffonia simplicifolia agglutinin I-B4 (GSAI-B4) are highly expressed in these cells. In order to clarify the molecular mechanisms of malignant transformation and metastasis of carcinoma cells, it is important to understand the relationship between such phenotypically distinct events. For this purpose, we examined whether the binding sites of these lectins and Ulex europaeus agglutinin I (UEA-I) are expressed concomitantly in the same carcinoma cells and analyzed their backbone structures. The expression of the binding sites of these lectins was observed independently of the blood group (ABO) of the patients and was not affected by the histological type of the carcinomas. Observation of serial sections stained with these lectins revealed that the distribution of HPA binding sites was almost identical to that of GSAI-B4 in most cases. Furthermore, in some cases, UEA-I binding patterns were similar to those of HPA and GSAI-B4 but in other cases, mosaic staining patterns with these lectins were also observed, i.e., some cell clusters were stained with both HPA and GSAI-B4 but not with UEA-I and adjacent cell clusters were stained only with UEA-I. Digestion with endo-beta-galactosidase or N-glycosidase F markedly reduced the staining intensity of these lectins. Together with the reduction of staining by these lectins, reactivity with Griffonia simplicifolia agglutinin II appeared in carcinoma cells following endo-beta-galactosidase digestion. Among the lectins specific to poly-N-acetyllactosamine, Lycopersicon esculentum agglutinin (LEA) most vividly and consistently stained the cancer cells. Next to LEA, pokeweed mitogen agglutinin was also effective in staining these cells. Carcinoma cells reactive with these

  11. Bulk Heterojunction Solar Cells: Impact of Minor Structural Modifications to the Polymer Backbone on the Polymer-Fullerene Mixing and Packing and on the Fullerene-Fullerene Connecting Network

    KAUST Repository

    Wang, Tonghui; Chen, Xiankai; Ashokan, Ajith; Zheng, Zilong; Ravva, Mahesh Kumar; Bré das, Jean-Luc

    2018-01-01

    The morphology of the active layer of a bulk heterojunction solar cell, made of a blend of an electron-donating polymer and an electron-accepting fullerene derivative, is known to play a determining role in device performance. Here, a combination of molecular dynamics simulations and long-range corrected density functional theory calculations is used to elucidate the molecular-scale effects that even minor structural changes to the polymer backbone can have on the “local” morphology; this study focuses on the extent of polymer–fullerene mixing, on their packing, and on the characteristics of the fullerene–fullerene connecting network in the mixed regions, aspects that are difficult to access experimentally. Three representative polymer donors are investigated: (i) poly[(5,6-difluoro-2,1,3-benzothiadiazol-4,7-diyl)-alt-(3,3′″-di(2-octyldodecyl)-2,2′;5′,2″;5″,2′″-quaterthiophen-5,5′″-diyl)] (PffBT4T-2OD); (ii) poly[(2,1,3-benzothiadiazol-4,7-diyl)-alt-(3,3′″-di(2-octyldodecyl)-2,2′;5′,2″;5″,2′″-quaterthiophen-5,5′″-diyl)] (PBT4T-2OD), where the fluorine atoms in the benzothiadiazole moieties of PffBT4T-2OD are replaced with hydrogen atoms; and (iii) poly[(2,2′-bithiophene)-alt-(4,7-bis((2-decyltetradecyl)thiophen-2-yl)-5,6-difluoro-2-propyl-2H-benzo[d][1,2,3]triazole)] (PT2-FTAZ), where the sulfur atoms in the benzothiadiazole moieties of PffBT4T-2OD are replaced with nitrogen atoms carrying a linear C3H7 side-chain; these polymers are mixed with the phenyl-C71-butyric acid methyl ester (PC71BM) acceptor. This study also discusses the nature of the charge-transfer electronic states appearing at the donor–acceptor interfaces, the electronic couplings relevant for the charge-recombination process, and the electron-transfer features between neighboring PC71BM molecules.

  12. Protein loop modeling using a new hybrid energy function and its application to modeling in inaccurate structural environments.

    Directory of Open Access Journals (Sweden)

    Hahnbeom Park

    Full Text Available Protein loop modeling is a tool for predicting protein local structures of particular interest, providing opportunities for applications involving protein structure prediction and de novo protein design. Until recently, the majority of loop modeling methods have been developed and tested by reconstructing loops in frameworks of experimentally resolved structures. In many practical applications, however, the protein loops to be modeled are located in inaccurate structural environments. These include loops in model structures, low-resolution experimental structures, or experimental structures of different functional forms. Accordingly, discrepancies in the accuracy of the structural environment assumed in development of the method and that in practical applications present additional challenges to modern loop modeling methods. This study demonstrates a new strategy for employing a hybrid energy function combining physics-based and knowledge-based components to help tackle this challenge. The hybrid energy function is designed to combine the strengths of each energy component, simultaneously maintaining accurate loop structure prediction in a high-resolution framework structure and tolerating minor environmental errors in low-resolution structures. A loop modeling method based on global optimization of this new energy function is tested on loop targets situated in different levels of environmental errors, ranging from experimental structures to structures perturbed in backbone as well as side chains and template-based model structures. The new method performs comparably to force field-based approaches in loop reconstruction in crystal structures and better in loop prediction in inaccurate framework structures. This result suggests that higher-accuracy predictions would be possible for a broader range of applications. The web server for this method is available at http://galaxy.seoklab.org/loop with the PS2 option for the scoring function.

  13. Reduced dimensionality (3,2)D NMR experiments and their automated analysis: implications to high-throughput structural studies on proteins.

    Science.gov (United States)

    Reddy, Jithender G; Kumar, Dinesh; Hosur, Ramakrishna V

    2015-02-01

    Protein NMR spectroscopy has expanded dramatically over the last decade into a powerful tool for the study of their structure, dynamics, and interactions. The primary requirement for all such investigations is sequence-specific resonance assignment. The demand now is to obtain this information as rapidly as possible and in all types of protein systems, stable/unstable, soluble/insoluble, small/big, structured/unstructured, and so on. In this context, we introduce here two reduced dimensionality experiments – (3,2)D-hNCOcanH and (3,2)D-hNcoCAnH – which enhance the previously described 2D NMR-based assignment methods quite significantly. Both the experiments can be recorded in just about 2-3 h each and hence would be of immense value for high-throughput structural proteomics and drug discovery research. The applicability of the method has been demonstrated using alpha-helical bovine apo calbindin-D9k P43M mutant (75 aa) protein. Automated assignment of this data using AUTOBA has been presented, which enhances the utility of these experiments. The backbone resonance assignments so derived are utilized to estimate secondary structures and the backbone fold using Web-based algorithms. Taken together, we believe that the method and the protocol proposed here can be used for routine high-throughput structural studies of proteins. Copyright © 2014 John Wiley & Sons, Ltd.

  14. Classification of proteins: available structural space for molecular modeling.

    Science.gov (United States)

    Andreeva, Antonina

    2012-01-01

    The wealth of available protein structural data provides unprecedented opportunity to study and better understand the underlying principles of protein folding and protein structure evolution. A key to achieving this lies in the ability to analyse these data and to organize them in a coherent classification scheme. Over the past years several protein classifications have been developed that aim to group proteins based on their structural relationships. Some of these classification schemes explore the concept of structural neighbourhood (structural continuum), whereas other utilize the notion of protein evolution and thus provide a discrete rather than continuum view of protein structure space. This chapter presents a strategy for classification of proteins with known three-dimensional structure. Steps in the classification process along with basic definitions are introduced. Examples illustrating some fundamental concepts of protein folding and evolution with a special focus on the exceptions to them are presented.

  15. Protein crystal structure analysis using synchrotron radiation at atomic resolution

    International Nuclear Information System (INIS)

    Nonaka, Takamasa

    1999-01-01

    We can now obtain a detailed picture of protein, allowing the identification of individual atoms, by interpreting the diffraction of X-rays from a protein crystal at atomic resolution, 1.2 A or better. As of this writing, about 45 unique protein structures beyond 1.2 A resolution have been deposited in the Protein Data Bank. This review provides a simplified overview of how protein crystallographers use such diffraction data to solve, refine, and validate protein structures. (author)

  16. MCBT: Multi-Hop Cluster Based Stable Backbone Trees for Data Collection and Dissemination in WSNs

    Directory of Open Access Journals (Sweden)

    Tae-Jin Lee

    2009-07-01

    Full Text Available We propose a stable backbone tree construction algorithm using multi-hop clusters for wireless sensor networks (WSNs. The hierarchical cluster structure has advantages in data fusion and aggregation. Energy consumption can be decreased by managing nodes with cluster heads. Backbone nodes, which are responsible for performing and managing multi-hop communication, can reduce the communication overhead such as control traffic and minimize the number of active nodes. Previous backbone construction algorithms, such as Hierarchical Cluster-based Data Dissemination (HCDD and Multicluster, Mobile, Multimedia radio network (MMM, consume energy quickly. They are designed without regard to appropriate factors such as residual energy and degree (the number of connections or edges to other nodes of a node for WSNs. Thus, the network is quickly disconnected or has to reconstruct a backbone. We propose a distributed algorithm to create a stable backbone by selecting the nodes with higher energy or degree as the cluster heads. This increases the overall network lifetime. Moreover, the proposed method balances energy consumption by distributing the traffic load among nodes around the cluster head. In the simulation, the proposed scheme outperforms previous clustering schemes in terms of the average and the standard deviation of residual energy or degree of backbone nodes, the average residual energy of backbone nodes after disseminating the sensed data, and the network lifetime.

  17. Segmental isotope labeling of proteins for NMR structural study using a protein S tag for higher expression and solubility

    International Nuclear Information System (INIS)

    Kobayashi, Hiroshi; Swapna, G. V. T.; Wu, Kuen-Phon; Afinogenova, Yuliya; Conover, Kenith; Mao, Binchen; Montelione, Gaetano T.; Inouye, Masayori

    2012-01-01

    A common obstacle to NMR studies of proteins is sample preparation. In many cases, proteins targeted for NMR studies are poorly expressed and/or expressed in insoluble forms. Here, we describe a novel approach to overcome these problems. In the protein S tag-intein (PSTI) technology, two tandem 92-residue N-terminal domains of protein S (PrS 2 ) from Myxococcus xanthus is fused at the N-terminal end of a protein to enhance its expression and solubility. Using intein technology, the isotope-labeled PrS 2 -tag is replaced with non-isotope labeled PrS 2 -tag, silencing the NMR signals from PrS 2 -tag in isotope-filtered 1 H-detected NMR experiments. This method was applied to the E. coli ribosome binding factor A (RbfA), which aggregates and precipitates in the absence of a solubilization tag unless the C-terminal 25-residue segment is deleted (RbfAΔ25). Using the PrS 2 -tag, full-length well-behaved RbfA samples could be successfully prepared for NMR studies. PrS 2 (non-labeled)-tagged RbfA (isotope-labeled) was produced with the use of the intein approach. The well-resolved TROSY-HSQC spectrum of full-length PrS 2 -tagged RbfA superimposes with the TROSY-HSQC spectrum of RbfAΔ25, indicating that PrS 2 -tag does not affect the structure of the protein to which it is fused. Using a smaller PrS-tag, consisting of a single N-terminal domain of protein S, triple resonance experiments were performed, and most of the backbone 1 H, 15 N and 13 C resonance assignments for full-length E. coli RbfA were determined. Analysis of these chemical shift data with the Chemical Shift Index and heteronuclear 1 H– 15 N NOE measurements reveal the dynamic nature of the C-terminal segment of the full-length RbfA protein, which could not be inferred using the truncated RbfAΔ25 construct. CS-Rosetta calculations also demonstrate that the core structure of full-length RbfA is similar to that of the RbfAΔ25 construct.

  18. Predicting Protein Secondary Structure with Markov Models

    DEFF Research Database (Denmark)

    Fischer, Paul; Larsen, Simon; Thomsen, Claus

    2004-01-01

    we are considering here, is to predict the secondary structure from the primary one. To this end we train a Markov model on training data and then use it to classify parts of unknown protein sequences as sheets, helices or coils. We show how to exploit the directional information contained...... in the Markov model for this task. Classifications that are purely based on statistical models might not always be biologically meaningful. We present combinatorial methods to incorporate biological background knowledge to enhance the prediction performance....

  19. GIS: a comprehensive source for protein structure similarities.

    Science.gov (United States)

    Guerler, Aysam; Knapp, Ernst-Walter

    2010-07-01

    A web service for analysis of protein structures that are sequentially or non-sequentially similar was generated. Recently, the non-sequential structure alignment algorithm GANGSTA+ was introduced. GANGSTA+ can detect non-sequential structural analogs for proteins stated to possess novel folds. Since GANGSTA+ ignores the polypeptide chain connectivity of secondary structure elements (i.e. alpha-helices and beta-strands), it is able to detect structural similarities also between proteins whose sequences were reshuffled during evolution. GANGSTA+ was applied in an all-against-all comparison on the ASTRAL40 database (SCOP version 1.75), which consists of >10,000 protein domains yielding about 55 x 10(6) possible protein structure alignments. Here, we provide the resulting protein structure alignments as a public web-based service, named GANGSTA+ Internet Services (GIS). We also allow to browse the ASTRAL40 database of protein structures with GANGSTA+ relative to an externally given protein structure using different constraints to select specific results. GIS allows us to analyze protein structure families according to the SCOP classification scheme. Additionally, users can upload their own protein structures for pairwise protein structure comparison, alignment against all protein structures of the ASTRAL40 database (SCOP version 1.75) or symmetry analysis. GIS is publicly available at http://agknapp.chemie.fu-berlin.de/gplus.

  20. Structural protein descriptors in 1-dimension and their sequence-based predictions.

    Science.gov (United States)

    Kurgan, Lukasz; Disfani, Fatemeh Miri

    2011-09-01

    The last few decades observed an increasing interest in development and application of 1-dimensional (1D) descriptors of protein structure. These descriptors project 3D structural features onto 1D strings of residue-wise structural assignments. They cover a wide-range of structural aspects including conformation of the backbone, burying depth/solvent exposure and flexibility of residues, and inter-chain residue-residue contacts. We perform first-of-its-kind comprehensive comparative review of the existing 1D structural descriptors. We define, review and categorize ten structural descriptors and we also describe, summarize and contrast over eighty computational models that are used to predict these descriptors from the protein sequences. We show that the majority of the recent sequence-based predictors utilize machine learning models, with the most popular being neural networks, support vector machines, hidden Markov models, and support vector and linear regressions. These methods provide high-throughput predictions and most of them are accessible to a non-expert user via web servers and/or stand-alone software packages. We empirically evaluate several recent sequence-based predictors of secondary structure, disorder, and solvent accessibility descriptors using a benchmark set based on CASP8 targets. Our analysis shows that the secondary structure can be predicted with over 80% accuracy and segment overlap (SOV), disorder with over 0.9 AUC, 0.6 Matthews Correlation Coefficient (MCC), and 75% SOV, and relative solvent accessibility with PCC of 0.7 and MCC of 0.6 (0.86 when homology is used). We demonstrate that the secondary structure predicted from sequence without the use of homology modeling is as good as the structure extracted from the 3D folds predicted by top-performing template-based methods.

  1. Automated protein structure modeling with SWISS-MODEL Workspace and the Protein Model Portal.

    Science.gov (United States)

    Bordoli, Lorenza; Schwede, Torsten

    2012-01-01

    Comparative protein structure modeling is a computational approach to build three-dimensional structural models for proteins using experimental structures of related protein family members as templates. Regular blind assessments of modeling accuracy have demonstrated that comparative protein structure modeling is currently the most reliable technique to model protein structures. Homology models are often sufficiently accurate to substitute for experimental structures in a wide variety of applications. Since the usefulness of a model for specific application is determined by its accuracy, model quality estimation is an essential component of protein structure prediction. Comparative protein modeling has become a routine approach in many areas of life science research since fully automated modeling systems allow also nonexperts to build reliable models. In this chapter, we describe practical approaches for automated protein structure modeling with SWISS-MODEL Workspace and the Protein Model Portal.

  2. Structural model for the interaction of a designed Ankyrin Repeat Protein with the human epidermal growth factor receptor 2.

    Directory of Open Access Journals (Sweden)

    V Chandana Epa

    Full Text Available Designed Ankyrin Repeat Proteins are a class of novel binding proteins that can be selected and evolved to bind to targets with high affinity and specificity. We are interested in the DARPin H10-2-G3, which has been evolved to bind with very high affinity to the human epidermal growth factor receptor 2 (HER2. HER2 is found to be over-expressed in 30% of breast cancers, and is the target for the FDA-approved therapeutic monoclonal antibodies trastuzumab and pertuzumab and small molecule tyrosine kinase inhibitors. Here, we use computational macromolecular docking, coupled with several interface metrics such as shape complementarity, interaction energy, and electrostatic complementarity, to model the structure of the complex between the DARPin H10-2-G3 and HER2. We analyzed the interface between the two proteins and then validated the structural model by showing that selected HER2 point mutations at the putative interface with H10-2-G3 reduce the affinity of binding up to 100-fold without affecting the binding of trastuzumab. Comparisons made with a subsequently solved X-ray crystal structure of the complex yielded a backbone atom root mean square deviation of 0.84-1.14 Ångstroms. The study presented here demonstrates the capability of the computational techniques of structural bioinformatics in generating useful structural models of protein-protein interactions.

  3. “Pinning strategy”: a novel approach for predicting the backbone ...

    Indian Academy of Sciences (India)

    Prakash

    To assess the quality of the strategy, we define two measures. The first one ...... modular framework of the protein backbone; Protein Eng. 12. 1063–1073 .... Richardson J S, Getzoff E D and Richardson D C 1978 The beta bulge: a common ...

  4. Backbone Diversity Analysis in Catalyst Design

    NARCIS (Netherlands)

    Maldonado, A.G.; Hageman, J.A.; Mastroianni, S.; Rothenberg, G.

    2009-01-01

    We present a computer-based heuristic framework for designing libraries of homogeneous catalysts. In this approach, a set of given bidentate ligand-metal complexes is disassembled into key substructures (building blocks). These include metal atoms, ligating groups, backbone groups, and residue

  5. ExScal Backbone Network Architecture

    Science.gov (United States)

    2005-01-01

    802.11 battery powered nodes was laid over the sensor network. We adopted the Stargate platform for the backbone tier to serve as the basis for...its head. XSS Hardware and Network: XSS stands for eXtreme Scaling Stargate . A stargate is a linux-based single board computer. It has a 400 MHz

  6. Versatile phosphite ligands based on silsesquioxane backbones

    NARCIS (Netherlands)

    van der Vlugt, JI; Ackerstaff, J; Dijkstra, TW; Mills, AM; Kooijman, H; Spek, AL; Meetsma, A; Abbenhuis, HCL; Vogt, D

    Silsesquioxanes are employed as ligand backbones for the synthesis of novel phosphite compounds with 3,3'-5,5'-tetrakis(tert-butyl)-2,2'-di-oxa-1,1'-biphenyl substituents. Both mono- and bidentate phosphites are prepared in good yields. Two types of silsesquioxanes are employed as starting

  7. Structure based alignment and clustering of proteins (STRALCP)

    Science.gov (United States)

    Zemla, Adam T.; Zhou, Carol E.; Smith, Jason R.; Lam, Marisa W.

    2013-06-18

    Disclosed are computational methods of clustering a set of protein structures based on local and pair-wise global similarity values. Pair-wise local and global similarity values are generated based on pair-wise structural alignments for each protein in the set of protein structures. Initially, the protein structures are clustered based on pair-wise local similarity values. The protein structures are then clustered based on pair-wise global similarity values. For each given cluster both a representative structure and spans of conserved residues are identified. The representative protein structure is used to assign newly-solved protein structures to a group. The spans are used to characterize conservation and assign a "structural footprint" to the cluster.

  8. Alpha complexes in protein structure prediction

    DEFF Research Database (Denmark)

    Winter, Pawel; Fonseca, Rasmus

    2015-01-01

    Reducing the computational effort and increasing the accuracy of potential energy functions is of utmost importance in modeling biological systems, for instance in protein structure prediction, docking or design. Evaluating interactions between nonbonded atoms is the bottleneck of such computations......-complexes from scratch for every configuration encountered during the search for the native structure would make this approach hopelessly slow. However, it is argued that kinetic a-complexes can be used to reduce the computational effort of determining the potential energy when "moving" from one configuration...... to a neighboring one. As a consequence, relatively expensive (initial) construction of an a-complex is expected to be compensated by subsequent fast kinetic updates during the search process. Computational results presented in this paper are limited. However, they suggest that the applicability of a...

  9. Course 12: Proteins: Structural, Thermodynamic and Kinetic Aspects

    Science.gov (United States)

    Finkelstein, A. V.

    1 Introduction 2 Overview of protein architectures and discussion of physical background of their natural selection 2.1 Protein structures 2.2 Physical selection of protein structures 3 Thermodynamic aspects of protein folding 3.1 Reversible denaturation of protein structures 3.2 What do denatured proteins look like? 3.3 Why denaturation of a globular protein is the first-order phase transition 3.4 "Gap" in energy spectrum: The main characteristic that distinguishes protein chains from random polymers 4 Kinetic aspects of protein folding 4.1 Protein folding in vivo 4.2 Protein folding in vitro (in the test-tube) 4.3 Theory of protein folding rates and solution of the Levinthal paradox

  10. Structural Refinement of Proteins by Restrained Molecular Dynamics Simulations with Non-interacting Molecular Fragments.

    Directory of Open Access Journals (Sweden)

    Rong Shen

    2015-10-01

    Full Text Available The knowledge of multiple conformational states is a prerequisite to understand the function of membrane transport proteins. Unfortunately, the determination of detailed atomic structures for all these functionally important conformational states with conventional high-resolution approaches is often difficult and unsuccessful. In some cases, biophysical and biochemical approaches can provide important complementary structural information that can be exploited with the help of advanced computational methods to derive structural models of specific conformational states. In particular, functional and spectroscopic measurements in combination with site-directed mutations constitute one important source of information to obtain these mixed-resolution structural models. A very common problem with this strategy, however, is the difficulty to simultaneously integrate all the information from multiple independent experiments involving different mutations or chemical labels to derive a unique structural model consistent with the data. To resolve this issue, a novel restrained molecular dynamics structural refinement method is developed to simultaneously incorporate multiple experimentally determined constraints (e.g., engineered metal bridges or spin-labels, each treated as an individual molecular fragment with all atomic details. The internal structure of each of the molecular fragments is treated realistically, while there is no interaction between different molecular fragments to avoid unphysical steric clashes. The information from all the molecular fragments is exploited simultaneously to constrain the backbone to refine a three-dimensional model of the conformational state of the protein. The method is illustrated by refining the structure of the voltage-sensing domain (VSD of the Kv1.2 potassium channel in the resting state and by exploring the distance histograms between spin-labels attached to T4 lysozyme. The resulting VSD structures are in good

  11. Sequential backbone assignment based on dipolar amide-to-amide correlation experiments

    Energy Technology Data Exchange (ETDEWEB)

    Xiang, ShengQi; Grohe, Kristof; Rovó, Petra; Vasa, Suresh Kumar; Giller, Karin; Becker, Stefan; Linser, Rasmus, E-mail: rali@nmr.mpibpc.mpg.de [Max Planck Institute for Biophysical Chemistry, Department for NMR-Based Structural Biology (Germany)

    2015-07-15

    Proton detection in solid-state NMR has seen a tremendous increase in popularity in the last years. New experimental techniques allow to exploit protons as an additional source of information on structure, dynamics, and protein interactions with their surroundings. In addition, sensitivity is mostly improved and ambiguity in assignment experiments reduced. We show here that, in the solid state, sequential amide-to-amide correlations turn out to be an excellent, complementary way to exploit amide shifts for unambiguous backbone assignment. For a general assessment, we compare amide-to-amide experiments with the more common {sup 13}C-shift-based methods. Exploiting efficient CP magnetization transfers rather than less efficient INEPT periods, our results suggest that the approach is very feasible for solid-state NMR.

  12. Side chain and backbone contributions of Phe508 to CFTR folding

    Energy Technology Data Exchange (ETDEWEB)

    Thibodeau, Patrick H.; Brautigam, Chad A.; Machius, Mischa; Thomas, Philip J. (U. of Texas-SMED)

    2010-12-07

    Mutations in the cystic fibrosis transmembrane conductance regulator (CFTR), an integral membrane protein, cause cystic fibrosis (CF). The most common CF-causing mutant, deletion of Phe508, fails to properly fold. To elucidate the role Phe508 plays in the folding of CFTR, missense mutations at this position were generated. Only one missense mutation had a pronounced effect on the stability and folding of the isolated domain in vitro. In contrast, many substitutions, including those of charged and bulky residues, disrupted folding of full-length CFTR in cells. Structures of two mutant nucleotide-binding domains (NBDs) reveal only local alterations of the surface near position 508. These results suggest that the peptide backbone plays a role in the proper folding of the domain, whereas the side chain plays a role in defining a surface of NBD1 that potentially interacts with other domains during the maturation of intact CFTR.

  13. Sequential backbone assignment based on dipolar amide-to-amide correlation experiments

    International Nuclear Information System (INIS)

    Xiang, ShengQi; Grohe, Kristof; Rovó, Petra; Vasa, Suresh Kumar; Giller, Karin; Becker, Stefan; Linser, Rasmus

    2015-01-01

    Proton detection in solid-state NMR has seen a tremendous increase in popularity in the last years. New experimental techniques allow to exploit protons as an additional source of information on structure, dynamics, and protein interactions with their surroundings. In addition, sensitivity is mostly improved and ambiguity in assignment experiments reduced. We show here that, in the solid state, sequential amide-to-amide correlations turn out to be an excellent, complementary way to exploit amide shifts for unambiguous backbone assignment. For a general assessment, we compare amide-to-amide experiments with the more common 13 C-shift-based methods. Exploiting efficient CP magnetization transfers rather than less efficient INEPT periods, our results suggest that the approach is very feasible for solid-state NMR

  14. Structural determination of intact proteins using mass spectrometry

    Science.gov (United States)

    Kruppa, Gary [San Francisco, CA; Schoeniger, Joseph S [Oakland, CA; Young, Malin M [Livermore, CA

    2008-05-06

    The present invention relates to novel methods of determining the sequence and structure of proteins. Specifically, the present invention allows for the analysis of intact proteins within a mass spectrometer. Therefore, preparatory separations need not be performed prior to introducing a protein sample into the mass spectrometer. Also disclosed herein are new instrumental developments for enhancing the signal from the desired modified proteins, methods for producing controlled protein fragments in the mass spectrometer, eliminating complex microseparations, and protein preparatory chemical steps necessary for cross-linking based protein structure determination.Additionally, the preferred method of the present invention involves the determination of protein structures utilizing a top-down analysis of protein structures to search for covalent modifications. In the preferred method, intact proteins are ionized and fragmented within the mass spectrometer.

  15. Molecular characterization of a Penicillium chrysogenum exo-rhamnogalacturonan lyase that is structurally distinct from other polysaccharide lyase family proteins.

    Science.gov (United States)

    Iwai, Marin; Kawakami, Takuya; Ikemoto, Takeshi; Fujiwara, Daisuke; Takenaka, Shigeo; Nakazawa, Masami; Ueda, Mitsuhiro; Sakamoto, Tatsuji

    2015-10-01

    We previously described an endo-acting rhamnogalacturonan (RG) lyase, termed PcRGL4A, of Penicillium chrysogenum 31B. Here, we describe a second RG lyase, called PcRGLX. We determined the cDNA sequence of the Pcrglx gene, which encodes PcRGLX. Based on analyses using a BLAST search and a conserved domain search, PcRGLX was found to be structurally distinct from known RG lyases and might belong to a new polysaccharide lyase family together with uncharacterized fungal proteins of Nectria haematococca, Aspergillus oryzae, and Fusarium oxysporum. The Pcrglx cDNA gene product (rPcRGLX) expressed in Escherichia coli demonstrated specific activity against RG but not against homogalacturonan. Divalent cations were not essential for the enzymatic activity of rPcRGLX. rPcRGLX mainly released unsaturated galacturonosyl rhamnose (ΔGR) from RG backbones used as the substrate from the initial stage of the reaction, indicating that the enzyme can be classified as an exo-acting RG lyase (EC 4.2.2.24). This is the first report of an RG lyase with this mode of action in Eukaryota. rPcRGLX acted synergistically with PcRGL4A to degrade soybean RG and released ΔGR. This ΔGR was partially decorated with galactose (Gal) residues, indicating that rPcRGLX preferred oligomeric RGs to polymeric RGs, that the enzyme did not require Gal decoration of RG backbones for degradation, and that the enzyme bypassed the Gal side chains of RG backbones. These characteristics of rPcRGLX might be useful in the determination of complex structures of pectins.

  16. J-UNIO protocol used for NMR structure determination of the 206-residue protein NP-346487.1 from Streptococcus pneumoniae TIGR4

    Energy Technology Data Exchange (ETDEWEB)

    Jaudzems, Kristaps [Latvian Institute of Organic Synthesis (Latvia); Pedrini, Bill [Paul Scherrer Institute (PSI), SwissFEL Project (Switzerland); Geralt, Michael; Serrano, Pedro; Wüthrich, Kurt, E-mail: wuthrich@scripps.edu [The Scripps Research Institute, Department of Integrative Structural and Computational Biology (United States)

    2015-01-15

    The NMR structure of the 206-residue protein NP-346487.1 was determined with the J-UNIO protocol, which includes extensive automation of the structure determination. With input from three APSY-NMR experiments, UNIO-MATCH automatically yielded 77 % of the backbone assignments, which were interactively validated and extended to 97 %. With an input of the near-complete backbone assignments and three 3D heteronuclear-resolved [{sup 1}H,{sup 1}H]-NOESY spectra, automated side chain assignment with UNIO-ATNOS/ASCAN resulted in 77 % of the expected assignments, which was extended interactively to about 90 %. Automated NOE assignment and structure calculation with UNIO-ATNOS/CANDID in combination with CYANA was used for the structure determination of this two-domain protein. The individual domains in the NMR structure coincide closely with the crystal structure, and the NMR studies further imply that the two domains undergo restricted hinge motions relative to each other in solution. NP-346487.1 is so far the largest polypeptide chain to which the J-UNIO structure determination protocol has successfully been applied.

  17. Protein structure similarity from principle component correlation analysis

    Directory of Open Access Journals (Sweden)

    Chou James

    2006-01-01

    Full Text Available Abstract Background Owing to rapid expansion of protein structure databases in recent years, methods of structure comparison are becoming increasingly effective and important in revealing novel information on functional properties of proteins and their roles in the grand scheme of evolutionary biology. Currently, the structural similarity between two proteins is measured by the root-mean-square-deviation (RMSD in their best-superimposed atomic coordinates. RMSD is the golden rule of measuring structural similarity when the structures are nearly identical; it, however, fails to detect the higher order topological similarities in proteins evolved into different shapes. We propose new algorithms for extracting geometrical invariants of proteins that can be effectively used to identify homologous protein structures or topologies in order to quantify both close and remote structural similarities. Results We measure structural similarity between proteins by correlating the principle components of their secondary structure interaction matrix. In our approach, the Principle Component Correlation (PCC analysis, a symmetric interaction matrix for a protein structure is constructed with relationship parameters between secondary elements that can take the form of distance, orientation, or other relevant structural invariants. When using a distance-based construction in the presence or absence of encoded N to C terminal sense, there are strong correlations between the principle components of interaction matrices of structurally or topologically similar proteins. Conclusion The PCC method is extensively tested for protein structures that belong to the same topological class but are significantly different by RMSD measure. The PCC analysis can also differentiate proteins having similar shapes but different topological arrangements. Additionally, we demonstrate that when using two independently defined interaction matrices, comparison of their maximum

  18. Assessing the structural conservation of protein pockets to study functional and allosteric sites: implications for drug discovery

    Directory of Open Access Journals (Sweden)

    Daura Xavier

    2010-03-01

    Full Text Available Abstract Background With the classical, active-site oriented drug-development approach reaching its limits, protein ligand-binding sites in general and allosteric sites in particular are increasingly attracting the interest of medicinal chemists in the search for new types of targets and strategies to drug development. Given that allostery represents one of the most common and powerful means to regulate protein function, the traditional drug discovery approach of targeting active sites can be extended by targeting allosteric or regulatory protein pockets that may allow the discovery of not only novel drug-like inhibitors, but activators as well. The wealth of available protein structural data can be exploited to further increase our understanding of allosterism, which in turn may have therapeutic applications. A first step in this direction is to identify and characterize putative effector sites that may be present in already available structural data. Results We performed a large-scale study of protein cavities as potential allosteric and functional sites, by integrating publicly available information on protein sequences, structures and active sites for more than a thousand protein families. By identifying common pockets across different structures of the same protein family we developed a method to measure the pocket's structural conservation. The method was first parameterized using known active sites. We characterized the predicted pockets in terms of sequence and structural conservation, backbone flexibility and electrostatic potential. Although these different measures do not tend to correlate, their combination is useful in selecting functional and regulatory sites, as a detailed analysis of a handful of protein families shows. We finally estimated the numbers of potential allosteric or regulatory pockets that may be present in the data set, finding that pockets with putative functional and effector characteristics are widespread across

  19. Nonlinear deterministic structures and the randomness of protein sequences

    CERN Document Server

    Huang Yan Zhao

    2003-01-01

    To clarify the randomness of protein sequences, we make a detailed analysis of a set of typical protein sequences representing each structural classes by using nonlinear prediction method. No deterministic structures are found in these protein sequences and this implies that they behave as random sequences. We also give an explanation to the controversial results obtained in previous investigations.

  20. The structure of a cholesterol-trapping protein

    Science.gov (United States)

    cholesterol-trapping protein Contact: Dan Krotz, dakrotz@lbl.gov Berkeley Lab Science Beat Lab website index Institute researchers determined the three-dimensional structure of a protein that controls cholesterol level in the bloodstream. Knowing the structure of the protein, a cellular receptor that ensnares

  1. STRUCTURAL FEATURES OF PLANT CHITINASES AND CHITIN-BINDING PROTEINS

    NARCIS (Netherlands)

    BEINTEMA, JJ

    1994-01-01

    Structural features of plant chitinases and chitin-binding proteins are discussed. Many of these proteins consist of multiple domains,of which the chitin-binding hevein domain is a predominant one. X-ray and NMR structures of representatives of the major classes of these proteins are available now,

  2. Conformation-specific spectroscopy of capped glutamine-containing peptides: role of a single glutamine residue on peptide backbone preferences.

    Science.gov (United States)

    Walsh, Patrick S; Dean, Jacob C; McBurney, Carl; Kang, Hyuk; Gellman, Samuel H; Zwier, Timothy S

    2016-04-28

    The conformational preferences of a series of short, aromatic-capped, glutamine-containing peptides have been studied under jet-cooled conditions in the gas phase. This work seeks a bottom-up understanding of the role played by glutamine residues in directing peptide structures that lead to neurodegenerative diseases. Resonant ion-dip infrared (RIDIR) spectroscopy is used to record single-conformation infrared spectra in the NH stretch, amide I and amide II regions. Comparison of the experimental spectra with the predictions of calculations carried out at the DFT M05-2X/6-31+G(d) level of theory lead to firm assignments for the H-bonding architectures of a total of eight conformers of four molecules, including three in Z-Gln-OH, one in Z-Gln-NHMe, three in Ac-Gln-NHBn, and one in Ac-Ala-Gln-NHBn. The Gln side chain engages actively in forming H-bonds with nearest-neighbor amide groups, forming C8 H-bonds to the C-terminal side, C9 H-bonds to the N-terminal side, and an amide-stacked geometry, all with an extended (C5) peptide backbone about the Gln residue. The Gln side chain also stabilizes an inverse γ-turn in the peptide backbone by forming a pair of H-bonds that bridge the γ-turn and stabilize it. Finally, the entire conformer population of Ac-Ala-Gln-NHBn is funneled into a single structure that incorporates the peptide backbone in a type I β-turn, stabilized by the Gln side chain forming a C7 H-bond to the central amide group in the β-turn not otherwise involved in a hydrogen bond. This β-turn backbone structure is nearly identical to that observed in a series of X-(AQ)-Y β-turns in the protein data bank, demonstrating that the gas-phase structure is robust to perturbations imposed by the crystalline protein environment.

  3. Synonymous codon bias and functional constraint on GC3-related DNA backbone dynamics in the prokaryotic nucleoid.

    Science.gov (United States)

    Babbitt, Gregory A; Alawad, Mohammed A; Schulze, Katharina V; Hudson, André O

    2014-01-01

    While mRNA stability has been demonstrated to control rates of translation, generating both global and local synonymous codon biases in many unicellular organisms, this explanation cannot adequately explain why codon bias strongly tracks neighboring intergene GC content; suggesting that structural dynamics of DNA might also influence codon choice. Because minor groove width is highly governed by 3-base periodicity in GC, the existence of triplet-based codons might imply a functional role for the optimization of local DNA molecular dynamics via GC content at synonymous sites (≈GC3). We confirm a strong association between GC3-related intrinsic DNA flexibility and codon bias across 24 different prokaryotic multiple whole-genome alignments. We develop a novel test of natural selection targeting synonymous sites and demonstrate that GC3-related DNA backbone dynamics have been subject to moderate selective pressure, perhaps contributing to our observation that many genes possess extreme DNA backbone dynamics for their given protein space. This dual function of codons may impose universal functional constraints affecting the evolution of synonymous and non-synonymous sites. We propose that synonymous sites may have evolved as an 'accessory' during an early expansion of a primordial genetic code, allowing for multiplexed protein coding and structural dynamic information within the same molecular context. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. Towards fully automated structure-based NMR resonance assignment of 15N-labeled proteins from automatically picked peaks

    KAUST Repository

    Jang, Richard; Gao, Xin; Li, Ming

    2011-01-01

    In NMR resonance assignment, an indispensable step in NMR protein studies, manually processed peaks from both N-labeled and C-labeled spectra are typically used as inputs. However, the use of homologous structures can allow one to use only N-labeled NMR data and avoid the added expense of using C-labeled data. We propose a novel integer programming framework for structure-based backbone resonance assignment using N-labeled data. The core consists of a pair of integer programming models: one for spin system forming and amino acid typing, and the other for backbone resonance assignment. The goal is to perform the assignment directly from spectra without any manual intervention via automatically picked peaks, which are much noisier than manually picked peaks, so methods must be error-tolerant. In the case of semi-automated/manually processed peak data, we compare our system with the Xiong-Pandurangan-Bailey- Kellogg's contact replacement (CR) method, which is the most error-tolerant method for structure-based resonance assignment. Our system, on average, reduces the error rate of the CR method by five folds on their data set. In addition, by using an iterative algorithm, our system has the added capability of using the NOESY data to correct assignment errors due to errors in predicting the amino acid and secondary structure type of each spin system. On a publicly available data set for human ubiquitin, where the typing accuracy is 83%, we achieve 91% accuracy, compared to the 59% accuracy obtained without correcting for such errors. In the case of automatically picked peaks, using assignment information from yeast ubiquitin, we achieve a fully automatic assignment with 97% accuracy. To our knowledge, this is the first system that can achieve fully automatic structure-based assignment directly from spectra. This has implications in NMR protein mutant studies, where the assignment step is repeated for each mutant. © Copyright 2011, Mary Ann Liebert, Inc.

  5. Towards fully automated structure-based NMR resonance assignment of 15N-labeled proteins from automatically picked peaks

    KAUST Repository

    Jang, Richard

    2011-03-01

    In NMR resonance assignment, an indispensable step in NMR protein studies, manually processed peaks from both N-labeled and C-labeled spectra are typically used as inputs. However, the use of homologous structures can allow one to use only N-labeled NMR data and avoid the added expense of using C-labeled data. We propose a novel integer programming framework for structure-based backbone resonance assignment using N-labeled data. The core consists of a pair of integer programming models: one for spin system forming and amino acid typing, and the other for backbone resonance assignment. The goal is to perform the assignment directly from spectra without any manual intervention via automatically picked peaks, which are much noisier than manually picked peaks, so methods must be error-tolerant. In the case of semi-automated/manually processed peak data, we compare our system with the Xiong-Pandurangan-Bailey- Kellogg\\'s contact replacement (CR) method, which is the most error-tolerant method for structure-based resonance assignment. Our system, on average, reduces the error rate of the CR method by five folds on their data set. In addition, by using an iterative algorithm, our system has the added capability of using the NOESY data to correct assignment errors due to errors in predicting the amino acid and secondary structure type of each spin system. On a publicly available data set for human ubiquitin, where the typing accuracy is 83%, we achieve 91% accuracy, compared to the 59% accuracy obtained without correcting for such errors. In the case of automatically picked peaks, using assignment information from yeast ubiquitin, we achieve a fully automatic assignment with 97% accuracy. To our knowledge, this is the first system that can achieve fully automatic structure-based assignment directly from spectra. This has implications in NMR protein mutant studies, where the assignment step is repeated for each mutant. © Copyright 2011, Mary Ann Liebert, Inc.

  6. Functional structural motifs for protein-ligand, protein-protein, and protein-nucleic acid interactions and their connection to supersecondary structures.

    Science.gov (United States)

    Kinjo, Akira R; Nakamura, Haruki

    2013-01-01

    Protein functions are mediated by interactions between proteins and other molecules. One useful approach to analyze protein functions is to compare and classify the structures of interaction interfaces of proteins. Here, we describe the procedures for compiling a database of interface structures and efficiently comparing the interface structures. To do so requires a good understanding of the data structures of the Protein Data Bank (PDB). Therefore, we also provide a detailed account of the PDB exchange dictionary necessary for extracting data that are relevant for analyzing interaction interfaces and secondary structures. We identify recurring structural motifs by classifying similar interface structures, and we define a coarse-grained representation of supersecondary structures (SSS) which represents a sequence of two or three secondary structure elements including their relative orientations as a string of four to seven letters. By examining the correspondence between structural motifs and SSS strings, we show that no SSS string has particularly high propensity to be found interaction interfaces in general, indicating any SSS can be used as a binding interface. When individual structural motifs are examined, there are some SSS strings that have high propensity for particular groups of structural motifs. In addition, it is shown that while the SSS strings found in particular structural motifs for nonpolymer and protein interfaces are as abundant as in other structural motifs that belong to the same subunit, structural motifs for nucleic acid interfaces exhibit somewhat stronger preference for SSS strings. In regard to protein folds, many motif-specific SSS strings were found across many folds, suggesting that SSS may be a useful description to investigate the universality of ligand binding modes.

  7. Solution structure of human intestinal fatty acid binding protein: Implications for ligand entry and exit

    International Nuclear Information System (INIS)

    Zhang Fengli; Luecke, Christian; Baier, Leslie J.; Sacchettini, James C.; Hamilton, James A.

    1997-01-01

    The human intestinal fatty acid binding protein (I-FABP) is a small (131 amino acids) protein which binds dietary long-chain fatty acids in the cytosol of enterocytes. Recently, an alanine to threonine substitution at position 54 in I-FABP has been identified which affects fatty acid binding and transport, and is associated with the development of insulin resistance in several populations including Mexican-Americans and Pima Indians. To investigate the molecular basis of the binding properties of I-FABP, the 3D solution structure of the more common form of human I-FABP (Ala54) was studied by multidimensional NMR spectroscopy.Recombinant I-FABP was expressed from E. coli in the presence and absence of 15N-enriched media. The sequential assignments for non-delipidated I-FABP were completed by using 2D homonuclear spectra (COSY, TOCSY and NOESY) and 3D heteronuclear spectra(NOESY-HMQC and TOCSY-HMQC). The tertiary structure of human I-FABP was calculated by using the distance geometry program DIANA based on 2519 distance constraints obtained from the NMR data. Subsequent energy minimization was carried out by using the program SYBYL in the presence of distance constraints. The conformation of human I-FABP consists of 10 antiparallel β-strands which form two nearly orthogonal β-sheets of five strands each, and two short α-helices that connect the β-strands A and B. The interior of the protein consists of a water-filled cavity between the two β-sheets. The NMR solution structure of human I-FABP is similar to the crystal structure of rat I-FABP.The NMR results show significant conformational variability of certain backbone segments around the postulated portal region for the entry and exit of fatty acid ligand

  8. Solution structure of human intestinal fatty acid binding protein: Implications for ligand entry and exit

    Energy Technology Data Exchange (ETDEWEB)

    Zhang Fengli [Boston University School of Medicine, Department of Biophysics (United States); Luecke, Christian [Johann Wolfgang Goethe-Universitaet (Germany); Baier, Leslie J. [NIDDK, NIH, Phoenix Epidemiology and Clinical Research Branch (United States); Sacchettini, James C. [Texas A and M University, Department of Biochemistry and Biophysics (United States); Hamilton, James A. [Boston University School of Medicine, Department of Biophysics (United States)

    1997-04-15

    The human intestinal fatty acid binding protein (I-FABP) is a small (131 amino acids) protein which binds dietary long-chain fatty acids in the cytosol of enterocytes. Recently, an alanine to threonine substitution at position 54 in I-FABP has been identified which affects fatty acid binding and transport, and is associated with the development of insulin resistance in several populations including Mexican-Americans and Pima Indians. To investigate the molecular basis of the binding properties of I-FABP, the 3D solution structure of the more common form of human I-FABP (Ala54) was studied by multidimensional NMR spectroscopy.Recombinant I-FABP was expressed from E. coli in the presence and absence of 15N-enriched media. The sequential assignments for non-delipidated I-FABP were completed by using 2D homonuclear spectra (COSY, TOCSY and NOESY) and 3D heteronuclear spectra(NOESY-HMQC and TOCSY-HMQC). The tertiary structure of human I-FABP was calculated by using the distance geometry program DIANA based on 2519 distance constraints obtained from the NMR data. Subsequent energy minimization was carried out by using the program SYBYL in the presence of distance constraints. The conformation of human I-FABP consists of 10 antiparallel {beta}-strands which form two nearly orthogonal {beta}-sheets of five strands each, and two short {alpha}-helices that connect the {beta}-strands A and B. The interior of the protein consists of a water-filled cavity between the two {beta}-sheets. The NMR solution structure of human I-FABP is similar to the crystal structure of rat I-FABP.The NMR results show significant conformational variability of certain backbone segments around the postulated portal region for the entry and exit of fatty acid ligand.

  9. Structural basis for recognition of synaptic vesicle protein 2C by botulinum neurotoxin A

    Science.gov (United States)

    Benoit, Roger M.; Frey, Daniel; Hilbert, Manuel; Kevenaar, Josta T.; Wieser, Mara M.; Stirnimann, Christian U.; McMillan, David; Ceska, Tom; Lebon, Florence; Jaussi, Rolf; Steinmetz, Michel O.; Schertler, Gebhard F. X.; Hoogenraad, Casper C.; Capitani, Guido; Kammerer, Richard A.

    2014-01-01

    Botulinum neurotoxin A (BoNT/A) belongs to the most dangerous class of bioweapons. Despite this, BoNT/A is used to treat a wide range of common medical conditions such as migraines and a variety of ocular motility and movement disorders. BoNT/A is probably best known for its use as an antiwrinkle agent in cosmetic applications (including Botox and Dysport). BoNT/A application causes long-lasting flaccid paralysis of muscles through inhibiting the release of the neurotransmitter acetylcholine by cleaving synaptosomal-associated protein 25 (SNAP-25) within presynaptic nerve terminals. Two types of BoNT/A receptor have been identified, both of which are required for BoNT/A toxicity and are therefore likely to cooperate with each other: gangliosides and members of the synaptic vesicle glycoprotein 2 (SV2) family, which are putative transporter proteins that are predicted to have 12 transmembrane domains, associate with the receptor-binding domain of the toxin. Recently, fibroblast growth factor receptor 3 (FGFR3) has also been reported to be a potential BoNT/A receptor. In SV2 proteins, the BoNT/A-binding site has been mapped to the luminal domain, but the molecular details of the interaction between BoNT/A and SV2 are unknown. Here we determined the high-resolution crystal structure of the BoNT/A receptor-binding domain (BoNT/A-RBD) in complex with the SV2C luminal domain (SV2C-LD). SV2C-LD consists of a right-handed, quadrilateral β-helix that associates with BoNT/A-RBD mainly through backbone-to-backbone interactions at open β-strand edges, in a manner that resembles the inter-strand interactions in amyloid structures. Competition experiments identified a peptide that inhibits the formation of the complex. Our findings provide a strong platform for the development of novel antitoxin agents and for the rational design of BoNT/A variants with improved therapeutic properties.

  10. CMsearch: simultaneous exploration of protein sequence space and structure space improves not only protein homology detection but also protein structure prediction

    KAUST Repository

    Cui, Xuefeng; Lu, Zhiwu; Wang, Sheng; Jing-Yan Wang, Jim; Gao, Xin

    2016-01-01

    Motivation: Protein homology detection, a fundamental problem in computational biology, is an indispensable step toward predicting protein structures and understanding protein functions. Despite the advances in recent decades on sequence alignment

  11. Characterization of Bifunctional Spin Labels for Investigating the Structural and Dynamic Properties of Membrane Proteins Using EPR Spectroscopy.

    Science.gov (United States)

    Sahu, Indra D; Craig, Andrew F; Dunagum, Megan M; McCarrick, Robert M; Lorigan, Gary A

    2017-10-05

    Site-directed spin labeling (SDSL) coupled with electron paramagnetic resonance (EPR) spectroscopy is a very powerful technique to study structural and dynamic properties of membrane proteins. The most widely used spin label is methanthiosulfonate (MTSL). However, the flexibility of this spin label introduces greater uncertainties in EPR measurements obtained for determining structures, side-chain dynamics, and backbone motion of membrane protein systems. Recently, a newer bifunctional spin label (BSL), 3,4-bis(methanethiosulfonylmethyl)-2,2,5,5-tetramethyl-2,5-dihydro-1H-pyrrol-1-yloxy, has been introduced to overcome the dynamic limitations associated with the MTSL spin label and has been invaluable in determining protein backbone dynamics and inter-residue distances due to its restricted internal motion and fewer size restrictions. While BSL has been successful in providing more accurate information about the structure and dynamics of several proteins, a detailed characterization of the spin label is still lacking. In this study, we characterized BSLs by performing CW-EPR spectral line shape analysis as a function of temperature on spin-labeled sites inside and outside of the membrane for the integral membrane protein KCNE1 in POPC/POPG lipid bilayers and POPC/POPG lipodisq nanoparticles. The experimental data revealed a powder pattern spectral line shape for all of the KCNE1-BSL samples at 296 K, suggesting the motion of BSLs approaches the rigid limit regime for these series of samples. BSLs were further utilized to report for the first time the distance measurement between two BSLs attached on an integral membrane protein KCNE1 in POPC/POPG lipid bilayers at room temperature using dipolar line broadening CW-EPR spectroscopy. The CW dipolar line broadening EPR data revealed a 15 ± 2 Å distance between doubly attached BSLs on KCNE1 (53/57-63/67) which is consistent with molecular dynamics modeling and the solution NMR structure of KCNE1 which yielded a

  12. Protein Function Prediction Based on Sequence and Structure Information

    KAUST Repository

    Smaili, Fatima Z.

    2016-01-01

    operate. In this master thesis project, we worked on inferring protein functions based on the primary protein sequence. In the approach we follow, 3D models are first constructed using I-TASSER. Functions are then deduced by structurally matching

  13. K-nearest uphill clustering in the protein structure space

    KAUST Repository

    Cui, Xuefeng

    2016-08-26

    The protein structure classification problem, which is to assign a protein structure to a cluster of similar proteins, is one of the most fundamental problems in the construction and application of the protein structure space. Early manually curated protein structure classifications (e.g., SCOP and CATH) are very successful, but recently suffer the slow updating problem because of the increased throughput of newly solved protein structures. Thus, fully automatic methods to cluster proteins in the protein structure space have been designed and developed. In this study, we observed that the SCOP superfamilies are highly consistent with clustering trees representing hierarchical clustering procedures, but the tree cutting is very challenging and becomes the bottleneck of clustering accuracy. To overcome this challenge, we proposed a novel density-based K-nearest uphill clustering method that effectively eliminates noisy pairwise protein structure similarities and identifies density peaks as cluster centers. Specifically, the density peaks are identified based on K-nearest uphills (i.e., proteins with higher densities) and K-nearest neighbors. To our knowledge, this is the first attempt to apply and develop density-based clustering methods in the protein structure space. Our results show that our density-based clustering method outperforms the state-of-the-art clustering methods previously applied to the problem. Moreover, we observed that computational methods and human experts could produce highly similar clusters at high precision values, while computational methods also suggest to split some large superfamilies into smaller clusters. © 2016 Elsevier B.V.

  14. Use of designed sequences in protein structure recognition.

    Science.gov (United States)

    Kumar, Gayatri; Mudgal, Richa; Srinivasan, Narayanaswamy; Sandhya, Sankaran

    2018-05-09

    Knowledge of the protein structure is a pre-requisite for improved understanding of molecular function. The gap in the sequence-structure space has increased in the post-genomic era. Grouping related protein sequences into families can aid in narrowing the gap. In the Pfam database, structure description is provided for part or full-length proteins of 7726 families. For the remaining 52% of the families, information on 3-D structure is not yet available. We use the computationally designed sequences that are intermediately related to two protein domain families, which are already known to share the same fold. These strategically designed sequences enable detection of distant relationships and here, we have employed them for the purpose of structure recognition of protein families of yet unknown structure. We first measured the success rate of our approach using a dataset of protein families of known fold and achieved a success rate of 88%. Next, for 1392 families of yet unknown structure, we made structural assignments for part/full length of the proteins. Fold association for 423 domains of unknown function (DUFs) are provided as a step towards functional annotation. The results indicate that knowledge-based filling of gaps in protein sequence space is a lucrative approach for structure recognition. Such sequences assist in traversal through protein sequence space and effectively function as 'linkers', where natural linkers between distant proteins are unavailable. This article was reviewed by Oliviero Carugo, Christine Orengo and Srikrishna Subramanian.

  15. Using an alignment of fragment strings for comparing protein structures

    DEFF Research Database (Denmark)

    Friedberg, Iddo; Harder, Tim; Kolodny, Rachel

    2007-01-01

    . RESULTS: Here we describe the use of a particular structure fragment library, denoted here as KL-strings, for the 1D representation of protein structure. Using KL-strings, we develop an infrastructure for comparing protein structures with a 1D representation. This study focuses on the added value gained...

  16. Rheology and structure of milk protein gels

    NARCIS (Netherlands)

    Vliet, van T.; Lakemond, C.M.M.; Visschers, R.W.

    2004-01-01

    Recent studies on gel formation and rheology of milk gels are reviewed. A distinction is made between gels formed by aggregated casein, gels of `pure` whey proteins and gels in which both casein and whey proteins contribute to their properties. For casein' whey protein mixtures, it has been shown

  17. Implementation of a Parallel Protein Structure Alignment Service on Cloud

    Directory of Open Access Journals (Sweden)

    Che-Lun Hung

    2013-01-01

    Full Text Available Protein structure alignment has become an important strategy by which to identify evolutionary relationships between protein sequences. Several alignment tools are currently available for online comparison of protein structures. In this paper, we propose a parallel protein structure alignment service based on the Hadoop distribution framework. This service includes a protein structure alignment algorithm, a refinement algorithm, and a MapReduce programming model. The refinement algorithm refines the result of alignment. To process vast numbers of protein structures in parallel, the alignment and refinement algorithms are implemented using MapReduce. We analyzed and compared the structure alignments produced by different methods using a dataset randomly selected from the PDB database. The experimental results verify that the proposed algorithm refines the resulting alignments more accurately than existing algorithms. Meanwhile, the computational performance of the proposed service is proportional to the number of processors used in our cloud platform.

  18. Structural Insights into RNA Recognition by the Alternate-Splicing Regulator CUG-Binding Protein 1

    Energy Technology Data Exchange (ETDEWEB)

    M Teplova; J Song; H Gaw; A Teplov; D Patel

    2011-12-31

    CUG-binding protein 1 (CUGBP1) regulates multiple aspects of nuclear and cytoplasmic mRNA processing, with implications for onset of myotonic dystrophy. CUGBP1 harbors three RRM domains and preferentially targets UGU-rich mRNA elements. We describe crystal structures of CUGBP1 RRM1 and tandem RRM1/2 domains bound to RNAs containing tandem UGU(U/G) elements. Both RRM1 in RRM1-RNA and RRM2 in RRM1/2-RNA complexes use similar principles to target UGU(U/G) elements, with recognition mediated by face-to-edge stacking and water-mediated hydrogen-bonding networks. The UG step adopts a left-handed Z-RNA conformation, with the syn guanine recognized through Hoogsteen edge-protein backbone hydrogen-bonding interactions. NMR studies on the RRM1/2-RNA complex establish that both RRM domains target tandem UGUU motifs in solution, whereas filter-binding assays identify a preference for recognition of GU over AU or GC steps. We discuss the implications of CUGBP1-mediated targeting and sequestration of UGU(U/G) elements on pre-mRNA alternative-splicing regulation, translational regulation, and mRNA decay.

  19. Porous solid backbone impregnation for electrochemical energy conversion systems

    KAUST Repository

    Boulfrad, Samir

    2013-09-19

    An apparatus and method for impregnating a porous solid backbone. The apparatus may include a platform for holding a porous solid backbone, an ink jet nozzle configured to dispense a liquid solution onto the porous solid backbone, a positioning mechanism configured to position the ink jet nozzle proximate to a plurality of locations of the porous solid backbone, and a control unit configured to control the positioning mechanism to position the ink jet nozzle proximate to the plurality of locations and cause the ink jet nozzle to dispense the liquid solution onto the porous solid backbone.

  20. Porous solid backbone impregnation for electrochemical energy conversion systems

    KAUST Repository

    Boulfrad, Samir; Jabbour, Ghassan

    2013-01-01

    An apparatus and method for impregnating a porous solid backbone. The apparatus may include a platform for holding a porous solid backbone, an ink jet nozzle configured to dispense a liquid solution onto the porous solid backbone, a positioning mechanism configured to position the ink jet nozzle proximate to a plurality of locations of the porous solid backbone, and a control unit configured to control the positioning mechanism to position the ink jet nozzle proximate to the plurality of locations and cause the ink jet nozzle to dispense the liquid solution onto the porous solid backbone.

  1. Compare local pocket and global protein structure models by small structure patterns

    KAUST Repository

    Cui, Xuefeng; Kuwahara, Hiroyuki; Li, Shuai Cheng; Gao, Xin

    2015-01-01

    Researchers proposed several criteria to assess the quality of predicted protein structures because it is one of the essential tasks in the Critical Assessment of Techniques for Protein Structure Prediction (CASP) competitions. Popular criteria

  2. PDB2CD visualises dynamics within protein structures.

    Science.gov (United States)

    Janes, Robert W

    2017-10-01

    Proteins tend to have defined conformations, a key factor in enabling their function. Atomic resolution structures of proteins are predominantly obtained by either solution nuclear magnetic resonance (NMR) or crystal structure methods. However, when considering a protein whose structure has been determined by both these approaches, on many occasions, the resultant conformations are subtly different, as illustrated by the examples in this study. The solution NMR approach invariably results in a cluster of structures whose conformations satisfy the distance boundaries imposed by the data collected; it might be argued that this is evidence of the dynamics of proteins when in solution. In crystal structures, the proteins are often in an energy minimum state which can result in an increase in the extent of regular secondary structure present relative to the solution state depicted by NMR, because the more dynamic ends of alpha helices and beta strands can become ordered at the lower temperatures. This study examines a novel way to display the differences in conformations within an NMR ensemble and between these and a crystal structure of a protein. Circular dichroism (CD) spectroscopy can be used to characterise protein structures in solution. Using the new bioinformatics tool, PDB2CD, which generates CD spectra from atomic resolution protein structures, the differences between, and possible dynamic range of, conformations adopted by a protein can be visualised.

  3. DNA mimic proteins: functions, structures, and bioinformatic analysis.

    Science.gov (United States)

    Wang, Hao-Ching; Ho, Chun-Han; Hsu, Kai-Cheng; Yang, Jinn-Moon; Wang, Andrew H-J

    2014-05-13

    DNA mimic proteins have DNA-like negative surface charge distributions, and they function by occupying the DNA binding sites of DNA binding proteins to prevent these sites from being accessed by DNA. DNA mimic proteins control the activities of a variety of DNA binding proteins and are involved in a wide range of cellular mechanisms such as chromatin assembly, DNA repair, transcription regulation, and gene recombination. However, the sequences and structures of DNA mimic proteins are diverse, making them difficult to predict by bioinformatic search. To date, only a few DNA mimic proteins have been reported. These DNA mimics were not found by searching for functional motifs in their sequences but were revealed only by structural analysis of their charge distribution. This review highlights the biological roles and structures of 16 reported DNA mimic proteins. We also discuss approaches that might be used to discover new DNA mimic proteins.

  4. The crystal structure of the Dachshund domain of human SnoN reveals flexibility in the putative protein interaction surface.

    Directory of Open Access Journals (Sweden)

    Tomas Nyman

    2010-09-01

    Full Text Available The human SnoN is an oncoprotein that interacts with several transcription-regulatory proteins such as the histone-deacetylase, N-CoR containing co-repressor complex and Smad proteins. This study presents the crystal structure of the Dachshund homology domain of human SnoN. The structure reveals a groove composed of conserved residues with characteristic properties of a protein-interaction surface. A comparison of the 12 monomers in the asymmetric unit reveals the presence of two major conformations: an open conformation with a well accessible groove and a tight conformation with a less accessible groove. The variability in the backbone between the open and the tight conformations matches the differences seen in previously determined structures of individual Dachshund homology domains, suggesting a general plasticity within this fold family. The flexibility observed in the putative protein binding groove may enable SnoN to recognize multiple interaction partners.This article can also be viewed as an enhanced version in which the text of the article is integrated with interactive 3D representations and animated transitions. Please note that a web plugin is required to access this enhanced functionality. Instructions for the installation and use of the web plugin are available in Text S1.

  5. Instant Backbone.js application development

    CERN Document Server

    Hunter, Thomas

    2013-01-01

    Get to grips with a new technology, understand what it is and what it can do for you, and then get to work with the most important features and tasks. This book is a practical, step-by-step tutorial that will teach you to build Backbone.js applications quickly and efficiently.This book is targeted towards developers. It is assumed that you have at least a basic understanding of JavaScript and jQuery selectors. If you are interested in building dynamic Single Page Applications that interact heavily with a backend server, then this is the book for you.

  6. Relation between native ensembles and experimental structures of proteins

    DEFF Research Database (Denmark)

    Best, R. B.; Lindorff-Larsen, Kresten; DePristo, M. A.

    2006-01-01

    Different experimental structures of the same protein or of proteins with high sequence similarity contain many small variations. Here we construct ensembles of "high-sequence similarity Protein Data Bank" (HSP) structures and consider the extent to which such ensembles represent the structural...... Data Bank ensembles; moreover, we show that the effects of uncertainties in structure determination are insufficient to explain the results. These results highlight the importance of accounting for native-state protein dynamics in making comparisons with ensemble-averaged experimental data and suggest...... heterogeneity of the native state in solution. We find that different NMR measurements probing structure and dynamics of given proteins in solution, including order parameters, scalar couplings, and residual dipolar couplings, are remarkably well reproduced by their respective high-sequence similarity Protein...

  7. Crystal structures of barley thioredoxin h isoforms HvTrxh1 and HvTrxh2 reveal features involved in protein recognition and possibly in discriminating the isoform specificity

    DEFF Research Database (Denmark)

    Maeda, Kenji; Hägglund, Per; Finnie, Christine

    2008-01-01

    segment of one HvTrxh1 molecule is positioned along a shallow hydrophobic groove at the primary nucleophile Cys40 of another HvTrxh1 molecule. The association mode can serve as a model for the target protein recognition by Trx, as it brings the Met82 C gamma atom (gamma position as a disulfide sulfur......) of the bound loop segment in the proximity of the Cys40 thiol. The interaction involves three characteristic backbone-backbone hydrogen bonds in an antiparallel beta-sheet-like arrangement, similar to the arrangement observed in the structure of an engineered, covalently bound complex between Trx...... and a substrate protein, as reported by Maeda et al. in an earlier paper. The occurrence of an intermolecular salt bridge between Glu80 of the bound loop segment and Arg101 near the hydrophobic groove suggests that charge complementarity plays a role in the specificity of Trx. In HvTrxh2, isoleucine corresponds...

  8. Current strategies for protein production and purification enabling membrane protein structural biology.

    Science.gov (United States)

    Pandey, Aditya; Shin, Kyungsoo; Patterson, Robin E; Liu, Xiang-Qin; Rainey, Jan K

    2016-12-01

    Membrane proteins are still heavily under-represented in the protein data bank (PDB), owing to multiple bottlenecks. The typical low abundance of membrane proteins in their natural hosts makes it necessary to overexpress these proteins either in heterologous systems or through in vitro translation/cell-free expression. Heterologous expression of proteins, in turn, leads to multiple obstacles, owing to the unpredictability of compatibility of the target protein for expression in a given host. The highly hydrophobic and (or) amphipathic nature of membrane proteins also leads to challenges in producing a homogeneous, stable, and pure sample for structural studies. Circumventing these hurdles has become possible through the introduction of novel protein production protocols; efficient protein isolation and sample preparation methods; and, improvement in hardware and software for structural characterization. Combined, these advances have made the past 10-15 years very exciting and eventful for the field of membrane protein structural biology, with an exponential growth in the number of solved membrane protein structures. In this review, we focus on both the advances and diversity of protein production and purification methods that have allowed this growth in structural knowledge of membrane proteins through X-ray crystallography, nuclear magnetic resonance (NMR) spectroscopy, and cryo-electron microscopy (cryo-EM).

  9. Integrating NOE and RDC using sum-of-squares relaxation for protein structure determination.

    Science.gov (United States)

    Khoo, Y; Singer, A; Cowburn, D

    2017-07-01

    We revisit the problem of protein structure determination from geometrical restraints from NMR, using convex optimization. It is well-known that the NP-hard distance geometry problem of determining atomic positions from pairwise distance restraints can be relaxed into a convex semidefinite program (SDP). However, often the NOE distance restraints are too imprecise and sparse for accurate structure determination. Residual dipolar coupling (RDC) measurements provide additional geometric information on the angles between atom-pair directions and axes of the principal-axis-frame. The optimization problem involving RDC is highly non-convex and requires a good initialization even within the simulated annealing framework. In this paper, we model the protein backbone as an articulated structure composed of rigid units. Determining the rotation of each rigid unit gives the full protein structure. We propose solving the non-convex optimization problems using the sum-of-squares (SOS) hierarchy, a hierarchy of convex relaxations with increasing complexity and approximation power. Unlike classical global optimization approaches, SOS optimization returns a certificate of optimality if the global optimum is found. Based on the SOS method, we proposed two algorithms-RDC-SOS and RDC-NOE-SOS, that have polynomial time complexity in the number of amino-acid residues and run efficiently on a standard desktop. In many instances, the proposed methods exactly recover the solution to the original non-convex optimization problem. To the best of our knowledge this is the first time SOS relaxation is introduced to solve non-convex optimization problems in structural biology. We further introduce a statistical tool, the Cramér-Rao bound (CRB), to provide an information theoretic bound on the highest resolution one can hope to achieve when determining protein structure from noisy measurements using any unbiased estimator. Our simulation results show that when the RDC measurements are

  10. Predicting nucleic acid binding interfaces from structural models of proteins.

    Science.gov (United States)

    Dror, Iris; Shazman, Shula; Mukherjee, Srayanta; Zhang, Yang; Glaser, Fabian; Mandel-Gutfreund, Yael

    2012-02-01

    The function of DNA- and RNA-binding proteins can be inferred from the characterization and accurate prediction of their binding interfaces. However, the main pitfall of various structure-based methods for predicting nucleic acid binding function is that they are all limited to a relatively small number of proteins for which high-resolution three-dimensional structures are available. In this study, we developed a pipeline for extracting functional electrostatic patches from surfaces of protein structural models, obtained using the I-TASSER protein structure predictor. The largest positive patches are extracted from the protein surface using the patchfinder algorithm. We show that functional electrostatic patches extracted from an ensemble of structural models highly overlap the patches extracted from high-resolution structures. Furthermore, by testing our pipeline on a set of 55 known nucleic acid binding proteins for which I-TASSER produces high-quality models, we show that the method accurately identifies the nucleic acids binding interface on structural models of proteins. Employing a combined patch approach we show that patches extracted from an ensemble of models better predicts the real nucleic acid binding interfaces compared with patches extracted from independent models. Overall, these results suggest that combining information from a collection of low-resolution structural models could be a valuable approach for functional annotation. We suggest that our method will be further applicable for predicting other functional surfaces of proteins with unknown structure. Copyright © 2011 Wiley Periodicals, Inc.

  11. Ion pairs in non-redundant protein structures

    Indian Academy of Sciences (India)

    Ion pairs contribute to several functions including the activity of catalytic triads, fusion of viral membranes, stability in thermophilic proteins and solvent–protein interactions. Furthermore, they have the ability to affect the stability of protein structures and are also a part of the forces that act to hold monomers together.

  12. The structure and function of endophilin proteins

    DEFF Research Database (Denmark)

    Kjaerulff, Ole; Brodin, Lennart; Jung, Anita

    2011-01-01

    Members of the BAR domain protein superfamily are essential elements of cellular traffic. Endophilins are among the best studied BAR domain proteins. They have a prominent function in synaptic vesicle endocytosis (SVE), receptor trafficking and apoptosis, and in other processes that require...

  13. AUTOBA: automation of backbone assignment from HN(C)N suite of experiments.

    Science.gov (United States)

    Borkar, Aditi; Kumar, Dinesh; Hosur, Ramakrishna V

    2011-07-01

    Development of efficient strategies and automation represent important milestones of progress in rapid structure determination efforts in proteomics research. In this context, we present here an efficient algorithm named as AUTOBA (Automatic Backbone Assignment) designed to automate the assignment protocol based on HN(C)N suite of experiments. Depending upon the spectral dispersion, the user can record 2D or 3D versions of the experiments for assignment. The algorithm uses as inputs: (i) protein primary sequence and (ii) peak-lists from user defined HN(C)N suite of experiments. In the end, one gets H(N), (15)N, C(α) and C' assignments (in common BMRB format) for the individual residues along the polypeptide chain. The success of the algorithm has been demonstrated, not only with experimental spectra recorded on two small globular proteins: ubiquitin (76 aa) and M-crystallin (85 aa), but also with simulated spectra of 27 other proteins using assignment data from the BMRB.

  14. Combining NMR ensembles and molecular dynamics simulations provides more realistic models of protein structures in solution and leads to better chemical shift prediction

    International Nuclear Information System (INIS)

    Lehtivarjo, Juuso; Tuppurainen, Kari; Hassinen, Tommi; Laatikainen, Reino; Peräkylä, Mikael

    2012-01-01

    While chemical shifts are invaluable for obtaining structural information from proteins, they also offer one of the rare ways to obtain information about protein dynamics. A necessary tool in transforming chemical shifts into structural and dynamic information is chemical shift prediction. In our previous work we developed a method for 4D prediction of protein 1 H chemical shifts in which molecular motions, the 4th dimension, were modeled using molecular dynamics (MD) simulations. Although the approach clearly improved the prediction, the X-ray structures and single NMR conformers used in the model cannot be considered fully realistic models of protein in solution. In this work, NMR ensembles (NMRE) were used to expand the conformational space of proteins (e.g. side chains, flexible loops, termini), followed by MD simulations for each conformer to map the local fluctuations. Compared with the non-dynamic model, the NMRE+MD model gave 6–17% lower root-mean-square (RMS) errors for different backbone nuclei. The improved prediction indicates that NMR ensembles with MD simulations can be used to obtain a more realistic picture of protein structures in solutions and moreover underlines the importance of short and long time-scale dynamics for the prediction. The RMS errors of the NMRE+MD model were 0.24, 0.43, 0.98, 1.03, 1.16 and 2.39 ppm for 1 Hα, 1 HN, 13 Cα, 13 Cβ, 13 CO and backbone 15 N chemical shifts, respectively. The model is implemented in the prediction program 4DSPOT, available at http://www.uef.fi/4dspothttp://www.uef.fi/4dspot.

  15. Combining NMR ensembles and molecular dynamics simulations provides more realistic models of protein structures in solution and leads to better chemical shift prediction

    Energy Technology Data Exchange (ETDEWEB)

    Lehtivarjo, Juuso, E-mail: juuso.lehtivarjo@uef.fi; Tuppurainen, Kari; Hassinen, Tommi; Laatikainen, Reino [University of Eastern Finland, School of Pharmacy (Finland); Peraekylae, Mikael [University of Eastern Finland, Institute of Biomedicine (Finland)

    2012-03-15

    While chemical shifts are invaluable for obtaining structural information from proteins, they also offer one of the rare ways to obtain information about protein dynamics. A necessary tool in transforming chemical shifts into structural and dynamic information is chemical shift prediction. In our previous work we developed a method for 4D prediction of protein {sup 1}H chemical shifts in which molecular motions, the 4th dimension, were modeled using molecular dynamics (MD) simulations. Although the approach clearly improved the prediction, the X-ray structures and single NMR conformers used in the model cannot be considered fully realistic models of protein in solution. In this work, NMR ensembles (NMRE) were used to expand the conformational space of proteins (e.g. side chains, flexible loops, termini), followed by MD simulations for each conformer to map the local fluctuations. Compared with the non-dynamic model, the NMRE+MD model gave 6-17% lower root-mean-square (RMS) errors for different backbone nuclei. The improved prediction indicates that NMR ensembles with MD simulations can be used to obtain a more realistic picture of protein structures in solutions and moreover underlines the importance of short and long time-scale dynamics for the prediction. The RMS errors of the NMRE+MD model were 0.24, 0.43, 0.98, 1.03, 1.16 and 2.39 ppm for {sup 1}H{alpha}, {sup 1}HN, {sup 13}C{alpha}, {sup 13}C{beta}, {sup 13}CO and backbone {sup 15}N chemical shifts, respectively. The model is implemented in the prediction program 4DSPOT, available at http://www.uef.fi/4dspothttp://www.uef.fi/4dspot.

  16. Improving predicted protein loop structure ranking using a Pareto-optimality consensus method.

    Science.gov (United States)

    Li, Yaohang; Rata, Ionel; Chiu, See-wing; Jakobsson, Eric

    2010-07-20

    Accurate protein loop structure models are important to understand functions of many proteins. Identifying the native or near-native models by distinguishing them from the misfolded ones is a critical step in protein loop structure prediction. We have developed a Pareto Optimal Consensus (POC) method, which is a consensus model ranking approach to integrate multiple knowledge- or physics-based scoring functions. The procedure of identifying the models of best quality in a model set includes: 1) identifying the models at the Pareto optimal front with respect to a set of scoring functions, and 2) ranking them based on the fuzzy dominance relationship to the rest of the models. We apply the POC method to a large number of decoy sets for loops of 4- to 12-residue in length using a functional space composed of several carefully-selected scoring functions: Rosetta, DOPE, DDFIRE, OPLS-AA, and a triplet backbone dihedral potential developed in our lab. Our computational results show that the sets of Pareto-optimal decoys, which are typically composed of approximately 20% or less of the overall decoys in a set, have a good coverage of the best or near-best decoys in more than 99% of the loop targets. Compared to the individual scoring function yielding best selection accuracy in the decoy sets, the POC method yields 23%, 37%, and 64% less false positives in distinguishing the native conformation, indentifying a near-native model (RMSD Pareto optimality and fuzzy dominance, the POC method is effective in distinguishing the best loop models from the other ones within a loop model set.

  17. BLAST-based structural annotation of protein residues using Protein Data Bank.

    Science.gov (United States)

    Singh, Harinder; Raghava, Gajendra P S

    2016-01-25

    In the era of next-generation sequencing where thousands of genomes have been already sequenced; size of protein databases is growing with exponential rate. Structural annotation of these proteins is one of the biggest challenges for the computational biologist. Although, it is easy to perform BLAST search against Protein Data Bank (PDB) but it is difficult for a biologist to annotate protein residues from BLAST search. A web-server StarPDB has been developed for structural annotation of a protein based on its similarity with known protein structures. It uses standard BLAST software for performing similarity search of a query protein against protein structures in PDB. This server integrates wide range modules for assigning different types of annotation that includes, Secondary-structure, Accessible surface area, Tight-turns, DNA-RNA and Ligand modules. Secondary structure module allows users to predict regular secondary structure states to each residue in a protein. Accessible surface area predict the exposed or buried residues in a protein. Tight-turns module is designed to predict tight turns like beta-turns in a protein. DNA-RNA module developed for predicting DNA and RNA interacting residues in a protein. Similarly, Ligand module of server allows one to predicted ligands, metal and nucleotides ligand interacting residues in a protein. In summary, this manuscript presents a web server for comprehensive annotation of a protein based on similarity search. It integrates number of visualization tools that facilitate users to understand structure and function of protein residues. This web server is available freely for scientific community from URL http://crdd.osdd.net/raghava/starpdb .

  18. Constructing Battery-Aware Virtual Backbones in Wireless Sensor Networks

    Directory of Open Access Journals (Sweden)

    Yang Yuanyuan

    2007-01-01

    Full Text Available A critical issue in battery-powered sensor networks is to construct energy efficient virtual backbones for network routing. Recent study in battery technology reveals that batteries tend to discharge more power than needed and reimburse the over-discharged power if they are recovered. In this paper we first provide a mathematical battery model suitable for implementation in sensor networks. We then introduce the concept of battery-aware connected dominating set (BACDS and show that in general the minimum BACDS (MBACDS can achieve longer lifetime than the previous backbone structures. Then we show that finding a MBACDS is NP-hard and give a distributed approximation algorithm to construct the BACDS. The resulting BACDS constructed by our algorithm is at most opt size, where is the maximum node degree and opt is the size of an optimal BACDS. Simulation results show that the BACDS can save a significant amount of energy and achieve up to longer network lifetime than previous schemes.

  19. Constructing Battery-Aware Virtual Backbones in Wireless Sensor Networks

    Directory of Open Access Journals (Sweden)

    Chi Ma

    2007-05-01

    Full Text Available A critical issue in battery-powered sensor networks is to construct energy efficient virtual backbones for network routing. Recent study in battery technology reveals that batteries tend to discharge more power than needed and reimburse the over-discharged power if they are recovered. In this paper we first provide a mathematical battery model suitable for implementation in sensor networks. We then introduce the concept of battery-aware connected dominating set (BACDS and show that in general the minimum BACDS (MBACDS can achieve longer lifetime than the previous backbone structures. Then we show that finding a MBACDS is NP-hard and give a distributed approximation algorithm to construct the BACDS. The resulting BACDS constructed by our algorithm is at most (8+Δopt size, where Δ is the maximum node degree and opt is the size of an optimal BACDS. Simulation results show that the BACDS can save a significant amount of energy and achieve up to 30% longer network lifetime than previous schemes.

  20. The contact activation proteins: a structure/function overview

    NARCIS (Netherlands)

    Meijers, J. C.; McMullen, B. A.; Bouma, B. N.

    1992-01-01

    In recent years, extensive knowledge has been obtained on the structure/function relationships of blood coagulation proteins. In this overview, we present recent developments on the structure/function relationships of the contact activation proteins: factor XII, high molecular weight kininogen,

  1. De novo protein structure determination using sparse NMR data

    International Nuclear Information System (INIS)

    Bowers, Peter M.; Strauss, Charlie E.M.; Baker, David

    2000-01-01

    We describe a method for generating moderate to high-resolution protein structures using limited NMR data combined with the ab initio protein structure prediction method Rosetta. Peptide fragments are selected from proteins of known structure based on sequence similarity and consistency with chemical shift and NOE data. Models are built from these fragments by minimizing an energy function that favors hydrophobic burial, strand pairing, and satisfaction of NOE constraints. Models generated using this procedure with ∼1 NOE constraint per residue are in some cases closer to the corresponding X-ray structures than the published NMR solution structures. The method requires only the sparse constraints available during initial stages of NMR structure determination, and thus holds promise for increasing the speed with which protein solution structures can be determined

  2. CMsearch: simultaneous exploration of protein sequence space and structure space improves not only protein homology detection but also protein structure prediction

    KAUST Repository

    Cui, Xuefeng

    2016-06-15

    Motivation: Protein homology detection, a fundamental problem in computational biology, is an indispensable step toward predicting protein structures and understanding protein functions. Despite the advances in recent decades on sequence alignment, threading and alignment-free methods, protein homology detection remains a challenging open problem. Recently, network methods that try to find transitive paths in the protein structure space demonstrate the importance of incorporating network information of the structure space. Yet, current methods merge the sequence space and the structure space into a single space, and thus introduce inconsistency in combining different sources of information. Method: We present a novel network-based protein homology detection method, CMsearch, based on cross-modal learning. Instead of exploring a single network built from the mixture of sequence and structure space information, CMsearch builds two separate networks to represent the sequence space and the structure space. It then learns sequence–structure correlation by simultaneously taking sequence information, structure information, sequence space information and structure space information into consideration. Results: We tested CMsearch on two challenging tasks, protein homology detection and protein structure prediction, by querying all 8332 PDB40 proteins. Our results demonstrate that CMsearch is insensitive to the similarity metrics used to define the sequence and the structure spaces. By using HMM–HMM alignment as the sequence similarity metric, CMsearch clearly outperforms state-of-the-art homology detection methods and the CASP-winning template-based protein structure prediction methods.

  3. Prediction of protein–protein interactions: unifying evolution and structure at protein interfaces

    International Nuclear Information System (INIS)

    Tuncbag, Nurcan; Gursoy, Attila; Keskin, Ozlem

    2011-01-01

    The vast majority of the chores in the living cell involve protein–protein interactions. Providing details of protein interactions at the residue level and incorporating them into protein interaction networks are crucial toward the elucidation of a dynamic picture of cells. Despite the rapid increase in the number of structurally known protein complexes, we are still far away from a complete network. Given experimental limitations, computational modeling of protein interactions is a prerequisite to proceed on the way to complete structural networks. In this work, we focus on the question 'how do proteins interact?' rather than 'which proteins interact?' and we review structure-based protein–protein interaction prediction approaches. As a sample approach for modeling protein interactions, PRISM is detailed which combines structural similarity and evolutionary conservation in protein interfaces to infer structures of complexes in the protein interaction network. This will ultimately help us to understand the role of protein interfaces in predicting bound conformations

  4. Structure of synaptophysin: a hexameric MARVEL-domain channel protein.

    Science.gov (United States)

    Arthur, Christopher P; Stowell, Michael H B

    2007-06-01

    Synaptophysin I (SypI) is an archetypal member of the MARVEL-domain family of integral membrane proteins and one of the first synaptic vesicle proteins to be identified and cloned. Most all MARVEL-domain proteins are involved in membrane apposition and vesicle-trafficking events, but their precise role in these processes is unclear. We have purified mammalian SypI and determined its three-dimensional (3D) structure by using electron microscopy and single-particle 3D reconstruction. The hexameric structure resembles an open basket with a large pore and tenuous interactions within the cytosolic domain. The structure suggests a model for Synaptophysin's role in fusion and recycling that is regulated by known interactions with the SNARE machinery. This 3D structure of a MARVEL-domain protein provides a structural foundation for understanding the role of these important proteins in a variety of biological processes.

  5. Sampling Realistic Protein Conformations Using Local Structural Bias

    DEFF Research Database (Denmark)

    Hamelryck, Thomas Wim; Kent, John T.; Krogh, A.

    2006-01-01

    The prediction of protein structure from sequence remains a major unsolved problem in biology. The most successful protein structure prediction methods make use of a divide-and-conquer strategy to attack the problem: a conformational sampling method generates plausible candidate structures, which...... are subsequently accepted or rejected using an energy function. Conceptually, this often corresponds to separating local structural bias from the long-range interactions that stabilize the compact, native state. However, sampling protein conformations that are compatible with the local structural bias encoded...... in a given protein sequence is a long-standing open problem, especially in continuous space. We describe an elegant and mathematically rigorous method to do this, and show that it readily generates native-like protein conformations simply by enforcing compactness. Our results have far-reaching implications...

  6. Structure of human Rad51 protein filament from molecular modeling and site-specific linear dichroism spectroscopy

    KAUST Repository

    Reymer, A.

    2009-07-08

    To get mechanistic insight into the DNA strand-exchange reaction of homologous recombination, we solved a filament structure of a human Rad51 protein, combining molecular modeling with experimental data. We build our structure on reported structures for central and N-terminal parts of pure (uncomplexed) Rad51 protein by aid of linear dichroism spectroscopy, providing angular orientations of substituted tyrosine residues of Rad51-dsDNA filaments in solution. The structure, validated by comparison with an electron microscopy density map and results from mutation analysis, is proposed to represent an active solution structure of the nucleo-protein complex. An inhomogeneously stretched double-stranded DNA fitted into the filament emphasizes the strategic positioning of 2 putative DNA-binding loops in a way that allows us speculate about their possibly distinct roles in nucleo-protein filament assembly and DNA strand-exchange reaction. The model suggests that the extension of a single-stranded DNA molecule upon binding of Rad51 is ensured by intercalation of Tyr-232 of the L1 loop, which might act as a docking tool, aligning protein monomers along the DNA strand upon filament assembly. Arg-235, also sitting on L1, is in the right position to make electrostatic contact with the phosphate backbone of the other DNA strand. The L2 loop position and its more ordered compact conformation makes us propose that this loop has another role, as a binding site for an incoming double-stranded DNA. Our filament structure and spectroscopic approach open the possibility of analyzing details along the multistep path of the strand-exchange reaction.

  7. Rapid and reliable protein structure determination via chemical shift threading.

    Science.gov (United States)

    Hafsa, Noor E; Berjanskii, Mark V; Arndt, David; Wishart, David S

    2018-01-01

    Protein structure determination using nuclear magnetic resonance (NMR) spectroscopy can be both time-consuming and labor intensive. Here we demonstrate how chemical shift threading can permit rapid, robust, and accurate protein structure determination using only chemical shift data. Threading is a relatively old bioinformatics technique that uses a combination of sequence information and predicted (or experimentally acquired) low-resolution structural data to generate high-resolution 3D protein structures. The key motivations behind using NMR chemical shifts for protein threading lie in the fact that they are easy to measure, they are available prior to 3D structure determination, and they contain vital structural information. The method we have developed uses not only sequence and chemical shift similarity but also chemical shift-derived secondary structure, shift-derived super-secondary structure, and shift-derived accessible surface area to generate a high quality protein structure regardless of the sequence similarity (or lack thereof) to a known structure already in the PDB. The method (called E-Thrifty) was found to be very fast (often chemical shift refinement, these results suggest that protein structure determination, using only NMR chemical shifts, is becoming increasingly practical and reliable. E-Thrifty is available as a web server at http://ethrifty.ca .

  8. Is protein structure prediction still an enigma?

    African Journals Online (AJOL)

    STORAGESEVER

    2008-12-29

    Dec 29, 2008 ... Computer methods for protein analysis address this problem since they study the .... neighbor methods, molecular dynamic simulation, and approaches .... fuzzy clustering, neural net works, logistic regression, decision tree ...

  9. At least 10% shorter C–H bonds in cryogenic protein crystal structures than in current AMBER forcefields

    Energy Technology Data Exchange (ETDEWEB)

    Pang, Yuan-Ping, E-mail: pang@mayo.edu

    2015-03-06

    High resolution protein crystal structures resolved with X-ray diffraction data at cryogenic temperature are commonly used as experimental data to refine forcefields and evaluate protein folding simulations. However, it has been unclear hitherto whether the C–H bond lengths in cryogenic protein structures are significantly different from those defined in forcefields to affect protein folding simulations. This article reports the finding that the C–H bonds in high resolution cryogenic protein structures are 10–14% shorter than those defined in current AMBER forcefields, according to 3709 C–H bonds in the cryogenic protein structures with resolutions of 0.62–0.79 Å. Also, 20 all-atom, isothermal–isobaric, 0.5-μs molecular dynamics simulations showed that chignolin folded from a fully-extended backbone formation to the native β-hairpin conformation in the simulations using AMBER forcefield FF12SB at 300 K with an aggregated native state population including standard error of 10 ± 4%. However, the aggregated native state population with standard error reduced to 3 ± 2% in the same simulations except that C–H bonds were shortened by 10–14%. Furthermore, the aggregated native state populations with standard errors increased to 35 ± 3% and 26 ± 3% when using FF12MC, which is based on AMBER forcefield FF99, with and without the shortened C–H bonds, respectively. These results show that the 10–14% bond length differences can significantly affect protein folding simulations and suggest that re-parameterization of C–H bonds according to the cryogenic structures could improve the ability of a forcefield to fold proteins in molecular dynamics simulations. - Highlights: • Cryogenic crystal structures are commonly used in computational studies of proteins. • C–H bonds in the cryogenic structures are shorter than those defined in forcefields. • A survey of 3709 C–H bonds shows that the cryogenic bonds are 10–14% shorter. • The

  10. Spectral fitting for signal assignment and structural analysis of uniformly {sup 13}C-labeled solid proteins by simulated annealing based on chemical shifts and spin dynamics

    Energy Technology Data Exchange (ETDEWEB)

    Matsuki, Yoh; Akutsu, Hideo; Fujiwara, Toshimichi [Osaka University, Institute for Protein Research (Japan)], E-mail: tfjwr@protein.osaka-u.ac.jp

    2007-08-15

    We describe an approach for the signal assignment and structural analysis with a suite of two-dimensional {sup 13}C-{sup 13}C magic-angle-spinning solid-state NMR spectra of uniformly {sup 13}C-labeled peptides and proteins. We directly fit the calculated spectra to experimental ones by simulated annealing in restrained molecular dynamics program CNS as a function of atomic coordinates. The spectra are calculated from the conformation dependent chemical shift obtained with SHIFTX and the cross-peak intensities computed for recoupled dipolar interactions. This method was applied to a membrane-bound 14-residue peptide, mastoparan-X. The obtained C', C{sup {alpha}} and C{sup {beta}} chemical shifts agreed with those reported previously at the precisions of 0.2, 0.7 and 0.4 ppm, respectively. This spectral fitting program also provides backbone dihedral angles with a precision of about 50 deg. from the spectra even with resonance overlaps. The restraints on the angles were improved by applying protein database program TALOS to the obtained chemical shifts. The peptide structure provided by these restraints was consistent with the reported structure at the backbone RMSD of about 1 A.

  11. Study of muscular skeletal apparatus’s functional state of junior sportsmen-power lifters, who have backbone verterbral abnormalities

    Directory of Open Access Journals (Sweden)

    V.R. Ilmatov

    2015-10-01

    Full Text Available Purpose: determination of abnormalities and disorders of muscular skeletal apparatuses’ status of power lifters, who have vertebral abnormalities of backbone. Material: 58 junior sportsmen participated in the research. 36 sportsmen were the main group of the research and had vertebral disorders in backbone. For posture testing visual examination was used. Backbone mobility was tested with goniometry method. Flat feet were registered with plantography method. Results: we determined posture abnormalities in sagittal and frontal planes; feet flat, limited maximal movements in thoracic and lumbar spines. It was determined that the most limited were rotational movements and backbone unbending. The next were side bents. These limitations were accompanied by pain syndrome. These observations indirectly confirmed theory of direct interaction of backbone structures with nervous structures. It is also a confirmation of vertebral abnormalities’ presence in junior sportsmen. Conclusions: it was found that in junior sportsmen - power lifters with backbone pathologies in 100% of cases symptoms are determined by local limitations of backbone mobility with pain syndrome. In 35% of cases they are accompanied by posture’s disorders and feet flat. Orientation and methodic of rehabilitation of such sportsmen have been determined.

  12. The Prediction of Botulinum Toxin Structure Based on in Silico and in Vitro Analysis

    Science.gov (United States)

    Suzuki, Tomonori; Miyazaki, Satoru

    2011-01-01

    Many of biological system mediated through protein-protein interactions. Knowledge of protein-protein complex structure is required for understanding the function. The determination of huge size and flexible protein-protein complex structure by experimental studies remains difficult, costly and five-consuming, therefore computational prediction of protein structures by homolog modeling and docking studies is valuable method. In addition, MD simulation is also one of the most powerful methods allowing to see the real dynamics of proteins. Here, we predict protein-protein complex structure of botulinum toxin to analyze its property. These bioinformatics methods are useful to report the relation between the flexibility of backbone structure and the activity.

  13. Crystal Structures of SlyA Protein, a Master Virulence Regulator of Salmonella, in Free and DNA-bound States

    Energy Technology Data Exchange (ETDEWEB)

    Dolan, Kyle T.; Duguid, Erica M.; He, Chuan (UC)

    2011-11-17

    SlyA is a master virulence regulator that controls the transcription of numerous genes in Salmonella enterica. We present here crystal structures of SlyA by itself and bound to a high-affinity DNA operator sequence in the slyA gene. SlyA interacts with DNA through direct recognition of a guanine base by Arg-65, as well as interactions between conserved Arg-86 and the minor groove and a large network of non-base-specific contacts with the sugar phosphate backbone. Our structures, together with an unpublished structure of SlyA bound to the small molecule effector salicylate (Protein Data Bank code 3DEU), reveal that, unlike many other MarR family proteins, SlyA dissociates from DNA without large conformational changes when bound to this effector. We propose that SlyA and other MarR global regulators rely more on indirect readout of DNA sequence to exert control over many genes, in contrast to proteins (such as OhrR) that recognize a single operator.

  14. Function and structure of GFP-like proteins in the protein data bank.

    Science.gov (United States)

    Ong, Wayne J-H; Alvarez, Samuel; Leroux, Ivan E; Shahid, Ramza S; Samma, Alex A; Peshkepija, Paola; Morgan, Alicia L; Mulcahy, Shawn; Zimmer, Marc

    2011-04-01

    The RCSB protein databank contains 266 crystal structures of green fluorescent proteins (GFP) and GFP-like proteins. This is the first systematic analysis of all the GFP-like structures in the pdb. We have used the pdb to examine the function of fluorescent proteins (FP) in nature, aspects of excited state proton transfer (ESPT) in FPs, deformation from planarity of the chromophore and chromophore maturation. The conclusions reached in this review are that (1) The lid residues are highly conserved, particularly those on the "top" of the β-barrel. They are important to the function of GFP-like proteins, perhaps in protecting the chromophore or in β-barrel formation. (2) The primary/ancestral function of GFP-like proteins may well be to aid in light induced electron transfer. (3) The structural prerequisites for light activated proton pumps exist in many structures and it's possible that like bioluminescence, proton pumps are secondary functions of GFP-like proteins. (4) In most GFP-like proteins the protein matrix exerts a significant strain on planar chromophores forcing most GFP-like proteins to adopt non-planar chromophores. These chromophoric deviations from planarity play an important role in determining the fluorescence quantum yield. (5) The chemospatial characteristics of the chromophore cavity determine the isomerization state of the chromophore. The cavities of highlighter proteins that can undergo cis/trans isomerization have chemospatial properties that are common to both cis and trans GFP-like proteins.

  15. A protein relational database and protein family knowledge bases to facilitate structure-based design analyses.

    Science.gov (United States)

    Mobilio, Dominick; Walker, Gary; Brooijmans, Natasja; Nilakantan, Ramaswamy; Denny, R Aldrin; Dejoannis, Jason; Feyfant, Eric; Kowticwar, Rupesh K; Mankala, Jyoti; Palli, Satish; Punyamantula, Sairam; Tatipally, Maneesh; John, Reji K; Humblet, Christine

    2010-08-01

    The Protein Data Bank is the most comprehensive source of experimental macromolecular structures. It can, however, be difficult at times to locate relevant structures with the Protein Data Bank search interface. This is particularly true when searching for complexes containing specific interactions between protein and ligand atoms. Moreover, searching within a family of proteins can be tedious. For example, one cannot search for some conserved residue as residue numbers vary across structures. We describe herein three databases, Protein Relational Database, Kinase Knowledge Base, and Matrix Metalloproteinase Knowledge Base, containing protein structures from the Protein Data Bank. In Protein Relational Database, atom-atom distances between protein and ligand have been precalculated allowing for millisecond retrieval based on atom identity and distance constraints. Ring centroids, centroid-centroid and centroid-atom distances and angles have also been included permitting queries for pi-stacking interactions and other structural motifs involving rings. Other geometric features can be searched through the inclusion of residue pair and triplet distances. In Kinase Knowledge Base and Matrix Metalloproteinase Knowledge Base, the catalytic domains have been aligned into common residue numbering schemes. Thus, by searching across Protein Relational Database and Kinase Knowledge Base, one can easily retrieve structures wherein, for example, a ligand of interest is making contact with the gatekeeper residue.

  16. Tuning structure of oppositely charged nanoparticle and protein complexes

    Energy Technology Data Exchange (ETDEWEB)

    Kumar, Sugam, E-mail: sugam@barc.gov.in; Aswal, V. K., E-mail: sugam@barc.gov.in [Solid State Physics Division, Bhabha Atomic Research Centre, Mumbai-400085 (India); Callow, P. [Institut Laue Langevin, DS/LSS, 6 rue Jules Horowitz, 38042 Grenoble Cedex 9 (France)

    2014-04-24

    Small-angle neutron scattering (SANS) has been used to probe the structures of anionic silica nanoparticles (LS30) and cationic lyszyme protein (M.W. 14.7kD, I.P. ∼ 11.4) by tuning their interaction through the pH variation. The protein adsorption on nanoparticles is found to be increasing with pH and determined by the electrostatic attraction between two components as well as repulsion between protein molecules. We show the strong electrostatic attraction between nanoparticles and protein molecules leads to protein-mediated aggregation of nanoparticles which are characterized by fractal structures. At pH 5, the protein adsorption gives rise to nanoparticle aggregation having surface fractal morphology with close packing of nanoparticles. The surface fractals transform to open structures of mass fractal morphology at higher pH (7 and 9) on approaching isoelectric point (I.P.)

  17. Studying Membrane Protein Structure and Function Using Nanodiscs

    DEFF Research Database (Denmark)

    Huda, Pie

    The structure and dynamic of membrane proteins can provide valuable information about general functions, diseases and effects of various drugs. Studying membrane proteins are a challenge as an amphiphilic environment is necessary to stabilise the protein in a functionally and structurally relevant...... form. This is most typically achieved through the use of detergent based reconstitution systems. However, time and again such systems fail to provide a suitable environment causing aggregation and inactivation. Nanodiscs are self-assembled lipoproteins containing two membrane scaffold proteins...... and a lipid bilayer in defined nanometer size, which can act as a stabiliser for membrane proteins. This enables both functional and structural investigation of membrane proteins in a detergent free environment which is closer to the native situation. Understanding the self-assembly of nanodiscs is important...

  18. Exploring structural variability in X-ray crystallographic models using protein local optimization by torsion-angle sampling

    International Nuclear Information System (INIS)

    Knight, Jennifer L.; Zhou, Zhiyong; Gallicchio, Emilio; Himmel, Daniel M.; Friesner, Richard A.; Arnold, Eddy; Levy, Ronald M.

    2008-01-01

    Torsion-angle sampling, as implemented in the Protein Local Optimization Program (PLOP), is used to generate multiple structurally variable single-conformer models which are in good agreement with X-ray data. An ensemble-refinement approach to differentiate between positional uncertainty and conformational heterogeneity is proposed. Modeling structural variability is critical for understanding protein function and for modeling reliable targets for in silico docking experiments. Because of the time-intensive nature of manual X-ray crystallographic refinement, automated refinement methods that thoroughly explore conformational space are essential for the systematic construction of structurally variable models. Using five proteins spanning resolutions of 1.0–2.8 Å, it is demonstrated how torsion-angle sampling of backbone and side-chain libraries with filtering against both the chemical energy, using a modern effective potential, and the electron density, coupled with minimization of a reciprocal-space X-ray target function, can generate multiple structurally variable models which fit the X-ray data well. Torsion-angle sampling as implemented in the Protein Local Optimization Program (PLOP) has been used in this work. Models with the lowest R free values are obtained when electrostatic and implicit solvation terms are included in the effective potential. HIV-1 protease, calmodulin and SUMO-conjugating enzyme illustrate how variability in the ensemble of structures captures structural variability that is observed across multiple crystal structures and is linked to functional flexibility at hinge regions and binding interfaces. An ensemble-refinement procedure is proposed to differentiate between variability that is a consequence of physical conformational heterogeneity and that which reflects uncertainty in the atomic coordinates

  19. Exploring protein dynamics space: the dynasome as the missing link between protein structure and function.

    Directory of Open Access Journals (Sweden)

    Ulf Hensen

    Full Text Available Proteins are usually described and classified according to amino acid sequence, structure or function. Here, we develop a minimally biased scheme to compare and classify proteins according to their internal mobility patterns. This approach is based on the notion that proteins not only fold into recurring structural motifs but might also be carrying out only a limited set of recurring mobility motifs. The complete set of these patterns, which we tentatively call the dynasome, spans a multi-dimensional space with axes, the dynasome descriptors, characterizing different aspects of protein dynamics. The unique dynamic fingerprint of each protein is represented as a vector in the dynasome space. The difference between any two vectors, consequently, gives a reliable measure of the difference between the corresponding protein dynamics. We characterize the properties of the dynasome by comparing the dynamics fingerprints obtained from molecular dynamics simulations of 112 proteins but our approach is, in principle, not restricted to any specific source of data of protein dynamics. We conclude that: 1. the dynasome consists of a continuum of proteins, rather than well separated classes. 2. For the majority of proteins we observe strong correlations between structure and dynamics. 3. Proteins with similar function carry out similar dynamics, which suggests a new method to improve protein function annotation based on protein dynamics.

  20. Host Proteins Determine MRSA Biofilm Structure and Integrity

    DEFF Research Database (Denmark)

    Dreier, Cindy; Nielsen, Astrid; Jørgensen, Nis Pedersen

    Human extracellular matrix (hECM) proteins aids the initial attachment and initiation of an infection, by specific binding to bacterial cell surface proteins. However, the importance of hECM proteins in structure, integrity and antibiotic resilience of a biofilm is unknown. This study aims...... to determine how specific hECM proteins affect S. aureus USA300 JE2 biofilms. Biofilms were grown in the presence of synovial fluid from rheumatoid arteritis patients to mimic in vivo conditions, where bacteria incorporate hECM proteins into the biofilm matrix. Difference in biofilm structure, with and without...... addition of hECM to growth media, was visualized by confocal laser scanning microscopy. Two enzymatic degradation experiments were used to study biofilm matrix composition and importance of hECM proteins: enzymatic removal of specific hECM proteins from growth media, before biofilm formation, and enzymatic...

  1. Extracting the information backbone in online system.

    Science.gov (United States)

    Zhang, Qian-Ming; Zeng, An; Shang, Ming-Sheng

    2013-01-01

    Information overload is a serious problem in modern society and many solutions such as recommender system have been proposed to filter out irrelevant information. In the literature, researchers have been mainly dedicated to improving the recommendation performance (accuracy and diversity) of the algorithms while they have overlooked the influence of topology of the online user-object bipartite networks. In this paper, we find that some information provided by the bipartite networks is not only redundant but also misleading. With such "less can be more" feature, we design some algorithms to improve the recommendation performance by eliminating some links from the original networks. Moreover, we propose a hybrid method combining the time-aware and topology-aware link removal algorithms to extract the backbone which contains the essential information for the recommender systems. From the practical point of view, our method can improve the performance and reduce the computational time of the recommendation system, thus improving both of their effectiveness and efficiency.

  2. Using linear algebra for protein structural comparison and classification.

    Science.gov (United States)

    Gomide, Janaína; Melo-Minardi, Raquel; Dos Santos, Marcos Augusto; Neshich, Goran; Meira, Wagner; Lopes, Júlio César; Santoro, Marcelo

    2009-07-01

    In this article, we describe a novel methodology to extract semantic characteristics from protein structures using linear algebra in order to compose structural signature vectors which may be used efficiently to compare and classify protein structures into fold families. These signatures are built from the pattern of hydrophobic intrachain interactions using Singular Value Decomposition (SVD) and Latent Semantic Indexing (LSI) techniques. Considering proteins as documents and contacts as terms, we have built a retrieval system which is able to find conserved contacts in samples of myoglobin fold family and to retrieve these proteins among proteins of varied folds with precision of up to 80%. The classifier is a web tool available at our laboratory website. Users can search for similar chains from a specific PDB, view and compare their contact maps and browse their structures using a JMol plug-in.

  3. Using linear algebra for protein structural comparison and classification

    Directory of Open Access Journals (Sweden)

    Janaína Gomide

    2009-01-01

    Full Text Available In this article, we describe a novel methodology to extract semantic characteristics from protein structures using linear algebra in order to compose structural signature vectors which may be used efficiently to compare and classify protein structures into fold families. These signatures are built from the pattern of hydrophobic intrachain interactions using Singular Value Decomposition (SVD and Latent Semantic Indexing (LSI techniques. Considering proteins as documents and contacts as terms, we have built a retrieval system which is able to find conserved contacts in samples of myoglobin fold family and to retrieve these proteins among proteins of varied folds with precision of up to 80%. The classifier is a web tool available at our laboratory website. Users can search for similar chains from a specific PDB, view and compare their contact maps and browse their structures using a JMol plug-in.

  4. Structural Mass Spectrometry of Proteins Using Hydroxyl Radical Based Protein Footprinting

    OpenAIRE

    Wang, Liwen; Chance, Mark R.

    2011-01-01

    Structural MS is a rapidly growing field with many applications in basic research and pharmaceutical drug development. In this feature article the overall technology is described and several examples of how hydroxyl radical based footprinting MS can be used to map interfaces, evaluate protein structure, and identify ligand dependent conformational changes in proteins are described.

  5. Selectively dispersed isotope labeling for protein structure determination by magic angle spinning NMR

    Energy Technology Data Exchange (ETDEWEB)

    Eddy, Matthew T. [Massachusetts Institute of Technology, Department of Chemistry (United States); Belenky, Marina [Brandeis University, Department of Chemistry (United States); Sivertsen, Astrid C. [Massachusetts Institute of Technology, Francis Bitter Magnet Laboratory (United States); Griffin, Robert G. [Massachusetts Institute of Technology, Department of Chemistry (United States); Herzfeld, Judith, E-mail: herzfeld@brandeis.edu [Brandeis University, Department of Chemistry (United States)

    2013-10-15

    The power of nuclear magnetic resonance spectroscopy derives from its site-specific access to chemical, structural and dynamic information. However, the corresponding multiplicity of interactions can be difficult to tease apart. Complimentary approaches involve spectral editing on the one hand and selective isotope substitution on the other. Here we present a new 'redox' approach to the latter: acetate is chosen as the sole carbon source for the extreme oxidation numbers of its two carbons. Consistent with conventional anabolic pathways for the amino acids, [1-{sup 13}C] acetate does not label {alpha} carbons, labels other aliphatic carbons and the aromatic carbons very selectively, and labels the carboxyl carbons heavily. The benefits of this labeling scheme are exemplified by magic angle spinning spectra of microcrystalline immunoglobulin binding protein G (GB1): the elimination of most J-couplings and one- and two-bond dipolar couplings provides narrow signals and long-range, intra- and inter-residue, recoupling essential for distance constraints. Inverse redox labeling, from [2-{sup 13}C] acetate, is also expected to be useful: although it retains one-bond couplings in the sidechains, the removal of CA-CO coupling in the backbone should improve the resolution of NCACX spectra.

  6. PSPP: a protein structure prediction pipeline for computing clusters.

    Directory of Open Access Journals (Sweden)

    Michael S Lee

    2009-07-01

    Full Text Available Protein structures are critical for understanding the mechanisms of biological systems and, subsequently, for drug and vaccine design. Unfortunately, protein sequence data exceed structural data by a factor of more than 200 to 1. This gap can be partially filled by using computational protein structure prediction. While structure prediction Web servers are a notable option, they often restrict the number of sequence queries and/or provide a limited set of prediction methodologies. Therefore, we present a standalone protein structure prediction software package suitable for high-throughput structural genomic applications that performs all three classes of prediction methodologies: comparative modeling, fold recognition, and ab initio. This software can be deployed on a user's own high-performance computing cluster.The pipeline consists of a Perl core that integrates more than 20 individual software packages and databases, most of which are freely available from other research laboratories. The query protein sequences are first divided into domains either by domain boundary recognition or Bayesian statistics. The structures of the individual domains are then predicted using template-based modeling or ab initio modeling. The predicted models are scored with a statistical potential and an all-atom force field. The top-scoring ab initio models are annotated by structural comparison against the Structural Classification of Proteins (SCOP fold database. Furthermore, secondary structure, solvent accessibility, transmembrane helices, and structural disorder are predicted. The results are generated in text, tab-delimited, and hypertext markup language (HTML formats. So far, the pipeline has been used to study viral and bacterial proteomes.The standalone pipeline that we introduce here, unlike protein structure prediction Web servers, allows users to devote their own computing assets to process a potentially unlimited number of queries as well as perform

  7. Global Transcriptional Regulation of Backbone Genes in Broad-Host-Range Plasmid RA3 from the IncU Group Involves Segregation Protein KorB (ParB Family).

    Science.gov (United States)

    Kulinska, Anna; Godziszewska, Jolanta; Wojciechowska, Anna; Ludwiczak, Marta; Jagura-Burdzy, Grazyna

    2016-04-01

    The KorB protein of the broad-host-range conjugative plasmid RA3 from the IncU group belongs to the ParB family of plasmid and chromosomal segregation proteins. As a partitioning DNA-binding factor, KorB specifically recognizes a 16-bp palindrome which is an essential motif in the centromere-like sequence parSRA3, forms a segrosome, and together with its partner IncC (ParA family) participates in active DNA segregation ensuring stable plasmid maintenance. Here we show that by binding to this palindromic sequence, KorB also acts as a repressor for the adjacent mobC promoter driving expression of the mobC-nicoperon, which is involved in DNA processing during conjugation. Three other promoters, one buried in the conjugative transfer module and two divergent promoters located at the border between the replication and stability regions, are regulated by KorB binding to additional KorB operators (OBs). KorB acts as a repressor at a distance, binding to OBs separated from their cognate promoters by between 46 and 1,317 nucleotides. This repressor activity is facilitated by KorB spreading along DNA, since a polymerization-deficient KorB variant with its dimerization and DNA-binding abilities intact is inactive in transcriptional repression. KorB may act as a global regulator of RA3 plasmid functions in Escherichia coli, since its overexpression in transnegatively interferes with mini-RA3 replication and stable maintenance of RA3. Copyright © 2016, American Society for Microbiology. All Rights Reserved.

  8. High Performance Infiltrated Backbones for Cathode-Supported SOFC's

    DEFF Research Database (Denmark)

    Gil, Vanesa; Kammer Hansen, Kent

    2014-01-01

    The concept of using highly ionic conducting backbones with subsequent infiltration of electronically conducting particles has widely been used to develop alternative anode-supported SOFC's. In this work, the idea was to develop infiltrated backbones as an alternative design based on cathode......, microstructural characterization and electrochemical testing are discussed. Data on polarization resistance, Rp, are obtained from impedance spectra recorded on quasi-symmetrical cells (YSZ backbones/YSZ/LSM-YSZ (screen printed)). The backbones are infiltrated with LSM and compared to a standard LSM-YSZ screen...

  9. Impact of Backbone Fluorination on π-Conjugated Polymers in Organic Photovoltaic Devices: A Review

    Directory of Open Access Journals (Sweden)

    Nicolas Leclerc

    2016-01-01

    Full Text Available Solution-processed bulk heterojunction solar cells have experienced a remarkable acceleration in performances in the last two decades, reaching power conversion efficiencies above 10%. This impressive progress is the outcome of a simultaneous development of more advanced device architectures and of optimized semiconducting polymers. Several chemical approaches have been developed to fine-tune the optoelectronics and structural polymer parameters required to reach high efficiencies. Fluorination of the conjugated polymer backbone has appeared recently to be an especially promising approach for the development of efficient semiconducting polymers. As a matter of fact, most currently best-performing semiconducting polymers are using fluorine atoms in their conjugated backbone. In this review, we attempt to give an up-to-date overview of the latest results achieved on fluorinated polymers for solar cells and to highlight general polymer properties’ evolution trends related to the fluorination of their conjugated backbone.

  10. ADAR RNA editing below the backbone.

    Science.gov (United States)

    Keegan, Liam; Khan, Anzer; Vukic, Dragana; O'Connell, Mary

    2017-09-01

    ADAR RNA editing enzymes ( a denosine d e a minases acting on R NA) that convert adenosine bases to inosines were first identified biochemically 30 years ago. Since then, studies on ADARs in genetic model organisms, and evolutionary comparisons between them, continue to reveal a surprising range of pleiotropic biological effects of ADARs. This review focuses on Drosophila melanogaster , which has a single Adar gene encoding a homolog of vertebrate ADAR2 that site-specifically edits hundreds of transcripts to change individual codons in ion channel subunits and membrane and cytoskeletal proteins. Drosophila ADAR is involved in the control of neuronal excitability and neurodegeneration and, intriguingly, in the control of neuronal plasticity and sleep. Drosophila ADAR also interacts strongly with RNA interference, a key antiviral defense mechanism in invertebrates. Recent crystal structures of human ADAR2 deaminase domain-RNA complexes help to interpret available information on Drosophila ADAR isoforms and on the evolution of ADARs from tRNA deaminase ADAT proteins. ADAR RNA editing is a paradigm for the now rapidly expanding range of RNA modifications in mRNAs and ncRNAs. Even with recent progress, much remains to be understood about these groundbreaking ADAR RNA modification systems. © 2017 Keegan et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  11. Structure of the Yersinia pestis tip protein LcrV refined to 1.65 Å resolution

    International Nuclear Information System (INIS)

    Chaudhury, Sukanya; Battaile, Kevin P.; Lovell, Scott; Plano, Gregory V.; De Guzman, Roberto N.

    2013-01-01

    Here, the crystal structure of Yersinia pestis tip protein LcrV is reported at a resolution of 1.65 Å. The human pathogen Yersinia pestis requires the assembly of the type III secretion system (T3SS) for virulence. The structural component of the T3SS contains an external needle and a tip complex, which is formed by LcrV in Y. pestis. The structure of an LcrV triple mutant (K40A/D41A/K42A) in a C273S background has previously been reported to 2.2 Å resolution. Here, the crystal structure of LcrV without the triple mutation in a C273S background is reported at a higher resolution of 1.65 Å. Overall the two structures are similar, but there are also notable differences, particularly near the site of the triple mutation. The refined structure revealed a slight shift in the backbone positions of residues Gly28–Asn43 and displayed electron density in the loop region consisting of residues Ile46–Val63, which was disordered in the original structure. In addition, the helical turn region spanning residues Tyr77–Gln95 adopts a different orientation

  12. Bayesian comparison of protein structures using partial Procrustes distance.

    Science.gov (United States)

    Ejlali, Nasim; Faghihi, Mohammad Reza; Sadeghi, Mehdi

    2017-09-26

    An important topic in bioinformatics is the protein structure alignment. Some statistical methods have been proposed for this problem, but most of them align two protein structures based on the global geometric information without considering the effect of neighbourhood in the structures. In this paper, we provide a Bayesian model to align protein structures, by considering the effect of both local and global geometric information of protein structures. Local geometric information is incorporated to the model through the partial Procrustes distance of small substructures. These substructures are composed of β-carbon atoms from the side chains. Parameters are estimated using a Markov chain Monte Carlo (MCMC) approach. We evaluate the performance of our model through some simulation studies. Furthermore, we apply our model to a real dataset and assess the accuracy and convergence rate. Results show that our model is much more efficient than previous approaches.

  13. SCit: web tools for protein side chain conformation analysis.

    Science.gov (United States)

    Gautier, R; Camproux, A-C; Tufféry, P

    2004-07-01

    SCit is a web server providing services for protein side chain conformation analysis and side chain positioning. Specific services use the dependence of the side chain conformations on the local backbone conformation, which is described using a structural alphabet that describes the conformation of fragments of four-residue length in a limited library of structural prototypes. Based on this concept, SCit uses sets of rotameric conformations dependent on the local backbone conformation of each protein for side chain positioning and the identification of side chains with unlikely conformations. The SCit web server is accessible at http://bioserv.rpbs.jussieu.fr/SCit.

  14. Structural and Functional Annotation of Hypothetical Proteins of O139

    Directory of Open Access Journals (Sweden)

    Md. Saiful Islam

    2015-06-01

    Full Text Available In developing countries threat of cholera is a significant health concern whenever water purification and sewage disposal systems are inadequate. Vibrio cholerae is one of the responsible bacteria involved in cholera disease. The complete genome sequence of V. cholerae deciphers the presence of various genes and hypothetical proteins whose function are not yet understood. Hence analyzing and annotating the structure and function of hypothetical proteins is important for understanding the V. cholerae. V. cholerae O139 is the most common and pathogenic bacterial strain among various V. cholerae strains. In this study sequence of six hypothetical proteins of V. cholerae O139 has been annotated from NCBI. Various computational tools and databases have been used to determine domain family, protein-protein interaction, solubility of protein, ligand binding sites etc. The three dimensional structure of two proteins were modeled and their ligand binding sites were identified. We have found domains and families of only one protein. The analysis revealed that these proteins might have antibiotic resistance activity, DNA breaking-rejoining activity, integrase enzyme activity, restriction endonuclease, etc. Structural prediction of these proteins and detection of binding sites from this study would indicate a potential target aiding docking studies for therapeutic designing against cholera.

  15. Structural study of surfactant-dependent interaction with protein

    Energy Technology Data Exchange (ETDEWEB)

    Mehan, Sumit; Aswal, Vinod K., E-mail: vkaswal@barc.gov.in [Solid State Physics Division, Bhabha Atomic Research Centre, Mumbai 400 085 (India); Kohlbrecher, Joachim [Laboratory for Neutron Scattering, Paul Scherrer Institut, CH-5232 PSI Villigen (Switzerland)

    2015-06-24

    Small-angle neutron scattering (SANS) has been used to study the complex structure of anionic BSA protein with three different (cationic DTAB, anionic SDS and non-ionic C12E10) surfactants. These systems form very different surfactant-dependent complexes. We show that the structure of protein-surfactant complex is initiated by the site-specific electrostatic interaction between the components, followed by the hydrophobic interaction at high surfactant concentrations. It is also found that hydrophobic interaction is preferred over the electrostatic interaction in deciding the resultant structure of protein-surfactant complexes.

  16. Analisa Perbandingan Quality Of Service (QoS) pada Jaringan Backbone Non-MPLS dengan Jaringan Backbone MPLS Menggunakan Routing Protocol OSPF di PT. Telekomunikasi Indonesia, Tbk. Witel Ridar Riau

    OpenAIRE

    Silaban, Nestor Hasudungan; Sari, Linna Oktaviana; Anhar, Anhar

    2015-01-01

    The development of telecommunications technology based on Internet Protocol (IP) is now growing with the competitiveness of the telecommunications company to improve the quality of service to consumers. It can be obtained by increasing the quality backbone network using Multi Protocol Label Switching (MPLS). MPLS is a new technology to forward the packet to the backbone network without changing the existing network structure. The main idea is to construct a replacement MPLS paths using label ...

  17. Integrating protein structures and precomputed genealogies in the Magnum database: Examples with cellular retinoid binding proteins

    Directory of Open Access Journals (Sweden)

    Bradley Michael E

    2006-02-01

    Full Text Available Abstract Background When accurate models for the divergent evolution of protein sequences are integrated with complementary biological information, such as folded protein structures, analyses of the combined data often lead to new hypotheses about molecular physiology. This represents an excellent example of how bioinformatics can be used to guide experimental research. However, progress in this direction has been slowed by the lack of a publicly available resource suitable for general use. Results The precomputed Magnum database offers a solution to this problem for ca. 1,800 full-length protein families with at least one crystal structure. The Magnum deliverables include 1 multiple sequence alignments, 2 mapping of alignment sites to crystal structure sites, 3 phylogenetic trees, 4 inferred ancestral sequences at internal tree nodes, and 5 amino acid replacements along tree branches. Comprehensive evaluations revealed that the automated procedures used to construct Magnum produced accurate models of how proteins divergently evolve, or genealogies, and correctly integrated these with the structural data. To demonstrate Magnum's capabilities, we asked for amino acid replacements requiring three nucleotide substitutions, located at internal protein structure sites, and occurring on short phylogenetic tree branches. In the cellular retinoid binding protein family a site that potentially modulates ligand binding affinity was discovered. Recruitment of cellular retinol binding protein to function as a lens crystallin in the diurnal gecko afforded another opportunity to showcase the predictive value of a browsable database containing branch replacement patterns integrated with protein structures. Conclusion We integrated two areas of protein science, evolution and structure, on a large scale and created a precomputed database, known as Magnum, which is the first freely available resource of its kind. Magnum provides evolutionary and structural

  18. 3D complex: a structural classification of protein complexes.

    Directory of Open Access Journals (Sweden)

    Emmanuel D Levy

    2006-11-01

    Full Text Available Most of the proteins in a cell assemble into complexes to carry out their function. It is therefore crucial to understand the physicochemical properties as well as the evolution of interactions between proteins. The Protein Data Bank represents an important source of information for such studies, because more than half of the structures are homo- or heteromeric protein complexes. Here we propose the first hierarchical classification of whole protein complexes of known 3-D structure, based on representing their fundamental structural features as a graph. This classification provides the first overview of all the complexes in the Protein Data Bank and allows nonredundant sets to be derived at different levels of detail. This reveals that between one-half and two-thirds of known structures are multimeric, depending on the level of redundancy accepted. We also analyse the structures in terms of the topological arrangement of their subunits and find that they form a small number of arrangements compared with all theoretically possible ones. This is because most complexes contain four subunits or less, and the large majority are homomeric. In addition, there is a strong tendency for symmetry in complexes, even for heteromeric complexes. Finally, through comparison of Biological Units in the Protein Data Bank with the Protein Quaternary Structure database, we identified many possible errors in quaternary structure assignments. Our classification, available as a database and Web server at http://www.3Dcomplex.org, will be a starting point for future work aimed at understanding the structure and evolution of protein complexes.

  19. Protein structure determination by exhaustive search of Protein Data Bank derived databases.

    Science.gov (United States)

    Stokes-Rees, Ian; Sliz, Piotr

    2010-12-14

    Parallel sequence and structure alignment tools have become ubiquitous and invaluable at all levels in the study of biological systems. We demonstrate the application and utility of this same parallel search paradigm to the process of protein structure determination, benefitting from the large and growing corpus of known structures. Such searches were previously computationally intractable. Through the method of Wide Search Molecular Replacement, developed here, they can be completed in a few hours with the aide of national-scale federated cyberinfrastructure. By dramatically expanding the range of models considered for structure determination, we show that small (less than 12% structural coverage) and low sequence identity (less than 20% identity) template structures can be identified through multidimensional template scoring metrics and used for structure determination. Many new macromolecular complexes can benefit significantly from such a technique due to the lack of known homologous protein folds or sequences. We demonstrate the effectiveness of the method by determining the structure of a full-length p97 homologue from Trichoplusia ni. Example cases with the MHC/T-cell receptor complex and the EmoB protein provide systematic estimates of minimum sequence identity, structure coverage, and structural similarity required for this method to succeed. We describe how this structure-search approach and other novel computationally intensive workflows are made tractable through integration with the US national computational cyberinfrastructure, allowing, for example, rapid processing of the entire Structural Classification of Proteins protein fragment database.

  20. Topological properties of complex networks in protein structures

    Science.gov (United States)

    Kim, Kyungsik; Jung, Jae-Won; Min, Seungsik

    2014-03-01

    We study topological properties of networks in structural classification of proteins. We model the native-state protein structure as a network made of its constituent amino-acids and their interactions. We treat four structural classes of proteins composed predominantly of α helices and β sheets and consider several proteins from each of these classes whose sizes range from amino acids of the Protein Data Bank. Particularly, we simulate and analyze the network metrics such as the mean degree, the probability distribution of degree, the clustering coefficient, the characteristic path length, the local efficiency, and the cost. This work was supported by the KMAR and DP under Grant WISE project (153-3100-3133-302-350).

  1. Relationship between Molecular Structure Characteristics of Feed Proteins and Protein In vitro Digestibility and Solubility.

    Science.gov (United States)

    Bai, Mingmei; Qin, Guixin; Sun, Zewei; Long, Guohui

    2016-08-01

    The nutritional value of feed proteins and their utilization by livestock are related not only to the chemical composition but also to the structure of feed proteins, but few studies thus far have investigated the relationship between the structure of feed proteins and their solubility as well as digestibility in monogastric animals. To address this question we analyzed soybean meal, fish meal, corn distiller's dried grains with solubles, corn gluten meal, and feather meal by Fourier transform infrared (FTIR) spectroscopy to determine the protein molecular spectral band characteristics for amides I and II as well as α-helices and β-sheets and their ratios. Protein solubility and in vitro digestibility were measured with the Kjeldahl method using 0.2% KOH solution and the pepsin-pancreatin two-step enzymatic method, respectively. We found that all measured spectral band intensities (height and area) of feed proteins were correlated with their the in vitro digestibility and solubility (p≤0.003); moreover, the relatively quantitative amounts of α-helices, random coils, and α-helix to β-sheet ratio in protein secondary structures were positively correlated with protein in vitro digestibility and solubility (p≤0.004). On the other hand, the percentage of β-sheet structures was negatively correlated with protein in vitro digestibility (pdigestibility at 28 h and solubility. Furthermore, the α-helix-to-β-sheet ratio can be used to predict the nutritional value of feed proteins.

  2. Improvement of hydrogen bond geometry in protein NMR structures by residual dipolar couplings - an assessment of the interrelation of NMR restraints

    Energy Technology Data Exchange (ETDEWEB)

    Jensen, Pernille Rose; Axelsen, Jacob Bock [University of Copenhagen, Institute of Molecular Biology (Denmark); Lerche, Mathilde Hauge [Amersham Health (Sweden); Poulsen, Flemming M. [University of Copenhagen, Institute of Molecular Biology (Denmark)], E-mail: fmp@apk.molbio.ku.dk

    2004-01-15

    We have examined how the hydrogen bond geometry in three different proteins is affected when structural restraints based on measurements of residual dipolar couplings are included in the structure calculations. The study shows, that including restraints based solely on {sup 1}H{sup N}-{sup 15}N residual dipolar couplings has pronounced impact on the backbone rmsd and Ramachandran plot but does not improve the hydrogen bond geometry. In the case of chymotrypsin inhibitor 2 the addition of {sup 13}CO-{sup 13}C{sup {alpha}} and {sup 15}N-{sup 13}CO one bond dipolar couplings as restraints in the structure calculations improved the hydrogen bond geometry to a quality comparable to that obtained in the 1.8 A resolution X-ray structure of this protein. A systematic restraint study was performed, in which four types of restraints, residual dipolar couplings, hydrogen bonds, TALOS angles and NOEs, were allowed in two states. This study revealed the importance of using several types of residual dipolar couplings to get good hydrogen bond geometry. The study also showed that using a small set of NOEs derived only from the amide protons, together with a full set of residual dipolar couplings resulted in structures of very high quality. When reducing the NOE set, it is mainly the side-chain to side-chain NOEs that are removed. Despite of this the effect on the side-chain packing is very small when a reduced NOE set is used, which implies that the over all fold of a protein structure is mainly determined by correct folding of the backbone.

  3. Assignment by Negative-Ion Electrospray Tandem Mass Spectrometry of the Tetrasaccharide Backbones of Monosialylated Glycans Released from Bovine Brain Gangliosides

    Science.gov (United States)

    Chai, Wengang; Zhang, Yibing; Mauri, Laura; Ciampa, Maria G.; Mulloy, Barbara; Sonnino, Sandro; Feizi, Ten

    2018-05-01

    Gangliosides, as plasma membrane-associated sialylated glycolipids, are antigenic structures and they serve as ligands for adhesion proteins of pathogens, for toxins of bacteria, and for endogenous proteins of the host. The detectability by carbohydrate-binding proteins of glycan antigens and ligands on glycolipids can be influenced by the differing lipid moieties. To investigate glycan sequences of gangliosides as recognition structures, we have underway a program of work to develop a "gangliome" microarray consisting of isolated natural gangliosides and neoglycolipids (NGLs) derived from glycans released from them, and each linked to the same lipid molecule for arraying and comparative microarray binding analyses. Here, in the first phase of our studies, we describe a strategy for high-sensitivity assignment of the tetrasaccharide backbones and application to identification of eight of monosialylated glycans released from bovine brain gangliosides. This approach is based on negative-ion electrospray mass spectrometry with collision-induced dissociation (ESI-CID-MS/MS) of the desialylated glycans. Using this strategy, we have the data on backbone regions of four minor components among the monosialo-ganglioside-derived glycans; these are of the ganglio-, lacto-, and neolacto-series.

  4. Effects of NMR spectral resolution on protein structure calculation.

    Directory of Open Access Journals (Sweden)

    Suhas Tikole

    Full Text Available Adequate digital resolution and signal sensitivity are two critical factors for protein structure determinations by solution NMR spectroscopy. The prime objective for obtaining high digital resolution is to resolve peak overlap, especially in NOESY spectra with thousands of signals where the signal analysis needs to be performed on a large scale. Achieving maximum digital resolution is usually limited by the practically available measurement time. We developed a method utilizing non-uniform sampling for balancing digital resolution and signal sensitivity, and performed a large-scale analysis of the effect of the digital resolution on the accuracy of the resulting protein structures. Structure calculations were performed as a function of digital resolution for about 400 proteins with molecular sizes ranging between 5 and 33 kDa. The structural accuracy was assessed by atomic coordinate RMSD values from the reference structures of the proteins. In addition, we monitored also the number of assigned NOESY cross peaks, the average signal sensitivity, and the chemical shift spectral overlap. We show that high resolution is equally important for proteins of every molecular size. The chemical shift spectral overlap depends strongly on the corresponding spectral digital resolution. Thus, knowing the extent of overlap can be a predictor of the resulting structural accuracy. Our results show that for every molecular size a minimal digital resolution, corresponding to the natural linewidth, needs to be achieved for obtaining the highest accuracy possible for the given protein size using state-of-the-art automated NOESY assignment and structure calculation methods.

  5. Structural and Function Prediction of Musa acuminata subsp. Malaccensis Protein

    Directory of Open Access Journals (Sweden)

    Anum Munir

    2016-03-01

    Full Text Available Hypothetical proteins (HPs are the proteins whose presence has been anticipated, yet in vivo function has not been built up. Illustrating the structural and functional privileged insights of these HPs might likewise prompt a superior comprehension of the protein-protein associations or networks in diverse types of life. Bananas (Musa acuminata spp., including sweet and cooking types, are giant perennial monocotyledonous herbs of the order Zingiberales, a sister grouped to the all-around considered Poales, which incorporate oats. Bananas are crucial for nourishment security in numerous tropical and subtropical nations and the most prominent organic product in industrialized nations. In the present study, the hypothetical protein of M. acuminata (Banana was chosen for analysis and modeling by distinctive bioinformatics apparatuses and databases. As indicated by primary and secondary structure analysis, XP_009393594.1 is a stable hydrophobic protein containing a noteworthy extent of α-helices; Homology modeling was done utilizing SWISS-MODEL server where the templates identity with XP_009393594.1 protein was less which demonstrated novelty of our protein. Ab initio strategy was conducted to produce its 3D structure. A few evaluations of quality assessment and validation parameters determined the generated protein model as stable with genuinely great quality. Functional analysis was completed by ProtFun 2.2, and KEGG (KAAS, recommended that the hypothetical protein is a transcription factor with cytoplasmic domain as zinc finger. The protein was observed to be vital for translation process, involved in metabolism, signaling and cellular processes, genetic information processing and Zinc ion binding. It is suggested that further test approval would help to anticipate the structures and functions of other uncharacterized proteins of different plants and living being.

  6. Structural and metabolic studies of O-linked fucose-containing proteins of normal and virally-transformed rat fibroblasts

    International Nuclear Information System (INIS)

    Morton, P.A.

    1985-01-01

    Previous studies in this laboratory have demonstrated that cultured human and rodent cells contain a series of low molecular weight glycosylated amino acids of unusual structure, designated amino acid fucosides. The incorporation of radiolabelled-fucose into one of these components, designated FL4a (glucosylfucosylthreonine), is markedly-reduced in transformed epithelial and fibroblastic cells. The authors have examined fucose-labelled normal and virally-transformed rat fibroblast cell lines for glycoproteins which might be precursors to amino acid fucosides. Using milk alkaline/borohydride treatment (the beta-elimination reaction) to release O-linked oligosaccharides from proteins, they have isolated and partially characterized two low M/sub r/ reaction products (designated DS-ol and TS-ol) released from macromolecular cell material. The identity of one of these components (DS-ol, glucosylfucitol) suggested the existence in these cells of a direct protein precursor to FL4a. They examined fucose-labelled macromolecular cell material for proteins which release DS-ol (DS-proteins.). Using gel filtration chromatography and sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE) with subsequent autoradiography, they have observed DS-proteins which appear to exhibit a broad molecular weight size range, and are also present in culture medium from normal and transformed cells. The findings suggest that mammalian cells contain DS-proteins and TS-proteins with a novel carbohydrate-peptide linkage wherein L-fucose is O-linked to a polypeptide backbone. Metabolic studies were undertaken to examine both the relationship between DS-protein and FL4a and the biochemical basis for the decreased level of FL4a and the biochemical basis for the decreased level of FL4a observed in transformed cells

  7. Structural studies of human glioma pathogenesis-related protein 1

    Energy Technology Data Exchange (ETDEWEB)

    Asojo, Oluwatoyin A., E-mail: oasojo@unmc.edu [College of Medicine, Nebraska Medical Center, Omaha, NE 68198-6495 (United States); Koski, Raymond A.; Bonafé, Nathalie [L2 Diagnostics LLC, 300 George Street, New Haven, CT 06511 (United States); College of Medicine, Nebraska Medical Center, Omaha, NE 68198-6495 (United States)

    2011-10-01

    Structural analysis of a truncated soluble domain of human glioma pathogenesis-related protein 1, a membrane protein implicated in the proliferation of aggressive brain cancer, is presented. Human glioma pathogenesis-related protein 1 (GLIPR1) is a membrane protein that is highly upregulated in brain cancers but is barely detectable in normal brain tissue. GLIPR1 is composed of a signal peptide that directs its secretion, a conserved cysteine-rich CAP (cysteine-rich secretory proteins, antigen 5 and pathogenesis-related 1 proteins) domain and a transmembrane domain. GLIPR1 is currently being investigated as a candidate for prostate cancer gene therapy and for glioblastoma targeted therapy. Crystal structures of a truncated soluble domain of the human GLIPR1 protein (sGLIPR1) solved by molecular replacement using a truncated polyalanine search model of the CAP domain of stecrisp, a snake-venom cysteine-rich secretory protein (CRISP), are presented. The correct molecular-replacement solution could only be obtained by removing all loops from the search model. The native structure was refined to 1.85 Å resolution and that of a Zn{sup 2+} complex was refined to 2.2 Å resolution. The latter structure revealed that the putative binding cavity coordinates Zn{sup 2+} similarly to snake-venom CRISPs, which are involved in Zn{sup 2+}-dependent mechanisms of inflammatory modulation. Both sGLIPR1 structures have extensive flexible loop/turn regions and unique charge distributions that were not observed in any of the previously reported CAP protein structures. A model is also proposed for the structure of full-length membrane-bound GLIPR1.

  8. Structure and function of nanoparticle-protein conjugates

    International Nuclear Information System (INIS)

    Aubin-Tam, M-E; Hamad-Schifferli, K

    2008-01-01

    Conjugation of proteins to nanoparticles has numerous applications in sensing, imaging, delivery, catalysis, therapy and control of protein structure and activity. Therefore, characterizing the nanoparticle-protein interface is of great importance. A variety of covalent and non-covalent linking chemistries have been reported for nanoparticle attachment. Site-specific labeling is desirable in order to control the protein orientation on the nanoparticle, which is crucial in many applications such as fluorescence resonance energy transfer. We evaluate methods for successful site-specific attachment. Typically, a specific protein residue is linked directly to the nanoparticle core or to the ligand. As conjugation often affects the protein structure and function, techniques to probe structure and activity are assessed. We also examine how molecular dynamics simulations of conjugates would complete those experimental techniques in order to provide atomistic details on the effect of nanoparticle attachment. Characterization studies of nanoparticle-protein complexes show that the structure and function are influenced by the chemistry of the nanoparticle ligand, the nanoparticle size, the nanoparticle material, the stoichiometry of the conjugates, the labeling site on the protein and the nature of the linkage (covalent versus non-covalent)

  9. Pushing the frontiers of atomic models for protein tertiary structure ...

    Indian Academy of Sciences (India)

    as an NP complete or NP hard problem.4,5 This notwith- standing, the dire need for tertiary structures of proteins in drug discovery and other areas6–8 has propelled the development of a multitude of computational recipes. In this article, we focus on ab initio/de novo strategies,. Bhageerath in particular, for protein tertiary ...

  10. Mnn10 Maintains Pathogenicity in Candida albicans by Extending α-1,6-Mannose Backbone to Evade Host Dectin-1 Mediated Antifungal Immunity.

    Directory of Open Access Journals (Sweden)

    Shi Qun Zhang

    2016-05-01

    Full Text Available The cell wall is a dynamic structure that is important for the pathogenicity of Candida albicans. Mannan, which is located in the outermost layer of the cell wall, has been shown to contribute to the pathogenesis of C. albicans, however, the molecular mechanism by which this occurs remains unclear. Here we identified a novel α-1,6-mannosyltransferase encoded by MNN10 in C. albicans. We found that Mnn10 is required for cell wall α-1,6-mannose backbone biosynthesis and polysaccharides organization. Deletion of MNN10 resulted in significant attenuation of the pathogenesis of C. albicans in a murine systemic candidiasis model. Inhibition of α-1,6-mannose backbone extension did not, however, impact the invasive ability of C. albicans in vitro. Notably, mnn10 mutant restored the invasive capacity in athymic nude mice, which further supports the notion of an enhanced host antifungal defense related to this backbone change. Mnn10 mutant induced enhanced Th1 and Th17 cell mediated antifungal immunity, and resulted in enhanced recruitment of neutrophils and monocytes for pathogen clearance in vivo. We also demonstrated that MNN10 could unmask the surface β-(1,3-glucan, a crucial pathogen-associated molecular pattern (PAMP of C. albicans recognized by host Dectin-1. Our results demonstrate that mnn10 mutant could stimulate an enhanced Dectin-1 dependent immune response of macrophages in vitro, including the activation of nuclear factor-κB, mitogen-activated protein kinase pathways, and secretion of specific cytokines such as TNF-α, IL-6, IL-1β and IL-12p40. In summary, our study indicated that α-1,6-mannose backbone is critical for the pathogenesis of C. albicans via shielding β-glucan from recognition by host Dectin-1 mediated immune recognition. Moreover, our work suggests that inhibition of α-1,6-mannose extension by Mnn10 may represent a novel modality to reduce the pathogenicity of C. albicans.

  11. Simulation of Protein Structure, Dynamics and Function in Organic Media

    National Research Council Canada - National Science Library

    Daggett, Valerie

    1998-01-01

    The overall goal of our ONR-sponsored research is to pursue realistic molecular modeling strudies pertinnent to the related properties of protein stability, dynamics, structure, function, and folding in aqueous solution...

  12. Protein structure estimation from NMR data by matrix completion.

    Science.gov (United States)

    Li, Zhicheng; Li, Yang; Lei, Qiang; Zhao, Qing

    2017-09-01

    Knowledge of protein structures is very important to understand their corresponding physical and chemical properties. Nuclear Magnetic Resonance (NMR) spectroscopy is one of the main methods to measure protein structure. In this paper, we propose a two-stage approach to calculate the structure of a protein from a highly incomplete distance matrix, where most data are obtained from NMR. We first randomly "guess" a small part of unobservable distances by utilizing the triangle inequality, which is crucial for the second stage. Then we use matrix completion to calculate the protein structure from the obtained incomplete distance matrix. We apply the accelerated proximal gradient algorithm to solve the corresponding optimization problem. Furthermore, the recovery error of our method is analyzed, and its efficiency is demonstrated by several practical examples.

  13. Modeling membrane protein structure through site-directed ESR spectroscopy

    NARCIS (Netherlands)

    Kavalenka, A.A.

    2009-01-01

    Site-directed spin labeling (SDSL) electron spin resonance (ESR) spectroscopy is a
    relatively new biophysical tool for obtaining structural information about proteins. This
    thesis presents a novel approach, based on powerful spectral analysis techniques (multicomponent
    spectral

  14. Computational design of proteins with novel structure and functions

    International Nuclear Information System (INIS)

    Yang Wei; Lai Lu-Hua

    2016-01-01

    Computational design of proteins is a relatively new field, where scientists search the enormous sequence space for sequences that can fold into desired structure and perform desired functions. With the computational approach, proteins can be designed, for example, as regulators of biological processes, novel enzymes, or as biotherapeutics. These approaches not only provide valuable information for understanding of sequence–structure–function relations in proteins, but also hold promise for applications to protein engineering and biomedical research. In this review, we briefly introduce the rationale for computational protein design, then summarize the recent progress in this field, including de novo protein design, enzyme design, and design of protein–protein interactions. Challenges and future prospects of this field are also discussed. (topical review)

  15. Potato leafroll virus structural proteins manipulate overlapping, yet distinct protein interaction networks during infection.

    Science.gov (United States)

    DeBlasio, Stacy L; Johnson, Richard; Sweeney, Michelle M; Karasev, Alexander; Gray, Stewart M; MacCoss, Michael J; Cilia, Michelle

    2015-06-01

    Potato leafroll virus (PLRV) produces a readthrough protein (RTP) via translational readthrough of the coat protein amber stop codon. The RTP functions as a structural component of the virion and as a nonincorporated protein in concert with numerous insect and plant proteins to regulate virus movement/transmission and tissue tropism. Affinity purification coupled to quantitative MS was used to generate protein interaction networks for a PLRV mutant that is unable to produce the read through domain (RTD) and compared to the known wild-type PLRV protein interaction network. By quantifying differences in the protein interaction networks, we identified four distinct classes of PLRV-plant interactions: those plant and nonstructural viral proteins interacting with assembled coat protein (category I); plant proteins in complex with both coat protein and RTD (category II); plant proteins in complex with the RTD (category III); and plant proteins that had higher affinity for virions lacking the RTD (category IV). Proteins identified as interacting with the RTD are potential candidates for regulating viral processes that are mediated by the RTP such as phloem retention and systemic movement and can potentially be useful targets for the development of strategies to prevent infection and/or viral transmission of Luteoviridae species that infect important crop species. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  16. Binding free energy analysis of protein-protein docking model structures by evERdock.

    Science.gov (United States)

    Takemura, Kazuhiro; Matubayasi, Nobuyuki; Kitao, Akio

    2018-03-14

    To aid the evaluation of protein-protein complex model structures generated by protein docking prediction (decoys), we previously developed a method to calculate the binding free energies for complexes. The method combines a short (2 ns) all-atom molecular dynamics simulation with explicit solvent and solution theory in the energy representation (ER). We showed that this method successfully selected structures similar to the native complex structure (near-native decoys) as the lowest binding free energy structures. In our current work, we applied this method (evERdock) to 100 or 300 model structures of four protein-protein complexes. The crystal structures and the near-native decoys showed the lowest binding free energy of all the examined structures, indicating that evERdock can successfully evaluate decoys. Several decoys that show low interface root-mean-square distance but relatively high binding free energy were also identified. Analysis of the fraction of native contacts, hydrogen bonds, and salt bridges at the protein-protein interface indicated that these decoys were insufficiently optimized at the interface. After optimizing the interactions around the interface by including interfacial water molecules, the binding free energies of these decoys were improved. We also investigated the effect of solute entropy on binding free energy and found that consideration of the entropy term does not necessarily improve the evaluations of decoys using the normal model analysis for entropy calculation.

  17. Constraining cyclic peptides to mimic protein structure motifs

    DEFF Research Database (Denmark)

    Hill, Timothy A.; Shepherd, Nicholas E.; Diness, Frederik

    2014-01-01

    peptides can have protein-like biological activities and potencies, enabling their uses as biological probes and leads to therapeutics, diagnostics and vaccines. This Review highlights examples of cyclic peptides that mimic three-dimensional structures of strand, turn or helical segments of peptides...... and proteins, and identifies some additional restraints incorporated into natural product cyclic peptides and synthetic macrocyclic pepti-domimetics that refine peptide structure and confer biological properties....

  18. Overcoming bottlenecks in the membrane protein structural biology pipeline.

    Science.gov (United States)

    Hardy, David; Bill, Roslyn M; Jawhari, Anass; Rothnie, Alice J

    2016-06-15

    Membrane proteins account for a third of the eukaryotic proteome, but are greatly under-represented in the Protein Data Bank. Unfortunately, recent technological advances in X-ray crystallography and EM cannot account for the poor solubility and stability of membrane protein samples. A limitation of conventional detergent-based methods is that detergent molecules destabilize membrane proteins, leading to their aggregation. The use of orthologues, mutants and fusion tags has helped improve protein stability, but at the expense of not working with the sequence of interest. Novel detergents such as glucose neopentyl glycol (GNG), maltose neopentyl glycol (MNG) and calixarene-based detergents can improve protein stability without compromising their solubilizing properties. Styrene maleic acid lipid particles (SMALPs) focus on retaining the native lipid bilayer of a membrane protein during purification and biophysical analysis. Overcoming bottlenecks in the membrane protein structural biology pipeline, primarily by maintaining protein stability, will facilitate the elucidation of many more membrane protein structures in the near future. © 2016 The Author(s). published by Portland Press Limited on behalf of the Biochemical Society.

  19. Illuminating structural proteins in viral "dark matter" with metaproteomics.

    Science.gov (United States)

    Brum, Jennifer R; Ignacio-Espinoza, J Cesar; Kim, Eun-Hae; Trubl, Gareth; Jones, Robert M; Roux, Simon; VerBerkmoes, Nathan C; Rich, Virginia I; Sullivan, Matthew B

    2016-03-01

    Viruses are ecologically important, yet environmental virology is limited by dominance of unannotated genomic sequences representing taxonomic and functional "viral dark matter." Although recent analytical advances are rapidly improving taxonomic annotations, identifying functional dark matter remains problematic. Here, we apply paired metaproteomics and dsDNA-targeted metagenomics to identify 1,875 virion-associated proteins from the ocean. Over one-half of these proteins were newly functionally annotated and represent abundant and widespread viral metagenome-derived protein clusters (PCs). One primarily unannotated PC dominated the dataset, but structural modeling and genomic context identified this PC as a previously unidentified capsid protein from multiple uncultivated tailed virus families. Furthermore, four of the five most abundant PCs in the metaproteome represent capsid proteins containing the HK97-like protein fold previously found in many viruses that infect all three domains of life. The dominance of these proteins within our dataset, as well as their global distribution throughout the world's oceans and seas, supports prior hypotheses that this HK97-like protein fold is the most abundant biological structure on Earth. Together, these culture-independent analyses improve virion-associated protein annotations, facilitate the investigation of proteins within natural viral communities, and offer a high-throughput means of illuminating functional viral dark matter.

  20. Functional diversification of structurally alike NLR proteins in plants.

    Science.gov (United States)

    Chakraborty, Joydeep; Jain, Akansha; Mukherjee, Dibya; Ghosh, Suchismita; Das, Sampa

    2018-04-01

    In due course of evolution many pathogens alter their effector molecules to modulate the host plants' metabolism and immune responses triggered upon proper recognition by the intracellular nucleotide-binding oligomerization domain containing leucine-rich repeat (NLR) proteins. Likewise, host plants have also evolved with diversified NLR proteins as a survival strategy to win the battle against pathogen invasion. NLR protein indeed detects pathogen derived effector proteins leading to the activation of defense responses associated with programmed cell death (PCD). In this interactive process, genome structure and plasticity play pivotal role in the development of innate immunity. Despite being quite conserved with similar biological functions in all eukaryotes, the intracellular NLR immune receptor proteins happen to be structurally distinct. Recent studies have made progress in identifying transcriptional regulatory complexes activated by NLR proteins. In this review, we attempt to decipher the intracellular NLR proteins mediated surveillance across the evolutionarily diverse taxa, highlighting some of the recent updates on NLR protein compartmentalization, molecular interactions before and after activation along with insights into the finer role of these receptor proteins to combat invading pathogens upon their recognition. Latest information on NLR sensors, helpers and NLR proteins with integrated domains in the context of plant pathogen interactions are also discussed. Copyright © 2018 Elsevier B.V. All rights reserved.

  1. Combining neural networks for protein secondary structure prediction

    DEFF Research Database (Denmark)

    Riis, Søren Kamaric

    1995-01-01

    In this paper structured neural networks are applied to the problem of predicting the secondary structure of proteins. A hierarchical approach is used where specialized neural networks are designed for each structural class and then combined using another neural network. The submodels are designed...... by using a priori knowledge of the mapping between protein building blocks and the secondary structure and by using weight sharing. Since none of the individual networks have more than 600 adjustable weights over-fitting is avoided. When ensembles of specialized experts are combined the performance...

  2. A generative, probabilistic model of local protein structure

    DEFF Research Database (Denmark)

    Boomsma, Wouter; Mardia, Kanti V.; Taylor, Charles C.

    2008-01-01

    Despite significant progress in recent years, protein structure prediction maintains its status as one of the prime unsolved problems in computational biology. One of the key remaining challenges is an efficient probabilistic exploration of the structural space that correctly reflects the relative...... conformational stabilities. Here, we present a fully probabilistic, continuous model of local protein structure in atomic detail. The generative model makes efficient conformational sampling possible and provides a framework for the rigorous analysis of local sequence-structure correlations in the native state...

  3. SCOWLP classification: Structural comparison and analysis of protein binding regions

    Directory of Open Access Journals (Sweden)

    Anders Gerd

    2008-01-01

    Full Text Available Abstract Background Detailed information about protein interactions is critical for our understanding of the principles governing protein recognition mechanisms. The structures of many proteins have been experimentally determined in complex with different ligands bound either in the same or different binding regions. Thus, the structural interactome requires the development of tools to classify protein binding regions. A proper classification may provide a general view of the regions that a protein uses to bind others and also facilitate a detailed comparative analysis of the interacting information for specific protein binding regions at atomic level. Such classification might be of potential use for deciphering protein interaction networks, understanding protein function, rational engineering and design. Description Protein binding regions (PBRs might be ideally described as well-defined separated regions that share no interacting residues one another. However, PBRs are often irregular, discontinuous and can share a wide range of interacting residues among them. The criteria to define an individual binding region can be often arbitrary and may differ from other binding regions within a protein family. Therefore, the rational behind protein interface classification should aim to fulfil the requirements of the analysis to be performed. We extract detailed interaction information of protein domains, peptides and interfacial solvent from the SCOWLP database and we classify the PBRs of each domain family. For this purpose, we define a similarity index based on the overlapping of interacting residues mapped in pair-wise structural alignments. We perform our classification with agglomerative hierarchical clustering using the complete-linkage method. Our classification is calculated at different similarity cut-offs to allow flexibility in the analysis of PBRs, feature especially interesting for those protein families with conflictive binding regions

  4. Sequential Release of Proteins from Structured Multishell Microcapsules.

    Science.gov (United States)

    Shimanovich, Ulyana; Michaels, Thomas C T; De Genst, Erwin; Matak-Vinkovic, Dijana; Dobson, Christopher M; Knowles, Tuomas P J

    2017-10-09

    In nature, a wide range of functional materials is based on proteins. Increasing attention is also turning to the use of proteins as artificial biomaterials in the form of films, gels, particles, and fibrils that offer great potential for applications in areas ranging from molecular medicine to materials science. To date, however, most such applications have been limited to single component materials despite the fact that their natural analogues are composed of multiple types of proteins with a variety of functionalities that are coassembled in a highly organized manner on the micrometer scale, a process that is currently challenging to achieve in the laboratory. Here, we demonstrate the fabrication of multicomponent protein microcapsules where the different components are positioned in a controlled manner. We use molecular self-assembly to generate multicomponent structures on the nanometer scale and droplet microfluidics to bring together the different components on the micrometer scale. Using this approach, we synthesize a wide range of multiprotein microcapsules containing three well-characterized proteins: glucagon, insulin, and lysozyme. The localization of each protein component in multishell microcapsules has been detected by labeling protein molecules with different fluorophores, and the final three-dimensional microcapsule structure has been resolved by using confocal microscopy together with image analysis techniques. In addition, we show that these structures can be used to tailor the release of such functional proteins in a sequential manner. Moreover, our observations demonstrate that the protein release mechanism from multishell capsules is driven by the kinetic control of mass transport of the cargo and by the dissolution of the shells. The ability to generate artificial materials that incorporate a variety of different proteins with distinct functionalities increases the breadth of the potential applications of artificial protein-based materials

  5. Extracting the information backbone in online system.

    Directory of Open Access Journals (Sweden)

    Qian-Ming Zhang

    Full Text Available Information overload is a serious problem in modern society and many solutions such as recommender system have been proposed to filter out irrelevant information. In the literature, researchers have been mainly dedicated to improving the recommendation performance (accuracy and diversity of the algorithms while they have overlooked the influence of topology of the online user-object bipartite networks. In this paper, we find that some information provided by the bipartite networks is not only redundant but also misleading. With such "less can be more" feature, we design some algorithms to improve the recommendation performance by eliminating some links from the original networks. Moreover, we propose a hybrid method combining the time-aware and topology-aware link removal algorithms to extract the backbone which contains the essential information for the recommender systems. From the practical point of view, our method can improve the performance and reduce the computational time of the recommendation system, thus improving both of their effectiveness and efficiency.

  6. Extracting the Information Backbone in Online System

    Science.gov (United States)

    Zhang, Qian-Ming; Zeng, An; Shang, Ming-Sheng

    2013-01-01

    Information overload is a serious problem in modern society and many solutions such as recommender system have been proposed to filter out irrelevant information. In the literature, researchers have been mainly dedicated to improving the recommendation performance (accuracy and diversity) of the algorithms while they have overlooked the influence of topology of the online user-object bipartite networks. In this paper, we find that some information provided by the bipartite networks is not only redundant but also misleading. With such “less can be more” feature, we design some algorithms to improve the recommendation performance by eliminating some links from the original networks. Moreover, we propose a hybrid method combining the time-aware and topology-aware link removal algorithms to extract the backbone which contains the essential information for the recommender systems. From the practical point of view, our method can improve the performance and reduce the computational time of the recommendation system, thus improving both of their effectiveness and efficiency. PMID:23690946

  7. Backbone dynamics of oxidized and reduced D. vulgaris flavodoxin in solution

    International Nuclear Information System (INIS)

    Hrovat, Andrea; Bluemel, Markus; Loehr, Frank; Mayhew, Stephen G.; Rueterjans, Heinz

    1997-01-01

    Recombinant Desulfovibrio vulgaris flavodoxin was produced in Escherichia coli. A complete backbone NMR assignment for the two-electron reduced protein revealed significant changes of chemical shift values compared to the oxidized protein, in particular for the flavine mononucleotide (FMN)-binding site. A comparison of homo- and heteronuclear NOESY spectra for the two redox states led to the assumption that reduction is not accompanied by significant changes of the global fold of the protein.The backbone dynamics of both the oxidized and reduced forms of D. vulgaris flavodoxin were investigated using two-dimensional 15 N- 1 H correlation NMR spectroscopy.T 1 , T 2 and NOE data are obtained for 95% of the backbone amide groups in both redox states. These values were analysed in terms of the 'model-free' approach introduced by Lipari and Szabo [(1982) J. Am. Chem. Soc., 104, 4546-;4559, 4559-;4570]. A comparison of the two redox states indicates that in the reduced species significantly more flexibility occurs in the two loop regions enclosing FMN.Also, a higher amplitude of local motion could be found for the N(3)H group of FMN bound to the reduced protein compared to the oxidized state

  8. Structuring oil by protein building blocks

    NARCIS (Netherlands)

    Vries, de Auke

    2017-01-01

    Over the recent years, structuring of oil into ‘organogels’ or ‘oleogels’ has gained much attention amongst colloid-, material,- and food scientists. Potentially, these oleogels could be used as an alternative for saturated- and trans fats in food products. To develop oleogels as a

  9. Mass Spectrometry Coupled Experiments and Protein Structure Modeling Methods

    Directory of Open Access Journals (Sweden)

    Lee Sael

    2013-10-01

    Full Text Available With the accumulation of next generation sequencing data, there is increasing interest in the study of intra-species difference in molecular biology, especially in relation to disease analysis. Furthermore, the dynamics of the protein is being identified as a critical factor in its function. Although accuracy of protein structure prediction methods is high, provided there are structural templates, most methods are still insensitive to amino-acid differences at critical points that may change the overall structure. Also, predicted structures are inherently static and do not provide information about structural change over time. It is challenging to address the sensitivity and the dynamics by computational structure predictions alone. However, with the fast development of diverse mass spectrometry coupled experiments, low-resolution but fast and sensitive structural information can be obtained. This information can then be integrated into the structure prediction process to further improve the sensitivity and address the dynamics of the protein structures. For this purpose, this article focuses on reviewing two aspects: the types of mass spectrometry coupled experiments and structural data that are obtainable through those experiments; and the structure prediction methods that can utilize these data as constraints. Also, short review of current efforts in integrating experimental data in the structural modeling is provided.

  10. Chaperonin Structure - The Large Multi-Subunit Protein Complex

    Directory of Open Access Journals (Sweden)

    Irena Roterman

    2009-03-01

    Full Text Available The multi sub-unit protein structure representing the chaperonins group is analyzed with respect to its hydrophobicity distribution. The proteins of this group assist protein folding supported by ATP. The specific axial symmetry GroEL structure (two rings of seven units stacked back to back - 524 aa each and the GroES (single ring of seven units - 97 aa each polypeptide chains are analyzed using the hydrophobicity distribution expressed as excess/deficiency all over the molecule to search for structure-to-function relationships. The empirically observed distribution of hydrophobic residues is confronted with the theoretical one representing the idealized hydrophobic core with hydrophilic residues exposure on the surface. The observed discrepancy between these two distributions seems to be aim-oriented, determining the structure-to-function relation. The hydrophobic force field structure generated by the chaperonin capsule is presented. Its possible influence on substrate folding is suggested.

  11. NMR structural studies of peptides and proteins in membranes

    Energy Technology Data Exchange (ETDEWEB)

    Opella, S J [Pennsylvania Univ., Philadelphia, PA (United States). Dept. of Chemistry

    1994-12-31

    The use of NMR methodology in structural studies is described as applicable to larger proteins, considering that the majority of membrane proteins is constructed from a limited repertoire of structural and dynamic elements. The membrane associated domains of these proteins are made up of long hydrophobic membrane spanning helices, shorter amphipathic bridging helices in the plane of the bilayer, connecting loops with varying degrees of mobility, and mobile N- and C- terminal sections. NMR studies have been successful in identifying all of these elements and their orientations relative to each other and the membrane bilayer 19 refs., 9 figs.

  12. High throughput platforms for structural genomics of integral membrane proteins.

    Science.gov (United States)

    Mancia, Filippo; Love, James

    2011-08-01

    Structural genomics approaches on integral membrane proteins have been postulated for over a decade, yet specific efforts are lagging years behind their soluble counterparts. Indeed, high throughput methodologies for production and characterization of prokaryotic integral membrane proteins are only now emerging, while large-scale efforts for eukaryotic ones are still in their infancy. Presented here is a review of recent literature on actively ongoing structural genomics of membrane protein initiatives, with a focus on those aimed at implementing interesting techniques aimed at increasing our rate of success for this class of macromolecules. Copyright © 2011 Elsevier Ltd. All rights reserved.

  13. Mining protein loops using a structural alphabet and statistical exceptionality

    Directory of Open Access Journals (Sweden)

    Martin Juliette

    2010-02-01

    Full Text Available Abstract Background Protein loops encompass 50% of protein residues in available three-dimensional structures. These regions are often involved in protein functions, e.g. binding site, catalytic pocket... However, the description of protein loops with conventional tools is an uneasy task. Regular secondary structures, helices and strands, have been widely studied whereas loops, because they are highly variable in terms of sequence and structure, are difficult to analyze. Due to data sparsity, long loops have rarely been systematically studied. Results We developed a simple and accurate method that allows the description and analysis of the structures of short and long loops using structural motifs without restriction on loop length. This method is based on the structural alphabet HMM-SA. HMM-SA allows the simplification of a three-dimensional protein structure into a one-dimensional string of states, where each state is a four-residue prototype fragment, called structural letter. The difficult task of the structural grouping of huge data sets is thus easily accomplished by handling structural letter strings as in conventional protein sequence analysis. We systematically extracted all seven-residue fragments in a bank of 93000 protein loops and grouped them according to the structural-letter sequence, named structural word. This approach permits a systematic analysis of loops of all sizes since we consider the structural motifs of seven residues rather than complete loops. We focused the analysis on highly recurrent words of loops (observed more than 30 times. Our study reveals that 73% of loop-lengths are covered by only 3310 highly recurrent structural words out of 28274 observed words. These structural words have low structural variability (mean RMSd of 0.85 Å. As expected, half of these motifs display a flanking-region preference but interestingly, two thirds are shared by short (less than 12 residues and long loops. Moreover, half of

  14. Mining protein loops using a structural alphabet and statistical exceptionality.

    Science.gov (United States)

    Regad, Leslie; Martin, Juliette; Nuel, Gregory; Camproux, Anne-Claude

    2010-02-04

    Protein loops encompass 50% of protein residues in available three-dimensional structures. These regions are often involved in protein functions, e.g. binding site, catalytic pocket... However, the description of protein loops with conventional tools is an uneasy task. Regular secondary structures, helices and strands, have been widely studied whereas loops, because they are highly variable in terms of sequence and structure, are difficult to analyze. Due to data sparsity, long loops have rarely been systematically studied. We developed a simple and accurate method that allows the description and analysis of the structures of short and long loops using structural motifs without restriction on loop length. This method is based on the structural alphabet HMM-SA. HMM-SA allows the simplification of a three-dimensional protein structure into a one-dimensional string of states, where each state is a four-residue prototype fragment, called structural letter. The difficult task of the structural grouping of huge data sets is thus easily accomplished by handling structural letter strings as in conventional protein sequence analysis. We systematically extracted all seven-residue fragments in a bank of 93000 protein loops and grouped them according to the structural-letter sequence, named structural word. This approach permits a systematic analysis of loops of all sizes since we consider the structural motifs of seven residues rather than complete loops. We focused the analysis on highly recurrent words of loops (observed more than 30 times). Our study reveals that 73% of loop-lengths are covered by only 3310 highly recurrent structural words out of 28274 observed words). These structural words have low structural variability (mean RMSd of 0.85 A). As expected, half of these motifs display a flanking-region preference but interestingly, two thirds are shared by short (less than 12 residues) and long loops. Moreover, half of recurrent motifs exhibit a significant level of

  15. Structural and kinetic mapping of side-chain exposure onto the protein energy landscape.

    Science.gov (United States)

    Bernstein, Rachel; Schmidt, Kierstin L; Harbury, Pehr B; Marqusee, Susan

    2011-06-28

    Identification and characterization of structural fluctuations that occur under native conditions is crucial for understanding protein folding and function, but such fluctuations are often rare and transient, making them difficult to study. Native-state hydrogen exchange (NSHX) has been a powerful tool for identifying such rarely populated conformations, but it generally reveals no information about the placement of these species along the folding reaction coordinate or the barriers separating them from the folded state and provides little insight into side-chain packing. To complement such studies, we have performed native-state alkyl-proton exchange, a method analogous to NSHX that monitors cysteine modification rather than backbone amide exchange, to examine the folding landscape of Escherichia coli ribonuclease H, a protein well characterized by hydrogen exchange. We have chosen experimental conditions such that the rate-limiting barrier acts as a kinetic partition: residues that become exposed only upon crossing the unfolding barrier are modified in the EX1 regime (alkylation rates report on the rate of unfolding), while those exposed on the native side of the barrier are modified predominantly in the EX2 regime (alkylation rates report on equilibrium populations). This kinetic partitioning allows for identification and placement of partially unfolded forms along the reaction coordinate. Using this approach we detect previously unidentified, rarely populated conformations residing on the native side of the barrier and identify side chains that are modified only upon crossing the unfolding barrier. Thus, in a single experiment under native conditions, both sides of the rate-limiting barrier are investigated.

  16. Structural protein relationships among eastern equine encephalitis viruses.

    Science.gov (United States)

    Strizki, J M; Repik, P M

    1994-11-01

    We have re-evaluated the relationships among the polypeptides of eastern equine encephalitis (EEE) viruses using SDS-PAGE and peptide mapping of individual virion proteins. Four to five distinct polypeptide bands were detected upon SDS-PAGE analysis of viruses: the E1, E2 and C proteins normally associated with alphavirus virions, as well as an additional more rapidly-migrating E2-associated protein and a high M(r) (HMW) protein. In contrast with previous findings by others, the electrophoretic profiles of the virion proteins of EEE viruses displayed a marked correlation with serotype. The protein profiles of the 33 North American (NA)-serotype viruses examined were remarkably homogeneous, with variation detected only in the E1 protein of two isolates. In contrast, considerable heterogeneity was observed in the migration profiles of both the E1 and E2 glycoproteins of the 13 South American (SA)-type viruses examined. Peptide mapping of individual virion proteins using limited proteolysis with Staphylococcus aureus V8 protease confirmed that, in addition to the homogeneity evident among NA-type viruses and relative heterogeneity among SA-type viruses, the E1 and E2 proteins of NA- and SA-serotype viruses exhibited serotype-specific structural variation. The C protein was highly conserved among isolates of both virus serotypes. Endoglycosidase analyses of intact virions did not reveal substantial glycosylation differences between the glycoproteins of NA- and SA-serotype viruses. Both the HMW protein and the E2 protein (doublet) of EEE virus appeared to contain, at least in part, high-mannose type N-linked oligosaccharides. No evidence of O-linked glycans was found on either the E1 or the E2 glycoprotein. Despite the observed structural differences between proteins of NA- and SA-type viruses, Western blot analyses utilizing polyclonal antibodies indicated that immunoreactive epitopes appeared to be conserved.

  17. Supramolecular Architectures and Mimics of Complex Natural Folds Derived from Rationally Designed alpha-Helical Protein Structures

    Science.gov (United States)

    Tavenor, Nathan Albert

    Protein-based supramolecular polymers (SMPs) are a class of biomaterials which draw inspiration from and expand upon the many examples of complex protein quaternary structures observed in nature: collagen, microtubules, viral capsids, etc. Designing synthetic supramolecular protein scaffolds both increases our understanding of natural superstructures and allows for the creation of novel materials. Similar to small-molecule SMPs, protein-based SMPs form due to self-assembly driven by intermolecular interactions between monomers, and monomer structure determines the properties of the overall material. Using protein-based monomers takes advantage of the self-assembly and highly specific molecular recognition properties encodable in polypeptide sequences to rationally design SMP architectures. The central hypothesis underlying our work is that alpha-helical coiled coils, a well-studied protein quaternary folding motif, are well-suited to SMP design through the addition of synthetic linkers at solvent-exposed sites. Through small changes in the structures of the cross-links and/or peptide sequence, we have been able to control both the nanoscale organization and the macroscopic properties of the SMPs. Changes to the linker and hydrophobic core of the peptide can be used to control polymer rigidity, stability, and dimensionality. The gaps in knowledge that this thesis sought to fill on this project were 1) the relationship between the molecular structure of the cross-linked polypeptides and the macroscopic properties of the SMPs and 2) a means of creating materials exhibiting multi-dimensional net or framework topologies. Separate from the above efforts on supramolecular architectures was work on improving backbone modification strategies for an alpha-helix in the context of a complex protein tertiary fold. Earlier work in our lab had successfully incorporated unnatural building blocks into every major secondary structure (beta-sheet, alpha-helix, loops and beta

  18. Linking structural features of protein complexes and biological function.

    Science.gov (United States)

    Sowmya, Gopichandran; Breen, Edmond J; Ranganathan, Shoba

    2015-09-01

    Protein-protein interaction (PPI) establishes the central basis for complex cellular networks in a biological cell. Association of proteins with other proteins occurs at varying affinities, yet with a high degree of specificity. PPIs lead to diverse functionality such as catalysis, regulation, signaling, immunity, and inhibition, playing a crucial role in functional genomics. The molecular principle of such interactions is often elusive in nature. Therefore, a comprehensive analysis of known protein complexes from the Protein Data Bank (PDB) is essential for the characterization of structural interface features to determine structure-function relationship. Thus, we analyzed a nonredundant dataset of 278 heterodimer protein complexes, categorized into major functional classes, for distinguishing features. Interestingly, our analysis has identified five key features (interface area, interface polar residue abundance, hydrogen bonds, solvation free energy gain from interface formation, and binding energy) that are discriminatory among the functional classes using Kruskal-Wallis rank sum test. Significant correlations between these PPI interface features amongst functional categories are also documented. Salt bridges correlate with interface area in regulator-inhibitors (r = 0.75). These representative features have implications for the prediction of potential function of novel protein complexes. The results provide molecular insights for better understanding of PPIs and their relation to biological functions. © 2015 The Protein Society.

  19. A computer graphics program system for protein structure representation.

    Science.gov (United States)

    Ross, A M; Golub, E E

    1988-01-01

    We have developed a computer graphics program system for the schematic representation of several protein secondary structure analysis algorithms. The programs calculate the probability of occurrence of alpha-helix, beta-sheet and beta-turns by the method of Chou and Fasman and assign unique predicted structure to each residue using a novel conflict resolution algorithm based on maximum likelihood. A detailed structure map containing secondary structure, hydrophobicity, sequence identity, sequence numbering and the location of putative N-linked glycosylation sites is then produced. In addition, helical wheel diagrams and hydrophobic moment calculations can be performed to further analyze the properties of selected regions of the sequence. As they require only structure specification as input, the graphics programs can easily be adapted for use with other secondary structure prediction schemes. The use of these programs to analyze protein structure-function relationships is described and evaluated. PMID:2832829

  20. Crystal structure of Homo sapiens protein LOC79017

    Energy Technology Data Exchange (ETDEWEB)

    Bae, Euiyoung; Bingman, Craig A.; Aceti, David J.; Phillips, Jr., George N. (UW)

    2010-02-08

    LOC79017 (MW 21.0 kDa, residues 1-188) was annotated as a hypothetical protein encoded by Homo sapiens chromosome 7 open reading frame 24. It was selected as a target by the Center for Eukaryotic Structural Genomics (CESG) because it did not share more than 30% sequence identity with any protein for which the three-dimensional structure is known. The biological function of the protein has not been established yet. Parts of LOC79017 were identified as members of uncharacterized Pfam families (residues 1-95 as PB006073 and residues 104-180 as PB031696). BLAST searches revealed homologues of LOC79017 in many eukaryotes, but none of them have been functionally characterized. Here, we report the crystal structure of H. sapiens protein LOC79017 (UniGene code Hs.530024, UniProt code O75223, CESG target number go.35223).

  1. Deprotonated imidodiphosphate in AMPPNP-containing protein structures

    International Nuclear Information System (INIS)

    Dauter, Miroslawa; Dauter, Zbigniew

    2011-01-01

    In certain AMPPNP-containing protein structures, the nitrogen bridging the two terminal phosphate groups can be deprotonated. Many different proteins utilize the chemical energy provided by the cofactor adenosine triphosphate (ATP) for their proper function. A number of structures in the Protein Data Bank (PDB) contain adenosine 5′-(β,γ-imido)triphosphate (AMPPNP), a nonhydrolysable analog of ATP in which the bridging O atom between the two terminal phosphate groups is substituted by the imido function. Under mild conditions imides do not have acidic properties and thus the imide nitrogen should be protonated. However, an analysis of protein structures containing AMPPNP reveals that the imide group is deprotonated in certain complexes if the negative charges of the phosphate moieties in AMPPNP are in part neutralized by coordinating divalent metals or a guanidinium group of an arginine

  2. EVA: continuous automatic evaluation of protein structure prediction servers.

    Science.gov (United States)

    Eyrich, V A; Martí-Renom, M A; Przybylski, D; Madhusudhan, M S; Fiser, A; Pazos, F; Valencia, A; Sali, A; Rost, B

    2001-12-01

    Evaluation of protein structure prediction methods is difficult and time-consuming. Here, we describe EVA, a web server for assessing protein structure prediction methods, in an automated, continuous and large-scale fashion. Currently, EVA evaluates the performance of a variety of prediction methods available through the internet. Every week, the sequences of the latest experimentally determined protein structures are sent to prediction servers, results are collected, performance is evaluated, and a summary is published on the web. EVA has so far collected data for more than 3000 protein chains. These results may provide valuable insight to both developers and users of prediction methods. http://cubic.bioc.columbia.edu/eva. eva@cubic.bioc.columbia.edu

  3. De novo protein structure generation from incomplete chemical shift assignments

    Energy Technology Data Exchange (ETDEWEB)

    Shen Yang [National Institutes of Health, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases (United States); Vernon, Robert; Baker, David [University of Washington, Department of Biochemistry and Howard Hughes Medical Institute (United States); Bax, Ad [National Institutes of Health, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases (United States)], E-mail: bax@nih.gov

    2009-02-15

    NMR chemical shifts provide important local structural information for proteins. Consistent structure generation from NMR chemical shift data has recently become feasible for proteins with sizes of up to 130 residues, and such structures are of a quality comparable to those obtained with the standard NMR protocol. This study investigates the influence of the completeness of chemical shift assignments on structures generated from chemical shifts. The Chemical-Shift-Rosetta (CS-Rosetta) protocol was used for de novo protein structure generation with various degrees of completeness of the chemical shift assignment, simulated by omission of entries in the experimental chemical shift data previously used for the initial demonstration of the CS-Rosetta approach. In addition, a new CS-Rosetta protocol is described that improves robustness of the method for proteins with missing or erroneous NMR chemical shift input data. This strategy, which uses traditional Rosetta for pre-filtering of the fragment selection process, is demonstrated for two paramagnetic proteins and also for two proteins with solid-state NMR chemical shift assignments.

  4. Blind Test of Physics-Based Prediction of Protein Structures

    Science.gov (United States)

    Shell, M. Scott; Ozkan, S. Banu; Voelz, Vincent; Wu, Guohong Albert; Dill, Ken A.

    2009-01-01

    We report here a multiprotein blind test of a computer method to predict native protein structures based solely on an all-atom physics-based force field. We use the AMBER 96 potential function with an implicit (GB/SA) model of solvation, combined with replica-exchange molecular-dynamics simulations. Coarse conformational sampling is performed using the zipping and assembly method (ZAM), an approach that is designed to mimic the putative physical routes of protein folding. ZAM was applied to the folding of six proteins, from 76 to 112 monomers in length, in CASP7, a community-wide blind test of protein structure prediction. Because these predictions have about the same level of accuracy as typical bioinformatics methods, and do not utilize information from databases of known native structures, this work opens up the possibility of predicting the structures of membrane proteins, synthetic peptides, or other foldable polymers, for which there is little prior knowledge of native structures. This approach may also be useful for predicting physical protein folding routes, non-native conformations, and other physical properties from amino acid sequences. PMID:19186130

  5. Relationship between Molecular Structure Characteristics of Feed Proteins and Protein Digestibility and Solubility

    Directory of Open Access Journals (Sweden)

    Mingmei Bai

    2016-08-01

    Full Text Available The nutritional value of feed proteins and their utilization by livestock are related not only to the chemical composition but also to the structure of feed proteins, but few studies thus far have investigated the relationship between the structure of feed proteins and their solubility as well as digestibility in monogastric animals. To address this question we analyzed soybean meal, fish meal, corn distiller’s dried grains with solubles, corn gluten meal, and feather meal by Fourier transform infrared (FTIR spectroscopy to determine the protein molecular spectral band characteristics for amides I and II as well as α-helices and β-sheets and their ratios. Protein solubility and in vitro digestibility were measured with the Kjeldahl method using 0.2% KOH solution and the pepsin-pancreatin two-step enzymatic method, respectively. We found that all measured spectral band intensities (height and area of feed proteins were correlated with their the in vitro digestibility and solubility (p≤0.003; moreover, the relatively quantitative amounts of α-helices, random coils, and α-helix to β-sheet ratio in protein secondary structures were positively correlated with protein in vitro digestibility and solubility (p≤0.004. On the other hand, the percentage of β-sheet structures was negatively correlated with protein in vitro digestibility (p<0.001 and solubility (p = 0.002. These results demonstrate that the molecular structure characteristics of feed proteins are closely related to their in vitro digestibility at 28 h and solubility. Furthermore, the α-helix-to-β-sheet ratio can be used to predict the nutritional value of feed proteins.

  6. Characterization of structural proteins of hirame rhabdovirus, HRV

    Science.gov (United States)

    Nishizawa, Toyohiko; Yoshimizu, Mamoru; Winton, James; Ahne, Winfried; Kimura, Takahisa

    1991-01-01

    Structural proteins of hirame rhabdovirus (HRV) were analyzed by SDS-polyacrylarnide gel electrophoresis, western blotting, 2-dimensional gel electrophoresis, and Triton X-100 treatment. Purified HRV virions were composed of: polymerase (L), glycoprotein (G), nucleoprotein (N), and 2 matrix proteins (M1 and M2). Based upon their relative mobilities, the estimated molecular weights of the proteins were: L, 156 KDa; G, 68 KDa; N, 46.4 KDa; M1, 26.4 KDa; and M2, 19.9 KDa. The electrophorehc pattern formed by the structural proteins of HRV was clearly different from that formed by pike fry rhabdovirus, spring viremia of carp virus, eel virus of America, and eel virus European X which belong to the Vesiculovirus genus; however, it resembled the pattern formed by structural proteins of viral hemorrhagic septicemia virus (VHSV) and infectious hematopoietic necrosis virus (IHNV) which are members of the Lyssavirus genus. Among HRV, IHNV, and VHSV, differences were observed in the relative mobilities of the G, N, M1, and M2 proteins. Western blot analysis revealed that the G. N, and M2 proteins of HRV shared antigenic determinants with IHNV and VHSV, but not with any of the 4 fish vesiculoviruses tested. Cross-reactions between the M1 proteins of HRV, IHNV, or VHSV were not detected in this assay. Two-dimensional gel electrophoresis was used to show that HRV differed from IHNV or VHSV in the isoelectric point (PI) of the M1 and M2 proteins. In this system, 2 forms of the M1 protein of HRV and IHNV were observed.These subspecies of M1 had the same relative mobility but different p1 values. Treatment of purified virions with 2% Triton X-100 in Tris buffer containing NaCl removed the G, M1, and M2 proteins of IHNV, but HRV virions were more stable under these conditions.

  7. Cold-set globular protein gels: Interactions, structure and rheology as a function of protein concentration.

    NARCIS (Netherlands)

    Alting, A.C.; Hamer, R.J.; Kruif, de C.G.

    2003-01-01

    We identified the contribution of covalent and noncovalent interactions to the scaling behavior of the structural and rheological properties in a cold gelling protein system. The system we studied consisted of two types of whey protein aggregates, equal in size but different in the amount of

  8. Identification of structural domains in proteins by a graph heuristic

    NARCIS (Netherlands)

    Wernisch, Lorenz; Hunting, M.M.G.; Wodak, Shoshana J.

    1999-01-01

    A novel automatic procedure for identifying domains from protein atomic coordinates is presented. The procedure, termed STRUDL (STRUctural Domain Limits), does not take into account information on secondary structures and handles any number of domains made up of contiguous or non-contiguous chain

  9. Connecting Protein Structure to Intermolecular Interactions: A Computer Modeling Laboratory

    Science.gov (United States)

    Abualia, Mohammed; Schroeder, Lianne; Garcia, Megan; Daubenmire, Patrick L.; Wink, Donald J.; Clark, Ginevra A.

    2016-01-01

    An understanding of protein folding relies on a solid foundation of a number of critical chemical concepts, such as molecular structure, intra-/intermolecular interactions, and relating structure to function. Recent reports show that students struggle on all levels to achieve these understandings and use them in meaningful ways. Further, several…

  10. Backbone resonance assignments of the outer membrane lipoprotein FrpD from Neisseria meningitidis

    Czech Academy of Sciences Publication Activity Database

    Bumba, Ladislav; Sviridova, E.; Kutá-Smatanová, Ivana; Řezáčová, Pavlína; Veverka, Václav

    2014-01-01

    Roč. 8, č. 1 (2014), s. 53-55 ISSN 1874-2718 R&D Projects: GA ČR(CZ) GAP207/11/0717; GA MŠk(CZ) LK11205 Institutional support: RVO:61388963 ; RVO:61388971 ; RVO:67179843 Keywords : Neisseria meningitidis * FrpC * FrpD * backbone assignments * NMR * iron-regulated protein Subject RIV: CE - Biochemistry Impact factor: 0.760, year: 2014

  11. Live Zika virus chimeric vaccine candidate based on a yellow fever 17-D attenuated backbone

    OpenAIRE

    Nougairede, Antoine; Klitting, Raphaelle; Aubry, Fabien; Gilles, Magali; Touret, Franck; De Lamballerie, Xavier

    2018-01-01

    Zika virus (ZIKV) recently dispersed throughout the tropics and sub-tropics causing epidemics associated with congenital disease and neurological complications. There is currently no commercial vaccine for ZIKV. Here we describe the initial development of a chimeric virus containing the prM/E proteins of a ZIKV epidemic strain incorporated into a yellow fever 17-D attenuated backbone. Using the versatile and rapid ISA (Infectious Subgenomic Amplicons) reverse genetics method, we compared diff...

  12. The Protein Model Portal--a comprehensive resource for protein structure and model information.

    Science.gov (United States)

    Haas, Juergen; Roth, Steven; Arnold, Konstantin; Kiefer, Florian; Schmidt, Tobias; Bordoli, Lorenza; Schwede, Torsten

    2013-01-01

    The Protein Model Portal (PMP) has been developed to foster effective use of 3D molecular models in biomedical research by providing convenient and comprehensive access to structural information for proteins. Both experimental structures and theoretical models for a given protein can be searched simultaneously and analyzed for structural variability. By providing a comprehensive view on structural information, PMP offers the opportunity to apply consistent assessment and validation criteria to the complete set of structural models available for proteins. PMP is an open project so that new methods developed by the community can contribute to PMP, for example, new modeling servers for creating homology models and model quality estimation servers for model validation. The accuracy of participating modeling servers is continuously evaluated by the Continuous Automated Model EvaluatiOn (CAMEO) project. The PMP offers a unique interface to visualize structural coverage of a protein combining both theoretical models and experimental structures, allowing straightforward assessment of the model quality and hence their utility. The portal is updated regularly and actively developed to include latest methods in the field of computational structural biology. Database URL: http://www.proteinmodelportal.org.

  13. The Protein Model Portal—a comprehensive resource for protein structure and model information

    Science.gov (United States)

    Haas, Juergen; Roth, Steven; Arnold, Konstantin; Kiefer, Florian; Schmidt, Tobias; Bordoli, Lorenza; Schwede, Torsten

    2013-01-01

    The Protein Model Portal (PMP) has been developed to foster effective use of 3D molecular models in biomedical research by providing convenient and comprehensive access to structural information for proteins. Both experimental structures and theoretical models for a given protein can be searched simultaneously and analyzed for structural variability. By providing a comprehensive view on structural information, PMP offers the opportunity to apply consistent assessment and validation criteria to the complete set of structural models available for proteins. PMP is an open project so that new methods developed by the community can contribute to PMP, for example, new modeling servers for creating homology models and model quality estimation servers for model validation. The accuracy of participating modeling servers is continuously evaluated by the Continuous Automated Model EvaluatiOn (CAMEO) project. The PMP offers a unique interface to visualize structural coverage of a protein combining both theoretical models and experimental structures, allowing straightforward assessment of the model quality and hence their utility. The portal is updated regularly and actively developed to include latest methods in the field of computational structural biology. Database URL: http://www.proteinmodelportal.org PMID:23624946

  14. Protein Structural Change Data - PSCDB | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us PSCDB Protein Structural Change Data Data detail Data name Protein Structural Change Data DO...History of This Database Site Policy | Contact Us Protein Structural Change Data - PSCDB | LSDB Archive ...

  15. Protein 3D structure computed from evolutionary sequence variation.

    Directory of Open Access Journals (Sweden)

    Debora S Marks

    Full Text Available The evolutionary trajectory of a protein through sequence space is constrained by its function. Collections of sequence homologs record the outcomes of millions of evolutionary experiments in which the protein evolves according to these constraints. Deciphering the evolutionary record held in these sequences and exploiting it for predictive and engineering purposes presents a formidable challenge. The potential benefit of solving this challenge is amplified by the advent of inexpensive high-throughput genomic sequencing.In this paper we ask whether we can infer evolutionary constraints from a set of sequence homologs of a protein. The challenge is to distinguish true co-evolution couplings from the noisy set of observed correlations. We address this challenge using a maximum entropy model of the protein sequence, constrained by the statistics of the multiple sequence alignment, to infer residue pair couplings. Surprisingly, we find that the strength of these inferred couplings is an excellent predictor of residue-residue proximity in folded structures. Indeed, the top-scoring residue couplings are sufficiently accurate and well-distributed to define the 3D protein fold with remarkable accuracy.We quantify this observation by computing, from sequence alone, all-atom 3D structures of fifteen test proteins from different fold classes, ranging in size from 50 to 260 residues, including a G-protein coupled receptor. These blinded inferences are de novo, i.e., they do not use homology modeling or sequence-similar fragments from known structures. The co-evolution signals provide sufficient information to determine accurate 3D protein structure to 2.7-4.8 Å C(α-RMSD error relative to the observed structure, over at least two-thirds of the protein (method called EVfold, details at http://EVfold.org. This discovery provides insight into essential interactions constraining protein evolution and will facilitate a comprehensive survey of the universe of

  16. Structure and Dynamic Properties of Membrane Proteins using NMR

    DEFF Research Database (Denmark)

    Rösner, Heike; Kragelund, Birthe

    2012-01-01

    conformational changes. Their structural and functional decoding is challenging and has imposed demanding experimental development. Solution nuclear magnetic resonance (NMR) spectroscopy is one of the techniques providing the capacity to make a significant difference in the deciphering of the membrane protein...... structure-function paradigm. The method has evolved dramatically during the last decade resulting in a plethora of new experiments leading to a significant increase in the scientific repertoire for studying membrane proteins. Besides solving the three-dimensional structures using state-of-the-art approaches......-populated states, this review seeks to introduce the vast possibilities solution NMR can offer to the study of membrane protein structure-function analyses with special focus on applicability. © 2012 American Physiological Society. Compr Physiol 2:1491-1539, 2012....

  17. Protein structure prediction using bee colony optimization metaheuristic

    DEFF Research Database (Denmark)

    Fonseca, Rasmus; Paluszewski, Martin; Winter, Pawel

    2010-01-01

    of the proteins structure, an energy potential and some optimization algorithm that ¿nds the structure with minimal energy. Bee Colony Optimization (BCO) is a relatively new approach to solving opti- mization problems based on the foraging behaviour of bees. Several variants of BCO have been suggested......Predicting the native structure of proteins is one of the most challenging problems in molecular biology. The goal is to determine the three-dimensional struc- ture from the one-dimensional amino acid sequence. De novo prediction algorithms seek to do this by developing a representation...... our BCO method to generate good solutions to the protein structure prediction problem. The results show that BCO generally ¿nds better solutions than simulated annealing which so far has been the metaheuristic of choice for this problem....

  18. Crystal structure of secretory protein Hcp3 from Pseudomonas aeruginosa.

    Science.gov (United States)

    Osipiuk, Jerzy; Xu, Xiaohui; Cui, Hong; Savchenko, Alexei; Edwards, Aled; Joachimiak, Andrzej

    2011-03-01

    The Type VI secretion pathway transports proteins across the cell envelope of Gram-negative bacteria. Pseudomonas aeruginosa, an opportunistic Gram-negative bacterial pathogen infecting humans, uses the type VI secretion pathway to export specific effector proteins crucial for its pathogenesis. The HSI-I virulence locus encodes for several proteins that has been proposed to participate in protein transport including the Hcp1 protein, which forms hexameric rings that assemble into nanotubes in vitro. Two Hcp1 paralogues have been identified in the P. aeruginosa genome, Hsp2 and Hcp3. Here, we present the structure of the Hcp3 protein from P. aeruginosa. The overall structure of the monomer resembles Hcp1 despite the lack of amino-acid sequence similarity between the two proteins. The monomers assemble into hexamers similar to Hcp1. However, instead of forming nanotubes in head-to-tail mode like Hcp1, Hcp3 stacks its rings in head-to-head mode forming double-ring structures.

  19. Structural Elements Regulating AAA+ Protein Quality Control Machines.

    Science.gov (United States)

    Chang, Chiung-Wen; Lee, Sukyeong; Tsai, Francis T F

    2017-01-01

    Members of the ATPases Associated with various cellular Activities (AAA+) superfamily participate in essential and diverse cellular pathways in all kingdoms of life by harnessing the energy of ATP binding and hydrolysis to drive their biological functions. Although most AAA+ proteins share a ring-shaped architecture, AAA+ proteins have evolved distinct structural elements that are fine-tuned to their specific functions. A central question in the field is how ATP binding and hydrolysis are coupled to substrate translocation through the central channel of ring-forming AAA+ proteins. In this mini-review, we will discuss structural elements present in AAA+ proteins involved in protein quality control, drawing similarities to their known role in substrate interaction by AAA+ proteins involved in DNA translocation. Elements to be discussed include the pore loop-1, the Inter-Subunit Signaling (ISS) motif, and the Pre-Sensor I insert (PS-I) motif. Lastly, we will summarize our current understanding on the inter-relationship of those structural elements and propose a model how ATP binding and hydrolysis might be coupled to polypeptide translocation in protein quality control machines.

  20. Models of protein-ligand crystal structures: trust, but verify.

    Science.gov (United States)

    Deller, Marc C; Rupp, Bernhard

    2015-09-01

    X-ray crystallography provides the most accurate models of protein-ligand structures. These models serve as the foundation of many computational methods including structure prediction, molecular modelling, and structure-based drug design. The success of these computational methods ultimately depends on the quality of the underlying protein-ligand models. X-ray crystallography offers the unparalleled advantage of a clear mathematical formalism relating the experimental data to the protein-ligand model. In the case of X-ray crystallography, the primary experimental evidence is the electron density of the molecules forming the crystal. The first step in the generation of an accurate and precise crystallographic model is the interpretation of the electron density of the crystal, typically carried out by construction of an atomic model. The atomic model must then be validated for fit to the experimental electron density and also for agreement with prior expectations of stereochemistry. Stringent validation of protein-ligand models has become possible as a result of the mandatory deposition of primary diffraction data, and many computational tools are now available to aid in the validation process. Validation of protein-ligand complexes has revealed some instances of overenthusiastic interpretation of ligand density. Fundamental concepts and metrics of protein-ligand quality validation are discussed and we highlight software tools to assist in this process. It is essential that end users select high quality protein-ligand models for their computational and biological studies, and we provide an overview of how this can be achieved.

  1. Rotational order–disorder structure of fluorescent protein FP480

    International Nuclear Information System (INIS)

    Pletnev, Sergei; Morozova, Kateryna S.; Verkhusha, Vladislav V.; Dauter, Zbigniew

    2009-01-01

    An analysis of the rotational order–disorder structure of fluorescent protein FP480 is presented. In the last decade, advances in instrumentation and software development have made crystallography a powerful tool in structural biology. Using this method, structural information can now be acquired from pathological crystals that would have been abandoned in earlier times. In this paper, the order–disorder (OD) structure of fluorescent protein FP480 is discussed. The structure is composed of tetramers with 222 symmetry incorporated into the lattice in two different ways, namely rotated 90° with respect to each other around the crystal c axis, with tetramer axes coincident with crystallographic twofold axes. The random distribution of alternatively oriented tetramers in the crystal creates a rotational OD structure with statistically averaged I422 symmetry, although the presence of very weak and diffuse additional reflections suggests that the randomness is only approximate

  2. Improved Energy Bound Accuracy Enhances the Efficiency of Continuous Protein Design

    OpenAIRE

    Roberts, Kyle E.; Donald, Bruce R.

    2015-01-01

    Flexibility and dynamics are important for protein function and a protein’s ability to accommodate amino acid substitutions. However, when computational protein design algorithms search over protein structures, the allowed flexibility is often reduced to a relatively small set of discrete side-chain and backbone conformations. While simplifications in scoring functions and protein flexibility are currently necessary to computationally search the vast protein sequence and conformational space,...

  3. DNA nanotubes for NMR structure determination of membrane proteins.

    Science.gov (United States)

    Bellot, Gaëtan; McClintock, Mark A; Chou, James J; Shih, William M

    2013-04-01

    Finding a way to determine the structures of integral membrane proteins using solution nuclear magnetic resonance (NMR) spectroscopy has proved to be challenging. A residual-dipolar-coupling-based refinement approach can be used to resolve the structure of membrane proteins up to 40 kDa in size, but to do this you need a weak-alignment medium that is detergent-resistant and it has thus far been difficult to obtain such a medium suitable for weak alignment of membrane proteins. We describe here a protocol for robust, large-scale synthesis of detergent-resistant DNA nanotubes that can be assembled into dilute liquid crystals for application as weak-alignment media in solution NMR structure determination of membrane proteins in detergent micelles. The DNA nanotubes are heterodimers of 400-nm-long six-helix bundles, each self-assembled from a M13-based p7308 scaffold strand and >170 short oligonucleotide staple strands. Compatibility with proteins bearing considerable positive charge as well as modulation of molecular alignment, toward collection of linearly independent restraints, can be introduced by reducing the negative charge of DNA nanotubes using counter ions and small DNA-binding molecules. This detergent-resistant liquid-crystal medium offers a number of properties conducive for membrane protein alignment, including high-yield production, thermal stability, buffer compatibility and structural programmability. Production of sufficient nanotubes for four or five NMR experiments can be completed in 1 week by a single individual.

  4. The structure of pyogenecin immunity protein, a novel bacteriocin-like immunity protein from streptococcus pyogenes.

    Energy Technology Data Exchange (ETDEWEB)

    Chang, C.; Coggill, P.; Bateman, A.; Finn, R.; Cymborowski, M.; Otwinowski, Z.; Minor, W.; Volkart, L.; Joachimiak, A.; Wellcome Trust Sanger Inst.; Univ. of Virginia; UT Southwestern Medical Center

    2009-12-17

    Many Gram-positive lactic acid bacteria (LAB) produce anti-bacterial peptides and small proteins called bacteriocins, which enable them to compete against other bacteria in the environment. These peptides fall structurally into three different classes, I, II, III, with class IIa being pediocin-like single entities and class IIb being two-peptide bacteriocins. Self-protective cognate immunity proteins are usually co-transcribed with these toxins. Several examples of cognates for IIa have already been solved structurally. Streptococcus pyogenes, closely related to LAB, is one of the most common human pathogens, so knowledge of how it competes against other LAB species is likely to prove invaluable. We have solved the crystal structure of the gene-product of locus Spy-2152 from S. pyogenes, (PDB: 2fu2), and found it to comprise an anti-parallel four-helix bundle that is structurally similar to other bacteriocin immunity proteins. Sequence analyses indicate this protein to be a possible immunity protein protective against class IIa or IIb bacteriocins. However, given that S. pyogenes appears to lack any IIa pediocin-like proteins but does possess class IIb bacteriocins, we suggest this protein confers immunity to IIb-like peptides. Combined structural, genomic and proteomic analyses have allowed the identification and in silico characterization of a new putative immunity protein from S. pyogenes, possibly the first structure of an immunity protein protective against potential class IIb two-peptide bacteriocins. We have named the two pairs of putative bacteriocins found in S. pyogenes pyogenecin 1, 2, 3 and 4.

  5. Influence of secondary structure on in-source decay of protein in matrix-assisted laser desorption/ionization mass spectrometry.

    Science.gov (United States)

    Takayama, Mitsuo; Osaka, Issey; Sakakura, Motoshi

    2012-01-01

    The susceptibility of the N-Cα bond of the peptide backbone to specific cleavage by in-source decay (ISD) in matrix-assisted laser desorption/ionization mass spectrometry (MALDI MS) was studied from the standpoint of the secondary structure of three proteins. A naphthalene derivative, 5-amino-1-naphtol (5,1-ANL), was used as the matrix. The resulting c'-ions, which originate from the cleavage at N-Cα bonds in flexible secondary structures such as turn and bend, and are free from intra-molecular hydrogen-bonded α-helix structure, gave relatively intense peaks. Furthermore, ISD spectra of the proteins showed that the N-Cα bonds of specific amino acid residues, namely Gly-Xxx, Xxx-Asp, and Xxx-Asn, were more susceptible to MALDI-ISD than other amino acid residues. This is in agreement with the observation that Gly, Asp and Asn residues usually located in turns, rather than α-helix. The results obtained indicate that protein molecules embedded into the matrix crystal in the MALDI experiments maintain their secondary structures as determined by X-ray crystallography, and that MALDI-ISD has the capability for providing information concerning the secondary structure of protein.

  6. Radiation safety system (RSS) backbones: Design, engineering, fabrication and installation

    International Nuclear Information System (INIS)

    Wilmarth, J.E.; Sturrock, J.C.; Gallegos, F.R.

    1998-01-01

    The Radiation Safety System (RSS) Backbones are part of an electrical/electronic/mechanical system insuring safe access and exclusion of personnel to areas at the Los Alamos Neutron Science Center (LANSCE) accelerator. The RSS Backbones control the safety fusible beam plugs which terminate transmission of accelerated ion beams in response to predefined conditions. Any beam or access fault of the backbone inputs will cause insertion of the beam plugs in the low energy beam transport. The Backbones serve the function of tying the beam plugs to the access control systems, beam spill monitoring systems and current-level limiting systems. In some ways the Backbones may be thought of as a spinal column with beam plugs at the head and nerve centers along the spinal column. The two Linac Backbone segments and experimental area segments form a continuous cable plant over 3,500 feet from beam plugs to the tip on the longest tail. The Backbones were installed in compliance with current safety standards, such as installation of the two segments in separate conduits or tray. Monitoring for ground-faults and input wiring verification was an added enhancement to the system. The system has the capability to be tested remotely

  7. Constraint Logic Programming approach to protein structure prediction

    Directory of Open Access Journals (Sweden)

    Fogolari Federico

    2004-11-01

    Full Text Available Abstract Background The protein structure prediction problem is one of the most challenging problems in biological sciences. Many approaches have been proposed using database information and/or simplified protein models. The protein structure prediction problem can be cast in the form of an optimization problem. Notwithstanding its importance, the problem has very seldom been tackled by Constraint Logic Programming, a declarative programming paradigm suitable for solving combinatorial optimization problems. Results Constraint Logic Programming techniques have been applied to the protein structure prediction problem on the face-centered cube lattice model. Molecular dynamics techniques, endowed with the notion of constraint, have been also exploited. Even using a very simplified model, Constraint Logic Programming on the face-centered cube lattice model allowed us to obtain acceptable results for a few small proteins. As a test implementation their (known secondary structure and the presence of disulfide bridges are used as constraints. Simplified structures obtained in this way have been converted to all atom models with plausible structure. Results have been compared with a similar approach using a well-established technique as molecular dynamics. Conclusions The results obtained on small proteins show that Constraint Logic Programming techniques can be employed for studying protein simplified models, which can be converted into realistic all atom models. The advantage of Constraint Logic Programming over other, much more explored, methodologies, resides in the rapid software prototyping, in the easy way of encoding heuristics, and in exploiting all the advances made in this research area, e.g. in constraint propagation and its use for pruning the huge search space.

  8. Constraint Logic Programming approach to protein structure prediction.

    Science.gov (United States)

    Dal Palù, Alessandro; Dovier, Agostino; Fogolari, Federico

    2004-11-30

    The protein structure prediction problem is one of the most challenging problems in biological sciences. Many approaches have been proposed using database information and/or simplified protein models. The protein structure prediction problem can be cast in the form of an optimization problem. Notwithstanding its importance, the problem has very seldom been tackled by Constraint Logic Programming, a declarative programming paradigm suitable for solving combinatorial optimization problems. Constraint Logic Programming techniques have been applied to the protein structure prediction problem on the face-centered cube lattice model. Molecular dynamics techniques, endowed with the notion of constraint, have been also exploited. Even using a very simplified model, Constraint Logic Programming on the face-centered cube lattice model allowed us to obtain acceptable results for a few small proteins. As a test implementation their (known) secondary structure and the presence of disulfide bridges are used as constraints. Simplified structures obtained in this way have been converted to all atom models with plausible structure. Results have been compared with a similar approach using a well-established technique as molecular dynamics. The results obtained on small proteins show that Constraint Logic Programming techniques can be employed for studying protein simplified models, which can be converted into realistic all atom models. The advantage of Constraint Logic Programming over other, much more explored, methodologies, resides in the rapid software prototyping, in the easy way of encoding heuristics, and in exploiting all the advances made in this research area, e.g. in constraint propagation and its use for pruning the huge search space.

  9. Protein folding and the organization of the protein topology universe

    DEFF Research Database (Denmark)

    Lindorff-Larsen,, Kresten; Røgen, Peter; Paci, Emanuele

    2005-01-01

    residues and, in addition, that the topology of the transition state is closer to that of the native state than to that of any other fold in the protein universe. Here, we review the evidence for these conclusions and suggest a molecular mechanism that rationalizes these findings by presenting a view...... of protein folds that is based on the topological features of the polypeptide backbone, rather than the conventional view that depends on the arrangement of different types of secondary-structure elements. By linking the folding process to the organization of the protein structure universe, we propose...

  10. Polystyrene Backbone Polymers Consisting of Alkyl-Substituted Triazine Side Groups for Phosphorescent OLEDs

    OpenAIRE

    Salert, Beatrice Ch. D.; Wedel, Armin; Grubert, Lutz; Eberle, Thomas; Anémian, Rémi; Krueger, Hartmut

    2012-01-01

    This paper describes the synthesis of new electron-transporting styrene monomers and their corresponding polystyrenes all with a 2,4,6-triphenyl-1,3,5-triazine basic structure in the side group. The monomers differ in the alkyl substitution and in the meta-/paralinkage of the triazine to the polymer backbone. The thermal and spectroscopic properties of the new electron-transporting polymers are discussed in regard to their chemical structures. Phosphorescent OLEDs were prepared using the obta...

  11. Structural Basis for Target Protein Regcognition by Thiredoxin

    DEFF Research Database (Denmark)

    Maeda, Kenji

    2007-01-01

    Ser) and a mutant of an in vitro substrate alpha-amylase/subtilisin inhibitor (BASI) (Cys144Ser), as a reaction intermediate-mimic of Trx-catalyzed disulfide reduction. The resultant structure showed a sequence of BASI residues along a conserved hydrophobic groove constituted of three loop segments...... of Trx-fold proteins glutaredoxin and glutathione transferase. This study suggests that the features of main chain conformation as well as charge property around disulfide bonds in protein substrates are important factors for interaction with Trx. Moreover, this study describes a detailed structural......Thioredoxin (Trx) is an ubiquitous protein disulfide reductase that possesses two redox active cysteines in the conserved active site sequence motif, Trp-CysN-Gly/Pro-Pro-CysC situated in the so called Trx-fold. The lack of insight into the protein substrate recognition mechanism of Trx has to date...

  12. Fundamental Characteristics of AAA+ Protein Family Structure and Function.

    Science.gov (United States)

    Miller, Justin M; Enemark, Eric J

    2016-01-01

    Many complex cellular events depend on multiprotein complexes known as molecular machines to efficiently couple the energy derived from adenosine triphosphate hydrolysis to the generation of mechanical force. Members of the AAA+ ATPase superfamily (ATPases Associated with various cellular Activities) are critical components of many molecular machines. AAA+ proteins are defined by conserved modules that precisely position the active site elements of two adjacent subunits to catalyze ATP hydrolysis. In many cases, AAA+ proteins form a ring structure that translocates a polymeric substrate through the central channel using specialized loops that project into the central channel. We discuss the major features of AAA+ protein structure and function with an emphasis on pivotal aspects elucidated with archaeal proteins.

  13. A resource for benchmarking the usefulness of protein structure models.

    KAUST Repository

    Carbajo, Daniel

    2012-08-02

    BACKGROUND: Increasingly, biologists and biochemists use computational tools to design experiments to probe the function of proteins and/or to engineer them for a variety of different purposes. The most effective strategies rely on the knowledge of the three-dimensional structure of the protein of interest. However it is often the case that an experimental structure is not available and that models of different quality are used instead. On the other hand, the relationship between the quality of a model and its appropriate use is not easy to derive in general, and so far it has been analyzed in detail only for specific application. RESULTS: This paper describes a database and related software tools that allow testing of a given structure based method on models of a protein representing different levels of accuracy. The comparison of the results of a computational experiment on the experimental structure and on a set of its decoy models will allow developers and users to assess which is the specific threshold of accuracy required to perform the task effectively. CONCLUSIONS: The ModelDB server automatically builds decoy models of different accuracy for a given protein of known structure and provides a set of useful tools for their analysis. Pre-computed data for a non-redundant set of deposited protein structures are available for analysis and download in the ModelDB database. IMPLEMENTATION, AVAILABILITY AND REQUIREMENTS: Project name: A resource for benchmarking the usefulness of protein structure models. Project home page: http://bl210.caspur.it/MODEL-DB/MODEL-DB_web/MODindex.php.Operating system(s): Platform independent. Programming language: Perl-BioPerl (program); mySQL, Perl DBI and DBD modules (database); php, JavaScript, Jmol scripting (web server). Other requirements: Java Runtime Environment v1.4 or later, Perl, BioPerl, CPAN modules, HHsearch, Modeller, LGA, NCBI Blast package, DSSP, Speedfill (Surfnet) and PSAIA. License: Free. Any restrictions to use by

  14. A resource for benchmarking the usefulness of protein structure models.

    Science.gov (United States)

    Carbajo, Daniel; Tramontano, Anna

    2012-08-02

    Increasingly, biologists and biochemists use computational tools to design experiments to probe the function of proteins and/or to engineer them for a variety of different purposes. The most effective strategies rely on the knowledge of the three-dimensional structure of the protein of interest. However it is often the case that an experimental structure is not available and that models of different quality are used instead. On the other hand, the relationship between the quality of a model and its appropriate use is not easy to derive in general, and so far it has been analyzed in detail only for specific application. This paper describes a database and related software tools that allow testing of a given structure based method on models of a protein representing different levels of accuracy. The comparison of the results of a computational experiment on the experimental structure and on a set of its decoy models will allow developers and users to assess which is the specific threshold of accuracy required to perform the task effectively. The ModelDB server automatically builds decoy models of different accuracy for a given protein of known structure and provides a set of useful tools for their analysis. Pre-computed data for a non-redundant set of deposited protein structures are available for analysis and download in the ModelDB database. IMPLEMENTATION, AVAILABILITY AND REQUIREMENTS: Project name: A resource for benchmarking the usefulness of protein structure models. Project home page: http://bl210.caspur.it/MODEL-DB/MODEL-DB_web/MODindex.php.Operating system(s): Platform independent. Programming language: Perl-BioPerl (program); mySQL, Perl DBI and DBD modules (database); php, JavaScript, Jmol scripting (web server). Other requirements: Java Runtime Environment v1.4 or later, Perl, BioPerl, CPAN modules, HHsearch, Modeller, LGA, NCBI Blast package, DSSP, Speedfill (Surfnet) and PSAIA. License: Free. Any restrictions to use by non-academics: No.

  15. A resource for benchmarking the usefulness of protein structure models.

    KAUST Repository

    Carbajo, Daniel; Tramontano, Anna

    2012-01-01

    BACKGROUND: Increasingly, biologists and biochemists use computational tools to design experiments to probe the function of proteins and/or to engineer them for a variety of different purposes. The most effective strategies rely on the knowledge of the three-dimensional structure of the protein of interest. However it is often the case that an experimental structure is not available and that models of different quality are used instead. On the other hand, the relationship between the quality of a model and its appropriate use is not easy to derive in general, and so far it has been analyzed in detail only for specific application. RESULTS: This paper describes a database and related software tools that allow testing of a given structure based method on models of a protein representing different levels of accuracy. The comparison of the results of a computational experiment on the experimental structure and on a set of its decoy models will allow developers and users to assess which is the specific threshold of accuracy required to perform the task effectively. CONCLUSIONS: The ModelDB server automatically builds decoy models of different accuracy for a given protein of known structure and provides a set of useful tools for their analysis. Pre-computed data for a non-redundant set of deposited protein structures are available for analysis and download in the ModelDB database. IMPLEMENTATION, AVAILABILITY AND REQUIREMENTS: Project name: A resource for benchmarking the usefulness of protein structure models. Project home page: http://bl210.caspur.it/MODEL-DB/MODEL-DB_web/MODindex.php.Operating system(s): Platform independent. Programming language: Perl-BioPerl (program); mySQL, Perl DBI and DBD modules (database); php, JavaScript, Jmol scripting (web server). Other requirements: Java Runtime Environment v1.4 or later, Perl, BioPerl, CPAN modules, HHsearch, Modeller, LGA, NCBI Blast package, DSSP, Speedfill (Surfnet) and PSAIA. License: Free. Any restrictions to use by

  16. Lipid nanotechnologies for structural studies of membrane-associated proteins.

    Science.gov (United States)

    Stoilova-McPhie, Svetla; Grushin, Kirill; Dalm, Daniela; Miller, Jaimy

    2014-11-01

    We present a methodology of lipid nanotubes (LNT) and nanodisks technologies optimized in our laboratory for structural studies of membrane-associated proteins at close to physiological conditions. The application of these lipid nanotechnologies for structure determination by cryo-electron microscopy (cryo-EM) is fundamental for understanding and modulating their function. The LNTs in our studies are single bilayer galactosylceramide based nanotubes of ∼20 nm inner diameter and a few microns in length, that self-assemble in aqueous solutions. The lipid nanodisks (NDs) are self-assembled discoid lipid bilayers of ∼10 nm diameter, which are stabilized in aqueous solutions by a belt of amphipathic helical scaffold proteins. By combining LNT and ND technologies, we can examine structurally how the membrane curvature and lipid composition modulates the function of the membrane-associated proteins. As proof of principle, we have engineered these lipid nanotechnologies to mimic the activated platelet's phosphtaidylserine rich membrane and have successfully assembled functional membrane-bound coagulation factor VIII in vitro for structure determination by cryo-EM. The macromolecular organization of the proteins bound to ND and LNT are further defined by fitting the known atomic structures within the calculated three-dimensional maps. The combination of LNT and ND technologies offers a means to control the design and assembly of a wide range of functional membrane-associated proteins and complexes for structural studies by cryo-EM. The presented results confirm the suitability of the developed methodology for studying the functional structure of membrane-associated proteins, such as the coagulation factors, at a close to physiological environment. © 2014 Wiley Periodicals, Inc.

  17. Distance matrix-based approach to protein structure prediction.

    Science.gov (United States)

    Kloczkowski, Andrzej; Jernigan, Robert L; Wu, Zhijun; Song, Guang; Yang, Lei; Kolinski, Andrzej; Pokarowski, Piotr

    2009-03-01

    Much structural information is encoded in the internal distances; a distance matrix-based approach can be used to predict protein structure and dynamics, and for structural refinement. Our approach is based on the square distance matrix D = [r(ij)(2)] containing all square distances between residues in proteins. This distance matrix contains more information than the contact matrix C, that has elements of either 0 or 1 depending on whether the distance r (ij) is greater or less than a cutoff value r (cutoff). We have performed spectral decomposition of the distance matrices D = sigma lambda(k)V(k)V(kT), in terms of eigenvalues lambda kappa and the corresponding eigenvectors v kappa and found that it contains at most five nonzero terms. A dominant eigenvector is proportional to r (2)--the square distance of points from the center of mass, with the next three being the principal components of the system of points. By predicting r (2) from the sequence we can approximate a distance matrix of a protein with an expected RMSD value of about 7.3 A, and by combining it with the prediction of the first principal component we can improve this approximation to 4.0 A. We can also explain the role of hydrophobic interactions for the protein structure, because r is highly correlated with the hydrophobic profile of the sequence. Moreover, r is highly correlated with several sequence profiles which are useful in protein structure prediction, such as contact number, the residue-wise contact order (RWCO) or mean square fluctuations (i.e. crystallographic temperature factors). We have also shown that the next three components are related to spatial directionality of the secondary structure elements, and they may be also predicted from the sequence, improving overall structure prediction. We have also shown that the large number of available HIV-1 protease structures provides a remarkable sampling of conformations, which can be viewed as direct structural information about the

  18. Phylogenetic and structural analysis of centromeric DNA and kinetochore proteins

    OpenAIRE

    Meraldi, Patrick; McAinsh, Andrew D; Rheinbay, Esther; Sorger, Peter K

    2006-01-01

    Background: Kinetochores are large multi-protein structures that assemble on centromeric DNA (CEN DNA) and mediate the binding of chromosomes to microtubules. Comprising 125 base-pairs of CEN DNA and 70 or more protein components, Saccharomyces cerevisiae kinetochores are among the best understood. In contrast, most fungal, plant and animal cells assemble kinetochores on CENs that are longer and more complex, raising the question of whether kinetochore architecture has been conserved through ...

  19. Predicting protein structures with a multiplayer online game

    OpenAIRE

    Cooper, Seth; Khatib, Firas; Treuille, Adrien; Barbero, Janos; Lee, Jeehyung; Beenen, Michael; Leaver-Fay, Andrew; Baker, David; Popović, Zoran

    2010-01-01

    People exert significant amounts of problem solving effort playing computer games. Simple image- and text-recognition tasks have been successfully crowd-sourced through gamesi, ii, iii, but it is not clear if more complex scientific problems can be similarly solved with human-directed computing. Protein structure prediction is one such problem: locating the biologically relevant native conformation of a protein is a formidable computational challenge given the very large size of the search sp...

  20. Structures and Interactions of Proteins in the Brain

    DEFF Research Database (Denmark)

    Nielsen, Lau Dalby

    The protein low density lipoprotein receptor related protein 1 (LRP1) plays multiple roles in the biology of amyloid β peptide (Aβ) and Alzheimer’s disease. LRP1 is very important for clearance of Aβ both in the brain and by facilitating Aβ export over the blood brain barrier. In spite of the app......The protein low density lipoprotein receptor related protein 1 (LRP1) plays multiple roles in the biology of amyloid β peptide (Aβ) and Alzheimer’s disease. LRP1 is very important for clearance of Aβ both in the brain and by facilitating Aβ export over the blood brain barrier. In spite...... coding for Arc protein has been domesticated from the same branch of genes that has given rise to retroviruses. We show that even despite the large evolutional distance between Arc and retroviruses. Despite large evolutionary distance Arc still self-assemble into higher order structures that resembles...

  1. Structure and assembly of a paramyxovirus matrix protein.

    Science.gov (United States)

    Battisti, Anthony J; Meng, Geng; Winkler, Dennis C; McGinnes, Lori W; Plevka, Pavel; Steven, Alasdair C; Morrison, Trudy G; Rossmann, Michael G

    2012-08-28

    Many pleomorphic, lipid-enveloped viruses encode matrix proteins that direct their assembly and budding, but the mechanism of this process is unclear. We have combined X-ray crystallography and cryoelectron tomography to show that the matrix protein of Newcastle disease virus, a paramyxovirus and relative of measles virus, forms dimers that assemble into pseudotetrameric arrays that generate the membrane curvature necessary for virus budding. We show that the glycoproteins are anchored in the gaps between the matrix proteins and that the helical nucleocapsids are associated in register with the matrix arrays. About 90% of virions lack matrix arrays, suggesting that, in agreement with previous biological observations, the matrix protein needs to dissociate from the viral membrane during maturation, as is required for fusion and release of the nucleocapsid into the host's cytoplasm. Structure and sequence conservation imply that other paramyxovirus matrix proteins function similarly.

  2. Structure and Modification of Electrode Materials for Protein Electrochemistry.

    Science.gov (United States)

    Jeuken, Lars J C

    The interactions between proteins and electrode surfaces are of fundamental importance in bioelectrochemistry, including photobioelectrochemistry. In order to optimise the interaction between electrode and redox protein, either the electrode or the protein can be engineered, with the former being the most adopted approach. This tutorial review provides a basic description of the most commonly used electrode materials in bioelectrochemistry and discusses approaches to modify these surfaces. Carbon, gold and transparent electrodes (e.g. indium tin oxide) are covered, while approaches to form meso- and macroporous structured electrodes are also described. Electrode modifications include the chemical modification with (self-assembled) monolayers and the use of conducting polymers in which the protein is imbedded. The proteins themselves can either be in solution, electrostatically adsorbed on the surface or covalently bound to the electrode. Drawbacks and benefits of each material and its modifications are discussed. Where examples exist of applications in photobioelectrochemistry, these are highlighted.

  3. (PS)2: protein structure prediction server version 3.0.

    Science.gov (United States)

    Huang, Tsun-Tsao; Hwang, Jenn-Kang; Chen, Chu-Huang; Chu, Chih-Sheng; Lee, Chi-Wen; Chen, Chih-Chieh

    2015-07-01

    Protein complexes are involved in many biological processes. Examining coupling between subunits of a complex would be useful to understand the molecular basis of protein function. Here, our updated (PS)(2) web server predicts the three-dimensional structures of protein complexes based on comparative modeling; furthermore, this server examines the coupling between subunits of the predicted complex by combining structural and evolutionary considerations. The predicted complex structure could be indicated and visualized by Java-based 3D graphics viewers and the structural and evolutionary profiles are shown and compared chain-by-chain. For each subunit, considerations with or without the packing contribution of other subunits cause the differences in similarities between structural and evolutionary profiles, and these differences imply which form, complex or monomeric, is preferred in the biological condition for the subunit. We believe that the (PS)(2) server would be a useful tool for biologists who are interested not only in the structures of protein complexes but also in the coupling between subunits of the complexes. The (PS)(2) is freely available at http://ps2v3.life.nctu.edu.tw/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. Structural Conservation of the Myoviridae Phage Tail Sheath Protein Fold

    Energy Technology Data Exchange (ETDEWEB)

    Aksyuk, Anastasia A.; Kurochkina, Lidia P.; Fokine, Andrei; Forouhar, Farhad; Mesyanzhinov, Vadim V.; Tong, Liang; Rossmann, Michael G. (SOIBC); (Purdue); (Columbia)

    2012-02-21

    Bacteriophage phiKZ is a giant phage that infects Pseudomonas aeruginosa, a human pathogen. The phiKZ virion consists of a 1450 {angstrom} diameter icosahedral head and a 2000 {angstrom}-long contractile tail. The structure of the whole virus was previously reported, showing that its tail organization in the extended state is similar to the well-studied Myovirus bacteriophage T4 tail. The crystal structure of a tail sheath protein fragment of phiKZ was determined to 2.4 {angstrom} resolution. Furthermore, crystal structures of two prophage tail sheath proteins were determined to 1.9 and 3.3 {angstrom} resolution. Despite low sequence identity between these proteins, all of these structures have a similar fold. The crystal structure of the phiKZ tail sheath protein has been fitted into cryo-electron-microscopy reconstructions of the extended tail sheath and of a polysheath. The structural rearrangement of the phiKZ tail sheath contraction was found to be similar to that of phage T4.

  5. Structural History of Human SRGAP2 Proteins.

    Science.gov (United States)

    Sporny, Michael; Guez-Haddad, Julia; Kreusch, Annett; Shakartzi, Sivan; Neznansky, Avi; Cross, Alice; Isupov, Michail N; Qualmann, Britta; Kessels, Michael M; Opatowsky, Yarden

    2017-06-01

    In the development of the human brain, human-specific genes are considered to play key roles, conferring its unique advantages and vulnerabilities. At the time of Homo lineage divergence from Australopithecus, SRGAP2C gradually emerged through a process of serial duplications and mutagenesis from ancestral SRGAP2A (3.4-2.4 Ma). Remarkably, ectopic expression of SRGAP2C endows cultured mouse brain cells, with human-like characteristics, specifically, increased dendritic spine length and density. To understand the molecular mechanisms underlying this change in neuronal morphology, we determined the structure of SRGAP2A and studied the interplay between SRGAP2A and SRGAP2C. We found that: 1) SRGAP2A homo-dimerizes through a large interface that includes an F-BAR domain, a newly identified F-BAR extension (Fx), and RhoGAP-SH3 domains. 2) SRGAP2A has an unusual inverse geometry, enabling associations with lamellipodia and dendritic spine heads in vivo, and scaffolding of membrane protrusions in cell culture. 3) As a result of the initial partial duplication event (∼3.4 Ma), SRGAP2C carries a defective Fx-domain that severely compromises its solubility and membrane-scaffolding ability. Consistently, SRGAP2A:SRAGP2C hetero-dimers form, but are insoluble, inhibiting SRGAP2A activity. 4) Inactivation of SRGAP2A is sensitive to the level of hetero-dimerization with SRGAP2C. 5) The primal form of SRGAP2C (P-SRGAP2C, existing between ∼3.4 and 2.4 Ma) is less effective in hetero-dimerizing with SRGAP2A than the modern SRGAP2C, which carries several substitutions (from ∼2.4 Ma). Thus, the genetic mutagenesis phase contributed to modulation of SRGAP2A's inhibition of neuronal expansion, by introducing and improving the formation of inactive SRGAP2A:SRGAP2C hetero-dimers, indicating a stepwise involvement of SRGAP2C in human evolutionary history. © The Author 2017. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  6. Structural basis of protein oxidation resistance: a lysozyme study.

    Directory of Open Access Journals (Sweden)

    Marion Girod

    Full Text Available Accumulation of oxidative damage in proteins correlates with aging since it can cause irreversible and progressive degeneration of almost all cellular functions. Apparently, native protein structures have evolved intrinsic resistance to oxidation since perfectly folded proteins are, by large most robust. Here we explore the structural basis of protein resistance to radiation-induced oxidation using chicken egg white lysozyme in the native and misfolded form. We study the differential resistance to oxidative damage of six different parts of native and misfolded lysozyme by a targeted tandem/mass spectrometry approach of its tryptic fragments. The decay of the amount of each lysozyme fragment with increasing radiation dose is found to be a two steps process, characterized by a double exponential evolution of their amounts: the first one can be largely attributed to oxidation of specific amino acids, while the second one corresponds to further degradation of the protein. By correlating these results to the structural parameters computed from molecular dynamics (MD simulations, we find the protein parts with increased root-mean-square deviation (RMSD to be more susceptible to modifications. In addition, involvement of amino acid side-chains in hydrogen bonds has a protective effect against oxidation Increased exposure to solvent of individual amino acid side chains correlates with high susceptibility to oxidative and other modifications like side chain fragmentation. Generally, while none of the structural parameters alone can account for the fate of peptides during radiation, together they provide an insight into the relationship between protein structure and susceptibility to oxidation.

  7. Contingency Table Browser - prediction of early stage protein structure.

    Science.gov (United States)

    Kalinowska, Barbara; Krzykalski, Artur; Roterman, Irena

    2015-01-01

    The Early Stage (ES) intermediate represents the starting structure in protein folding simulations based on the Fuzzy Oil Drop (FOD) model. The accuracy of FOD predictions is greatly dependent on the accuracy of the chosen intermediate. A suitable intermediate can be constructed using the sequence-structure relationship information contained in the so-called contingency table - this table expresses the likelihood of encountering various structural motifs for each tetrapeptide fragment in the amino acid sequence. The limited accuracy with which such structures could previously be predicted provided the motivation for a more indepth study of the contingency table itself. The Contingency Table Browser is a tool which can visualize, search and analyze the table. Our work presents possible applications of Contingency Table Browser, among them - analysis of specific protein sequences from the point of view of their structural ambiguity.

  8. Electron transfer reactions in structural units of copper proteins

    International Nuclear Information System (INIS)

    Faraggi, M.

    1975-01-01

    In previous pulse radiolysis studies it was suggested that the reduction of the Cu(II) ions in copper proteins by the hydrated electron is a multi-step electron migration process. The technique has been extended to investigate the reduction of some structural units of these proteins. These studies include: the reaction of the hydrated electron with peptides, the reaction of the disulphide bridge with formate radical ion and radicals produced by the reduction of peptides, and the reaction of Cu(II)-peptide complex with esub(aq)sup(-) and CO 2 - . Using these results the reduction mechanism of copper and other proteins will be discussed. (author)

  9. Three-dimensional protein structure prediction: Methods and computational strategies.

    Science.gov (United States)

    Dorn, Márcio; E Silva, Mariel Barbachan; Buriol, Luciana S; Lamb, Luis C

    2014-10-12

    A long standing problem in structural bioinformatics is to determine the three-dimensional (3-D) structure of a protein when only a sequence of amino acid residues is given. Many computational methodologies and algorithms have been proposed as a solution to the 3-D Protein Structure Prediction (3-D-PSP) problem. These methods can be divided in four main classes: (a) first principle methods without database information; (b) first principle methods with database information; (c) fold recognition and threading methods; and (d) comparative modeling methods and sequence alignment strategies. Deterministic computational techniques, optimization techniques, data mining and machine learning approaches are typically used in the construction of computational solutions for the PSP problem. Our main goal with this work is to review the methods and computational strategies that are currently used in 3-D protein prediction. Copyright © 2014 Elsevier Ltd. All rights reserved.

  10. PROGRAM SYSTEM AND INFORMATION METADATA BANK OF TERTIARY PROTEIN STRUCTURES

    Directory of Open Access Journals (Sweden)

    T. A. Nikitin

    2013-01-01

    Full Text Available The article deals with the architecture of metadata storage model for check results of three-dimensional protein structures. Concept database model was built. The service and procedure of database update as well as data transformation algorithms for protein structures and their quality were presented. Most important information about entries and their submission forms to store, access, and delivery to users were highlighted. Software suite was developed for the implementation of functional tasks using Java programming language in the NetBeans v.7.0 environment and JQL to query and interact with the database JavaDB. The service was tested and results have shown system effectiveness while protein structures filtration.

  11. SA-Search: a web tool for protein structure mining based on a Structural Alphabet

    OpenAIRE

    Guyon, Frédéric; Camproux, Anne-Claude; Hochez, Joëlle; Tufféry, Pierre

    2004-01-01

    SA-Search is a web tool that can be used to mine for protein structures and extract structural similarities. It is based on a hidden Markov model derived Structural Alphabet (SA) that allows the compression of three-dimensional (3D) protein conformations into a one-dimensional (1D) representation using a limited number of prototype conformations. Using such a representation, classical methods developed for amino acid sequences can be employed. Currently, SA-Search permits the performance of f...

  12. RACK1, A Multifaceted Scaffolding Protein: Structure and Function

    LENUS (Irish Health Repository)

    Adams, David R

    2011-10-06

    Abstract The Receptor for Activated C Kinase 1 (RACK1) is a member of the tryptophan-aspartate repeat (WD-repeat) family of proteins and shares significant homology to the β subunit of G-proteins (Gβ). RACK1 adopts a seven-bladed β-propeller structure which facilitates protein binding. RACK1 has a significant role to play in shuttling proteins around the cell, anchoring proteins at particular locations and in stabilising protein activity. It interacts with the ribosomal machinery, with several cell surface receptors and with proteins in the nucleus. As a result, RACK1 is a key mediator of various pathways and contributes to numerous aspects of cellular function. Here, we discuss RACK1 gene and structure and its role in specific signaling pathways, and address how posttranslational modifications facilitate subcellular location and translocation of RACK1. This review condenses several recent studies suggesting a role for RACK1 in physiological processes such as development, cell migration, central nervous system (CN) function and circadian rhythm as well as reviewing the role of RACK1 in disease.

  13. Protein Function Prediction Based on Sequence and Structure Information

    KAUST Repository

    Smaili, Fatima Z.

    2016-05-25

    The number of available protein sequences in public databases is increasing exponentially. However, a significant fraction of these sequences lack functional annotation which is essential to our understanding of how biological systems and processes operate. In this master thesis project, we worked on inferring protein functions based on the primary protein sequence. In the approach we follow, 3D models are first constructed using I-TASSER. Functions are then deduced by structurally matching these predicted models, using global and local similarities, through three independent enzyme commission (EC) and gene ontology (GO) function libraries. The method was tested on 250 “hard” proteins, which lack homologous templates in both structure and function libraries. The results show that this method outperforms the conventional prediction methods based on sequence similarity or threading. Additionally, our method could be improved even further by incorporating protein-protein interaction information. Overall, the method we use provides an efficient approach for automated functional annotation of non-homologous proteins, starting from their sequence.

  14. Functional classification of protein structures by local structure matching in graph representation.

    Science.gov (United States)

    Mills, Caitlyn L; Garg, Rohan; Lee, Joslynn S; Tian, Liang; Suciu, Alexandru; Cooperman, Gene; Beuning, Penny J; Ondrechen, Mary Jo

    2018-03-31

    As a result of high-throughput protein structure initiatives, over 14,400 protein structures have been solved by structural genomics (SG) centers and participating research groups. While the totality of SG data represents a tremendous contribution to genomics and structural biology, reliable functional information for these proteins is generally lacking. Better functional predictions for SG proteins will add substantial value to the structural information already obtained. Our method described herein, Graph Representation of Active Sites for Prediction of Function (GRASP-Func), predicts quickly and accurately the biochemical function of proteins by representing residues at the predicted local active site as graphs rather than in Cartesian coordinates. We compare the GRASP-Func method to our previously reported method, structurally aligned local sites of activity (SALSA), using the ribulose phosphate binding barrel (RPBB), 6-hairpin glycosidase (6-HG), and Concanavalin A-like Lectins/Glucanase (CAL/G) superfamilies as test cases. In each of the superfamilies, SALSA and the much faster method GRASP-Func yield similar correct classification of previously characterized proteins, providing a validated benchmark for the new method. In addition, we analyzed SG proteins using our SALSA and GRASP-Func methods to predict function. Forty-one SG proteins in the RPBB superfamily, nine SG proteins in the 6-HG superfamily, and one SG protein in the CAL/G superfamily were successfully classified into one of the functional families in their respective superfamily by both methods. This improved, faster, validated computational method can yield more reliable predictions of function that can be used for a wide variety of applications by the community. © 2018 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.

  15. Improved protein structure reconstruction using secondary structures, contacts at higher distance thresholds, and non-contacts.

    Science.gov (United States)

    Adhikari, Badri; Cheng, Jianlin

    2017-08-29

    Residue-residue contacts are key features for accurate de novo protein structure prediction. For the optimal utilization of these predicted contacts in folding proteins accurately, it is important to study the challenges of reconstructing protein structures using true contacts. Because contact-guided protein modeling approach is valuable for predicting the folds of proteins that do not have structural templates, it is necessary for reconstruction studies to focus on hard-to-predict protein structures. Using a data set consisting of 496 structural domains released in recent CASP experiments and a dataset of 150 representative protein structures, in this work, we discuss three techniques to improve the reconstruction accuracy using true contacts - adding secondary structures, increasing contact distance thresholds, and adding non-contacts. We find that reconstruction using secondary structures and contacts can deliver accuracy higher than using full contact maps. Similarly, we demonstrate that non-contacts can improve reconstruction accuracy not only when the used non-contacts are true but also when they are predicted. On the dataset consisting of 150 proteins, we find that by simply using low ranked predicted contacts as non-contacts and adding them as additional restraints, can increase the reconstruction accuracy by 5% when the reconstructed models are evaluated using TM-score. Our findings suggest that secondary structures are invaluable companions of contacts for accurate reconstruction. Confirming some earlier findings, we also find that larger distance thresholds are useful for folding many protein structures which cannot be folded using the standard definition of contacts. Our findings also suggest that for more accurate reconstruction using predicted contacts it is useful to predict contacts at higher distance thresholds (beyond 8 Å) and predict non-contacts.

  16. Protein Secondary Structure Prediction Using Deep Convolutional Neural Fields.

    Science.gov (United States)

    Wang, Sheng; Peng, Jian; Ma, Jianzhu; Xu, Jinbo

    2016-01-11

    Protein secondary structure (SS) prediction is important for studying protein structure and function. When only the sequence (profile) information is used as input feature, currently the best predictors can obtain ~80% Q3 accuracy, which has not been improved in the past decade. Here we present DeepCNF (Deep Convolutional Neural Fields) for protein SS prediction. DeepCNF is a Deep Learning extension of Conditional Neural Fields (CNF), which is an integration of Conditional Random Fields (CRF) and shallow neural networks. DeepCNF can model not only complex sequence-structure relationship by a deep hierarchical architecture, but also interdependency between adjacent SS labels, so it is much more powerful than CNF. Experimental results show that DeepCNF can obtain ~84% Q3 accuracy, ~85% SOV score, and ~72% Q8 accuracy, respectively, on the CASP and CAMEO test proteins, greatly outperforming currently popular predictors. As a general framework, DeepCNF can be used to predict other protein structure properties such as contact number, disorder regions, and solvent accessibility.

  17. A tensegrity model for hydrogen bond networks in proteins

    Directory of Open Access Journals (Sweden)

    Robert P. Bywater

    2017-05-01

    Full Text Available Hydrogen-bonding networks in proteins considered as structural tensile elements are in balance separately from any other stabilising interactions that may be in operation. The hydrogen bond arrangement in the network is reminiscent of tensegrity structures in architecture and sculpture. Tensegrity has been discussed before in cells and tissues and in proteins. In contrast to previous work only hydrogen bonds are studied here. The other interactions within proteins are either much stronger − covalent bonds connecting the atoms in the molecular skeleton or weaker forces like the so-called hydrophobic interactions. It has been demonstrated that the latter operate independently from hydrogen bonds. Each category of interaction must, if the protein is to have a stable structure, balance out. The hypothesis here is that the entire hydrogen bond network is in balance without any compensating contributions from other types of interaction. For sidechain-sidechain, sidechain-backbone and backbone-backbone hydrogen bonds in proteins, tensegrity balance (“closure” is required over the entire length of the polypeptide chain that defines individually folding units in globular proteins (“domains” as well as within the repeating elements in fibrous proteins that consist of extended chain structures. There is no closure to be found in extended structures that do not have repeating elements. This suggests an explanation as to why globular domains, as well as the repeat units in fibrous proteins, have to have a defined number of residues. Apart from networks of sidechain-sidechain hydrogen bonds there are certain key points at which this closure is achieved in the sidechain-backbone hydrogen bonds and these are associated with demarcation points at the start or end of stretches of secondary structure. Together, these three categories of hydrogen bond achieve the closure that is necessary for the stability of globular protein domains as well as repeating

  18. Taking MAD to the extreme: ultrafast protein structure determination

    International Nuclear Information System (INIS)

    Walsh, M.A.; Dementieva, I.; Evans, G.; Sanishvili, R.; Joachimiak, A.

    1999-01-01

    Multiwavelength anomalous diffraction data were measured in 23 min from a 16 kDa selenomethionyl-substituted protein, producing experimental phases to 2.25 (angstrom) resolution. The data were collected on a mosaic 3 x 3 charge-coupled device using undulator radiation from the Structural Biology Center 19ID beamline at the Argonne National Laboratory's Advanced Photon Source. The phases were independently obtained semiautomatically by two crystallographic program suites, CCP4 and CNS. The quality and speed of this data acquisition exemplify the opportunities at third-generation synchrotron sources for high-throughput protein crystal structure determination

  19. Automatic protein structure solution from weak X-ray data

    Science.gov (United States)

    Skubák, Pavol; Pannu, Navraj S.

    2013-11-01

    Determining new protein structures from X-ray diffraction data at low resolution or with a weak anomalous signal is a difficult and often an impossible task. Here we propose a multivariate algorithm that simultaneously combines the structure determination steps. In tests on over 140 real data sets from the protein data bank, we show that this combined approach can automatically build models where current algorithms fail, including an anisotropically diffracting 3.88 Å RNA polymerase II data set. The method seamlessly automates the process, is ideal for non-specialists and provides a mathematical framework for successfully combining various sources of information in image processing.

  20. Prediction of protein-protein interactions in dengue virus coat proteins guided by low resolution cryoEM structures

    Directory of Open Access Journals (Sweden)

    Srinivasan Narayanaswamy

    2010-06-01

    Full Text Available Abstract Background Dengue virus along with the other members of the flaviviridae family has reemerged as deadly human pathogens. Understanding the mechanistic details of these infections can be highly rewarding in developing effective antivirals. During maturation of the virus inside the host cell, the coat proteins E and M undergo conformational changes, altering the morphology of the viral coat. However, due to low resolution nature of the available 3-D structures of viral assemblies, the atomic details of these changes are still elusive. Results In the present analysis, starting from Cα positions of low resolution cryo electron microscopic structures the residue level details of protein-protein interaction interfaces of dengue virus coat proteins have been predicted. By comparing the preexisting structures of virus in different phases of life cycle, the changes taking place in these predicted protein-protein interaction interfaces were followed as a function of maturation process of the virus. Besides changing the current notion about the presence of only homodimers in the mature viral coat, the present analysis indicated presence of a proline-rich motif at the protein-protein interaction interface of the coat protein. Investigating the conservation status of these seemingly functionally crucial residues across other members of flaviviridae family enabled dissecting common mechanisms used for infections by these viruses. Conclusions Thus, using computational approach the present analysis has provided better insights into the preexisting low resolution structures of virus assemblies, the findings of which can be made use of in designing effective antivirals against these deadly human pathogens.

  1. Critical Features of Fragment Libraries for Protein Structure Prediction.

    Science.gov (United States)

    Trevizani, Raphael; Custódio, Fábio Lima; Dos Santos, Karina Baptista; Dardenne, Laurent Emmanuel

    2017-01-01

    The use of fragment libraries is a popular approach among protein structure prediction methods and has proven to substantially improve the quality of predicted structures. However, some vital aspects of a fragment library that influence the accuracy of modeling a native structure remain to be determined. This study investigates some of these features. Particularly, we analyze the effect of using secondary structure prediction guiding fragments selection, different fragments sizes and the effect of structural clustering of fragments within libraries. To have a clearer view of how these factors affect protein structure prediction, we isolated the process of model building by fragment assembly from some common limitations associated with prediction methods, e.g., imprecise energy functions and optimization algorithms, by employing an exact structure-based objective function under a greedy algorithm. Our results indicate that shorter fragments reproduce the native structure more accurately than the longer. Libraries composed of multiple fragment lengths generate even better structures, where longer fragments show to be more useful at the beginning of the simulations. The use of many different fragment sizes shows little improvement when compared to predictions carried out with libraries that comprise only three different fragment sizes. Models obtained from libraries built using only sequence similarity are, on average, better than those built with a secondary structure prediction bias. However, we found that the use of secondary structure prediction allows greater reduction of the search space, which is invaluable for prediction methods. The results of this study can be critical guidelines for the use of fragment libraries in protein structure prediction.

  2. Predicting and validating protein interactions using network structure.

    Directory of Open Access Journals (Sweden)

    Pao-Yang Chen

    2008-07-01

    Full Text Available Protein interactions play a vital part in the function of a cell. As experimental techniques for detection and validation of protein interactions are time consuming, there is a need for computational methods for this task. Protein interactions appear to form a network with a relatively high degree of local clustering. In this paper we exploit this clustering by suggesting a score based on triplets of observed protein interactions. The score utilises both protein characteristics and network properties. Our score based on triplets is shown to complement existing techniques for predicting protein interactions, outperforming them on data sets which display a high degree of clustering. The predicted interactions score highly against test measures for accuracy. Compared to a similar score derived from pairwise interactions only, the triplet score displays higher sensitivity and specificity. By looking at specific examples, we show how an experimental set of interactions can be enriched and validated. As part of this work we also examine the effect of different prior databases upon the accuracy of prediction and find that the interact