WorldWideScience

Sample records for accurate protein identification

  1. Mass spectrometry based protein identification with accurate statistical significance assignment

    OpenAIRE

    Alves, Gelio; Yu, Yi-Kuo

    2014-01-01

    Motivation: Assigning statistical significance accurately has become increasingly important as meta data of many types, often assembled in hierarchies, are constructed and combined for further biological analyses. Statistical inaccuracy of meta data at any level may propagate to downstream analyses, undermining the validity of scientific conclusions thus drawn. From the perspective of mass spectrometry based proteomics, even though accurate statistics for peptide identification can now be ach...

  2. Rapid identification of sequences for orphan enzymes to power accurate protein annotation.

    Directory of Open Access Journals (Sweden)

    Kevin R Ramkissoon

    Full Text Available The power of genome sequencing depends on the ability to understand what those genes and their proteins products actually do. The automated methods used to assign functions to putative proteins in newly sequenced organisms are limited by the size of our library of proteins with both known function and sequence. Unfortunately this library grows slowly, lagging well behind the rapid increase in novel protein sequences produced by modern genome sequencing methods. One potential source for rapidly expanding this functional library is the "back catalog" of enzymology--"orphan enzymes," those enzymes that have been characterized and yet lack any associated sequence. There are hundreds of orphan enzymes in the Enzyme Commission (EC database alone. In this study, we demonstrate how this orphan enzyme "back catalog" is a fertile source for rapidly advancing the state of protein annotation. Starting from three orphan enzyme samples, we applied mass-spectrometry based analysis and computational methods (including sequence similarity networks, sequence and structural alignments, and operon context analysis to rapidly identify the specific sequence for each orphan while avoiding the most time- and labor-intensive aspects of typical sequence identifications. We then used these three new sequences to more accurately predict the catalytic function of 385 previously uncharacterized or misannotated proteins. We expect that this kind of rapid sequence identification could be efficiently applied on a larger scale to make enzymology's "back catalog" another powerful tool to drive accurate genome annotation.

  3. Rapid identification of sequences for orphan enzymes to power accurate protein annotation.

    Science.gov (United States)

    Ramkissoon, Kevin R; Miller, Jennifer K; Ojha, Sunil; Watson, Douglas S; Bomar, Martha G; Galande, Amit K; Shearer, Alexander G

    2013-01-01

    The power of genome sequencing depends on the ability to understand what those genes and their proteins products actually do. The automated methods used to assign functions to putative proteins in newly sequenced organisms are limited by the size of our library of proteins with both known function and sequence. Unfortunately this library grows slowly, lagging well behind the rapid increase in novel protein sequences produced by modern genome sequencing methods. One potential source for rapidly expanding this functional library is the "back catalog" of enzymology--"orphan enzymes," those enzymes that have been characterized and yet lack any associated sequence. There are hundreds of orphan enzymes in the Enzyme Commission (EC) database alone. In this study, we demonstrate how this orphan enzyme "back catalog" is a fertile source for rapidly advancing the state of protein annotation. Starting from three orphan enzyme samples, we applied mass-spectrometry based analysis and computational methods (including sequence similarity networks, sequence and structural alignments, and operon context analysis) to rapidly identify the specific sequence for each orphan while avoiding the most time- and labor-intensive aspects of typical sequence identifications. We then used these three new sequences to more accurately predict the catalytic function of 385 previously uncharacterized or misannotated proteins. We expect that this kind of rapid sequence identification could be efficiently applied on a larger scale to make enzymology's "back catalog" another powerful tool to drive accurate genome annotation.

  4. Rapid Identification of Sequences for Orphan Enzymes to Power Accurate Protein Annotation

    Science.gov (United States)

    Ojha, Sunil; Watson, Douglas S.; Bomar, Martha G.; Galande, Amit K.; Shearer, Alexander G.

    2013-01-01

    The power of genome sequencing depends on the ability to understand what those genes and their proteins products actually do. The automated methods used to assign functions to putative proteins in newly sequenced organisms are limited by the size of our library of proteins with both known function and sequence. Unfortunately this library grows slowly, lagging well behind the rapid increase in novel protein sequences produced by modern genome sequencing methods. One potential source for rapidly expanding this functional library is the “back catalog” of enzymology – “orphan enzymes,” those enzymes that have been characterized and yet lack any associated sequence. There are hundreds of orphan enzymes in the Enzyme Commission (EC) database alone. In this study, we demonstrate how this orphan enzyme “back catalog” is a fertile source for rapidly advancing the state of protein annotation. Starting from three orphan enzyme samples, we applied mass-spectrometry based analysis and computational methods (including sequence similarity networks, sequence and structural alignments, and operon context analysis) to rapidly identify the specific sequence for each orphan while avoiding the most time- and labor-intensive aspects of typical sequence identifications. We then used these three new sequences to more accurately predict the catalytic function of 385 previously uncharacterized or misannotated proteins. We expect that this kind of rapid sequence identification could be efficiently applied on a larger scale to make enzymology’s “back catalog” another powerful tool to drive accurate genome annotation. PMID:24386392

  5. Towards an accurate bioimpedance identification

    Science.gov (United States)

    Sanchez, B.; Louarroudi, E.; Bragos, R.; Pintelon, R.

    2013-04-01

    This paper describes the local polynomial method (LPM) for estimating the time-invariant bioimpedance frequency response function (FRF) considering both the output-error (OE) and the errors-in-variables (EIV) identification framework and compare it with the traditional cross— and autocorrelation spectral analysis techniques. The bioimpedance FRF is measured with the multisine electrical impedance spectroscopy (EIS) technique. To show the overwhelming accuracy of the LPM approach, both the LPM and the classical cross— and autocorrelation spectral analysis technique are evaluated through the same experimental data coming from a nonsteady-state measurement of time-varying in vivo myocardial tissue. The estimated error sources at the measurement frequencies due to noise, σnZ, and the stochastic nonlinear distortions, σZNL, have been converted to Ω and plotted over the bioimpedance spectrum for each framework. Ultimately, the impedance spectra have been fitted to a Cole impedance model using both an unweighted and a weighted complex nonlinear least square (CNLS) algorithm. A table is provided with the relative standard errors on the estimated parameters to reveal the importance of which system identification frameworks should be used.

  6. Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features

    Science.gov (United States)

    Li, Yuan; Wang, Mingjun; Wang, Huilin; Tan, Hao; Zhang, Ziding; Webb, Geoffrey I.; Song, Jiangning

    2014-07-01

    Lysine acetylation is a reversible post-translational modification, playing an important role in cytokine signaling, transcriptional regulation, and apoptosis. To fully understand acetylation mechanisms, identification of substrates and specific acetylation sites is crucial. Experimental identification is often time-consuming and expensive. Alternative bioinformatics methods are cost-effective and can be used in a high-throughput manner to generate relatively precise predictions. Here we develop a method termed as SSPKA for species-specific lysine acetylation prediction, using random forest classifiers that combine sequence-derived and functional features with two-step feature selection. Feature importance analysis indicates functional features, applied for lysine acetylation site prediction for the first time, significantly improve the predictive performance. We apply the SSPKA model to screen the entire human proteome and identify many high-confidence putative substrates that are not previously identified. The results along with the implemented Java tool, serve as useful resources to elucidate the mechanism of lysine acetylation and facilitate hypothesis-driven experimental design and validation.

  7. Accurate pose estimation for forensic identification

    Science.gov (United States)

    Merckx, Gert; Hermans, Jeroen; Vandermeulen, Dirk

    2010-04-01

    In forensic authentication, one aims to identify the perpetrator among a series of suspects or distractors. A fundamental problem in any recognition system that aims for identification of subjects in a natural scene is the lack of constrains on viewing and imaging conditions. In forensic applications, identification proves even more challenging, since most surveillance footage is of abysmal quality. In this context, robust methods for pose estimation are paramount. In this paper we will therefore present a new pose estimation strategy for very low quality footage. Our approach uses 3D-2D registration of a textured 3D face model with the surveillance image to obtain accurate far field pose alignment. Starting from an inaccurate initial estimate, the technique uses novel similarity measures based on the monogenic signal to guide a pose optimization process. We will illustrate the descriptive strength of the introduced similarity measures by using them directly as a recognition metric. Through validation, using both real and synthetic surveillance footage, our pose estimation method is shown to be accurate, and robust to lighting changes and image degradation.

  8. Fast and Accurate Identification of Cross-Linked Peptides for the Structural Analysis of Large Protein Complexes and Elucidation of Interaction Networks. / Tahir, Salman; Bukowski-Wills, Jimi-Carlo; Rasmussen, Morten; Rappsilber, Juri

    DEFF Research Database (Denmark)

    Rasmussen, Morten

    Fast and Accurate Identification of Cross-Linked Peptides for the structural analysis of large protein complexes and to elucidate interaction networks. Salman Tahir Jimi-Carlo Bukowski-Wills; Morten Rasmussen; Juri RappsilberWellcome Trust Centre for Cell Biology, Edinburgh , United Kingdom   Novel...

  9. Fast and Accurate Identification of Cross-Linked Peptides for the Structural Analysis of Large Protein Complexes and Elucidation of Interaction Networks. / Tahir, Salman; Bukowski-Wills, Jimi-Carlo; Rasmussen, Morten; Rappsilber, Juri

    DEFF Research Database (Denmark)

    Rasmussen, Morten

    Fast and Accurate Identification of Cross-Linked Peptides for the structural analysis of large protein complexes and to elucidate interaction networks. Salman Tahir Jimi-Carlo Bukowski-Wills; Morten Rasmussen; Juri RappsilberWellcome Trust Centre for Cell Biology, Edinburgh , United Kingdom   Novel...... to investigate protein structure and protein-protein interactions. When applied to single proteins or small purified protein complexes, this methodology works well. However certain challenges arise when applied to more complex samples. One of the main problems is the combinatorial increase in the search space...... Aspect: Our software efficiently and correctly identifies cross-links within large protein complexes, facilitating the construction of low-resolution 3D-models and interaction networks   .Introduction Chemical cross-linking of peptides coupled with mass spectrometry emerges as a powerful method...

  10. HIPPI: highly accurate protein family classification with ensembles of HMMs

    Directory of Open Access Journals (Sweden)

    Nam-phuong Nguyen

    2016-11-01

    Full Text Available Abstract Background Given a new biological sequence, detecting membership in a known family is a basic step in many bioinformatics analyses, with applications to protein structure and function prediction and metagenomic taxon identification and abundance profiling, among others. Yet family identification of sequences that are distantly related to sequences in public databases or that are fragmentary remains one of the more difficult analytical problems in bioinformatics. Results We present a new technique for family identification called HIPPI (Hierarchical Profile Hidden Markov Models for Protein family Identification. HIPPI uses a novel technique to represent a multiple sequence alignment for a given protein family or superfamily by an ensemble of profile hidden Markov models computed using HMMER. An evaluation of HIPPI on the Pfam database shows that HIPPI has better overall precision and recall than blastp, HMMER, and pipelines based on HHsearch, and maintains good accuracy even for fragmentary query sequences and for protein families with low average pairwise sequence identity, both conditions where other methods degrade in accuracy. Conclusion HIPPI provides accurate protein family identification and is robust to difficult model conditions. Our results, combined with observations from previous studies, show that ensembles of profile Hidden Markov models can better represent multiple sequence alignments than a single profile Hidden Markov model, and thus can improve downstream analyses for various bioinformatic tasks. Further research is needed to determine the best practices for building the ensemble of profile Hidden Markov models. HIPPI is available on GitHub at https://github.com/smirarab/sepp .

  11. Accurate Identification of Cancerlectins through Hybrid Machine Learning Technology

    Directory of Open Access Journals (Sweden)

    Jieru Zhang

    2016-01-01

    Full Text Available Cancerlectins are cancer-related proteins that function as lectins. They have been identified through computational identification techniques, but these techniques have sometimes failed to identify proteins because of sequence diversity among the cancerlectins. Advanced machine learning identification methods, such as support vector machine and basic sequence features (n-gram, have also been used to identify cancerlectins. In this study, various protein fingerprint features and advanced classifiers, including ensemble learning techniques, were utilized to identify this group of proteins. We improved the prediction accuracy of the original feature extraction methods and classification algorithms by more than 10% on average. Our work provides a basis for the computational identification of cancerlectins and reveals the power of hybrid machine learning techniques in computational proteomics.

  12. [A accurate identification method for Chinese materia medica--systematic identification of Chinese materia medica].

    Science.gov (United States)

    Wang, Xue-Yong; Liao, Cai-Li; Liu, Si-Qi; Liu, Chun-Sheng; Shao, Ai-Juan; Huang, Lu-Qi

    2013-05-01

    This paper put forward a more accurate identification method for identification of Chinese materia medica (CMM), the systematic identification of Chinese materia medica (SICMM) , which might solve difficulties in CMM identification used the ordinary traditional ways. Concepts, mechanisms and methods of SICMM were systematically introduced and possibility was proved by experiments. The establishment of SICMM will solve problems in identification of Chinese materia medica not only in phenotypic characters like the mnorphous, microstructure, chemical constituents, but also further discovery evolution and classification of species, subspecies and population in medical plants. The establishment of SICMM will improve the development of identification of CMM and create a more extensive study space.

  13. PILER-CR: Fast and accurate identification of CRISPR repeats

    Directory of Open Access Journals (Sweden)

    Edgar Robert C

    2007-01-01

    Full Text Available Abstract Background Sequencing of prokaryotic genomes has recently revealed the presence of CRISPR elements: short, highly conserved repeats separated by unique sequences of similar length. The distinctive sequence signature of CRISPR repeats can be found using general-purpose repeat- or pattern-finding software tools. However, the output of such tools is not always ideal for studying these repeats, and significant effort is sometimes needed to build additional tools and perform manual analysis of the output. Results We present PILER-CR, a program specifically designed for the identification and analysis of CRISPR repeats. The program executes rapidly, completing a 5 Mb genome in around 5 seconds on a current desktop computer. We validate the algorithm by manual curation and by comparison with published surveys of these repeats, finding that PILER-CR has both high sensitivity and high specificity. We also present a catalogue of putative CRISPR repeats identified in a comprehensive analysis of 346 prokaryotic genomes. Conclusion PILER-CR is a useful tool for rapid identification and classification of CRISPR repeats. The software is donated to the public domain. Source code and a Linux binary are freely available at http://www.drive5.com/pilercr.

  14. An Overview of Practical Applications of Protein Disorder Prediction and Drive for Faster, More Accurate Predictions

    Directory of Open Access Journals (Sweden)

    Xin Deng

    2015-07-01

    Full Text Available Protein disordered regions are segments of a protein chain that do not adopt a stable structure. Thus far, a variety of protein disorder prediction methods have been developed and have been widely used, not only in traditional bioinformatics domains, including protein structure prediction, protein structure determination and function annotation, but also in many other biomedical fields. The relationship between intrinsically-disordered proteins and some human diseases has played a significant role in disorder prediction in disease identification and epidemiological investigations. Disordered proteins can also serve as potential targets for drug discovery with an emphasis on the disordered-to-ordered transition in the disordered binding regions, and this has led to substantial research in drug discovery or design based on protein disordered region prediction. Furthermore, protein disorder prediction has also been applied to healthcare by predicting the disease risk of mutations in patients and studying the mechanistic basis of diseases. As the applications of disorder prediction increase, so too does the need to make quick and accurate predictions. To fill this need, we also present a new approach to predict protein residue disorder using wide sequence windows that is applicable on the genomic scale.

  15. An Overview of Practical Applications of Protein Disorder Prediction and Drive for Faster, More Accurate Predictions.

    Science.gov (United States)

    Deng, Xin; Gumm, Jordan; Karki, Suman; Eickholt, Jesse; Cheng, Jianlin

    2015-07-07

    Protein disordered regions are segments of a protein chain that do not adopt a stable structure. Thus far, a variety of protein disorder prediction methods have been developed and have been widely used, not only in traditional bioinformatics domains, including protein structure prediction, protein structure determination and function annotation, but also in many other biomedical fields. The relationship between intrinsically-disordered proteins and some human diseases has played a significant role in disorder prediction in disease identification and epidemiological investigations. Disordered proteins can also serve as potential targets for drug discovery with an emphasis on the disordered-to-ordered transition in the disordered binding regions, and this has led to substantial research in drug discovery or design based on protein disordered region prediction. Furthermore, protein disorder prediction has also been applied to healthcare by predicting the disease risk of mutations in patients and studying the mechanistic basis of diseases. As the applications of disorder prediction increase, so too does the need to make quick and accurate predictions. To fill this need, we also present a new approach to predict protein residue disorder using wide sequence windows that is applicable on the genomic scale.

  16. A statistical method for assessing peptide identification confidence in accurate mass and time tag proteomics.

    Science.gov (United States)

    Stanley, Jeffrey R; Adkins, Joshua N; Slysz, Gordon W; Monroe, Matthew E; Purvine, Samuel O; Karpievitch, Yuliya V; Anderson, Gordon A; Smith, Richard D; Dabney, Alan R

    2011-08-15

    Current algorithms for quantifying peptide identification confidence in the accurate mass and time (AMT) tag approach assume that the AMT tags themselves have been correctly identified. However, there is uncertainty in the identification of AMT tags, because this is based on matching LC-MS/MS fragmentation spectra to peptide sequences. In this paper, we incorporate confidence measures for the AMT tag identifications into the calculation of probabilities for correct matches to an AMT tag database, resulting in a more accurate overall measure of identification confidence for the AMT tag approach. The method is referenced as Statistical Tools for AMT Tag Confidence (STAC). STAC additionally provides a uniqueness probability (UP) to help distinguish between multiple matches to an AMT tag and a method to calculate an overall false discovery rate (FDR). STAC is freely available for download, as both a command line and a Windows graphical application.

  17. Identification of Microorganisms by High Resolution Tandem Mass Spectrometry with Accurate Statistical Significance

    Science.gov (United States)

    Alves, Gelio; Wang, Guanghui; Ogurtsov, Aleksey Y.; Drake, Steven K.; Gucek, Marjan; Suffredini, Anthony F.; Sacks, David B.; Yu, Yi-Kuo

    2016-02-01

    Correct and rapid identification of microorganisms is the key to the success of many important applications in health and safety, including, but not limited to, infection treatment, food safety, and biodefense. With the advance of mass spectrometry (MS) technology, the speed of identification can be greatly improved. However, the increasing number of microbes sequenced is challenging correct microbial identification because of the large number of choices present. To properly disentangle candidate microbes, one needs to go beyond apparent morphology or simple `fingerprinting'; to correctly prioritize the candidate microbes, one needs to have accurate statistical significance in microbial identification. We meet these challenges by using peptidome profiles of microbes to better separate them and by designing an analysis method that yields accurate statistical significance. Here, we present an analysis pipeline that uses tandem MS (MS/MS) spectra for microbial identification or classification. We have demonstrated, using MS/MS data of 81 samples, each composed of a single known microorganism, that the proposed pipeline can correctly identify microorganisms at least at the genus and species levels. We have also shown that the proposed pipeline computes accurate statistical significances, i.e., E-values for identified peptides and unified E-values for identified microorganisms. The proposed analysis pipeline has been implemented in MiCId, a freely available software for Microorganism Classification and Identification. MiCId is available for download at http://www.ncbi.nlm.nih.gov/CBBresearch/Yu/downloads.html.

  18. The SPECIES and ORGANISMS Resources for Fast and Accurate Identification of Taxonomic Names in Text

    DEFF Research Database (Denmark)

    Pafilis, Evangelos; Pletscher-Frankild, Sune; Fanini, Lucia

    2013-01-01

    The exponential growth of the biomedical literature is making the need for efficient, accurate text-mining tools increasingly clear. The identification of named biological entities in text is a central and difficult task. We have developed an efficient algorithm and implementation of a dictionary......-based approach to named entity recognition, which we here use to identify names of species and other taxa in text. The tool, SPECIES, is more than an order of magnitude faster and as accurate as existing tools. The precision and recall was assessed both on an existing gold-standard corpus and on a new corpus...

  19. A Statistical Method for Assessing Peptide Identification Confidence in Accurate Mass and Time Tag Proteomics

    Energy Technology Data Exchange (ETDEWEB)

    Stanley, Jeffrey R.; Adkins, Joshua N.; Slysz, Gordon W.; Monroe, Matthew E.; Purvine, Samuel O.; Karpievitch, Yuliya V.; Anderson, Gordon A.; Smith, Richard D.; Dabney, Alan R.

    2011-07-15

    High-throughput proteomics is rapidly evolving to require high mass measurement accuracy for a variety of different applications. Increased mass measurement accuracy in bottom-up proteomics specifically allows for an improved ability to distinguish and characterize detected MS features, which may in turn be identified by, e.g., matching to entries in a database for both precursor and fragmentation mass identification methods. Many tools exist with which to score the identification of peptides from LC-MS/MS measurements or to assess matches to an accurate mass and time (AMT) tag database, but these two calculations remain distinctly unrelated. Here we present a statistical method, Statistical Tools for AMT tag Confidence (STAC), which extends our previous work incorporating prior probabilities of correct sequence identification from LC-MS/MS, as well as the quality with which LC-MS features match AMT tags, to evaluate peptide identification confidence. Compared to existing tools, we are able to obtain significantly more high-confidence peptide identifications at a given false discovery rate and additionally assign confidence estimates to individual peptide identifications. Freely available software implementations of STAC are available in both command line and as a Windows graphical application.

  20. Identification and Quantification of Protein Glycosylation

    Directory of Open Access Journals (Sweden)

    Ziv Roth

    2012-01-01

    Full Text Available Glycosylation is one of the most abundant posttranslation modifications of proteins, and accumulating evidence indicate that the vast majority of proteins in eukaryotes are glycosylated. Glycosylation plays a role in protein folding, interaction, stability, and mobility, as well as in signal transduction. Thus, by regulating protein activity, glycosylation is involved in the normal functioning of the cell and in the development of diseases. Indeed, in the past few decades there has been a growing realization of the importance of protein glycosylation, as aberrant glycosylation has been implicated in metabolic, neurodegenerative, and neoplastic diseases. Thus, the identification and quantification of protein-borne oligosaccharides have become increasingly important both in the basic sciences of biochemistry and glycobiology and in the applicative sciences, particularly biomedicine and biotechnology. Here, we review the state-of-the-art methodologies for the identification and quantification of oligosaccharides, specifically N- and O-glycosylated proteins.

  1. What's in a Name? The Impact of Accurate Staphylococcus pseudintermedius Identification on Appropriate Antimicrobial Susceptibility Testing.

    Science.gov (United States)

    Limbago, Brandi M

    2016-03-01

    Bacteria in the Staphylococcus intermedius group, including Staphylococcus pseudintermedius, often encode mecA-mediated methicillin resistance. Reliable detection of this phenotype for proper treatment and infection control decisions requires that these coagulase-positive staphylococci are accurately identified and specifically that they are not misidentified as S. aureus. As correct species level bacterial identification becomes more commonplace in clinical laboratories, one can expect to see changes in guidance for antimicrobial susceptibility testing and interpretation. The study by Wu et al. in this issue (M. T. Wu, C.-A. D. Burnham, L. F. Westblade, J. Dien Bard, S. D. Lawhon, M. A. Wallace, T. Stanley, E. Burd, J. Hindler, R. M. Humphries, J Clin Microbiol 54:535-542, 2016, http://dx.doi.org/10.1128/JCM.02864-15) highlights the impact of robust identification of S. intermedius group organisms on the selection of appropriate antimicrobial susceptibility testing methods and interpretation.

  2. A fluorescence-based quantitative real-time PCR assay for accurate Pocillopora damicornis species identification

    Science.gov (United States)

    Thomas, Luke; Stat, Michael; Evans, Richard D.; Kennington, W. Jason

    2016-09-01

    Pocillopora damicornis is one of the most extensively studied coral species globally, but high levels of phenotypic plasticity within the genus make species identification based on morphology alone unreliable. As a result, there is a compelling need to develop cheap and time-effective molecular techniques capable of accurately distinguishing P. damicornis from other congeneric species. Here, we develop a fluorescence-based quantitative real-time PCR (qPCR) assay to genotype a single nucleotide polymorphism that accurately distinguishes P. damicornis from other morphologically similar Pocillopora species. We trial the assay across colonies representing multiple Pocillopora species and then apply the assay to screen samples of Pocillopora spp. collected at regional scales along the coastline of Western Australia. This assay offers a cheap and time-effective alternative to Sanger sequencing and has broad applications including studies on gene flow, dispersal, recruitment and physiological thresholds of P. damicornis.

  3. Post-Electrophoretic Identification of Oxidized Proteins

    Directory of Open Access Journals (Sweden)

    Conrad Craig

    2000-01-01

    Full Text Available The oxidative modification of proteins has been shown to play a major role in a number of human diseases. However, the ability to identify specific proteins that are most susceptible to oxidative modifications is difficult. Separation of proteins using polyacrylamide gel electrophoresis (PAGE offers the analytical potential for the recovery, amino acid sequencing, and identification of thousands of individual proteins from cells and tissues. We have developed a method to allow underivatized proteins to be electroblotted onto PVDF membranes before derivatization and staining. Since both the protein and oxidation proteins are quantifiable, the specific oxidation index of each protein can be determined. The optimal sequence and conditions for the staining process are (a electrophoresis, (b electroblotting onto PVDF membranes, (c derivatization of carbonyls with 2,4-DNP, (d immunostaining with anti DNP antibody, and (e protein staining with colloidal gold.

  4. A novel PCR-based approach for accurate identification of Vibrio parahaemolyticus

    Directory of Open Access Journals (Sweden)

    Ruichao eLi

    2016-01-01

    Full Text Available A PCR-based assay was developed for more accurate identification of Vibrio parahaemolyticus through targeting the blaCARB-17 like element, an intrinsic β-lactamase gene that may also be regarded as a novel species-specific genetic marker of this organism. Phylogenetic analysis showed that blaCARB-17 like genes were more conservative than the tlh, toxR and atpA genes, the genetic markers commonly used as detection targets in identification of V. parahaemolyticus. Our data showed that this blaCARB-17-specific PCR-based detection approach consistently achieved 100% specificity, whereas PCR targeting the tlh, toxR and atpA genes occasionally produced false positive results. Furthermore, a positive result of this test is consistently associated with an intrinsic ampicillin resistance phenotype of the test organism, presumably conferred by the products of blaCARB-17 like genes. We envision that combined analysis of the unique genetic and phenotypic characteristics conferred by blaCARB-17 shall further enhance the detection specificity of this novel yet easy-to-use detection approach to a level superior to the conventional methods used in V. parahaemolyticus detection and identification.

  5. A Novel PCR-Based Approach for Accurate Identification of Vibrio parahaemolyticus.

    Science.gov (United States)

    Li, Ruichao; Chiou, Jiachi; Chan, Edward Wai-Chi; Chen, Sheng

    2016-01-01

    A PCR-based assay was developed for more accurate identification of Vibrio parahaemolyticus through targeting the bla CARB-17 like element, an intrinsic β-lactamase gene that may also be regarded as a novel species-specific genetic marker of this organism. Homologous analysis showed that bla CARB-17 like genes were more conservative than the tlh, toxR and atpA genes, the genetic markers commonly used as detection targets in identification of V. parahaemolyticus. Our data showed that this bla CARB-17-specific PCR-based detection approach consistently achieved 100% specificity, whereas PCR targeting the tlh and atpA genes occasionally produced false positive results. Furthermore, a positive result of this test is consistently associated with an intrinsic ampicillin resistance phenotype of the test organism, presumably conferred by the products of bla CARB-17 like genes. We envision that combined analysis of the unique genetic and phenotypic characteristics conferred by bla CARB-17 shall further enhance the detection specificity of this novel yet easy-to-use detection approach to a level superior to the conventional methods used in V. parahaemolyticus detection and identification.

  6. Considerations for accurate identification of adult Culex restuans (Diptera: Culicidae) in field studies.

    Science.gov (United States)

    Harrington, Laura C; Poulson, Rebecca L

    2008-01-01

    Understanding the ecology and behavior of different mosquito species (Diptera: Culicidae) is essential for identifying their role in disease transmission cycles and public health risk. Two species of Culex mosquitoes in the northeastern United States, Culex pipiens L. and Culex restuans Theobald, have been implicated in enzootic transmission of West Nile virus (family Flaviviridae, genus Flavivirus, WNV). Despite the difficulty of differentiating these two species as adults, many public health workers and vector biologists collecting adults in the field separate these species based on external morphology. This approach is often used rather than examination of dissected male genitalia or polymerase chain reaction (PCR)-based diagnostics due to time or cost constraints. We evaluated the reliability of seven published morphological characters to differentiate adults of these species by comparing blindly scored morphology with PCR-based confirmations. Our study demonstrates that morphological identification of Cx. pipiens is marginal and often not reliable for Cx. restuans. We also examined error rates with molecular-based approaches. DNA samples were contaminated with as little as one leg from another species. We conclude that to fully understand the respective roles of Culex species in the epidemiology of WNV and other pathogens, more attention should be paid to these considerations for accurate species identification.

  7. T2Candida Provides Rapid and Accurate Species Identification in Pediatric Cases of Candidemia.

    Science.gov (United States)

    Hamula, Camille L; Hughes, Kenneth; Fisher, Brian T; Zaoutis, Theoklis E; Singh, Ila R; Velegraki, Aristea

    2016-06-01

    The goal of this study is to assess the ability of the T2Candida platform (T2 Biosystems, Lexington, MA) to accurately identify Candida species from pediatric blood specimens with low volumes. Whole blood from 15 children with candidemia was collected immediately following blood culture draw. The amount of blood required by the system was reduced by pipetting whole blood directly onto the T2Candida cartridge. Specimens were subsequently run on the T2Dx Instrument (T2 Biosystems). The T2Candida panel provided the appropriate result for each specimen compared with blood culture-based species identification and correctly identified 15 positive and nine negative results in 3 to 5 hours. While the time to species identification for blood culture was not reported, the T2Candida results include species data. T2Candida can be used to efficiently diagnose or rule out candidemia using low-volume blood specimens from pediatric patients. This could result in improved time to appropriate antifungal therapy or reduction in unnecessary empirical antifungal therapy. © American Society for Clinical Pathology, 2016. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  8. Accurate identification of Culicidae at aquatic developmental stages by MALDI-TOF MS profiling.

    Science.gov (United States)

    Dieme, Constentin; Yssouf, Amina; Vega-Rúa, Anubis; Berenger, Jean-Michel; Failloux, Anna-Bella; Raoult, Didier; Parola, Philippe; Almeras, Lionel

    2014-12-02

    The identification of mosquito vectors is generally based on morphological criteria, but for aquatic stages, morphological characteristics may be missing, leading to incomplete or incorrect identification. The high cost of molecular biology techniques requires the development of an alternative strategy. In the last decade, matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) profiling has proved to be efficient for arthropod identification at the species level. To investigate the usefulness of MALDI-TOF MS for the identification of mosquitoes at aquatic stages, optimizations of sample preparation, diet, body parts and storage conditions were tested. Protein extracts of whole specimens from second larval stage to pupae were selected for the creation of a reference spectra database. The database included a total of 95 laboratory-reared specimens of 6 mosquito species, including Anopheles gambiae (S form), Anopheles coluzzi (M form), Culex pipiens pipiens, Culex pipiens molestus, Aedes aegypti and 2 colonies of Aedes albopictus. The present study revealed that whole specimens at aquatic stages produced reproducible and singular spectra according to the mosquito species. Moreover, MS protein profiles appeared weakly affected by the diet provided. Despite the low diversity of some MS profiles, notably for cryptic species, clustering analyses correctly classified all specimens tested at the species level followed by the clustering of early vs. late aquatic developmental stages. Discriminant mass peaks were recorded for the 6 mosquito species analyzed at larval stage 3 and the pupal stage. Querying against the reference spectra database of 149 new specimens at different aquatic stages from the 6 mosquito species revealed that 147 specimens were correctly identified at the species level and that early and late developmental stages were also distinguished. The present work highlights that MALDI-TOF MS profiling may be useful for the

  9. CASD-NMR 2: robust and accurate unsupervised analysis of raw NOESY spectra and protein structure determination with UNIO

    Energy Technology Data Exchange (ETDEWEB)

    Guerry, Paul; Duong, Viet Dung; Herrmann, Torsten, E-mail: torsten.herrmann@ens-lyon.fr [Université de Lyon (UMR 5280 CNRS, Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon 1), Institut des Sciences Analytiques, Centre de RMN à très Hauts Champs (France)

    2015-08-15

    UNIO is a comprehensive software suite for protein NMR structure determination that enables full automation of all NMR data analysis steps involved—including signal identification in NMR spectra, sequence-specific backbone and side-chain resonance assignment, NOE assignment and structure calculation. Within the framework of the second round of the community-wide stringent blind NMR structure determination challenge (CASD-NMR 2), we participated in two categories of CASD-NMR 2, namely using either raw NMR spectra or unrefined NOE peak lists as input. A total of 15 resulting NMR structure bundles were submitted for 9 out of 10 blind protein targets. All submitted UNIO structures accurately coincided with the corresponding blind targets as documented by an average backbone root mean-square deviation to the reference proteins of only 1.2 Å. Also, the precision of the UNIO structure bundles was virtually identical to the ensemble of reference structures. By assessing the quality of all UNIO structures submitted to the two categories, we find throughout that only the UNIO–ATNOS/CANDID approach using raw NMR spectra consistently yielded structure bundles of high quality for direct deposition in the Protein Data Bank. In conclusion, the results obtained in CASD-NMR 2 are another vital proof for robust, accurate and unsupervised NMR data analysis by UNIO for real-world applications.

  10. An accurate and efficient identification of children with psychosocial problems by means of computerized adaptive testing

    Directory of Open Access Journals (Sweden)

    Reijneveld Symen A

    2011-08-01

    Full Text Available Abstract Background Questionnaires used by health services to identify children with psychosocial problems are often rather short. The psychometric properties of such short questionnaires are mostly less than needed for an accurate distinction between children with and without problems. We aimed to assess whether a short Computerized Adaptive Test (CAT can overcome the weaknesses of short written questionnaires when identifying children with psychosocial problems. Method We used a Dutch national data set obtained from parents of children invited for a routine health examination by Preventive Child Healthcare with 205 items on behavioral and emotional problems (n = 2,041, response 84%. In a random subsample we determined which items met the requirements of an Item Response Theory (IRT model to a sufficient degree. Using those items, item parameters necessary for a CAT were calculated and a cut-off point was defined. In the remaining subsample we determined the validity and efficiency of a Computerized Adaptive Test using simulation techniques, with current treatment status and a clinical score on the Total Problem Scale (TPS of the Child Behavior Checklist as criteria. Results Out of 205 items available 190 sufficiently met the criteria of the underlying IRT model. For 90% of the children a score above or below cut-off point could be determined with 95% accuracy. The mean number of items needed to achieve this was 12. Sensitivity and specificity with the TPS as a criterion were 0.89 and 0.91, respectively. Conclusion An IRT-based CAT is a very promising option for the identification of psychosocial problems in children, as it can lead to an efficient, yet high-quality identification. The results of our simulation study need to be replicated in a real-life administration of this CAT.

  11. Identification of "Known Unknowns" Utilizing Accurate Mass Data and ChemSpider

    Science.gov (United States)

    Little, James L.; Williams, Antony J.; Pshenichnov, Alexey; Tkachenko, Valery

    2012-01-01

    In many cases, an unknown to an investigator is actually known in the chemical literature, a reference database, or an internet resource. We refer to these types of compounds as "known unknowns." ChemSpider is a very valuable internet database of known compounds useful in the identification of these types of compounds in commercial, environmental, forensic, and natural product samples. The database contains over 26 million entries from hundreds of data sources and is provided as a free resource to the community. Accurate mass mass spectrometry data is used to query the database by either elemental composition or a monoisotopic mass. Searching by elemental composition is the preferred approach. However, it is often difficult to determine a unique elemental composition for compounds with molecular weights greater than 600 Da. In these cases, searching by the monoisotopic mass is advantageous. In either case, the search results are refined by sorting the number of references associated with each compound in descending order. This raises the most useful candidates to the top of the list for further evaluation. These approaches were shown to be successful in identifying "known unknowns" noted in our laboratory and for compounds of interest to others.

  12. Identification of "Known Unknowns" Utilizing Accurate Mass Data and Chemical Abstracts Service Databases

    Science.gov (United States)

    Little, James L.; Cleven, Curtis D.; Brown, Stacy D.

    2011-02-01

    In many cases, an unknown to an investigator is actually known in the chemical literature. We refer to these types of compounds as "known unknowns." Chemical Abstracts Service (CAS) Registry is a particularly good source of these substances as it contains over 54 million entries. Accurate mass measurements can be used to query the CAS Registry by either molecular formulae or average molecular weights. Searching the database by the web-based version of SciFinder is the preferred approach when molecular formulae are available. However, if a definitive molecular formula cannot be ascertained, searching the database with STN Express by average molecular weights is a viable alternative. The results from either approach are refined by employing the number of associated references or minimal sample history as orthogonal filters. These approaches were shown to be successful in identifying "known unknowns" noted in LC-MS and even GC-MS analyses in our laboratory. In addition, they were demonstrated in the identification of a variety of compounds of interest to others.

  13. Identification of "known unknowns" utilizing accurate mass data and chemical abstracts service databases.

    Science.gov (United States)

    Little, James L; Cleven, Curtis D; Brown, Stacy D

    2011-02-01

    In many cases, an unknown to an investigator is actually known in the chemical literature. We refer to these types of compounds as "known unknowns." Chemical Abstracts Service (CAS) Registry is a particularly good source of these substances as it contains over 54 million entries. Accurate mass measurements can be used to query the CAS Registry by either molecular formulae or average molecular weights. Searching the database by the web-based version of SciFinder is the preferred approach when molecular formulae are available. However, if a definitive molecular formula cannot be ascertained, searching the database with STN Express by average molecular weights is a viable alternative. The results from either approach are refined by employing the number of associated references or minimal sample history as orthogonal filters. These approaches were shown to be successful in identifying "known unknowns" noted in LC-MS and even GC-MS analyses in our laboratory. In addition, they were demonstrated in the identification of a variety of compounds of interest to others. © American Society for Mass Spectrometry, 2011

  14. A scalable and accurate method for classifying protein-ligand binding geometries using a MapReduce approach.

    Science.gov (United States)

    Estrada, T; Zhang, B; Cicotti, P; Armen, R S; Taufer, M

    2012-07-01

    We present a scalable and accurate method for classifying protein-ligand binding geometries in molecular docking. Our method is a three-step process: the first step encodes the geometry of a three-dimensional (3D) ligand conformation into a single 3D point in the space; the second step builds an octree by assigning an octant identifier to every single point in the space under consideration; and the third step performs an octree-based clustering on the reduced conformation space and identifies the most dense octant. We adapt our method for MapReduce and implement it in Hadoop. The load-balancing, fault-tolerance, and scalability in MapReduce allow screening of very large conformation spaces not approachable with traditional clustering methods. We analyze results for docking trials for 23 protein-ligand complexes for HIV protease, 21 protein-ligand complexes for Trypsin, and 12 protein-ligand complexes for P38alpha kinase. We also analyze cross docking trials for 24 ligands, each docking into 24 protein conformations of the HIV protease, and receptor ensemble docking trials for 24 ligands, each docking in a pool of HIV protease receptors. Our method demonstrates significant improvement over energy-only scoring for the accurate identification of native ligand geometries in all these docking assessments. The advantages of our clustering approach make it attractive for complex applications in real-world drug design efforts. We demonstrate that our method is particularly useful for clustering docking results using a minimal ensemble of representative protein conformational states (receptor ensemble docking), which is now a common strategy to address protein flexibility in molecular docking.

  15. PlantLoc: an accurate web server for predicting plant protein subcellular localization by substantiality motif

    OpenAIRE

    Tang, Shengnan; Li, Tonghua; Cong, Peisheng; Xiong, Wenwei; Wang, Zhiheng; Sun, Jiangming

    2013-01-01

    Knowledge of subcellular localizations (SCLs) of plant proteins relates to their functions and aids in understanding the regulation of biological processes at the cellular level. We present PlantLoc, a highly accurate and fast webserver for predicting the multi-label SCLs of plant proteins. The PlantLoc server has two innovative characters: building localization motif libraries by a recursive method without alignment and Gene Ontology information; and establishing simple architecture for rapi...

  16. Analysis of inteins in the Candida parapsilosis complex for simple and accurate species identification.

    Science.gov (United States)

    Prandini, Tâmara Heloísa Rocha; Theodoro, Raquel Cordeiro; Bruder-Nascimento, Ariane C M O; Scheel, Christina M; Bagagli, Eduardo

    2013-09-01

    Inteins are coding sequences that are transcribed and translated with flanking sequences and then are excised by an autocatalytic process. There are two types of inteins in fungi, mini-inteins and full-length inteins, both of which present a splicing domain containing well-conserved amino acid sequences. Full-length inteins also present a homing endonuclease domain that makes the intein a mobile genetic element. These parasitic genetic elements are located in highly conserved genes and may allow for the differentiation of closely related species of the Candida parapsilosis (psilosis) complex. The correct identification of the three psilosis complex species C. parapsilosis, Candida metapsilosis, and Candida orthopsilosis is very important in the clinical setting for improving antifungal therapy and patient care. In this work, we analyzed inteins that are present in the vacuolar ATPase gene VMA and in the threonyl-tRNA synthetase gene ThrRS in 85 strains of the Candida psilosis complex (46 C. parapsilosis, 17 C. metapsilosis, and 22 C. orthopsilosis). Here, we describe an accessible and accurate technique based on a single PCR that is able to differentiate the psilosis complex based on the VMA intein. Although the ThrRS intein does not distinguish the three species of the psilosis complex by PCR product size, it can differentiate them by sequencing and phylogenetic analysis. Furthermore, this intein is unusually present as both mini- and full-length forms in C. orthopsilosis. Additional population studies should be performed to address whether this represents a common intraspecific variability or the presence of subspecies within C. orthopsilosis.

  17. Calculation of accurate small angle X-ray scattering curves from coarse-grained protein models

    Directory of Open Access Journals (Sweden)

    Stovgaard Kasper

    2010-08-01

    Full Text Available Abstract Background Genome sequencing projects have expanded the gap between the amount of known protein sequences and structures. The limitations of current high resolution structure determination methods make it unlikely that this gap will disappear in the near future. Small angle X-ray scattering (SAXS is an established low resolution method for routinely determining the structure of proteins in solution. The purpose of this study is to develop a method for the efficient calculation of accurate SAXS curves from coarse-grained protein models. Such a method can for example be used to construct a likelihood function, which is paramount for structure determination based on statistical inference. Results We present a method for the efficient calculation of accurate SAXS curves based on the Debye formula and a set of scattering form factors for dummy atom representations of amino acids. Such a method avoids the computationally costly iteration over all atoms. We estimated the form factors using generated data from a set of high quality protein structures. No ad hoc scaling or correction factors are applied in the calculation of the curves. Two coarse-grained representations of protein structure were investigated; two scattering bodies per amino acid led to significantly better results than a single scattering body. Conclusion We show that the obtained point estimates allow the calculation of accurate SAXS curves from coarse-grained protein models. The resulting curves are on par with the current state-of-the-art program CRYSOL, which requires full atomic detail. Our method was also comparable to CRYSOL in recognizing native structures among native-like decoys. As a proof-of-concept, we combined the coarse-grained Debye calculation with a previously described probabilistic model of protein structure, TorusDBN. This resulted in a significant improvement in the decoy recognition performance. In conclusion, the presented method shows great promise for

  18. Accurate Target Identification Using Multi-look Fusion of Low Quality Target Signatures

    Science.gov (United States)

    2008-12-01

    qualité, ce qui pourrait avoir des conséquences importantes pour les applications pratiques. D’une part, l’apparition de technologies de capteurs et...identification performance and this is not adequate for many target identification applications . Furthermore, in order for the single-look procedure to...obtenues qu’avec un un seul capteur . Toutefois, force est de constater que le rendement de l’identification correcte d’objectifs par l’approche

  19. A random protein-creatinine ratio accurately predicts baseline proteinuria in early pregnancy.

    Science.gov (United States)

    Hirshberg, Adi; Draper, Jennifer; Curley, Cara; Sammel, Mary D; Schwartz, Nadav

    2014-12-01

    Data surrounding the use of a random urine protein:creatinine ratio (PCR) in the diagnosis of preeclampsia is conflicting. We sought to determine whether PCR in early pregnancy can replace the 24-hour urine collection as the primary screening test in patients at risk for baseline proteinuria. Women requiring a baseline evaluation for proteinuria supplied a urine sample the morning after their 24-hour collection. The PCR was analyzed as a predictor of significant proteinuria (≥150 mg). A regression equation to estimate the 24-hour protein value from the PCR was then developed. Sixty of 135 subjects enrolled completed the study. The median 24-hour urine protein and PCR were 90 mg (IQR: 50-145) and 0.063 (IQR: 0.039-0.083), respectively. Fifteen patients (25%) had significant proteinuria. PCR was strongly correlated with the 24-hour protein value (r = 0.99, p proteinuria (AUC = 0.86). A PCR cut-point of 0.079 yielded a sensitivity of 93.3% and a specificity of 57.8%. The resulting regression equation [total protein = 46.5 + 904.2*PCR] accurately estimates the actual 24-hour protein (95% CI: ±88 mg). A random urine PCR accurately estimates the 24-hour protein excretion in the first half of pregnancy and can be used as the primary screening test for baseline proteinuria in at-risk patients.

  20. Automated protein subfamily identification and classification.

    Directory of Open Access Journals (Sweden)

    Duncan P Brown

    2007-08-01

    Full Text Available Function prediction by homology is widely used to provide preliminary functional annotations for genes for which experimental evidence of function is unavailable or limited. This approach has been shown to be prone to systematic error, including percolation of annotation errors through sequence databases. Phylogenomic analysis avoids these errors in function prediction but has been difficult to automate for high-throughput application. To address this limitation, we present a computationally efficient pipeline for phylogenomic classification of proteins. This pipeline uses the SCI-PHY (Subfamily Classification in Phylogenomics algorithm for automatic subfamily identification, followed by subfamily hidden Markov model (HMM construction. A simple and computationally efficient scoring scheme using family and subfamily HMMs enables classification of novel sequences to protein families and subfamilies. Sequences representing entirely novel subfamilies are differentiated from those that can be classified to subfamilies in the input training set using logistic regression. Subfamily HMM parameters are estimated using an information-sharing protocol, enabling subfamilies containing even a single sequence to benefit from conservation patterns defining the family as a whole or in related subfamilies. SCI-PHY subfamilies correspond closely to functional subtypes defined by experts and to conserved clades found by phylogenetic analysis. Extensive comparisons of subfamily and family HMM performances show that subfamily HMMs dramatically improve the separation between homologous and non-homologous proteins in sequence database searches. Subfamily HMMs also provide extremely high specificity of classification and can be used to predict entirely novel subtypes. The SCI-PHY Web server at http://phylogenomics.berkeley.edu/SCI-PHY/ allows users to upload a multiple sequence alignment for subfamily identification and subfamily HMM construction. Biologists wishing to

  1. Identification of NAD interacting residues in proteins

    Directory of Open Access Journals (Sweden)

    Raghava Gajendra PS

    2010-03-01

    Full Text Available Abstract Background Small molecular cofactors or ligands play a crucial role in the proper functioning of cells. Accurate annotation of their target proteins and binding sites is required for the complete understanding of reaction mechanisms. Nicotinamide adenine dinucleotide (NAD+ or NAD is one of the most commonly used organic cofactors in living cells, which plays a critical role in cellular metabolism, storage and regulatory processes. In the past, several NAD binding proteins (NADBP have been reported in the literature, which are responsible for a wide-range of activities in the cell. Attempts have been made to derive a rule for the binding of NAD+ to its target proteins. However, so far an efficient model could not be derived due to the time consuming process of structure determination, and limitations of similarity based approaches. Thus a sequence and non-similarity based method is needed to characterize the NAD binding sites to help in the annotation. In this study attempts have been made to predict NAD binding proteins and their interacting residues (NIRs from amino acid sequence using bioinformatics tools. Results We extracted 1556 proteins chains from 555 NAD binding proteins whose structure is available in Protein Data Bank. Then we removed all redundant protein chains and finally obtained 195 non-redundant NAD binding protein chains, where no two chains have more than 40% sequence identity. In this study all models were developed and evaluated using five-fold cross validation technique on the above dataset of 195 NAD binding proteins. While certain type of residues are preferred (e.g. Gly, Tyr, Thr, His in NAD interaction, residues like Ala, Glu, Leu, Lys are not preferred. A support vector machine (SVM based method has been developed using various window lengths of amino acid sequence for predicting NAD interacting residues and obtained maximum Matthew's correlation coefficient (MCC 0.47 with accuracy 74.13% at window length 17

  2. Accurate Quantitation of Dystrophin Protein in Human Skeletal Muscle Using Mass Spectrometry

    OpenAIRE

    Brown, Kristy J; Marathi, Ramya; Fiorillo, Alyson A; Ciccimaro, Eugene F.; Sharma, Seema; Rowlands, David S.; Rayavarapu, Sree; Nagaraju, Kanneboyina; Eric P. Hoffman; Hathout, Yetrib

    2012-01-01

    Quantitation of human dystrophin protein in muscle biopsies is a clinically relevant endpoint for both diagnosis and response to dystrophin-replacement therapies for dystrophinopathies. A robust and accurate assay would enable the use of dystrophin as a surrogate biomarker, particularly in exploratory Phase 2 trials. Currently available methods to quantitate dystrophin rely on immunoblot or immunohistochemistry methods that are not considered robust. Here we present a mass spectrometry based ...

  3. Accurate Classification of Protein Subcellular Localization from High-Throughput Microscopy Images Using Deep Learning

    Directory of Open Access Journals (Sweden)

    Tanel Pärnamaa

    2017-05-01

    Full Text Available High-throughput microscopy of many single cells generates high-dimensional data that are far from straightforward to analyze. One important problem is automatically detecting the cellular compartment where a fluorescently-tagged protein resides, a task relatively simple for an experienced human, but difficult to automate on a computer. Here, we train an 11-layer neural network on data from mapping thousands of yeast proteins, achieving per cell localization classification accuracy of 91%, and per protein accuracy of 99% on held-out images. We confirm that low-level network features correspond to basic image characteristics, while deeper layers separate localization classes. Using this network as a feature calculator, we train standard classifiers that assign proteins to previously unseen compartments after observing only a small number of training examples. Our results are the most accurate subcellular localization classifications to date, and demonstrate the usefulness of deep learning for high-throughput microscopy.

  4. Fast and Accurate Calculation of Protein Depth by Euclidean Distance Transform

    Science.gov (United States)

    Xu, Dong; Li, Hua; Zhang, Yang

    2014-01-01

    The depth of each atom/residue in a protein structure is a key attribution that has been widely used in protein structure modeling and function annotation. However, the accurate calculation of depth is time consuming. Here, we propose to use the Euclidean distance transform (EDT) to calculate the depth, which conveniently converts the protein structure to a 3D gray-scale image with each pixel labeling the minimum distance of the pixel to the surface of the molecule (i.e. the depth). We tested the proposed EDT method on a set of 261 non-redundant protein structures. The data show that the EDT method is 2.6 times faster than the widely used method by Chakravarty and Varadarajan. The depth value by EDT method is also highly accurate, which is almost identical to the depth calculated by exhaustive search (Pearson’s correlation coefficient≈1). We believe the EDT-based depth calculation program can be used as an efficient tool to assist the studies of protein fold recognition and structure-based function annotation. PMID:25035865

  5. Neural network and SVM classifiers accurately predict lipid binding proteins, irrespective of sequence homology.

    Science.gov (United States)

    Bakhtiarizadeh, Mohammad Reza; Moradi-Shahrbabak, Mohammad; Ebrahimi, Mansour; Ebrahimie, Esmaeil

    2014-09-07

    Due to the central roles of lipid binding proteins (LBPs) in many biological processes, sequence based identification of LBPs is of great interest. The major challenge is that LBPs are diverse in sequence, structure, and function which results in low accuracy of sequence homology based methods. Therefore, there is a need for developing alternative functional prediction methods irrespective of sequence similarity. To identify LBPs from non-LBPs, the performances of support vector machine (SVM) and neural network were compared in this study. Comprehensive protein features and various techniques were employed to create datasets. Five-fold cross-validation (CV) and independent evaluation (IE) tests were used to assess the validity of the two methods. The results indicated that SVM outperforms neural network. SVM achieved 89.28% (CV) and 89.55% (IE) overall accuracy in identification of LBPs from non-LBPs and 92.06% (CV) and 92.90% (IE) (in average) for classification of different LBPs classes. Increasing the number and the range of extracted protein features as well as optimization of the SVM parameters significantly increased the efficiency of LBPs class prediction in comparison to the only previous report in this field. Altogether, the results showed that the SVM algorithm can be run on broad, computationally calculated protein features and offers a promising tool in detection of LBPs classes. The proposed approach has the potential to integrate and improve the common sequence alignment based methods.

  6. Seroprofiling at the Candida albicans protein species level unveils an accurate molecular discriminator for candidemia.

    Science.gov (United States)

    Pitarch, Aida; Nombela, César; Gil, Concha

    2016-02-16

    Serum antibodies to specific Candida proteins have been reported as potential diagnostic biomarkers for candidemia. However, their diagnostic usefulness at the protein species level has hardly been examined. Using serological proteome analysis, we explored the IgG-antibody responses to Candida albicans protein species in candidemia and control patients. We found that 87 discrete protein species derived from 34 unique proteins were IgG-targets, although only 43 of them were differentially recognized by candidemia and control sera. An increase in the speciation of the immunome, connectivity and modularity of antigenic species co-recognition networks, and heterogeneity of antigenic species recognition patterns was associated with candidemia. IgG antibodies to certain discrete protein species were better predictors of candidemia than those to their corresponding proteins. A molecular discriminator delineated from the combined fingerprints of IgG antibodies to two distinct species of phosphoglycerate kinase and enolase accurately classified candidemia and control patients. These results provide new insight into the anti-Candida IgG-antibody response development in candidemia, and demonstrate that an immunoproteomic signature at the molecular level may be useful for its diagnosis. Our study further highlights the importance of defining pathogen-specific antigens at the chemical and molecular level for their potential application as immunodiagnostic reagents or even vaccine candidates.

  7. Fast and accurate protein substructure searching with simulated annealing and GPUs

    Directory of Open Access Journals (Sweden)

    Stivala Alex D

    2010-09-01

    Full Text Available Abstract Background Searching a database of protein structures for matches to a query structure, or occurrences of a structural motif, is an important task in structural biology and bioinformatics. While there are many existing methods for structural similarity searching, faster and more accurate approaches are still required, and few current methods are capable of substructure (motif searching. Results We developed an improved heuristic for tableau-based protein structure and substructure searching using simulated annealing, that is as fast or faster and comparable in accuracy, with some widely used existing methods. Furthermore, we created a parallel implementation on a modern graphics processing unit (GPU. Conclusions The GPU implementation achieves up to 34 times speedup over the CPU implementation of tableau-based structure search with simulated annealing, making it one of the fastest available methods. To the best of our knowledge, this is the first application of a GPU to the protein structural search problem.

  8. Fast and accurate multivariate Gaussian modeling of protein families: predicting residue contacts and protein-interaction partners.

    Directory of Open Access Journals (Sweden)

    Carlo Baldassi

    Full Text Available In the course of evolution, proteins show a remarkable conservation of their three-dimensional structure and their biological function, leading to strong evolutionary constraints on the sequence variability between homologous proteins. Our method aims at extracting such constraints from rapidly accumulating sequence data, and thereby at inferring protein structure and function from sequence information alone. Recently, global statistical inference methods (e.g. direct-coupling analysis, sparse inverse covariance estimation have achieved a breakthrough towards this aim, and their predictions have been successfully implemented into tertiary and quaternary protein structure prediction methods. However, due to the discrete nature of the underlying variable (amino-acids, exact inference requires exponential time in the protein length, and efficient approximations are needed for practical applicability. Here we propose a very efficient multivariate Gaussian modeling approach as a variant of direct-coupling analysis: the discrete amino-acid variables are replaced by continuous Gaussian random variables. The resulting statistical inference problem is efficiently and exactly solvable. We show that the quality of inference is comparable or superior to the one achieved by mean-field approximations to inference with discrete variables, as done by direct-coupling analysis. This is true for (i the prediction of residue-residue contacts in proteins, and (ii the identification of protein-protein interaction partner in bacterial signal transduction. An implementation of our multivariate Gaussian approach is available at the website http://areeweb.polito.it/ricerca/cmp/code.

  9. Accurate refinement of docked protein complexes using evolutionary information and deep learning.

    Science.gov (United States)

    Akbal-Delibas, Bahar; Farhoodi, Roshanak; Pomplun, Marc; Haspel, Nurit

    2016-06-01

    One of the major challenges for protein docking methods is to accurately discriminate native-like structures from false positives. Docking methods are often inaccurate and the results have to be refined and re-ranked to obtain native-like complexes and remove outliers. In a previous work, we introduced AccuRefiner, a machine learning based tool for refining protein-protein complexes. Given a docked complex, the refinement tool produces a small set of refined versions of the input complex, with lower root-mean-square-deviation (RMSD) of atomic positions with respect to the native structure. The method employs a unique ranking tool that accurately predicts the RMSD of docked complexes with respect to the native structure. In this work, we use a deep learning network with a similar set of features and five layers. We show that a properly trained deep learning network can accurately predict the RMSD of a docked complex with 1.40 Å error margin on average, by approximating the complex relationship between a wide set of scoring function terms and the RMSD of a docked structure. The network was trained on 35000 unbound docking complexes generated by RosettaDock. We tested our method on 25 different putative docked complexes produced also by RosettaDock for five proteins that were not included in the training data. The results demonstrate that the high accuracy of the ranking tool enables AccuRefiner to consistently choose the refinement candidates with lower RMSD values compared to the coarsely docked input structures.

  10. Mass spectrometry allows direct identification of proteins in large genomes

    DEFF Research Database (Denmark)

    Küster, B; Mortensen, Peter V.; Andersen, Jens S.

    2001-01-01

    Proteome projects seek to provide systematic functional analysis of the genes uncovered by genome sequencing initiatives. Mass spectrometric protein identification is a key requirement in these studies but to date, database searching tools rely on the availability of protein sequences derived fro...... genome and allows identification, mapping, cloning and assistance in gene prediction of any protein for which minimal mass spectrometric information can be obtained. Several novel proteins from Arabidopsis thaliana and human have been discovered in this way....

  11. Using context to improve protein domain identification

    Directory of Open Access Journals (Sweden)

    Llinás Manuel

    2011-03-01

    Full Text Available Abstract Background Identifying domains in protein sequences is an important step in protein structural and functional annotation. Existing domain recognition methods typically evaluate each domain prediction independently of the rest. However, the majority of proteins are multidomain, and pairwise domain co-occurrences are highly specific and non-transitive. Results Here, we demonstrate how to exploit domain co-occurrence to boost weak domain predictions that appear in previously observed combinations, while penalizing higher confidence domains if such combinations have never been observed. Our framework, Domain Prediction Using Context (dPUC, incorporates pairwise "context" scores between domains, along with traditional domain scores and thresholds, and improves domain prediction across a variety of organisms from bacteria to protozoa and metazoa. Among the genomes we tested, dPUC is most successful at improving predictions for the poorly-annotated malaria parasite Plasmodium falciparum, for which over 38% of the genome is currently unannotated. Our approach enables high-confidence annotations in this organism and the identification of orthologs to many core machinery proteins conserved in all eukaryotes, including those involved in ribosomal assembly and other RNA processing events, which surprisingly had not been previously known. Conclusions Overall, our results demonstrate that this new context-based approach will provide significant improvements in domain and function prediction, especially for poorly understood genomes for which the need for additional annotations is greatest. Source code for the algorithm is available under a GPL open source license at http://compbio.cs.princeton.edu/dpuc/. Pre-computed results for our test organisms and a web server are also available at that location.

  12. Accurate Prediction of One-Dimensional Protein Structure Features Using SPINE-X.

    Science.gov (United States)

    Faraggi, Eshel; Kloczkowski, Andrzej

    2017-01-01

    Accurate prediction of protein secondary structure and other one-dimensional structure features is essential for accurate sequence alignment, three-dimensional structure modeling, and function prediction. SPINE-X is a software package to predict secondary structure as well as accessible surface area and dihedral angles ϕ and ψ. For secondary structure SPINE-X achieves an accuracy of between 81 and 84 % depending on the dataset and choice of tests. The Pearson correlation coefficient for accessible surface area prediction is 0.75 and the mean absolute error from the ϕ and ψ dihedral angles are 20(∘) and 33(∘), respectively. The source code and a Linux executables for SPINE-X are available from Research and Information Systems at http://mamiris.com .

  13. Identification of SUMO target proteins by quantitative proteomics

    DEFF Research Database (Denmark)

    Andersen, Jens S; Matic, Ivan; Vertegaal, Alfred C O

    2009-01-01

    The identification of target proteins for small ubiquitin-like modifiers (SUMOs) is a critical step towards a detailed understanding of the cellular functions of SUMOs. Substrate protein identification for SUMOs is hampered by the low abundance of SUMO targets, the finding that only a small fract...

  14. Identification of low molecular weight proteins isolated by 2-D liquid separations.

    Science.gov (United States)

    Zhu, Kan; Miller, Fred R; Barder, Timothy J; Lubman, David M

    2004-07-01

    Proteins with molecular mass (M(r)) <20 kDa are often poorly separated in 2-D sodium dodecyl sulfate polyacrylamide gel electrophoresis. In addition, low-M(r) proteins may not be readily identified using peptide mass fingerprinting (PMF) owing to the small number of peptides generated in tryptic digestion. In this work, we used a 2-D liquid separation method based on chromatofocusing and non-porous silica reversed-phase high-performance liquid chromatography to purify proteins for matrix-assisted laser desorption/ionization time-of-flight mass spectrometric (MALDI-TOFMS) analysis and protein identification. Several proteins were identified using the PMF method where the result was supported using an accurate M(r) value obtained from electrospray ionization TOFMS. However, many proteins were not identified owing to an insufficient number of peptides observed in the MALDI-TOF experiments. The small number of peptides detected in MALDI-TOFMS can result from internal fragmentation, the few arginines in its sequence and incomplete tryptic digestion. MALDI-QTOFMS/MS can be used to identify many of these proteins. The accurate experimental M(r) and pI confirm identification and aid in identifying post-translational modifications such as truncations and acetylations. In some cases, high-quality MS/MS data obtained from the MALDI-QTOF spectrometer overcome preferential cleavages and result in protein identification.

  15. Unifying protein inference and peptide identification with feedback to update consistency between peptides.

    Science.gov (United States)

    Shi, Jinhong; Chen, Bolin; Wu, Fang-Xiang

    2013-01-01

    We first propose a new method to process peptide identification reports from databases search engines. Then via it we develop a method for unifying protein inference and peptide identification by adding a feedback from protein inference to peptide identification. The feedback information is a list of high-confidence proteins, which is used to update an adjacency matrix between peptides. The adjacency matrix is used in the regularization of peptide scores. Logistic regression (LR) is used to compute the probability of peptide identification with the regularized scores. Protein scores are then calculated with the LR probability of peptides. Instead of selecting the best peptide match for each MS/MS, we select multiple peptides. By testing on two datasets, the results have shown that the proposed method can robustly assign accurate probabilities to peptides, and have a higher discrimination power than PeptideProphet to distinguish correct and incorrect identified peptides. Additionally, not only can our method infer more true positive proteins but also infer less false positive proteins than ProteinProphet at the same false positive rate. The coverage of inferred proteins is also significantly increased due to the selection of multiple peptides for each MS/MS and the improvement of their scores by the feedback from the inferred proteins.

  16. Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model.

    Science.gov (United States)

    Wang, Sheng; Sun, Siqi; Li, Zhen; Zhang, Renyu; Xu, Jinbo

    2017-01-01

    Protein contacts contain key information for the understanding of protein structure and function and thus, contact prediction from sequence is an important problem. Recently exciting progress has been made on this problem, but the predicted contacts for proteins without many sequence homologs is still of low quality and not very useful for de novo structure prediction. This paper presents a new deep learning method that predicts contacts by integrating both evolutionary coupling (EC) and sequence conservation information through an ultra-deep neural network formed by two deep residual neural networks. The first residual network conducts a series of 1-dimensional convolutional transformation of sequential features; the second residual network conducts a series of 2-dimensional convolutional transformation of pairwise information including output of the first residual network, EC information and pairwise potential. By using very deep residual networks, we can accurately model contact occurrence patterns and complex sequence-structure relationship and thus, obtain higher-quality contact prediction regardless of how many sequence homologs are available for proteins in question. Our method greatly outperforms existing methods and leads to much more accurate contact-assisted folding. Tested on 105 CASP11 targets, 76 past CAMEO hard targets, and 398 membrane proteins, the average top L long-range prediction accuracy obtained by our method, one representative EC method CCMpred and the CASP11 winner MetaPSICOV is 0.47, 0.21 and 0.30, respectively; the average top L/10 long-range accuracy of our method, CCMpred and MetaPSICOV is 0.77, 0.47 and 0.59, respectively. Ab initio folding using our predicted contacts as restraints but without any force fields can yield correct folds (i.e., TMscore>0.6) for 203 of the 579 test proteins, while that using MetaPSICOV- and CCMpred-predicted contacts can do so for only 79 and 62 of them, respectively. Our contact-assisted models also have

  17. Accurate De Novo Prediction of Protein Contact Map by Ultra-Deep Learning Model

    Science.gov (United States)

    Li, Zhen; Zhang, Renyu

    2017-01-01

    Motivation Protein contacts contain key information for the understanding of protein structure and function and thus, contact prediction from sequence is an important problem. Recently exciting progress has been made on this problem, but the predicted contacts for proteins without many sequence homologs is still of low quality and not very useful for de novo structure prediction. Method This paper presents a new deep learning method that predicts contacts by integrating both evolutionary coupling (EC) and sequence conservation information through an ultra-deep neural network formed by two deep residual neural networks. The first residual network conducts a series of 1-dimensional convolutional transformation of sequential features; the second residual network conducts a series of 2-dimensional convolutional transformation of pairwise information including output of the first residual network, EC information and pairwise potential. By using very deep residual networks, we can accurately model contact occurrence patterns and complex sequence-structure relationship and thus, obtain higher-quality contact prediction regardless of how many sequence homologs are available for proteins in question. Results Our method greatly outperforms existing methods and leads to much more accurate contact-assisted folding. Tested on 105 CASP11 targets, 76 past CAMEO hard targets, and 398 membrane proteins, the average top L long-range prediction accuracy obtained by our method, one representative EC method CCMpred and the CASP11 winner MetaPSICOV is 0.47, 0.21 and 0.30, respectively; the average top L/10 long-range accuracy of our method, CCMpred and MetaPSICOV is 0.77, 0.47 and 0.59, respectively. Ab initio folding using our predicted contacts as restraints but without any force fields can yield correct folds (i.e., TMscore>0.6) for 203 of the 579 test proteins, while that using MetaPSICOV- and CCMpred-predicted contacts can do so for only 79 and 62 of them, respectively. Our contact

  18. Phage display library screening for identification of interacting protein partners.

    Science.gov (United States)

    Addepalli, Balasubrahmanyam; Rao, Suryadevara; Hunt, Arthur G

    2015-01-01

    Phage display is a versatile high-throughput screening method employed to understand and improve the chemical biology, be it production of human monoclonal antibodies or identification of interacting protein partners. A majority of cell proteins operate in a concerted fashion either by stable or transient interactions. Such interactions can be mediated by recognition of small amino acid sequence motifs on the protein surface. Phage display can play a crucial role in identification of such motifs. This report describes the use of phage display for the identification of high affinity sequence motifs that could be responsible for interactions with a target (bait) protein.

  19. Rapid and accurate prediction and scoring of water molecules in protein binding sites.

    Directory of Open Access Journals (Sweden)

    Gregory A Ross

    Full Text Available Water plays a critical role in ligand-protein interactions. However, it is still challenging to predict accurately not only where water molecules prefer to bind, but also which of those water molecules might be displaceable. The latter is often seen as a route to optimizing affinity of potential drug candidates. Using a protocol we call WaterDock, we show that the freely available AutoDock Vina tool can be used to predict accurately the binding sites of water molecules. WaterDock was validated using data from X-ray crystallography, neutron diffraction and molecular dynamics simulations and correctly predicted 97% of the water molecules in the test set. In addition, we combined data-mining, heuristic and machine learning techniques to develop probabilistic water molecule classifiers. When applied to WaterDock predictions in the Astex Diverse Set of protein ligand complexes, we could identify whether a water molecule was conserved or displaced to an accuracy of 75%. A second model predicted whether water molecules were displaced by polar groups or by non-polar groups to an accuracy of 80%. These results should prove useful for anyone wishing to undertake rational design of new compounds where the displacement of water molecules is being considered as a route to improved affinity.

  20. Rapid identification of DNA-binding proteins by mass spectrometry

    DEFF Research Database (Denmark)

    Nordhoff, E; Krogsdam, A M; Jorgensen, H F

    1999-01-01

    We report a protocol for the rapid identification of DNA-binding proteins. Immobilized DNA probes harboring a specific sequence motif are incubated with cell or nuclear extract. Proteins are analyzed directly off the solid support by matrix-assisted laser desorption/ionization time-of-flight mass...... was validated by the identification of known prokaryotic and eukaryotic DNA-binding proteins, and its use provided evidence that poly(ADP-ribose) polymerase exhibits DNA sequence-specific binding to DNA....

  1. HMM-FRAME: accurate protein domain classification for metagenomic sequences containing frameshift errors

    Directory of Open Access Journals (Sweden)

    Sun Yanni

    2011-05-01

    Full Text Available Abstract Background Protein domain classification is an important step in metagenomic annotation. The state-of-the-art method for protein domain classification is profile HMM-based alignment. However, the relatively high rates of insertions and deletions in homopolymer regions of pyrosequencing reads create frameshifts, causing conventional profile HMM alignment tools to generate alignments with marginal scores. This makes error-containing gene fragments unclassifiable with conventional tools. Thus, there is a need for an accurate domain classification tool that can detect and correct sequencing errors. Results We introduce HMM-FRAME, a protein domain classification tool based on an augmented Viterbi algorithm that can incorporate error models from different sequencing platforms. HMM-FRAME corrects sequencing errors and classifies putative gene fragments into domain families. It achieved high error detection sensitivity and specificity in a data set with annotated errors. We applied HMM-FRAME in Targeted Metagenomics and a published metagenomic data set. The results showed that our tool can correct frameshifts in error-containing sequences, generate much longer alignments with significantly smaller E-values, and classify more sequences into their native families. Conclusions HMM-FRAME provides a complementary protein domain classification tool to conventional profile HMM-based methods for data sets containing frameshifts. Its current implementation is best used for small-scale metagenomic data sets. The source code of HMM-FRAME can be downloaded at http://www.cse.msu.edu/~zhangy72/hmmframe/ and at https://sourceforge.net/projects/hmm-frame/.

  2. Serum protein profile at remission can accurately assess therapeutic outcomes and survival for serous ovarian cancer.

    Directory of Open Access Journals (Sweden)

    Jinhua Wang

    Full Text Available BACKGROUND: Biomarkers play critical roles in early detection, diagnosis and monitoring of therapeutic outcome and recurrence of cancer. Previous biomarker research on ovarian cancer (OC has mostly focused on the discovery and validation of diagnostic biomarkers. The primary purpose of this study is to identify serum biomarkers for prognosis and therapeutic outcomes of ovarian cancer. EXPERIMENTAL DESIGN: Forty serum proteins were analyzed in 70 serum samples from healthy controls (HC and 101 serum samples from serous OC patients at three different disease phases: post diagnosis (PD, remission (RM and recurrence (RC. The utility of serum proteins as OC biomarkers was evaluated using a variety of statistical methods including survival analysis. RESULTS: Ten serum proteins (PDGF-AB/BB, PDGF-AA, CRP, sFas, CA125, SAA, sTNFRII, sIL-6R, IGFBP6 and MDC have individually good area-under-the-curve (AUC values (AUC = 0.69-0.86 and more than 10 three-marker combinations have excellent AUC values (0.91-0.93 in distinguishing active cancer samples (PD & RC from HC. The mean serum protein levels for RM samples are usually intermediate between HC and OC patients with active cancer (PD & RC. Most importantly, five proteins (sICAM1, RANTES, sgp130, sTNFR-II and sVCAM1 measured at remission can classify, individually and in combination, serous OC patients into two subsets with significantly different overall survival (best HR = 17, p<10(-3. CONCLUSION: We identified five serum proteins which, when measured at remission, can accurately predict the overall survival of serous OC patients, suggesting that they may be useful for monitoring the therapeutic outcomes for ovarian cancer.

  3. Application of the antibiotic batumin for accurate and rapid identification of staphylococcal small colony variants

    Directory of Open Access Journals (Sweden)

    Churkina Larisa N

    2012-07-01

    Full Text Available Abstract Background Staphylococcus aureus is a major human pathogen causing significant morbidity and mortality. The S. aureus colonies in osteomyelitis, in patients with cystic fibrosis and patients with endoprosthesis rejection frequently have an atypical morphology, i.e. staphylococcal small-colony variants, which form a naturally occurring subpopulation of clinically important staphylococci. Identification of these small colony variants is difficult, because of the loss of typical phenotypic characteristics of these variants. We wanted to improve and simplify the diagnosis of staphylococcal infection using a diagnostic preparation, consisting of 5 μg batumin paper disks. Batumin possesses a unique selective activity against all studied Staphylococcus spp., whereas all other species tested thus far are batumin resistant. We assessed the efficacy of the batumin diagnostic preparation to identify staphylococcal small colony variants, isolated from osteomyelitis patients. Findings With the batumin diagnostic preparation, all 30 tested staphylococcal small-colony variants had a growth inhibition zone around the disk of minimum 25 mm, accordant with the inhibition zones of the parent strains, isolated from the same patients. Conclusions The batumin diagnostic preparation correctly identified the small-colony variants of S. aureus, S. haemolyticus and S. epidermidis as belonging to the genus Staphylococcus, which differ profoundly from parental strains and are difficult to identify with standard methods. Identification of staphylococcal small-colony variants with the batumin diagnostic preparation is technically simple and can facilitate practical laboratory work.

  4. Retrival experience as an accurate indicator of person identification in line-ups

    Directory of Open Access Journals (Sweden)

    María José Contreras

    2011-07-01

    Full Text Available Responses in eyewitness identification of a person in a line-up may be based on two types of recovery experiences, remember and know experiences. Remember responses involve eyewitness identification of the target person as an episodic memory task, because it implies retrieving information about the target person in the place and at the time of the event. Know responses, in contrast, engage recognition based on familiarity or perceptual facilitation, that is, as a semantic memory task. To explore the relation between retrieval experiences and recognition accuracy, 86 participants took part in a recognition task with two conditions: one with an interpolated target absent line-up and the other only with the target present line-up. Accuracy of recognition and retrieval experience was measured. The results showed that, having previously participated in a target-absent line-up, increased omissions, while the number of hits decreased. Furthermore, participants’ know responses were associated to false recognition, whilst remember responses were associated to hits in recognition. Thus, asking eyewitnesses to inform about the kind of retrieval experience in which they based their recognition responses, may serve as a reliable indicator of accuracy in recognition. Future studies are needed to investigate whether this is also the case in natural settings.

  5. Poisonous or non-poisonous plants? DNA-based tools and applications for accurate identification.

    Science.gov (United States)

    Mezzasalma, Valerio; Ganopoulos, Ioannis; Galimberti, Andrea; Cornara, Laura; Ferri, Emanuele; Labra, Massimo

    2017-01-01

    Plant exposures are among the most frequently reported cases to poison control centres worldwide. This is a growing condition due to recent societal trends oriented towards the consumption of wild plants as food, cosmetics, or medicine. At least three general causes of plant poisoning can be identified: plant misidentification, introduction of new plant-based supplements and medicines with no controls about their safety, and the lack of regulation for the trading of herbal and phytochemical products. Moreover, an efficient screening for the occurrence of plants poisonous to humans is also desirable at the different stages of the food supply chain: from the raw material to the final transformed product. A rapid diagnosis of intoxication cases is necessary in order to provide the most reliable treatment. However, a precise taxonomic characterization of the ingested species is often challenging. In this review, we provide an overview of the emerging DNA-based tools and technologies to address the issue of poisonous plant identification. Specifically, classic DNA barcoding and its applications using High Resolution Melting (Bar-HRM) ensure high universality and rapid response respectively, whereas High Throughput Sequencing techniques (HTS) provide a complete characterization of plant residues in complex matrices. The pros and cons of each approach have been evaluated with the final aim of proposing a general user's guide to molecular identification directed to different stakeholder categories interested in the diagnostics of poisonous plants.

  6. DeepBound: accurate identification of transcript boundaries via deep convolutional neural fields

    KAUST Repository

    Shao, Mingfu

    2017-04-20

    Motivation: Reconstructing the full- length expressed transcripts (a. k. a. the transcript assembly problem) from the short sequencing reads produced by RNA-seq protocol plays a central role in identifying novel genes and transcripts as well as in studying gene expressions and gene functions. A crucial step in transcript assembly is to accurately determine the splicing junctions and boundaries of the expressed transcripts from the reads alignment. In contrast to the splicing junctions that can be efficiently detected from spliced reads, the problem of identifying boundaries remains open and challenging, due to the fact that the signal related to boundaries is noisy and weak.

  7. Modified AutoDock for accurate docking of protein kinase inhibitors.

    Science.gov (United States)

    Buzko, Oleksandr V; Bishop, Anthony C; Shokat, Kevan M

    2002-02-01

    Protein kinases are an important class of enzymes controlling virtually all cellular signaling pathways. Consequently, selective inhibitors of protein kinases have attracted significant interest as potential new drugs for many diseases. Computational methods, including molecular docking, have increasingly been used in the inhibitor design process [1]. We have considered several docking packages in order to strengthen our kinase inhibitor work with computational capabilities. In our experience, AutoDock offered a reasonable combination of accuracy and speed, as opposed to methods that specialize either in fast database searches or detailed and computationally intensive calculations. However, AutoDock did not perform well in cases where extensive hydrophobic contacts were involved, such as docking of SB203580 to its target protein kinase p38. Another shortcoming was a hydrogen bonding energy function, which underestimated the attraction component and, thus, did not allow for sufficiently accurate modeling of the key hydrogen bonds in the kinase-inhibitor complexes. We have modified the parameter set used to model hydrogen bonds, which increased the accuracy of AutoDock and appeared to be generally applicable to many kinase-inhibitor pairs without customization. Binding to largely hydrophobic sites, such as the active site of p38, was significantly improved by introducing a correction factor selectively affecting only carbon and hydrogen energy grids, thus, providing an effective, although approximate, treatment of solvation.

  8. Innovative Flow Cytometry Allows Accurate Identification of Rare Circulating Cells Involved in Endothelial Dysfunction

    Science.gov (United States)

    Boraldi, Federica; Bartolomeo, Angelica; De Biasi, Sara; Orlando, Stefania; Costa, Sonia; Cossarizza, Andrea; Quaglino, Daniela

    2016-01-01

    Introduction Although rare, circulating endothelial and progenitor cells could be considered as markers of endothelial damage and repair potential, possibly predicting the severity of cardiovascular manifestations. A number of studies highlighted the role of these cells in age-related diseases, including those characterized by ectopic calcification. Nevertheless, their use in clinical practice is still controversial, mainly due to difficulties in finding reproducible and accurate methods for their determination. Methods Circulating mature cells (CMC, CD45-, CD34+, CD133-) and circulating progenitor cells (CPC, CD45dim, CD34bright, CD133+) were investigated by polychromatic high-speed flow cytometry to detect the expression of endothelial (CD309+) or osteogenic (BAP+) differentiation markers in healthy subjects and in patients affected by peripheral vascular manifestations associated with ectopic calcification. Results This study shows that: 1) polychromatic flow cytometry represents a valuable tool to accurately identify rare cells; 2) the balance of CD309+ on CMC/CD309+ on CPC is altered in patients affected by peripheral vascular manifestations, suggesting the occurrence of vascular damage and low repair potential; 3) the increase of circulating cells exhibiting a shift towards an osteoblast-like phenotype (BAP+) is observed in the presence of ectopic calcification. Conclusion Differences between healthy subjects and patients with ectopic calcification indicate that this approach may be useful to better evaluate endothelial dysfunction in a clinical context. PMID:27560136

  9. Mitotic Protein CSPP1 Interacts with CENP-H Protein to Coordinate Accurate Chromosome Oscillation in Mitosis.

    Science.gov (United States)

    Zhu, Lijuan; Wang, Zhikai; Wang, Wenwen; Wang, Chunli; Hua, Shasha; Su, Zeqi; Brako, Larry; Garcia-Barrio, Minerva; Ye, Mingliang; Wei, Xuan; Zou, Hanfa; Ding, Xia; Liu, Lifang; Liu, Xing; Yao, Xuebiao

    2015-11-06

    Mitotic chromosome segregation is orchestrated by the dynamic interaction of spindle microtubules with the kinetochores. During chromosome alignment, kinetochore-bound microtubules undergo dynamic cycles between growth and shrinkage, leading to an oscillatory movement of chromosomes along the spindle axis. Although kinetochore protein CENP-H serves as a molecular control of kinetochore-microtubule dynamics, the mechanistic link between CENP-H and kinetochore microtubules (kMT) has remained less characterized. Here, we show that CSPP1 is a kinetochore protein essential for accurate chromosome movements in mitosis. CSPP1 binds to CENP-H in vitro and in vivo. Suppression of CSPP1 perturbs proper mitotic progression and compromises the satisfaction of spindle assembly checkpoint. In addition, chromosome oscillation is greatly attenuated in CSPP1-depleted cells, similar to what was observed in the CENP-H-depleted cells. Importantly, CSPP1 depletion enhances velocity of kinetochore movement, and overexpression of CSPP1 decreases the speed, suggesting that CSPP1 promotes kMT stability during cell division. Specific perturbation of CENP-H/CSPP1 interaction using a membrane-permeable competing peptide resulted in a transient mitotic arrest and chromosome segregation defect. Based on these findings, we propose that CSPP1 cooperates with CENP-H on kinetochores to serve as a novel regulator of kMT dynamics for accurate chromosome segregation.

  10. A transition radiation detector for RHIC featuring accurate tracking and dE/dx particle identification

    Energy Technology Data Exchange (ETDEWEB)

    O`Brien, E.; Lissauer, D.; McCorkle, S.; Polychronakos, V.; Takai, H. [Brookhaven National Lab., Upton, NY (United States); Chi, C.Y.; Nagamiya, S.; Sippach, W.; Toy, M.; Wang, D.; Wang, Y.F.; Wiggins, C.; Willis, W. [Columbia Univ., New York, NY (United States); Cherniatin, V.; Dolgoshein, B. [Moscow Institute of Physics and Engineering, (Russian Federation); Bennett, M.; Chikanian, A.; Kumar, S.; Mitchell, J.T.; Pope, K. [Yale Univ., New Haven, CT (United States)

    1991-12-31

    We describe the results of a test ran involving a Transition Radiation Detector that can both distinguish electrons from pions which momenta greater titan 0.7 GeV/c and simultaneously track particles passing through the detector. The particle identification is accomplished through a combination of the detection of Transition Radiation from the electron and the differences in electron and pion energy loss (dE/dx) in the detector. The dE/dx particle separation is most, efficient below 2 GeV/c while particle ID utilizing Transition Radiation effective above 1.5 GeV/c. Combined, the electron-pion separation is-better than 5 {times} 10{sup 2}. The single-wire, track-position resolution for the TRD is {approximately}230 {mu}m.

  11. A transition radiation detector which features accurate tracking and dE/dx particle identification

    Energy Technology Data Exchange (ETDEWEB)

    O`Brien, E.; Lissauer, D.; McCorkle, S.; Polychronakos, V.; Takai, H. [Brookhaven National Lab., Upton, NY (United States); Chi, C.Y.; Nagamiya, S.; Sippach, W.; Toy, M.; Wang, D.; Wang, Y.F.; Wiggins, C.; Willis, W. [Columbia Univ., New York, NY (United States); Cherniatin, V.; Dolgoshein, B. [Moscow Inst. of Physics and Engineering, Moscow (Russia Federation); Bennett, M.; Chikanian, A.; Kumar, S.; Mitchell, J.T.; Pope, K. [Yale Univ., New Haven, CT (United States)

    1991-12-31

    We describe the results of a test run involving a Transition Radiation Detector that can both distinguish electrons from pions with momenta greater than 0.7 GeV/c and simultaneously track particles passing through the detector. The particle identification is accomplished through a combination of the detection of Transition Radiation from the electron and the differences in electron and pion energy loss (dE/dx) in the detector. The dE/dx particle separation is most efficient below 2 GeV/c while particle ID utilizing Transition Radiation is effective above 1.5 GeV/c. Combined, the electron-pion separation is better than 5 {times} l0{sup 2}. The single-wire, track-position resolution for the TRD is {approximately}230{mu}m.

  12. A transition radiation detector which features accurate tracking and dE/dx particle identification

    Energy Technology Data Exchange (ETDEWEB)

    O' Brien, E.; Lissauer, D.; McCorkle, S.; Polychronakos, V.; Takai, H. (Brookhaven National Lab., Upton, NY (United States)); Chi, C.Y.; Nagamiya, S.; Sippach, W.; Toy, M.; Wang, D.; Wang, Y.F.; Wiggins, C.; Willis, W. (Columbia Univ., New York, NY (United States)); Cherniatin, V.; Dolgoshein, B. (Moscow Inst. of Physics and Engineering (Russian Federation)); Bennett, M.; Chikanian, A.; Kumar, S.; Mitchell, J.T.; Pope, K. (Yale Univ., New Haven, CT (United States))

    1993-04-01

    The authors describe the results of a test run involving a Transition Radiation Detector that can both distinguish electrons from pions with momenta greater than 0.7 GeV/c and simultaneously track particles passing through the detector. The particle identification is accomplished through a combination of the detection of Transition Radiation from the electron and the differences in electron and pion energy loss (dE/dx) in the detector. The dE/dx particle separation is most efficient below 2 GeV/c while particle ID utilizing Transition Radiation is effective above 1.5 GeV/c. Combined, the electron-pion separation is better than 5 x 10[sup 2]. The single-wire, track-position resolution for the TRD is [approximately] [mu]m.

  13. Automated selected reaction monitoring software for accurate label-free protein quantification.

    Science.gov (United States)

    Teleman, Johan; Karlsson, Christofer; Waldemarson, Sofia; Hansson, Karin; James, Peter; Malmström, Johan; Levander, Fredrik

    2012-07-06

    Selected reaction monitoring (SRM) is a mass spectrometry method with documented ability to quantify proteins accurately and reproducibly using labeled reference peptides. However, the use of labeled reference peptides becomes impractical if large numbers of peptides are targeted and when high flexibility is desired when selecting peptides. We have developed a label-free quantitative SRM workflow that relies on a new automated algorithm, Anubis, for accurate peak detection. Anubis efficiently removes interfering signals from contaminating peptides to estimate the true signal of the targeted peptides. We evaluated the algorithm on a published multisite data set and achieved results in line with manual data analysis. In complex peptide mixtures from whole proteome digests of Streptococcus pyogenes we achieved a technical variability across the entire proteome abundance range of 6.5-19.2%, which was considerably below the total variation across biological samples. Our results show that the label-free SRM workflow with automated data analysis is feasible for large-scale biological studies, opening up new possibilities for quantitative proteomics and systems biology.

  14. Electrostatics of proteins in dielectric solvent continua. I. An accurate and efficient reaction field description.

    Science.gov (United States)

    Bauer, Sebastian; Mathias, Gerald; Tavan, Paul

    2014-03-14

    We present a reaction field (RF) method which accurately solves the Poisson equation for proteins embedded in dielectric solvent continua at a computational effort comparable to that of an electrostatics calculation with polarizable molecular mechanics (MM) force fields. The method combines an approach originally suggested by Egwolf and Tavan [J. Chem. Phys. 118, 2039 (2003)] with concepts generalizing the Born solution [Z. Phys. 1, 45 (1920)] for a solvated ion. First, we derive an exact representation according to which the sources of the RF potential and energy are inducible atomic anti-polarization densities and atomic shielding charge distributions. Modeling these atomic densities by Gaussians leads to an approximate representation. Here, the strengths of the Gaussian shielding charge distributions are directly given in terms of the static partial charges as defined, e.g., by standard MM force fields for the various atom types, whereas the strengths of the Gaussian anti-polarization densities are calculated by a self-consistency iteration. The atomic volumes are also described by Gaussians. To account for covalently overlapping atoms, their effective volumes are calculated by another self-consistency procedure, which guarantees that the dielectric function ε(r) is close to one everywhere inside the protein. The Gaussian widths σ(i) of the atoms i are parameters of the RF approximation. The remarkable accuracy of the method is demonstrated by comparison with Kirkwood's analytical solution for a spherical protein [J. Chem. Phys. 2, 351 (1934)] and with computationally expensive grid-based numerical solutions for simple model systems in dielectric continua including a di-peptide (Ac-Ala-NHMe) as modeled by a standard MM force field. The latter example shows how weakly the RF conformational free energy landscape depends on the parameters σ(i). A summarizing discussion highlights the achievements of the new theory and of its approximate solution particularly by

  15. Accurate Identification of Fatty Liver Disease in Data Warehouse Utilizing Natural Language Processing.

    Science.gov (United States)

    Redman, Joseph S; Natarajan, Yamini; Hou, Jason K; Wang, Jingqi; Hanif, Muzammil; Feng, Hua; Kramer, Jennifer R; Desiderio, Roxanne; Xu, Hua; El-Serag, Hashem B; Kanwal, Fasiha

    2017-08-31

    Natural language processing is a powerful technique of machine learning capable of maximizing data extraction from complex electronic medical records. We utilized this technique to develop algorithms capable of "reading" full-text radiology reports to accurately identify the presence of fatty liver disease. Abdominal ultrasound, computerized tomography, and magnetic resonance imaging reports were retrieved from the Veterans Affairs Corporate Data Warehouse from a random national sample of 652 patients. Radiographic fatty liver disease was determined by manual review by two physicians and verified with an expert radiologist. A split validation method was utilized for algorithm development. For all three imaging modalities, the algorithms could identify fatty liver disease with >90% recall and precision, with F-measures >90%. These algorithms could be used to rapidly screen patient records to establish a large cohort to facilitate epidemiological and clinical studies and examine the clinic course and outcomes of patients with radiographic hepatic steatosis.

  16. Seed Storage Proteins as a System for Teaching Protein Identification by Mass Spectrometry in Biochemistry Laboratory

    Science.gov (United States)

    Wilson, Karl A.; Tan-Wilson, Anna

    2013-01-01

    Mass spectrometry (MS) has become an important tool in studying biological systems. One application is the identification of proteins and peptides by the matching of peptide and peptide fragment masses to the sequences of proteins in protein sequence databases. Often prior protein separation of complex protein mixtures by 2D-PAGE is needed,…

  17. Seed Storage Proteins as a System for Teaching Protein Identification by Mass Spectrometry in Biochemistry Laboratory

    Science.gov (United States)

    Wilson, Karl A.; Tan-Wilson, Anna

    2013-01-01

    Mass spectrometry (MS) has become an important tool in studying biological systems. One application is the identification of proteins and peptides by the matching of peptide and peptide fragment masses to the sequences of proteins in protein sequence databases. Often prior protein separation of complex protein mixtures by 2D-PAGE is needed,…

  18. Fast and Accurate Discovery of Degenerate Linear Motifs in Protein Sequences

    Science.gov (United States)

    Levy, Emmanuel D.; Michnick, Stephen W.

    2014-01-01

    Linear motifs mediate a wide variety of cellular functions, which makes their characterization in protein sequences crucial to understanding cellular systems. However, the short length and degenerate nature of linear motifs make their discovery a difficult problem. Here, we introduce MotifHound, an algorithm particularly suited for the discovery of small and degenerate linear motifs. MotifHound performs an exact and exhaustive enumeration of all motifs present in proteins of interest, including all of their degenerate forms, and scores the overrepresentation of each motif based on its occurrence in proteins of interest relative to a background (e.g., proteome) using the hypergeometric distribution. To assess MotifHound, we benchmarked it together with state-of-the-art algorithms. The benchmark consists of 11,880 sets of proteins from S. cerevisiae; in each set, we artificially spiked-in one motif varying in terms of three key parameters, (i) number of occurrences, (ii) length and (iii) the number of degenerate or “wildcard” positions. The benchmark enabled the evaluation of the impact of these three properties on the performance of the different algorithms. The results showed that MotifHound and SLiMFinder were the most accurate in detecting degenerate linear motifs. Interestingly, MotifHound was 15 to 20 times faster at comparable accuracy and performed best in the discovery of highly degenerate motifs. We complemented the benchmark by an analysis of proteins experimentally shown to bind the FUS1 SH3 domain from S. cerevisiae. Using the full-length protein partners as sole information, MotifHound recapitulated most experimentally determined motifs binding to the FUS1 SH3 domain. Moreover, these motifs exhibited properties typical of SH3 binding peptides, e.g., high intrinsic disorder and evolutionary conservation, despite the fact that none of these properties were used as prior information. MotifHound is available (http://michnick.bcm.umontreal.ca or http

  19. Identification of mitochondrial proteins of malaria parasite using analysis of variance.

    Science.gov (United States)

    Ding, Hui; Li, Dongmei

    2015-02-01

    As a parasitic protozoan, Plasmodium falciparum (P. falciparum) can cause malaria. The mitochondrial proteins of malaria parasite play important roles in the discovery of anti-malarial drug targets. Thus, accurate identification of mitochondrial proteins of malaria parasite is a key step for understanding their functions and finding potential drug targets. In this work, we developed a sequence-based method to identify the mitochondrial proteins of malaria parasite. At first, we extended adjoining dipeptide composition to g-gap dipeptide composition for discretely formulating the protein sequences. Subsequently, the analysis of variance (ANOVA) combined with incremental feature selection (IFS) was used to pick out the optimal features. Finally, the jackknife cross-validation was used to evaluate the performance of the proposed model. Evaluation results showed that the maximum accuracy of 97.1% could be achieved by using 101 optimal 5-gap dipeptides. The comparison with previous methods demonstrated that our method was accurate and efficient.

  20. A tri-stage cluster identification model for accurate analysis of seismic catalogs

    Directory of Open Access Journals (Sweden)

    S. J. Nanda

    2013-02-01

    Full Text Available In this paper we propose a tri-stage cluster identification model that is a combination of a simple single iteration distance algorithm and an iterative K-means algorithm. In this study of earthquake seismicity, the model considers event location, time and magnitude information from earthquake catalog data to efficiently classify events as either background or mainshock and aftershock sequences. Tests on a synthetic seismicity catalog demonstrate the efficiency of the proposed model in terms of accuracy percentage (94.81% for background and 89.46% for aftershocks. The close agreement between lambda and cumulative plots for the ideal synthetic catalog and that generated by the proposed model also supports the accuracy of the proposed technique. There is flexibility in the model design to allow for proper selection of location and magnitude ranges, depending upon the nature of the mainshocks present in the catalog. The effectiveness of the proposed model also is evaluated by the classification of events in three historic catalogs: California, Japan and Indonesia. As expected, for both synthetic and historic catalog analysis it is observed that the density of events classified as background is almost uniform throughout the region, whereas the density of aftershock events are higher near the mainshocks.

  1. FAMSA: Fast and accurate multiple sequence alignment of huge protein families

    Science.gov (United States)

    Deorowicz, Sebastian; Debudaj-Grabysz, Agnieszka; Gudyś, Adam

    2016-01-01

    Rapid development of modern sequencing platforms has contributed to the unprecedented growth of protein families databases. The abundance of sets containing hundreds of thousands of sequences is a formidable challenge for multiple sequence alignment algorithms. The article introduces FAMSA, a new progressive algorithm designed for fast and accurate alignment of thousands of protein sequences. Its features include the utilization of the longest common subsequence measure for determining pairwise similarities, a novel method of evaluating gap costs, and a new iterative refinement scheme. What matters is that its implementation is highly optimized and parallelized to make the most of modern computer platforms. Thanks to the above, quality indicators, i.e. sum-of-pairs and total-column scores, show FAMSA to be superior to competing algorithms, such as Clustal Omega or MAFFT for datasets exceeding a few thousand sequences. Quality does not compromise on time or memory requirements, which are an order of magnitude lower than those in the existing solutions. For example, a family of 415519 sequences was analyzed in less than two hours and required no more than 8 GB of RAM. FAMSA is available for free at http://sun.aei.polsl.pl/REFRESH/famsa. PMID:27670777

  2. Conformational energy range of ligands in protein crystal structures: The difficult quest for accurate understanding.

    Science.gov (United States)

    Peach, Megan L; Cachau, Raul E; Nicklaus, Marc C

    2017-02-24

    In this review, we address a fundamental question: What is the range of conformational energies seen in ligands in protein-ligand crystal structures? This value is important biophysically, for better understanding the protein-ligand binding process; and practically, for providing a parameter to be used in many computational drug design methods such as docking and pharmacophore searches. We synthesize a selection of previously reported conflicting results from computational studies of this issue and conclude that high ligand conformational energies really are present in some crystal structures. The main source of disagreement between different analyses appears to be due to divergent treatments of electrostatics and solvation. At the same time, however, for many ligands, a high conformational energy is in error, due to either crystal structure inaccuracies or incorrect determination of the reference state. Aside from simple chemistry mistakes, we argue that crystal structure error may mainly be because of the heuristic weighting of ligand stereochemical restraints relative to the fit of the structure to the electron density. This problem cannot be fixed with improvements to electron density fitting or with simple ligand geometry checks, though better metrics are needed for evaluating ligand and binding site chemistry in addition to geometry during structure refinement. The ultimate solution for accurately determining ligand conformational energies lies in ultrahigh-resolution crystal structures that can be refined without restraints.

  3. Use of Fourier transform infrared spectroscopy (FTIR spectroscopy for rapid and accurate identification of Yeasts isolated from human and animals

    Directory of Open Access Journals (Sweden)

    M. Taha

    2013-06-01

    Full Text Available Rapid and accurate identification of yeast is increasingly important to stipulate the appropriate therapy thus reducing morbidity and mortality related to yeast infections. Vibrational spectroscopic techniques (infrared (IR and Raman could provide potential alternatives to conventional typing methods, because they constitute a rapid, inexpensive and highly specific spectroscopic fingerprint through-which microorganism can be identified. The present study evaluate (FTIR spectroscopy as a sensitive and effective assay for the identification of the most frequent yeast species isolated from human and animals. One hundred and twenty-eight yeasts isolated from infected human mouths/vaginas, chronic diseased cows, crop mycosis in chicken and soil contaminated with pigeon droppings were phenotypically identified. Using universal primers, ITS1/ITS4, we have amplified ITS1-5.8S-ITS2 rDNA regions for 39 yeast isolates as representative samples. The PCR products were digested with restriction enzyme MspI and examined by PCR-RFLP, which was an efficient technique for identification of Candida spp., Cryptococcus neoformans and Trichosporon asahii. Further, identification of the same 39 isolates were done by FTIR spectroscopy and considered as reference for other strains by comparison of their FTIR spectra. The current study has sharply demonstrated the significant spectral differences between the various examined species of Candida, Cryptococcus, Trichosporon, Rhodotorula and Geotrichum isolated from different sources. Decisively, our research has confirmed that FTIR spectroscopy is a promising diagnostic tool, because of its sensitivity, rapidity, high differentiation capacity and simplicity compared to conventional/molecular techniques.

  4. Evaluating de novo sequencing in proteomics: already an accurate alternative to database-driven peptide identification?

    Science.gov (United States)

    Muth, Thilo; Renard, Bernhard Y

    2017-03-21

    While peptide identifications in mass spectrometry (MS)-based shotgun proteomics are mostly obtained using database search methods, high-resolution spectrum data from modern MS instruments nowadays offer the prospect of improving the performance of computational de novo peptide sequencing. The major benefit of de novo sequencing is that it does not require a reference database to deduce full-length or partial tag-based peptide sequences directly from experimental tandem mass spectrometry spectra. Although various algorithms have been developed for automated de novo sequencing, the prediction accuracy of proposed solutions has been rarely evaluated in independent benchmarking studies. The main objective of this work is to provide a detailed evaluation on the performance of de novo sequencing algorithms on high-resolution data. For this purpose, we processed four experimental data sets acquired from different instrument types from collision-induced dissociation and higher energy collisional dissociation (HCD) fragmentation mode using the software packages Novor, PEAKS and PepNovo. Moreover, the accuracy of these algorithms is also tested on ground truth data based on simulated spectra generated from peak intensity prediction software. We found that Novor shows the overall best performance compared with PEAKS and PepNovo with respect to the accuracy of correct full peptide, tag-based and single-residue predictions. In addition, the same tool outpaced the commercial competitor PEAKS in terms of running time speedup by factors of around 12-17. Despite around 35% prediction accuracy for complete peptide sequences on HCD data sets, taken as a whole, the evaluated algorithms perform moderately on experimental data but show a significantly better performance on simulated data (up to 84% accuracy). Further, we describe the most frequently occurring de novo sequencing errors and evaluate the influence of missing fragment ion peaks and spectral noise on the accuracy. Finally

  5. An improved Bathocuproine assay for accurate valence identification and quantification of copper bound by biomolecules.

    Science.gov (United States)

    Chen, Dinglong; Darabedian, Narek; Li, Zhiqiang; Kai, Tianhan; Jiang, Dianlu; Zhou, Feimeng

    2016-03-15

    Copper is an essential metal in all organisms. Reliably quantifying and identifying the copper content and oxidation state is crucial, since the information is essential to understanding protein structure and function. Chromophoric ligands, such as Bathocuproine (BC) and its water-soluble analog, Bathocuproinedisulfonic acid (BCS), preferentially bind Cu(I) over Cu(II), and therefore have been widely used as optical probes to determine the oxidation state of copper bound by biomolecules. However, the BCS assay is commonly misused, leading to erroneous conclusions regarding the role of copper in biological processes. By measuring the redox potential of Cu(II)-BCS2 and conducting UV-vis absorption measurements in the presence of oxidizable amino acids, the thermodynamic origin of the potential artifacts becomes evident. The BCS assay was improved by introducing a strong Cu(II) chelator EDTA prior to the addition of BCS to prevent interference that might arise from Cu(II) present in the sample. The strong Cu(II) chelator rids of all the potential errors inherent in the conventional BCS assay. Applications of the improved assay to peptides and protein containing oxidizable amino acid residues confirm that free Cu(II) no longer leads to artifacts, thereby resolving issues related to this persistently misused colorimetric assay of Cu(I) in biological systems.

  6. Rapid and Accurate Identification of Animal Species in Natural Leather Goods by Liquid Chromatography/Mass Spectrometry.

    Science.gov (United States)

    Izuchi, Yukari; Takashima, Tsuneo; Hatano, Naoya

    2016-01-01

    The demand for leather goods has grown globally in recent years. Industry revenue is forecast to reach $91.2 billion by 2018. There is an ongoing labelling problem in the leather items market, in that it is currently impossible to identify the species that a given piece of leather is derived from. To address this issue, we developed a rapid and simple method for the specific identification of leather derived from cattle, horses, pigs, sheep, goats, and deer by analysing peptides produced by the trypsin-digestion of proteins contained in leather goods using liquid chromatography/mass spectrometry. We determined species-specific amino acid sequences by liquid chromatography/tandem mass spectrometry analysis using the Mascot software program and demonstrated that collagen α-1(I), collagen α-2(I), and collagen α-1(III) from the dermal layer of the skin are particularly useful in species identification.

  7. Evaluating Peptide Mass Fingerprinting-based Protein Identification

    Institute of Scientific and Technical Information of China (English)

    Senthilkumar; Damodaran; Troy; D.; Wood; Priyadharsini; Nagarajan; Richard; A.; Rabin

    2007-01-01

    Identification of proteins by mass spectrometry (MS) is an essential step in pro- teomic studies and is typically accomplished by either peptide mass fingerprinting (PMF) or amino acid sequencing of the peptide. Although sequence information from MS/MS analysis can be used to validate PMF-based protein identification, it may not be practical when analyzing a large number of proteins and when high- throughput MS/MS instrumentation is not readily available. At present, a vast majority of proteomic studies employ PMF. However, there are huge disparities in criteria used to identify proteins using PMF. Therefore, to reduce incorrect protein identification using PMF, and also to increase confidence in PMF-based protein identification without accompanying MS/MS analysis, definitive guiding principles are essential. To this end, we propose a value-based scoring system that provides guidance on evaluating when PMF-based protein identification can be deemed sufficient without accompanying amino acid sequence data from MS/MS analysis.

  8. More comprehensive forensic genetic marker analyses for accurate human remains identification using massively parallel DNA sequencing.

    Science.gov (United States)

    Ambers, Angie D; Churchill, Jennifer D; King, Jonathan L; Stoljarova, Monika; Gill-King, Harrell; Assidi, Mourad; Abu-Elmagd, Muhammad; Buhmeida, Abdelbaset; Al-Qahtani, Mohammed; Budowle, Bruce

    2016-10-17

    Although the primary objective of forensic DNA analyses of unidentified human remains is positive identification, cases involving historical or archaeological skeletal remains often lack reference samples for comparison. Massively parallel sequencing (MPS) offers an opportunity to provide biometric data in such cases, and these cases provide valuable data on the feasibility of applying MPS for characterization of modern forensic casework samples. In this study, MPS was used to characterize 140-year-old human skeletal remains discovered at a historical site in Deadwood, South Dakota, United States. The remains were in an unmarked grave and there were no records or other metadata available regarding the identity of the individual. Due to the high throughput of MPS, a variety of biometric markers could be typed using a single sample. Using MPS and suitable forensic genetic markers, more relevant information could be obtained from a limited quantity and quality sample. Results were obtained for 25/26 Y-STRs, 34/34 Y SNPs, 166/166 ancestry-informative SNPs, 24/24 phenotype-informative SNPs, 102/102 human identity SNPs, 27/29 autosomal STRs (plus amelogenin), and 4/8 X-STRs (as well as ten regions of mtDNA). The Y-chromosome (Y-STR, Y-SNP) and mtDNA profiles of the unidentified skeletal remains are consistent with the R1b and H1 haplogroups, respectively. Both of these haplogroups are the most common haplogroups in Western Europe. Ancestry-informative SNP analysis also supported European ancestry. The genetic results are consistent with anthropological findings that the remains belong to a male of European ancestry (Caucasian). Phenotype-informative SNP data provided strong support that the individual had light red hair and brown eyes. This study is among the first to genetically characterize historical human remains with forensic genetic marker kits specifically designed for MPS. The outcome demonstrates that substantially more genetic information can be obtained from

  9. Accurate microRNA target prediction correlates with protein repression levels

    Directory of Open Access Journals (Sweden)

    Simossis Victor A

    2009-09-01

    Full Text Available Abstract Background MicroRNAs are small endogenously expressed non-coding RNA molecules that regulate target gene expression through translation repression or messenger RNA degradation. MicroRNA regulation is performed through pairing of the microRNA to sites in the messenger RNA of protein coding genes. Since experimental identification of miRNA target genes poses difficulties, computational microRNA target prediction is one of the key means in deciphering the role of microRNAs in development and disease. Results DIANA-microT 3.0 is an algorithm for microRNA target prediction which is based on several parameters calculated individually for each microRNA and combines conserved and non-conserved microRNA recognition elements into a final prediction score, which correlates with protein production fold change. Specifically, for each predicted interaction the program reports a signal to noise ratio and a precision score which can be used as an indication of the false positive rate of the prediction. Conclusion Recently, several computational target prediction programs were benchmarked based on a set of microRNA target genes identified by the pSILAC method. In this assessment DIANA-microT 3.0 was found to achieve the highest precision among the most widely used microRNA target prediction programs reaching approximately 66%. The DIANA-microT 3.0 prediction results are available online in a user friendly web server at http://www.microrna.gr/microT

  10. Fitmunk: improving protein structures by accurate, automatic modeling of side-chain conformations.

    Science.gov (United States)

    Porebski, Przemyslaw Jerzy; Cymborowski, Marcin; Pasenkiewicz-Gierula, Marta; Minor, Wladek

    2016-02-01

    Improvements in crystallographic hardware and software have allowed automated structure-solution pipelines to approach a near-`one-click' experience for the initial determination of macromolecular structures. However, in many cases the resulting initial model requires a laborious, iterative process of refinement and validation. A new method has been developed for the automatic modeling of side-chain conformations that takes advantage of rotamer-prediction methods in a crystallographic context. The algorithm, which is based on deterministic dead-end elimination (DEE) theory, uses new dense conformer libraries and a hybrid energy function derived from experimental data and prior information about rotamer frequencies to find the optimal conformation of each side chain. In contrast to existing methods, which incorporate the electron-density term into protein-modeling frameworks, the proposed algorithm is designed to take advantage of the highly discriminatory nature of electron-density maps. This method has been implemented in the program Fitmunk, which uses extensive conformational sampling. This improves the accuracy of the modeling and makes it a versatile tool for crystallographic model building, refinement and validation. Fitmunk was extensively tested on over 115 new structures, as well as a subset of 1100 structures from the PDB. It is demonstrated that the ability of Fitmunk to model more than 95% of side chains accurately is beneficial for improving the quality of crystallographic protein models, especially at medium and low resolutions. Fitmunk can be used for model validation of existing structures and as a tool to assess whether side chains are modeled optimally or could be better fitted into electron density. Fitmunk is available as a web service at http://kniahini.med.virginia.edu/fitmunk/server/ or at http://fitmunk.bitbucket.org/.

  11. Identification of outer membrane proteins of Yersinia pestis through biotinylation

    NARCIS (Netherlands)

    Smither, S.J.; Hill, J.; Baar, B.L.M. van; Hulst, A.G.; Jong, A.L. de; Titball, R.W.

    2007-01-01

    The outer membrane of Gram-negative bacteria contains proteins that might be good targets for vaccines, antimicrobials or detection systems. The identification of surface located proteins using traditional methods is often difficult. Yersinia pestis, the causative agent of plague, was labelled with

  12. Identification of outer membrane proteins of Yersinia pestis through biotinylation

    NARCIS (Netherlands)

    Smither, S.J.; Hill, J.; Baar, B.L.M. van; Hulst, A.G.; Jong, A.L. de; Titball, R.W.

    2007-01-01

    The outer membrane of Gram-negative bacteria contains proteins that might be good targets for vaccines, antimicrobials or detection systems. The identification of surface located proteins using traditional methods is often difficult. Yersinia pestis, the causative agent of plague, was labelled with

  13. IFPTarget: A Customized Virtual Target Identification Method Based on Protein-Ligand Interaction Fingerprinting Analyses.

    Science.gov (United States)

    Li, Guo-Bo; Yu, Zhu-Jun; Liu, Sha; Huang, Lu-Yi; Yang, Ling-Ling; Lohans, Christopher T; Yang, Sheng-Yong

    2017-07-24

    Small-molecule target identification is an important and challenging task for chemical biology and drug discovery. Structure-based virtual target identification has been widely used, which infers and prioritizes potential protein targets for the molecule of interest (MOI) principally via a scoring function. However, current "universal" scoring functions may not always accurately identify targets to which the MOI binds from the retrieved target database, in part due to a lack of consideration of the important binding features for an individual target. Here, we present IFPTarget, a customized virtual target identification method, which uses an interaction fingerprinting (IFP) method for target-specific interaction analyses and a comprehensive index (Cvalue) for target ranking. Evaluation results indicate that the IFP method enables substantially improved binding pose prediction, and Cvalue has an excellent performance in target ranking for the test set. When applied to screen against our established target library that contains 11,863 protein structures covering 2842 unique targets, IFPTarget could retrieve known targets within the top-ranked list and identified new potential targets for chemically diverse drugs. IFPTarget prediction led to the identification of the metallo-β-lactamase VIM-2 as a target for quercetin as validated by enzymatic inhibition assays. This study provides a new in silico target identification tool and will aid future efforts to develop new target-customized methods for target identification.

  14. High Specificity in Circulating Tumor Cell Identification Is Required for Accurate Evaluation of Programmed Death-Ligand 1

    Science.gov (United States)

    Schultz, Zachery D.; Warrick, Jay W.; Guckenberger, David J.; Pezzi, Hannah M.; Sperger, Jamie M.; Heninger, Erika; Saeed, Anwaar; Leal, Ticiana; Mattox, Kara; Traynor, Anne M.; Campbell, Toby C.; Berry, Scott M.; Beebe, David J.; Lang, Joshua M.

    2016-01-01

    Background Expression of programmed-death ligand 1 (PD-L1) in non-small cell lung cancer (NSCLC) is typically evaluated through invasive biopsies; however, recent advances in the identification of circulating tumor cells (CTCs) may be a less invasive method to assay tumor cells for these purposes. These liquid biopsies rely on accurate identification of CTCs from the diverse populations in the blood, where some tumor cells share characteristics with normal blood cells. While many blood cells can be excluded by their high expression of CD45, neutrophils and other immature myeloid subsets have low to absent expression of CD45 and also express PD-L1. Furthermore, cytokeratin is typically used to identify CTCs, but neutrophils may stain non-specifically for intracellular antibodies, including cytokeratin, thus preventing accurate evaluation of PD-L1 expression on tumor cells. This holds even greater significance when evaluating PD-L1 in epithelial cell adhesion molecule (EpCAM) positive and EpCAM negative CTCs (as in epithelial-mesenchymal transition (EMT)). Methods To evaluate the impact of CTC misidentification on PD-L1 evaluation, we utilized CD11b to identify myeloid cells. CTCs were isolated from patients with metastatic NSCLC using EpCAM, MUC1 or Vimentin capture antibodies and exclusion-based sample preparation (ESP) technology. Results Large populations of CD11b+CD45lo cells were identified in buffy coats and stained non-specifically for intracellular antibodies including cytokeratin. The amount of CD11b+ cells misidentified as CTCs varied among patients; accounting for 33–100% of traditionally identified CTCs. Cells captured with vimentin had a higher frequency of CD11b+ cells at 41%, compared to 20% and 18% with MUC1 or EpCAM, respectively. Cells misidentified as CTCs ultimately skewed PD-L1 expression to varying degrees across patient samples. Conclusions Interfering myeloid populations can be differentiated from true CTCs with additional staining criteria

  15. High Specificity in Circulating Tumor Cell Identification Is Required for Accurate Evaluation of Programmed Death-Ligand 1.

    Directory of Open Access Journals (Sweden)

    Jennifer L Schehr

    Full Text Available Expression of programmed-death ligand 1 (PD-L1 in non-small cell lung cancer (NSCLC is typically evaluated through invasive biopsies; however, recent advances in the identification of circulating tumor cells (CTCs may be a less invasive method to assay tumor cells for these purposes. These liquid biopsies rely on accurate identification of CTCs from the diverse populations in the blood, where some tumor cells share characteristics with normal blood cells. While many blood cells can be excluded by their high expression of CD45, neutrophils and other immature myeloid subsets have low to absent expression of CD45 and also express PD-L1. Furthermore, cytokeratin is typically used to identify CTCs, but neutrophils may stain non-specifically for intracellular antibodies, including cytokeratin, thus preventing accurate evaluation of PD-L1 expression on tumor cells. This holds even greater significance when evaluating PD-L1 in epithelial cell adhesion molecule (EpCAM positive and EpCAM negative CTCs (as in epithelial-mesenchymal transition (EMT.To evaluate the impact of CTC misidentification on PD-L1 evaluation, we utilized CD11b to identify myeloid cells. CTCs were isolated from patients with metastatic NSCLC using EpCAM, MUC1 or Vimentin capture antibodies and exclusion-based sample preparation (ESP technology.Large populations of CD11b+CD45lo cells were identified in buffy coats and stained non-specifically for intracellular antibodies including cytokeratin. The amount of CD11b+ cells misidentified as CTCs varied among patients; accounting for 33-100% of traditionally identified CTCs. Cells captured with vimentin had a higher frequency of CD11b+ cells at 41%, compared to 20% and 18% with MUC1 or EpCAM, respectively. Cells misidentified as CTCs ultimately skewed PD-L1 expression to varying degrees across patient samples.Interfering myeloid populations can be differentiated from true CTCs with additional staining criteria, thus improving the

  16. Proteomics: Protein Identification Using Online Databases

    Science.gov (United States)

    Eurich, Chris; Fields, Peter A.; Rice, Elizabeth

    2012-01-01

    Proteomics is an emerging area of systems biology that allows simultaneous study of thousands of proteins expressed in cells, tissues, or whole organisms. We have developed this activity to enable high school or college students to explore proteomic databases using mass spectrometry data files generated from yeast proteins in a college laboratory…

  17. Proteomics: Protein Identification Using Online Databases

    Science.gov (United States)

    Eurich, Chris; Fields, Peter A.; Rice, Elizabeth

    2012-01-01

    Proteomics is an emerging area of systems biology that allows simultaneous study of thousands of proteins expressed in cells, tissues, or whole organisms. We have developed this activity to enable high school or college students to explore proteomic databases using mass spectrometry data files generated from yeast proteins in a college laboratory…

  18. Protein identification by peptide mass fingerprinting

    DEFF Research Database (Denmark)

    Hjernø, Karin

    2007-01-01

      Peptide mass fingerprinting is an effective way of identifying, e.g., gel-separated proteins, by matching experimentally obtained peptide mass data against large databases. However, several factors are known to influence the quality of the resulting matches, such as proteins contaminating...

  19. Protein identification by peptide mass fingerprinting

    DEFF Research Database (Denmark)

    Hjernø, Karin

    2007-01-01

      Peptide mass fingerprinting is an effective way of identifying, e.g., gel-separated proteins, by matching experimentally obtained peptide mass data against large databases. However, several factors are known to influence the quality of the resulting matches, such as proteins contaminating the s...

  20. Direct Maximization of Protein Identifications from Tandem Mass Spectra*

    Science.gov (United States)

    Spivak, Marina; Weston, Jason; Tomazela, Daniela; MacCoss, Michael J.; Noble, William Stafford

    2012-01-01

    The goal of many shotgun proteomics experiments is to determine the protein complement of a complex biological mixture. For many mixtures, most methodological approaches fall significantly short of this goal. Existing solutions to this problem typically subdivide the task into two stages: first identifying a collection of peptides with a low false discovery rate and then inferring from the peptides a corresponding set of proteins. In contrast, we formulate the protein identification problem as a single optimization problem, which we solve using machine learning methods. This approach is motivated by the observation that the peptide and protein level tasks are cooperative, and the solution to each can be improved by using information about the solution to the other. The resulting algorithm directly controls the relevant error rate, can incorporate a wide variety of evidence and, for complex samples, provides 18–34% more protein identifications than the current state of the art approaches. PMID:22052992

  1. Experimental Identification of Downhill Protein Folding

    Science.gov (United States)

    Garcia-Mira, Maria M.; Sadqi, Mourad; Fischer, Niels; Sanchez-Ruiz, Jose M.; Muñoz, Victor

    2002-12-01

    Theory predicts the existence of barrierless protein folding. Without barriers, folding should be noncooperative and the degree of native structure should be coupled to overall protein stability. We investigated the thermal unfolding of the peripheral subunit binding domain from Escherichia coli's 2-oxoglutarate dehydrogenase multienzyme complex (termed BBL) with a combination of spectroscopic techniques and calorimetry. Each technique probed a different feature of protein structure. BBL has a defined three-dimensional structure at low temperatures. However, each technique showed a distinct unfolding transition. Global analysis with a statistical mechanical model identified BBL as a downhill-folding protein. Because of BBL's biological function, we propose that downhill folders may be molecular rheostats, in which effects could be modulated by altering the distribution of an ensemble of structures.

  2. Identification of Ina proteins from Fusarium acuminatum

    Science.gov (United States)

    Scheel, Jan Frederik; Kunert, Anna Theresa; Pöschl, Ulrich; Fröhlich-Nowoisky, Janine

    2015-04-01

    Freezing of water above -36° C is based on ice nucleation activity (INA) mediated by ice nucleators (IN) which can be of various origins. Beside mineral IN, biological particles are a potentially important source of atmospheric IN. The best-known biological IN are common plant-associated bacteria. The IN activity of these bacteria is induced by a surface protein on the outer cell membrane, which is fully characterized. In contrast, much less is known about the nature of fungal IN. The fungal genus Fusarium is widely spread throughout the earth. It belongs to the Ascomycota and is one of the most severe fungal pathogens. It can affect a variety of organisms from plants to animals including humans. INA of Fusarium was already described about 30 years ago and INA of Fusarium as well as other fungal genera is assumed to be mediated by proteins or at least to contain a proteinaceous compound. Although many efforts were made the precise INA machinery of Fusarium and other fungal species including the proteins and their corresponding genes remain unidentified. In this study preparations from living fungal samples of F. acuminatum were fractionated by liquid chromatography and IN active fractions were identified by freezing assays. SDS-page and de novo sequencing by mass spectrometry were used to identify the primary structure of the protein. Preliminary results show that the INA protein of F. acuminatum is contained in the early size exclusion chromatography fractions indicating a high molecular size. Moreover we could identify a single protein band from IN active fractions at 130-145 kDa corresponding to sizes of IN proteins from bacterial species. To our knowledge this is for the first time an isolation of a single protein from in vivo samples, which can be assigned as IN active from Fusarium.

  3. Identification of N(6)-methyladenosine reader proteins.

    Science.gov (United States)

    Zhou, Katherine I; Liu, Nian; Pan, Tao

    2017-08-15

    The reversible N(6)-methyladenosine (m(6)A) modification of eukaryotic messenger RNAs (mRNAs) is a widespread regulatory mechanism that impacts every step in the mRNA life cycle. The effect of m(6)A on mRNA fate depends on the binding of "m(6)A reader" proteins - RNA binding proteins that specifically bind to RNAs containing m(6)A. Here, we describe an RNA pull-down method that can be used to identify novel m(6)A reader proteins starting from a known m(6)A-modified site in cellular or viral RNA. We further describe how a combination of immunoprecipitation-based sequencing methods can be used to identify m(6)A-modified sites bound by an m(6)A reader protein on a transcriptome-wide level. The discovery of new m(6)A reader proteins and their m(6)A-modified targets would provide further insight into the mechanisms and functions of m(6)A in the cell. Copyright © 2017 Elsevier Inc. All rights reserved.

  4. Identification of immunogenic proteins of Waddlia chondrophila.

    Directory of Open Access Journals (Sweden)

    Carole Kebbi-Beghdadi

    Full Text Available Evidence is growing for a role of Waddlia chondrophila as an agent of adverse pregnancy outcomes in both humans and ruminants. This emerging pathogen, member of the order Chlamydiales, is also implicated in bronchiolitis and lower respiratory tract infections. Until now, the serological diagnosis of W. chondrophila infection has mainly relied on manually intensive tests including micro-immunofluorescence and Western blotting. Thus, there is an urgent need to establish reliable high throughput serological assays. Using a combined genomic and proteomic approach, we detected 57 immunogenic proteins of W. chondrophila, of which 17 were analysed by mass spectrometry. Two novel hypothetical proteins, Wim3 and Wim4, were expressed as recombinant proteins in Escherichia coli, purified and used as antigens in an ELISA test. Both proteins were recognized by sera of rabbits immunized with W. chondrophila as well as by human W. chondrophila positive sera but not by rabbit pre-immune sera nor human W. chondrophila negative sera. These results demonstrated that the approach chosen is suitable to identify immunogenic proteins that can be used to develop a serological test. This latter will be a valuable tool to further clarify the pathogenic potential of W. chondrophila.

  5. BioID Identification of Lamin-Associated Proteins.

    Science.gov (United States)

    Mehus, Aaron A; Anderson, Ruthellen H; Roux, Kyle J

    2016-01-01

    A- and B-type lamins support the nuclear envelope, contribute to heterochromatin organization, and regulate a myriad of nuclear processes. The mechanisms by which lamins function in different cell types and the mechanisms by which lamin mutations cause over a dozen human diseases (laminopathies) remain unclear. The identification of proteins associated with lamins is likely to provide fundamental insight into these mechanisms. BioID (proximity-dependent biotin identification) is a unique and powerful method for identifying protein-protein and proximity-based interactions in living cells. BioID utilizes a mutant biotin ligase from bacteria that is fused to a protein of interest (bait). When expressed in living cells and stimulated with excess biotin, this BioID-fusion protein promiscuously biotinylates directly interacting and vicinal endogenous proteins. Following biotin-affinity capture, the biotinylated proteins can be identified using mass spectrometry. BioID thus enables screening for physiologically relevant protein associations that occur over time in living cells. BioID is applicable to insoluble proteins such as lamins that are often refractory to study by other methods and can identify weak and/or transient interactions. We discuss the use of BioID to elucidate novel lamin-interacting proteins and its applications in a broad range of biological systems, and provide detailed protocols to guide new applications.

  6. Large-Scale Off-Target Identification Using Fast and Accurate Dual Regularized One-Class Collaborative Filtering and Its Application to Drug Repurposing

    Science.gov (United States)

    Poleksic, Aleksandar; Yao, Yuan; Tong, Hanghang; Meng, Patrick; Xie, Lei

    2016-01-01

    Target-based screening is one of the major approaches in drug discovery. Besides the intended target, unexpected drug off-target interactions often occur, and many of them have not been recognized and characterized. The off-target interactions can be responsible for either therapeutic or side effects. Thus, identifying the genome-wide off-targets of lead compounds or existing drugs will be critical for designing effective and safe drugs, and providing new opportunities for drug repurposing. Although many computational methods have been developed to predict drug-target interactions, they are either less accurate than the one that we are proposing here or computationally too intensive, thereby limiting their capability for large-scale off-target identification. In addition, the performances of most machine learning based algorithms have been mainly evaluated to predict off-target interactions in the same gene family for hundreds of chemicals. It is not clear how these algorithms perform in terms of detecting off-targets across gene families on a proteome scale. Here, we are presenting a fast and accurate off-target prediction method, REMAP, which is based on a dual regularized one-class collaborative filtering algorithm, to explore continuous chemical space, protein space, and their interactome on a large scale. When tested in a reliable, extensive, and cross-gene family benchmark, REMAP outperforms the state-of-the-art methods. Furthermore, REMAP is highly scalable. It can screen a dataset of 200 thousands chemicals against 20 thousands proteins within 2 hours. Using the reconstructed genome-wide target profile as the fingerprint of a chemical compound, we predicted that seven FDA-approved drugs can be repurposed as novel anti-cancer therapies. The anti-cancer activity of six of them is supported by experimental evidences. Thus, REMAP is a valuable addition to the existing in silico toolbox for drug target identification, drug repurposing, phenotypic screening, and

  7. Accurate and rapid identification of the Burkholderia pseudomallei near-neighbour, Burkholderia ubonensis, using real-time PCR.

    Directory of Open Access Journals (Sweden)

    Erin P Price

    Full Text Available Burkholderia ubonensis is an environmental bacterium belonging to the Burkholderia cepacia complex (Bcc, a group of genetically related organisms that are associated with opportunistic but generally nonfatal infections in healthy individuals. In contrast, the near-neighbour species Burkholderia pseudomallei causes melioidosis, a disease that can be fatal in up to 95% of cases if left untreated. B. ubonensis is frequently misidentified as B. pseudomallei from soil samples using selective culturing on Ashdown's medium, reflecting both the shared environmental niche and morphological similarities of these species. Additionally, B. ubonensis shows potential as an important biocontrol agent in B. pseudomallei-endemic regions as certain strains possess antagonistic properties towards B. pseudomallei. Current methods for characterising B. ubonensis are laborious, time-consuming and costly, and as such this bacterium remains poorly studied. The aim of our study was to develop a rapid and inexpensive real-time PCR-based assay specific for B. ubonensis. We demonstrate that a novel B. ubonensis-specific assay, Bu550, accurately differentiates B. ubonensis from B. pseudomallei and other species that grow on selective Ashdown's agar. We anticipate that Bu550 will catalyse research on B. ubonensis by enabling rapid identification of this organism from Ashdown's-positive colonies that are not B. pseudomallei.

  8. Identification and quantitation of signal molecule-dependent protein phosphorylation

    KAUST Repository

    Groen, Arnoud J.

    2013-09-03

    Phosphoproteomics is a fast-growing field that aims at characterizing phosphorylated proteins in a cell or a tissue at a given time. Phosphorylation of proteins is an important regulatory mechanism in many cellular processes. Gel-free phosphoproteome technique involving enrichment of phosphopeptide coupled with mass spectrometry has proven to be invaluable to detect and characterize phosphorylated proteins. In this chapter, a gel-free quantitative approach involving 15N metabolic labelling in combination with phosphopeptide enrichment by titanium dioxide (TiO2) and their identification by MS is described. This workflow can be used to gain insights into the role of signalling molecules such as cyclic nucleotides on regulatory networks through the identification and quantification of responsive phospho(proteins). © Springer Science+Business Media New York 2013.

  9. Protein C/S ratio, an accurate and simple tool to identify carriers of a protein C gene mutation

    NARCIS (Netherlands)

    Libourel, EJ; Meinardi, [No Value; de Kam, PJ; Ruiters, MHJ; van der Meer, J; van der Schaaf, W; Veenstra, R.

    Hereditary protein C deficiency is demonstrated by lowered protein C plasma levels in a patient and at least one first-degree relative. This approach is insufficient in some cases owing to overlapping protein C levels in carriers and non-carriers of a protein C gene mutation. The protein C/S ratio

  10. Combining computer algorithms with experimental approaches permits the rapid and accurate identification of T cell epitopes from defined antigens.

    Science.gov (United States)

    Schirle, M; Weinschenk, T; Stevanović, S

    2001-11-01

    The identification of T cell epitopes from immunologically relevant antigens remains a critical step in the development of vaccines and methods for monitoring of T cell responses. This review presents an overview of strategies that employ computer algorithms for the selection of candidate peptides from defined proteins and subsequent verification of their in vivo relevance by experimental approaches. Several computer algorithms are currently being used for epitope prediction of various major histocompatibility complex (MHC) class I and II molecules, based either on the analysis of natural MHC ligands or on the binding properties of synthetic peptides. Moreover, the analysis of proteasomal digests of peptides and whole proteins has led to the development of algorithms for the prediction of proteasomal cleavages. In order to verify the generation of the predicted peptides during antigen processing in vivo as well as their immunogenic potential, several experimental approaches have been pursued in the recent past. Mass spectrometry-based bioanalytical approaches have been used specifically to detect predicted peptides among isolated natural ligands. Other strategies employ various methods for the stimulation of primary T cell responses against the predicted peptides and subsequent testing of the recognition pattern towards target cells that express the antigen.

  11. FastRNABindR: Fast and Accurate Prediction of Protein-RNA Interface Residues.

    Directory of Open Access Journals (Sweden)

    Yasser El-Manzalawy

    Full Text Available A wide range of biological processes, including regulation of gene expression, protein synthesis, and replication and assembly of many viruses are mediated by RNA-protein interactions. However, experimental determination of the structures of protein-RNA complexes is expensive and technically challenging. Hence, a number of computational tools have been developed for predicting protein-RNA interfaces. Some of the state-of-the-art protein-RNA interface predictors rely on position-specific scoring matrix (PSSM-based encoding of the protein sequences. The computational efforts needed for generating PSSMs severely limits the practical utility of protein-RNA interface prediction servers. In this work, we experiment with two approaches, random sampling and sequence similarity reduction, for extracting a representative reference database of protein sequences from more than 50 million protein sequences in UniRef100. Our results suggest that random sampled databases produce better PSSM profiles (in terms of the number of hits used to generate the profile and the distance of the generated profile to the corresponding profile generated using the entire UniRef100 data as well as the accuracy of the machine learning classifier trained using these profiles. Based on our results, we developed FastRNABindR, an improved version of RNABindR for predicting protein-RNA interface residues using PSSM profiles generated using 1% of the UniRef100 sequences sampled uniformly at random. To the best of our knowledge, FastRNABindR is the only protein-RNA interface residue prediction online server that requires generation of PSSM profiles for query sequences and accepts hundreds of protein sequences per submission. Our approach for determining the optimal BLAST database for a protein-RNA interface residue classification task has the potential of substantially speeding up, and hence increasing the practical utility of, other amino acid sequence based predictors of protein-protein

  12. Methods and Approaches to Mass Spectroscopy Based Protein Identification

    Science.gov (United States)

    This book chapter is a review of current mass spectrometers and the role in the field of proteomics. Various instruments are discussed and their strengths and weaknesses are highlighted. In addition, the methods of protein identification using a mass spectrometer are explained as well as data vali...

  13. Identification & Characterization of Fungal Ice Nucleation Proteins

    Science.gov (United States)

    Scheel, Jan Frederik; Kunert, Anna Theresa; Kampf, Christopher Johannes; Mauri, Sergio; Weidner, Tobias; Pöschl, Ulrich; Fröhlich-Nowoisky, Janine

    2016-04-01

    Freezing of water at relatively warm subfreezing temperatures is dependent on ice nucleation catalysis facilitated by ice nuclei (IN). These IN can be of various origins and although extensive research was done and progress was achieved, the nature and mechanisms leading to an effective IN are to date still poorly understood. Some of the most important processes of our geosphere like the water cycle are highly dependent on effective ice nucleation at temperatures between -2°C - -8°C, a temperature range which is almost exclusively covered by biological IN (BioIN). BioIN are usually macromolecular structures of biological polymers. Sugars as well as proteins have been reported to serve as IN and the best characterized BioIN are ice nucleation proteins (IN-P) from gram negative bacteria. Fungal strains from Fusarium spp. were described to be effective IN at subfreezing temperatures up to -2°C already 25 years ago and more and more fungal species are described to serve as efficient IN. Fungal IN are also thought to be proteins or at least contain a proteinaceous compound, but to date the fungal IN-P primary structure as well as their coding genetic elements of all IN active fungi are unknown. The aim of this study is a.) to identify the proteins and their coding genetic elements from IN active fungi (F. acuminatum, F. avenaceum, M. alpina) and b.) to characterize the mechanisms by which fungal IN serve as effective IN. We designed an interdisciplinary approach using biological, analytical and physical methods to identify fungal IN-P and describe their biological, chemical, and physical properties.

  14. Accurate prediction of interfacial residues in two-domain proteins using evolutionary information: implications for three-dimensional modeling.

    Science.gov (United States)

    Bhaskara, Ramachandra M; Padhi, Amrita; Srinivasan, Narayanaswamy

    2014-07-01

    With the preponderance of multidomain proteins in eukaryotic genomes, it is essential to recognize the constituent domains and their functions. Often function involves communications across the domain interfaces, and the knowledge of the interacting sites is essential to our understanding of the structure-function relationship. Using evolutionary information extracted from homologous domains in at least two diverse domain architectures (single and multidomain), we predict the interface residues corresponding to domains from the two-domain proteins. We also use information from the three-dimensional structures of individual domains of two-domain proteins to train naïve Bayes classifier model to predict the interfacial residues. Our predictions are highly accurate (∼85%) and specific (∼95%) to the domain-domain interfaces. This method is specific to multidomain proteins which contain domains in at least more than one protein architectural context. Using predicted residues to constrain domain-domain interaction, rigid-body docking was able to provide us with accurate full-length protein structures with correct orientation of domains. We believe that these results can be of considerable interest toward rational protein and interaction design, apart from providing us with valuable information on the nature of interactions.

  15. Toward more accurate ancestral protein genotype-phenotype reconstructions with the use of species tree-aware gene trees.

    Science.gov (United States)

    Groussin, Mathieu; Hobbs, Joanne K; Szöllősi, Gergely J; Gribaldo, Simonetta; Arcus, Vickery L; Gouy, Manolo

    2015-01-01

    The resurrection of ancestral proteins provides direct insight into how natural selection has shaped proteins found in nature. By tracing substitutions along a gene phylogeny, ancestral proteins can be reconstructed in silico and subsequently synthesized in vitro. This elegant strategy reveals the complex mechanisms responsible for the evolution of protein functions and structures. However, to date, all protein resurrection studies have used simplistic approaches for ancestral sequence reconstruction (ASR), including the assumption that a single sequence alignment alone is sufficient to accurately reconstruct the history of the gene family. The impact of such shortcuts on conclusions about ancestral functions has not been investigated. Here, we show with simulations that utilizing information on species history using a model that accounts for the duplication, horizontal transfer, and loss (DTL) of genes statistically increases ASR accuracy. This underscores the importance of the tree topology in the inference of putative ancestors. We validate our in silico predictions using in vitro resurrection of the LeuB enzyme for the ancestor of the Firmicutes, a major and ancient bacterial phylum. With this particular protein, our experimental results demonstrate that information on the species phylogeny results in a biochemically more realistic and kinetically more stable ancestral protein. Additional resurrection experiments with different proteins are necessary to statistically quantify the impact of using species tree-aware gene trees on ancestral protein phenotypes. Nonetheless, our results suggest the need for incorporating both sequence and DTL information in future studies of protein resurrections to accurately define the genotype-phenotype space in which proteins diversify.

  16. Accurate identification of the six human Plasmodium spp. causing imported malaria, including Plasmodium ovale wallikeri and Plasmodium knowlesi.

    Science.gov (United States)

    Calderaro, Adriana; Piccolo, Giovanna; Gorrini, Chiara; Rossi, Sabina; Montecchini, Sara; Dell'Anna, Maria Loretana; De Conto, Flora; Medici, Maria Cristina; Chezzi, Carlo; Arcangeletti, Maria Cristina

    2013-09-13

    Accurate identification of Plasmodium infections in non-endemic countries is of critical importance with regard to the administration of a targeted therapy having a positive impact on patient health and management and allowing the prevention of the risk of re-introduction of endemic malaria in such countries. Malaria is no longer endemic in Italy where it is the most commonly imported disease, with one of the highest rates of imported malaria among European non-endemic countries including France, the UK and Germany, and with a prevalence of 24.3% at the University Hospital of Parma. Molecular methods showed high sensitivity and specificity and changed the epidemiology of imported malaria in several non-endemic countries, highlighted a higher prevalence of Plasmodium ovale, Plasmodium vivax and Plasmodium malariae underestimated by microscopy and, not least, brought to light both the existence of two species of P. ovale (Plasmodium ovale curtisi and Plasmodium ovale wallikeri) and the infection in humans by Plasmodium knowlesi, otherwise not detectable by microscopy. In this retrospective study an evaluation of two real-time PCR assays able to identify P. ovale wallikeri, distinguishing it from P. ovale curtisi, and to detect P. knowlesi, respectively, was performed applying them on a subset of 398 blood samples belonging to patients with the clinical suspicion of malaria. These assays revealed an excellent analytical sensitivity and no cross-reactivity versus other Plasmodium spp. infecting humans, suggesting their usefulness for an accurate and complete diagnosis of imported malaria. Among the 128 patients with malaria, eight P. ovale curtisi and four P. ovale wallikeri infections were detected, while no cases of P. knowlesi infection were observed. Real-time PCR assays specific for P. ovale wallikeri and P. knowlesi were included in the panel currently used in the University Hospital of Parma for the diagnosis of imported malaria, accomplishing the goal of

  17. Accurate design of co-assembling multi-component protein nanomaterials.

    Science.gov (United States)

    King, Neil P; Bale, Jacob B; Sheffler, William; McNamara, Dan E; Gonen, Shane; Gonen, Tamir; Yeates, Todd O; Baker, David

    2014-06-05

    The self-assembly of proteins into highly ordered nanoscale architectures is a hallmark of biological systems. The sophisticated functions of these molecular machines have inspired the development of methods to engineer self-assembling protein nanostructures; however, the design of multi-component protein nanomaterials with high accuracy remains an outstanding challenge. Here we report a computational method for designing protein nanomaterials in which multiple copies of two distinct subunits co-assemble into a specific architecture. We use the method to design five 24-subunit cage-like protein nanomaterials in two distinct symmetric architectures and experimentally demonstrate that their structures are in close agreement with the computational design models. The accuracy of the method and the number and variety of two-component materials that it makes accessible suggest a route to the construction of functional protein nanomaterials tailored to specific applications.

  18. Identification of chikungunya virus interacting proteins in mammalian cells

    Indian Academy of Sciences (India)

    Mandar S Paingankar; Vidya A Arankalle

    2014-06-01

    Identification and characterization of virus host interactions is an essential step for the development of novel antiviral strategies. Very few studies have been targeted towards identification of chikungunya virus (CHIKV) interacting host proteins. In current study, virus overlay protein binding assay (VOPBA) and matrix-assisted laser desorption/ionization time of flight analysis (MALDI TOF/TOF) were employed for the identification of CHIKV binding proteins in mammalian cells. HSP70 and actin were identified as virus binding proteins in HEK-293T and Vero-E6 cells, whereas STAT-2 was identified as an additional protein in Vero-E6 cells. Pre-incubation with anti-HSP70 antibody and miRNA silencing of HSP70 significantly reduced the CHIKV production in HEK-293T and Vero-E6 cells at early time points. These results suggest that CHIKV exploits the housekeeping molecules such as actin, HSP70 and STAT-2 to establish infection in the mammalian cells.

  19. Protein identification using nano liquid chromatography-tandem mass spectrometry.

    Science.gov (United States)

    Negroni, Luc

    2007-01-01

    Tandem mass spectrometry is an efficient technique for the identification of peptides on the basis of their fragmentation pattern (MS/MS scan). It can generate individual spectra for each peptide, thereby creating a powerful tool for protein identification on the basis of peptide characterization. This important advance in automatic data acquisition has allowed an efficient association between liquid chromatography and tandem mass spectrometry, and the use of nanocolumns and nanoelectrospray ionization has dramatically increased the efficiency of this method. Now large sets of peptides can be identified at a femtomole level. At the end of the process, batch processing of the MS/MS spectra produces peptide lists that identify purified proteins or protein mixtures with high confidence.

  20. Bioinformatics pipeline for functional identification and characterization of proteins

    Science.gov (United States)

    Skarzyńska, Agnieszka; Pawełkowicz, Magdalena; Krzywkowski, Tomasz; Świerkula, Katarzyna; PlÄ der, Wojciech; Przybecki, Zbigniew

    2015-09-01

    The new sequencing methods, called Next Generation Sequencing gives an opportunity to possess a vast amount of data in short time. This data requires structural and functional annotation. Functional identification and characterization of predicted proteins could be done by in silico approches, thanks to a numerous computational tools available nowadays. However, there is a need to confirm the results of proteins function prediction using different programs and comparing the results or confirm experimentally. Here we present a bioinformatics pipeline for structural and functional annotation of proteins.

  1. Protein corona composition does not accurately predict hematocompatibility of colloidal gold nanoparticles.

    Science.gov (United States)

    Dobrovolskaia, Marina A; Neun, Barry W; Man, Sonny; Ye, Xiaoying; Hansen, Matthew; Patri, Anil K; Crist, Rachael M; McNeil, Scott E

    2014-10-01

    Proteins bound to nanoparticle surfaces are known to affect particle clearance by influencing immune cell uptake and distribution to the organs of the mononuclear phagocytic system. The composition of the protein corona has been described for several types of nanomaterials, but the role of the corona in nanoparticle biocompatibility is not well established. In this study we investigate the role of nanoparticle surface properties (PEGylation) and incubation times on the protein coronas of colloidal gold nanoparticles. While neither incubation time nor PEG molecular weight affected the specific proteins in the protein corona, the total amount of protein binding was governed by the molecular weight of PEG coating. Furthermore, the composition of the protein corona did not correlate with nanoparticle hematocompatibility. Specialized hematological tests should be used to deduce nanoparticle hematotoxicity. From the clinical editor: It is overall unclear how the protein corona associated with colloidal gold nanoparticles may influence hematotoxicity. This study warns that PEGylation itself may be insufficient, because composition of the protein corona does not directly correlate with nanoparticle hematocompatibility. The authors suggest that specialized hematological tests must be used to deduce nanoparticle hematotoxicity.

  2. CYP450 phenotyping and accurate mass identification of metabolites of the 8-aminoquinoline, anti-malarial drug primaquine

    Directory of Open Access Journals (Sweden)

    Pybus Brandon S

    2012-08-01

    Full Text Available Abstract Background The 8-aminoquinoline (8AQ drug primaquine (PQ is currently the only approved drug effective against the persistent liver stage of the hypnozoite forming strains Plasmodium vivax and Plasmodium ovale as well as Stage V gametocytes of Plasmodium falciparum. To date, several groups have investigated the toxicity observed in the 8AQ class, however, exact mechanisms and/or metabolic species responsible for PQ’s haemotoxic and anti-malarial properties are not fully understood. Methods In the present study, the metabolism of PQ was evaluated using in vitro recombinant metabolic enzymes from the cytochrome P450 (CYP and mono-amine oxidase (MAO families. Based on this information, metabolite identification experiments were performed using nominal and accurate mass measurements. Results Relative activity factor (RAF-weighted intrinsic clearance values show the relative role of each enzyme to be MAO-A, 2C19, 3A4, and 2D6, with 76.1, 17.0, 5.2, and 1.7% contributions to PQ metabolism, respectively. CYP 2D6 was shown to produce at least six different oxidative metabolites along with demethylations, while MAO-A products derived from the PQ aldehyde, a pre-cursor to carboxy PQ. CYPs 2C19 and 3A4 produced only trace levels of hydroxylated species. Conclusions As a result of this work, CYP 2D6 and MAO-A have been implicated as the key enzymes associated with PQ metabolism, and metabolites previously identified as potentially playing a role in efficacy and haemolytic toxicity have been attributed to production via CYP 2D6 mediated pathways.

  3. Identification and analysis of multi-protein complexes in placenta.

    Directory of Open Access Journals (Sweden)

    Fuqiang Wang

    Full Text Available Placental malfunction induces pregnancy disorders which contribute to life-threatening complications for both the mother and the fetus. Identification and characterization of placental multi-protein complexes is an important step to integratedly understand the protein-protein interaction networks in placenta which determine placental function. In this study, blue native/sodium dodecyl sulfate polyacrylamide gel electrophoresis (BN/SDS-PAGE and Liquid chromatography-tandem mass spectrometry (LC-MS/MS were used to screen the multi-protein complexes in placenta. 733 unique proteins and 34 known and novel heterooligomeric multi-protein complexes including mitochondrial respiratory chain complexes, integrin complexes, proteasome complexes, histone complex, and heat shock protein complexes were identified. A novel protein complex, which involves clathrin and small conductance calcium-activated potassium (SK channel protein 2, was identified and validated by antibody based gel shift assay, co-immunoprecipitation and immunofluorescence staining. These results suggest that BN/SDS-PAGE, when integrated with LC-MS/MS, is a very powerful and versatile tool for the investigation of placental protein complexes. This work paves the way for deeper functional characterization of the placental protein complexes associated with pregnancy disorders.

  4. VORFFIP-Driven Dock: V-D2OCK, a Fast and Accurate Protein Docking Strategy

    Science.gov (United States)

    Segura, Joan; Marín-López, Manuel Alejandro; Jones, Pamela F.; Oliva, Baldo; Fernandez-Fuentes, Narcis

    2015-01-01

    The experimental determination of the structure of protein complexes cannot keep pace with the generation of interactomic data, hence resulting in an ever-expanding gap. As the structural details of protein complexes are central to a full understanding of the function and dynamics of the cell machinery, alternative strategies are needed to circumvent the bottleneck in structure determination. Computational protein docking is a valid and valuable approach to model the structure of protein complexes. In this work, we describe a novel computational strategy to predict the structure of protein complexes based on data-driven docking: VORFFIP-driven dock (V-D2OCK). This new approach makes use of our newly described method to predict functional sites in protein structures, VORFFIP, to define the region to be sampled during docking and structural clustering to reduce the number of models to be examined by users. V-D2OCK has been benchmarked using a validated and diverse set of protein complexes and compared to a state-of-art docking method. The speed and accuracy compared to contemporary tools justifies the potential use of VD2OCK for high-throughput, genome-wide, protein docking. Finally, we have developed a web interface that allows users to browser and visualize V-D2OCK predictions from the convenience of their web-browsers. PMID:25763838

  5. VORFFIP-driven dock: V-D2OCK, a fast and accurate protein docking strategy.

    Directory of Open Access Journals (Sweden)

    Joan Segura

    Full Text Available The experimental determination of the structure of protein complexes cannot keep pace with the generation of interactomic data, hence resulting in an ever-expanding gap. As the structural details of protein complexes are central to a full understanding of the function and dynamics of the cell machinery, alternative strategies are needed to circumvent the bottleneck in structure determination. Computational protein docking is a valid and valuable approach to model the structure of protein complexes. In this work, we describe a novel computational strategy to predict the structure of protein complexes based on data-driven docking: VORFFIP-driven dock (V-D2OCK. This new approach makes use of our newly described method to predict functional sites in protein structures, VORFFIP, to define the region to be sampled during docking and structural clustering to reduce the number of models to be examined by users. V-D2OCK has been benchmarked using a validated and diverse set of protein complexes and compared to a state-of-art docking method. The speed and accuracy compared to contemporary tools justifies the potential use of VD2OCK for high-throughput, genome-wide, protein docking. Finally, we have developed a web interface that allows users to browser and visualize V-D2OCK predictions from the convenience of their web-browsers.

  6. VORFFIP-driven dock: V-D2OCK, a fast and accurate protein docking strategy.

    Science.gov (United States)

    Segura, Joan; Marín-López, Manuel Alejandro; Jones, Pamela F; Oliva, Baldo; Fernandez-Fuentes, Narcis

    2015-01-01

    The experimental determination of the structure of protein complexes cannot keep pace with the generation of interactomic data, hence resulting in an ever-expanding gap. As the structural details of protein complexes are central to a full understanding of the function and dynamics of the cell machinery, alternative strategies are needed to circumvent the bottleneck in structure determination. Computational protein docking is a valid and valuable approach to model the structure of protein complexes. In this work, we describe a novel computational strategy to predict the structure of protein complexes based on data-driven docking: VORFFIP-driven dock (V-D2OCK). This new approach makes use of our newly described method to predict functional sites in protein structures, VORFFIP, to define the region to be sampled during docking and structural clustering to reduce the number of models to be examined by users. V-D2OCK has been benchmarked using a validated and diverse set of protein complexes and compared to a state-of-art docking method. The speed and accuracy compared to contemporary tools justifies the potential use of VD2OCK for high-throughput, genome-wide, protein docking. Finally, we have developed a web interface that allows users to browser and visualize V-D2OCK predictions from the convenience of their web-browsers.

  7. Identification of surface proteins in Enterococcus faecalis V583

    Directory of Open Access Journals (Sweden)

    Eijsink Vincent GH

    2011-03-01

    Full Text Available Abstract Background Surface proteins are a key to a deeper understanding of the behaviour of Gram-positive bacteria interacting with the human gastro-intestinal tract. Such proteins contribute to cell wall synthesis and maintenance and are important for interactions between the bacterial cell and the human host. Since they are exposed and may play roles in pathogenicity, surface proteins are interesting targets for drug design. Results Using methods based on proteolytic "shaving" of bacterial cells and subsequent mass spectrometry-based protein identification, we have identified surface-located proteins in Enterococcus faecalis V583. In total 69 unique proteins were identified, few of which have been identified and characterized previously. 33 of these proteins are predicted to be cytoplasmic, whereas the other 36 are predicted to have surface locations (31 or to be secreted (5. Lipid-anchored proteins were the most dominant among the identified surface proteins. The seemingly most abundant surface proteins included a membrane protein with a potentially shedded extracellular sulfatase domain that could act on the sulfate groups in mucin and a lipid-anchored fumarate reductase that could contribute to generation of reactive oxygen species. Conclusions The present proteome analysis gives an experimental impression of the protein landscape on the cell surface of the pathogenic bacterium E. faecalis. The 36 identified secreted (5 and surface (31 proteins included several proteins involved in cell wall synthesis, pheromone-regulated processes, and transport of solutes, as well as proteins with unknown function. These proteins stand out as interesting targets for further investigation of the interaction between E. faecalis and its environment.

  8. CMASA: an accurate algorithm for detecting local protein structural similarity and its application to enzyme catalytic site annotation

    Directory of Open Access Journals (Sweden)

    Li Gong-Hua

    2010-08-01

    Full Text Available Abstract Background The rapid development of structural genomics has resulted in many "unknown function" proteins being deposited in Protein Data Bank (PDB, thus, the functional prediction of these proteins has become a challenge for structural bioinformatics. Several sequence-based and structure-based methods have been developed to predict protein function, but these methods need to be improved further, such as, enhancing the accuracy, sensitivity, and the computational speed. Here, an accurate algorithm, the CMASA (Contact MAtrix based local Structural Alignment algorithm, has been developed to predict unknown functions of proteins based on the local protein structural similarity. This algorithm has been evaluated by building a test set including 164 enzyme families, and also been compared to other methods. Results The evaluation of CMASA shows that the CMASA is highly accurate (0.96, sensitive (0.86, and fast enough to be used in the large-scale functional annotation. Comparing to both sequence-based and global structure-based methods, not only the CMASA can find remote homologous proteins, but also can find the active site convergence. Comparing to other local structure comparison-based methods, the CMASA can obtain the better performance than both FFF (a method using geometry to predict protein function and SPASM (a local structure alignment method; and the CMASA is more sensitive than PINTS and is more accurate than JESS (both are local structure alignment methods. The CMASA was applied to annotate the enzyme catalytic sites of the non-redundant PDB, and at least 166 putative catalytic sites have been suggested, these sites can not be observed by the Catalytic Site Atlas (CSA. Conclusions The CMASA is an accurate algorithm for detecting local protein structural similarity, and it holds several advantages in predicting enzyme active sites. The CMASA can be used in large-scale enzyme active site annotation. The CMASA can be available by the

  9. Are current atomistic force fields accurate enough to study proteins in crowded environments?

    Directory of Open Access Journals (Sweden)

    Drazen Petrov

    2014-05-01

    Full Text Available The high concentration of macromolecules in the crowded cellular interior influences different thermodynamic and kinetic properties of proteins, including their structural stabilities, intermolecular binding affinities and enzymatic rates. Moreover, various structural biology methods, such as NMR or different spectroscopies, typically involve samples with relatively high protein concentration. Due to large sampling requirements, however, the accuracy of classical molecular dynamics (MD simulations in capturing protein behavior at high concentration still remains largely untested. Here, we use explicit-solvent MD simulations and a total of 6.4 µs of simulated time to study wild-type (folded and oxidatively damaged (unfolded forms of villin headpiece at 6 mM and 9.2 mM protein concentration. We first perform an exhaustive set of simulations with multiple protein molecules in the simulation box using GROMOS 45a3 and 54a7 force fields together with different types of electrostatics treatment and solution ionic strengths. Surprisingly, the two villin headpiece variants exhibit similar aggregation behavior, despite the fact that their estimated aggregation propensities markedly differ. Importantly, regardless of the simulation protocol applied, wild-type villin headpiece consistently aggregates even under conditions at which it is experimentally known to be soluble. We demonstrate that aggregation is accompanied by a large decrease in the total potential energy, with not only hydrophobic, but also polar residues and backbone contributing substantially. The same effect is directly observed for two other major atomistic force fields (AMBER99SB-ILDN and CHARMM22-CMAP as well as indirectly shown for additional two (AMBER94, OPLS-AAL, and is possibly due to a general overestimation of the potential energy of protein-protein interactions at the expense of water-water and water-protein interactions. Overall, our results suggest that current MD force fields

  10. Calculation of accurate small angle X-ray scattering curves from coarse-grained protein models

    DEFF Research Database (Denmark)

    Stovgaard, Kasper; Andreetta, Christian; Ferkinghoff-Borg, Jesper

    2010-01-01

    the computationally costly iteration over all atoms. We estimated the form factors using generated data from a set of high quality protein structures. No ad hoc scaling or correction factors are applied in the calculation of the curves. Two coarse-grained representations of protein structure were investigated; two...... CRYSOL, which requires full atomic detail. Our method was also comparable to CRYSOL in recognizing native structures among native-like decoys. As a proof-of-concept, we combined the coarse-grained Debye calculation with a previously described probabilistic model of protein structure, Torus...

  11. Identification of local variations within secondary structures of proteins.

    Science.gov (United States)

    Kumar, Prasun; Bansal, Manju

    2015-05-01

    Secondary-structure elements (SSEs) play an important role in the folding of proteins. Identification of SSEs in proteins is a common problem in structural biology. A new method, ASSP (Assignment of Secondary Structure in Proteins), using only the path traversed by the C(α) atoms has been developed. The algorithm is based on the premise that the protein structure can be divided into continuous or uniform stretches, which can be defined in terms of helical parameters, and depending on their values the stretches can be classified into different SSEs, namely α-helices, 310-helices, π-helices, extended β-strands and polyproline II (PPII) and other left-handed helices. The methodology was validated using an unbiased clustering of these parameters for a protein data set consisting of 1008 protein chains, which suggested that there are seven well defined clusters associated with different SSEs. Apart from α-helices and extended β-strands, 310-helices and π-helices were also found to occur in substantial numbers. ASSP was able to discriminate non-α-helical segments from flanking α-helices, which were often identified as part of α-helices by other algorithms. ASSP can also lead to the identification of novel SSEs. It is believed that ASSP could provide a better understanding of the finer nuances of protein secondary structure and could make an important contribution to the better understanding of comparatively less frequently occurring structural motifs. At the same time, it can contribute to the identification of novel SSEs. A standalone version of the program for the Linux as well as the Windows operating systems is freely downloadable and a web-server version is also available at http://nucleix.mbu.iisc.ernet.in/assp/index.php.

  12. GalaxyDock BP2 score: a hybrid scoring function for accurate protein-ligand docking

    Science.gov (United States)

    Baek, Minkyung; Shin, Woong-Hee; Chung, Hwan Won; Seok, Chaok

    2017-07-01

    Protein-ligand docking is a useful tool for providing atomic-level understanding of protein functions in nature and design principles for artificial ligands or proteins with desired properties. The ability to identify the true binding pose of a ligand to a target protein among numerous possible candidate poses is an essential requirement for successful protein-ligand docking. Many previously developed docking scoring functions were trained to reproduce experimental binding affinities and were also used for scoring binding poses. However, in this study, we developed a new docking scoring function, called GalaxyDock BP2 Score, by directly training the scoring power of binding poses. This function is a hybrid of physics-based, empirical, and knowledge-based score terms that are balanced to strengthen the advantages of each component. The performance of the new scoring function exhibits significant improvement over existing scoring functions in decoy pose discrimination tests. In addition, when the score is used with the GalaxyDock2 protein-ligand docking program, it outperformed other state-of-the-art docking programs in docking tests on the Astex diverse set, the Cross2009 benchmark set, and the Astex non-native set. GalaxyDock BP2 Score and GalaxyDock2 with this score are freely available at http://galaxy.seoklab.org/softwares/galaxydock.html.

  13. Accurate proteome-wide protein quantification from high-resolution 15N mass spectra.

    Science.gov (United States)

    Khan, Zia; Amini, Sasan; Bloom, Joshua S; Ruse, Cristian; Caudy, Amy A; Kruglyak, Leonid; Singh, Mona; Perlman, David H; Tavazoie, Saeed

    2011-12-19

    In quantitative mass spectrometry-based proteomics, the metabolic incorporation of a single source of 15N-labeled nitrogen has many advantages over using stable isotope-labeled amino acids. However, the lack of a robust computational framework for analyzing the resulting spectra has impeded wide use of this approach. We have addressed this challenge by introducing a new computational methodology for analyzing 15N spectra in which quantification is integrated with identification. Application of this method to an Escherichia coli growth transition reveals significant improvement in quantification accuracy over previous methods.

  14. Computational identification of strain-, species- and genus-specific proteins

    Directory of Open Access Journals (Sweden)

    Thiagarajan Rathi

    2005-11-01

    Full Text Available Abstract Background The identification of unique proteins at different taxonomic levels has both scientific and practical value. Strain-, species- and genus-specific proteins can provide insight into the criteria that define an organism and its relationship with close relatives. Such proteins can also serve as taxon-specific diagnostic targets. Description A pipeline using a combination of computational and manual analyses of BLAST results was developed to identify strain-, species-, and genus-specific proteins and to catalog the closest sequenced relative for each protein in a proteome. Proteins encoded by a given strain are preliminarily considered to be unique if BLAST, using a comprehensive protein database, fails to retrieve (with an e-value better than 0.001 any protein not encoded by the query strain, species or genus (for strain-, species- and genus-specific proteins respectively, or if BLAST, using the best hit as the query (reverse BLAST, does not retrieve the initial query protein. Results are manually inspected for homology if the initial query is retrieved in the reverse BLAST but is not the best hit. Sequences unlikely to retrieve homologs using the default BLOSUM62 matrix (usually short sequences are re-tested using the PAM30 matrix, thereby increasing the number of retrieved homologs and increasing the stringency of the search for unique proteins. The above protocol was used to examine several food- and water-borne pathogens. We find that the reverse BLAST step filters out about 22% of proteins with homologs that would otherwise be considered unique at the genus and species levels. Analysis of the annotations of unique proteins reveals that many are remnants of prophage proteins, or may be involved in virulence. The data generated from this study can be accessed and further evaluated from the CUPID (Core and Unique Protein Identification system web site (updated semi-annually at http://pir.georgetown.edu/cupid. Conclusion CUPID

  15. Identification of Essential Proteins Based on a New Combination of Local Interaction Density and Protein Complexes.

    Directory of Open Access Journals (Sweden)

    Jiawei Luo

    Full Text Available Computational approaches aided by computer science have been used to predict essential proteins and are faster than expensive, time-consuming, laborious experimental approaches. However, the performance of such approaches is still poor, making practical applications of computational approaches difficult in some fields. Hence, the development of more suitable and efficient computing methods is necessary for identification of essential proteins.In this paper, we propose a new method for predicting essential proteins in a protein interaction network, local interaction density combined with protein complexes (LIDC, based on statistical analyses of essential proteins and protein complexes. First, we introduce a new local topological centrality, local interaction density (LID, of the yeast PPI network; second, we discuss a new integration strategy for multiple bioinformatics. The LIDC method was then developed through a combination of LID and protein complex information based on our new integration strategy. The purpose of LIDC is discovery of important features of essential proteins with their neighbors in real protein complexes, thereby improving the efficiency of identification.Experimental results based on three different PPI(protein-protein interaction networks of Saccharomyces cerevisiae and Escherichia coli showed that LIDC outperformed classical topological centrality measures and some recent combinational methods. Moreover, when predicting MIPS datasets, the better improvement of performance obtained by LIDC is over all nine reference methods (i.e., DC, BC, NC, LID, PeC, CoEWC, WDC, ION, and UC.LIDC is more effective for the prediction of essential proteins than other recently developed methods.

  16. Identification of Essential Proteins Based on a New Combination of Local Interaction Density and Protein Complexes

    Science.gov (United States)

    Luo, Jiawei; Qi, Yi

    2015-01-01

    Background Computational approaches aided by computer science have been used to predict essential proteins and are faster than expensive, time-consuming, laborious experimental approaches. However, the performance of such approaches is still poor, making practical applications of computational approaches difficult in some fields. Hence, the development of more suitable and efficient computing methods is necessary for identification of essential proteins. Method In this paper, we propose a new method for predicting essential proteins in a protein interaction network, local interaction density combined with protein complexes (LIDC), based on statistical analyses of essential proteins and protein complexes. First, we introduce a new local topological centrality, local interaction density (LID), of the yeast PPI network; second, we discuss a new integration strategy for multiple bioinformatics. The LIDC method was then developed through a combination of LID and protein complex information based on our new integration strategy. The purpose of LIDC is discovery of important features of essential proteins with their neighbors in real protein complexes, thereby improving the efficiency of identification. Results Experimental results based on three different PPI(protein-protein interaction) networks of Saccharomyces cerevisiae and Escherichia coli showed that LIDC outperformed classical topological centrality measures and some recent combinational methods. Moreover, when predicting MIPS datasets, the better improvement of performance obtained by LIDC is over all nine reference methods (i.e., DC, BC, NC, LID, PeC, CoEWC, WDC, ION, and UC). Conclusions LIDC is more effective for the prediction of essential proteins than other recently developed methods. PMID:26125187

  17. Identification of bacteriophage virion proteins by the ANOVA feature selection and analysis.

    Science.gov (United States)

    Ding, Hui; Feng, Peng-Mian; Chen, Wei; Lin, Hao

    2014-08-01

    The bacteriophage virion proteins play extremely important roles in the fate of host bacterial cells. Accurate identification of bacteriophage virion proteins is very important for understanding their functions and clarifying the lysis mechanism of bacterial cells. In this study, a new sequence-based method was developed to identify phage virion proteins. In the new method, the protein sequences were initially formulated by the g-gap dipeptide compositions. Subsequently, the analysis of variance (ANOVA) with incremental feature selection (IFS) was used to search for the optimal feature set. It was observed that, in jackknife cross-validation, the optimal feature set including 160 optimized features can produce the maximum accuracy of 85.02%. By performing feature analysis, we found that the correlation between two amino acids with one gap was more important than other correlations for phage virion protein prediction and that some of the 1-gap dipeptides were important and mainly contributed to the virion protein prediction. This analysis will provide novel insights into the function of phage virion proteins. On the basis of the proposed method, an online web-server, PVPred, was established and can be freely accessed from the website (http://lin.uestc.edu.cn/server/PVPred). We believe that the PVPred will become a powerful tool to study phage virion proteins and to guide the related experimental validations.

  18. HMMCAS: a web tool for the identification and domain annotations of Cas proteins.

    Science.gov (United States)

    Chai, Guoshi; Yu, Min; Jiang, Lixu; Duan, Yaocong; Huang, Jian

    2017-02-07

    The CRISPR-Cas (clustered regularly interspaced short palindromic repeats-CRISPR-associated proteins) adaptive immune systems are discovered in many bacteria and most archaea. These systems are encoded by cas (CRISPR-associated) operons that have an extremely diverse architecture. The most crucial step in the depiction of cas operons composition is the identification of cas genes or Cas proteins. With the continuous increase of the newly sequenced archaeal and bacterial genomes, the recognition of new Cas proteins is becoming possible, which not only provides candidates for novel genome editing tools but also helps to understand the prokaryotic immune system better. Here we describe HMMCAS, a web service for the detection of CRISPR-associated structural and functional domains in protein sequences. HMMCAS uses hmmscan similarity search algorithm in HMMER3.1 to provide a fast, interactive service based on a comprehensive collection of hidden Markov models of Cas protein family. It can accurately identify the Cas proteins including those fusion proteins, for example the Cas1-Cas4 fusion protein in Candidatus Chloracidobacterium thermophilum B (Cab. thermophilum B). HMMCAS can also find putative cas operon and determine which type it belongs to. HMMCAS is freely available at http://i.uestc.edu.cn/hmmcas.

  19. Protein Identification Pipeline for the Homology Driven Proteomics

    Science.gov (United States)

    Junqueira, Magno; Spirin, Victor; Balbuena, Tiago Santana; Thomas, Henrik; Adzhubei, Ivan; Sunyaev, Shamil; Shevchenko, Andrej

    2008-01-01

    Homology-driven proteomics is a major tool to characterize proteomes of organisms with unsequenced genomes. This paper addresses practical aspects of automated homology–driven protein identifications by LC-MS/MS on a hybrid LTQ Orbitrap mass spectrometer. All essential software elements supporting the presented pipeline are either hosted at the publicly accessible web server, or are available for free download. PMID:18639657

  20. Using protein markers of embryo and seed storage proteins in identification of four pistachio cultivars

    Directory of Open Access Journals (Sweden)

    Ali Akbar Ehsanpour

    2010-12-01

    Full Text Available Identification of protein marker for Pistachio cultivars, as a valuable source of food is important. In this study, the protein patterns of embryo from four pistachio cultivars including Akbari, Ahmad Aghaei, Fandoghi and Kaleghouchi were analyzed using SDS-PAGE. The presence of protein bands about 90 and 45 killo dalton (kd in protein pattern of embryonic axes in cultivars Kaleghouchi and Akbari respectively and the absence of protein bands with approximate molecular weight 30 and 20 kd in protein pattern of cotyledons in cultivars Kaleghouchi and Akbari respectively can be used as protein markers for these pistachio cultivars. On the other hand, the maximum expression level of bands 45 kd in protein pattern of cotyledons could be indicative of a protein marker for cultivar Ahmad Aghaei.

  1. Accurate retention time determination of co-eluting proteins in analytical chromatography by means of spectral data.

    Science.gov (United States)

    Dismer, Florian; Hansen, Sigrid; Oelmeier, Stefan Alexander; Hubbuch, Jürgen

    2013-03-01

    Chromatography is the method of choice for the separation of proteins, at both analytical and preparative scale. Orthogonal purification strategies for industrial use can easily be implemented by combining different modes of adsorption. Nevertheless, with flexibility comes the freedom of choice and optimal conditions for consecutive steps need to be identified in a robust and reproducible fashion. One way to address this issue is the use of mathematical models that allow for an in silico process optimization. Although this has been shown to work, model parameter estimation for complex feedstocks becomes the bottleneck in process development. An integral part of parameter assessment is the accurate measurement of retention times in a series of isocratic or gradient elution experiments. As high-resolution analytics that can differentiate between proteins are often not readily available, pure protein is mandatory for parameter determination. In this work, we present an approach that has the potential to solve this problem. Based on the uniqueness of UV absorption spectra of proteins, we were able to accurately measure retention times in systems of up to four co-eluting compounds. The presented approach is calibration-free, meaning that prior knowledge of pure component absorption spectra is not required. Actually, pure protein spectra can be determined from co-eluting proteins as part of the methodology. The approach was tested for size-exclusion chromatograms of 38 mixtures of co-eluting proteins. Retention times were determined with an average error of 0.6 s (1.6% of average peak width), approximated and measured pure component spectra showed an average coefficient of correlation of 0.992.

  2. Identification of protein superfamily from structure- based sequence motif

    Institute of Scientific and Technical Information of China (English)

    2002-01-01

    The structure-based sequence motif of the distant proteins in evolution, protein tyrosine phosphatases (PTP) Ⅰ and Ⅱ superfamilies, as an example, has been defined by the structural comparison, structure-based sequence alignment and analyses on substitution patterns of residues in common sequence conserved regions. And the phosphatases Ⅰ and Ⅱ can be correctly identified together by the structure-based PTP sequence motif from SWISS-PROT and TrEBML databases. The results show that the correct rates of identification are over 98%. This is the first time to identify PTP Ⅰ and Ⅱ together by this motif.

  3. Physicochemical property distributions for accurate and rapid pairwise protein homology detection

    Directory of Open Access Journals (Sweden)

    Oehmen Christopher S

    2010-03-01

    Full Text Available Abstract Background The challenge of remote homology detection is that many evolutionarily related sequences have very little similarity at the amino acid level. Kernel-based discriminative methods, such as support vector machines (SVMs, that use vector representations of sequences derived from sequence properties have been shown to have superior accuracy when compared to traditional approaches for the task of remote homology detection. Results We introduce a new method for feature vector representation based on the physicochemical properties of the primary protein sequence. A distribution of physicochemical property scores are assembled from 4-mers of the sequence and normalized based on the null distribution of the property over all possible 4-mers. With this approach there is little computational cost associated with the transformation of the protein into feature space, and overall performance in terms of remote homology detection is comparable with current state-of-the-art methods. We demonstrate that the features can be used for the task of pairwise remote homology detection with improved accuracy versus sequence-based methods such as BLAST and other feature-based methods of similar computational cost. Conclusions A protein feature method based on physicochemical properties is a viable approach for extracting features in a computationally inexpensive manner while retaining the sensitivity of SVM protein homology detection. Furthermore, identifying features that can be used for generic pairwise homology detection in lieu of family-based homology detection is important for applications such as large database searches and comparative genomics.

  4. Machine Learning Identification of Protein Properties Useful for Specific Applications

    KAUST Repository

    Khamis, Abdullah

    2016-03-31

    Proteins play critical roles in cellular processes of living organisms. It is therefore important to identify and characterize their key properties associated with their functions. Correlating protein’s structural, sequence and physicochemical properties of its amino acids (aa) with protein functions could identify some of the critical factors governing the specific functionality. We point out that not all functions of even well studied proteins are known. This, complemented by the huge increase in the number of newly discovered and predicted proteins, makes challenging the experimental characterization of the whole spectrum of possible protein functions for all proteins of interest. Consequently, the use of computational methods has become more attractive. Here we address two questions. The first one is how to use protein aa sequence and physicochemical properties to characterize a family of proteins. The second one focuses on how to use transcription factor (TF) protein’s domains to enhance accuracy of predicting TF DNA binding sites (TFBSs). To address the first question, we developed a novel method using computational representation of proteins based on characteristics of different protein regions (N-terminal, M-region and C-terminal) and combined these with the properties of protein aa sequences. We show that this description provides important biological insight about characterization of the protein functional groups. Using feature selection techniques, we identified key properties of proteins that allow for very accurate characterization of different protein families. We demonstrated efficiency of our method in application to a number of antimicrobial peptide families. To address the second question we developed another novel method that uses a combination of aa properties of DNA binding domains of TFs and their TFBS properties to develop machine learning models for predicting TFBSs. Feature selection is used to identify the most relevant characteristics

  5. Identification of AOSC-binding proteins in neurons

    Institute of Scientific and Technical Information of China (English)

    LIU Ming; NIE Qin; XIN Xianliang; GENG Meiyu

    2008-01-01

    Acidic oligosaccharide sugar chain (AOSC), a D-mannuronic acid oligosaccharide, derived from brown algae polysaccharide, has been completed Phase I clinical trial in China as an anti-Alzheimer's Disease (AD) drug candidate. The identification of AOSC-binding protein(s) in neurons is very important for understanding its action mechanism. To determine the binding protein(s) of AOSC in neurons mediating its anti-AD activities, confocal microscopy, affinity chromatography, and liquid chromatography-tandem mass spectrometry (LC-MS/MS) analysis were used. Confocal microscopy analysis shows that AOSC binds to SH-SY5Y cells in concentration-, time-, and temperature-dependent fashions. The AOSC binding proteins were purified by affinity chromatography and identified by LC-MS/MS analysis. The results showed that there are 349 proteins binding AOSC, including clathrin, adaptor protein-2 (AP-2) and amyloid precursor protein (APP). These results suggest that the binding/entrance of AOSC to neurons is probably responsible for anti-AD activities.

  6. Identification of Tobacco Topping Responsive Proteins in Roots

    Directory of Open Access Journals (Sweden)

    Hongxiang eGuo

    2016-04-01

    Full Text Available Tobacco plant has many responses to topping, such as the increase in ability of nicotine synthesis and secondary growth of roots. Some topping responsive miRNAs and genes had been identified in our previous work, but it is not enough to elaborate mechanism of tobacco response to topping. Here, topping responsive proteins were screened from tobacco roots with two-dimensional electrophoresis. Of these proteins, calretulin (CRT and Auxin-responsive protein IAA9 were related to the secondary growth of roots, LRR disease resistance, heat shock protein 70 and farnesyl pyrophosphate synthase 1(FPPS)were involved in wounding stress response, and F-box protein played an important role in promoting the ability of nicotine synthesis after topping. In addition, there were five tobacco bHLH proteins (NtbHLH, NtMYC1a, NtMYC1b, NtMYC2a and NtMYC2b related to nicotine synthesis. It was suggested that NtMYC2 might be the main positive transcription factor and NtbHLH protein is a negative regulator in the JA-mediating activation of nicotine synthesis after topping. Tobacco topping activates some comprehensive biology processes involving IAA and JA signaling pathway, and the identification of these proteins will be helpful to understand the process of topping response.

  7. Combining Evolutionary Information and an Iterative Sampling Strategy for Accurate Protein Structure Prediction.

    Directory of Open Access Journals (Sweden)

    Tatjana Braun

    2015-12-01

    Full Text Available Recent work has shown that the accuracy of ab initio structure prediction can be significantly improved by integrating evolutionary information in form of intra-protein residue-residue contacts. Following this seminal result, much effort is put into the improvement of contact predictions. However, there is also a substantial need to develop structure prediction protocols tailored to the type of restraints gained by contact predictions. Here, we present a structure prediction protocol that combines evolutionary information with the resolution-adapted structural recombination approach of Rosetta, called RASREC. Compared to the classic Rosetta ab initio protocol, RASREC achieves improved sampling, better convergence and higher robustness against incorrect distance restraints, making it the ideal sampling strategy for the stated problem. To demonstrate the accuracy of our protocol, we tested the approach on a diverse set of 28 globular proteins. Our method is able to converge for 26 out of the 28 targets and improves the average TM-score of the entire benchmark set from 0.55 to 0.72 when compared to the top ranked models obtained by the EVFold web server using identical contact predictions. Using a smaller benchmark, we furthermore show that the prediction accuracy of our method is only slightly reduced when the contact prediction accuracy is comparatively low. This observation is of special interest for protein sequences that only have a limited number of homologs.

  8. Identification of protein interacting partners using tandem affinity purification.

    Science.gov (United States)

    Bailey, Dalan; Urena, Luis; Thorne, Lucy; Goodfellow, Ian

    2012-02-25

    A critical and often limiting step in understanding the function of host and viral proteins is the identification of interacting cellular or viral protein partners. There are many approaches that allow the identification of interacting partners, including the yeast two hybrid system, as well as pull down assays using recombinant proteins and immunoprecipitation of endogenous proteins followed by mass spectrometry identification(1). Recent studies have highlighted the utility of double-affinity tag mediated purification, coupled with two specific elution steps in the identification of interacting proteins. This approach, termed Tandem Affinity Purification (TAP), was initially used in yeast(2,3) but more recently has been adapted to use in mammalian cells(4-8). As proof-of-concept we have established a tandem affinity purification (TAP) method using the well-characterized eukaryotic translation initiation factor eIF4E(9,10).The cellular translation factor eIF4E is a critical component of the cellular eIF4F complex involved in cap-dependent translation initiation(10). The TAP tag used in the current study is composed of two Protein G units and a streptavidin binding peptide separated by a Tobacco Etch Virus (TEV) protease cleavage sequence. The TAP tag used in the current study is composed of two Protein G units and a streptavidin binding peptide separated by a Tobacco Etch Virus (TEV) protease cleavage sequence(8). To forgo the need for the generation of clonal cell lines, we developed a rapid system that relies on the expression of the TAP-tagged bait protein from an episomally maintained plasmid based on pMEP4 (Invitrogen). Expression of tagged murine eIF4E from this plasmid was controlled using the cadmium chloride inducible metallothionein promoter. Lysis of the expressing cells and subsequent affinity purification via binding to rabbit IgG agarose, TEV protease cleavage, binding to streptavidin linked agarose and subsequent biotin elution identified numerous

  9. Comprehensive Identification of Immunodominant Proteins of Brucella abortus and Brucella melitensis Using Antibodies in the Sera from Naturally Infected Hosts

    OpenAIRE

    Gamal Wareth; Murat Eravci; Christoph Weise; Uwe Roesler; Falk Melzer; Sprague, Lisa D.; Heinrich Neubauer; Jayaseelan Murugaiyan

    2016-01-01

    Brucellosis is a debilitating zoonotic disease that affects humans and animals. The diagnosis of brucellosis is challenging, as accurate species level identification is not possible with any of the currently available serology-based diagnostic methods. The present study aimed at identifying Brucella (B.) species-specific proteins from the closely related species B. abortus and B. melitensis using sera collected from naturally infected host species. Unlike earlier reported investigations with ...

  10. HEASARC Astronomical Archive: GLIESE2MAS - Gliese Catalog Stars with Accurate Coordinates and 2MASS Cross-Identifications

    Data.gov (United States)

    National Aeronautics and Space Administration — This table contains precise epoch 2000 coordinates and cross-identifications to sources in the 2MASS Point Source Catalog for nearly all stars in the Gliese,...

  11. PETs: A Stable and Accurate Predictor of Protein-Protein Interacting Sites Based on Extremely-Randomized Trees.

    Science.gov (United States)

    Xia, Bin; Zhang, Hong; Li, Qianmu; Li, Tao

    2015-12-01

    Protein-protein interaction (PPI) plays crucial roles in the performance of various biological processes. A variety of methods are dedicated to identify whether proteins have interaction residues, but it is often more crucial to recognize each amino acid. In practical applications, the stability of a prediction model is as important as its accuracy. However, random sampling, which is widely used in previous prediction models, often brings large difference between each training model. In this paper, a Predictor of protein-protein interaction sites based on Extremely-randomized Trees (PETs) is proposed to improve the prediction accuracy while maintaining the prediction stability. In PETs, a cluster-based sampling strategy is proposed to ensure the model stability: first, the training dataset is divided into subsets using specific features; second, the subsets are clustered using K-means; and finally the samples are selected from each cluster. Using the proposed sampling strategy, samples which have different types of significant features could be selected independently from different clusters. The evaluation shows that PETs is able to achieve better accuracy while maintaining a good stability. The source code and toolkit are available at https://github.com/BinXia/PETs.

  12. Accurate determination of the diffusion coefficient of proteins by Fourier analysis with whole column imaging detection.

    Science.gov (United States)

    Zarabadi, Atefeh S; Pawliszyn, Janusz

    2015-02-17

    Analysis in the frequency domain is considered a powerful tool to elicit precise information from spectroscopic signals. In this study, the Fourier transformation technique is employed to determine the diffusion coefficient (D) of a number of proteins in the frequency domain. Analytical approaches are investigated for determination of D from both experimental and data treatment viewpoints. The diffusion process is modeled to calculate diffusion coefficients based on the Fourier transformation solution to Fick's law equation, and its results are compared to time domain results. The simulations characterize optimum spatial and temporal conditions and demonstrate the noise tolerance of the method. The proposed model is validated by its application for the electropherograms from the diffusion path of a set of proteins. Real-time dynamic scanning is conducted to monitor dispersion by employing whole column imaging detection technology in combination with capillary isoelectric focusing (CIEF) and the imaging plug flow (iPF) experiment. These experimental techniques provide different peak shapes, which are utilized to demonstrate the Fourier transformation ability in extracting diffusion coefficients out of irregular shape signals. Experimental results confirmed that the Fourier transformation procedure substantially enhanced the accuracy of the determined values compared to those obtained in the time domain.

  13. Identification of Uropathogenic Escherichia coli Surface Proteins by Shotgun Proteomics

    Science.gov (United States)

    Walters, Matthew S.; Mobley, Harry L.T.

    2009-01-01

    Uropathogenic Escherichia coli (UPEC) cause the majority of uncomplicated urinary tract infections in humans. In the process of identifying candidate antigens for a vaccine, two methods for the identification of the UPEC surface proteome during growth in human urine were investigated. The first approach utilized a protease to ‘shave’ surface-exposed peptides from the bacterial cell surface and identify them by mass spectrometry. Although this approach has been successfully applied to a Gram-positive pathogen, the adaptation to Gram-negative UPEC resulted in cytoplasmic protein contamination. In a more direct approach, whole-cell bacteria were labeled with a biotin tag to indicate surface-exposed peptides and two-dimensional liquid chromatography-tandem mass spectrometry (2-DLC-MS/MS) was used to identify proteins isolated from the outer membrane. This method discovered 25 predicted outer membrane proteins expressed by UPEC while growing in human urine. Nine of the 25 predicted outer membrane proteins were part of iron transport systems or putative iron-regulated virulence proteins, indicating the importance of iron acquisition during growth in urine. One of the iron transport proteins identified, Hma, appears to be a promising vaccine candidate is being further investigated. The method described here presents a system to rapidly identify the outer membrane proteome of bacteria, which may prove valuable in vaccine development. PMID:19426766

  14. Identification of 24h Ixodes scapularis immunogenic tick saliva proteins.

    Science.gov (United States)

    Lewis, Lauren A; Radulović, Željko M; Kim, Tae K; Porter, Lindsay M; Mulenga, Albert

    2015-04-01

    Ixodes scapularis is arguably the most medically important tick species in the United States. This tick transmits 5 of the 14 human tick-borne disease (TBD) agents in the USA: Borrelia burgdorferi, Anaplasma phagocytophilum, B. miyamotoi, Babesia microti, and Powassan virus disease. Except for the Powassan virus disease, I. scapularis-vectored TBD agents require more than 24h post attachment to be transmitted. This study describes identification of 24h immunogenic I. scapularis tick saliva proteins, which could provide opportunities to develop strategies to stop tick feeding before transmission of the majority of pathogens. A 24h fed female I. scapularis phage display cDNA expression library was biopanned using rabbit antibodies to 24h fed I. scapularis female tick saliva proteins, subjected to next generation sequencing, de novo assembly, and bioinformatic analyses. A total of 182 contigs were assembled, of which ∼19% (35/182) are novel and did not show identity to any known proteins in GenBank. The remaining ∼81% (147/182) of contigs were provisionally identified based on matches in GenBank including ∼18% (27/147) that matched protein sequences previously annotated as hypothetical and putative tick saliva proteins. Others include proteases and protease inhibitors (∼3%, 5/147), transporters and/or ligand binding proteins (∼6%, 9/147), immunogenic tick saliva housekeeping enzyme-like (17%, 25/147), ribosomal protein-like (∼31%, 46/147), and those classified as miscellaneous (∼24%, 35/147). Notable among the miscellaneous class include antimicrobial peptides (microplusin and ricinusin), myosin-like proteins that have been previously found in tick saliva, and heat shock tick saliva protein. Data in this study provides the foundation for in-depth analysis of I. scapularis feeding during the first 24h, before the majority of TBD agents can be transmitted. Copyright © 2015 Elsevier GmbH. All rights reserved.

  15. Amplified protein detection and identification through DNA-conjugated M13 bacteriophage.

    Science.gov (United States)

    Lee, Ju Hun; Domaille, Dylan W; Cha, Jennifer N

    2012-06-26

    Sensitive protein detection and accurate identification continues to be in great demand for disease screening in clinical and laboratory settings. For these diagnostics to be of clinical value, it is necessary to develop sensors that have high sensitivity but favorable cost-to-benefit ratios. However, many of these sensing platforms are thermally unstable or require significant materials synthesis, engineering, or fabrication. Recently, we demonstrated that naturally occurring M13 bacteriophage can serve as biological scaffolds for engineering protein diagnostics. These viruses have five copies of the pIII protein, which can bind specifically to target antigens, and thousands of pVIII coat proteins, which can be genetically or chemically modified to react with signal-producing materials, such as plasmon-shifting gold nanoparticles (Au NPs). In this report, we show that DNA-conjugated M13 bacteriophage can act as inexpensive protein sensors that can rapidly induce a color change in the presence of a target protein yet also offer the ability to identify the detected antigen in a separate step. Many copies of a specific DNA oligonucleotide were appended to each virus to create phage-DNA conjugates that can hybridize with DNA-conjugated gold nanoparticles. In the case of a colorimetric positive result, the identity of the antigen can also be easily determined by using a DNA microarray. This saves precious resources by establishing a rapid, quantitative method to first screen for the presence of antigen followed by a highly specific typing assay if necessary.

  16. Fit3D: a web application for highly accurate screening of spatial residue patterns in protein structure data.

    Science.gov (United States)

    Kaiser, Florian; Eisold, Alexander; Bittrich, Sebastian; Labudde, Dirk

    2016-03-01

    The clarification of linkage between protein structure and function is still a demanding process and can be supported by comparison of spatial residue patterns, so-called structural motifs. However, versatile up-to-date resources to search for local structure similarities are rare. We present Fit3D, an easily accessible web application for highly accurate screening of structural motifs in 3D protein data. The web application is accessible at https://biosciences.hs-mittweida.de/fit3d and program sources of the command line version were released under the terms of GNU GPLv3. Platform-independent binaries and documentations for offline usage are available at https://bitbucket.org/fkaiser/fit3d florian.kaiser@hs-mittweida.de Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  17. Identification and phylogenetic analysis of Dictyostelium discoideum kinesin proteins

    Directory of Open Access Journals (Sweden)

    Glöckner Gernot

    2003-11-01

    Full Text Available Abstract Background Kinesins constitute a large superfamily of motor proteins in eukaryotic cells. They perform diverse tasks such as vesicle and organelle transport and chromosomal segregation in a microtubule- and ATP-dependent manner. In recent years, the genomes of a number of eukaryotic organisms have been completely sequenced. Subsequent studies revealed and classified the full set of members of the kinesin superfamily expressed by these organisms. For Dictyostelium discoideum, only five kinesin superfamily proteins (Kif's have already been reported. Results Here, we report the identification of thirteen kinesin genes exploiting the information from the raw shotgun reads of the Dictyostelium discoideum genome project. A phylogenetic tree of 390 kinesin motor domain sequences was built, grouping the Dictyostelium kinesins into nine subfamilies. According to known cellular functions or strong homologies to kinesins of other organisms, four of the Dictyostelium kinesins are involved in organelle transport, six are implicated in cell division processes, two are predicted to perform multiple functions, and one kinesin may be the founder of a new subclass. Conclusion This analysis of the Dictyostelium genome led to the identification of eight new kinesin motor proteins. According to an exhaustive phylogenetic comparison, Dictyostelium contains the same subset of kinesins that higher eukaryotes need to perform mitosis. Some of the kinesins are implicated in intracellular traffic and a small number have unpredictable functions.

  18. DisoMCS: Accurately Predicting Protein Intrinsically Disordered Regions Using a Multi-Class Conservative Score Approach.

    Directory of Open Access Journals (Sweden)

    Zhiheng Wang

    Full Text Available The precise prediction of protein intrinsically disordered regions, which play a crucial role in biological procedures, is a necessary prerequisite to further the understanding of the principles and mechanisms of protein function. Here, we propose a novel predictor, DisoMCS, which is a more accurate predictor of protein intrinsically disordered regions. The DisoMCS bases on an original multi-class conservative score (MCS obtained by sequence-order/disorder alignment. Initially, near-disorder regions are defined on fragments located at both the terminus of an ordered region connecting a disordered region. Then the multi-class conservative score is generated by sequence alignment against a known structure database and represented as order, near-disorder and disorder conservative scores. The MCS of each amino acid has three elements: order, near-disorder and disorder profiles. Finally, the MCS is exploited as features to identify disordered regions in sequences. DisoMCS utilizes a non-redundant data set as the training set, MCS and predicted secondary structure as features, and a conditional random field as the classification algorithm. In predicted near-disorder regions a residue is determined as an order or a disorder according to the optimized decision threshold. DisoMCS was evaluated by cross-validation, large-scale prediction, independent tests and CASP (Critical Assessment of Techniques for Protein Structure Prediction tests. All results confirmed that DisoMCS was very competitive in terms of accuracy of prediction when compared with well-established publicly available disordered region predictors. It also indicated our approach was more accurate when a query has higher homologous with the knowledge database.The DisoMCS is available at http://cal.tongji.edu.cn/disorder/.

  19. CRNPRED: highly accurate prediction of one-dimensional protein structures by large-scale critical random networks

    Directory of Open Access Journals (Sweden)

    Kinjo Akira R

    2006-09-01

    Full Text Available Abstract Background One-dimensional protein structures such as secondary structures or contact numbers are useful for three-dimensional structure prediction and helpful for intuitive understanding of the sequence-structure relationship. Accurate prediction methods will serve as a basis for these and other purposes. Results We implemented a program CRNPRED which predicts secondary structures, contact numbers and residue-wise contact orders. This program is based on a novel machine learning scheme called critical random networks. Unlike most conventional one-dimensional structure prediction methods which are based on local windows of an amino acid sequence, CRNPRED takes into account the whole sequence. CRNPRED achieves, on average per chain, Q3 = 81% for secondary structure prediction, and correlation coefficients of 0.75 and 0.61 for contact number and residue-wise contact order predictions, respectively. Conclusion CRNPRED will be a useful tool for computational as well as experimental biologists who need accurate one-dimensional protein structure predictions.

  20. Rapid and accurate identification of Mycobacterium tuberculosis complex and common non-tuberculous mycobacteria by multiplex real-time PCR targeting different housekeeping genes.

    Science.gov (United States)

    Nasr Esfahani, Bahram; Rezaei Yazdi, Hadi; Moghim, Sharareh; Ghasemian Safaei, Hajieh; Zarkesh Esfahani, Hamid

    2012-11-01

    Rapid and accurate identification of mycobacteria isolates from primary culture is important due to timely and appropriate antibiotic therapy. Conventional methods for identification of Mycobacterium species based on biochemical tests needs several weeks and may remain inconclusive. In this study, a novel multiplex real-time PCR was developed for rapid identification of Mycobacterium genus, Mycobacterium tuberculosis complex (MTC) and the most common non-tuberculosis mycobacteria species including M. abscessus, M. fortuitum, M. avium complex, M. kansasii, and the M. gordonae in three reaction tubes but under same PCR condition. Genetic targets for primer designing included the 16S rDNA gene, the dnaJ gene, the gyrB gene and internal transcribed spacer (ITS). Multiplex real-time PCR was setup with reference Mycobacterium strains and was subsequently tested with 66 clinical isolates. Results of multiplex real-time PCR were analyzed with melting curves and melting temperature (T (m)) of Mycobacterium genus, MTC, and each of non-tuberculosis Mycobacterium species were determined. Multiplex real-time PCR results were compared with amplification and sequencing of 16S-23S rDNA ITS for identification of Mycobacterium species. Sensitivity and specificity of designed primers were each 100 % for MTC, M. abscessus, M. fortuitum, M. avium complex, M. kansasii, and M. gordonae. Sensitivity and specificity of designed primer for genus Mycobacterium was 96 and 100 %, respectively. According to the obtained results, we conclude that this multiplex real-time PCR with melting curve analysis and these novel primers can be used for rapid and accurate identification of genus Mycobacterium, MTC, and the most common non-tuberculosis Mycobacterium species.

  1. Identification of anabolic steroids and derivatives using bioassay-guided fractionation,UHPLC/TOFMS analysis and accurate mass database searching

    NARCIS (Netherlands)

    Peters, R.J.B.; Rijk, J.C.W.; Bovee, T.F.H.; Nijrolder, A.W.J.M.; Lommen, A.; Nielen, M.W.F.

    2010-01-01

    Biological tests can be used to screen samples for large groups of compounds having a particular effect, but it is often difficult to identify a specific compound when a positive effect is observed. The identification of an unknown compound is a challenge for analytical chemistry in environmental an

  2. Identification of differentially expressed serum proteins in gastric adenocarcinoma☆

    Science.gov (United States)

    Subbannayya, Yashwanth; Mir, Sartaj Ahmad; Renuse, Santosh; Manda, Srikanth S.; Pinto, Sneha M.; Puttamallesh, Vinuth N.; Solanki, Hitendra Singh; Manju, H.C.; Syed, Nazia; Sharma, Rakesh; Christopher, Rita; Vijayakumar, M.; Kumar, K.V. Veerendra; Prasad, T.S. Keshava; Ramaswamy, Girija; Kumar, Rekha V.; Chatterjee, Aditi; Pandey, Akhilesh; Gowda, Harsha

    2015-01-01

    Gastric adenocarcinoma is an aggressive cancer with poor prognosis. Blood based biomarkers of gastric cancer have the potential to improve diagnosis and monitoring of these tumors. Proteins that show altered levels in the circulation of gastric cancer patients could prove useful as putative biomarkers. We used an iTRAQ-based quantitative proteomic approach to identify proteins that show altered levels in the sera of patients with gastric cancer. Our study resulted in identification of 643 proteins, of which 48 proteins showed increased levels and 11 proteins showed decreased levels in serum from gastric cancer patients compared to age and sex matched healthy controls. Proteins that showed increased expression in gastric cancer included inter-alpha-trypsin inhibitor heavy chain H4 (ITIH4), Mannose-binding protein C (MBL2), sex hormone-binding globulin (SHBG), insulin-like growth factor-binding protein 2 (IGFBP2), serum amyloid A protein (SAA1), Orosomucoid 1 (ORM1) and extracellular superoxide dismutase [Cu–Zn] (SOD3). We used multiple reaction monitoring assays and validated elevated levels of ITIH4 and SAA1 proteins in serum from gastric cancer patients. Biological significance Gastric cancer is a highly aggressive cancer associated with high mortality. Serum-based biomarkers are of considerable interest in diagnosis and monitoring of various diseases including cancers. Gastric cancer is often diagnosed at advanced stages resulting in poor prognosis and high mortality. Pathological diagnosis using biopsy specimens remains the gold standard for diagnosis of gastric cancer. Serum-based biomarkers are of considerable importance as they are minimally invasive. In this study, we carried out quantitative proteomic profiling of serum from gastric cancer patients to identify proteins that show altered levels in gastric cancer patients. We identified more than 50 proteins that showed altered levels in gastric cancer patient sera. Validation in a large cohort of well

  3. Identification of Proteins with Potential Osteogenic Activity Present in the Water-Soluble Matrix Proteins from Crassostrea gigas Nacre Using a Proteomic Approach

    Directory of Open Access Journals (Sweden)

    Daniel V. Oliveira

    2012-01-01

    Full Text Available Nacre, when implanted in vivo in bones of dogs, sheep, mice, and humans, induces a biological response that includes integration and osteogenic activity on the host tissue that seems to be activated by a set of proteins present in the nacre water-soluble matrix (WSM. We describe here an experimental approach that can accurately identify the proteins present in the WSM of shell mollusk nacre. Four proteins (three gigasin-2 isoforms and a cystatin A2 were for the first time identified in WSM of Crassostrea gigas nacre using 2DE and LC-MS/MS for protein identification. These proteins are thought to be involved in bone remodeling processes and could be responsible for the biocompatibility shown between bone and nacre grafts. These results represent a contribution to the study of shell biomineralization process and opens new perspectives for the development of new nacre biomaterials for orthopedic applications.

  4. Rapid and accurate identification of Streptococcus equi subspecies by MALDI-TOF MS

    DEFF Research Database (Denmark)

    Kudirkiene, Egle; Welker, Martin; Knudsen, Nanna Reumert

    2015-01-01

    phenotypic and sequence similarity between three subspecies their discrimination remains difficult. In this study, we aimed to design and validate a novel, Superspectra based, MALDI-TOF MS approach for reliable, rapid and cost-effective identification of SEE and SEZ, the most frequent S. equi subspecies.......3±7.5%). This result may be attributed to the highly clonal population structure of SEE, as opposed to the diversity of SEZ seen in horses. Importantly strains with atypical colony appearance both within SEE and SEZ did not affect correct identification of the strains by MALDI-TOF MS. Atypical colony variants...... with spectra analyses using the SARAMIS database. Additionally, first results on subtyping of SEZ indicated that a more refined discrimination, for example for epidemiological surveys, may be possible...

  5. SCPRED: Accurate prediction of protein structural class for sequences of twilight-zone similarity with predicting sequences

    Directory of Open Access Journals (Sweden)

    Chen Ke

    2008-05-01

    Full Text Available Abstract Background Protein structure prediction methods provide accurate results when a homologous protein is predicted, while poorer predictions are obtained in the absence of homologous templates. However, some protein chains that share twilight-zone pairwise identity can form similar folds and thus determining structural similarity without the sequence similarity would be desirable for the structure prediction. The folding type of a protein or its domain is defined as the structural class. Current structural class prediction methods that predict the four structural classes defined in SCOP provide up to 63% accuracy for the datasets in which sequence identity of any pair of sequences belongs to the twilight-zone. We propose SCPRED method that improves prediction accuracy for sequences that share twilight-zone pairwise similarity with sequences used for the prediction. Results SCPRED uses a support vector machine classifier that takes several custom-designed features as its input to predict the structural classes. Based on extensive design that considers over 2300 index-, composition- and physicochemical properties-based features along with features based on the predicted secondary structure and content, the classifier's input includes 8 features based on information extracted from the secondary structure predicted with PSI-PRED and one feature computed from the sequence. Tests performed with datasets of 1673 protein chains, in which any pair of sequences shares twilight-zone similarity, show that SCPRED obtains 80.3% accuracy when predicting the four SCOP-defined structural classes, which is superior when compared with over a dozen recent competing methods that are based on support vector machine, logistic regression, and ensemble of classifiers predictors. Conclusion The SCPRED can accurately find similar structures for sequences that share low identity with sequence used for the prediction. The high predictive accuracy achieved by SCPRED is

  6. Rapid and accurate identification by real-time PCR of biotoxin-producing dinoflagellates from the family gymnodiniaceae.

    Science.gov (United States)

    Smith, Kirsty F; de Salas, Miguel; Adamson, Janet; Rhodes, Lesley L

    2014-03-07

    The identification of toxin-producing dinoflagellates for monitoring programmes and bio-compound discovery requires considerable taxonomic expertise. It can also be difficult to morphologically differentiate toxic and non-toxic species or strains. Various molecular methods have been used for dinoflagellate identification and detection, and this study describes the development of eight real-time polymerase chain reaction (PCR) assays targeting the large subunit ribosomal RNA (LSU rRNA) gene of species from the genera Gymnodinium, Karenia, Karlodinium, and Takayama. Assays proved to be highly specific and sensitive, and the assay for G. catenatum was further developed for quantification in response to a bloom in Manukau Harbour, New Zealand. The assay estimated cell densities from environmental samples as low as 0.07 cells per PCR reaction, which equated to three cells per litre. This assay not only enabled conclusive species identification but also detected the presence of cells below the limit of detection for light microscopy. This study demonstrates the usefulness of real-time PCR as a sensitive and rapid molecular technique for the detection and quantification of micro-algae from environmental samples.

  7. Identification of a Chitinase-modifying Protein from Fusarium verticillioides

    Science.gov (United States)

    Naumann, Todd A.; Wicklow, Donald T.; Price, Neil P. J.

    2011-01-01

    Chitinase-modifying proteins (cmps) are proteases secreted by fungal pathogens that truncate the plant class IV chitinases ChitA and ChitB during maize ear rot. cmp activity has been characterized for Bipolaris zeicola and Stenocarpella maydis, but the identities of the proteases are not known. Here, we report that cmps are secreted by multiple species from the genus Fusarium, that cmp from Fusarium verticillioides (Fv-cmp) is a fungalysin metalloprotease, and that it cleaves within a sequence that is conserved in class IV chitinases. Protein extracts from Fusarium cultures were found to truncate ChitA and ChitB in vitro. Based on this activity, Fv-cmp was purified from F. verticillioides. N-terminal sequencing of truncated ChitA and MALDI-TOF-MS analysis of reaction products showed that Fv-cmp is an endoprotease that cleaves a peptide bond on the C-terminal side of the lectin domain. The N-terminal sequence of purified Fv-cmp was determined and compared with a set of predicted proteins, resulting in its identification as a zinc metalloprotease of the fungalysin family. Recombinant Fv-cmp also truncated ChitA, confirming its identity, but had reduced activity, suggesting that the recombinant protease did not mature efficiently from its propeptide-containing precursor. This is the first report of a fungalysin that targets a nonstructural host protein and the first to implicate this class of virulence-related proteases in plant disease. PMID:21878653

  8. Identification of proteins binding coding and non-coding human RNAs using protein microarrays

    Directory of Open Access Journals (Sweden)

    Siprashvili Zurab

    2012-11-01

    Full Text Available Abstract Background The regulation and function of mammalian RNAs has been increasingly appreciated to operate via RNA-protein interactions. With the recent discovery of thousands of novel human RNA molecules by high-throughput RNA sequencing, efficient methods to uncover RNA-protein interactions are urgently required. Existing methods to study proteins associated with a given RNA are laborious and require substantial amounts of cell-derived starting material. To overcome these limitations, we have developed a rapid and large-scale approach to characterize binding of in vitro transcribed labeled RNA to ~9,400 human recombinant proteins spotted on protein microarrays. Results We have optimized methodology to probe human protein microarrays with full-length RNA molecules and have identified 137 RNA-protein interactions specific for 10 coding and non-coding RNAs. Those proteins showed strong enrichment for common human RNA binding domains such as RRM, RBD, as well as K homology and CCCH type zinc finger motifs. Previously unknown RNA-protein interactions were discovered using this technique, and these interactions were biochemically verified between TP53 mRNA and Staufen1 protein as well as between HRAS mRNA and CNBP protein. Functional characterization of the interaction between Staufen 1 protein and TP53 mRNA revealed a novel role for Staufen 1 in preserving TP53 RNA stability. Conclusions Our approach demonstrates a scalable methodology, allowing rapid and efficient identification of novel human RNA-protein interactions using RNA hybridization to human protein microarrays. Biochemical validation of newly identified interactions between TP53-Stau1 and HRAS-CNBP using reciprocal pull-down experiments, both in vitro and in vivo, demonstrates the utility of this approach to study uncharacterized RNA-protein interactions.

  9. Identification of 4th intercostal space using sternal notch to xiphoid length for accurate electrocardiogram lead placement.

    Science.gov (United States)

    Day, Kevin; Oliva, Isabel; Krupinski, Elizabeth; Marcus, Frank

    2015-01-01

    Precordial ECG lead placement is difficult in obese patients with increased chest wall soft tissues due to inaccurate palpation of the intercostal spaces. We investigated whether the length of the sternum (distance between the sternal notch and xiphoid process) can accurately predict the location of the 4th intercostal space, which is the traditional location for V1 lead position. Fifty-five consecutive adult chest computed tomography examinations were reviewed for measurements. The sternal notch to right 4th intercostal space distance was 67% of the sternal notch to xiphoid process length with an overall correlation of r=0.600 (pintercostal space for accurate placement of the precordial electrodes in adults in whom the 4th intercostal space cannot be found by physical exam. Copyright © 2015 Elsevier Inc. All rights reserved.

  10. Identification of divergent protein domains by combining HMM-HMM comparisons and co-occurrence detection.

    Directory of Open Access Journals (Sweden)

    Amel Ghouila

    Full Text Available Identification of protein domains is a key step for understanding protein function. Hidden Markov Models (HMMs have proved to be a powerful tool for this task. The Pfam database notably provides a large collection of HMMs which are widely used for the annotation of proteins in sequenced organisms. This is done via sequence/HMM comparisons. However, this approach may lack sensitivity when searching for domains in divergent species. Recently, methods for HMM/HMM comparisons have been proposed and proved to be more sensitive than sequence/HMM approaches in certain cases. However, these approaches are usually not used for protein domain discovery at a genome scale, and the benefit that could be expected from their utilization for this problem has not been investigated. Using proteins of P. falciparum and L. major as examples, we investigate the extent to which HMM/HMM comparisons can identify new domain occurrences not already identified by sequence/HMM approaches. We show that although HMM/HMM comparisons are much more sensitive than sequence/HMM comparisons, they are not sufficiently accurate to be used as a standalone complement of sequence/HMM approaches at the genome scale. Hence, we propose to use domain co-occurrence--the general domain tendency to preferentially appear along with some favorite domains in the proteins--to improve the accuracy of the approach. We show that the combination of HMM/HMM comparisons and co-occurrence domain detection boosts protein annotations. At an estimated False Discovery Rate of 5%, it revealed 901 and 1098 new domains in Plasmodium and Leishmania proteins, respectively. Manual inspection of part of these predictions shows that it contains several domain families that were missing in the two organisms. All new domain occurrences have been integrated in the EuPathDomains database, along with the GO annotations that can be deduced.

  11. Identification of divergent protein domains by combining HMM-HMM comparisons and co-occurrence detection.

    Science.gov (United States)

    Ghouila, Amel; Florent, Isabelle; Guerfali, Fatma Zahra; Terrapon, Nicolas; Laouini, Dhafer; Yahia, Sadok Ben; Gascuel, Olivier; Bréhélin, Laurent

    2014-01-01

    Identification of protein domains is a key step for understanding protein function. Hidden Markov Models (HMMs) have proved to be a powerful tool for this task. The Pfam database notably provides a large collection of HMMs which are widely used for the annotation of proteins in sequenced organisms. This is done via sequence/HMM comparisons. However, this approach may lack sensitivity when searching for domains in divergent species. Recently, methods for HMM/HMM comparisons have been proposed and proved to be more sensitive than sequence/HMM approaches in certain cases. However, these approaches are usually not used for protein domain discovery at a genome scale, and the benefit that could be expected from their utilization for this problem has not been investigated. Using proteins of P. falciparum and L. major as examples, we investigate the extent to which HMM/HMM comparisons can identify new domain occurrences not already identified by sequence/HMM approaches. We show that although HMM/HMM comparisons are much more sensitive than sequence/HMM comparisons, they are not sufficiently accurate to be used as a standalone complement of sequence/HMM approaches at the genome scale. Hence, we propose to use domain co-occurrence--the general domain tendency to preferentially appear along with some favorite domains in the proteins--to improve the accuracy of the approach. We show that the combination of HMM/HMM comparisons and co-occurrence domain detection boosts protein annotations. At an estimated False Discovery Rate of 5%, it revealed 901 and 1098 new domains in Plasmodium and Leishmania proteins, respectively. Manual inspection of part of these predictions shows that it contains several domain families that were missing in the two organisms. All new domain occurrences have been integrated in the EuPathDomains database, along with the GO annotations that can be deduced.

  12. Accurate spectroscopic characterization of oxirane: A valuable route to its identification in Titan's atmosphere and the assignment of unidentified infrared bands

    Energy Technology Data Exchange (ETDEWEB)

    Puzzarini, Cristina [Dipartimento di Chimica " Giacomo Ciamician," Università di Bologna, Via Selmi 2, I-40126 Bologna (Italy); Biczysko, Malgorzata; Bloino, Julien; Barone, Vincenzo, E-mail: cristina.puzzarini@unibo.it [Scuola Normale Superiore, Piazza dei Cavalieri 7, I-56126 Pisa (Italy)

    2014-04-20

    In an effort to provide an accurate spectroscopic characterization of oxirane, state-of-the-art computational methods and approaches have been employed to determine highly accurate fundamental vibrational frequencies and rotational parameters. Available experimental data were used to assess the reliability of our computations, and an accuracy on average of 10 cm{sup –1} for fundamental transitions as well as overtones and combination bands has been pointed out. Moving to rotational spectroscopy, relative discrepancies of 0.1%, 2%-3%, and 3%-4% were observed for rotational, quartic, and sextic centrifugal-distortion constants, respectively. We are therefore confident that the highly accurate spectroscopic data provided herein can be useful for identification of oxirane in Titan's atmosphere and the assignment of unidentified infrared bands. Since oxirane was already observed in the interstellar medium and some astronomical objects are characterized by very high D/H ratios, we also considered the accurate determination of the spectroscopic parameters for the mono-deuterated species, oxirane-d1. For the latter, an empirical scaling procedure allowed us to improve our computed data and to provide predictions for rotational transitions with a relative accuracy of about 0.02% (i.e., an uncertainty of about 40 MHz for a transition lying at 200 GHz).

  13. Streptococcus dysgalactiae subsp. equisimilis Isolated From Infections in Dogs and Humans: Are Current Subspecies Identification Criteria accurate?

    Science.gov (United States)

    Ciszewski, Marcin; Zegarski, Kamil; Szewczyk, Eligia M

    2016-11-01

    Streptococcus dysgalactiae is a pyogenic species pathogenic both for humans and animals. Until recently, it has been considered an exclusive animal pathogen causing infections in wild as well as domestic animals. Currently, human infections are being reported with increasing frequency, and their clinical picture is often similar to the ones caused by Streptococcus pyogenes. Due to the fact that S. dysgalactiae is a heterogeneous species, it was divided into two subspecies: S. dysgalactiae subsp. equisimilis (SDSE) and S. dysgalactiae subsp. dysgalactiae (SDSD). The first differentiation criterion, described in 1996, was based on strain isolation source. Currently applied criteria, published in 1998, are based on hemolysis type and Lancefield group classification. In this study, we compared subspecies identification results for 36 strains isolated from clinical cases both in humans and animals. Species differentiation was based on two previously described criteria as well as MALDI-TOF and genetic analyses: RISA and 16S rRNA genes sequencing. Antimicrobial susceptibility profiles were also determined according to CLSI guidelines. The results presented in our study suggest that the subspecies differentiation criteria previously described in the above two literature positions seem to be inaccurate in analyzed group of strains, the hemolysis type on blood agar, and Lancefield classification should not be here longer considered as criteria in subspecies identification. The antimicrobial susceptibility tests indicate emerging of multiresistant human SDSE strains resistant also to vancomycin, linezolid and tigecycline, which might pose a substantial problem in treatment.

  14. Simple and accurate determination of global tau(R) in proteins using (13)C or (15)N relaxation data.

    Science.gov (United States)

    Mispelter, J; Izadi-Pruneyre, N; Quiniou, E; Adjadj, E

    2000-03-01

    In the study of protein dynamics by (13)C or (15)N relaxation measurements different models from the Lipari-Szabo formalism are used in order to determine the motion parameters. The global rotational correlation time tau(R) of the molecule must be estimated prior to the analysis. In this Communication, the authors propose a new approach in determining an accurate value for tau(R) in order to realize the best fit of R(2) for the whole sequence of the protein, regardless of the different type of motions atoms may experience. The method first determines the highly structured regions of the sequence. For each corresponding site, the Lipari-Szabo parameters are calculated for R(1) and NOE, using an arbitrary value for tau(R). The chi(2) for R(2), summed over the selected sites, shows a clear minimum, as a function of tau(R). This minimum is used to better estimate a proper value for tau(R).

  15. Aptamer-conjugated live human immune cell based biosensors for the accurate detection of C-reactive protein

    Science.gov (United States)

    Hwang, Jangsun; Seo, Youngmin; Jo, Yeonho; Son, Jaewoo; Choi, Jonghoon

    2016-10-01

    C-reactive protein (CRP) is a pentameric protein that is present in the bloodstream during inflammatory events, e.g., liver failure, leukemia, and/or bacterial infection. The level of CRP indicates the progress and prognosis of certain diseases; it is therefore necessary to measure CRP levels in the blood accurately. The normal concentration of CRP is reported to be 1-3 mg/L. Inflammatory events increase the level of CRP by up to 500 times; accordingly, CRP is a biomarker of acute inflammatory disease. In this study, we demonstrated the preparation of DNA aptamer-conjugated peripheral blood mononuclear cells (Apt-PBMCs) that specifically capture human CRP. Live PBMCs functionalized with aptamers could detect different levels of human CRP by producing immune complexes with reporter antibody. The binding behavior of Apt-PBMCs toward highly concentrated CRP sites was also investigated. The immune responses of Apt-PBMCs were evaluated by measuring TNF-alpha secretion after stimulating the PBMCs with lipopolysaccharides. In summary, engineered Apt-PBMCs have potential applications as live cell based biosensors and for in vitro tracing of CRP secretion sites.

  16. Automatic Identification of Antibodies in the Protein Data Bank

    Institute of Scientific and Technical Information of China (English)

    LI Xun; WANG Renxiao

    2009-01-01

    An automatic method has been developed for identifying antibody entries in the protein data bank (PDB). Our method, called KIAb (Keyword-based Identification of Antibodies), parses PDB-format files to search for particular keywords relevant to antibodies, and makes judgment accordingly. Our method identified 780 entries as antibodies on the entire PDB. Among them, 767 entries were confirmed by manual inspection, indicating a high success rate of 98.3%. Our method recovered basically all of the entries compiled in the Summary of Antibody Crystal Structures (SACS) database. It also identified a number of entries missed by SACS. Our method thus provides a more com-plete mining of antibody entries in PDB with a very low false positive rate.

  17. ETHNOPRED: a novel machine learning method for accurate continental and sub-continental ancestry identification and population stratification correction

    Science.gov (United States)

    2013-01-01

    Background Population stratification is a systematic difference in allele frequencies between subpopulations. This can lead to spurious association findings in the case–control genome wide association studies (GWASs) used to identify single nucleotide polymorphisms (SNPs) associated with disease-linked phenotypes. Methods such as self-declared ancestry, ancestry informative markers, genomic control, structured association, and principal component analysis are used to assess and correct population stratification but each has limitations. We provide an alternative technique to address population stratification. Results We propose a novel machine learning method, ETHNOPRED, which uses the genotype and ethnicity data from the HapMap project to learn ensembles of disjoint decision trees, capable of accurately predicting an individual’s continental and sub-continental ancestry. To predict an individual’s continental ancestry, ETHNOPRED produced an ensemble of 3 decision trees involving a total of 10 SNPs, with 10-fold cross validation accuracy of 100% using HapMap II dataset. We extended this model to involve 29 disjoint decision trees over 149 SNPs, and showed that this ensemble has an accuracy of ≥ 99.9%, even if some of those 149 SNP values were missing. On an independent dataset, predominantly of Caucasian origin, our continental classifier showed 96.8% accuracy and improved genomic control’s λ from 1.22 to 1.11. We next used the HapMap III dataset to learn classifiers to distinguish European subpopulations (North-Western vs. Southern), East Asian subpopulations (Chinese vs. Japanese), African subpopulations (Eastern vs. Western), North American subpopulations (European vs. Chinese vs. African vs. Mexican vs. Indian), and Kenyan subpopulations (Luhya vs. Maasai). In these cases, ETHNOPRED produced ensembles of 3, 39, 21, 11, and 25 disjoint decision trees, respectively involving 31, 502, 526, 242 and 271 SNPs, with 10-fold cross validation accuracy of

  18. Accurate prediction of secreted substrates and identification of a conserved putative secretion signal for type III secretion systems.

    Directory of Open Access Journals (Sweden)

    Ram Samudrala

    2009-04-01

    Full Text Available The type III secretion system is an essential component for virulence in many Gram-negative bacteria. Though components of the secretion system apparatus are conserved, its substrates--effector proteins--are not. We have used a novel computational approach to confidently identify new secreted effectors by integrating protein sequence-based features, including evolutionary measures such as the pattern of homologs in a range of other organisms, G+C content, amino acid composition, and the N-terminal 30 residues of the protein sequence. The method was trained on known effectors from the plant pathogen Pseudomonas syringae and validated on a set of effectors from the animal pathogen Salmonella enterica serovar Typhimurium (S. Typhimurium after eliminating effectors with detectable sequence similarity. We show that this approach can predict known secreted effectors with high specificity and sensitivity. Furthermore, by considering a large set of effectors from multiple organisms, we computationally identify a common putative secretion signal in the N-terminal 20 residues of secreted effectors. This signal can be used to discriminate 46 out of 68 total known effectors from both organisms, suggesting that it is a real, shared signal applicable to many type III secreted effectors. We use the method to make novel predictions of secreted effectors in S. Typhimurium, some of which have been experimentally validated. We also apply the method to predict secreted effectors in the genetically intractable human pathogen Chlamydia trachomatis, identifying the majority of known secreted proteins in addition to providing a number of novel predictions. This approach provides a new way to identify secreted effectors in a broad range of pathogenic bacteria for further experimental characterization and provides insight into the nature of the type III secretion signal.

  19. Identification of hot-spot residues in protein-protein interactions by computational docking

    Directory of Open Access Journals (Sweden)

    Fernández-Recio Juan

    2008-10-01

    Full Text Available Abstract Background The study of protein-protein interactions is becoming increasingly important for biotechnological and therapeutic reasons. We can define two major areas therein: the structural prediction of protein-protein binding mode, and the identification of the relevant residues for the interaction (so called 'hot-spots'. These hot-spot residues have high interest since they are considered one of the possible ways of disrupting a protein-protein interaction. Unfortunately, large-scale experimental measurement of residue contribution to the binding energy, based on alanine-scanning experiments, is costly and thus data is fairly limited. Recent computational approaches for hot-spot prediction have been reported, but they usually require the structure of the complex. Results We have applied here normalized interface propensity (NIP values derived from rigid-body docking with electrostatics and desolvation scoring for the prediction of interaction hot-spots. This parameter identifies hot-spot residues on interacting proteins with predictive rates that are comparable to other existing methods (up to 80% positive predictive value, and the advantage of not requiring any prior structural knowledge of the complex. Conclusion The NIP values derived from rigid-body docking can reliably identify a number of hot-spot residues whose contribution to the interaction arises from electrostatics and desolvation effects. Our method can propose residues to guide experiments in complexes of biological or therapeutic interest, even in cases with no available 3D structure of the complex.

  20. DNA binding protein identification by combining pseudo amino acid composition and profile-based protein representation

    Science.gov (United States)

    Liu, Bin; Wang, Shanyi; Wang, Xiaolong

    2015-10-01

    DNA-binding proteins play an important role in most cellular processes. Therefore, it is necessary to develop an efficient predictor for identifying DNA-binding proteins only based on the sequence information of proteins. The bottleneck for constructing a useful predictor is to find suitable features capturing the characteristics of DNA binding proteins. We applied PseAAC to DNA binding protein identification, and PseAAC was further improved by incorporating the evolutionary information by using profile-based protein representation. Finally, Combined with Support Vector Machines (SVMs), a predictor called iDNAPro-PseAAC was proposed. Experimental results on an updated benchmark dataset showed that iDNAPro-PseAAC outperformed some state-of-the-art approaches, and it can achieve stable performance on an independent dataset. By using an ensemble learning approach to incorporate more negative samples (non-DNA binding proteins) in the training process, the performance of iDNAPro-PseAAC was further improved. The web server of iDNAPro-PseAAC is available at http://bioinformatics.hitsz.edu.cn/iDNAPro-PseAAC/.

  1. Rapid metabolite discovery, identification, and accurate comparison of the stereoselective metabolism of metalaxyl in rat hepatic microsomes.

    Science.gov (United States)

    Wang, Xinru; Qiu, Jing; Xu, Peng; Zhang, Ping; Wang, Yao; Zhou, Zhiqiang; Zhu, Wentao

    2015-01-28

    Metabolite identification and quantitation impose great challenges on risk assessment of agrochemicals, as many metabolite standards are generally unavailable. In this study, metalaxyl metabolites were identified by time-of-flight mass spectrometry and semiquantified by triple quadrupole tandem mass spectrometry with self-prepared (13)C-labeled metalaxyl metabolites as internal standards. Such methodology was employed to characterize the stereoselective metabolism of metalaxyl in rat hepatic microsomes successfully. Metabolites derived from hydroxylation, demethylation, and didemethylation were identified and semiquantified. The results indicated that (+)-S-metalaxyl eliminated preferentially as the enantiomer fraction was 0.32 after 60 min incubation. The amounts of hydroxymetalaxyl and demethylmetalaxyl derived from (-)-R-metalaxyl were 1.76 and 1.82 times higher than that of (+)-S-metalaxyl, whereas didemethylmetalaxyl derived from (+)-S-metalaxyl was 1.44 times larger than that from (-)-R-metalaxyl. This study highlights a new quantitation approach for stereoselective metabolism of chiral agrochemicals and provides more knowledge on metalaxyl risk assessment.

  2. Accurate high-throughput identification of parallel G-quadruplex topology by a new tetraaryl-substituted imidazole.

    Science.gov (United States)

    Hu, Ming-Hao; Chen, Shuo-Bin; Wang, Yu-Qing; Zeng, You-Mei; Ou, Tian-Miao; Li, Ding; Gu, Lian-Quan; Huang, Zhi-Shu; Tan, Jia-Heng

    2016-09-15

    G-quadruplex nucleic acids are four-stranded DNA or RNA secondary structures that are formed in guanine-rich sequences. These structures exhibit extensive structural polymorphism and play a pivotal role in the control of a variety of cellular processes. To date, diverse approaches for high-throughput identification of G-quadruplex structures have been successfully developed, but high-throughput methods for further characterization of their topologies are still lacking. In this study, we report a new tetra-arylimidazole probe psIZCM-1, which was found to display significant and distinctive changes in both the absorption and the fluorescence spectra in the presence of parallel G-quadruplexes but show insignificant changes upon interactions with anti-parallel G-quadruplexes or other non-quadruplex oligonucleotides. In view of this dual-output feature, we used psIZCM-1 to identify the parallel G-quadruplexes from a large set of 314 oligonucleotides (including 300 G-quadruplex-forming oligonucleotides and 14 non-quadruplex oligonucleotides) via a microplate reader and accordingly established a high-throughput method for the characterization of parallel G-quadruplex topologies. The accuracy of this method was greater than 95%, which was much higher than that of the commercial probe NMM. To make the approach more practical, we further combined psIZCM-1 with another G-quadruplex probe IZCM-7 to realize the high-throughput classification of parallel, anti-parallel G-quadruplexes and non-quadruplex structures.

  3. Improving protein identification sensitivity by combining MS and MS/MS information for shotgun proteomics using LTQ-Orbitrap high mass accuracy data.

    Science.gov (United States)

    Lu, Bingwen; Motoyama, Akira; Ruse, Cristian; Venable, John; Yates, John R

    2008-03-15

    We investigated and compared three approaches for shotgun protein identification by combining MS and MS/MS information using LTQ-Orbitrap high mass accuracy data. In the first approach, we employed a unique mass identifier method where MS peaks matched to peptides predicted from proteins identified from an MS/MS database search are first subtracted before using the MS peaks as unique mass identifiers for protein identification. In the second method, we used an accurate mass and time tag method by building a potential mass and retention time database from previous MudPIT analyses. For the third method, we used a peptide mass fingerprinting-like approach in combination with a randomized database for protein identification. We show that we can improve protein identification sensitivity for low-abundance proteins by combining MS and MS/MS information. Furthermore, "one-hit wonders" from MS/MS database searching can be further substantiated by MS information and the approach improves the identification of low-abundance proteins. The advantages and disadvantages for the three approaches are then discussed.

  4. Phevor Combines Multiple Biomedical Ontologies for Accurate Identification of Disease-Causing Alleles in Single Individuals and Small Nuclear Families

    Science.gov (United States)

    Singleton, Marc V.; Guthery, Stephen L.; Voelkerding, Karl V.; Chen, Karin; Kennedy, Brett; Margraf, Rebecca L.; Durtschi, Jacob; Eilbeck, Karen; Reese, Martin G.; Jorde, Lynn B.; Huff, Chad D.; Yandell, Mark

    2014-01-01

    Phevor integrates phenotype, gene function, and disease information with personal genomic data for improved power to identify disease-causing alleles. Phevor works by combining knowledge resident in multiple biomedical ontologies with the outputs of variant-prioritization tools. It does so by using an algorithm that propagates information across and between ontologies. This process enables Phevor to accurately reprioritize potentially damaging alleles identified by variant-prioritization tools in light of gene function, disease, and phenotype knowledge. Phevor is especially useful for single-exome and family-trio-based diagnostic analyses, the most commonly occurring clinical scenarios and ones for which existing personal genome diagnostic tools are most inaccurate and underpowered. Here, we present a series of benchmark analyses illustrating Phevor’s performance characteristics. Also presented are three recent Utah Genome Project case studies in which Phevor was used to identify disease-causing alleles. Collectively, these results show that Phevor improves diagnostic accuracy not only for individuals presenting with established disease phenotypes but also for those with previously undescribed and atypical disease presentations. Importantly, Phevor is not limited to known diseases or known disease-causing alleles. As we demonstrate, Phevor can also use latent information in ontologies to discover genes and disease-causing alleles not previously associated with disease. PMID:24702956

  5. Accurate Promoter and Enhancer Identification in 127 ENCODE and Roadmap Epigenomics Cell Types and Tissues by GenoSTAN

    Science.gov (United States)

    Zacher, Benedikt; Michel, Margaux; Schwalb, Björn; Cramer, Patrick; Tresch, Achim

    2017-01-01

    Accurate maps of promoters and enhancers are required for understanding transcriptional regulation. Promoters and enhancers are usually mapped by integration of chromatin assays charting histone modifications, DNA accessibility, and transcription factor binding. However, current algorithms are limited by unrealistic data distribution assumptions. Here we propose GenoSTAN (Genomic STate ANnotation), a hidden Markov model overcoming these limitations. We map promoters and enhancers for 127 cell types and tissues from the ENCODE and Roadmap Epigenomics projects, today’s largest compendium of chromatin assays. Extensive benchmarks demonstrate that GenoSTAN generally identifies promoters and enhancers with significantly higher accuracy than previous methods. Moreover, GenoSTAN-derived promoters and enhancers showed significantly higher enrichment of complex trait-associated genetic variants than current annotations. Altogether, GenoSTAN provides an easy-to-use tool to define promoters and enhancers in any system, and our annotation of human transcriptional cis-regulatory elements constitutes a rich resource for future research in biology and medicine. PMID:28056037

  6. Identification and validation of reference genes for accurate normalization of real-time quantitative PCR data in kiwifruit.

    Science.gov (United States)

    Ferradás, Yolanda; Rey, Laura; Martínez, Óscar; Rey, Manuel; González, Ma Victoria

    2016-05-01

    Identification and validation of reference genes are required for the normalization of qPCR data. We studied the expression stability produced by eight primer pairs amplifying four common genes used as references for normalization. Samples representing different tissues, organs and developmental stages in kiwifruit (Actinidia chinensis var. deliciosa (A. Chev.) A. Chev.) were used. A total of 117 kiwifruit samples were divided into five sample sets (mature leaves, axillary buds, stigmatic arms, fruit flesh and seeds). All samples were also analysed as a single set. The expression stability of the candidate primer pairs was tested using three algorithms (geNorm, NormFinder and BestKeeper). The minimum number of reference genes necessary for normalization was also determined. A unique primer pair was selected for amplifying the 18S rRNA gene. The primer pair selected for amplifying the ACTIN gene was different depending on the sample set. 18S 2 and ACT 2 were the candidate primer pairs selected for normalization in the three sample sets (mature leaves, fruit flesh and stigmatic arms). 18S 2 and ACT 3 were the primer pairs selected for normalization in axillary buds. No primer pair could be selected for use as the reference for the seed sample set. The analysis of all samples in a single set did not produce the selection of any stably expressing primer pair. Considering data previously reported in the literature, we validated the selected primer pairs amplifying the FLOWERING LOCUS T gene for use in the normalization of gene expression in kiwifruit.

  7. LC-MS/MS methods for absolute quantification and identification of proteins associated with chimeric plant oil bodies.

    Science.gov (United States)

    Capuano, Floriana; Bond, Nicholas J; Gatto, Laurent; Beaudoin, Frédéric; Napier, Johnathan A; Benvenuto, Eugenio; Lilley, Kathryn S; Baschieri, Selene

    2011-12-15

    Oil bodies (OBs) are plant cell organelles that consist of a lipid core surrounded by a phospholipid monolayer embedded with specialized proteins such as oleosins. Recombinant proteins expressed in plants can be targeted to OBs as fusions with oleosin. This expression strategy is attractive because OBs are easily enriched and purified from other cellular components, based on their unique physicochemical properties. For recombinant OBs to be a potential therapeutic agent in biomedical applications, it is necessary to comprehensively analyze and quantify both endogenous and heterologously expressed OB proteins. In this study, a mass spectrometry (MS)-based method was developed to accurately quantify an OB-targeted heterologously expressed fusion protein that has potential as a therapeutic agent. The effect of the chimeric oleosin expression upon the OB proteome in transgenic plants was also investigated, and the identification of new potential OB residents was pursued through a variety of liquid chromatography (LC)-MS/MS approaches. The results showed that the accumulation of the fusion protein on OBs was low. Moreover, no significant differences in the accumulation of OB proteins were revealed between transgenic and wild-type seeds. The identification of five new putative components of OB proteome was also reported.

  8. Accurate prediction of secreted substrates and identification of a conserved putative secretion signal for type III secretion systems

    Energy Technology Data Exchange (ETDEWEB)

    Samudrala, Ram; Heffron, Fred; McDermott, Jason E.

    2009-04-24

    The type III secretion system is an essential component for virulence in many Gram-negative bacteria. Though components of the secretion system apparatus are conserved, its substrates, effector proteins, are not. We have used a machine learning approach to identify new secreted effectors. The method integrates evolutionary measures, such as the pattern of homologs in a range of other organisms, and sequence-based features, such as G+C content, amino acid composition and the N-terminal 30 residues of the protein sequence. The method was trained on known effectors from Salmonella typhimurium and validated on a corresponding set of effectors from Pseudomonas syringae, after eliminating effectors with detectable sequence similarity. The method was able to identify all of the known effectors in P. syringae with a specificity of 84% and sensitivity of 82%. The reciprocal validation, training on P. syringae and validating on S. typhimurium, gave similar results with a specificity of 86% when the sensitivity level was 87%. These results show that type III effectors in disparate organisms share common features. We found that maximal performance is attained by including an N-terminal sequence of only 30 residues, which agrees with previous studies indicating that this region contains the secretion signal. We then used the method to define the most important residues in this putative secretion signal. Finally, we present novel predictions of secreted effectors in S. typhimurium, some of which have been experimentally validated, and apply the method to predict secreted effectors in the genetically intractable human pathogen Chlamydia trachomatis. This approach is a novel and effective way to identify secreted effectors in a broad range of pathogenic bacteria for further experimental characterization and provides insight into the nature of the type III secretion signal.

  9. Hexapeptide libraries for enhanced protein PTM identification and relative abundance profiling in whole human saliva

    OpenAIRE

    Bandhakavi, Sricharan; van Riper, Susan K.; Tawfik, Pierre N; Matthew D Stone; Haddad, Tufia; Rhodus, Nelson L.; Carlis, John V.; Griffin, Timothy J.

    2011-01-01

    Dynamic range compression (DRC) by hexapeptide libraries increases MS/MS-based identification of lower-abundance proteins in complex mixtures. However, two unanswered questions impede fully realizing DRC’s potential in shotgun proteomics. First, does DRC enhance identification of post-translationally modified proteins? Second, can DRC be incorporated into a workflow enabling relative protein abundance profiling? We sought to answer both questions analyzing human whole saliva. Addressing quest...

  10. Identification, Purification and Characterization of Major Antigenic Proteins of Campylobacter jejuni

    Science.gov (United States)

    1991-01-01

    ELISA -We next examined the potential application of antibodies to C. jejuni proteins for identification and diagnosis of Campylobacter and/or Helico...EXTRACT ANTI-PEBI Fio;. 5. Recognition of Campylobacter and Helicobacter t)ISCtTSSION cells by antisera to C. jejuni proteins by ELISA . Whoile...AD-A271 905 5 April 1991 Reprint Identification, Purification, and Characterization Army Project Order of Major Antigenic Proteins of Campylobacter

  11. Mass spectrometric identification of proteins and characterization of their post-translational modifications in proteome analysis

    DEFF Research Database (Denmark)

    Roepstorff, P; Larsen, Martin Røssel

    2001-01-01

    dominant strategies for identification of proteins from gels based on peptide mass spectrometric fingerprinting and partial sequencing by mass spectrometry are described. After identification of the proteins the next challenge in proteome analysis is characterization of their post-translational...... modifications. The general problems associated with characterization of these directly from gel separated proteins are described and the current state of art for the determination of phosphorylation, glycosylation and proteolytic processing is illustrated....

  12. Microscopic method in processed animal proteins identification in feed: applications of image analysis

    Directory of Open Access Journals (Sweden)

    Savoini G

    2004-01-01

    Full Text Available Processed animal proteins (PAP detection and identification in feedstuffs can be difficult in distinguishing among land animals, i.e. poultry and mammals. Thus, the aim of this study was to evaluate the potential application of image analysis in PAP identification. For this purpose four reference samples containing poultry meals and four reference samples containing mammalian meat and bone meals were used. Each sample was analyzed using the microscopic method (98/88/EC. Bone fragments are characterized by similar morphological features (colours, shape, lacunae shape, lacunae distribution, etc. that make it diff i c u l t to distinguish between poultry and mammals. Through a digital camera and an image analysis software a total of 30 bone fragment lacunae images at X400 were obtained. For each image 29 geometric parameters related to the lacunae and 3 geometric parameters related to the canaliculae of lacunae, were measured using the image analysis software obtaining 960 observations. Of the 32 descriptors used two, the area of the lacunae and their perimeter, were able to explain 96.15% of the total variability of the data, even though their contribution was different (83.97% vs. 12.18%, respectively. Through these two descriptors it was possible to distinguish between mammalian and poultry lacunae, except in two cases (6.6%, in which poultry lacunae were wrongly classified as mammalian. This latter can be related with higher variability in the lacunae area recorded for mammals compared to poultry. On the basis of the present study, it can be concluded that image analysis represents a promising potential tool in PAP identification, that may provide accurate and reliable results in feedstuffs characterisation, analysis and control.

  13. In silico identification of essential proteins in Corynebacterium pseudotuberculosis based on protein-protein interaction networks

    DEFF Research Database (Denmark)

    Folador, Edson Luiz; de Carvalho, Paulo Vinícius Sanches Daltro; Silva, Wanderson Marques;

    2016-01-01

    and decreased production of meat, wool, and milk. Current diagnosis or treatment protocols are not fully effective and, thus, require further research of Cp pathogenesis. RESULTS: Here, we mapped known protein-protein interactions (PPI) from various species to nine Cp strains to reconstruct parts...

  14. Identification of a putative protein profile associated with tamoxifen therapy resistance in breast cancer.

    Science.gov (United States)

    Umar, Arzu; Kang, Hyuk; Timmermans, Annemieke M; Look, Maxime P; Meijer-van Gelder, Marion E; den Bakker, Michael A; Jaitly, Navdeep; Martens, John W M; Luider, Theo M; Foekens, John A; Pasa-Tolić, Ljiljana

    2009-06-01

    Tamoxifen resistance is a major cause of death in patients with recurrent breast cancer. Current clinical factors can correctly predict therapy response in only half of the treated patients. Identification of proteins that are associated with tamoxifen resistance is a first step toward better response prediction and tailored treatment of patients. In the present study we intended to identify putative protein biomarkers indicative of tamoxifen therapy resistance in breast cancer using nano-LC coupled with FTICR MS. Comparative proteome analysis was performed on approximately 5,500 pooled tumor cells (corresponding to approximately 550 ng of protein lysate/analysis) obtained through laser capture microdissection (LCM) from two independently processed data sets (n = 24 and n = 27) containing both tamoxifen therapy-sensitive and therapy-resistant tumors. Peptides and proteins were identified by matching mass and elution time of newly acquired LC-MS features to information in previously generated accurate mass and time tag reference databases. A total of 17,263 unique peptides were identified that corresponded to 2,556 non-redundant proteins identified with > or = 2 peptides. 1,713 overlapping proteins between the two data sets were used for further analysis. Comparative proteome analysis revealed 100 putatively differentially abundant proteins between tamoxifen-sensitive and tamoxifen-resistant tumors. The presence and relative abundance for 47 differentially abundant proteins were verified by targeted nano-LC-MS/MS in a selection of unpooled, non-microdissected discovery set tumor tissue extracts. ENPP1, EIF3E, and GNB4 were significantly associated with progression-free survival upon tamoxifen treatment for recurrent disease. Differential abundance of our top discriminating protein, extracellular matrix metalloproteinase inducer, was validated by tissue microarray in an independent patient cohort (n = 156). Extracellular matrix metalloproteinase inducer levels were

  15. Identification of a novel resi-dent centrosomal protein

    Institute of Scientific and Technical Information of China (English)

    2001-01-01

    One human autoimmune serum was identified to react withcentrosomes by immunofluorescence. We applied the affinity purification of membrane-bound antibody technique and demonstrated that the antibodies present in this antiserum reacted with a 31/29 ku centrosomal antigen. Immunofluorescence showed that this antigen is located at centrosome in a cell-cycle independent manner, and thereby it belongs to the family of centrosomal residents. We then uti- lized this autoimmune serum and antibodies against centrin and gamma-tubulin to investigate changes of centrosome cycle kinetics during premature chromosome condensation (PCC) artificially induced in V79-8 cells. We show here that centrosomal proteins continue to express when cells are syn-chronized at G1/S boundary and S phase by Hydroxyurea (HU). During this time, the addition of caffeine causes cells with unreplicated genome to go into mitosis, and induces the separation of the replicated centrosomes. These results sug-gest that the coordination of DNA synthesis and centrosome replication in the normal cell cycle can be uncoupled. Cells ensure that centrosome duplicates once, and only once dur-ing each DNA synthesis cycle through the tight and subtle coordination of cell cycle engine molecules, and thereby the assembly of bipolar spindle and the accurate transmission of genetic information.

  16. Identification of metastasis-associated proteins in a human tumor metastasis model using the mass-mapping technique

    Science.gov (United States)

    Kreunin, Paweena; Urquidi, Virginia; Lubman, David M; Goodison, Steve

    2005-01-01

    For most cancer cell types, the acquisition of metastatic ability leads to clinically incurable disease. The identification of molecules whose expression is specifically correlated with the metastatic spread of cancer would facilitate the design of therapeutic interventions to inhibit this lethal process. In order to facilitate metastasis gene discovery we have previously characterized a pair of monoclonal cell lines from the human breast carcinoma cell line MDA-MB-435 that have different metastatic phenotypes in immune-compromised mice. In this study, serum-free conditioned media was collected from the cultured monoclonal cell lines and a mass mapping technique was applied in order to profile a component of each cell line proteome. We utilized chromatofocusing in the first dimension to obtain a high resolution separation based on protein pI, and nonporous silica reverse-phase high performance liquid chromatography was used for the second dimension. Selected proteins were identified on the basis of electrospray ionization time of flight mass spectrometry (ESI-TOF MS) intact protein mapping and matrix-assisted laser desorption/ionization time of flight mass spectrometry (MALDI-TOF MS) peptide mass fingerprinting. Using this approach we were able to map over 400 proteins and plot them as a 2-D map of pI versus accurate Mr. This was performed over a pI range of 4.0–6.2, and a mass range of 6–80 kDa. ESI-TOF MS data and further analysis using MALDI-TOF MS confirmed and identified 27 differentially expressed proteins. Proteins associated with the metastatic phenotype included osteopontin and extracellular matrix protein 1, whereas the matrix metalloproteinase-1 and annexin 1 proteins were associated with the non-metastatic phenotype. These findings demonstrate that the mass mapping technique is a powerful tool for the detection and identification of proteins in complex biological samples and which are specifically associated with a cellular phenotype. PMID:15352249

  17. Identification and localization of the structural proteins of anguillid herpesvirus 1

    Directory of Open Access Journals (Sweden)

    van Beurden Steven J

    2011-10-01

    Full Text Available Abstract Many of the known fish herpesviruses have important aquaculture species as their natural host, and may cause serious disease and mortality. Anguillid herpesvirus 1 (AngHV-1 causes a hemorrhagic disease in European eel, Anguilla anguilla. Despite their importance, fundamental molecular knowledge on fish herpesviruses is still limited. In this study we describe the identification and localization of the structural proteins of AngHV-1. Purified virions were fractionated into a capsid-tegument and an envelope fraction, and premature capsids were isolated from infected cells. Proteins were extracted by different methods and identified by mass spectrometry. A total of 40 structural proteins were identified, of which 7 could be assigned to the capsid, 11 to the envelope, and 22 to the tegument. The identification and localization of these proteins allowed functional predictions. Our findings include the identification of the putative capsid triplex protein 1, the predominant tegument protein, and the major antigenic envelope proteins. Eighteen of the 40 AngHV-1 structural proteins had sequence homologues in related Cyprinid herpesvirus 3 (CyHV-3. Conservation of fish herpesvirus structural genes seemed to be high for the capsid proteins, limited for the tegument proteins, and low for the envelope proteins. The identification and localization of the structural proteins of AngHV-1 in this study adds to the fundamental knowledge of members of the Alloherpesviridae family, especially of the Cyprinivirus genus.

  18. Decision peptide-driven: a free software tool for accurate protein quantification using gel electrophoresis and matrix assisted laser desorption ionization time of flight mass spectrometry.

    Science.gov (United States)

    Santos, Hugo M; Reboiro-Jato, Miguel; Glez-Peña, Daniel; Nunes-Miranda, J D; Fdez-Riverola, Florentino; Carvallo, R; Capelo, J L

    2010-09-15

    The decision peptide-driven tool implements a software application for assisting the user in a protocol for accurate protein quantification based on the following steps: (1) protein separation through gel electrophoresis; (2) in-gel protein digestion; (3) direct and inverse (18)O-labeling and (4) matrix assisted laser desorption ionization time of flight mass spectrometry, MALDI analysis. The DPD software compares the MALDI results of the direct and inverse (18)O-labeling experiments and quickly identifies those peptides with paralleled loses in different sets of a typical proteomic workflow. Those peptides are used for subsequent accurate protein quantification. The interpretation of the MALDI data from direct and inverse labeling experiments is time-consuming requiring a significant amount of time to do all comparisons manually. The DPD software shortens and simplifies the searching of the peptides that must be used for quantification from a week to just some minutes. To do so, it takes as input several MALDI spectra and aids the researcher in an automatic mode (i) to compare data from direct and inverse (18)O-labeling experiments, calculating the corresponding ratios to determine those peptides with paralleled losses throughout different sets of experiments; and (ii) allow to use those peptides as internal standards for subsequent accurate protein quantification using (18)O-labeling. In this work the DPD software is presented and explained with the quantification of protein carbonic anhydrase.

  19. Identification of peptide and protein doping related drug compounds confiscated in Denmark between 2007-2013

    DEFF Research Database (Denmark)

    Hartvig, Rune Andersen; Holm, Niels Bjerre; Dalsgaard, Petur Weihe

    2014-01-01

    We present an overview of protein and peptide compounds confiscated in Denmark from late 2007 till late 2013 together with a description of a newly developed HRAM-LC-MS method used for identification. As examples of identification, we present data for the peptides AOD-9604, [D-Ala2, Gln8, Ala15, ...

  20. Choosing an Optimal Database for Protein Identification from Tandem Mass Spectrometry Data.

    Science.gov (United States)

    Kumar, Dhirendra; Yadav, Amit Kumar; Dash, Debasis

    2017-01-01

    Database searching is the preferred method for protein identification from digital spectra of mass to charge ratios (m/z) detected for protein samples through mass spectrometers. The search database is one of the major influencing factors in discovering proteins present in the sample and thus in deriving biological conclusions. In most cases the choice of search database is arbitrary. Here we describe common search databases used in proteomic studies and their impact on final list of identified proteins. We also elaborate upon factors like composition and size of the search database that can influence the protein identification process. In conclusion, we suggest that choice of the database depends on the type of inferences to be derived from proteomics data. However, making additional efforts to build a compact and concise database for a targeted question should generally be rewarding in achieving confident protein identifications.

  1. Identification of Novel Perfluoroalkyl Ether Carboxylic Acids (PFECAs) and Sulfonic Acids (PFESAs) in Natural Waters Using Accurate Mass Time-of-Flight Mass Spectrometry (TOFMS).

    Science.gov (United States)

    Strynar, Mark; Dagnino, Sonia; McMahen, Rebecca; Liang, Shuang; Lindstrom, Andrew; Andersen, Erik; McMillan, Larry; Thurman, Michael; Ferrer, Imma; Ball, Carol

    2015-10-06

    Recent scientific scrutiny and concerns over exposure, toxicity, and risk have led to international regulatory efforts resulting in the reduction or elimination of certain perfluorinated compounds from various products and waste streams. Some manufacturers have started producing shorter chain per- and polyfluorinated compounds to try to reduce the potential for bioaccumulation in humans and wildlife. Some of these new compounds contain central ether oxygens or other minor modifications of traditional perfluorinated structures. At present, there has been very limited information published on these "replacement chemistries" in the peer-reviewed literature. In this study we used a time-of-flight mass spectrometry detector (LC-ESI-TOFMS) to identify fluorinated compounds in natural waters collected from locations with historical perfluorinated compound contamination. Our workflow for discovery of chemicals included sequential sampling of surface water for identification of potential sources, nontargeted TOFMS analysis, molecular feature extraction (MFE) of samples, and evaluation of features unique to the sample with source inputs. Specifically, compounds were tentatively identified by (1) accurate mass determination of parent and/or related adducts and fragments from in-source collision-induced dissociation (CID), (2) in-depth evaluation of in-source adducts formed during analysis, and (3) confirmation with authentic standards when available. We observed groups of compounds in homologous series that differed by multiples of CF2 (m/z 49.9968) or CF2O (m/z 65.9917). Compounds in each series were chromatographically separated and had comparable fragments and adducts produced during analysis. We detected 12 novel perfluoroalkyl ether carboxylic and sulfonic acids in surface water in North Carolina, USA using this approach. A key piece of evidence was the discovery of accurate mass in-source n-mer formation (H(+) and Na(+)) differing by m/z 21.9819, corresponding to the

  2. Identification of proteins in the postsynaptic density fraction by mass spectrometry

    DEFF Research Database (Denmark)

    Walikonis, R S; Jensen, Ole Nørregaard; Mann, M

    2000-01-01

    Our understanding of the organization of postsynaptic signaling systems at excitatory synapses has been aided by the identification of proteins in the postsynaptic density (PSD) fraction, a subcellular fraction enriched in structures with the morphology of PSDs. In this study, we have completed...... the identification of most major proteins in the PSD fraction with the use of an analytical method based on mass spectrometry coupled with searching of the protein sequence databases. At least one protein in each of 26 prominent protein bands from the PSD fraction has now been identified. We found 7 proteins...... not previously known to be constituents of the PSD fraction and 24 that had previously been associated with the PSD by other methods. The newly identified proteins include the heavy chain of myosin-Va (dilute myosin), a motor protein thought to be involved in vesicle trafficking, and the mammalian homolog...

  3. PSSP-RFE: accurate prediction of protein structural class by recursive feature extraction from PSI-BLAST profile, physical-chemical property and functional annotations.

    Directory of Open Access Journals (Sweden)

    Liqi Li

    Full Text Available Protein structure prediction is critical to functional annotation of the massively accumulated biological sequences, which prompts an imperative need for the development of high-throughput technologies. As a first and key step in protein structure prediction, protein structural class prediction becomes an increasingly challenging task. Amongst most homological-based approaches, the accuracies of protein structural class prediction are sufficiently high for high similarity datasets, but still far from being satisfactory for low similarity datasets, i.e., below 40% in pairwise sequence similarity. Therefore, we present a novel method for accurate and reliable protein structural class prediction for both high and low similarity datasets. This method is based on Support Vector Machine (SVM in conjunction with integrated features from position-specific score matrix (PSSM, PROFEAT and Gene Ontology (GO. A feature selection approach, SVM-RFE, is also used to rank the integrated feature vectors through recursively removing the feature with the lowest ranking score. The definitive top features selected by SVM-RFE are input into the SVM engines to predict the structural class of a query protein. To validate our method, jackknife tests were applied to seven widely used benchmark datasets, reaching overall accuracies between 84.61% and 99.79%, which are significantly higher than those achieved by state-of-the-art tools. These results suggest that our method could serve as an accurate and cost-effective alternative to existing methods in protein structural classification, especially for low similarity datasets.

  4. PSSP-RFE: accurate prediction of protein structural class by recursive feature extraction from PSI-BLAST profile, physical-chemical property and functional annotations.

    Science.gov (United States)

    Li, Liqi; Cui, Xiang; Yu, Sanjiu; Zhang, Yuan; Luo, Zhong; Yang, Hua; Zhou, Yue; Zheng, Xiaoqi

    2014-01-01

    Protein structure prediction is critical to functional annotation of the massively accumulated biological sequences, which prompts an imperative need for the development of high-throughput technologies. As a first and key step in protein structure prediction, protein structural class prediction becomes an increasingly challenging task. Amongst most homological-based approaches, the accuracies of protein structural class prediction are sufficiently high for high similarity datasets, but still far from being satisfactory for low similarity datasets, i.e., below 40% in pairwise sequence similarity. Therefore, we present a novel method for accurate and reliable protein structural class prediction for both high and low similarity datasets. This method is based on Support Vector Machine (SVM) in conjunction with integrated features from position-specific score matrix (PSSM), PROFEAT and Gene Ontology (GO). A feature selection approach, SVM-RFE, is also used to rank the integrated feature vectors through recursively removing the feature with the lowest ranking score. The definitive top features selected by SVM-RFE are input into the SVM engines to predict the structural class of a query protein. To validate our method, jackknife tests were applied to seven widely used benchmark datasets, reaching overall accuracies between 84.61% and 99.79%, which are significantly higher than those achieved by state-of-the-art tools. These results suggest that our method could serve as an accurate and cost-effective alternative to existing methods in protein structural classification, especially for low similarity datasets.

  5. Identification and characterization of the surface proteins of Clostridium difficile

    Energy Technology Data Exchange (ETDEWEB)

    Dailey, D.C.

    1988-01-01

    Several clostridial proteins were detected on the clostridial cell surface by sensitive radioiodination techniques. Two major proteins and six minor proteins comprised the radioiodinated proteins on the clostridial cell surface. Cellular fractionation of surface radiolabeled C. difficile determined that the radioiodinated proteins were found in the cell wall fraction of C. difficile and surprisingly were also present in the clostridial membrane. Furthermore, an interesting phenomenon of disulfide-crosslinking of the cell surface proteins of C. difficile was observed. Disulfide-linked protein complexes were found in both the membrane and cell wall fractions. In addition, the cell surface proteins of C. difficile were found to be released into the culture medium. In attempts to further characterize the clostridial proteins recombinant DNA techniques were employed. In addition, the role of the clostridial cell surface proteins in the interactions of C. difficile with human PMNs was also investigated.

  6. Identification and characterization of secreted proteins in Eimeria tenella

    Science.gov (United States)

    Ramlee, Intan Azlinda; Firdaus-Raih, Mohd; Wan, Kiew-Lian

    2015-09-01

    Eimeria tenella is a protozoan parasite that causes coccidiosis, an economically important disease in the poultry industry. The characterization of proteins that are secreted by parasites have been shown to play important roles in parasite invasion and are considered to be potential control agents. In this study, 775 proteins potentially secreted by E. tenella were identified. These proteins were further filtered to remove mitochondrial proteins. Out of 763 putative secreted proteins, 259 proteins possess transmembrane domains while another 150 proteins have GPI (Glycosylphosphatidylinositol) anchors. Homology search revealed that 315 and 448 proteins have matches with known and hypothetical proteins in the database, respectively. Within this data set, previously characterized secretory proteins such as micronemes, rhoptry kinases and dense granules were detected.

  7. microTSS: accurate microRNA transcription start site identification reveals a significant number of divergent pri-miRNAs.

    Science.gov (United States)

    Georgakilas, Georgios; Vlachos, Ioannis S; Paraskevopoulou, Maria D; Yang, Peter; Zhang, Yuhong; Economides, Aris N; Hatzigeorgiou, Artemis G

    2014-12-10

    A large fraction of microRNAs (miRNAs) are derived from intergenic non-coding loci and the identification of their promoters remains 'elusive'. Here, we present microTSS, a machine-learning algorithm that provides highly accurate, single-nucleotide resolution predictions for intergenic miRNA transcription start sites (TSSs). MicroTSS integrates high-resolution RNA-sequencing data with active transcription marks derived from chromatin immunoprecipitation and DNase-sequencing to enable the characterization of tissue-specific promoters. MicroTSS is validated with a specifically designed Drosha-null/conditional-null mouse model, generated using the conditional by inversion (COIN) methodology. Analyses of global run-on sequencing data revealed numerous pri-miRNAs in human and mouse either originating from divergent transcription at promoters of active genes or partially overlapping with annotated long non-coding RNAs. MicroTSS is readily applicable to any cell or tissue samples and constitutes the missing part towards integrating the regulation of miRNA transcription into the modelling of tissue-specific regulatory networks.

  8. Identification of Bile Duct Paucity in Alagille Syndrome: Using CK7 and EMA Immunohistochemistry as a Reliable Panel for Accurate Diagnosis.

    Science.gov (United States)

    Herman, Haley K; Abramowsky, Carlos R; Caltharp, Shelley; Metry, Diana; Cundiff, Caitlin A; Romero, Rene; Gillespie, Scott E; Shehata, Bahig M

    2016-01-01

    Bile duct paucity is the absence or marked reduction in the number of interlobular bile ducts (ILBD) within portal tracts. Its syndromic variant, Alagille syndrome (ALGS), is a multisystem disorder with effects on the liver, cardiovascular system, skeleton, face, and eyes. It is inherited as an autosomal dominant trait due to defects in NOTCH signaling pathway. ALGS is characterized by vanishing ILBD with subsequent chronic obstructive cholestasis in approximately 89% of cases. Cholestasis stimulates formation of new bile ductules through a process of neoductular reaction, making it difficult to evaluate the presence or absence of ILBD. Therefore, finding a method to differentiate clearly between ILBD and the ductular proliferation is essential for accurate diagnosis. A database search identified 28 patients with confirmed diagnosis of ALGS between 1992 and 2014. Additionally, 7 controls were used. A panel of two immunostains, cytokeratin 7 (CK7) and epithelial membrane antigen (EMA), was performed. CK7 highlighted the bile duct epithelium of ILBD and ductular proliferation, while EMA stained only the brush border of ILBD. In our ALGS group, the ratio of EMA-positive ILBD to identified portal tracts was 12.6% (range, 0%-41%). However, this same ratio was 95.0% (range, 90%-100%) among control cases (P EMA, to differentiate ILBD from ductular proliferation in patients with cholestasis. With this panel, identification of bile duct paucity can be achieved. Additional studies, including molecular confirmation and clinical correlation, would provide a definitive diagnosis of ALGS.

  9. An accurate and reliable method for identification and quantification of fatty acids and trans fatty acids in food fats samples using gas chromatography

    Directory of Open Access Journals (Sweden)

    Jumat Salimon

    2017-05-01

    Full Text Available A method for the separation, identification and further quantification of fatty acids (FAs and trans fatty acids (TFAs by gas chromatography (GC using the combination of lipid extraction and derivatization with the base-catalysed method followed by trimethylsilyl-diazomethane (TMS-DM was developed. The proposed method was found to allow sensitive and accurate determination of a wide range of different types of FAs, including TFA isomers. The method was validated on real samples of dietary fat from hydrogenated edible oils (margarine and nine standard FAs as representatives of margarines. For this purpose, response linearity, limit of detection (LOD, limit of quantification (LOQ, precision and recovery (R% were all determined. Based on the results obtained, R-values from all the samples were revealed to be close to 100%, repeatability RSD ranged between 0.89% and 2.34%, and reproducibility RSD values ranged between 1.46% and 3.72%. The applicability of this method was demonstrated in four margarine samples and it was compared with the method used as reference. In general, the results proved that the proposed method is suitable for the analysis of FAs since it has shown higher effectiveness in TFA analysis than the classic methods. Thus, it could be an effective tool for analysing dietary fats and oils in complex mixtures of food products for the monitoring of low levels of FAs and TFA, and the control of labelling authenticity.

  10. An accurate binding interaction model in de novo computational protein design of interactions: if you build it, they will bind.

    Science.gov (United States)

    London, Nir; Ambroggio, Xavier

    2014-02-01

    Computational protein design efforts aim to create novel proteins and functions in an automated manner and, in the process, these efforts shed light on the factors shaping natural proteins. The focus of these efforts has progressed from the interior of proteins to their surface and the design of functions, such as binding or catalysis. Here we examine progress in the development of robust methods for the computational design of non-natural interactions between proteins and molecular targets such as other proteins or small molecules. This problem is referred to as the de novo computational design of interactions. Recent successful efforts in de novo enzyme design and the de novo design of protein-protein interactions open a path towards solving this problem. We examine the common themes in these efforts, and review recent studies aimed at understanding the nature of successes and failures in the de novo computational design of interactions. While several approaches culminated in success, the use of a well-defined structural model for a specific binding interaction in particular has emerged as a key strategy for a successful design, and is therefore reviewed with special consideration. Copyright © 2013 Elsevier Inc. All rights reserved.

  11. Identification of phosphorylation sites in protein kinase A substrates using artificial neural networks and mass spectrometry

    DEFF Research Database (Denmark)

    Hjerrild, Majbrit; Stensballe, Allan; Rasmussen, Thomas E

    2011-01-01

    Protein phosphorylation plays a key role in cell regulation and identification of phosphorylation sites is important for understanding their functional significance. Here, we present an artificial neural network algorithm: NetPhosK (http://www.cbs.dtu.dk/services/NetPhosK/) that predicts protein...

  12. Identification of cardiac myofilament protein isoforms using multiple mass spectrometry based approaches

    NARCIS (Netherlands)

    Kooij, V.; Venkatraman, V.; Kirk, J.A.; Ubaida-Mohien, C.; Graham, D.R.; Faber, M.J.; Eyk, J.E. Van

    2014-01-01

    PURPOSE: The identification of protein isoforms in complex biological samples is challenging. We, therefore, used an MS approach to unambiguously identify cardiac myofilament protein isoforms based on the observation of a tryptic peptide consisting of a sequence unique to a particular isoform. EXPER

  13. Identification of phosphorylation sites in protein kinase A substrates using artificial neural networks and mass spectrometry

    DEFF Research Database (Denmark)

    Hjerrild, M.; Stensballe, A.; Rasmussen, T.E.;

    2004-01-01

    Protein phosphorylation plays a key role in cell regulation and identification of phosphorylation sites is important for understanding their functional significance. Here, we present an artificial neural network algorithm: NetPhosK (http://www.cbs.dtu.dk/services/NetPhosK/) that predicts protein...

  14. Identification of phosphorylation sites in protein kinase A substrates using artificial neural networks and mass spectrometry

    DEFF Research Database (Denmark)

    Hjerrild, Majbrit; Stensballe, Allan; Rasmussen, Thomas E

    2011-01-01

    Protein phosphorylation plays a key role in cell regulation and identification of phosphorylation sites is important for understanding their functional significance. Here, we present an artificial neural network algorithm: NetPhosK (http://www.cbs.dtu.dk/services/NetPhosK/) that predicts protein...

  15. Identification of fibrin clot-bound plasma proteins.

    Directory of Open Access Journals (Sweden)

    Simone Talens

    Full Text Available Several proteins are known to bind to a fibrin network and to change clot properties or function. In this study we aimed to get an overview of fibrin clot-bound plasma proteins. A plasma clot was formed by adding thrombin, CaCl(2 and aprotinin to citrated platelet-poor plasma and unbound proteins were washed away with Tris-buffered saline. Non-covalently bound proteins were extracted, separated with 2D gel electrophoresis and visualized with Sypro Ruby. Excised protein spots were analyzed with mass spectrometry. The identity of the proteins was verified by checking the mass of the protein, and, if necessary, by Western blot analysis. Next to established fibrin-binding proteins we identified several novel fibrin clot-bound plasma proteins, including α(2-macroglobulin, carboxypeptidase N, α(1-antitrypsin, haptoglobin, serum amyloid P, and the apolipoproteins A-I, E, J, and A-IV. The latter six proteins are associated with high-density lipoprotein particles. In addition we showed that high-density lipoprotein associated proteins were also present in fibrinogen preparations purified from plasma. Most plasma proteins in a fibrin clot can be classified into three groups according to either blood coagulation, protease inhibition or high-density lipoprotein metabolism. The presence of high-density lipoprotein in clots might point to a role in hemostasis.

  16. Gel Electrophoresis of Proteins for the Identification of Crop Varieties

    Institute of Scientific and Technical Information of China (English)

    LAN Hai-yan; LI Li-hui

    2002-01-01

    With the development of the international trade and agricultural science and technology, especially after the execution of the rules on protection of new plant varieties, considerable emphasis has been placed on variety identification. Many evidences have suggested that gel electrophoresis have great influence on this area. This paper reviewed study status of various gel electrophoresis, including development of the methods, comparison of these techniques, influence factors, practical applications, achievements obtained and aspects in the future study. With the wider range on protection of new plant varieties in China, electrophoresis will play a more important role in variety identification.

  17. Identification of proteins associated with amyloidosis by polarity index method.

    Science.gov (United States)

    Polanco, Carlos; Samaniego, José Lino; Uversky, Vladimir N; Castañón-González, Jorge Alberto; Buhse, Thomas; Leopold-Sordo, Marili; Madero-Arteaga, Alejandro; Morales-Reyes, Alicia; Tavera-Sierra, Lourdes; González-Bernal, Jesus A; Arias-Estrada, Miguel

    2015-01-01

    There is a natural protein form, insoluble and resistant to proteolysis, adopted by many proteins independently of their amino acid sequences via specific misfolding-aggregation process. This dynamic process occurs in parallel with or as an alternative to physiologic folding, generating toxic protein aggregates that are deposited and accumulated in various organs and tissues. These proteinaceous deposits typically represent bundles of β-sheet-enriched fibrillar species known as the amyloid fibrils that are responsible for serious pathological conditions, including but not limited to neurodegenerative diseases, grouped under the term amyloidoses. The proteins that might adopt this fibrillar conformation are some globular proteins and natively unfolded (or intrinsically disordered) proteins. Our work shows that intrinsically disordered and intrinsically ordered proteins can be reliably identified, discriminated, and differentiated by analyzing their polarity profiles generated using a computational tool known as the polarity index method (Polanco & Samaniego, 2009; Polanco et al., 2012; 2013; 2013a; 2014; 2014a; 2014b; 2014c; 2014d). We also show that proteins expressed in neurons can be differentiated from proteins in these two groups based on their polarity profiles, and also that this computational tool can be used to identify proteins associated with amyloidoses. The efficiency of the proposed method is high (i.e. 70%) as evidenced by the analysis of peptides and proteins in the APD2 database (2012), AVPpred database (2013), and CPPsite database (2013), the set of selective antibacterial peptides from del Rio et al. (2001), the sets of natively unfolded and natively folded proteins from Oldfield et al. (2005), the set of human revised proteins expressed in neurons, and non-human revised proteins expressed in neurons, from the Uniprot database (2014), and also the set of amyloidogenic proteins from the AmyPDB database (2014).

  18. Identification of a Protein that Purifies with the Scrapie Prion

    Science.gov (United States)

    Bolton, David C.; McKinley, Michael P.; Prusiner, Stanley B.

    1982-12-01

    Purification of prions from scrapie-infected hamster brain yielded a protein that was not found in a similar fraction from uninfected brain. The protein migrated with an apparent molecular size of 27,000 to 30,000 daltons in sodium dodecyl sulfate polyacrylamide gels. The resistance of this protein to digestion by proteinase K distinguished it from proteins of similar molecular weight found in normal hamster brain. Initial results suggest that the amount of this protein correlates with the titer of the agent.

  19. Proteins of human milk. I. Identification of major components

    Energy Technology Data Exchange (ETDEWEB)

    Anderson, N.G.; Powers, M.T.; Tollaksen, S.L.

    1982-04-01

    Traditionally, human milk proteins are identified largely by reference to bovine milk. Hence, to identify the major proteins in human milk, we subjected human and bovine milk, in parallel, to high-resolution two-dimensional electrophoresis. Isoelectric precipitation at pH 4.6 was our criterion for distinguishing whey proteins from those of the casein complex. The ..cap alpha..- and..beta..-caseins were identified on the basis of relative abundance, relative molecular mass, and relative isoelectric points. No protein disappeared from ISO-DALT patterns of human milk after rennin treatment, and no new protein comparable to bovine para K-casein appeared in the BASO-DALT patterns; this suggests that K-casein is absent from human milk. The proteins identified in human milk patterns include the ..cap alpha.. and ..beta.. casein families, lactalbumin, albumin, transferrin, IgA, and lactoferrin. Numerous additional proteins seen in patterns for human milk remain to be identified.

  20. Identification of Topological Network Modules in Perturbed Protein Interaction Networks

    Science.gov (United States)

    Sardiu, Mihaela E.; Gilmore, Joshua M.; Groppe, Brad; Florens, Laurence; Washburn, Michael P.

    2017-01-01

    Biological networks consist of functional modules, however detecting and characterizing such modules in networks remains challenging. Perturbing networks is one strategy for identifying modules. Here we used an advanced mathematical approach named topological data analysis (TDA) to interrogate two perturbed networks. In one, we disrupted the S. cerevisiae INO80 protein interaction network by isolating complexes after protein complex components were deleted from the genome. In the second, we reanalyzed previously published data demonstrating the disruption of the human Sin3 network with a histone deacetylase inhibitor. Here we show that disrupted networks contained topological network modules (TNMs) with shared properties that mapped onto distinct locations in networks. We define TMNs as proteins that occupy close network positions depending on their coordinates in a topological space. TNMs provide new insight into networks by capturing proteins from different categories including proteins within a complex, proteins with shared biological functions, and proteins disrupted across networks. PMID:28272416

  1. AutoDock-GIST: Incorporating Thermodynamics of Active-Site Water into Scoring Function for Accurate Protein-Ligand Docking.

    Science.gov (United States)

    Uehara, Shota; Tanaka, Shigenori

    2016-11-23

    Water plays a significant role in the binding process between protein and ligand. However, the thermodynamics of water molecules are often underestimated, or even ignored, in protein-ligand docking. Usually, the free energies of active-site water molecules are substantially different from those of waters in the bulk region. The binding of a ligand to a protein causes a displacement of these waters from an active site to bulk, and this displacement process substantially contributes to the free energy change of protein-ligand binding. The free energy of active-site water molecules can be calculated by grid inhomogeneous solvation theory (GIST), using molecular dynamics (MD) and the trajectory of a target protein and water molecules. Here, we show a case study of the combination of GIST and a docking program and discuss the effectiveness of the displacing gain of unfavorable water in protein-ligand docking. We combined the GIST-based desolvation function with the scoring function of AutoDock4, which is called AutoDock-GIST. The proposed scoring function was assessed employing 51 ligands of coagulation factor Xa (FXa), and results showed that both scoring accuracy and docking success rate were improved. We also evaluated virtual screening performance of AutoDock-GIST using FXa ligands in the directory of useful decoys-enhanced (DUD-E), thus finding that the displacing gain of unfavorable water is effective for a successful docking campaign.

  2. Identification of Posttranslational Modification-Dependent Protein Interactions Using Yeast Surface Displayed Human Proteome Libraries.

    Science.gov (United States)

    Bidlingmaier, Scott; Liu, Bin

    2015-01-01

    The identification of proteins that interact specifically with posttranslational modifications such as phosphorylation is often necessary to understand cellular signaling pathways. Numerous methods for identifying proteins that interact with posttranslational modifications have been utilized, including affinity-based purification and analysis, protein microarrays, phage display, and tethered catalysis. Although these techniques have been used successfully, each has limitations. Recently, yeast surface-displayed human proteome libraries have been utilized to identify protein fragments with affinity for various target molecules, including phosphorylated peptides. When coupled with fluorescently activated cell sorting and high throughput methods for the analysis of selection outputs, yeast surface-displayed human proteome libraries can rapidly and efficiently identify protein fragments with affinity for any soluble ligand that can be fluorescently detected, including posttranslational modifications. In this review we compare the use of yeast surface display libraries to other methods for the identification of interactions between proteins and posttranslational modifications and discuss future applications of the technology.

  3. Identification of Differentially Expressed Serum Proteins in Infectious Purpura Fulminans

    Directory of Open Access Journals (Sweden)

    Ting He

    2014-01-01

    Full Text Available Purpura fulminans (PF is a life-threatening hemorrhagic condition. Because of the rarity and randomness of the disease, no improvement in treatment has been made for a long time. In this study, we assessed the serum proteome response to PF by comparing serum proteins between healthy controls and PF patient. Liquid chromatography with tandem mass spectrometry (LC-MS/MS approach was used after depleting 6 abundant proteins of serum. In total, 262 proteins were confidently identified with 2 unique peptides, and 38 proteins were identified significantly up- (≥2 or downregulated (≤0.5 based on spectral counting ratios (SpCPF/N. In the 38 proteins with significant abundance changes, 11 proteins were previously known to be associated with burn or sepsis response, but 27 potentially novel proteins may be specifically associated with PF process. Two differentially expressed proteins, alpha-1-antitrypsin (SERPINA1 and alpha-2 antiplasmin (SERPINF2, were validated by Western blot. This is the first study where PF patient and healthy controls are compared in a proteomic study to elucidate proteins involved in the response to PF. This study provides an initial basis for future studies of PF, and the differentially expressed proteins might provide new therapeutic targets to decrease the mortality of PF.

  4. Identification of IgE-binding proteins in soy lecithin.

    Science.gov (United States)

    Gu, X; Beardslee, T; Zeece, M; Sarath, G; Markwell, J

    2001-11-01

    Soy lecithin is widely used as an emulsifier in processed foods, pharmaceuticals and cosmetics. Soy lecithin is composed principally of phospholipids; however, it has also been shown to contain IgE-binding proteins, albeit at a low level. A few clinical cases involving allergic reactions to soy lecithin have been reported. The purpose of this investigation is to better characterize the IgE-binding proteins typically found in lecithin. Soy lecithin proteins were isolated following solvent extraction of lipid components and then separated on sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE). The separated lecithin proteins were immunoblotted with sera from soy-sensitive individuals to determine the pattern of IgE-binding proteins. The identity of IgE-reactive bands was determined from their N-terminal sequence. The level of protein in six lecithin samples obtained from commercial suppliers ranged from 100 to 1,400 ppm. Lecithin samples showed similar protein patterns when examined by SDS-PAGE. Immunoblotting with sera from soy-sensitive individuals showed IgE binding to bands corresponding to 7, 12, 20, 39 and 57 kD. N-terminal analysis of these IgE-binding bands resulted in sequences for 3 components. The 12-kD band was identified as a methionine-rich protein (MRP) and a member of the 2S albumin class of soy proteins. The 20-kD band was found to be soybean Kunitz trypsin inhibitor. The 39-kD band was matched to a soy protein with unknown function. Soy lecithin contains a number of IgE-binding proteins; thus, it might represent a source of hidden allergens. These allergens are a more significant concern for soy-allergic individuals consuming lecithin products as a health supplement. In addition, the MRP and the 39-kD protein identified in this study represent newly identified IgE-binding proteins. Copyright 2001 S. Karger AG, Basel

  5. Phytochip: development of a DNA-microarray for rapid and accurate identification of Pseudo-nitzschia spp and other harmful algal species.

    Science.gov (United States)

    Noyer, Charlotte; Abot, Anne; Trouilh, Lidwine; Leberre, Véronique Anton; Dreanno, Catherine

    2015-05-01

    Detection of harmful algal blooms has become a challenging concern because of the direct impacts on public health and economy. The identification of toxic dinoflagellates and diatoms in monitoring programs requires an extensive taxonomic expertise and is time consuming. Advances in molecular biology have allowed the development of new approaches, more rapid, accurate and cost-effective for detecting these microorganisms. In this context, we developed a new DNA microarray (called, Phytochip) for the simultaneous detection of multiple HAB species with a particular emphasis on Pseudo-nitzschia species. Oligonucleotide probes were designed along the rRNA operon. After DNA extraction, the target rDNA genes were amplified and labeled using an asymmetric PCR; then, the amplicons were hybridized to the oligonucleotide probes present on the chips. The total assay from seawater sampling to data acquisition can be performed within a working day. Specificity and sensitivity were assessed by using monoclonal cultures, mixtures of species and field samples spiked with a known amount of cultured cells. The Phytochip with its 81 validated oligonucleotide probes was able to detect 12 species of Pseudo-nitzschia and 11 species of dinoflagellates among which were 3 species of Karenia and 3 species of Alexandrium. The Phytochip was applied to environmental samples already characterized by light microscopy and cloned into DNA libraries. The hybridizations on the Phytochip were in good agreement with the sequences retrieved from the clone libraries and the microscopic observations. The Phytochip enables a reliable multiplex detection of phytoplankton and can assist a water quality monitoring program as well as more general ecological research.

  6. Comprehensive Identification of Immunodominant Proteins of Brucella abortus and Brucella melitensis Using Antibodies in the Sera from Naturally Infected Hosts

    Directory of Open Access Journals (Sweden)

    Gamal Wareth

    2016-04-01

    Full Text Available Brucellosis is a debilitating zoonotic disease that affects humans and animals. The diagnosis of brucellosis is challenging, as accurate species level identification is not possible with any of the currently available serology-based diagnostic methods. The present study aimed at identifying Brucella (B. species-specific proteins from the closely related species B. abortus and B. melitensis using sera collected from naturally infected host species. Unlike earlier reported investigations with either laboratory-grown species or vaccine strains, in the present study, field strains were utilized for analysis. The label-free quantitative proteomic analysis of the naturally isolated strains of these two closely related species revealed 402 differentially expressed proteins, among which 63 and 103 proteins were found exclusively in the whole cell extracts of B. abortus and B. melitensis field strains, respectively. The sera from four different naturally infected host species, i.e., cattle, buffalo, sheep, and goat were applied to identify the immune-binding protein spots present in the whole protein extracts from the isolated B. abortus and B. melitensis field strains and resolved on two-dimensional gel electrophoresis. Comprehensive analysis revealed that 25 proteins of B. abortus and 20 proteins of B. melitensis were distinctly immunoreactive. Dihydrodipicolinate synthase, glyceraldehyde-3-phosphate dehydrogenase and lactate/malate dehydrogenase from B. abortus, amino acid ABC transporter substrate-binding protein from B. melitensis and fumarylacetoacetate hydrolase from both species were reactive with the sera of all the tested naturally infected host species. The identified proteins could be used for the design of serological assays capable of detecting pan-Brucella, B. abortus- and B. melitensis-specific antibodies.

  7. Comprehensive Identification of Immunodominant Proteins of Brucella abortus and Brucella melitensis Using Antibodies in the Sera from Naturally Infected Hosts.

    Science.gov (United States)

    Wareth, Gamal; Eravci, Murat; Weise, Christoph; Roesler, Uwe; Melzer, Falk; Sprague, Lisa D; Neubauer, Heinrich; Murugaiyan, Jayaseelan

    2016-04-30

    Brucellosis is a debilitating zoonotic disease that affects humans and animals. The diagnosis of brucellosis is challenging, as accurate species level identification is not possible with any of the currently available serology-based diagnostic methods. The present study aimed at identifying Brucella (B.) species-specific proteins from the closely related species B. abortus and B. melitensis using sera collected from naturally infected host species. Unlike earlier reported investigations with either laboratory-grown species or vaccine strains, in the present study, field strains were utilized for analysis. The label-free quantitative proteomic analysis of the naturally isolated strains of these two closely related species revealed 402 differentially expressed proteins, among which 63 and 103 proteins were found exclusively in the whole cell extracts of B. abortus and B. melitensis field strains, respectively. The sera from four different naturally infected host species, i.e., cattle, buffalo, sheep, and goat were applied to identify the immune-binding protein spots present in the whole protein extracts from the isolated B. abortus and B. melitensis field strains and resolved on two-dimensional gel electrophoresis. Comprehensive analysis revealed that 25 proteins of B. abortus and 20 proteins of B. melitensis were distinctly immunoreactive. Dihydrodipicolinate synthase, glyceraldehyde-3-phosphate dehydrogenase and lactate/malate dehydrogenase from B. abortus, amino acid ABC transporter substrate-binding protein from B. melitensis and fumarylacetoacetate hydrolase from both species were reactive with the sera of all the tested naturally infected host species. The identified proteins could be used for the design of serological assays capable of detecting pan-Brucella, B. abortus- and B. melitensis-specific antibodies.

  8. An effective approach for identification of in vivo protein-DNA binding sites from paired-end ChIP-Seq data

    Directory of Open Access Journals (Sweden)

    Wilson Zoe A

    2010-02-01

    Full Text Available Abstract Background ChIP-Seq, which combines chromatin immunoprecipitation (ChIP with high-throughput massively parallel sequencing, is increasingly being used for identification of protein-DNA interactions in vivo in the genome. However, to maximize the effectiveness of data analysis of such sequences requires the development of new algorithms that are able to accurately predict DNA-protein binding sites. Results Here, we present SIPeS (Site Identification from Paired-end Sequencing, a novel algorithm for precise identification of binding sites from short reads generated by paired-end solexa ChIP-Seq technology. In this paper we used ChIP-Seq data from the Arabidopsis basic helix-loop-helix transcription factor ABORTED MICROSPORES (AMS, which is expressed within the anther during pollen development, the results show that SIPeS has better resolution for binding site identification compared to two existing ChIP-Seq peak detection algorithms, Cisgenome and MACS. Conclusions When compared to Cisgenome and MACS, SIPeS shows better resolution for binding site discovery. Moreover, SIPeS is designed to calculate the mappable genome length accurately with the fragment length based on the paired-end reads. Dynamic baselines are also employed to effectively discriminate closely adjacent binding sites, for effective binding sites discovery, which is of particular value when working with high-density genomes.

  9. Identification and characterization of Euphorbia nivulia latex proteins.

    Science.gov (United States)

    Badgujar, Shamkant B; Mahajan, Raghunath T

    2014-03-01

    The protein profile of latex of Euphorbia nivulia Buch.-Ham. is established. Three new proteins viz., Nivulian-I, II and III have been purified to homogeneity from the latex. The relative molecular masses of Nivulian-I, II and III are 31,486.985, 43,670.846 and 52,803.470 Da respectively. Nivulian-I is a simple type of protein while Nivulian-II and III are glycoproteins. Peptide mass fingerprint analysis revealed peptides of these proteins match with Tubulin alpha-1 chain of Eleusine indica, Maturase K of Banksia quercifolia and hypothetical protein of Zea mays respectively. Tryptic digestion profile of Nivulian-I, II and III, infer the exclusive nature of latex origin proteins and may be new and are additive molecules in the dictionaries of phytoproteins or botany. This is the first of its kind, regarding characterization and validation of Nivulian-I, II and III with respect to peptide sequencing.

  10. Identification of new centrosome proteins by autoimmune patient sera

    Institute of Scientific and Technical Information of China (English)

    XIA Liang; LI Yan; YANG Dong; WANG LiMin; HE Fang; ZHOU ChunYuan; LI YongZhe; ZENG ChangQing; He DaCheng

    2007-01-01

    Compared to other subcellular organelles, centrosome proteome can hardly be studied, due to the difficulties in separation and purification of centrosome. Auto-antisera from 6 autoimmune patients, which recognized centrosome specifically in immunofluorescence, were used to identify the corresponding centrosomal proteins. The sera were first tested by Western blot on whole cell lysate, and all bound antibodies were then eluted from each single band in Western blot membrane to assure which antibody was responsible for the centrosome specific immunofluorescence staining. The corresponding proteins were obtained by immunoprecipitation and identified by mass spectrometry. Six centrosomal proteins, including 2 known centrosomal proteins and 4 proteins with unknown localization or reportedly non-centrosomal localization, were identified. These proteins apparently involve in cell cycle regulation, signal transduction pathways, molecular chaperons, and metabolism enzymes, which may reflect the expected functional diversity of centrosome.

  11. Identification of an epitope of SARS-coronavirus nucleocapsid protein

    Institute of Scientific and Technical Information of China (English)

    YING LIN; JIN WANG; HONG XIA WANG; HUA LIANG JIANG; JIAN HUA SHEN; YOU HUA XIE; YUAN WANG; GANG PEI; BEI FEN SHEN; JIA RUI WU; BING SUN; XU SHEN; RUI FU YANG; YI XUE LI; YONG YONG JI; YOU YU HE; MUDE SHI; WEI LU; TIE LIU SHI

    2003-01-01

    The nucleocapsid (N) protein of severe acute respiratory syndrome-coronavirus (SARS-CoV) is a majorvirion structural protein. In this study, two epitopes (N1 and N2) of the N protein of SARS-CoV werepredicted by bioinformatics analysis. After immunization with two peptides, the peptides-specific antibodieswere isolated from the immunized rabbits. The further experiments demonstrated that N1 peptide-inducedpolyclonal antibodies had a high affinity to bind to E. coli expressed N protein of SARS-CoV. Furthermore, itwas confirmed that N1 peptide-specific IgG antibodies were detectable in the sera of severe acute respiratorysyndrome (SARS) patients. The results indicated that an epitope of the N protein has been identified andN protein specific Abs were produced by peptide immunization, which will be useful for the study of SARS-CoV.

  12. Toward accurate prediction of pKa values for internal protein residues: the importance of conformational relaxation and desolvation energy.

    Science.gov (United States)

    Wallace, Jason A; Wang, Yuhang; Shi, Chuanyin; Pastoor, Kevin J; Nguyen, Bao-Linh; Xia, Kai; Shen, Jana K

    2011-12-01

    Proton uptake or release controls many important biological processes, such as energy transduction, virus replication, and catalysis. Accurate pK(a) prediction informs about proton pathways, thereby revealing detailed acid-base mechanisms. Physics-based methods in the framework of molecular dynamics simulations not only offer pK(a) predictions but also inform about the physical origins of pK(a) shifts and provide details of ionization-induced conformational relaxation and large-scale transitions. One such method is the recently developed continuous constant pH molecular dynamics (CPHMD) method, which has been shown to be an accurate and robust pK(a) prediction tool for naturally occurring titratable residues. To further examine the accuracy and limitations of CPHMD, we blindly predicted the pK(a) values for 87 titratable residues introduced in various hydrophobic regions of staphylococcal nuclease and variants. The predictions gave a root-mean-square deviation of 1.69 pK units from experiment, and there were only two pK(a)'s with errors greater than 3.5 pK units. Analysis of the conformational fluctuation of titrating side-chains in the context of the errors of calculated pK(a) values indicate that explicit treatment of conformational flexibility and the associated dielectric relaxation gives CPHMD a distinct advantage. Analysis of the sources of errors suggests that more accurate pK(a) predictions can be obtained for the most deeply buried residues by improving the accuracy in calculating desolvation energies. Furthermore, it is found that the generalized Born implicit-solvent model underlying the current CPHMD implementation slightly distorts the local conformational environment such that the inclusion of an explicit-solvent representation may offer improvement of accuracy.

  13. Identification of proteins in fluid collected from nerve regeneration chambers

    Directory of Open Access Journals (Sweden)

    Ye Yilin

    2014-01-01

    Full Text Available We examined whether there are novel neurotrophic factors (NTFs in nerve regeneration conditioned fluid (NRCF. Nerve regeneration chamber models were established in the sciatic nerves of 25 New Zealand rabbits, and NRCF was extracted from the chambers l week postoperatively. Proteins in NRCF were separated by native polyacrylamide gel electrophoresis (PAGE, and Western blot and ELISA were used to identify the proteins. A novel NTF was identified in a protein fraction corresponding to 220 kDa.

  14. Novel identification of matrix proteins involved in calcitic biomineralization.

    Science.gov (United States)

    Rose-Martel, Megan; Smiley, Sandy; Hincke, Maxwell T

    2015-02-26

    Calcitic biomineralization is essential for otoconia formation in vertebrates. This process is characterized by protein-crystal interactions that modulate crystal growth on an extracellular matrix. An excellent model for the study of calcitic biomineralization is the avian eggshell, the fastest known biomineralization process. The objective of this study is to identify and characterize matrix proteins associated with the eggshell mammillary cones, which are hypothesized to regulate the earliest stage of eggshell calcification. Mammillary cones were isolated from 2 models, fertilized and unfertilized, and the released proteins were identified by RP-nanoLC and ES-MS/MS proteomics. Proteomics analysis identified 49 proteins associated with the eggshell membrane fibers and, importantly, 18 mammillary cone-specific proteins with an additional 18 proteins identified as enriched in the mammillary cones. Among the most promising candidates for modulating protein-crystal interactions were extracellular matrix proteins, including ABI family member 3 (NESH) binding protein (ABI3BP), tiarin-like, hyaluronan and proteoglycan link protein 3 (HAPLN3), collagen alpha-1(X), collagen alpha-1(II) and fibronectin, in addition to the calcium binding proteins calumenin, EGF-like repeats and discoidin 1-like domains 3 (EDIL3), nucleobindin-2 and SPARC. In conclusion, we identified several cone-resident proteins that are candidates to regulate initiation of eggshell calcification. Further study of these proteins will determine their roles in modulating calcitic biomineralization and lead to insight into the process of otoconia formation/regeneration. Biomineralization is essential for the development of hard tissues in vertebrates, which includes both calcium phosphate and calcium carbonate structures. Calcitic mineralization by calcium carbonate is an important process in the formation of otoconia, which are gravity receptor organs located in the inner ear and are responsible for balance

  15. Identification of ultramodified proteins using top-down spectra

    Energy Technology Data Exchange (ETDEWEB)

    Liu, Xiaowen; Hengel, Shawna M.; Wu, Si; Tolic, Nikola; Pasa-Tolic, Ljiljana; Pevzner, Pavel A.

    2013-04-10

    Post-translational modifications (PTMs) play an important role in various biological processes through changing protein structure and function. Some ultramodified proteins (like histones) have multiple PTMs forming PTM patterns that define the functionality of a protein. While bottom-up mass spectrometry (MS) has been successful in identifying individual PTMs within short peptides, it is unable to identify PTM patterns spread along entire proteins in a coordinated fashion. In contrast, top-down MS analyzes intact proteins and reveals PTM patterns along the entire proteins. However, while recent advances in instrumentation have made top-down MS accessible to many laboratories, most computational tools for top-down MS focus on proteins with few PTMs and are unable to identify complex PTM patterns. We propose a new algorithm, MS-Align-E, that identifies both expected and unexpected PTMs in ultramodified proteins. We demonstrate that MS-Align-E identifies many protein forms of histone H4 and benchmark it against the currently accepted software tools.

  16. Identification of Ultramodified Proteins Using Top-Down Mass Spectra

    Energy Technology Data Exchange (ETDEWEB)

    Liu, Xiaowen; Hengel, Shawna M.; Wu, Si; Tolic, Nikola; Pasa-Tolic, Ljiljana; Pevzner, Pavel A.

    2013-11-05

    Post-translational modifications (PTMs) play an important role in various biological processes through changing protein structure and function. Some ultramodified proteins (like histones) have multiple PTMs forming PTM patterns that define the functionality of a protein. While bottom-up mass spectrometry (MS) has been successful in identifying individual PTMs within short peptides, it is unable to identify PTM patterns spread along entire proteins in a coordinated fashion. In contrast, top-down MS analyzes intact proteins and reveals PTM patterns along the entire proteins. However, while recent advances in instrumentation have made top-down MS accessible to many laboratories, most computational tools for top-down MS focus on proteins with few PTMs and are unable to identify complex PTM patterns. We propose a new algorithm, MS-Align-E, that identifies both expected and unexpected PTMs in ultramodified proteins. We demonstrate that MS-Align-E identifies many protein forms of histone H4 and benchmark it against the currently accepted software tools.

  17. Accurate protein structure annotation through competitive diffusion of enzymatic functions over a network of local evolutionary similarities.

    Directory of Open Access Journals (Sweden)

    Eric Venner

    Full Text Available High-throughput Structural Genomics yields many new protein structures without known molecular function. This study aims to uncover these missing annotations by globally comparing select functional residues across the structural proteome. First, Evolutionary Trace Annotation, or ETA, identifies which proteins have local evolutionary and structural features in common; next, these proteins are linked together into a proteomic network of ETA similarities; then, starting from proteins with known functions, competing functional labels diffuse link-by-link over the entire network. Every node is thus assigned a likelihood z-score for every function, and the most significant one at each node wins and defines its annotation. In high-throughput controls, this competitive diffusion process recovered enzyme activity annotations with 99% and 97% accuracy at half-coverage for the third and fourth Enzyme Commission (EC levels, respectively. This corresponds to false positive rates 4-fold lower than nearest-neighbor and 5-fold lower than sequence-based annotations. In practice, experimental validation of the predicted carboxylesterase activity in a protein from Staphylococcus aureus illustrated the effectiveness of this approach in the context of an increasingly drug-resistant microbe. This study further links molecular function to a small number of evolutionarily important residues recognizable by Evolutionary Tracing and it points to the specificity and sensitivity of functional annotation by competitive global network diffusion. A web server is at http://mammoth.bcm.tmc.edu/networks.

  18. Accurate protein structure annotation through competitive diffusion of enzymatic functions over a network of local evolutionary similarities.

    Science.gov (United States)

    Venner, Eric; Lisewski, Andreas Martin; Erdin, Serkan; Ward, R Matthew; Amin, Shivas R; Lichtarge, Olivier

    2010-12-13

    High-throughput Structural Genomics yields many new protein structures without known molecular function. This study aims to uncover these missing annotations by globally comparing select functional residues across the structural proteome. First, Evolutionary Trace Annotation, or ETA, identifies which proteins have local evolutionary and structural features in common; next, these proteins are linked together into a proteomic network of ETA similarities; then, starting from proteins with known functions, competing functional labels diffuse link-by-link over the entire network. Every node is thus assigned a likelihood z-score for every function, and the most significant one at each node wins and defines its annotation. In high-throughput controls, this competitive diffusion process recovered enzyme activity annotations with 99% and 97% accuracy at half-coverage for the third and fourth Enzyme Commission (EC) levels, respectively. This corresponds to false positive rates 4-fold lower than nearest-neighbor and 5-fold lower than sequence-based annotations. In practice, experimental validation of the predicted carboxylesterase activity in a protein from Staphylococcus aureus illustrated the effectiveness of this approach in the context of an increasingly drug-resistant microbe. This study further links molecular function to a small number of evolutionarily important residues recognizable by Evolutionary Tracing and it points to the specificity and sensitivity of functional annotation by competitive global network diffusion. A web server is at http://mammoth.bcm.tmc.edu/networks.

  19. Identification of urinary proteins potentially associated with diabetic kidney disease

    Directory of Open Access Journals (Sweden)

    R K Marikanty

    2016-01-01

    Full Text Available Diabetic nephropathy (DN is the most common cause of chronic kidney disease. Although several parameters are used to evaluate renal damage, in many instances, there is no pathological change until damage is already advanced. Mass spectrometry-based proteomics is a novel tool to identify newer diagnostic markers. To identify urinary proteins associated with renal complications in diabetes, we collected urine samples from 10 type 2 diabetes patients each with normoalbuminuria, micro- and macro-albuminuria and compared their urinary proteome with that of 10 healthy individuals. Urinary proteins were concentrated, depleted of albumin and five other abundant plasma proteins and in-gel trypsin digested after prefractionation on sodium dodecyl sulfate polyacrylamide gel electrophoresis. The peptides were analyzed using a nanoflow reverse phase liquid chromatography system coupled to linear trap quadrupole-Orbitrap mass spectrometer. We identified large number of proteins in each group, of which many were exclusively present in individual patient groups. A total of 53 proteins were common in all patients but were absent in the controls. The majority of the proteins were functionally binding, biologically involved in metabolic processes, and showed enrichment of alternative complement and blood coagulation pathways. In addition to identifying reported proteins such as α2-HS-glycoprotein and Vitamin D binding protein, we detected novel proteins such as CD59, extracellular matrix protein 1 (ECM1, factor H, and myoglobin in the urine of macroalbuminuria patients. ECM1 and factor H are known to influence mesangial cell proliferation, and CD59 causes microvascular damage by influencing membrane attack complex deposition, suggestive their biological relevance to DN. Thus, we have developed a proteome database where various proteins exclusively present in the patients may be further investigated for their role as stage-specific markers and possible therapeutic

  20. Proteomics of Soil and Sediment: Protein Identification by De Novo Sequencing of Mass Spectra Complements Traditional Database Searching

    Science.gov (United States)

    Miller, S.; Rizzo, A. I.; Waldbauer, J.

    2015-12-01

    Proteomics has the potential to elucidate the metabolic pathways and taxa responsible for in situ biogeochemical transformations. However, low rates of protein identification from high resolution mass spectra have been a barrier to the development of proteomics in complex environmental samples. Much of the difficulty lies in the computational challenge of linking mass spectra to their corresponding proteins. Traditional database search methods for matching peptide sequences to mass spectra are often inadequate due to the complexity of environmental proteomes and the large database search space, as we demonstrate with soil and sediment proteomes generated via a range of extraction methods. One alternative to traditional database searching is de novo sequencing, which identifies peptide sequences without the need for a database. BLAST can then be used to match de novo sequences to similar genetic sequences. Assigning confidence to putative identifications has been one hurdle for the implementation of de novo sequencing. We found that accurate de novo sequences can be screened by quality score and length. Screening criteria are verified by comparing the results of de novo sequencing and traditional database searching for well-characterized proteomes from simple biological systems. The BLAST hits of screened sequences are interrogated for taxonomic and functional information. We applied de novo sequencing to organic topsoil and marine sediment proteomes. Peak-rich proteomes, which can result from various extraction techniques, yield thousands of high-confidence protein identifications, an improvement over previous proteomic studies of soil and sediment. User-friendly software tools for de novo metaproteomics analysis have been developed. This "De Novo Analysis" Pipeline is also a faster method of data analysis than constructing a tailored sequence database for traditional database searching.

  1. Identification of highly active flocculant proteins in bovine blood.

    Science.gov (United States)

    Piazza, George J; Nuñez, Alberto; Garcia, Rafael A

    2012-03-01

    Synthetic polymeric flocculants are used extensively for wastewater remediation, soil stabilization, and reduction in water leakage from unlined canals. Sources of highly active, inexpensive, renewable flocculants are needed to replace synthetic flocculants. High kaolin flocculant activity was documented for bovine blood (BB) and blood plasma with several anticoagulant treatments. BB serum also had high flocculant activity. To address the hypothesis that some blood proteins have strong flocculating activity, the BB proteins were separated by SEC. Then, the major proteins of the flocculant-active fractions were separated by SDS-PAGE. Identity of the major protein components was determined by tryptic digestion and peptide analysis by MALDI TOF MS. The sequence of selected peptides was confirmed using TOF/TOF-MS/MS fragmentation. Hemoglobin dimer (subunits α and β) was identified as the major protein component of the active fraction in BB; its high flocculation activity was confirmed by testing a commercial sample of hemoglobin. In the same manner, three proteins from blood plasma (fibrinogen, γ-globulin, α-2-macroglobulin) were found to be highly active flocculants, but bovine serum albumin, α-globulin, and β-globulin were not flocculants. On a mass basis, hemoglobin, γ-globulin, α-2-macroglobulin were as effective as anionic polyacrylamide (PAM), a widely used synthetic flocculant. The blood proteins acted faster than PAM, and unlike PAM, the blood proteins flocculants did not require calcium salts for their activity.

  2. Identification of vitreous proteins in retinopathy of prematurity.

    Science.gov (United States)

    Sugioka, Koji; Saito, Akio; Kusaka, Shunji; Kuniyoshi, Kazuki; Shimomura, Yoshikazu

    2017-07-01

    Retinopathy of prematurity (ROP) is a disorder of blood vessels in the retina developed in premature infants and the leading cause of the blindness in children. Proteomic analysis was performed to identify vitreous proteins specific to patients with ROP. Vitreous humor samples were obtained from three patients with ROP and two patients with congenital cataract, the latter included as a control group. The vitreous samples were separated by 2D-PAGE and the proteins running as definitive spots were identified by MALDI-TOF MS spectrometry. We identified 13 and 6 proteins in the vitreous from ROP and cataract patients, respectively. Albumin, transferrin, pigment epithelium-derived factor (PEDF) and transthyretin were found in both patient groups. In the samples from ROP patients, PEDF and transthyretin levels were lower than in those from cataract patients, and retinol binding protein 3 and prostaglandin D synthase were not detected. Of the 13 proteins, 9 proteins including α-2-macroglobulin, ceruloplasmin, α-fetoprotein, vitamin D-binding protein, α-1-antitrypsin, α-1-β-glycoprotein, hemopexin, apolipoprotein A-1 and A-lV were found in vitreous samples of only the ROP patients. PEDF has anti-angiogenic and neurotrophic functions. Whether PEDF is increased or decreased in diabetic retinopathy has been controversial but we observed lower PEDF in the ROP samples than in the controls. The proteins specific to or decreased in ROP, if confirmed in future studies, may provide clue to understanding its pathogenesis. Copyright © 2017 Elsevier Inc. All rights reserved.

  3. Identification and properties of Trichomonas vaginalis proteins involved in cytadherence.

    Science.gov (United States)

    Alderete, J F; Garza, G E

    1988-01-01

    Trichomonas vaginalis NYH286 surface proteins which are candidates for mediating parasite cytadherence (adhesins) were identified. At least four trichomonad protein ligands ranging in relative molecular mass from 65 to less than or equal to 21 kilodaltons were found to selectively bind to chemically stabilized HeLa cells. The proteins were present on the surfaces of 10 different isolates of T. vaginalis examined; however, the nonpathogenic trichomonad T. tenax did not possess similar HeLa cell-binding proteins under identical experimental conditions, suggesting that these proteins are unique to the pathogenic human trichomonads. The surface nature of the candidate adhesins was confirmed by the ability of the proteins on intact, live organisms to be radioiodinated and to be removed with trypsin treatment. Rabbit antiserum (immunoglobulin G fraction) generated against adhesin proteins electroeluted from acrylamide preparations inhibited cytadherence compared with control immunoglobulin G. An adherence-negative subpopulation of T. vaginalis NYH286 organisms was also isolated. These nonadherent trichomonads did not synthesize the adhesin proteins. Interestingly, absence of adhesins from these parasites paralleled expression of a major immunogen known to undergo phenotypic variation. Revertant organisms derived from the adherence-minus subpopulation synthesized the adhesins and attached to HeLa cells. The emergence of revertant adherent T. vaginalis organisms also corresponded with the appearance of parasites which were without the major immunogen on their surface. Finally, it was determined that only those parasites lacking the major surface immunogen were capable of adherence and toxicity to HeLa cells.

  4. Identification, cloning, and purification of protein antigens of Treponema pallidum.

    Science.gov (United States)

    Stamm, L V; Dallas, W S; Ray, P H; Bassford, P J

    1988-01-01

    Difficulties in culturing the bacterium Treponema pallidum have greatly hindered syphilis research. In recent years, several laboratories have begun applying recombinant DNA technology to the study of this organism. Recent work is summarized concerning the expression of T. pallidum DNA in Escherichia coli. A number of E. coli clones expressing treponemal protein antigens have been identified. In one instance, a recombinant protein was purified to homogeneity and shown to be identical to a highly immunogenic, native T. pallidum membrane protein of molecular weight 39,000, which was designated the basic membrane protein (BMP) of this organism. In addition, recent experiments are described that were designed to identify cell-surface proteins that would serve as the primary focus of our cloning efforts. Results obtained with use of several different approaches strongly suggest that the outer membrane of T. pallidum is an antigenically inert structure largely devoid of protein. However, a class of low-molecular-weight protein antigens have been identified that are actively secreted into the extracellular medium. Attempts currently are being made to clone these secreted proteins and investigate their roles in the pathogenesis and immunobiology of syphilis.

  5. Identification of fibrin clot-bound plasma proteins

    NARCIS (Netherlands)

    S. Talens (Simone); F.W.G. Leebeek (Frank); J.A.A. Demmers (Jeroen); D.C. Rijken (Dingeman)

    2012-01-01

    textabstractSeveral proteins are known to bind to a fibrin network and to change clot properties or function. In this study we aimed to get an overview of fibrin clot-bound plasma proteins. A plasma clot was formed by adding thrombin, CaCl2 and aprotinin to citrated platelet-poor plasma and unbound

  6. Cross-Species Genome-Wide Identification of Evolutionary Conserved MicroProteins

    Science.gov (United States)

    Straub, Daniel

    2017-01-01

    MicroProteins are small single-domain proteins that act by engaging their targets into different, sometimes nonproductive protein complexes. In order to identify novel microProteins in any sequenced genome of interest, we have developed miPFinder, a program that identifies and classifies potential microProteins. In the past years, several microProteins have been discovered in plants where they are mainly involved in the regulation of development by fine-tuning transcription factor activities. The miPFinder algorithm identifies all up to date known plant microProteins and extends the microProtein concept beyond transcription factors to other protein families. Here, we reveal potential microProtein candidates in several plant and animal reference genomes. A large number of these microProteins are species-specific while others evolved early and are evolutionary highly conserved. Most known microProtein genes originated from large ancestral genes by gene duplication, mutation and subsequent degradation. Gene ontology analysis shows that putative microProtein ancestors are often located in the nucleus, and involved in DNA binding and formation of protein complexes. Additionally, microProtein candidates act in plant transcriptional regulation, signal transduction and anatomical structure development. MiPFinder is freely available to find microProteins in any genome and will aid in the identification of novel microProteins in plants and animals. PMID:28338802

  7. Identification of SNARE proteins in fish-Tilapia Oreochromis niloticus

    Institute of Scientific and Technical Information of China (English)

    HUANG Xiaohang; LAM Patrick P L; LIN Xuezheng; LIU Chenlin; BIAN Ji; GAISANO Herbert

    2007-01-01

    SNARE proteins are a group of membrane-associated proteins involved in exocytosis, secretion and membrane trafficking events in eukaryotic cells. Research on SNARE protein biology has become a more attractive field in recent years, which is applied to marine biology specifically to the fish Tilapia (Oreochromis niloticus). Plasma membrane fractions of different tissues of Tilapia, including brain, liver-pancreas, intestine, skin and muscle, were extracted, and immuno-decorated with isoform-specific antibodies to the SNARE families and associated proteins. The presence of Syntaxins -1A, 2 and 3, SNAP-23 and SNAP-25, VAMP-2, Munc-18-1 and Munc-13 in the brain was identified, which were differentially distributed in the other organ tissues of the fish Tilapia. The distinct distribution of SNARE and associated proteins will serve as the basis for further investigation into their special secretory function in these tissues of the fish.

  8. Identification of new centrosome proteins by autoimmune patient sera

    Institute of Scientific and Technical Information of China (English)

    2007-01-01

    Compared to other subcellular organelles, centrosome proteome can hardly be studied, due to the dif- ficulties in separation and purification of centrosome. Auto-antisera from 6 autoimmune patients, which recognized centrosome specifically in immunofluorescence, were used to identify the corresponding centrosomal proteins. The sera were first tested by Western blot on whole cell lysate, and all bound antibodies were then eluted from each single band in Western blot membrane to assure which antibody was responsible for the centrosome specific immunofluorescence staining. The corresponding pro- teins were obtained by immunoprecipitation and identified by mass spectrometry. Six centrosomal proteins, including 2 known centrosomal proteins and 4 proteins with unknown localization or report- edly non-centrosomal localization, were identified. These proteins apparently involve in cell cycle regulation, signal transduction pathways, molecular chaperons, and metabolism enzymes, which may reflect the expected functional diversity of centrosome.

  9. Microdosing of a Carbon-14 Labeled Protein in Healthy Volunteers Accurately Predicts Its Pharmacokinetics at Therapeutic Dosages

    NARCIS (Netherlands)

    Vlaming, M.L.; Duijn, E. van; Dillingh, M.R.; Brands, R.; Windhorst, A.D.; Hendrikse, N.H.; Bosgra, S.; Burggraaf, J.; Koning, M.C. de; Fidder, A.; Mocking, J.A.; Sandman, H.; Ligt, R.A. de; Fabriek, B.O.; Pasman, W.J.; Seinen, W.; Alves, T.; Carrondo, M.; Peixoto, C.; Peeters, P.A.; Vaes, W.H.

    2015-01-01

    Preclinical development of new biological entities (NBEs), such as human protein therapeutics, requires considerable expenditure of time and costs. Poor prediction of pharmacokinetics in humans further reduces net efficiency. In this study, we show for the first time that pharmacokinetic data of

  10. Microdosing of a Carbon-14 Labeled Protein in Healthy Volunteers Accurately Predicts Its Pharmacokinetics at Therapeutic Dosages

    NARCIS (Netherlands)

    Vlaming, M.L.; Duijn, E. van; Dillingh, M.R.; Brands, R.; Windhorst, A.D.; Hendrikse, N.H.; Bosgra, S.; Burggraaf, J.; Koning, M.C. de; Fidder, A.; Mocking, J.A.; Sandman, H.; Ligt, R.A. de; Fabriek, B.O.; Pasman, W.J.; Seinen, W.; Alves, T.; Carrondo, M.; Peixoto, C.; Peeters, P.A.; Vaes, W.H.

    2015-01-01

    Preclinical development of new biological entities (NBEs), such as human protein therapeutics, requires considerable expenditure of time and costs. Poor prediction of pharmacokinetics in humans further reduces net efficiency. In this study, we show for the first time that pharmacokinetic data of NBE

  11. A highly accurate protein structural class prediction approach using auto cross covariance transformation and recursive feature elimination.

    Science.gov (United States)

    Li, Xiaowei; Liu, Taigang; Tao, Peiying; Wang, Chunhua; Chen, Lanming

    2015-12-01

    Structural class characterizes the overall folding type of a protein or its domain. Many methods have been proposed to improve the prediction accuracy of protein structural class in recent years, but it is still a challenge for the low-similarity sequences. In this study, we introduce a feature extraction technique based on auto cross covariance (ACC) transformation of position-specific score matrix (PSSM) to represent a protein sequence. Then support vector machine-recursive feature elimination (SVM-RFE) is adopted to select top K features according to their importance and these features are input to a support vector machine (SVM) to conduct the prediction. Performance evaluation of the proposed method is performed using the jackknife test on three low-similarity datasets, i.e., D640, 1189 and 25PDB. By means of this method, the overall accuracies of 97.2%, 96.2%, and 93.3% are achieved on these three datasets, which are higher than those of most existing methods. This suggests that the proposed method could serve as a very cost-effective tool for predicting protein structural class especially for low-similarity datasets.

  12. Analytical approaches for the characterization and identification of olive (Olea europaea) oil proteins.

    Science.gov (United States)

    Esteve, Clara; D'Amato, Alfonsina; Marina, María Luisa; García, María Concepción; Righetti, Pier Giorgio

    2013-10-30

    Proteins in olive oil have been scarcely investigated probably due to the difficulty of working with such a lipidic matrix and the dramatically low abundance of proteins in this biological material. Additionally, this scarce information has generated contradictory results, thus requiring further investigations. This work treats this subject from a comprehensive point of view and proposes the use of different analytical approaches to delve into the characterization and identification of proteins in olive oil. Different extraction methodologies, including capture via combinational hexapeptide ligand libraries (CPLLs), were tried. A sequence of methodologies, starting with off-gel isoelectric focusing (IEF) followed by sodium dodecyl sulfate-polyacrylamide gel electrophoresis (SDS-PAGE) or high-performance liquid chromatography (HPLC) using an ultraperformance liquid chromatography (UPLC) column, was applied to profile proteins from olive seed, pulp, and oil. Besides this, and for the first time, a tentative identification of oil proteins by mass spectrometry has been attempted.

  13. Proteomic identification of proteins in exosomes of patients with atherosclerosis

    Institute of Scientific and Technical Information of China (English)

    JIANG Mei; QUAN Jing; ZHANG Heng; DING Qian-qian; XIANG Meng; MENG Dan; SUN Ning; CHEN Si-feng

    2016-01-01

    AIM:Atherosclerosis primarily involved systemic arteries .Luminal surface , a monolayer of endothelial cells , of artery directly exposes to blood and is susceptible to active substances in the blood .Exosomes contain significantly amount of proteins and RNAs .Ex-osomes can be good and bad for cells , depending on their component .Thus, exosomes may contribute to atherosclerosis by affecting endothelial cells .This study analyzed the relationship of exosome proteins and atherosclerosis .METHODS: Fifty-six patients and healthy subjects were recruited and divided into two comparisons:healthy subjects vs atherosclerosis ( HS vs AS) , and hypertension vs hypertension plus atherosclerosis ( HT vs HT+AS) .Serum exosomes were decoded by protein mass spectrometry .The protein profile and function were analyzed by gene ontology ( GO) .RESULTS:It was found that five child terms repeatedly appeared in “response to stimulus” and “immune system process” of BP of the two categories ( HS vs AS and AS vs HT+AS):“positive regulation of innate immune response”,“immune response-activating signal transduction”,”activation of innate immune response”,“innate immune re-sponse-activating signal transduction” and “innate immune response activating cell surface receptor signaling pathway ”.Two child terms repeatedly showed in “binding” of MF of the two categories:“antigen binding” and “enzyme binding”.Two proteins, PSMA6 and PSMA7, were repeatedly shown in the two categories .CONCLUSION:GO analysis was utilized for structure hierarchy “tree” to illustrate these proteins involved in various terms in BP , CC and MF.The PPI analysis supplied proteins which may play potentially im-portant roles in AS process .Innate immune system and blood coagulation pathway contribute to AS formation .The proteins, PSMA6, PSMA7 and Annexin A2, may can be the new target proteins for prevention and treatment of AS .

  14. Identification of novel amelogenin-binding proteins by proteomics analysis.

    Directory of Open Access Journals (Sweden)

    Takao Fukuda

    Full Text Available Emdogain (enamel matrix derivative, EMD is well recognized in periodontology. It is used in periodontal surgery to regenerate cementum, periodontal ligament, and alveolar bone. However, the precise molecular mechanisms underlying periodontal regeneration are still unclear. In this study, we investigated the proteins bound to amelogenin, which are suggested to play a pivotal role in promoting periodontal tissue regeneration. To identify new molecules that interact with amelogenin and are involved in osteoblast activation, we employed coupling affinity chromatography with proteomic analysis in fractionated SaOS-2 osteoblastic cell lysate. In SaOS-2 cells, many of the amelogenin-interacting proteins in the cytoplasm were mainly cytoskeletal proteins and several chaperone molecules of heat shock protein 70 (HSP70 family. On the other hand, the proteomic profiles of amelogenin-interacting proteins in the membrane fraction of the cell extracts were quite different from those of the cytosolic-fraction. They were mainly endoplasmic reticulum (ER-associated proteins, with lesser quantities of mitochondrial proteins and nucleoprotein. Among the identified amelogenin-interacting proteins, we validated the biological interaction of amelogenin with glucose-regulated protein 78 (Grp78/Bip, which was identified in both cytosolic and membrane-enriched fractions. Confocal co-localization experiment strongly suggested that Grp78/Bip could be an amelogenin receptor candidate. Further biological evaluations were examined by Grp78/Bip knockdown analysis with and without amelogenin. Within the limits of the present study, the interaction of amelogenin with Grp78/Bip contributed to cell proliferation, rather than correlate with the osteogenic differentiation in SaOS-2 cells. Although the biological significance of other interactions are not yet explored, these findings suggest that the differential effects of amelogenin-derived osteoblast activation could be of

  15. On-tissue protein identification and imaging by MALDI-ion mobility mass spectrometry.

    Science.gov (United States)

    Stauber, Jonathan; MacAleese, Luke; Franck, Julien; Claude, Emmanuelle; Snel, Marten; Kaletas, Basak Kükrer; Wiel, Ingrid M V D; Wisztorski, Maxence; Fournier, Isabelle; Heeren, Ron M A

    2010-03-01

    MALDI imaging mass spectrometry (MALDI-IMS) has become a powerful tool for the detection and localization of drugs, proteins, and lipids on-tissue. Nevertheless, this approach can only perform identification of low mass molecules as lipids, pharmaceuticals, and peptides. In this article, a combination of approaches for the detection and imaging of proteins and their identification directly on-tissue is described after tryptic digestion. Enzymatic digestion protocols for different kinds of tissues--formalin fixed paraffin embedded (FFPE) and frozen tissues--are combined with MALDI-ion mobility mass spectrometry (IM-MS). This combination enables localization and identification of proteins via their related digested peptides. In a number of cases, ion mobility separates isobaric ions that cannot be identified by conventional MALDI time-of-flight (TOF) mass spectrometry. The amount of detected peaks per measurement increases (versus conventional MALDI-TOF), which enables mass and time selected ion images and the identification of separated ions. These experiments demonstrate the feasibility of direct proteins identification by ion-mobility-TOF IMS from tissue. The tissue digestion combined with MALDI-IM-TOF-IMS approach allows a proteomics "bottom-up" strategy with different kinds of tissue samples, especially FFPE tissues conserved for a long time in hospital sample banks. The combination of IM with IMS marks the development of IMS approaches as real proteomic tools, which brings new perspectives to biological studies.

  16. PSI/TM-Coffee: a web server for fast and accurate multiple sequence alignments of regular and transmembrane proteins using homology extension on reduced databases.

    Science.gov (United States)

    Floden, Evan W; Tommaso, Paolo D; Chatzou, Maria; Magis, Cedrik; Notredame, Cedric; Chang, Jia-Ming

    2016-07-08

    The PSI/TM-Coffee web server performs multiple sequence alignment (MSA) of proteins by combining homology extension with a consistency based alignment approach. Homology extension is performed with Position Specific Iterative (PSI) BLAST searches against a choice of redundant and non-redundant databases. The main novelty of this server is to allow databases of reduced complexity to rapidly perform homology extension. This server also gives the possibility to use transmembrane proteins (TMPs) reference databases to allow even faster homology extension on this important category of proteins. Aside from an MSA, the server also outputs topological prediction of TMPs using the HMMTOP algorithm. Previous benchmarking of the method has shown this approach outperforms the most accurate alignment methods such as MSAProbs, Kalign, PROMALS, MAFFT, ProbCons and PRALINE™. The web server is available at http://tcoffee.crg.cat/tmcoffee. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  17. Identification and Characterization of Proteins Associated with Plant Tolerance to Heat Stress

    Institute of Scientific and Technical Information of China (English)

    Bingru Huang; Chenping Xu

    2008-01-01

    Heat stress is a major abiotic stress limiting plant growth and productivity in many areas of the world. Understanding mechanisms of plant adaptation to heat stress would facilitate the development of heat-tolerant cultivars for improving productivity in warm climatic regions. Protein metabolism involving protein synthesis and degradation is one of the most sensitive processes to heat stress. Changes in the level and expression pattern of some proteins may play an important role in plant adaptation to heat stress. The identification of stress-responsive proteins and pathways has been facilitated by an increasing number of tools and resources, including two-dimensional electrophoresis and mass spectrometry, and the rapidly expanding nucleotide and amino acid sequence databases. Heat stress may induce or enhance protein expression or cause protein degradation. The induction of heat-responsive proteins, particularly heat shock proteins (HSPs), plays a key role in plant tolerance to heat stress. Protein degradation involving various proteases is also important in regulating plant responses to heat stress. This review provides an overview of recent research on proteomic profiling for the identification of heat-responsive proteins associated with heat tolerance, heat induction and characteristics of HSPs, and protein degradation in relation to plant responses to heat stress.

  18. A Graph-Centric Approach for Metagenome-Guided Peptide and Protein Identification in Metaproteomics.

    Science.gov (United States)

    Tang, Haixu; Li, Sujun; Ye, Yuzhen

    2016-12-01

    Metaproteomic studies adopt the common bottom-up proteomics approach to investigate the protein composition and the dynamics of protein expression in microbial communities. When matched metagenomic and/or metatranscriptomic data of the microbial communities are available, metaproteomic data analyses often employ a metagenome-guided approach, in which complete or fragmental protein-coding genes are first directly predicted from metagenomic (and/or metatranscriptomic) sequences or from their assemblies, and the resulting protein sequences are then used as the reference database for peptide/protein identification from MS/MS spectra. This approach is often limited because protein coding genes predicted from metagenomes are incomplete and fragmental. In this paper, we present a graph-centric approach to improving metagenome-guided peptide and protein identification in metaproteomics. Our method exploits the de Bruijn graph structure reported by metagenome assembly algorithms to generate a comprehensive database of protein sequences encoded in the community. We tested our method using several public metaproteomic datasets with matched metagenomic and metatranscriptomic sequencing data acquired from complex microbial communities in a biological wastewater treatment plant. The results showed that many more peptides and proteins can be identified when assembly graphs were utilized, improving the characterization of the proteins expressed in the microbial communities. The additional proteins we identified contribute to the characterization of important pathways such as those involved in degradation of chemical hazards. Our tools are released as open-source software on github at https://github.com/COL-IU/Graph2Pro.

  19. Identification and characterization of N-glycosylated proteins using proteomics

    DEFF Research Database (Denmark)

    Selby, David S; Larsen, Martin R; Calvano, Cosima Damiana;

    2008-01-01

    Glycoproteins constitute a large fraction of the proteome. The fundamental role of protein glycosylation in cellular development, growth, and differentiation, tissue development, and in host-pathogen interactions is by now widely accepted. Proteome-wide characterization of glycoproteins...

  20. Identification of differentially expressed proteins in response to Pb ...

    African Journals Online (AJOL)

    use

    in many forms in natural sources throughout the world. According to the environmental protection ... has been attached to the problems of Pb pollution with ..... structure of proteins and/or increased degradation, thus ..... Plant Soil, 200: 241-250.

  1. Systematic identification of proteins that elicit drug side effects

    DEFF Research Database (Denmark)

    Kuhn, Michael; Al Banchaabouchi, Mumna; Campillos, Monica

    2013-01-01

    Side effect similarities of drugs have recently been employed to predict new drug targets, and networks of side effects and targets have been used to better understand the mechanism of action of drugs. Here, we report a large-scale analysis to systematically predict and characterize proteins...... that cause drug side effects. We integrated phenotypic data obtained during clinical trials with known drug-target relations to identify overrepresented protein-side effect combinations. Using independent data, we confirm that most of these overrepresentations point to proteins which, when perturbed, cause...... side effects. Of 1428 side effects studied, 732 were predicted to be predominantly caused by individual proteins, at least 137 of them backed by existing pharmacological or phenotypic data. We prove this concept in vivo by confirming our prediction that activation of the serotonin 7 receptor (HTR7...

  2. Proteomic identification of S-nitrosylated proteins in Arabidopsis

    DEFF Research Database (Denmark)

    Lindermayr, C.; Saalbach, G.; Durner, J.

    2005-01-01

    to be one of the dominant regulation mechanisms for many animal proteins. For plants, the principle of S-nitrosylation remained to be elucidated. We generated S-nitrosothiols by treating extracts from Arabidopsis (Arabidopsis thaliana) cell suspension cultures with the NO-donor S......-nitrosoglutathione. Furthermore, Arabidopsis plants were treated with gaseous NO to analyze whether S-nitrosylation can occur in the specific redox environment of a plant cell in vivo. S-Nitrosylated proteins were detected by a biotin switch method, converting S-nitrosylated Cys to biotinylated Cys. Biotin-labeled proteins were......Although nitric oxide (NO) has grown into a key signaling molecule in plants during the last few years, less is known about how NO regulates different events in plants. Analyses of NO-dependent processes in animal systems have demonstrated protein S-nitrosylation of cysteine (Cys) residues...

  3. Three BUB1 and BUBR1/MAD3-related spindle assembly checkpoint proteins are required for accurate mitosis in Arabidopsis.

    Science.gov (United States)

    Paganelli, Laetitia; Caillaud, Marie-Cécile; Quentin, Michaël; Damiani, Isabelle; Govetto, Benjamin; Lecomte, Philippe; Karpov, Pavel A; Abad, Pierre; Chabouté, Marie-Edith; Favery, Bruno

    2015-01-01

    The spindle assembly checkpoint (SAC) is a refined surveillance mechanism which ensures that chromosomes undergoing mitosis do not segregate until they are properly attached to the spindle microtubules (MT). The SAC has been extensively studied in metazoans and yeast, but little is known about its role in plants. We identified proteins interacting with a MT-associated protein MAP65-3, which plays a critical role in organising mitotic MT arrays, and carried out a functional analysis of previously and newly identified SAC components. We show that Arabidopsis SAC proteins BUB3.1, MAD2, BUBR1/MAD3s and BRK1 interact with each other and with MAP65-3. We found that two BUBR1/MAD3s interacted specifically at centromeres. When stably expressed in Arabidopsis, BRK1 localised to the kinetochores during all stages of the mitotic cell cycle. Early in mitosis, BUB3.1 and BUBR1/MAD3.1 localise to the mitotic spindle, where MAP65-3 organises spindle MTs. A double-knockout mad3.1 mad3.2 mutant presented spindle MT abnormalities, chromosome misalignments on the metaphase plate and the production of lagging chromosomes and micronuclei during mitosis. We conclude that BRK1 and BUBR1/MAD3-related proteins play a key role in ensuring faithful chromosome segregation during mitosis and that their interaction with MAP65-3 may be important for the regulation of MT-chromosome attachment. © 2014 The Authors. New Phytologist © 2014 New Phytologist Trust.

  4. Microdosing of a Carbon-14 Labeled Protein in Healthy Volunteers Accurately Predicts Its Pharmacokinetics at Therapeutic Dosages.

    Science.gov (United States)

    Vlaming, M L H; van Duijn, E; Dillingh, M R; Brands, R; Windhorst, A D; Hendrikse, N H; Bosgra, S; Burggraaf, J; de Koning, M C; Fidder, A; Mocking, J A J; Sandman, H; de Ligt, R A F; Fabriek, B O; Pasman, W J; Seinen, W; Alves, T; Carrondo, M; Peixoto, C; Peeters, P A M; Vaes, W H J

    2015-08-01

    Preclinical development of new biological entities (NBEs), such as human protein therapeutics, requires considerable expenditure of time and costs. Poor prediction of pharmacokinetics in humans further reduces net efficiency. In this study, we show for the first time that pharmacokinetic data of NBEs in humans can be successfully obtained early in the drug development process by the use of microdosing in a small group of healthy subjects combined with ultrasensitive accelerator mass spectrometry (AMS). After only minimal preclinical testing, we performed a first-in-human phase 0/phase 1 trial with a human recombinant therapeutic protein (RESCuing Alkaline Phosphatase, human recombinant placental alkaline phosphatase [hRESCAP]) to assess its safety and kinetics. Pharmacokinetic analysis showed dose linearity from microdose (53 μg) [(14) C]-hRESCAP to therapeutic doses (up to 5.3 mg) of the protein in healthy volunteers. This study demonstrates the value of a microdosing approach in a very small cohort for accelerating the clinical development of NBEs.

  5. Backbone building from quadrilaterals: a fast and accurate algorithm for protein backbone reconstruction from alpha carbon coordinates.

    Science.gov (United States)

    Gront, Dominik; Kmiecik, Sebastian; Kolinski, Andrzej

    2007-07-15

    In this contribution, we present an algorithm for protein backbone reconstruction that comprises very high computational efficiency with high accuracy. Reconstruction of the main chain atomic coordinates from the alpha carbon trace is a common task in protein modeling, including de novo structure prediction, comparative modeling, and processing experimental data. The method employed in this work follows the main idea of some earlier approaches to the problem. The details and careful design of the present approach are new and lead to the algorithm that outperforms all commonly used earlier applications. BBQ (Backbone Building from Quadrilaterals) program has been extensively tested both on native structures as well as on near-native decoy models and compared with the different available existing methods. Obtained results provide a comprehensive benchmark of existing tools and evaluate their applicability to a large scale modeling using a reduced representation of protein conformational space. The BBQ package is available for downloading from our website at http://biocomp.chem.uw.edu.pl/services/BBQ/. This webpage also provides a user manual that describes BBQ functions in detail.

  6. Fast and Accurate Protein False Discovery Rates on Large-Scale Proteomics Data Sets with Percolator 3.0

    Science.gov (United States)

    The, Matthew; MacCoss, Michael J.; Noble, William S.; Käll, Lukas

    2016-11-01

    Percolator is a widely used software tool that increases yield in shotgun proteomics experiments and assigns reliable statistical confidence measures, such as q values and posterior error probabilities, to peptides and peptide-spectrum matches (PSMs) from such experiments. Percolator's processing speed has been sufficient for typical data sets consisting of hundreds of thousands of PSMs. With our new scalable approach, we can now also analyze millions of PSMs in a matter of minutes on a commodity computer. Furthermore, with the increasing awareness for the need for reliable statistics on the protein level, we compared several easy-to-understand protein inference methods and implemented the best-performing method—grouping proteins by their corresponding sets of theoretical peptides and then considering only the best-scoring peptide for each protein—in the Percolator package. We used Percolator 3.0 to analyze the data from a recent study of the draft human proteome containing 25 million spectra (PM:24870542). The source code and Ubuntu, Windows, MacOS, and Fedora binary packages are available from http://percolator.ms/ under an Apache 2.0 license.

  7. Biomarkers for ragwort poisoning in horses: identification of protein targets

    Directory of Open Access Journals (Sweden)

    Beynon Robert J

    2008-08-01

    Full Text Available Abstract Background Ingestion of the poisonous weed ragwort (Senecio jacobea by horses leads to irreversible liver damage. The principal toxins of ragwort are the pyrrolizidine alkaloids that are rapidly metabolised to highly reactive and cytotoxic pyrroles, which can escape into the circulation and bind to proteins. In this study a non-invasive in vitro model system has been developed to investigate whether pyrrole toxins induce specific modifications of equine blood proteins that are detectable by proteomic methods. Results One dimensional gel electrophoresis revealed a significant alteration in the equine plasma protein profile following pyrrole exposure and the formation of a high molecular weight protein aggregate. Using mass spectrometry and confirmation by western blotting the major components of this aggregate were identified as fibrinogen, serum albumin and transferrin. Conclusion These findings demonstrate that pyrrolic metabolites can modify equine plasma proteins. The high molecular weight aggregate may result from extensive inter- and intra-molecular cross-linking of fibrinogen with the pyrrole. This model has the potential to form the basis of a novel proteomic strategy aimed at identifying surrogate protein biomarkers of ragwort exposure in horses and other livestock.

  8. Identification of novel cyclic nucleotide binding proteins in Trypanosoma cruzi.

    Science.gov (United States)

    Jäger, Adriana V; De Gaudenzi, Javier G; Mild, Jesica G; Mc Cormack, Bárbara; Pantano, Sergio; Altschuler, Daniel L; Edreira, Martin M

    2014-12-01

    Cyclic AMP has been implicated as second messenger in a wide range of cellular processes. In the protozoan parasite Trypanosoma cruzi, cAMP is involved in the development of the parasite's life cycle. While cAMP effectors have been widely studied in other eukaryotic cells, little is known about cAMP's mechanism of action in T. cruzi. To date, only a cAMP-dependent protein kinase A (PKA) has been cloned and characterised in this parasite; however experimental evidence indicates the existence of cAMP-dependent, PKA-independent events. In order to identify new cAMP binding proteins as potential cAMP effectors, we carried out in silico studies using the predicted T. cruzi proteome. Using a combination of search methods 27 proteins with putative cNMP binding domains (CBDs) were identified. Phylogenetic analysis of the CBDs presented a homogeneous distribution, with sequences segregated into two main branches: one containing kinases-like proteins and the other gathering hypothetical proteins with different function or no other known. Comparative modelling of the strongest candidates provides support for the hypothesis that these proteins may give rise to structurally viable cyclic nucleotide binding domains. Pull-down and nucleotide displacement assays strongly suggest that TcCLB.508523.80 could bind cAMP and eventually be a new putative PKA-independent cAMP effector in T. cruzi.

  9. Identification of a conserved interface between PUF and CPEB proteins.

    Science.gov (United States)

    Campbell, Zachary T; Menichelli, Elena; Friend, Kyle; Wu, Joann; Kimble, Judith; Williamson, James R; Wickens, Marvin

    2012-05-25

    Members of the PUF (Pumilio and FBF) and CPEB (cytoplasmic polyadenylation element-binding) protein families collaborate to regulate mRNA expression throughout eukaryotes. Here, we focus on the physical interactions between members of these two families, concentrating on Caenorhabditis elegans FBF-2 and CPB-1. To localize the site of interaction on FBF-2, we identified conserved amino acids within C. elegans PUF proteins. Deletion of an extended loop containing several conserved residues abolished binding to CPB-1. We analyzed alanine substitutions at 13 individual amino acids in FBF-2, each identified via its conservation. Multiple single point mutations disrupted binding to CPB-1 but not to RNA. Position Tyr-479 was particularly critical as multiple substitutions to other amino acids at this position did not restore binding. The complex of FBF-2 and CPB-1 repressed translation of an mRNA containing an FBF binding element. Repression required both proteins and was disrupted by FBF-2 alleles that failed to bind CPB-1 or RNA. The equivalent loop in human PUM2 is required for binding to human CPEB3 in vitro, although the primary sequences of the human and C. elegans PUF proteins have diverged in that region. Our findings define a key region in PUF/CPEB interactions and imply a conserved platform through which PUF proteins interact with their protein partners.

  10. A new strategy for protein interface identification using manifold learning method.

    Science.gov (United States)

    Wang, Bing; Huang, De-Shuang; Jiang, Changjun

    2014-06-01

    Protein interactions play vital roles in biological processes. The study for protein interface will allow people to elucidate the mechanism of protein interaction. However, a large portion of protein interface data is incorrectly collected in current studies. In this paper, a novel strategy of dataset reconstruction using manifold learning method has been proposed for dealing with the noises in the interaction interface data whose definition is based on the residue distances among the different chains within protein complexes. Three support vector machine-based predictors are constructed using different protein features to identify the functional sites involved in the formation of protein interface. The experimental results achieved in this work demonstrate that our strategy can remove noises, and therefore improve the ability for identification of protein interfaces with 77.8% accuracy.

  11. Identification of cancer protein biomarkers using proteomic techniques

    Energy Technology Data Exchange (ETDEWEB)

    Mor, Gil G; Ward, David C; Bray-Ward, Patricia

    2015-03-10

    The claimed invention describes methods to diagnose or aid in the diagnosis of cancer. The claimed methods are based on the identification of biomarkers which are particularly well suited to discriminate between cancer subjects and healthy subjects. These biomarkers were identified using a unique and novel screening method described herein. The biomarkers identified herein can also be used in the prognosis and monitoring of cancer. The invention comprises the use of leptin, prolactin, OPN and IGF-II for diagnosing, prognosis and monitoring of ovarian cancer.

  12. Identification of cancer protein biomarkers using proteomic techniques

    Energy Technology Data Exchange (ETDEWEB)

    Mor, Gil G. (Cheshire, CT); Ward, David C. (Las Vegas, NV); Bray-Ward, Patricia (Las Vegas, NV)

    2010-02-23

    The claimed invention describes methods to diagnose or aid in the diagnosis of cancer. The claimed methods are based on the identification of biomarkers which are particularly well suited to discriminate between cancer subjects and healthy subjects. These biomarkers were identified using a unique and novel screening method described herein. The biomarkers identified herein can also be used in the prognosis and monitoring of cancer. The invention comprises the use of leptin, prolactin, OPN and IGF-II for diagnosing, prognosis and monitoring of ovarian cancer.

  13. Identification of cancer protein biomarkers using proteomic techniques

    Energy Technology Data Exchange (ETDEWEB)

    Mor, Gil G.; Ward, David C.; Bray-Ward, Patricia

    2016-10-18

    The claimed invention describes methods to diagnose or aid in the diagnosis of cancer. The claimed methods are based on the identification of biomarkers which are particularly well suited to discriminate between cancer subjects and healthy subjects. These biomarkers were identified using a unique and novel screening method described herein. The biomarkers identified herein can also be used in the prognosis and monitoring of cancer. The invention comprises the use of leptin, prolactin, OPN and IGF-II for diagnosing, prognosis and monitoring of ovarian cancer.

  14. Identification of novel Drosophila centromere-associated proteins.

    Science.gov (United States)

    Barth, Teresa K; Schade, Georg O M; Schmidt, Andreas; Vetter, Irene; Wirth, Marc; Heun, Patrick; Thomae, Andreas W; Imhof, Axel

    2014-10-01

    Centromeres are chromosomal regions crucial for correct chromosome segregation during mitosis and meiosis. They are epigenetically defined by centromeric proteins such as the centromere-specific histone H3-variant centromere protein A (CENP-A). In humans, 16 additional proteins have been described to be constitutively associated with centromeres throughout the cell cycle, known as the constitutive centromere-associated network (CCAN). In contrast, only one additional constitutive centromeric protein is known in Drosophila melanogaster (D.mel), the conserved CCAN member CENP-C. To gain further insights into D.mel centromere composition and biology, we analyzed affinity-purified chromatin prepared from D.mel cell lines expressing green fluorescent protein tagged histone three variants by MS. In addition to already-known centromeric proteins, we identified novel factors that were repeatedly enriched in affinity purification-MS experiments. We analyzed the cellular localization of selected candidates by immunocytochemistry and confirmed localization to the centromere and other genomic regions for ten factors. Furthermore, RNA interference mediated depletion of CG2051, CG14480, and hyperplastic discs, three of our strongest candidates, leads to elevated mitotic defects. Knockdowns of these candidates neither impair the localization of several known kinetochore proteins nor CENP-A(CID) loading, suggesting their involvement in alternative pathways that contribute to proper centromere function. In summary, we provide a comprehensive analysis of the proteomic composition of Drosophila centromeres. All MS data have been deposited in the ProteomeXchange with identifier PXD000758 (http://proteomecentral.proteomexchange.org/dataset/PXD000758).

  15. Proteomic identification of secreted proteins of Propionibacterium acnes

    Directory of Open Access Journals (Sweden)

    Holland Carsten

    2010-08-01

    Full Text Available Abstract Background The anaerobic Gram-positive bacterium Propionibacterium acnes is a human skin commensal that resides preferentially within sebaceous follicles; however, it also exhibits many traits of an opportunistic pathogen, playing roles in a variety of inflammatory diseases such as acne vulgaris. To date, the underlying disease-causing mechanisms remain ill-defined and knowledge of P. acnes virulence factors remains scarce. Here, we identified proteins secreted during anaerobic cultivation of a range of skin and clinical P. acnes isolates, spanning the four known phylogenetic groups. Results Culture supernatant proteins of P. acnes were separated by two-dimensional electrophoresis (2-DE and all Coomassie-stained spots were subsequently identified by MALDI mass spectrometry (MALDI-MS. A set of 20 proteins was secreted in the mid-exponential growth phase by the majority of strains tested. Functional annotation revealed that many of these common proteins possess degrading activities, including glycoside hydrolases with similarities to endoglycoceramidase, β-N-acetylglucosaminidase and muramidase; esterases such as lysophospholipase and triacylglycerol lipase; and several proteases. Other secreted factors included Christie-Atkins-Munch-Petersen (CAMP factors, glyceraldehyde 3-phosphate dehydrogenase (GAPDH, and several hypothetical proteins, a few of which are unique to P. acnes. Strain-specific differences were apparent, mostly in the secretion of putative adhesins, whose genes exhibit variable phase variation-like sequence signatures. Conclusions Our proteomic investigations have revealed that the P. acnes secretome harbors several proteins likely to play a role in host-tissue degradation and inflammation. Despite a large overlap between the secretomes of all four P. acnes phylotypes, distinct differences between predicted host-tissue interacting proteins were identified, providing potential insight into the differential virulence

  16. Identification of Anaplasma marginale type IV secretion system effector proteins.

    Directory of Open Access Journals (Sweden)

    Svetlana Lockwood

    Full Text Available BACKGROUND: Anaplasma marginale, an obligate intracellular alphaproteobacterium in the order Rickettsiales, is a tick-borne pathogen and the leading cause of anaplasmosis in cattle worldwide. Complete genome sequencing of A. marginale revealed that it has a type IV secretion system (T4SS. The T4SS is one of seven known types of secretion systems utilized by bacteria, with the type III and IV secretion systems particularly prevalent among pathogenic Gram-negative bacteria. The T4SS is predicted to play an important role in the invasion and pathogenesis of A. marginale by translocating effector proteins across its membrane into eukaryotic target cells. However, T4SS effector proteins have not been identified and tested in the laboratory until now. RESULTS: By combining computational methods with phylogenetic analysis and sequence identity searches, we identified a subset of potential T4SS effectors in A. marginale strain St. Maries and chose six for laboratory testing. Four (AM185, AM470, AM705 [AnkA], and AM1141 of these six proteins were translocated in a T4SS-dependent manner using Legionella pneumophila as a reporter system. CONCLUSIONS: The algorithm employed to find T4SS effector proteins in A. marginale identified four such proteins that were verified by laboratory testing. L. pneumophila was shown to work as a model system for A. marginale and thus can be used as a screening tool for A. marginale effector proteins. The first T4SS effector proteins for A. marginale have been identified in this work.

  17. Identification of Actin-Binding Proteins from Maize Pollen

    Energy Technology Data Exchange (ETDEWEB)

    Staiger, C.J.

    2004-01-13

    Specific Aims--The goal of this project was to gain an understanding of how actin filament organization and dynamics are controlled in flowering plants. Specifically, we proposed to identify unique proteins with novel functions by investigating biochemical strategies for the isolation and characterization of actin-binding proteins (ABPs). In particular, our hunt was designed to identify capping proteins and nucleation factors. The specific aims included: (1) to use F-actin affinity chromatography (FAAC) as a general strategy to isolate pollen ABPs (2) to produce polyclonal antisera and perform subcellular localization in pollen tubes (3) to isolate cDNA clones for the most promising ABPs (4) to further purify and characterize ABP interactions with actin in vitro. Summary of Progress By employing affinity chromatography on F-actin or DNase I columns, we have identified at least two novel ABPs from pollen, PrABP80 (gelsolin-like) and ZmABP30, We have also cloned and expressed recombinant protein, as well as generated polyclonal antisera, for 6 interesting ABPs from Arabidopsis (fimbrin AtFIM1, capping protein a/b (AtCP), adenylyl cyclase-associated protein (AtCAP), AtCapG & AtVLN1). We performed quantitative analyses of the biochemical properties for two of these previously uncharacterized ABPs (fimbrin and capping protein). Our studies provide the first evidence for fimbrin activity in plants, demonstrate the existence of barbed-end capping factors and a gelsolin-like severing activity, and provide the quantitative data necessary to establish and test models of F-actin organization and dynamics in plant cells.

  18. Improved Recovery and Identification of Membrane Proteins from Rat Hepatic Cells using a Centrifugal Proteomic Reactor*

    Science.gov (United States)

    Zhou, Hu; Wang, Fangjun; Wang, Yuwei; Ning, Zhibin; Hou, Weimin; Wright, Theodore G.; Sundaram, Meenakshi; Zhong, Shumei; Yao, Zemin; Figeys, Daniel

    2011-01-01

    Despite their importance in many biological processes, membrane proteins are underrepresented in proteomic analysis because of their poor solubility (hydrophobicity) and often low abundance. We describe a novel approach for the identification of plasma membrane proteins and intracellular microsomal proteins that combines membrane fractionation, a centrifugal proteomic reactor for streamlined protein extraction, protein digestion and fractionation by centrifugation, and high performance liquid chromatography-electrospray ionization-tandem MS. The performance of this approach was illustrated for the study of the proteome of ER and Golgi microsomal membranes in rat hepatic cells. The centrifugal proteomic reactor identified 945 plasma membrane proteins and 955 microsomal membrane proteins, of which 63 and 47% were predicted as bona fide membrane proteins, respectively. Among these proteins, >800 proteins were undetectable by the conventional in-gel digestion approach. The majority of the membrane proteins only identified by the centrifugal proteomic reactor were proteins with ≥2 transmembrane segments or proteins with high molecular mass (e.g. >150 kDa) and hydrophobicity. The improved proteomic reactor allowed the detection of a group of endocytic and/or signaling receptor proteins on the plasma membrane, as well as apolipoproteins and glycerolipid synthesis enzymes that play a role in the assembly and secretion of apolipoprotein B100-containing very low density lipoproteins. Thus, the centrifugal proteomic reactor offers a new analytical tool for structure and function studies of membrane proteins involved in lipid and lipoprotein metabolism. PMID:21749988

  19. Improved recovery and identification of membrane proteins from rat hepatic cells using a centrifugal proteomic reactor.

    Science.gov (United States)

    Zhou, Hu; Wang, Fangjun; Wang, Yuwei; Ning, Zhibin; Hou, Weimin; Wright, Theodore G; Sundaram, Meenakshi; Zhong, Shumei; Yao, Zemin; Figeys, Daniel

    2011-10-01

    Despite their importance in many biological processes, membrane proteins are underrepresented in proteomic analysis because of their poor solubility (hydrophobicity) and often low abundance. We describe a novel approach for the identification of plasma membrane proteins and intracellular microsomal proteins that combines membrane fractionation, a centrifugal proteomic reactor for streamlined protein extraction, protein digestion and fractionation by centrifugation, and high performance liquid chromatography-electrospray ionization-tandem MS. The performance of this approach was illustrated for the study of the proteome of ER and Golgi microsomal membranes in rat hepatic cells. The centrifugal proteomic reactor identified 945 plasma membrane proteins and 955 microsomal membrane proteins, of which 63 and 47% were predicted as bona fide membrane proteins, respectively. Among these proteins, >800 proteins were undetectable by the conventional in-gel digestion approach. The majority of the membrane proteins only identified by the centrifugal proteomic reactor were proteins with ≥ 2 transmembrane segments or proteins with high molecular mass (e.g. >150 kDa) and hydrophobicity. The improved proteomic reactor allowed the detection of a group of endocytic and/or signaling receptor proteins on the plasma membrane, as well as apolipoproteins and glycerolipid synthesis enzymes that play a role in the assembly and secretion of apolipoprotein B100-containing very low density lipoproteins. Thus, the centrifugal proteomic reactor offers a new analytical tool for structure and function studies of membrane proteins involved in lipid and lipoprotein metabolism.

  20. Large-format imaging plate and weissenberg camera for accurate protein crystallographic data collection using synchrotron radiation.

    Science.gov (United States)

    Sakabe, K; Sasaki, K; Watanabe, N; Suzuki, M; Wang, Z G; Miyahara, J; Sakabe, N

    1997-05-01

    Off-line and on-line protein data-collection systems using an imaging plate as a detector are described and their components reported. The off-line scanner IPR4080 was developed for a large-format imaging plate ;BASIII' of dimensions 400 x 400 mm and 400 x 800 mm. The characteristics of this scanner are a dynamic range of 10(5) photons pixel(-1), low background noise and high sensitivity. A means of reducing electronic noise and a method for finding the origin of the noise are discussed in detail. A dedicated screenless Weissenberg camera matching IPR4080 with synchrotron radiation was developed and installed on beamline BL6B at the Photon Factory. This camera can attach one or two sheets of 400 x 800 mm large-format imaging plate inside the film cassette by evacuation. The positional reproducibility of the imaging plate on the cassette is so good that the data can be processed by batch job. Data of 93% completeness up to 1.6 A resolution were collected on a single axis rotation and the value of R(merge) becomes 4% from a tetragonal lysozyme crystal using a set of two imaging-plate sheets. Comparing two types of imaging plates, the signal-to-noise ratio of the ST-VIP-type imaging plate is 25% better than that of the BASIII-type imaging plate for protein data collection using 1.0 and 0.7 A X-rays. A new on-line protein data-collection system with imaging plates is specially designed to use synchrotron radiation X-rays at maximum efficiency.

  1. Rapid and accurate identification of isolates of Candida species by melting peak and melting curve analysis of the internally transcribed spacer region 2 fragment (ITS2-MCA)

    NARCIS (Netherlands)

    Decat, E.; van Mechelen, E.; Saerens, B.; Vermeulen, S.J.T.; Boekhout, T.; de Blaiser, S.; Vaneechoutte, M.; Deschaght, P.

    2013-01-01

    Rapid identification of clinically important yeasts can facilitate the initiation of anti-fungal therapy, since susceptibility is largely species-dependent. We evaluated melting peak and melting curve analysis of the internally transcribed spacer region 2 fragment (ITS2-MCA) as an identification too

  2. Rapid and accurate identification of isolates of Candida species by melting peak and melting curve analysis of the internally transcribed spacer region 2 fragment (ITS2-MCA)

    NARCIS (Netherlands)

    Decat, E.; van Mechelen, E.; Saerens, B.; Vermeulen, S.J.T.; Boekhout, T.; de Blaiser, S.; Vaneechoutte, M.; Deschaght, P.

    2013-01-01

    Rapid identification of clinically important yeasts can facilitate the initiation of anti-fungal therapy, since susceptibility is largely species-dependent. We evaluated melting peak and melting curve analysis of the internally transcribed spacer region 2 fragment (ITS2-MCA) as an identification

  3. Identification of Redox and Glucose-Dependent Txnip Protein Interactions

    Directory of Open Access Journals (Sweden)

    Benjamin J. Forred

    2016-01-01

    Full Text Available Thioredoxin-interacting protein (Txnip acts as a negative regulator of thioredoxin function and is a critical modulator of several diseases including, but not limited to, diabetes, ischemia-reperfusion cardiac injury, and carcinogenesis. Therefore, Txnip has become an attractive therapeutic target to alleviate disease pathologies. Although Txnip has been implicated with numerous cellular processes such as proliferation, fatty acid and glucose metabolism, inflammation, and apoptosis, the molecular mechanisms underlying these processes are largely unknown. The objective of these studies was to identify Txnip interacting proteins using the proximity-based labeling method, BioID, to understand differential regulation of pleiotropic Txnip cellular functions. The BioID transgene fused to Txnip expressed in HEK293 identified 31 interacting proteins. Many protein interactions were redox-dependent and were disrupted through mutation of a previously described reactive cysteine (C247S. Furthermore, we demonstrate that this model can be used to identify dynamic Txnip interactions due to known physiological regulators such as hyperglycemia. These data identify novel Txnip protein interactions and demonstrate dynamic interactions dependent on redox and glucose perturbations, providing clarification to the pleiotropic cellular functions of Txnip.

  4. Identification of giant Mimivirus protein functions using RNA interference

    Directory of Open Access Journals (Sweden)

    Haitham eSobhy

    2015-04-01

    Full Text Available Genomic analysis of giant viruses, such as Mimivirus, has revealed that more than half of the putative genes have no known functions (ORFans. We knocked down Mimivirus genes using short interfering RNA (siRNA as a proof of concept to determine the functions of giant virus ORFans. As fibers are easy to observe, we targeted a gene encoding a protein absent in a Mimivirus mutant devoid of fibers as well as 3 genes encoding products identified in a protein concentrate of fibers, including one ORFan and one gene of unknown function. We found that knocking down these four genes was associated with depletion or modification of the fibers. Our strategy of silencing ORFan genes in giant viruses opens a way to identify its complete gene repertoire and may clarify the role of these genes, differentiating between junk DNA and truly used genes. Using this strategy, we were able to annotate 4 proteins in Mimivirus and 30 homologous proteins in other giant viruses. In addition, we were able to annotate >500 proteins from cellular organisms and 100 from metagenomic databases.

  5. Non-targeted identification of prions and amyloid-forming proteins from yeast and mammalian cells.

    Science.gov (United States)

    Kryndushkin, Dmitry; Pripuzova, Natalia; Burnett, Barrington G; Shewmaker, Frank

    2013-09-20

    The formation of amyloid aggregates is implicated both as a primary cause of cellular degeneration in multiple human diseases and as a functional mechanism for providing extraordinary strength to large protein assemblies. The recent identification and characterization of several amyloid proteins from diverse organisms argues that the amyloid phenomenon is widespread in nature. Yet identifying new amyloid-forming proteins usually requires a priori knowledge of specific candidates. Amyloid fibers can resist heat, pressure, proteolysis, and denaturation by reagents such as urea or sodium dodecyl sulfate. Here we show that these properties can be exploited to identify naturally occurring amyloid-forming proteins directly from cell lysates. This proteomic-based approach utilizes a novel purification of amyloid aggregates followed by identification by mass spectrometry without the requirement for special genetic tools. We have validated this technique by blind identification of three amyloid-based yeast prions from laboratory and wild strains and disease-related polyglutamine proteins expressed in both yeast and mammalian cells. Furthermore, we found that polyglutamine aggregates specifically recruit some stress granule components, revealing a possible mechanism of toxicity. Therefore, core amyloid-forming proteins as well as strongly associated proteins can be identified directly from cells of diverse origin.

  6. Proteome identification of proteins interacting with histone methyltransferase SET8

    Institute of Scientific and Technical Information of China (English)

    Yi Qin; Huafang Ouyang; Jing Liu; Youhua Xie

    2013-01-01

    SET8 (also known as PR-Set7/9,SETD8,KMT5A),a member of the SET domain containing methyltransferase family,which specifically catalyzes mono-methylation of K20 on histone H4 (H4K20me1),has been implicated in multiple biological processes,such as gene transcriptional regulation,cell cycle control,genomic integrity maintenance and development.In this study,we used GST-SET8 fusion protein as bait to search for SET8 interaction partners to elucidate physiological functions of SET8.In combination with mass spectrometry,we identified 40 proteins that potentially interact with SET8.DDX21,a nucleolar protein,was further confirmed to associate with SET8.Furthermore,we discovered a novel function of SET8 in the regulation of rRNA transcription.

  7. Identification of Disulfide Bonds in Protein Proteolytic Degradation Products Using de Novo-Protein Unique Sequence Tags Approach

    Energy Technology Data Exchange (ETDEWEB)

    Shen, Yufeng; Tolic, Nikola; Purvine, Samuel O.; Smith, Richard D.

    2010-08-01

    Disulfide bonds are a form of posttranslational modification that often determines protein structure(s) and function(s). In this work, we report a mass spectrometry method for identification of disulfides in degradation products of proteins, and specifically endogenous peptides in the human blood plasma peptidome. LC-Fourier transform tandem mass spectrometry (FT MS/MS) was used for acquiring mass spectra that were de novo sequenced and then searched against the IPI human protein database. Through the use of unique sequence tags (UStags) we unambiguously correlated the spectra to specific database proteins. Examination of the UStags’ prefix and/or suffix sequences that contain cysteine(s) in conjunction with sequences of the UStags-specified database proteins is shown to enable the unambigious determination of disulfide bonds. Using this method, we identified the intermolecular and intramolecular disulfides in human blood plasma peptidome peptides that have molecular weights of up to ~10 kDa.

  8. Identification of Dominant Immunogenic Bacteria and Bacterial Proteins in Periodontitis

    DEFF Research Database (Denmark)

    Agerbæk, Mette Rylev; Haubek, Dorte; Birkelund, Svend

    Marginal periodontitis is considered an infectious disease that triggers host inflammatory responses resulting in destruction of the periodontium. A complex biofilm of bacteria is associated with periodontitis. Some species have been identified as putative pathogens such as Porphyromonas gingivalis...... (P.g) and Actinobacillus actinomycetemcomitans (A.a), but the identity of dominate immunogens of these bacteria is poorly elucidated. The aim of the study was to identify dominant immunogenic proteins of P.g and A.a in patients suffering from chronic and aggressive periodontitis by proteomic analysis...... will be able to identify immunodominant proteins and potentially important virulence factors of putative periodontal pathogens....

  9. Identification of Chlamydia trachomatis outer membrane complex proteins by differential proteomics.

    Science.gov (United States)

    Liu, Xiaoyun; Afrane, Mary; Clemmer, David E; Zhong, Guangming; Nelson, David E

    2010-06-01

    The extracellular chlamydial infectious particle, or elementary body (EB), is enveloped by an intra- and intermolecular cysteine cross-linked protein shell called the chlamydial outer membrane complex (COMC). A few abundant proteins, including the major outer membrane protein and cysteine-rich proteins (OmcA and OmcB), constitute the overwhelming majority of COMC proteins. The identification of less-abundant COMC proteins has been complicated by limitations of proteomic methodologies and the contamination of COMC fractions with abundant EB proteins. Here, we used parallel liquid chromatography-mass spectrometry/mass spectrometry (LC-MS/MS) analyses of Chlamydia trachomatis serovar L2 434/Bu EB, COMC, and Sarkosyl-soluble EB fractions to identify proteins enriched or depleted from COMC. All well-described COMC proteins were specifically enriched in the COMC fraction. In contrast, multiple COMC-associated proteins found in previous studies were strongly enriched in the Sarkosyl-soluble fraction, suggesting that these proteins are not COMC components or are not stably associated with COMC. Importantly, we also identified novel proteins enriched in COMC. The list of COMC proteins identified in this study has provided reliable information for further understanding chlamydial protein secretion systems and modeling COMC and EB structures.

  10. Screening and identification of proteins interacting with nucleostemin

    Institute of Scientific and Technical Information of China (English)

    Hai-Xia Yang; Geng-Lin Jin; Ling Meng; Jian-Zhi Zhang; Wen-Bin Liu; Cheng-Chao Shou

    2005-01-01

    AIM: To identify the proteins interacting with nucleostemin (NS), thereby gaining an insight into the function of NS.METHODS: Yeast two-hybrid assay was performed to screen a human placenta cDNA library with the full length of NS as a bait. X-Gal assay and β-galactosidase filter assay were subsequently conducted to check the positive clones and the gene was identified by DNA sequencing.To further confirm the interaction of two proteins, the DNA fragment coding NS and the DNA fragment isolated from the positive clone were inserted into the mammalian expression vector pcDNA3 and pcDNA3-myc, respectively.Then, two plasmids were cotransfected into the COS-7 cells by DEAE-dextron. The total protein from the cotransfected cells was extracted and coimmunoprecipitation and Western blot were performed with suitable antibodies sequentially.RESULTS: Two positive clones that interacted with NS were obtained from human placenta cDNA library. One was an alpha isoform of human protein phosphatase 2 regulatory subunit B (B56) (PPP2R5A) and the other was a novel gene being highly homologous to the gene associated with spondylo paralysis. The co-immunoprecipitation also showed that NS specifically interacted with PPP2R5A.CONCLUSION: NS and PPP2R5A interact in yeast and mammalian cells, respectively, which is helpful for addressing the function of NS in cancer development and progression.

  11. Identification of Dominant Immunogenic Bacteria and Bacterial Proteins in Periodontitis

    DEFF Research Database (Denmark)

    Agerbæk, Mette Rylev; Haubek, Dorte; Birkelund, Svend

    . 2-dimensional gel electroforesis of outer membrane and secreted proteins from P.g strain W83, A.a strain HK1651, and Streptococcus gordonii strain SK7 was performed. The gels were blotted onto membranes and immunogens were detected by incubation with sera collected from patients, healthy controls...

  12. Identification of serum protein biomarkers for utrophin based DMD therapy

    Science.gov (United States)

    Guiraud, Simon; Edwards, Benjamin; Squire, Sarah E.; Babbs, Arran; Shah, Nandini; Berg, Adam; Chen, Huijia; Davies, Kay E.

    2017-01-01

    Despite promising therapeutic avenues, there is currently no effective treatment for Duchenne muscular dystrophy (DMD), a lethal monogenic disorder caused by the loss of the large cytoskeletal protein, dystrophin. A highly promising approach to therapy, applicable to all DMD patients irrespective to their genetic defect, is to modulate utrophin, a functional paralogue of dystrophin, able to compensate for the primary defects of DMD restoring sarcolemmal stability. One of the major difficulties in assessing the effectiveness of therapeutic strategies is to define appropriate outcome measures. In the present study, we utilised an aptamer based proteomics approach to profile 1,310 proteins in plasma of wild-type, mdx and Fiona (mdx overexpressing utrophin) mice. Comparison of the C57 and mdx sera revealed 83 proteins with statistically significant >2 fold changes in dystrophic serum abundance. A large majority of previously described biomarkers (ANP32B, THBS4, CAMK2A/B/D, CYCS, CAPNI) were normalised towards wild-type levels in Fiona animals. This work also identified potential mdx markers specific to increased utrophin (DUS3, TPI1) and highlights novel mdx biomarkers (GITR, MYBPC1, HSP60, SIRT2, SMAD3, CNTN1). We define a panel of putative protein mdx biomarkers to evaluate utrophin based strategies which may help to accelerate their translation to the clinic. PMID:28252048

  13. Identification of cell wall-associated proteins from Phytophthora ramorum

    NARCIS (Netherlands)

    Meijer, H.J.G.; Vondervoort, van de P.J.I.; Yin, Q.Y.; Koster, de C.G.; Klis, F.M.; Govers, F.; Groot, de P.W.J.

    2006-01-01

    The oomycete genus Phytophthora comprises a large group of fungal-like plant pathogens. Two Phytophthora genomes recently have been sequenced; one of them is the genome of Phytophthora ramorum, the causal agent of sudden oak death. During plant infection, extracellular proteins, either soluble secre

  14. Identification of differentially expressed proteins in vitamin B 12

    Directory of Open Access Journals (Sweden)

    Swati Varshney

    2015-01-01

    Full Text Available Background: Vitamin B 12 (cobalamin is a water-soluble vitamin generally synthesized by microorganisms. Mammals cannot synthesize this vitamin but have evolved processes for absorption, transport and cellular uptake of this vitamin. Only about 30% of vitamin B 12 , which is bound to the protein transcobalamin (TC (Holo-TC [HoloTC] enters into the cell and hence is referred to as the biologically active form of vitamin B 12 . Vitamin B 12 deficiency leads to several complex disorders, including neurological disorders and anemia. We had earlier shown that vitamin B 12 deficiency is associated with coronary artery disease (CAD in Indian population. In the current study, using a proteomics approach we identified proteins that are differentially expressed in the plasma of individuals with low HoloTC levels. Materials and Methods: We used isobaric-tagging method of relative and absolute quantitation to identify proteins that are differently expressed in individuals with low HoloTC levels when compared to those with normal HoloTC level. Results: In two replicate isobaric tags for relative and absolute quantitation experiments several proteins involved in lipid metabolism, blood coagulation, cholesterol metabolic process, and lipoprotein metabolic process were found to be altered in individuals having low HoloTC levels. Conclusions: Our study indicates that low HoloTc levels could be a risk factor in the development of CAD.

  15. Identification of cell wall-associated proteins from Phytophthora ramorum

    NARCIS (Netherlands)

    Meijer, H.J.G.; Vondervoort, van de P.J.I.; Yin, Q.Y.; Koster, de C.G.; Klis, F.M.; Govers, F.; Groot, de P.W.J.

    2006-01-01

    The oomycete genus Phytophthora comprises a large group of fungal-like plant pathogens. Two Phytophthora genomes recently have been sequenced; one of them is the genome of Phytophthora ramorum, the causal agent of sudden oak death. During plant infection, extracellular proteins, either soluble

  16. Identification of antigenic proteins of the nosocomial pathogen Klebsiella pneumoniae.

    Directory of Open Access Journals (Sweden)

    Sebastian Hoppe

    Full Text Available The continuous expansion of nosocomial infections around the globe has become a precarious situation. Key challenges include mounting dissemination of multiple resistances to antibiotics, the easy transmission and the growing mortality rates of hospital-acquired bacterial diseases. Thus, new ways to rapidly detect these infections are vital. Consequently, researchers around the globe pursue innovative approaches for point-of-care devices. In many cases the specific interaction of an antigen and a corresponding antibody is pivotal. However, the knowledge about suitable antigens is lacking. The aim of this study was to identify novel antigens as specific diagnostic markers. Additionally, these proteins might be aptly used for the generation of vaccines to improve current treatment options. Hence, a cDNA-based expression library was constructed and screened via microarrays to detect novel antigens of Klebsiella pneumoniae, a prominent agent of nosocomial infections well-known for its extensive antibiotics resistance, especially by extended-spectrum beta-lactamases (ESBL. After screening 1536 clones, 14 previously unknown immunogenic proteins were identified. Subsequently, each protein was expressed in full-length and its immunodominant character examined by ELISA and microarray analyses. Consequently, six proteins were selected for epitope mapping and three thereof possessed linear epitopes. After specificity analysis, homology survey and 3d structural modelling, one epitope sequence GAVVALSTTFA of KPN_00363, an ion channel protein, was identified harboring specificity for K. pneumoniae. The remaining epitopes showed ambiguous results regarding the specificity for K. pneumoniae. The approach adopted herein has been successfully utilized to discover novel antigens of Campylobacter jejuni and Salmonella enterica antigens before. Now, we have transferred this knowledge to the key nosocomial agent, K. pneumoniae. By identifying several novel antigens

  17. Identification of Antigenic Proteins of the Nosocomial Pathogen Klebsiella pneumoniae

    Science.gov (United States)

    Hoppe, Sebastian; Bier, Frank F.; von Nickisch-Rosenegk, Markus

    2014-01-01

    The continuous expansion of nosocomial infections around the globe has become a precarious situation. Key challenges include mounting dissemination of multiple resistances to antibiotics, the easy transmission and the growing mortality rates of hospital-acquired bacterial diseases. Thus, new ways to rapidly detect these infections are vital. Consequently, researchers around the globe pursue innovative approaches for point-of-care devices. In many cases the specific interaction of an antigen and a corresponding antibody is pivotal. However, the knowledge about suitable antigens is lacking. The aim of this study was to identify novel antigens as specific diagnostic markers. Additionally, these proteins might be aptly used for the generation of vaccines to improve current treatment options. Hence, a cDNA-based expression library was constructed and screened via microarrays to detect novel antigens of Klebsiella pneumoniae, a prominent agent of nosocomial infections well-known for its extensive antibiotics resistance, especially by extended-spectrum beta-lactamases (ESBL). After screening 1536 clones, 14 previously unknown immunogenic proteins were identified. Subsequently, each protein was expressed in full-length and its immunodominant character examined by ELISA and microarray analyses. Consequently, six proteins were selected for epitope mapping and three thereof possessed linear epitopes. After specificity analysis, homology survey and 3d structural modelling, one epitope sequence GAVVALSTTFA of KPN_00363, an ion channel protein, was identified harboring specificity for K. pneumoniae. The remaining epitopes showed ambiguous results regarding the specificity for K. pneumoniae. The approach adopted herein has been successfully utilized to discover novel antigens of Campylobacter jejuni and Salmonella enterica antigens before. Now, we have transferred this knowledge to the key nosocomial agent, K. pneumoniae. By identifying several novel antigens and their linear

  18. Extraction and identification of membrane proteins from black widow spider eggs.

    Science.gov (United States)

    Fu, Si-Ling; Li, Jiang-Lin; Chen, Jia; Wang, Qiu-Ting; Li, Jian-Jun; Wang, Xian-Chun

    2015-07-18

    The eggs of oviparous animals are storehouses of maternal proteins required for embryonic development. Identification and molecular characterization of such proteins will provide much insight into the regulation of embryonic development. We previously analyzed soluble proteins in the eggs of the black widow spider (Latrodectus tredecimguttatus), and report here on the extraction and mass spectrometric identification of the egg membrane proteins. Comparison of different lysis solutions indicated that the highest extraction of the membrane proteins was achieved with 3%-4% sodium laurate in 40 mmol/L Tris-HCl buffer containing 4% CHAPS and 2% DTT (pH 7.4). SDS-PAGE combined with nLC-MS/MS identified 39 proteins with membrane-localization annotation, including those with structural, catalytic, and regulatory activities. Nearly half of the identified membrane proteins were metabolic enzymes involved in various cellular processes, particularly energy metabolism and biosynthesis, suggesting that relevant metabolic processes were active during the embryonic development of the eggs. Several identified cell membrane proteins were involved in the special structure formation and function of the egg cell membranes. The present proteomic analysis of the egg membrane proteins provides new insight into the molecular mechanisms of spider embryonic development.

  19. Identification of proteins interacting with ammodytoxins in Vipera ammodytes ammodytes venom by immuno-affinity chromatography.

    Science.gov (United States)

    Brgles, Marija; Kurtović, Tihana; Kovačič, Lidija; Križaj, Igor; Barut, Miloš; Lang Balija, Maja; Allmaier, Günter; Marchetti-Deschmann, Martina; Halassy, Beata

    2014-01-01

    In order to perform their function, proteins frequently interact with other proteins. Various methods are used to reveal protein interacting partners, and affinity chromatography is one of them. Snake venom is composed mostly of proteins, and various protein complexes in the venom have been found to exhibit higher toxicity levels than respective components separately. Complexes can modulate envenomation activity of a venom and/or potentiate its effect. Our previous data indicate that the most toxic components of the Vipera ammodytes ammodytes (Vaa) venom isolated so far-ammodytoxins (Atxs)-are contributing to the venom's toxicity only moderately; therefore, we aimed to explore whether they have some interacting partner(s) potentiating toxicity. For screening of possible interactions, immuno-affinity chromatography combined with identification by mass spectrometry was used. Various chemistries (epoxy, carbonyldiimidazole, ethylenediamine) as well as protein G functionality were used to immobilize antibodies on monolith support, a Convective Interaction Media disk. Monoliths have been demonstrated to better suit the separation of large biomolecules. Using such approach, several proteins were indicated as potential Atx-binding proteins. Among these, the interaction of Atxs with a Kunitz-type inhibitor was confirmed by far-Western dot-blot and surface plasmon resonance measurement. It can be concluded that affinity chromatography on monolithic columns combined with mass spectrometry identification is a successful approach for screening of protein interactions and it resulted with detection of the interaction of Atx with Kunitz-type inhibitor in Vaa venom for the first time.

  20. A novel protein complex identification algorithm based on Connected Affinity Clique Extension (CACE).

    Science.gov (United States)

    Li, Peng; He, Tingting; Hu, Xiaohua; Zhao, Junmin; Shen, Xianjun; Zhang, Ming; Wang, Yan

    2014-06-01

    A novel algorithm based on Connected Affinity Clique Extension (CACE) for mining overlapping functional modules in protein interaction network is proposed in this paper. In this approach, the value of protein connected affinity which is inferred from protein complexes is interpreted as the reliability and possibility of interaction. The protein interaction network is constructed as a weighted graph, and the weight is dependent on the connected affinity coefficient. The experimental results of our CACE in two test data sets show that the CACE can detect the functional modules much more effectively and accurately when compared with other state-of-art algorithms CPM and IPC-MCE.

  1. THE IDENTIFICATION AND CHARACTERIZATION OF AN IGE-INDUCING PROTEIN IN METARHIZIUM ANISOPLIAE EXTRACT

    Science.gov (United States)

    The Identification and Characterization of an IgE-Inducing Protein in Metarhizium anisopliae ExtractMarsha D.W. Ward1, Lisa B. Copeland1, Maura J. Donahue2, and Jody A. Shoemaker31ORD, NHEERL, US EPA, RTP, NC; 2Oak Ridge Institute for Science and Education, Cincinnati...

  2. Application of difference gel electrophoresis to the identification of inner medullary collecting duct proteins

    NARCIS (Netherlands)

    Hoffert, J.D.; Balkom, B.W.M. van; Chou, C.L.; Knepper, M.A.

    2004-01-01

    In this study, we present a standardized approach to purification of native inner medullary collecting duct (IMCD) cells from rat kidney for proteomic analysis and apply the approach to identification of abundant proteins utilizing two-dimensional difference gel electrophoresis (DIGE) coupled with m

  3. Identification of a putative protein-profile associating with tamoxifen therapy-resistance in breast cancer

    NARCIS (Netherlands)

    A. Umar (Arzu); J.W.M. Martens (John); J.A. Foekens (John); L. Paša-Tolić (Ljiljana); H. Kang; A.M. Timmermans (Mieke); M.P. Look (Maxime); M.E. Meijer van Gelder (Marion); N. Jaitly (Navdeep); M.A. den Bakker (Michael)

    2009-01-01

    textabstractTamoxifen-resistance is a major cause of death in patients with recurrent breast cancer. Current clinical parameters can correctly predict therapy response in only half of the treated patients. Identification of proteins that associate with tamoxifen-resistance is a first step towards

  4. Identification of a 5-protein biomarker molecular signature for predicting Alzheimer's disease.

    Directory of Open Access Journals (Sweden)

    Martín Gómez Ravetti

    Full Text Available BACKGROUND: Alzheimer's disease (AD is a progressive brain disease with a huge cost to human lives. The impact of the disease is also a growing concern for the governments of developing countries, in particular due to the increasingly high number of elderly citizens at risk. Alzheimer's is the most common form of dementia, a common term for memory loss and other cognitive impairments. There is no current cure for AD, but there are drug and non-drug based approaches for its treatment. In general the drug-treatments are directed at slowing the progression of symptoms. They have proved to be effective in a large group of patients but success is directly correlated with identifying the disease carriers at its early stages. This justifies the need for timely and accurate forms of diagnosis via molecular means. We report here a 5-protein biomarker molecular signature that achieves, on average, a 96% total accuracy in predicting clinical AD. The signature is composed of the abundances of IL-1alpha, IL-3, EGF, TNF-alpha and G-CSF. METHODOLOGY/PRINCIPAL FINDINGS: Our results are based on a recent molecular dataset that has attracted worldwide attention. Our paper illustrates that improved results can be obtained with the abundance of only five proteins. Our methodology consisted of the application of an integrative data analysis method. This four step process included: a abundance quantization, b feature selection, c literature analysis, d selection of a classifier algorithm which is independent of the feature selection process. These steps were performed without using any sample of the test datasets. For the first two steps, we used the application of Fayyad and Irani's discretization algorithm for selection and quantization, which in turn creates an instance of the (alpha-beta-k-Feature Set problem; a numerical solution of this problem led to the selection of only 10 proteins. CONCLUSIONS/SIGNIFICANCE: the previous study has provided an extremely

  5. Novel Accurate Bacterial Discrimination by MALDI-Time-of-Flight MS Based on Ribosomal Proteins Coding in S10-spc-alpha Operon at Strain Level S10-GERMS

    Science.gov (United States)

    Tamura, Hiroto; Hotta, Yudai; Sato, Hiroaki

    2013-08-01

    Matrix-assisted laser-desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) is one of the most widely used mass-based approaches for bacterial identification and classification because of the simple sample preparation and extremely rapid analysis within a few minutes. To establish the accurate MALDI-TOF MS bacterial discrimination method at strain level, the ribosomal subunit proteins coded in the S 10-spc-alpha operon, which encodes half of the ribosomal subunit protein and is highly conserved in eubacterial genomes, were selected as reliable biomarkers. This method, named the S10-GERMS method, revealed that the strains of genus Pseudomonas were successfully identified and discriminated at species and strain levels, respectively; therefore, the S10-GERMS method was further applied to discriminate the pathovar of P. syringae. The eight selected biomarkers (L24, L30, S10, S12, S14, S16, S17, and S19) suggested the rapid discrimination of P. syringae at the strain (pathovar) level. The S10-GERMS method appears to be a powerful tool for rapid and reliable bacterial discrimination and successful phylogenetic characterization. In this article, an overview of the utilization of results from the S10-GERMS method is presented, highlighting the characterization of the Lactobacillus casei group and discrimination of the bacteria of genera Bacillus and Sphingopyxis despite only two and one base difference in the 16S rRNA gene sequence, respectively.

  6. Novel accurate bacterial discrimination by MALDI-time-of-flight MS based on ribosomal proteins coding in S10-spc-alpha operon at strain level S10-GERMS.

    Science.gov (United States)

    Tamura, Hiroto; Hotta, Yudai; Sato, Hiroaki

    2013-08-01

    Matrix-assisted laser-desorption/ionization time-of-flight mass spectrometry (MALDI-TOF MS) is one of the most widely used mass-based approaches for bacterial identification and classification because of the simple sample preparation and extremely rapid analysis within a few minutes. To establish the accurate MALDI-TOF MS bacterial discrimination method at strain level, the ribosomal subunit proteins coded in the S10-spc-alpha operon, which encodes half of the ribosomal subunit protein and is highly conserved in eubacterial genomes, were selected as reliable biomarkers. This method, named the S10-GERMS method, revealed that the strains of genus Pseudomonas were successfully identified and discriminated at species and strain levels, respectively; therefore, the S10-GERMS method was further applied to discriminate the pathovar of P. syringae. The eight selected biomarkers (L24, L30, S10, S12, S14, S16, S17, and S19) suggested the rapid discrimination of P. syringae at the strain (pathovar) level. The S10-GERMS method appears to be a powerful tool for rapid and reliable bacterial discrimination and successful phylogenetic characterization. In this article, an overview of the utilization of results from the S10-GERMS method is presented, highlighting the characterization of the Lactobacillus casei group and discrimination of the bacteria of genera Bacillus and Sphingopyxis despite only two and one base difference in the 16S rRNA gene sequence, respectively.

  7. A new coarse-grained model for E. coli cytoplasm: accurate calculation of the diffusion coefficient of proteins and observation of anomalous diffusion.

    Directory of Open Access Journals (Sweden)

    Sabeeha Hasnain

    Full Text Available A new coarse-grained model of the E. coli cytoplasm is developed by describing the proteins of the cytoplasm as flexible units consisting of one or more spheres that follow Brownian dynamics (BD, with hydrodynamic interactions (HI accounted for by a mean-field approach. Extensive BD simulations were performed to calculate the diffusion coefficients of three different proteins in the cellular environment. The results are in close agreement with experimental or previously simulated values, where available. Control simulations without HI showed that use of HI is essential to obtain accurate diffusion coefficients. Anomalous diffusion inside the crowded cellular medium was investigated with Fractional Brownian motion analysis, and found to be present in this model. By running a series of control simulations in which various forces were removed systematically, it was found that repulsive interactions (volume exclusion are the main cause for anomalous diffusion, with a secondary contribution from HI.

  8. Identification and characterization of immunogenic proteins of Mycoplasma genitalium

    DEFF Research Database (Denmark)

    Svenstrup, Helle Friis; Jensen, J.S.; Gevaert, K.

    2006-01-01

    . genitalium strains were isolated (J. S. Jensen, H. T. Hansen, and K. Lind, J. Clin. Microbiol. 34:286-291, 1996). The objective of this study was to characterize immunogenic proteins of M. genitalium by sodium dodecyl sulfate-polyacrylamide gel electrophoresis and immunoblotting by using a hyperimmune rabbit.......0383) and they had significantly higher antibody titers. By use of the rMgPa ELISA, this study further substantiates the importance of M. genitalium as a cause of male urethritis....

  9. MALDI-TOF MS is more accurate than VITEK II ANC card and API Rapid ID 32 A system for the identification of Clostridium species.

    Science.gov (United States)

    Kim, Young Jin; Kim, Si Hyun; Park, Hyun-Jung; Park, Hae-Geun; Park, Dongchul; Song, Sae Am; Lee, Hee Joo; Yong, Dongeun; Choi, Jun Yong; Kook, Joong-Ki; Kim, Hye Ran; Shin, Jeong Hwan

    2016-08-01

    All 50 Clostridium difficile strains were definitely identified by Vitek2 system, Rapid ID 32A system, and MALDI-TOF. For 18 non-difficile Clostridium strains, the identification results were correct in 0, 2, and 17 strains by Vitek2, Rapid ID 32A, and MALDI-TOF, respectively. MALDI-TOF could be used as the primary tool for identification of Clostridium species.

  10. Accurate small and wide angle x-ray scattering profiles from atomic models of proteins and nucleic acids

    Energy Technology Data Exchange (ETDEWEB)

    Nguyen, Hung T. [BioMaPS Institute for Quantitative Biology, Rutgers University, Piscataway, New Jersey 08854 (United States); Pabit, Suzette A.; Meisburger, Steve P.; Pollack, Lois [School of Applied and Engineering Physics, Cornell University, Ithaca, New York 14853 (United States); Case, David A., E-mail: case@biomaps.rutgers.edu [BioMaPS Institute for Quantitative Biology, Rutgers University, Piscataway, New Jersey 08854 (United States); Department of Chemistry and Chemical Biology, Rutgers University, Piscataway, New Jersey 08854 (United States)

    2014-12-14

    A new method is introduced to compute X-ray solution scattering profiles from atomic models of macromolecules. The three-dimensional version of the Reference Interaction Site Model (RISM) from liquid-state statistical mechanics is employed to compute the solvent distribution around the solute, including both water and ions. X-ray scattering profiles are computed from this distribution together with the solute geometry. We describe an efficient procedure for performing this calculation employing a Lebedev grid for the angular averaging. The intensity profiles (which involve no adjustable parameters) match experiment and molecular dynamics simulations up to wide angle for two proteins (lysozyme and myoglobin) in water, as well as the small-angle profiles for a dozen biomolecules taken from the BioIsis.net database. The RISM model is especially well-suited for studies of nucleic acids in salt solution. Use of fiber-diffraction models for the structure of duplex DNA in solution yields close agreement with the observed scattering profiles in both the small and wide angle scattering (SAXS and WAXS) regimes. In addition, computed profiles of anomalous SAXS signals (for Rb{sup +} and Sr{sup 2+}) emphasize the ionic contribution to scattering and are in reasonable agreement with experiment. In cases where an absolute calibration of the experimental data at q = 0 is available, one can extract a count of the excess number of waters and ions; computed values depend on the closure that is assumed in the solution of the Ornstein–Zernike equations, with results from the Kovalenko–Hirata closure being closest to experiment for the cases studied here.

  11. Protein markers for identification of Yersinia pestis and their variation related to culture

    Energy Technology Data Exchange (ETDEWEB)

    Wunschel, David S. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Engelmann, Heather E. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Victry, Kristin D. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Clowers, Brian H. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Sorensen, Christina M. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Valentine, Nancy B. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Mahoney Fahey, Christine M. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Wietsma, Thomas W. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Wahl, Karen L. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States)

    2013-12-11

    The detection of high consequence pathogens, such as Yersinia pestis, is well established in biodefense laboratories for bioterror situations. Laboratory protocols are well established using specified culture media and a growth temperature of 37 °C for expression of specific antigens. Direct detection of Y. pestis protein markers, without prior culture, depends on their expression. Unfortunately protein expression can be impacted by the culture medium which cannot be predicted ahead of time. Furthermore, higher biomass yields are obtained at the optimal growth temperature (i.e. 28 °C–30 °C) and therefore are more likely to be used for bulk production. Analysis of Y. pestis grown on several types of media at 30 °C showed that several protein markers were found to be differentially detected in different media. Analysis of the identified proteins against a comprehensive database provided an additional level of organism identification. Peptides corresponding to variable regions of some proteins could separate large groups of strains and aid in organism identification. This work illustrates the need to understand variability of protein expression for detection targets. The potential for relating expression changes of known proteins to specific media factors, even in nutrient rich and chemically complex culture medium, may provide the opportunity to draw forensic information from protein profiles.

  12. Extraction methods of red blood cell membrane proteins for Multidimensional Protein Identification Technology (MudPIT) analysis.

    Science.gov (United States)

    De Palma, Antonella; Roveri, Antonella; Zaccarin, Mattia; Benazzi, Louise; Daminelli, Simone; Pantano, Giorgia; Buttarello, Mauro; Ursini, Fulvio; Gion, Massimo; Mauri, Pier Luigi

    2010-08-13

    Since red blood cells (RBCs) lack nuclei and organelles, cell membrane is their main load-bearing component and, according to a dynamic interaction with the cytoskeleton compartment, plays a pivotal role in their functioning. Even if erythrocyte membranes are available in large quantities, the low abundance and the hydrophobic nature of cell membrane proteins complicate their purification and detection by conventional 2D gel-based proteomic approaches. So, in order to increase the efficiency of RBC membrane proteome identification, here we took advantage of a simple and reproducible membrane sub-fractionation method coupled to Multidimensional Protein Identification Technology (MudPIT). In addition, the adoption of a stringent RBC filtration strategy from the whole blood, permitted to remove exhaustively contaminants, such as platelets and white blood cells, and to identify a total of 275 proteins in the three RBC membrane fractions collected and analysed. Finally, by means of software for the elaboration of the great quantity of data obtained and programs for statistical analysis and protein classification, it was possible to determine the validity of the entire system workflow and to assign the proper sub-cellular localization and function for the greatest number of the identified proteins.

  13. Accurate flexible fitting of high-resolution protein structures to small-angle x-ray scattering data using a coarse-grained model with implicit hydration shell.

    Science.gov (United States)

    Zheng, Wenjun; Tekpinar, Mustafa

    2011-12-21

    Small-angle x-ray scattering (SAXS) is a powerful technique widely used to explore conformational states and transitions of biomolecular assemblies in solution. For accurate model reconstruction from SAXS data, one promising approach is to flexibly fit a known high-resolution protein structure to low-resolution SAXS data by computer simulations. This is a highly challenging task due to low information content in SAXS data. To meet this challenge, we have developed what we believe to be a novel method based on a coarse-grained (one-bead-per-residue) protein representation and a modified form of the elastic network model that allows large-scale conformational changes while maintaining pseudobonds and secondary structures. Our method optimizes a pseudoenergy that combines the modified elastic-network model energy with a SAXS-fitting score and a collision energy that penalizes steric collisions. Our method uses what we consider a new implicit hydration shell model that accounts for the contribution of hydration shell to SAXS data accurately without explicitly adding waters to the system. We have rigorously validated our method using five test cases with simulated SAXS data and three test cases with experimental SAXS data. Our method has successfully generated high-quality structural models with root mean-squared deviation of 1 ∼ 3 Å from the target structures.

  14. Identification of Novel Immunogenic Proteins of Neisseria gonorrhoeae by Phage Display.

    Directory of Open Access Journals (Sweden)

    Daniel O Connor

    Full Text Available Neisseria gonorrhoeae is one of the most prevalent sexually transmitted diseases worldwide with more than 100 million new infections per year. A lack of intense research over the last decades and increasing resistances to the recommended antibiotics call for a better understanding of gonococcal infection, fast diagnostics and therapeutic measures against N. gonorrhoeae. Therefore, the aim of this work was to identify novel immunogenic proteins as a first step to advance those unresolved problems. For the identification of immunogenic proteins, pHORF oligopeptide phage display libraries of the entire N. gonorrhoeae genome were constructed. Several immunogenic oligopeptides were identified using polyclonal rabbit antibodies against N. gonorrhoeae. Corresponding full-length proteins of the identified oligopeptides were expressed and their immunogenic character was verified by ELISA. The immunogenic character of six proteins was identified for the first time. Additional 13 proteins were verified as immunogenic proteins in N. gonorrhoeae.

  15. Identification of Novel Immunogenic Proteins of Neisseria gonorrhoeae by Phage Display.

    Science.gov (United States)

    Connor, Daniel O; Zantow, Jonas; Hust, Michael; Bier, Frank F; von Nickisch-Rosenegk, Markus

    2016-01-01

    Neisseria gonorrhoeae is one of the most prevalent sexually transmitted diseases worldwide with more than 100 million new infections per year. A lack of intense research over the last decades and increasing resistances to the recommended antibiotics call for a better understanding of gonococcal infection, fast diagnostics and therapeutic measures against N. gonorrhoeae. Therefore, the aim of this work was to identify novel immunogenic proteins as a first step to advance those unresolved problems. For the identification of immunogenic proteins, pHORF oligopeptide phage display libraries of the entire N. gonorrhoeae genome were constructed. Several immunogenic oligopeptides were identified using polyclonal rabbit antibodies against N. gonorrhoeae. Corresponding full-length proteins of the identified oligopeptides were expressed and their immunogenic character was verified by ELISA. The immunogenic character of six proteins was identified for the first time. Additional 13 proteins were verified as immunogenic proteins in N. gonorrhoeae.

  16. Identification of Thylakoid Membrane Protein Complexes by Using a BN-Chip/MS Approach

    Institute of Scientific and Technical Information of China (English)

    Longquan Fan; Yinghong Pan

    2012-01-01

    Thylakoid membrane protein complexes of wheat (Triticum aestivum Linn.)play crucial roles in growth and crop production.Knowledge of the composition and structure of protein complexes,as well as protein interactions,will result in a much deeper understanding of metabolic pathways and cellular processes than protein identities alone,especially if the complexes can be separated in the native forms.Whereas the analysis of membrane protein complexes is a significant challenge due to their hydrophobic properties and relatively low abundance.A rapid and efficient method of identifying membrane protein complexes will greatly facilitate the investigation of agriculture.The present work developed an BN-Chip/MS approach for exhaustive separation and identification of protein complexes,by combining using blue-native polyacrylamide gel electrophoresis (BN-PAGE) and chip-based high-performance liquid chromatography quadruple time-of-flight tandem mass spectrometry (HPLC-Chip/ESI-QT-OF-MS,Chip/MS).By using this approach,seventy-five nonredundant proteins of wheat thylakoid membrane complexes were identified from digested 13 bands of BN-gel.When the protocol of BN separation was not used,only 37 nonredundant proteins had been identified and among of them 9 proteins were uniquely identi? ed.This BN-Chip/MS approach is rapid and efficient for identifying protein complexes in wheat thylakoid membranes,and also providing reliable foundations for further functional research of wheat chloroplast and for identifying protein complexes of other species.

  17. Advances in identification and validation of protein targets of natural products without chemical modification.

    Science.gov (United States)

    Chang, J; Kim, Y; Kwon, H J

    2016-05-04

    Covering: up to February 2016Identification of the target proteins of natural products is pivotal to understanding the mechanisms of action to develop natural products for use as molecular probes and potential therapeutic drugs. Affinity chromatography of immobilized natural products has been conventionally used to identify target proteins, and has yielded good results. However, this method has limitations, in that labeling or tagging for immobilization and affinity purification often result in reduced or altered activity of the natural product. New strategies have recently been developed and applied to identify the target proteins of natural products and synthetic small molecules without chemical modification of the natural product. These direct and indirect methods for target identification of label-free natural products include drug affinity responsive target stability (DARTS), stability of proteins from rates of oxidation (SPROX), cellular thermal shift assay (CETSA), thermal proteome profiling (TPP), and bioinformatics-based analysis of connectivity. This review focuses on and reports case studies of the latest advances in target protein identification methods for label-free natural products. The integration of newly developed technologies will provide new insights and highlight the value of natural products for use as biological probes and new drug candidates.

  18. Small acid soluble proteins for rapid spore identification.

    Energy Technology Data Exchange (ETDEWEB)

    Branda, Steven S.; Lane, Todd W.; VanderNoot, Victoria A.; Jokerst, Amanda S.

    2006-12-01

    This one year LDRD addressed the problem of rapid characterization of bacterial spores such as those from the genus Bacillus, the group that contains pathogenic spores such as B. anthracis. In this effort we addressed the feasibility of using a proteomics based approach to spore characterization using a subset of conserved spore proteins known as the small acid soluble proteins or SASPs. We proposed developing techniques that built on our previous expertise in microseparations to rapidly characterize or identify spores. An alternative SASP extraction method was developed that was amenable to both the subsequent fluorescent labeling required for laser-induced fluorescence detection and the low ionic strength requirements for isoelectric focusing. For the microseparations, both capillary isoelectric focusing and chip gel electrophoresis were employed. A variety of methods were evaluated to improve the molecular weight resolution for the SASPs, which are in a molecular weight range that is not well resolved by the current methods. Isoelectric focusing was optimized and employed to resolve the SASPs using UV absorbance detection. Proteomic signatures of native wild type Bacillus spores and clones genetically engineered to produce altered SASP patterns were assessed by slab gel electrophoresis, capillary isoelectric focusing with absorbance detection as well as microchip based gel electrophoresis employing sensitive laser-induced fluorescence detection.

  19. Protein-protein interface analysis and hot spots identification for chemical ligand design.

    Science.gov (United States)

    Chen, Jing; Ma, Xiaomin; Yuan, Yaxia; Pei, Jianfeng; Lai, Luhua

    2014-01-01

    Rational design for chemical compounds targeting protein-protein interactions has grown from a dream to reality after a decade of efforts. There are an increasing number of successful examples, though major challenges remain in the field. In this paper, we will first give a brief review of the available methods that can be used to analyze protein-protein interface and predict hot spots for chemical ligand design. New developments of binding sites detection, ligandability and hot spots prediction from the author's group will also be described. Pocket V.3 is an improved program for identifying hot spots in protein-protein interface using only an apo protein structure. It has been developed based on Pocket V.2 that can derive receptor-based pharmacophore model for ligand binding cavity. Given similarities and differences between the essence of pharmacophore and hot spots for guiding design of chemical compounds, not only energetic but also spatial properties of protein-protein interface are used in Pocket V.3 for dealing with protein-protein interface. In order to illustrate the capability of Pocket V.3, two datasets have been used. One is taken from ASEdb and BID having experimental alanine scanning results for testing hot spots prediction. The other is taken from the 2P2I database containing complex structures of protein-ligand binding at the original protein-protein interface for testing hot spots application in ligand design.

  20. PDTD: a web-accessible protein database for drug target identification

    Directory of Open Access Journals (Sweden)

    Gao Zhenting

    2008-02-01

    Full Text Available Abstract Background Target identification is important for modern drug discovery. With the advances in the development of molecular docking, potential binding proteins may be discovered by docking a small molecule to a repository of proteins with three-dimensional (3D structures. To complete this task, a reverse docking program and a drug target database with 3D structures are necessary. To this end, we have developed a web server tool, TarFisDock (Target Fishing Docking http://www.dddc.ac.cn/tarfisdock, which has been used widely by others. Recently, we have constructed a protein target database, Potential Drug Target Database (PDTD, and have integrated PDTD with TarFisDock. This combination aims to assist target identification and validation. Description PDTD is a web-accessible protein database for in silico target identification. It currently contains >1100 protein entries with 3D structures presented in the Protein Data Bank. The data are extracted from the literatures and several online databases such as TTD, DrugBank and Thomson Pharma. The database covers diverse information of >830 known or potential drug targets, including protein and active sites structures in both PDB and mol2 formats, related diseases, biological functions as well as associated regulating (signaling pathways. Each target is categorized by both nosology and biochemical function. PDTD supports keyword search function, such as PDB ID, target name, and disease name. Data set generated by PDTD can be viewed with the plug-in of molecular visualization tools and also can be downloaded freely. Remarkably, PDTD is specially designed for target identification. In conjunction with TarFisDock, PDTD can be used to identify binding proteins for small molecules. The results can be downloaded in the form of mol2 file with the binding pose of the probe compound and a list of potential binding targets according to their ranking scores. Conclusion PDTD serves as a comprehensive and

  1. Identification of bitter peptides in whey protein hydrolysate.

    Science.gov (United States)

    Liu, Xiaowei; Jiang, Deshou; Peterson, Devin G

    2014-06-25

    Bitterness of whey protein hydrolysates (WPH) can negatively affect product quality and limit utilization in food and pharmaceutical applications. Four main bitter peptides were identified in a commercial WPH by means of sensory-guided fractionation techniques that included ultrafiltration and offline two-dimensional reverse phase chromatography. LC-TOF-MS/MS analysis revealed the amino acid sequences of the bitter peptides were YGLF, IPAVF, LLF, and YPFPGPIPN that originated from α-lactalbumin, β-lactoglobulin, serum albumin, and β-casein, respectively. Quantitative LC-MS/MS analysis reported the concentrations of YGLF, IPAVF, LLF, and YPFPGPIPN to be 0.66, 0.58, 1.33, and 2.64 g/kg powder, respectively. Taste recombination analysis of an aqueous model consisting of all four peptides was reported to explain 88% of the bitterness intensity of the 10% WPH solution.

  2. Identification of Protein Palmitoylation Inhibitors from a Scaffold Ranking Library

    Science.gov (United States)

    Hamel, Laura D.; Lenhart, Brian J.; Mitchell, David A.; Santos, Radleigh G.; Giulianotti, Marc A.; Deschenes, Robert J.

    2016-01-01

    The addition of palmitoyl moieties to proteins regulates their membrane targeting, subcellular localization, and stability. Dysregulation of the enzymes which catalyzed the palmitoyl addition and/or the substrates of these enzymes have been linked to cancer, cardiovascular, and neurological disorders, implying these enzymes and substrates are valid targets for pharmaceutical intervention. However, current chemical modulators of zDHHC PAT enzymes lack specificity and affinity, underscoring the need for screening campaigns to identify new specific, high affinity modulators. This report describes a mixture based screening approach to identify inhibitors of Erf2 activity. Erf2 is the Saccharomyces cerevisiae PAT responsible for catalyzing the palmitoylation of Ras2, an ortholog of the human Ras oncogene proteins. A chemical library developed by the Torrey Pines Institute for Molecular Studies consists of more than 30 million compounds designed around 68 molecular scaffolds that are systematically arranged into positional scanning and scaffold ranking formats. We have used this approach to identify and characterize several scaffold backbones and R-groups that reduce or eliminate the activity of Erf2 in vitro. Here, we present the analysis of one of the scaffold backbones, bis-cyclic piperazine. We identified compounds that inhibited Erf2 auto-palmitoylation activity using a fluorescence-based, coupled assay in a high throughput screening (HTS) format and validated the hits utilizing an orthogonal gel-based assay. Finally, we examined the effects of the compounds on cell growth in a yeast cell-based assay. Based on our results, we have identified specific, high affinity palmitoyl transferase inhibitors that will serve as a foundation for future compound design. PMID:27009891

  3. Surface protein composition of Aeromonas hydrophila strains virulent for fish: identification of a surface array protein

    Energy Technology Data Exchange (ETDEWEB)

    Dooley, J.S.G.; Trust, T.J.

    1988-02-01

    The surface protein composition of members of a serogroup of Aeromonas hydrophila was examined. Immunoblotting with antiserum raised against formalinized whole cells of A. hydrophila TF7 showed a 52K S-layer protein to be the major surface protein antigen, and impermeant Sulfo-NHS-Biotin cell surface labeling showed that the 52K S-layer protein was the only protein accessible to the Sulfo-NHS-Biotin label and effectively masked underlying outer membrane (OM) proteins. In its native surface conformation the 52K S-layer protein was only weakly reactive with a lactoperoxidase /sup 125/I surface iodination procedure. A UV-induced rough lipopolysaccharide (LPS) mutant of TF7 was found to produce an intact S layer, but a deep rough LPS mutant was unable to maintain an array on the cell surface and excreted the S-layer protein into the growth medium, indicating that a minimum LPS oligosaccharide size required for A. hydrophila S-layer anchoring. The native S layer was permeable to /sup 125/I in the lactoperoxidase radiolabeling procedure, and two major OM proteins of molecular weights 30,000 and 48,000 were iodinated. The 48K species was a peptidoglycan-associated, transmembrane protein which exhibited heat-modifiable SDS solubilization behavior characteristic of a porin protein. A 50K major peptidoglycan-associated OM protein which was not radiolabeled exhibited similar SDS heat modification characteristics and possibly represents a second porin protein.

  4. Identification of Arsenic Direct-Binding Proteins in Acute Promyelocytic Leukaemia Cells

    Directory of Open Access Journals (Sweden)

    Tao Zhang

    2015-11-01

    Full Text Available The identification of arsenic direct-binding proteins is essential for determining the mechanism by which arsenic trioxide achieves its chemotherapeutic effects. At least two cysteines close together in the amino acid sequence are crucial to the binding of arsenic and essential to the identification of arsenic-binding proteins. In the present study, arsenic binding proteins were pulled down with streptavidin and identified using a liquid chromatograph-mass spectrometer (LC-MS/MS. More than 40 arsenic-binding proteins were separated, and redox-related proteins, glutathione S-transferase P1 (GSTP1, heat shock 70 kDa protein 9 (HSPA9 and pyruvate kinase M2 (PKM2, were further studied using binding assays in vitro. Notably, PKM2 has a high affinity for arsenic. In contrast to PKM2, GSTP1and HSPA9 did not combine with arsenic directly in vitro. These observations suggest that arsenic-mediated acute promyelocytic leukaemia (APL suppressive effects involve PKM2. In summary, we identified several arsenic binding proteins in APL cells and investigated the therapeutic mechanisms of arsenic trioxide for APL. Further investigation into specific signal pathways by which PKM2 mediates APL developments may lead to a better understanding of arsenic effects on APL.

  5. Pooled protein immunization for identification of cell surface antigens in Streptococcus sanguinis.

    Directory of Open Access Journals (Sweden)

    Xiuchun Ge

    Full Text Available BACKGROUND: Available bacterial genomes provide opportunities for screening vaccines by reverse vaccinology. Efficient identification of surface antigens is required to reduce time and animal cost in this technology. We developed an approach to identify surface antigens rapidly in Streptococcus sanguinis, a common infective endocarditis causative species. METHODS AND FINDINGS: We applied bioinformatics for antigen prediction and pooled antigens for immunization. Forty-seven surface-exposed proteins including 28 lipoproteins and 19 cell wall-anchored proteins were chosen based on computer algorithms and comparative genomic analyses. Eight proteins among these candidates and 2 other proteins were pooled together to immunize rabbits. The antiserum reacted strongly with each protein and with S. sanguinis whole cells. Affinity chromatography was used to purify the antibodies to 9 of the antigen pool components. Competitive ELISA and FACS results indicated that these 9 proteins were exposed on S. sanguinis cell surfaces. The purified antibodies had demonstrable opsonic activity. CONCLUSIONS: The results indicate that immunization with pooled proteins, in combination with affinity purification, and comprehensive immunological assays may facilitate cell surface antigen identification to combat infectious diseases.

  6. Use of sequential chemical extractions to purify nuclear membrane proteins for proteomics identification.

    Science.gov (United States)

    Korfali, Nadia; Fairley, Elizabeth A L; Swanson, Selene K; Florens, Laurence; Schirmer, Eric C

    2009-01-01

    The nuclear envelope (NE) is a double membrane system that is both a part of the endoplasmic reticulum and part of the nucleus. As its constituent proteins tend to be highly complexed with nuclear and cytoplasmic components, it is notoriously difficult to purify. Two methods can reduce this difficulty for the identification of nuclear membrane proteins: comparison to contaminating membranes and chemical extractions to enrich for certain groups of proteins. The purification of nuclear envelopes and contaminating microsomal membranes is described here along with procedures for chemical extraction using salt and detergent, chaotropes, or alkaline solutions. Each extraction method enriches for different combinations of nuclear envelope proteins. Finally, we describe the analysis of these fractions with MudPIT, a proteomics methodology that avoids gel extraction of bands to facilitate identification of minor proteins and membrane proteins that do not resolve well on gels. Together these three approaches can significantly increase the output of proteomics studies aimed at identifying the protein complement of subcellular membrane systems.

  7. Efficient identification of critical residues based only on protein structure by network analysis.

    Directory of Open Access Journals (Sweden)

    Michael P Cusack

    Full Text Available Despite the increasing number of published protein structures, and the fact that each protein's function relies on its three-dimensional structure, there is limited access to automatic programs used for the identification of critical residues from the protein structure, compared with those based on protein sequence. Here we present a new algorithm based on network analysis applied exclusively on protein structures to identify critical residues. Our results show that this method identifies critical residues for protein function with high reliability and improves automatic sequence-based approaches and previous network-based approaches. The reliability of the method depends on the conformational diversity screened for the protein of interest. We have designed a web site to give access to this software at http://bis.ifc.unam.mx/jamming/. In summary, a new method is presented that relates critical residues for protein function with the most traversed residues in networks derived from protein structures. A unique feature of the method is the inclusion of the conformational diversity of proteins in the prediction, thus reproducing a basic feature of the structure/function relationship of proteins.

  8. Identification of Novel O-Linked Glycosylated Toxoplasma Proteins by Vicia villosa Lectin Chromatography.

    Science.gov (United States)

    Wang, Kevin; Peng, Eric D; Huang, Amy S; Xia, Dong; Vermont, Sarah J; Lentini, Gaelle; Lebrun, Maryse; Wastling, Jonathan M; Bradley, Peter J

    2016-01-01

    Toxoplasma gondii maintains its intracellular life cycle using an extraordinary arsenal of parasite-specific organelles including the inner membrane complex (IMC), rhoptries, micronemes, and dense granules. While these unique compartments play critical roles in pathogenesis, many of their protein constituents have yet to be identified. We exploited the Vicia villosa lectin (VVL) to identify new glycosylated proteins that are present in these organelles. Purification of VVL-binding proteins by lectin affinity chromatography yielded a number of novel proteins that were subjected to further study, resulting in the identification of proteins from the dense granules, micronemes, rhoptries and IMC. We then chose to focus on three proteins identified by this approach, the SAG1 repeat containing protein SRS44, the rhoptry neck protein RON11 as well as a novel IMC protein we named IMC25. To assess function, we disrupted their genes by homologous recombination or CRISPR/Cas9. The knockouts were all successful, demonstrating that these proteins are not essential for invasion or intracellular survival. We also show that IMC25 undergoes substantial proteolytic processing that separates the C-terminal domain from the predicted glycosylation site. Together, we have demonstrated that lectin affinity chromatography is an efficient method of identifying new glycosylated parasite-specific proteins.

  9. Identification of Novel O-Linked Glycosylated Toxoplasma Proteins by Vicia villosa Lectin Chromatography.

    Directory of Open Access Journals (Sweden)

    Kevin Wang

    Full Text Available Toxoplasma gondii maintains its intracellular life cycle using an extraordinary arsenal of parasite-specific organelles including the inner membrane complex (IMC, rhoptries, micronemes, and dense granules. While these unique compartments play critical roles in pathogenesis, many of their protein constituents have yet to be identified. We exploited the Vicia villosa lectin (VVL to identify new glycosylated proteins that are present in these organelles. Purification of VVL-binding proteins by lectin affinity chromatography yielded a number of novel proteins that were subjected to further study, resulting in the identification of proteins from the dense granules, micronemes, rhoptries and IMC. We then chose to focus on three proteins identified by this approach, the SAG1 repeat containing protein SRS44, the rhoptry neck protein RON11 as well as a novel IMC protein we named IMC25. To assess function, we disrupted their genes by homologous recombination or CRISPR/Cas9. The knockouts were all successful, demonstrating that these proteins are not essential for invasion or intracellular survival. We also show that IMC25 undergoes substantial proteolytic processing that separates the C-terminal domain from the predicted glycosylation site. Together, we have demonstrated that lectin affinity chromatography is an efficient method of identifying new glycosylated parasite-specific proteins.

  10. Gene identification and protein classification in microbial metagenomic sequence data via incremental clustering

    Directory of Open Access Journals (Sweden)

    Li Weizhong

    2008-04-01

    Full Text Available Abstract Background The identification and study of proteins from metagenomic datasets can shed light on the roles and interactions of the source organisms in their communities. However, metagenomic datasets are characterized by the presence of organisms with varying GC composition, codon usage biases etc., and consequently gene identification is challenging. The vast amount of sequence data also requires faster protein family classification tools. Results We present a computational improvement to a sequence clustering approach that we developed previously to identify and classify protein coding genes in large microbial metagenomic datasets. The clustering approach can be used to identify protein coding genes in prokaryotes, viruses, and intron-less eukaryotes. The computational improvement is based on an incremental clustering method that does not require the expensive all-against-all compute that was required by the original approach, while still preserving the remote homology detection capabilities. We present evaluations of the clustering approach in protein-coding gene identification and classification, and also present the results of updating the protein clusters from our previous work with recent genomic and metagenomic sequences. The clustering results are available via CAMERA, (http://camera.calit2.net. Conclusion The clustering paradigm is shown to be a very useful tool in the analysis of microbial metagenomic data. The incremental clustering method is shown to be much faster than the original approach in identifying genes, grouping sequences into existing protein families, and also identifying novel families that have multiple members in a metagenomic dataset. These clusters provide a basis for further studies of protein families.

  11. PredPPCrys: accurate prediction of sequence cloning, protein production, purification and crystallization propensity from protein sequences using multi-step heterogeneous feature fusion and selection.

    Directory of Open Access Journals (Sweden)

    Huilin Wang

    Full Text Available X-ray crystallography is the primary approach to solve the three-dimensional structure of a protein. However, a major bottleneck of this method is the failure of multi-step experimental procedures to yield diffraction-quality crystals, including sequence cloning, protein material production, purification, crystallization and ultimately, structural determination. Accordingly, prediction of the propensity of a protein to successfully undergo these experimental procedures based on the protein sequence may help narrow down laborious experimental efforts and facilitate target selection. A number of bioinformatics methods based on protein sequence information have been developed for this purpose. However, our knowledge on the important determinants of propensity for a protein sequence to produce high diffraction-quality crystals remains largely incomplete. In practice, most of the existing methods display poorer performance when evaluated on larger and updated datasets. To address this problem, we constructed an up-to-date dataset as the benchmark, and subsequently developed a new approach termed 'PredPPCrys' using the support vector machine (SVM. Using a comprehensive set of multifaceted sequence-derived features in combination with a novel multi-step feature selection strategy, we identified and characterized the relative importance and contribution of each feature type to the prediction performance of five individual experimental steps required for successful crystallization. The resulting optimal candidate features were used as inputs to build the first-level SVM predictor (PredPPCrys I. Next, prediction outputs of PredPPCrys I were used as the input to build second-level SVM classifiers (PredPPCrys II, which led to significantly enhanced prediction performance. Benchmarking experiments indicated that our PredPPCrys method outperforms most existing procedures on both up-to-date and previous datasets. In addition, the predicted crystallization

  12. Identification of host proteins associated with HIV-1 preintegration complexes isolated from infected CD4+ cells.

    Science.gov (United States)

    Raghavendra, Nidhanapati K; Shkriabai, Nikolozi; Graham, Robert Lj; Hess, Sonja; Kvaratskhelia, Mamuka; Wu, Li

    2010-08-11

    An integrated HIV-1 genomic DNA leads to an infected cell becoming either an active or a latent virus-producing cell. Upon appropriate activation, a latently infected cell can result in production of progeny viruses that spread the infection to uninfected cells. The host proteins influence several steps of HIV-1 infection including formation of the preintegration complex (PIC), a key nucleoprotein intermediate essential for integration of reverse transcribed viral DNA into the chromosome. Much effort has gone into the identification of host proteins contributing to the assembly of functional PICs. Experimental approaches included the use of yeast two-hybrid system, co-immunoprecipitation, affinity tagged HIV-1 viral proteins and in vitro reconstitution of salt-stripped PIC activity. Several host proteins identified using these approaches have been shown to affect HIV-1 replication in cells and influence catalytic activities of recombinant IN in vitro. However, the comprehensive identification and characterization of host proteins associated with HIV-1 PICs of infected cells have been hindered in part by the technical limitation in acquiring sufficient amount of catalytically active PICs. To efficiently identify additional host factors associated with PICs in infected cells, we have developed the following novel approach. The catalytically active PICs from HIV-1-infected CD4+ cells were isolated using biotinylated target DNA, and the proteins selectively co-purifying with PICs have been analyzed by mass spectrometry. This technology enabled us to reveal at least 19 host proteins that are associated with HIV-1 PICs, of which 18 proteins have not been described previously with respect to HIV-1 integration. Physiological functions of the identified proteins range from chromatin organization to protein transport. A detailed characterization of these host proteins could provide new insights into the mechanism of HIV-1 integration and uncover new antiviral targets to

  13. Immunoproteomic analysis of Brucella melitensis and identification of a new immunogenic candidate protein for the development of brucellosis subunit vaccine.

    Science.gov (United States)

    Yang, Yanling; Wang, Lin; Yin, Jigang; Wang, Xinglong; Cheng, Shipeng; Lang, Xulong; Wang, Xiuran; Qu, Hailong; Sun, Chunhui; Wang, Jinglong; Zhang, Rui

    2011-10-01

    In order to screen immunogenic candidate antigens for the development of a brucellosis subunit vaccine, an immunoproteomic assay was used to identify immunogenic proteins from Brucella melitensis 16 M soluble proteins. In this study, a total of 56 immunodominant proteins were identified from the two-dimensional electrophoresis immunoblot profiles by liquid chromatography tandem mass spectrometry (LC-MS/MS). Two proteins of interest, riboflavin synthase alpha chain (RS-α) and Loraine synthase (LS-2), which are both involved in riboflavin synthesis, were detected by two-dimensional immunoblots using antisera obtained from Brucella-infected human and goats. LS-2, however, is an already well-known vaccine candidate. Therefore, we focussed our studies on the novel vaccine candidate RS-α. B. melitensis RS-α and LS-2 were then expressed in Escherichia coli as fusion proteins with His tag. The humoral and cellular immune responses to the recombinant (r)RS-α was characterized. In response to in vitro stimulation by rRS-α, splenocytes from mice vaccinated with rRS-α were able to produce γ-interferon (IFN-γ) and interleukin (IL)-2 but not interleukin (IL)-4 and interleukin (IL)-10. Furthermore, rRS-α or rLS-2-vaccinated mice were partially protected against B. melitensis infection. Our results suggested that we have developed a high-throughout, accurate, rapid and highly efficient method for the identification of candidate antigens by a combination of immunoproteomics with immunisation and bacterial challenge and rRs-α could be a useful candidate for the development of subunit vaccines against B. melitensis.

  14. A novel spectral library workflow to enhance protein identifications.

    Science.gov (United States)

    Li, Haomin; Zong, Nobel C; Liang, Xiangbo; Kim, Allen K; Choi, Jeong Ho; Deng, Ning; Zelaya, Ivette; Lam, Maggie; Duan, Huilong; Ping, Peipei

    2013-04-09

    The innovations in mass spectrometry-based investigations in proteome biology enable systematic characterization of molecular details in pathophysiological phenotypes. However, the process of delineating large-scale raw proteomic datasets into a biological context requires high-throughput data acquisition and processing. A spectral library search engine makes use of previously annotated experimental spectra as references for subsequent spectral analyses. This workflow delivers many advantages, including elevated analytical efficiency and specificity as well as reduced demands in computational capacity. In this study, we created a spectral matching engine to address challenges commonly associated with a library search workflow. Particularly, an improved sliding dot product algorithm, that is robust to systematic drifts of mass measurement in spectra, is introduced. Furthermore, a noise management protocol distinguishes spectra correlation attributed from noise and peptide fragments. It enables elevated separation between target spectral matches and false matches, thereby suppressing the possibility of propagating inaccurate peptide annotations from library spectra to query spectra. Moreover, preservation of original spectra also accommodates user contributions to further enhance the quality of the library. Collectively, this search engine supports reproducible data analyses using curated references, thereby broadening the accessibility of proteomics resources to biomedical investigators. This article is part of a Special Issue entitled: From protein structures to clinical applications.

  15. MASCOT HTML and XML parser: an implementation of a novel object model for protein identification data.

    Science.gov (United States)

    Yang, Chunguang G; Granite, Stephen J; Van Eyk, Jennifer E; Winslow, Raimond L

    2006-11-01

    Protein identification using MS is an important technique in proteomics as well as a major generator of proteomics data. We have designed the protein identification data object model (PDOM) and developed a parser based on this model to facilitate the analysis and storage of these data. The parser works with HTML or XML files saved or exported from MASCOT MS/MS ions search in peptide summary report or MASCOT PMF search in protein summary report. The program creates PDOM objects, eliminates redundancy in the input file, and has the capability to output any PDOM object to a relational database. This program facilitates additional analysis of MASCOT search results and aids the storage of protein identification information. The implementation is extensible and can serve as a template to develop parsers for other search engines. The parser can be used as a stand-alone application or can be driven by other Java programs. It is currently being used as the front end for a system that loads HTML and XML result files of MASCOT searches into a relational database. The source code is freely available at http://www.ccbm.jhu.edu and the program uses only free and open-source Java libraries.

  16. Metal affinity enrichment increases the range and depth of proteome identification for extracellular microbial proteins

    Energy Technology Data Exchange (ETDEWEB)

    Wheeler, Korin [Lawrence Livermore National Laboratory (LLNL); Erickson, Brian K [ORNL; Mueller, Ryan [University of California, Berkeley; Singer, Steven [Lawrence Livermore National Laboratory (LLNL); Verberkmoes, Nathan C [ORNL; Hwang, Mona [Lawrence Livermore National Laboratory (LLNL); Thelen, Michael P. [University of California, Berkeley; Hettich, Robert {Bob} L [ORNL

    2012-01-01

    Many key proteins, such as those involved in cellular signaling or transcription, are difficult to measure in microbial proteomic experiments due to the interfering presence of more abundant, dominant proteins. In an effort to enhance the identification of previously undetected proteins, as well as provide a methodology for selective enrichment, we evaluated and optimized immobilized metal affinity chromatography (IMAC) coupled with mass spectrometric characterization of extracellular proteins from an extremophilic microbial community. Seven different metals were tested for IMAC enrichment. The combined results added 20% greater proteomic depth to the extracellular proteome. Although this IMAC enrichment could not be conducted at the physiological pH of the environmental system, this approach did yield a reproducible and specific enrichment of groups of proteins with functions potentially vital to the community, thereby providing a more extensive biochemical characterization. Notably, 40 unknown proteins previously annotated as hypothetical were enriched and identified for the first time. Examples of identified proteins includes a predicted TonB signal sensing protein homologous to other known TonB proteins and a protein with a COXG domain previously identified in many chemolithoautotrophic microbes as having a function in the oxidation of CO.

  17. An approach to large scale identification of non-obvious structural similarities between proteins

    Directory of Open Access Journals (Sweden)

    Cherkasov Artem

    2004-05-01

    Full Text Available Abstract Background A new sequence independent bioinformatics approach allowing genome-wide search for proteins with similar three dimensional structures has been developed. By utilizing the numerical output of the sequence threading it establishes putative non-obvious structural similarities between proteins. When applied to the testing set of proteins with known three dimensional structures the developed approach was able to recognize structurally similar proteins with high accuracy. Results The method has been developed to identify pathogenic proteins with low sequence identity and high structural similarity to host analogues. Such protein structure relationships would be hypothesized to arise through convergent evolution or through ancient horizontal gene transfer events, now undetectable using current sequence alignment techniques. The pathogen proteins, which could mimic or interfere with host activities, would represent candidate virulence factors. The developed approach utilizes the numerical outputs from the sequence-structure threading. It identifies the potential structural similarity between a pair of proteins by correlating the threading scores of the corresponding two primary sequences against the library of the standard folds. This approach allowed up to 64% sensitivity and 99.9% specificity in distinguishing protein pairs with high structural similarity. Conclusion Preliminary results obtained by comparison of the genomes of Homo sapiens and several strains of Chlamydia trachomatis have demonstrated the potential usefulness of the method in the identification of bacterial proteins with known or potential roles in virulence.

  18. Development of BIATECH-54 standard mixtures for assessment of protein identification and relative expression.

    Science.gov (United States)

    Kolker, Eugene; Hogan, Jason M; Higdon, Roger; Kolker, Natali; Landorf, Elizabeth; Yakunin, Alexander F; Collart, Frank R; van Belle, Gerald

    2007-10-01

    Mixtures of known proteins have been very useful in the assessment and validation of methods for high-throughput (HTP) MS (MS/MS) proteomics experiments. However, these test mixtures have generally consisted of few proteins at near equal concentration or of a single protein at varied concentrations. Such mixtures are too simple to effectively assess the validity of error rates for protein identification and differential expression in HTP MS/MS studies. This work aimed at overcoming these limitations and simulating studies of complex biological samples. We introduced a pair of 54-protein standard mixtures of variable concentrations with up to a 1000-fold dynamic range in concentration and up to ten-fold expression ratios with additional negative controls (infinite expression ratios). These test mixtures comprised 16 off-the-shelf Sigma-Aldrich proteins and 38 Shewanella oneidensis proteins produced in-house. The standard proteins were systematically distributed into three main concentration groups (high, medium, and low) and then the concentrations were varied differently for each mixture within the groups to generate different expression ratios. The mixtures were analyzed with both low mass accuracy LCQ and high mass accuracy FT-LTQ instruments. In addition, these 54 standard proteins closely follow the molecular weight distributions of both bacterial and human proteomes. As a result, these new standard mixtures allow for a much more realistic assessment of approaches for protein identification and label-free differential expression than previous mixtures. Finally, methodology and experimental design developed in this work can be readily applied in future to development of more complex standard mixtures for HTP proteomics studies.

  19. Hexapeptide libraries for enhanced protein PTM identification and relative abundance profiling in whole human saliva

    Science.gov (United States)

    Bandhakavi, Sricharan; Van Riper, Susan K; Tawfik, Pierre N; Stone, Matthew D; Haddad, Tufia; Rhodus, Nelson L.; Carlis, John V.; Griffin, Timothy J

    2011-01-01

    SUMMARY Dynamic range compression (DRC) by hexapeptide libraries increases MS/MS-based identification of lower-abundance proteins in complex mixtures. However, two unanswered questions impede fully realizing DRC’s potential in shotgun proteomics. First, does DRC enhance identification of post-translationally modified proteins? Second, can DRC be incorporated into a workflow enabling relative protein abundance profiling? We sought to answer both questions analyzing human whole saliva. Addressing question one, we coupled DRC with covalent glycopeptide enrichment and MS/MS. With DRC we identified ~2 times more N-linked glycoproteins and their glycosylation sites than without DRC, dramatically increasing the known salivary glycoprotein catalog. Addressing question two, we compared differentially stable isotope-labeled saliva samples pooled from healthy and metastatic breast cancer women using a multidimensional peptide fractionation-based workflow, analyzing in parallel one sample portion with DRC and one portion without. Our workflow categorizes proteins with higher absolute abundance, whose relative abundance ratios are altered by DRC, from proteins of lower absolute abundance detected only after DRC. Within each of these salivary protein categories we identified novel abundance changes putatively associated with breast cancer, demonstrating feasibility and benefits of DRC for relative abundance profiling. Collectively, our results bring us closer to realizing the full potential of DRC for proteomic studies. PMID:21142092

  20. Identification and characterization of proteins involved in nuclear organization using Drosophila GFP protein trap lines.

    Directory of Open Access Journals (Sweden)

    Margaret Rohrbaugh

    Full Text Available BACKGROUND: Strains from a collection of Drosophila GFP protein trap lines express GFP in the normal tissues where the endogenous protein is present. This collection can be used to screen for proteins distributed in the nucleus in a non-uniform pattern. METHODOLOGY/PRINCIPAL FINDINGS: We analyzed four lines that show peripheral or punctate nuclear staining. One of these lines affects an uncharacterized gene named CG11138. The CG11138 protein shows a punctate distribution in the nuclear periphery similar to that of Drosophila insulator proteins but does not co-localize with known insulators. Interestingly, mutations in Lamin proteins result in alterations in CG11138 localization, suggesting that this protein may be a novel component of the nuclear lamina. A second line affects the Decondensation factor 31 (Df31 gene, which encodes a protein with a unique nuclear distribution that appears to segment the nucleus into four different compartments. The X-chromosome of males is confined to one of these compartments. We also find that Drosophila Nucleoplasmin (dNlp is present in regions of active transcription. Heat shock leads to loss of dNlp from previously transcribed regions of polytene chromosome without redistribution to the heat shock genes. Analysis of Stonewall (Stwl, a protein previously found to be necessary for the maintenance of germline stem cells, shows that Stwl is present in a punctate pattern in the nucleus that partially overlaps with that of known insulator proteins. Finally we show that Stwl, dNlp, and Df31 form part of a highly interactive network. The properties of other components of this network may help understand the role of these proteins in nuclear biology. CONCLUSIONS/SIGNIFICANCE: These results establish screening of GFP protein trap alleles as a strategy to identify factors with novel cellular functions. Information gained from the analysis of CG11138 Stwl, dNlp, and Df31 sets the stage for future studies of these

  1. Accurate recapture identification for genetic mark-recapture studies with error-tolerant likelihood-based match calling and sample clustering.

    Science.gov (United States)

    Sethi, Suresh A; Linden, Daniel; Wenburg, John; Lewis, Cara; Lemons, Patrick; Fuller, Angela; Hare, Matthew P

    2016-12-01

    Error-tolerant likelihood-based match calling presents a promising technique to accurately identify recapture events in genetic mark-recapture studies by combining probabilities of latent genotypes and probabilities of observed genotypes, which may contain genotyping errors. Combined with clustering algorithms to group samples into sets of recaptures based upon pairwise match calls, these tools can be used to reconstruct accurate capture histories for mark-recapture modelling. Here, we assess the performance of a recently introduced error-tolerant likelihood-based match-calling model and sample clustering algorithm for genetic mark-recapture studies. We assessed both biallelic (i.e. single nucleotide polymorphisms; SNP) and multiallelic (i.e. microsatellite; MSAT) markers using a combination of simulation analyses and case study data on Pacific walrus (Odobenus rosmarus divergens) and fishers (Pekania pennanti). A novel two-stage clustering approach is demonstrated for genetic mark-recapture applications. First, repeat captures within a sampling occasion are identified. Subsequently, recaptures across sampling occasions are identified. The likelihood-based matching protocol performed well in simulation trials, demonstrating utility for use in a wide range of genetic mark-recapture studies. Moderately sized SNP (64+) and MSAT (10-15) panels produced accurate match calls for recaptures and accurate non-match calls for samples from closely related individuals in the face of low to moderate genotyping error. Furthermore, matching performance remained stable or increased as the number of genetic markers increased, genotyping error notwithstanding.

  2. Identification of membrane proteins by tandem mass spectrometry of protein ions.

    Science.gov (United States)

    Carroll, Joe; Altman, Matthew C; Fearnley, Ian M; Walker, John E

    2007-09-04

    The most common way of identifying proteins in proteomic analyses is to use short segments of sequence ("tags") determined by mass spectrometric analysis of proteolytic fragments. The approach is effective with globular proteins and with membrane proteins with significant polar segments between membrane-spanning alpha-helices, but it is ineffective with other hydrophobic proteins where protease cleavage sites are either infrequent or absent. By developing methods to purify hydrophobic proteins in organic solvents and by fragmenting ions of these proteins by collision induced dissociation with argon, we have shown that partial sequences of many membrane proteins can be deduced easily by manual inspection. The spectra from small proteolipids (1-4 transmembrane alpha-helices) are dominated usually by fragment ions arising from internal amide cleavages, from which internal sequences can be obtained, whereas the spectra from larger membrane proteins (5-18 transmembrane alpha-helices) often contain fragment ions from N- and/or C-terminal parts yielding sequences in those regions. With these techniques, we have, for example, identified an abundant protein of unknown function from inner membranes of mitochondria that to our knowledge has escaped detection in proteomic studies, and we have produced sequences from 10 of 13 proteins encoded in mitochondrial DNA. They include the ND6 subunit of complex I, the last of its 45 subunits to be analyzed. The procedures have the potential to be developed further, for example by using newly introduced methods for protein ion dissociation to induce fragmentation of internal regions of large membrane proteins, which may remain partially folded in the gas phase.

  3. Normalization with genes encoding ribosomal proteins but not GAPDH provides an accurate quantification of gene expressions in neuronal differentiation of PC12 cells

    Directory of Open Access Journals (Sweden)

    Lim Qing-En

    2010-01-01

    Full Text Available Abstract Background Gene regulation at transcript level can provide a good indication of the complex signaling mechanisms underlying physiological and pathological processes. Transcriptomic methods such as microarray and quantitative real-time PCR require stable reference genes for accurate normalization of gene expression. Some but not all studies have shown that housekeeping genes (HGKs, β-actin (ACTB and glyceraldehyde-3-phosphate dehydrogenase (GAPDH, which are routinely used for normalization, may vary significantly depending on the cell/tissue type and experimental conditions. It is currently unclear if these genes are stably expressed in cells undergoing drastic morphological changes during neuronal differentiation. Recent meta-analysis of microarray datasets showed that some but not all of the ribosomal protein genes are stably expressed. To test the hypothesis that some ribosomal protein genes can serve as reference genes for neuronal differentiation, a genome-wide analysis was performed and putative reference genes were identified based on stability of expressions. The stabilities of these potential reference genes were then analyzed by reverse transcription quantitative real-time PCR in six differentiation conditions. Results Twenty stably expressed genes, including thirteen ribosomal protein genes, were selected from microarray analysis of the gene expression profiles of GDNF and NGF induced differentiation of PC12 cells. The expression levels of these candidate genes as well as ACTB and GAPDH were further analyzed by reverse transcription quantitative real-time PCR in PC12 cells differentiated with a variety of stimuli including NGF, GDNF, Forskolin, KCl and ROCK inhibitor, Y27632. The performances of these candidate genes as stable reference genes were evaluated with two independent statistical approaches, geNorm and NormFinder. Conclusions The ribosomal protein genes, RPL19 and RPL29, were identified as suitable reference genes

  4. Protein feature based identification of cell cycle regulated proteins in yeast

    DEFF Research Database (Denmark)

    de Lichtenberg, Ulrik; Jensen, Thomas Skøt; Jensen, Lars Juhl;

    2003-01-01

    DNA microarrays have been used extensively to identify cell cycle regulated genes in yeast; however, the overlap in the genes identified is surprisingly small. We show that certain protein features can be used to distinguish cell cycle regulated genes from other genes with high confidence (features...... include protein phosphorylation, glycosylation, subcellular location and instability/degradation). We demonstrate that co-expressed, periodic genes encode proteins which share combinations of features, and provide an overview of the proteome dynamics during the cycle. A large set of novel putative cell...... cycle regulated proteins were identified, many of which have no known function....

  5. Identification of Immunodominant B-cell Epitope Regions of Reticulocyte Binding Proteins in Plasmodium vivax by Protein Microarray Based Immunoscreening.

    Science.gov (United States)

    Han, Jin-Hee; Li, Jian; Wang, Bo; Lee, Seong-Kyun; Nyunt, Myat Htut; Na, Sunghun; Park, Jeong-Hyun; Han, Eun-Taek

    2015-08-01

    Plasmodium falciparum can invade all stages of red blood cells, while Plasmodium vivax can invade only reticulocytes. Although many P. vivax proteins have been discovered, their functions are largely unknown. Among them, P. vivax reticulocyte binding proteins (PvRBP1 and PvRBP2) recognize and bind to reticulocytes. Both proteins possess a C-terminal hydrophobic transmembrane domain, which drives adhesion to reticulocytes. PvRBP1 and PvRBP2 are large (> 326 kDa), which hinders identification of the functional domains. In this study, the complete genome information of the P. vivax RBP family was thoroughly analyzed using a prediction server with bioinformatics data to predict B-cell epitope domains. Eleven pvrbp family genes that included 2 pseudogenes and 9 full or partial length genes were selected and used to express recombinant proteins in a wheat germ cell-free system. The expressed proteins were used to evaluate the humoral immune response with vivax malaria patients and healthy individual serum samples by protein microarray. The recombinant fragments of 9 PvRBP proteins were successfully expressed; the soluble proteins ranged in molecular weight from 16 to 34 kDa. Evaluation of the humoral immune response to each recombinant PvRBP protein indicated a high antigenicity, with 38-88% sensitivity and 100% specificity. Of them, N-terminal parts of PvRBP2c (PVX_090325-1) and PvRBP2 like partial A (PVX_090330-1) elicited high antigenicity. In addition, the PvRBP2-like homologue B (PVX_116930) fragment was newly identified as high antigenicity and may be exploited as a potential antigenic candidate among the PvRBP family. The functional activity of the PvRBP family on merozoite invasion remains unknown.

  6. Identification of Differentially Abundant Proteins of Edwardsiella ictaluri during Iron Restriction.

    Directory of Open Access Journals (Sweden)

    Pradeep R Dumpala

    Full Text Available Edwardsiella ictaluri is a Gram-negative facultative anaerobe intracellular bacterium that causes enteric septicemia in channel catfish. Iron is an essential inorganic nutrient of bacteria and is crucial for bacterial invasion. Reduced availability of iron by the host may cause significant stress for bacterial pathogens and is considered a signal that leads to significant alteration in virulence gene expression. However, the precise effect of iron-restriction on E. ictaluri protein abundance is unknown. The purpose of this study was to identify differentially abundant proteins of E. ictaluri during in vitro iron-restricted conditions. We applied two-dimensional difference in gel electrophoresis (2D-DIGE for determining differentially abundant proteins and matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI TOF/TOF MS for protein identification. Gene ontology and pathway-based functional modeling of differentially abundant proteins was also conducted. A total of 50 unique differentially abundant proteins at a minimum of 2-fold (p ≤ 0.05 difference in abundance due to iron-restriction were detected. The numbers of up- and down-regulated proteins were 37 and 13, respectively. We noted several proteins, including EsrB, LamB, MalM, MalE, FdaA, and TonB-dependent heme/hemoglobin receptor family proteins responded to iron restriction in E. ictaluri.

  7. The Effect of Edge Definition of Complex Networks on Protein Structure Identification

    Directory of Open Access Journals (Sweden)

    Jing Sun

    2013-01-01

    Full Text Available The main objective of this study is to explore the contribution of complex network together with its different definitions of vertexes and edges to describe the structure of proteins. Protein folds into a specific conformation for its function depending on interactions between residues. Consequently, in many studies, a protein structure was treated as a complex system comprised of individual components residues, and edges were interactions between residues. What is the proper time for representing a protein structure as a network? To confirm the effect of different definitions of vertexes and edges in constructing the amino acid interaction networks, protein domains and the structural unit of proteins were described using this method. The identification performance of 2847 proteins with domain/domains proved that the structure of proteins was described well when was around 5.0–7.5 Å, and the optimal cutoff value for constructing the protein structure networks was 5.0 Å ( distances while the ideal community division method was community structure detection based on edge betweenness in this study.

  8. Identification of phosphorylation sites in the nucleocapsid protein (N protein) of SARS-coronavirus

    Science.gov (United States)

    Lin, Liang; Shao, Jianmin; Sun, Maomao; Liu, Jinxiu; Xu, Gongjin; Zhang, Xumin; Xu, Ningzhi; Wang, Rong; Liu, Siqi

    2007-12-01

    After decoding the genome of SARS-coronavirus (SARS-CoV), next challenge is to understand how this virus causes the illness at molecular bases. Of the viral structural proteins, the N protein plays a pivot role in assembly process of viral particles as well as viral replication and transcription. The SARS-CoV N proteins expressed in the eukaryotes, such as yeast and HEK293 cells, appeared in the multiple spots on two-dimensional electrophoresis (2DE), whereas the proteins expressed in E. coli showed a single 2DE spotE These 2DE spots were further examined by Western blot and MALDI-TOF/TOF MS, and identified as the N proteins with differently apparent pI values and similar molecular mass of 50 kDa. In the light of the observations and other evidences, a hypothesis was postulated that the SARS-CoV N protein could be phosphorylated in eukaryotes. To locate the plausible regions of phosphorylation in the N protein, two truncated N proteins were generated in E. coli and treated with PKC[alpha]. The two truncated N proteins after incubation of PKC[alpha] exhibited the differently electrophoretic behaviors on 2DE, suggesting that the region of 1-256 aa in the N protein was the possible target for PKC[alpha] phosphorylation. Moreover, the SARS-CoV N protein expressed in yeast were partially digested with trypsin and carefully analyzed by MALDI-TOF/TOF MS. In contrast to the completely tryptic digestion, these partially digested fragments generated two new peptide mass signals with neutral loss, and MS/MS analysis revealed two phosphorylated peptides located at the "dense serine" island in the N protein with amino acid sequences, GFYAEGSRGGSQASSRSSSR and GNSGNSTPGSSRGNSPARMASGGGK. With the PKC[alpha] phosphorylation treatment and the partially tryptic digestion, the N protein expressed in E. coli released the same peptides as observed in yeast cells. Thus, this investigation provided the preliminary data to determine the phosphorylation sites in the SARS-CoV N protein, and

  9. Identification of protein secretion systems and novel secreted proteins in Rhizobium leguminosarum bv. viciae

    Directory of Open Access Journals (Sweden)

    Krehenbrink Martin

    2008-01-01

    Full Text Available Abstract Background Proteins secreted by bacteria play an important role in infection of eukaryotic hosts. Rhizobia infect the roots of leguminous plants and establish a mutually beneficial symbiosis. Proteins secreted during the infection process by some rhizobial strains can influence infection and modify the plant defence signalling pathways. The aim of this study was to systematically analyse protein secretion in the recently sequenced strain Rhizobium leguminosarum bv. viciae 3841. Results Similarity searches using defined protein secretion systems from other Gram-negative bacteria as query sequences revealed that R. l. bv. viciae 3841 has ten putative protein secretion systems. These are the general export pathway (GEP, a twin-arginine translocase (TAT secretion system, four separate Type I systems, one putative Type IV system and three Type V autotransporters. Mutations in genes encoding each of these (except the GEP were generated, but only mutations affecting the PrsDE (Type I and TAT systems were observed to affect the growth phenotype and the profile of proteins in the culture supernatant. Bioinformatic analysis and mass fingerprinting of tryptic fragments of culture supernatant proteins identified 14 putative Type I substrates, 12 of which are secreted via the PrsDE, secretion system. The TAT mutant was defective for the symbiosis, forming nodules incapable of nitrogen fixation. Conclusion None of the R. l. bv. viciae 3841 protein secretion systems putatively involved in the secretion of proteins to the extracellular space (Type I, Type IV, Type V is required for establishing the symbiosis with legumes. The PrsDE (Type I system was shown to be the major route of protein secretion in non-symbiotic cells and to secrete proteins of widely varied size and predicted function. This is in contrast to many Type I systems from other bacteria, which typically secrete specific substrates encoded by genes often localised in close proximity to

  10. Identification of a novel hypocholesterolemic protein, major royal jelly protein 1, derived from royal jelly.

    Directory of Open Access Journals (Sweden)

    Yuri Kashima

    Full Text Available Royal jelly (RJ intake lowers serum cholesterol levels in animals and humans, but the active component in RJ that lowers serum cholesterol level and its molecular mechanism are unclear. In this study, we set out to identify the bile acid-binding protein contained in RJ, because dietary bile acid-binding proteins including soybean protein and its peptide are effective in ameliorating hypercholesterolemia. Using a cholic acid-conjugated column, we separated some bile acid-binding proteins from RJ and identified the major RJ protein 1 (MRJP1, MRJP2, and MRJP3 as novel bile acid-binding proteins from RJ, based on matrix-assisted laser desorption ionization time-of-flight mass spectrometry. Purified MRJP1, which is the most abundant protein of the bile acid-binding proteins in RJ, exhibited taurocholate-binding activity in vitro. The micellar solubility of cholesterol was significantly decreased in the presence of MRJP1 compared with casein in vitro. Liver bile acids levels were significantly increased, and cholesterol 7α-hydroxylase (CYP7A1 mRNA and protein tended to increase by MRJP1 feeding compared with the control. CYP7A1 mRNA and protein levels were significantly increased by MRJP1 tryptic hydrolysate treatment compared with that of casein tryptic hydrolysate in hepatocytes. MRJP1 hypocholesterolemic effect has been investigated in rats. The cholesterol-lowering action induced by MRJP1 occurs because MRJP1 interacts with bile acids induces a significant increase in fecal bile acids excretion and a tendency to increase in fecal cholesterol excretion and also enhances the hepatic cholesterol catabolism. We have identified, for the first time, a novel hypocholesterolemic protein, MRJP1, in RJ. Interestingly, MRJP1 exhibits greater hypocholesterolemic activity than the medicine β-sitosterol in rats.

  11. Identification of differentially expressed proteins and phosphorylated proteins in rice seedlings in response to strigolactone treatment.

    Directory of Open Access Journals (Sweden)

    Fangyu Chen

    Full Text Available Strigolactones (SLs are recently identified plant hormones that inhibit shoot branching and control various aspects of plant growth, development and interaction with parasites. Previous studies have shown that plant D10 protein is a carotenoid cleavage dioxygenase that functions in SL biosynthesis. In this work, we used an allelic SL-deficient d10 mutant XJC of rice (Oryza sativa L. spp. indica to investigate proteins that were responsive to SL treatment. When grown in darkness, d10 mutant seedlings exhibited elongated mesocotyl that could be rescued by exogenous application of SLs. Soluble protein extracts were prepared from d10 mutant seedlings grown in darkness in the presence of GR24, a synthetic SL analog. Soluble proteins were separated on two-dimensional gels and subjected to proteomic analysis. Proteins that were expressed differentially and phosphoproteins whose phosphorylation status changed in response to GR24 treatment were identified. Eight proteins were found to be induced or down-regulated by GR24, and a different set of 8 phosphoproteins were shown to change their phosphorylation intensities in the dark-grown d10 seedlings in response to GR24 treatment. Analysis of these proteins revealed that they are important enzymes of the carbohydrate and amino acid metabolic pathways and key components of the cellular energy generation machinery. These proteins may represent potential targets of the SL signaling pathway. This study provides new insight into the complex and negative regulatory mechanism by which SLs control shoot branching and plant development.

  12. Redox proteomics identification of oxidatively modified myocardial proteins in human heart failure: implications for protein function.

    Directory of Open Access Journals (Sweden)

    Maura Brioschi

    Full Text Available Increased oxidative stress in a failing heart may contribute to the pathogenesis of heart failure (HF. The aim of this study was to identify the oxidised proteins in the myocardium of HF patients and analyse the consequences of oxidation on protein function. The carbonylated proteins in left ventricular tissue from failing (n = 14 and non-failing human hearts (n = 13 were measured by immunoassay and identified by proteomics. HL-1 cardiomyocytes were incubated in the presence of stimuli relevant for HF in order to assess the generation of reactive oxygen species (ROS, the induction of protein carbonylation, and its consequences on protein function. The levels of carbonylated proteins were significantly higher in the HF patients than in the controls (p<0.01. We identified two proteins that mainly underwent carbonylation: M-type creatine kinase (M-CK, whose activity is impaired, and, to a lesser extent, α-cardiac actin. Exposure of cardiomyocytes to angiotensin II and norepinephrine led to ROS generation and M-CK carbonylation with loss of its enzymatic activity. Our findings indicate that protein carbonylation is increased in the myocardium during HF and that these oxidative changes may help to explain the decreased CK activity and consequent defects in energy metabolism observed in HF.

  13. Redox proteomics identification of oxidatively modified myocardial proteins in human heart failure: implications for protein function.

    Science.gov (United States)

    Brioschi, Maura; Polvani, Gianluca; Fratto, Pasquale; Parolari, Alessandro; Agostoni, Piergiuseppe; Tremoli, Elena; Banfi, Cristina

    2012-01-01

    Increased oxidative stress in a failing heart may contribute to the pathogenesis of heart failure (HF). The aim of this study was to identify the oxidised proteins in the myocardium of HF patients and analyse the consequences of oxidation on protein function. The carbonylated proteins in left ventricular tissue from failing (n = 14) and non-failing human hearts (n = 13) were measured by immunoassay and identified by proteomics. HL-1 cardiomyocytes were incubated in the presence of stimuli relevant for HF in order to assess the generation of reactive oxygen species (ROS), the induction of protein carbonylation, and its consequences on protein function. The levels of carbonylated proteins were significantly higher in the HF patients than in the controls (p<0.01). We identified two proteins that mainly underwent carbonylation: M-type creatine kinase (M-CK), whose activity is impaired, and, to a lesser extent, α-cardiac actin. Exposure of cardiomyocytes to angiotensin II and norepinephrine led to ROS generation and M-CK carbonylation with loss of its enzymatic activity. Our findings indicate that protein carbonylation is increased in the myocardium during HF and that these oxidative changes may help to explain the decreased CK activity and consequent defects in energy metabolism observed in HF.

  14. Identification of active pocket and protein druggability within envelope glycoprotein GP2 from Ebola virus

    Institute of Scientific and Technical Information of China (English)

    Beuy Joob; Viroj Wiwanitkit

    2014-01-01

    The drug searching for combating the present outbreak of Ebola virus infection is the urgent activity at present. Finding the new effective drug at present must base on the molecular analysis of the pathogenic virus. The in-depth analysis of the viral protein to find the binding site, active pocket is needed. Here, the authors analyzed the envelope glycoprotein GP2 from Ebola virus. Identification of active pocket and protein druggability within envelope glycoprotein GP2 from Ebola virus was done. According to this assessment, 7 active pockets with varied druggability could be identified.

  15. Identification of active pocket and protein druggability within envelope glycoprotein GP2 from Ebola virus

    Institute of Scientific and Technical Information of China (English)

    Beuy; Joob; Viroj; Wiwanitkit

    2014-01-01

    The drug searching for combating the present outbreak of Ebola virus infection is the urgent activity at present.Finding the new effective drug at present must base on the molecular analysis of the pathogenic virus.The in-depth analysis of the viral protein to find the binding site,active pocket is needed.Here,the authors analyzed the envelope glycoprotein GP2 from Ebola virus.Identification of active pocket and protein draggability within envelope glycoprotein GP2 from Ebola virus was done.According to this assessment,7 active pockets with varied draggability could be identified.

  16. Bioinformatics-Based Identification of Chemosensory Proteins in African Malaria Mosquito, Anopheles gambiae

    Institute of Scientific and Technical Information of China (English)

    Zhengxi Li; Zuorui Shen; Jingjiang Zhou; Lin Field

    2003-01-01

    Chemosensory proteins (CSPs) are identifiable by four spatially conserved Cysteine residues in their primary structure or by two disulfide bridges in their tertiary structure according to the previously identified olfactory specific-D related proteins. A genomics- and bioinformatics-based approach is taken in the present study to identify the putative CSPs in the malaria-carrying mosquito, Anopheles gambiae. The results show that five out of the nine annotated candidates are the most possible Anopheles CSPs of A. gambiae. This study lays the foundation for further functional identification of Anopheles CSPs, though all of these candidates need additional experimental verification.

  17. Separating the Wheat from the Chaff: Unbiased Filtering of Background Tandem Mass Spectra Improves Protein Identification

    Science.gov (United States)

    Junqueira, Magno; Spirin, Victor; Balbuena, Tiago Santana; Waridel, Patrice; Surendranath, Vineeth; Kryukov, Grigoriy; Adzhubei, Ivan; Thomas, Henrik; Sunyaev, Shamil; Shevchenko, Andrej

    2009-01-01

    Only a small fraction of spectra acquired in LC-MS/MS runs matches peptides from target proteins upon database searches. The remaining, operationally termed background, spectra originate from a variety of poorly controlled sources and affect the throughput and confidence of database searches. Here, we report an algorithm and its software implementation that rapidly removes background spectra, regardless of their precise origin. The method estimates the dissimilarity distance between screened MS/MS spectra and unannotated spectra from a partially redundant background library compiled from several control and blank runs. Filtering MS/MS queries enhanced the protein identification capacity when searches lacked spectrum to sequence matching specificity. In sequence-similarity searches it reduced by, on average, 30-fold the number of orphan hits, which were not explicitly related to background protein contaminants and required manual validation. Removing high quality background MS/MS spectra, while preserving in the data set the genuine spectra from target proteins, decreased the false positive rate of stringent database searches and improved the identification of low-abundance proteins. PMID:18558732

  18. Identification of Protein Secretion Systems in Bacterial Genomes Using MacSyFinder.

    Science.gov (United States)

    Abby, Sophie S; Rocha, Eduardo P C

    2017-01-01

    Protein secretion systems are complex molecular machineries that translocate proteins through the outer membrane, and sometimes through multiple other barriers. They have evolved by co-option of components from other envelope-associated cellular machineries, making them sometimes difficult to identify and discriminate. Here, we describe how to identify protein secretion systems in bacterial genomes using MacSyFinder. This flexible computational tool uses the knowledge stemming from experimental studies to identify homologous systems in genome data. It can be used with a set of predefined models-"TXSScan"-to identify all major secretion systems of diderm bacteria (i.e., with inner and with LPS-containing outer membranes). For this, it identifies and clusters colocalized components of secretion systems using sequence similarity searches with hidden Markov model protein profiles. Finally, it checks whether the genetic content and organization of clusters satisfy the constraints of the model. TXSScan models can be customized to search for variants of known systems. The models can also be built from scratch to identify novel systems. In this chapter, we describe a complete pipeline of analysis, including the identification of a reference set of experimentally studied systems, the identification of components and the construction of their protein profiles, the definition of the models, their optimization, and, finally, their use as tools to search genomic data.

  19. Purification, identification and preliminary crystallographic studies of Pru du amandin, an allergenic protein from Prunus dulcis

    Energy Technology Data Exchange (ETDEWEB)

    Gaur, Vineet; Sethi, Dhruv K.; Salunke, Dinakar M., E-mail: dinakar@nii.res.in [National Institute of Immunology, Aruna Asaf Ali Marg, New Delhi 110 067 (India)

    2008-01-01

    The purification, identification, crystallization and preliminary crystallographic studies of an allergy-related protein, Pru du amandin, from P. dulcis nuts are reported. Food allergies appear to be one of the foremost causes of hypersensitivity reactions. Nut allergies account for most food allergies and are often permanent. The 360 kDa hexameric protein Pru du amandin, a known allergen, was purified from almonds (Prunus dulcis) by ammonium sulfate fractionation and ion-exchange chromatography. The protein was identified by a BLAST homology search against the nonredundant sequence database. Pru du amandin belongs to the 11S legumin family of seed storage proteins characterized by the presence of a cupin motif. Crystals were obtained by the hanging-drop vapour-diffusion method. The crystals belong to space group P4{sub 1} (or P4{sub 3}), with unit-cell parameters a = b = 150.7, c = 164.9 Å.

  20. Identification and modification of dynamical regions in proteins for alteration of enzyme catalytic effect

    Energy Technology Data Exchange (ETDEWEB)

    Agarwal, Pratul K.

    2015-11-24

    A method for analysis, control, and manipulation for improvement of the chemical reaction rate of a protein-mediated reaction is provided. Enzymes, which typically comprise protein molecules, are very efficient catalysts that enhance chemical reaction rates by many orders of magnitude. Enzymes are widely used for a number of functions in chemical, biochemical, pharmaceutical, and other purposes. The method identifies key protein vibration modes that control the chemical reaction rate of the protein-mediated reaction, providing identification of the factors that enable the enzymes to achieve the high rate of reaction enhancement. By controlling these factors, the function of enzymes may be modulated, i.e., the activity can either be increased for faster enzyme reaction or it can be decreased when a slower enzyme is desired. This method provides an inexpensive and efficient solution by utilizing computer simulations, in combination with available experimental data, to build suitable models and investigate the enzyme activity.

  1. Identification and modification of dynamical regions in proteins for alteration of enzyme catalytic effect

    Science.gov (United States)

    Agarwal, Pratul K.

    2013-04-09

    A method for analysis, control, and manipulation for improvement of the chemical reaction rate of a protein-mediated reaction is provided. Enzymes, which typically comprise protein molecules, are very efficient catalysts that enhance chemical reaction rates by many orders of magnitude. Enzymes are widely used for a number of functions in chemical, biochemical, pharmaceutical, and other purposes. The method identifies key protein vibration modes that control the chemical reaction rate of the protein-mediated reaction, providing identification of the factors that enable the enzymes to achieve the high rate of reaction enhancement. By controlling these factors, the function of enzymes may be modulated, i.e., the activity can either be increased for faster enzyme reaction or it can be decreased when a slower enzyme is desired. This method provides an inexpensive and efficient solution by utilizing computer simulations, in combination with available experimental data, to build suitable models and investigate the enzyme activity.

  2. Bayesian mixture modeling using a hybrid sampler with application to protein subfamily identification.

    Science.gov (United States)

    Fong, Youyi; Wakefield, Jon; Rice, Kenneth

    2010-01-01

    Predicting protein function is essential to advancing our knowledge of biological processes. This article is focused on discovering the functional diversification within a protein family. A Bayesian mixture approach is proposed to model a protein family as a mixture of profile hidden Markov models. For a given mixture size, a hybrid Markov chain Monte Carlo sampler comprising both Gibbs sampling steps and hierarchical clustering-based split/merge proposals is used to obtain posterior inference. Inference for mixture size concentrates on comparing the integrated likelihoods. The choice of priors is critical with respect to the performance of the procedure. Through simulation studies, we show that 2 priors that are based on independent data sets allow correct identification of the mixture size, both when the data are homogeneous and when the data are generated from a mixture. We illustrate our method using 2 sets of real protein sequences.

  3. Fast and accurate method for identifying high-quality protein-interaction modules by clique merging and its application to yeast.

    Science.gov (United States)

    Zhang, Chi; Liu, Song; Zhou, Yaoqi

    2006-04-01

    Molecular networks in cells are organized into functional modules, where genes in the same module interact densely with each other and participate in the same biological process. Thus, identification of modules from molecular networks is an important step toward a better understanding of how cells function through the molecular networks. Here, we propose a simple, automatic method, called MC(2), to identify functional modules by enumerating and merging cliques in the protein-interaction data from large-scale experiments. Application of MC(2) to the S. cerevisiae protein-interaction data produces 84 modules, whose sizes range from 4 to 69 genes. The majority of the discovered modules are significantly enriched with a highly specific process term (at least 4 levels below root) and a specific cellular component in Gene Ontology (GO) tree. The average fraction of genes with the most enriched GO term for all modules is 82% for specific biological processes and 78% for specific cellular components. In addition, the predicted modules are enriched with coexpressed proteins. These modules are found to be useful for annotating unknown genes and uncovering novel functions of known genes. MC(2) is efficient, and takes only about 5 min to identify modules from the current yeast gene interaction network with a typical PC (Intel Xeon 2.5 GHz CPU and 512 MB memory). The CPU time of MC(2) is affordable (12 h) even when the number of interactions is increased by a factor of 10. MC(2) and its results are publicly available on http://theory.med.buffalo.edu/MC2.

  4. A rapid and accurate method for determining protein content in dairy products based on asynchronous-injection alternating merging zone flow-injection spectrophotometry.

    Science.gov (United States)

    Liang, Qin-Qin; Li, Yong-Sheng

    2013-12-01

    An accurate and rapid method and a system to determine protein content using asynchronous-injection alternating merging zone flow-injection spectrophotometry based on reaction between coomassie brilliant blue G250 (CBBG) and protein was established. Main merit of our approach is that it can avoid interferences of other nitric-compounds in samples, such as melamine and urea. Optimized conditions are as follows: Concentrations of CBBG, polyvinyl alcohol (PVA), NaCl and HCl are 150 mg/l, 30 mg/l, 0.1 mol/l and 1.0% (v/v), respectively; volumes of the sample and reagent are 150 μl and 30 μl, respectively; length of a reaction coil is 200 cm; total flow rate is 2.65 ml/min. The linear range of the method is 0.5-15 mg/l (BSA), its detection limit is 0.05 mg/l, relative standard deviation is less than 1.87% (n=11), and analytical speed is 60 samples per hour.

  5. Improved protein extraction and protein identification from archival formalin-fixed paraffin-embedded human aortas.

    Science.gov (United States)

    Fu, Zongming; Yan, Kun; Rosenberg, Avraham; Jin, Zhicheng; Crain, Barbara; Athas, Grace; Heide, Richard S Vander; Howard, Timothy; Everett, Allen D; Herrington, David; Van Eyk, Jennifer E

    2013-04-01

    Evaluate combination of heat and elevated pressure to enhance protein extraction and quality of formalin-fixed (FF), and FF paraffin-embedded (FFPE) aorta for proteomics. Proteins were extracted from fresh frozen aorta at room temperature (RT). FF and FFPE aortas (3 months and 15 years) were extracted at RT, heat alone, or a combination of heat and high pressure. Protein yields were compared, and digested peptides from the extracts were analyzed with MS. Combined heat and elevated pressure increased protein yield from human FF or FFPE aorta compared to matched tissues with heat alone (1.5-fold) or at RT (8.3-fold), resulting in more proteins identified and with more sequence coverage. The length of storage did adversely affect the quality of proteins from FF tissue. For long-term storage, aorta was preserved better with FFPE than FF alone. Periostin and MGF-E8 were demonstrated suitable for MRM assays from FFPE aorta. Combination of heat and high pressure is an effective method to extract proteins from FFPE aorta for downstream proteomics. This method opens the possibility for use of archival and often rare FFPE aortas and possibly other tissues available to proteomics for biomarker discovery and quantification. © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  6. Development of a Rapid and Accurate Identification Method for Citrobacter Species Isolated from Pork Products Using a Matrix-Assisted Laser-Desorption Ionization Time-of-Flight Mass Spectrometry (MALDI-TOF MS).

    Science.gov (United States)

    Kwak, Hye-Lim; Han, Sun-Kyung; Park, Sunghoon; Park, Si Hong; Shim, Jae-Yong; Oh, Mihwa; Ricke, Steven C; Kim, Hae-Yeong

    2015-09-01

    Previous detection methods for Citrobacter are considered time consuming and laborious. In this study, we have developed a rapid and accurate detection method for Citrobacter species in pork products, using matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF) mass spectrometry (MS). A total of 35 Citrobacter strains were isolated from 30 pork products and identified by both MALDI-TOF MS and 16S rRNA gene sequencing approaches. All isolates were identified to the species level by the MALDI-TOF MS, while 16S rRNA gene sequencing results could not discriminate them clearly. These results confirmed that MALDI-TOF MS is a more accurate and rapid detection method for the identification of Citrobacter species.

  7. Identification of conserved surface proteins as novel antigenic vaccine candidates of Actinobacillus pleuropneumoniae.

    Science.gov (United States)

    Chen, Xiabing; Xu, Zhuofei; Li, Lu; Chen, Huanchun; Zhou, Rui

    2012-12-01

    Actinobacillus pleuropneumoniae is an important swine respiratory pathogen causing great economic losses worldwide. Identification of conserved surface antigenic proteins is helpful for developing effective vaccines. In this study, a genome-wide strategy combined with bioinformatic and experimental approaches, was applied to discover and characterize surface-associated immunogenic proteins of A. pleuropneumoniae. Thirty nine genes encoding outer membrane proteins (OMPs) and lipoproteins were identified by comparative genomics and gene expression profiling as being-highly conserved and stably transcribed in the different serotypes of A. pleuropneumoniae reference strains. Twelve of these conserved proteins were successfully expressed in Escherichia coli and their immunogenicity was estimated by homologous challenge in the mouse model, and then three of these proteins (APJL_0126, HbpA and OmpW) were further tested in the natural host (swine) by homologous and heterologous challenges. The results showed that these proteins could induce high titers of antibodies, but vaccination with each protein individually elicited low protective immunity against A. pleuropneumoniae. This study gives novel insights into immunogenicity of the conserved OMPs and lipoproteins of A. pleuropneumoniae. Although none of the surface proteins characterized in this study could individually induce effective protective immunity against A. pleuropneumoniae, they are potential candidates for subunit vaccines in combination with Apx toxins.

  8. Identification of obscure yet conserved actin-associated proteins in Giardia lamblia.

    Science.gov (United States)

    Paredez, Alexander R; Nayeri, Arash; Xu, Jennifer W; Krtková, Jana; Cande, W Zacheus

    2014-06-01

    Consistent with its proposed status as an early branching eukaryote, Giardia has the most divergent actin of any eukaryote and lacks core actin regulators. Although conserved actin-binding proteins are missing from Giardia, its actin is utilized similarly to that of other eukaryotes and functions in core cellular processes such as cellular organization, endocytosis, and cytokinesis. We set out to identify actin-binding proteins in Giardia using affinity purification coupled with mass spectroscopy (multidimensional protein identification technology [MudPIT]) and have identified >80 putative actin-binding proteins. Several of these have homology to conserved proteins known to complex with actin for functions in the nucleus and flagella. We validated localization and interaction for seven of these proteins, including 14-3-3, a known cytoskeletal regulator with a controversial relationship to actin. Our results indicate that although Giardia lacks canonical actin-binding proteins, there is a conserved set of actin-interacting proteins that are evolutionarily indispensable and perhaps represent some of the earliest functions of the actin cytoskeleton.

  9. pH fractionation and identification of proteins: comparing column chromatofocusing versus liquid isoelectric focusing techniques.

    Science.gov (United States)

    Gunther, Nereus W; Paul, Moushumi; Nuñez, Alberto; Liu, Yanhong

    2012-06-01

    In proteomic investigations, a number of different separation techniques can be applied to fractionate whole cell proteomes into more manageable fractions for subsequent analysis. In this work, utilizing HPLC and mass spectrometry for protein identification, two different fractionation methods were compared and contrasted to determine the potential of each method for the simple and reproducible fractionation of a bacterial proteome. Column-based chromatofocusing and liquid-based isoelectric focusing both utilized pH gradients to produce similar results in terms of the numbers of proteins successfully identified (402 and 378 proteins) and the consistency of proteins identified from one experiment to the next (<10% change). However, there was limited overlap in the protein sets with <50% of the proteins identified as common between the sets of proteins identified by the different systems. In addition to the numbers of proteins identified and consistency of those identified, the reduced monetary costs of experimentation and increased assay flexibility produced by using isoelectric focusing was considered in order to adopt a system best suited for comparative proteomic projects. © 2012 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  10. Comprehensive identification of protein substrates of the Dot/Icm type IV transporter of Legionella pneumophila.

    Directory of Open Access Journals (Sweden)

    Wenhan Zhu

    Full Text Available A large number of proteins transferred by the Legionella pneumophila Dot/Icm system have been identified by various strategies. With no exceptions, these strategies are based on one or more characteristics associated with the tested proteins. Given the high level of diversity exhibited by the identified proteins, it is possible that some substrates have been missed in these screenings. In this study, we took a systematic method to survey the L. pneumophila genome by testing hypothetical orfs larger than 300 base pairs for Dot/Icm-dependent translocation. 798 of the 832 analyzed orfs were successfully fused to the carboxyl end of β-lactamase. The transfer of the fusions into mammalian cells was determined using the β-lactamase reporter substrate CCF4-AM. These efforts led to the identification of 164 proteins positive in translocation. Among these, 70 proteins are novel substrates of the Dot/Icm system. These results brought the total number of experimentally confirmed Dot/Icm substrates to 275. Sequence analysis of the C-termini of these identified proteins revealed that Lpg2844, which contains few features known to be important for Dot/Icm-dependent protein transfer can be translocated at a high efficiency. Thus, our efforts have identified a large number of novel substrates of the Dot/Icm system and have revealed the diverse features recognizable by this protein transporter.

  11. Preliminary identification of secreted proteins by Leptospira interrogans serovar Kennewicki strain Pomona Fromm

    Energy Technology Data Exchange (ETDEWEB)

    Ricardi, L.M.P.; Portaro, F.C.; Abreu, P.A.E.; Barbosa, A.S. [Instituto Butantan, Sao Paulo, SP (Brazil); Morais, Z.M.; Vasconcellos, S.A. [Universidade de Sao Paulo (USP), SP (Brazil)

    2012-07-01

    Full text: This project aimed to identify secreted proteins by pathogenic Leptospira interrogans serovar Kennewicki strain Pomona Fromm (LPF) by proteomic analyses. The strain LPF, whose virulence was maintained by passages in hamsters, were cultured in EMJH medium. The supernatants were centrifuged, dialyzed and subjected to lyophilization. Protein samples were resolved first by IEF at pH 3 to 10, immobilized pH gradient 13-cm strips. Strips were then processed for the second-dimension separation on SDS-polyacrylamide gels. Proteins from gel spots were subjected to reduction, cysteine-alkylation, and in-gel tryptic digestion, and analyzed by LC/MS/MS spectrometry. Liquid chromatography-based separation followed by automated tandem mass spectrometry was also used to identify secreted proteins. In silico analyses were performed using the PSORTbV.3.0 program and SignalP server. One major obstacle to secretome studies is the difficulty to obtain extracts of secreted proteins without citoplasmatic contamination. In addition, the extraction of low concentration proteins from large volumes of culture media, which are rich in salts, BSA and other compounds, frequently interfere with most proteomics techniques. For these reasons, several experimental approaches were used to optimize the protocol applied. In spite of this fact, our analysis resulted in the identification of 200 proteins with high confidence. Only 5 of 63 secreted proteins predicted by in silico analysis were found. Other classes identified included proteins that possess signal peptide but whose cellular localization prediction is unknown or may have multiple localization sites, and proteins that lack signal peptide and are thus thought to be secreted via non conventional mechanisms or resulting from cytoplasmic contamination by cell lysis. Many of these are hypothetical proteins with no putative conserved domains detected. To our knowledge, this is the first study to identify secreted proteins by

  12. Rapid and accurate identification of species belonging to the Candida parapsilosis complex by real-time PCR and melting curve analysis.

    Science.gov (United States)

    Hays, Constantin; Duhamel, Chantal; Cattoir, Vincent; Bonhomme, Julie

    2011-04-01

    Candida parapsilosis is the second most frequent Candida species isolated from blood cultures. Since 2005, C. parapsilosis has been divided into three distinct species based on genetic traits: Candida parapsilosis, Candida metapsilosis and Candida orthopsilosis. The aim of this study was to develop a rapid real-time PCR assay able to distinguish these closely related species via a melting curve analysis. This identification method was optimized by using reference strains and well-characterized clinical isolates of Candida species. A single set of consensus primers was designed to amplify a 184 bp portion of the SADH gene in order to identify species based on the unique melt profile resulting from DNA sequence variations from each species of the complex. PCR products were detected with SYBR Green fluorescent dye and identification was established by melting curve analysis. For validation of the technique, a total of 116 clinical isolates, phenotypically identified as C. parapsilosis, were tested by real-time PCR and results were further compared with PCR-RFLP patterns of the SADH gene, used as the reference method. The melting curve analysis of amplified DNA could differentiate between C. parapsilosis (83.5 °C), C. metapsilosis (82.9 °C) and C. orthopsilosis (82.1 °C), with a sensitivity and specificity comparable to those of the reference method. One hundred and fourteen C. parapsilosis and two C. orthopsilosis isolates were identified among the clinical isolates. This method provides a simple, rapid and reliable identification of species belonging to the C. parapsilosis complex. This novel approach could be helpful for clinical and epidemiological investigations.

  13. Application of an Accurate and Validated Method for Identification and Quantification of Acrylamide in Bread, Biscuits and Other Bakery Products Using GC-MS/MS System

    OpenAIRE

    Negoiță,Mioara; Culețu,Alina

    2016-01-01

    A gas chromatography tandem mass spectrometry has been developed and validated for the separation, detection, identification and quantification of acrylamide in bread, biscuits and similar products. The method showed good precision with values lower than 6%. A good sensitivity was achieved for bread with 2.41 and 7.23 µg kg-1 limit of detection (LOD) and limit of quantification (LOQ), respectively, while for biscuits, LOD and LOQ were 4.63 and 13.89 µg kg-1, respectively. Accuracy obtained th...

  14. Multi-Segment Direct Inject nano-ESI-LTQ-FT-ICR-MS/MS For Protein Identification.

    Science.gov (United States)

    Chen, Jing; Canales, Lorena; Neal, Rachel E

    2011-07-07

    Reversed phase high performance liquid chromatography (HPLC) interfaced to electrospray tandem mass spectrometry (MS/MS) is commonly used for the identification of peptides from proteolytically cleaved proteins embedded in a polyacrylamide gel matrix as well as for metabolomics screening. HPLC separations are time consuming (30-60 min average), costly (columns and mobile phase reagents), and carry the risk of column carry over between samples. The use of a chip-based nano-ESI platform (Advion NanoMate) based on replaceable nano-tips for sample introduction eliminates sample cross-contamination, provides unchanging sample matrix, and enhances spray stability with attendant increases in reproducibility. Recent papers have established direct infusion nano-ESI-MS/MS utilizing the NanoMate for protein identification of gel spots based on full range MS scans with data dependent MS/MS. In a full range scan, discontinuous ion suppression due to sample matrix can impair identification of putative mass features of interest in both the proteomic and metabolomic workflows. In the current study, an extension of an established direct inject nano-ESI-MS/MS method is described that utilizes the mass filtering capability of an ion-trap for ion packet separation into four narrow mass ranges (50 amu overlap) with segment specific dynamic data dependent peak inclusion for MS/MS fragmentation (total acquisition time of 3 minutes). Comparison of this method with a more traditional nanoLC-MS/MS based protocol utilizing solvent/sample stream splitting to achieve nanoflow demonstrated comparable results for protein identification from polyacrylamide gel matrices. The advantages of this method include full automation, lack of cross-contamination, low cost, and high throughput.

  15. Multi-Segment Direct Inject nano-ESI-LTQ-FT-ICR-MS/MS For Protein Identification

    Directory of Open Access Journals (Sweden)

    Neal Rachel E

    2011-07-01

    Full Text Available Abstract Reversed phase high performance liquid chromatography (HPLC interfaced to electrospray tandem mass spectrometry (MS/MS is commonly used for the identification of peptides from proteolytically cleaved proteins embedded in a polyacrylamide gel matrix as well as for metabolomics screening. HPLC separations are time consuming (30-60 min average, costly (columns and mobile phase reagents, and carry the risk of column carry over between samples. The use of a chip-based nano-ESI platform (Advion NanoMate based on replaceable nano-tips for sample introduction eliminates sample cross-contamination, provides unchanging sample matrix, and enhances spray stability with attendant increases in reproducibility. Recent papers have established direct infusion nano-ESI-MS/MS utilizing the NanoMate for protein identification of gel spots based on full range MS scans with data dependent MS/MS. In a full range scan, discontinuous ion suppression due to sample matrix can impair identification of putative mass features of interest in both the proteomic and metabolomic workflows. In the current study, an extension of an established direct inject nano-ESI-MS/MS method is described that utilizes the mass filtering capability of an ion-trap for ion packet separation into four narrow mass ranges (50 amu overlap with segment specific dynamic data dependent peak inclusion for MS/MS fragmentation (total acquisition time of 3 minutes. Comparison of this method with a more traditional nanoLC-MS/MS based protocol utilizing solvent/sample stream splitting to achieve nanoflow demonstrated comparable results for protein identification from polyacrylamide gel matrices. The advantages of this method include full automation, lack of cross-contamination, low cost, and high throughput.

  16. Technical advance: identification of plant actin-binding proteins by F-actin affinity chromatography

    Science.gov (United States)

    Hu, S.; Brady, S. R.; Kovar, D. R.; Staiger, C. J.; Clark, G. B.; Roux, S. J.; Muday, G. K.

    2000-01-01

    Proteins that interact with the actin cytoskeleton often modulate the dynamics or organization of the cytoskeleton or use the cytoskeleton to control their localization. In plants, very few actin-binding proteins have been identified and most are thought to modulate cytoskeleton function. To identify actin-binding proteins that are unique to plants, the development of new biochemical procedures will be critical. Affinity columns using actin monomers (globular actin, G-actin) or actin filaments (filamentous actin, F-actin) have been used to identify actin-binding proteins from a wide variety of organisms. Monomeric actin from zucchini (Cucurbita pepo L.) hypocotyl tissue was purified to electrophoretic homogeneity and shown to be native and competent for polymerization to actin filaments. G-actin, F-actin and bovine serum albumin affinity columns were prepared and used to separate samples enriched in either soluble or membrane-associated actin-binding proteins. Extracts of soluble actin-binding proteins yield distinct patterns when eluted from the G-actin and F-actin columns, respectively, leading to the identification of a putative F-actin-binding protein of approximately 40 kDa. When plasma membrane-associated proteins were applied to these columns, two abundant polypeptides eluted selectively from the F-actin column and cross-reacted with antiserum against pea annexins. Additionally, a protein that binds auxin transport inhibitors, the naphthylphthalamic acid binding protein, which has been previously suggested to associate with the actin cytoskeleton, was eluted in a single peak from the F-actin column. These experiments provide a new approach that may help to identify novel actin-binding proteins from plants.

  17. Classification and Identification of Plant Fibrous Material with Different Species Using near Infrared Technique—A New Way to Approach Determining Biomass Properties Accurately within Different Species

    Science.gov (United States)

    Jiang, Wei; Zhou, Chengfeng; Han, Guangting; Via, Brian; Swain, Tammy; Fan, Zhaofei; Liu, Shaoyang

    2017-01-01

    Plant fibrous material is a good resource in textile and other industries. Normally, several kinds of plant fibrous materials used in one process are needed to be identified and characterized in advance. It is easy to identify them when they are in raw condition. However, most of the materials are semi products which are ground, rotted or pre-hydrolyzed. To classify these samples which include different species with high accuracy is a big challenge. In this research, both qualitative and quantitative analysis methods were chosen to classify six different species of samples, including softwood, hardwood, bast, and aquatic plant. Soft Independent Modeling of Class Analogy (SIMCA) and partial least squares (PLS) were used. The algorithm to classify different species of samples using PLS was created independently in this research. Results found that the six species can be successfully classified using SIMCA and PLS methods, and these two methods show similar results. The identification rates of kenaf, ramie and pine are 100%, and the identification rates of lotus, eucalyptus and tallow are higher than 94%. It is also found that spectra loadings can help pick up best wavenumber ranges for constructing the NIR model. Inter material distance can show how close between two species. Scores graph is helpful to choose the principal components numbers during the model construction. PMID:28105037

  18. Isolation and Identification of Concanavalin A Binding Glycoproteins from Human Seminal Plasma: A Step Towards Identification of Male Infertility Marker Proteins

    Directory of Open Access Journals (Sweden)

    Anil Kumar Tomar

    2011-01-01

    Full Text Available Human seminal plasma contains a large array of proteins of clinical importance which are essentially needed to maintain the reproductive physiology of spermatozoa and for successful fertilization. Thus, isolation and identification of seminal plasma proteins is of paramount significance for their biophysical characterization and functional analysis in reproductive physiological processes. In this study, we have isolated Concanavalin-A binding glycoproteins from human seminal plasma and subsequently identified them by MALDI-TOF/MS analysis. The major proteins, as identified in this study, are Aminopeptidase N, lactoferrin, prostatic acid phosphatase, zinc-alpha-2-glycoprotein, prostate specific antigen, progestagen-associated endometrial protein, Izumo sperm-egg fusion protein and prolactin inducible protein. This paper also reports preliminary studies to identify altered expression of these proteins in oligospermia and azoospermia in comparison to normospermia. In oligospermia, five proteins were found to be downregulated while in azoospermia, four proteins were downregulated and two proteins were upregulated. Thus, this study is of immense biomedical interest towards identification of potential male infertility marker proteins in seminal plasma.

  19. Isolation and Identification of Concanavalin A Binding Glycoproteins from Human Seminal Plasma: A Step Towards Identification of Male Infertility Marker Proteins

    Science.gov (United States)

    Tomar, Anil Kumar; Sooch, Balwinder Singh; Raj, Isha; Singh, Sarman; Singh, Tej P.; Yadav, Savita

    2011-01-01

    Human seminal plasma contains a large array of proteins of clinical importance which are essentially needed to maintain the reproductive physiology of spermatozoa and for successful fertilization. Thus, isolation and identification of seminal plasma proteins is of paramount significance for their biophysical characterization and functional analysis in reproductive physiological processes. In this study, we have isolated Concanavalin-A binding glycoproteins from human seminal plasma and subsequently identified them by MALDI-TOF/MS analysis. The major proteins, as identified in this study, are Aminopeptidase N, lactoferrin, prostatic acid phosphatase, zinc-alpha-2-glycoprotein, prostate specific antigen, progestagen-associated endometrial protein, Izumo sperm-egg fusion protein and prolactin inducible protein. This paper also reports preliminary studies to identify altered expression of these proteins in oligospermia and azoospermia in comparison to normospermia. In oligospermia, five proteins were found to be downregulated while in azoospermia, four proteins were downregulated and two proteins were upregulated. Thus, this study is of immense biomedical interest towards identification of potential male infertility marker proteins in seminal plasma. PMID:22182811

  20. enDNA-Prot: Identification of DNA-Binding Proteins by Applying Ensemble Learning

    Directory of Open Access Journals (Sweden)

    Ruifeng Xu

    2014-01-01

    Full Text Available DNA-binding proteins are crucial for various cellular processes, such as recognition of specific nucleotide, regulation of transcription, and regulation of gene expression. Developing an effective model for identifying DNA-binding proteins is an urgent research problem. Up to now, many methods have been proposed, but most of them focus on only one classifier and cannot make full use of the large number of negative samples to improve predicting performance. This study proposed a predictor called enDNA-Prot for DNA-binding protein identification by employing the ensemble learning technique. Experiential results showed that enDNA-Prot was comparable with DNA-Prot and outperformed DNAbinder and iDNA-Prot with performance improvement in the range of 3.97–9.52% in ACC and 0.08–0.19 in MCC. Furthermore, when the benchmark dataset was expanded with negative samples, the performance of enDNA-Prot outperformed the three existing methods by 2.83–16.63% in terms of ACC and 0.02–0.16 in terms of MCC. It indicated that enDNA-Prot is an effective method for DNA-binding protein identification and expanding training dataset with negative samples can improve its performance. For the convenience of the vast majority of experimental scientists, we developed a user-friendly web-server for enDNA-Prot which is freely accessible to the public.

  1. Identification and characterization of RBM44 as a novel intercellular bridge protein.

    Directory of Open Access Journals (Sweden)

    Tokuko Iwamori

    Full Text Available Intercellular bridges are evolutionarily conserved structures that connect differentiating germ cells. We previously reported the identification of TEX14 as the first essential intercellular bridge protein, the demonstration that intercellular bridges are required for male fertility, and the finding that intercellular bridges utilize components of the cytokinesis machinery to form. Herein, we report the identification of RNA binding motif protein 44 (RBM44 as a novel germ cell intercellular bridge protein. RBM44 was identified by proteomic analysis after intercellular bridge enrichment using TEX14 as a marker protein. RBM44 is highly conserved between mouse and human and contains an RNA recognition motif of unknown function. RBM44 mRNA is enriched in testis, and immunofluorescence confirms that RBM44 is an intercellular bridge component. However, RBM44 only partially localizes to TEX14-positive intercellular bridges. RBM44 is expressed most highly in pachytene and secondary spermatocytes, but disappears abruptly in spermatids. We discovered that RBM44 interacts with itself and TEX14 using yeast two-hybrid, mammalian two-hybrid, and immunoprecipitation. To define the in vivo function of RBM44, we generated a targeted deletion of Rbm44 in mice. Rbm44 null male mice produce somewhat increased sperm, and show enhanced fertility of unknown etiology. Thus, although RBM44 localizes to intercellular bridges during meiosis, RBM44 is not required for fertility in contrast to TEX14.

  2. Identification and Characterization of RBM44 as a Novel Intercellular Bridge Protein

    Science.gov (United States)

    Iwamori, Tokuko; Lin, Yi-Nan; Ma, Lang; Iwamori, Naoki; Matzuk, Martin M.

    2011-01-01

    Intercellular bridges are evolutionarily conserved structures that connect differentiating germ cells. We previously reported the identification of TEX14 as the first essential intercellular bridge protein, the demonstration that intercellular bridges are required for male fertility, and the finding that intercellular bridges utilize components of the cytokinesis machinery to form. Herein, we report the identification of RNA binding motif protein 44 (RBM44) as a novel germ cell intercellular bridge protein. RBM44 was identified by proteomic analysis after intercellular bridge enrichment using TEX14 as a marker protein. RBM44 is highly conserved between mouse and human and contains an RNA recognition motif of unknown function. RBM44 mRNA is enriched in testis, and immunofluorescence confirms that RBM44 is an intercellular bridge component. However, RBM44 only partially localizes to TEX14-positive intercellular bridges. RBM44 is expressed most highly in pachytene and secondary spermatocytes, but disappears abruptly in spermatids. We discovered that RBM44 interacts with itself and TEX14 using yeast two-hybrid, mammalian two-hybrid, and immunoprecipitation. To define the in vivo function of RBM44, we generated a targeted deletion of Rbm44 in mice. Rbm44 null male mice produce somewhat increased sperm, and show enhanced fertility of unknown etiology. Thus, although RBM44 localizes to intercellular bridges during meiosis, RBM44 is not required for fertility in contrast to TEX14. PMID:21364893

  3. Identification of proteins associated with Mycobacterium tuberculosis virulence pathway by their polar profile.

    Science.gov (United States)

    Polanco, Carlos; Castañón-González, Jorge Alberto; Mancilla, Raul; Buhse, Thomas; Samaniego, José Lino; Gimbel, Arturo

    2015-01-01

    With almost one third of the world population infected, tuberculosis is one of the most devastating diseases worldwide and it is a major threat to any healthcare system. With the mathematical-computational method named "Polarity Index Method", already published by this group, we identified, with high accuracy (70%), proteins related to Mycobacterium tuberculosis bacteria virulence pathway from the Tuberculist Database. The test considered the totality of proteins cataloged in the main domains: fungi, bacteria, and viruses from three databases: Antimicrobial Peptide Database (APD2), Tuberculist Database, Uniprot Database, and four antigens of Mycobacterium tuberculosis: PstS-1, 38-kDa, 19-kDa, and H37Rv ORF. The method described was calibrated with each database to achieve the same performance, showing a high percentage of coincidence in the identification of proteins associated with Mycobacterium tuberculosis bacteria virulence pathway located in the Tuberculist Database, and identifying a polar pattern regardless of the group studied. This method has already been used in the identification of diverse groups of proteins and peptides, showing that it is an effective discriminant. Its metric considers only one physico-chemical property, i.e. polarity.

  4. Proteomics meets blood banking: identification of protein targets for the improvement of platelet quality.

    Science.gov (United States)

    Schubert, Peter; Devine, Dana V

    2010-01-01

    Proteomics has brought new perspectives to the fields of hematology and transfusion medicine in the last decade. The steady improvement of proteomic technology is propelling novel discoveries of molecular mechanisms by studying protein expression, post-translational modifications and protein interactions. This review article focuses on the application of proteomics to the identification of molecular mechanisms leading to the deterioration of blood platelets during storage - a critical aspect in the provision of platelet transfusion products. Several proteomic approaches have been employed to analyse changes in the platelet protein profile during storage and the obtained data now need to be translated into platelet biochemistry in order to connect the results to platelet function. Targeted biochemical applications then allow the identification of points for intervention in signal transduction pathways. Once validated and placed in a transfusion context, these data will provide further understanding of the underlying molecular mechanisms leading to platelet storage lesion. Future aspects of proteomics in blood banking will aim to make use of protein markers identified for platelet storage lesion development to monitor proteome changes when alterations such as the use of additive solutions or pathogen reduction strategies are put in place in order to improve platelet quality for patients.

  5. Identification of bovine sperm acrosomal proteins that interact with a 32-kDa acrosomal matrix protein.

    Science.gov (United States)

    Nagdas, Subir K; Smith, Linda; Medina-Ortiz, Ilza; Hernandez-Encarnacion, Luisa; Raychoudhury, Samir

    2016-03-01

    Mammalian fertilization is accomplished by the interaction between sperm and egg. Previous studies from this laboratory have identified a stable acrosomal matrix assembly from the bovine sperm acrosome termed the outer acrosomal membrane-matrix complex (OMC). This stable matrix assembly exhibits precise binding activity for acrosin and N-acetylglucosaminidase. A highly purified OMC fraction comprises three major (54, 50, and 45 kDa) and several minor (38-19 kDa) polypeptides. The set of minor polypeptides (38-19 kDa) termed "OMCrpf polypeptides" is selectively solubilized by high-pH extraction (pH 10.5), while the three major polypeptides (55, 50, and 45 kDa) remain insoluble. Proteomic identification of the OMC32 polypeptide (32 kDa polypeptide isolated from high-pH soluble fraction of OMC) yielded two peptides that matched the NCBI database sequence of acrosin-binding protein. Anti-OMC32 recognized an antigenically related family of polypeptides (OMCrpf polypeptides) in the 38-19-kDa range with isoelectric points ranging between 4.0 and 5.1. Other than glycohydrolases, OMC32 may also be complexed to other acrosomal proteins. The present study was undertaken to identify and localize the OMC32 binding polypeptides and to elucidate the potential role of the acrosomal protein complex in sperm function. OMC32 affinity chromatography of a detergent-soluble fraction of bovine cauda sperm acrosome followed by mass spectrometry-based identification of bound proteins identified acrosin, lactadherin, SPACA3, and IZUMO1. Co-immunoprecipitation analysis also demonstrated the interaction of OMC32 with acrosin, lactadherin, SPACA3, and IZUMO1. Our immunofluorescence studies revealed the presence of SPACA3 and lactadherin over the apical segment, whereas IZUMO1 is localized over the equatorial segment of Triton X-100 permeabilized cauda sperm. Immunoblot analysis showed that a significant portion of SPACA3 was released after the lysophosphatidylcholine (LPC)-induced acrosome

  6. Identification of pneumococcal surface protein A as a lactoferrin-binding protein of Streptococcus pneumoniae.

    Science.gov (United States)

    Hammerschmidt, S; Bethe, G; Remane, P H; Chhatwal, G S

    1999-04-01

    Lactoferrin (Lf), an iron-sequestering glycoprotein, predominates in mucosal secretions, where the level of free extracellular iron (10(-18) M) is not sufficient for bacterial growth. This represents a mechanism of resistance to bacterial infections by prevention of colonization of the host by pathogens. In this study we were able to show that Streptococcus pneumoniae specifically recognizes and binds the iron carrier protein human Lf (hLf). Pretreatment of pneumococci with proteases reduced hLf binding significantly, indicating that the hLf receptor is proteinaceous. Binding assays performed with 63 clinical isolates belonging to different serotypes showed that 88% of the tested isolates interacted with hLf. Scatchard analysis showed the existence of two hLf-binding proteins with dissociation constants of 5.7 x 10(-8) and 2.74 x 10(-7) M. The receptors were purified by affinity chromatography, and internal sequence analysis revealed that one of the S. pneumoniae proteins was homologous to pneumococcal surface protein A (PspA). The function of PspA as an hLf-binding protein was confirmed by the ability of purified PspA to bind hLf and to competitively inhibit hLf binding to pneumococci. S. pneumoniae may use the hLf-PspA interaction to overcome the iron limitation at mucosal surfaces, and this might represent a potential virulence mechanism.

  7. Identification of actin binding protein, ABP-280, as a binding partner of human Lnk adaptor protein.

    Science.gov (United States)

    He, X; Li, Y; Schembri-King, J; Jakes, S; Hayashi, J

    2000-08-01

    Human Lnk (hLnk) is an adaptor protein with multiple functional domains that regulates T cell activation signaling. In order to identify cellular Lnk binding partners, a yeast two-hybrid screening of human spleen cDNA library was carried out using human hLnk as bait. A polypeptide sequence identical to the C-terminal segment of the actin binding protein (ABP-280) was identified as a hLnk binding protein. The expressed hLnk and the FLAG tagged C-terminal 673 amino acid residues of ABP-280 or the endogenous ABP-280 in COS-7 cells could be co-immunoprecipitated using antibodies either to hLnk, FLAG or ABP-280, respectively. Furthermore, immunofluorescence confocal microscope showed that hLnk and ABP-280 co-localized at the plasma membrane and at juxtanuclear region of COS-7 cells. In Jurkat cells, the endogenous hLnk also associates with the endogenous ABP-280 indicating that the association of these two proteins is physiological. The interacting domains of both proteins were mapped using yeast two-hybrid assays. Our results indicate that hLnk binds to the residues 2006-2454 (repeats 19-23C) of ABP-280. The domain in hLnk that associates with ABP-280 was mapped to an interdomain region of 56 amino acids between pleckstrin homology and Src homology 2 domains. These results suggest that hLnk may exert its regulatory role through its association with ABP-280.

  8. Identification and characterization of a microneme protein (NcMIC6) in Neospora caninum.

    Science.gov (United States)

    Li, Wensheng; Liu, Jing; Wang, Jing; Fu, Yong; Nan, Huizhu; Liu, Qun

    2015-08-01

    Neospora caninum, an Apicomplexa parasite, is the causative agent of neosporosis. As described for other members of Apicomplexa, microneme proteins (MICs) play a key role in attachment and invasion of host cells by N. caninum. Herein we identified N. caninum microneme protein 6 (NcMIC6) that is orthologous to Toxoplasma gondii microneme protein 6 (TgMIC6). The open reading frame of the NcMIC6 gene is 984 bp and encodes a 327 amino acid peptide. Sequence analysis showed that NcMIC6 included a signal peptide, a transmembrane region, three epidermal growth factor-like (EGF) domains, and two low complexity regions. Antibodies raised against recombinant NcMIC6 recognized an approximately 35-kDa native MIC6 protein in Western blots of N. caninum tachyzoites. Immunofluorescence analysis showed that NcMIC6 had a polar labeling pattern, which was consistent with localization of micronemes in the apical region. Pulse invasion assays showed that NcMIC6 translocated from the apical tip to the posterior end of the parasites. Secretion assays demonstrated that NcMIC6 was released into the supernatants. Importantly, it was clearly revealed by co-immunoprecipitation that NcMIC6 formed a complex with other two soluble microneme proteins (NcMIC1 and NcMIC4). In conclusion, identification and characterization of the novel microneme protein NcMIC6 may contribute to understanding how this protein functions during the parasite motility and host cell invasion.

  9. Identification of Newly Synthesized Proteins by Echinococcus granulosus Protoscoleces upon Induction of Strobilation.

    Directory of Open Access Journals (Sweden)

    João Antonio Debarba

    2015-09-01

    Full Text Available The proteins responsible for the key molecular events leading to the structural changes between the developmental stages of Echinococcus granulosus remain unknown. In this work, azidohomoalanine (AHA-specific labeling was used to identify proteins expressed by E. granulosus protoscoleces (PSCs upon the induction of strobilar development.The in vitro incorporation of AHA with different tags into newly synthesized proteins (NSPs by PSCs was analyzed using SDS-PAGE and confocal microscopy. The LC-MS/MS analysis of AHA-labeled NSPs by PSCs undergoing strobilation allowed for the identification of 365 proteins, of which 75 were differentially expressed in comparison between the presence or absence of strobilation stimuli and 51 were expressed exclusively in either condition. These proteins were mainly involved in metabolic, regulatory and signaling processes.After the controlled-labeling of proteins during the induction of strobilar development, we identified modifications in protein expression. The changes in the metabolism and the activation of control and signaling pathways may be important for the correct parasite development and be target for further studies.

  10. Identification of an immunogenic protein of Giardia lamblia using monoclonal antibodies generated from infected mice

    Directory of Open Access Journals (Sweden)

    Jael Quintero

    2013-08-01

    Full Text Available The humoral immune response plays an important role in the clearance of Giardia lamblia. However, our knowledge about the specific antigens of G. lamblia that induce a protective immune response is limited. The purpose of this study was to identify and characterise the immunogenic proteins of G. lamblia in a mouse model. We generated monoclonal antibodies (moAbs specific to G. lamblia (1B10, 2C9.D11, 3C10.E5, 3D10, 5G8.B5, 5F4, 4C7, 3C5 and 3C6 by fusing splenocytes derived from infected mice. Most of these moAbs recognised a band of ± 71 kDa (5G8 protein and this protein was also recognised by serum from the infected mice. We found that the moAbs recognised conformational epitopes of the 5G8 protein and that this antigen is expressed on the cell surface and inside trophozoites. Additionally, antibodies specific to the 5G8 protein induced strong agglutination (> 70-90% of trophozoites. We have thus identified a highly immunogenic antigen of G. lamblia that is recognised by the immune system of infected mice. In summary, this study describes the identification and partial characterisation of an immunogenic protein of G. lamblia. Additionally, we generated a panel of moAbs specific for this protein that will be useful for the biochemical and immunological characterisation of this immunologically interesting Giardia molecule.

  11. YahO protein as a calibrant for top-down proteomic identification of Shiga toxin using MALDI-TOF-TOF-MS/MS and post-source decay

    Science.gov (United States)

    Matrix-assisted laser desorption/ionization tandem time-of-flight (MALDI-TOF-TOF) mass spectrometry is increasingly utilized for rapid top-down proteomic identification of proteins. This identification may involve analysis of either a pure protein or a protein mixture. For analysis of a pure protein...

  12. Identification of Protein-Protein Interactions Involved in Pectin Biosynthesis in the golgi Apparatus

    DEFF Research Database (Denmark)

    Lund, Christian Have

    GALACTURONOSYLTRANSFERASE1 (GAUT1) and GAUT7 has beesn identified and is essential for pectin biosynthesis. Interestingly, GAUT1 has been shown to be proteolytic processed from its transmembrane anchor domain and its catalytic domain is retained by GAUT7, thus ensuring biosynthesis of HG in the Golgi apparatus. Many...... methods exist in identifying protein-protein interaction (PPI) but many of these are developed for other organisms than plants and are most applicable for PPI detection in other organelles than the Golgi apparatus where pectin biosynthesis occurs. In this work, different PPI detection methods are examined...... for their ability to detect PPI inside the Golgi lumen. The first method tested was the commercially available splitubiquitin system from Dualsystems Biotech AG. This was applied to test binary interactions between proteins involved in HG and Rhamnogalacturonan I (RG-I) biosynthesis (see Manuscript II...

  13. Automating proteome analysis: improvements in throughput, quality and accuracy of protein identification by peptide mass fingerprinting.

    Science.gov (United States)

    Canelle, Ludovic; Pionneau, Cédric; Marie, Arul; Bousquet, Jordane; Bigeard, Jean; Lutomski, Didier; Kadri, Tewfik; Caron, Michel; Joubert-Caron, Raymonde

    2004-01-01

    The use of robots has major effects on maximizing the proteomic workflow required in an increasing number of high-throughput projects and on increasing the quality of the data. In peptide mass finger printing (PMF), automation of steps downstream of two-dimensional gel electrophoresis is essential. To achieve this goal, the workflow must be fluid. We have developed tools using macros written in Microsoft Excel and Word to complete the automation of our platform. Additionally, because sample preparation is crucial for identification of proteins by matrix-assisted laser desorption/ionization (MALDI) mass spectrometry, we optimized a sandwich method usable by any robot for spotting digests on a MALDI target. This procedure enables further efficient automated washing steps directly on the MALDI target. The success rate of PMF identification was evaluated for the automated sandwich method, and for the dried-droplet method implemented on the robot as recommended by the manufacturer. Of the two methods, the sandwich method achieved the highest identification success rate and sequence coverage of proteins. 2004 John Wiley & Sons, Ltd.

  14. Bacillus anthracis secretome time course under host-simulated conditions and identification of immunogenic proteins

    Directory of Open Access Journals (Sweden)

    Whittington Jessica

    2007-07-01

    accumulation may be relevant in elucidation of the progression of pathogenicity, identification of therapeutics and diagnostic markers, and vaccine development. This study also adds to the continuously growing list of identified Bacillus anthracis secretome proteins.

  15. Identification of genes involved in radioresistance of nasopharyngeal carcinoma by integrating gene ontology and protein-protein interaction networks.

    Science.gov (United States)

    Guo, Ya; Zhu, Xiao-Dong; Qu, Song; Li, Ling; Su, Fang; Li, Ye; Huang, Shi-Ting; Li, Dan-Rong

    2012-01-01

    Radioresistance remains one of the important factors in relapse and metastasis of nasopharyngeal carcinoma. Thus, it is imperative to identify genes involved in radioresistance and explore the underlying biological processes in the development of radioresistance. In this study, we used cDNA microarrays to select differential genes between radioresistant CNE-2R and parental CNE-2 cell lines. One hundred and eighty-three significantly differentially expressed genes (pgenes were upregulated and 45 genes were downregulated in CNE-2R. We further employed publicly available bioinformatics related software, such as GOEAST and STRING to examine the relationship among differentially expressed genes. The results show that these genes were involved in type I interferon-mediated signaling pathway biological processes; the nodes tended to have high connectivity with the EGFR pathway, IFN-related pathways, NF-κB. The node STAT1 has high connectivity with other nodes in the protein-protein interaction (PPI) networks. Finally, the reliability of microarray data was validated for selected genes by semi-quantitative RT-PCR and Western blotting. The results were consistent with the microarray data. Our study suggests that microarrays combined with gene ontology and protein interaction networks have great value in the identification of genes of radioresistance in nasopharyngeal carcinoma; genes involved in several biological processes and protein interaction networks may be relevant to NPC radioresistance; in particular, the verified genes CCL5, STAT1-α, STAT2 and GSTP1 may become potential biomarkers for predicting NPC response to radiotherapy.

  16. Identification of a chitinase-modifying protein from Fusarium verticillioides: truncation of a host resistance protein by a fungalysin metalloprotease.

    Science.gov (United States)

    Naumann, Todd A; Wicklow, Donald T; Price, Neil P J

    2011-10-14

    Chitinase-modifying proteins (cmps) are proteases secreted by fungal pathogens that truncate the plant class IV chitinases ChitA and ChitB during maize ear rot. cmp activity has been characterized for Bipolaris zeicola and Stenocarpella maydis, but the identities of the proteases are not known. Here, we report that cmps are secreted by multiple species from the genus Fusarium, that cmp from Fusarium verticillioides (Fv-cmp) is a fungalysin metalloprotease, and that it cleaves within a sequence that is conserved in class IV chitinases. Protein extracts from Fusarium cultures were found to truncate ChitA and ChitB in vitro. Based on this activity, Fv-cmp was purified from F. verticillioides. N-terminal sequencing of truncated ChitA and MALDI-TOF-MS analysis of reaction products showed that Fv-cmp is an endoprotease that cleaves a peptide bond on the C-terminal side of the lectin domain. The N-terminal sequence of purified Fv-cmp was determined and compared with a set of predicted proteins, resulting in its identification as a zinc metalloprotease of the fungalysin family. Recombinant Fv-cmp also truncated ChitA, confirming its identity, but had reduced activity, suggesting that the recombinant protease did not mature efficiently from its propeptide-containing precursor. This is the first report of a fungalysin that targets a nonstructural host protein and the first to implicate this class of virulence-related proteases in plant disease.

  17. High-accuracy identification and bioinformatic analysis of in vivo protein phosphorylation sites in yeast

    DEFF Research Database (Denmark)

    Gnad, Florian; de Godoy, Lyris M F; Cox, Jürgen

    2009-01-01

    mapped to 1118 proteins, representatively covering the yeast kinome and a multitude of transcription factors. We show that a single false discovery rate for all peptide identifications significantly overestimates occurrence of rare modifications, such as tyrosine phosphorylation in yeast. The identified...... phosphorylation sites are predominantly located on irregularly structured and accessible protein regions. We found high evolutionary conservation of phosphorylated proteins and a large overlap of significantly over-represented motifs with the human phosphoproteome. Nevertheless, phosphorylation events at the site...... level were not highly conserved between yeast and higher eukaryotes, which points to metazoan-specific kinase and substrate families. We constructed a yeast-specific phosphorylation sites predictor on the basis of a support vector machine, which - together with the yeast phosphorylation data...

  18. Hidden Markov Models Incorporating Fuzzy Measures and Integrals for Protein Sequence Identification and Alignment

    Institute of Scientific and Technical Information of China (English)

    Niranjan P.Bidargaddi; Madlhu Chetty; Joarder Kamruzzaman

    2008-01-01

    Profile hidden Markov models (HMMs) based on classical HMMs have been widely applied for protein sequence identification. The formulation of the forward and backward variables in profile HMMs is made under statistical independence assumption of the probability theory. We propose a fuzzy profile HMM to overcome the limitations of that assumption and to achieve an improved alignment for protein sequences belonging to a given family. The proposed model fuzzifies the forward and backward variables by incorporating Sugeno fuzzy measures and Choquet integrals, thus further extends the generalized HMM. Based on the fuzzified forwardand backward variables, we propose a fuzzy Baum-Welch parameter estimation al-gorithm for profiles. The strong correlations and the sequence preference involved in the protein structures make this fuzzy architecture based model as a suitable candidate for building profiles of a given family, since the fuzzy set can handle uncertainties better than classical methods.

  19. Novel small protein identification and quantitative proteomic analysis in Pseudomonas putida KT-­2440

    DEFF Research Database (Denmark)

    Yang, Xiaochen

    .This thesis investigated an industrial bacterium, Pseudomonas putida KT-2440, in two aspects. First, the research focused on discovering novel small proteins (s-proteins) in the bacterium. With large-scale approaches for gene identification, groups of novel s-proteins were identified and validated from...... the genome, transcriptome and proteome of the bacterium. The application of new research approach, ribosome profiling, enabled us to analysis novel open reading frames (ORFs) from different standpoint. Second, by quantitative proteomic approach, the differential expressions of genes were analyzed at proteome...... level under different environmental conditions. The results yield insights intothe adaptation of P. putida KT-2440 in different environments.Based on bioinformatic, proteomic and transcriptomic approaches, global gene expression was analyzed on both transcriptional and translational levels. Our research...

  20. Identification of Asp isomerization in proteins by ¹⁸O labeling and tandem mass spectrometry.

    Science.gov (United States)

    Zhang, Jennifer; Katta, Viswanatham

    2012-01-01

    Isomerization of aspartic acid (Asp) to isoaspartic acid (isoAsp) via succinimide intermediate is a common route of degradation for proteins that can affect their structural integrity. As Asp/isoAsp is isobaric in mass, it is difficult to identify the site of modification by LC-MS/MS peptide mapping. Here, we describe an approach to label the Asp residue involved in isomerization at the protein level by hydrolyzing the succinimide intermediate in H₂¹⁸O. Tryptic digestion of this labeled protein will result in peptides containing the site of isomerization being 2 Da heavier than the ¹⁶O-containing counterparts, due to ¹⁸O incorporation during the hydrolysis process. Comparison of tandem mass spectra of isomerized peptides with and without ¹⁸O incorporation allows easy identification of the Asp residue involved. This method proved to be especially useful in identifying the sites when isomerization occurs in Asp-Asp motifs.

  1. Proteomic identification of novel differentiation plasma protein markers in hypobaric hypoxia-induced rat model.

    Directory of Open Access Journals (Sweden)

    Yasmin Ahmad

    Full Text Available BACKGROUND: Hypobaric hypoxia causes complex changes in the expression of genes, including stress related genes and corresponding proteins that are necessary to maintain homeostasis. Whereas most prior studies focused on single proteins, newer methods allowing the simultaneous study of many proteins could lead to a better understanding of complex and dynamic changes that occur during the hypobaric hypoxia. METHODS: In this study we investigated the temporal plasma protein alterations of rat induced by hypobaric hypoxia at a simulated altitude of 7620 m (25,000 ft, 282 mm Hg in a hypobaric chamber. Total plasma proteins collected at different time points (0, 6, 12 and 24 h, separated by two-dimensional electrophoresis (2-DE and identified using matrix assisted laser desorption ionization time of flight (MALDI-TOF/TOF. Biological processes that were enriched in the plasma proteins during hypobaric hypoxia were identified using Gene Ontology (GO analysis. According to their properties and obvious alterations during hypobaric hypoxia, changes of plasma concentrations of Ttr, Prdx-2, Gpx -3, Apo A-I, Hp, Apo-E, Fetub and Nme were selected to be validated by Western blot analysis. RESULTS: Bioinformatics analysis of 25 differentially expressed proteins showed that 23 had corresponding candidates in the database. The expression patterns of the eight selected proteins observed by Western blot were in agreement with 2-DE results, thus confirming the reliability of the proteomic analysis. Most of the proteins identified are related to cellular defense mechanisms involving anti-inflammatory and antioxidant activity. Their presence reflects the consequence of serial cascades initiated by hypobaric hypoxia. CONCLUSION/SIGNIFICANCE: This study provides information about the plasma proteome changes induced in response to hypobaric hypoxia and thus identification of the candidate proteins which can act as novel biomarkers.

  2. Enrichment of Functional Redox Reactive Proteins and Identification by Mass Spectrometry Results in Several Terminal Fe(III)-reducing Candidate Proteins in Shewanella oneidensis MR-1.

    Energy Technology Data Exchange (ETDEWEB)

    Elias, Dwayne A.; Yang, Feng; Mottaz, Heather M.; Beliaev, Alex S.; Lipton, Mary S.

    2007-02-01

    Identification of the proteins directly involved in microbial metal-reduction is important to understanding the biochemistry involved in heavy metal reduction/immobilization and the ultimate cleanup of DOE contaminated sites. Although previous strategies for the identification of these proteins have traditionally required laborious protein purification/characterization of metal-reducing capability, activity is often lost before the final purification step, thus creating a significant knowledge gap. In the current study, subcellular fractions of S. oneidensis MR-1 were enriched for Fe(III)-NTA reducing proteins in a single step using several orthogonal column matrices. The protein content of eluted fractions that demonstrated activity were determined by ultra high pressure liquid chromatography coupled with tandem mass spectrometry (LCMS/ MS). A comparison of the proteins identified from active fractions in all separations produced 30 proteins that may act as the terminal electron-accepting protein for Fe(III)-reduction. These include MtrA, MtrB, MtrC and OmcA as well as a number of other proteins not previously associated with Fe(III)-reduction. This is the first report of such an approach where the laborious procedures for protein purification are not required for identification of metal-reducing proteins. Such work provides the basis for a similar approach with other cultured organisms as well as analysis of sediment and groundwater samples from biostimulation efforts at contaminated sites.

  3. Identification of potential molecular associations between chikungunya virus non-structural protein 2 and human host proteins.

    Science.gov (United States)

    Rana, J; Gulati, S; Rajasekharan, S; Gupta, A; Chaudhary, V; Gupta, S

    2017-01-01

    Chikungunya virus (CHIKV) non-structural protein 2 (nsP2) is considered to be the master regulator of viral RNA replication and host responses generated during viral infection. This protein has two main functional domains: an N-terminal domain which exhibits NTPase, RNA triphosphatase and helicase activities and a C-terminal protease domain. Understanding how CHIKV nsP2 interacts with its host proteins is essential for elucidating all the required processes for viral replication and pathogenesis along with the identification of potential targets for antiviral therapy. In current study yeast two-hybrid (Y2H) screening of a human fetal brain cDNA library was performed using nsP2 protein as bait. The analysis identified seven host proteins (CCDC130, CPNE6, POLR2C, MAPK9, EIF4A2, EEF1A1 and EIF3I) as putative interactors of CHIKV nsP2 which were selected for further analysis based on their roles in host cellular machinery. The gene ontology analysis indicates that these proteins are mainly involved in apoptosis, transcription and translational mechanism of host cell. Domain mapping of nsP2 revealed that these associations are not random connections but instead they have functional significance. Further studies to identify the amino acid residues and their chemical interactions that may help in opening new possibilities for preventing these interactions, thus reducing chances of chikungunya infection were performed. This study expands the understanding of CHIKV-host interactions and is important for rational approaches of discovering new antiviral agents.

  4. Identification of maturation and protein synthesis related proteins from porcine oocytes during in vitro maturation

    Directory of Open Access Journals (Sweden)

    Seo Kang

    2011-06-01

    Full Text Available Abstract Background In vitro maturation (IVM of mammalian oocytes is divided into the GV (germinal vesicle stage, MI (metaphase I stage and MII (metaphase II stage stages, and only fully mature oocytes have acquired the ability to be fertilized and initiate zygotic development. These observations have been mostly based on morphological evaluations, but the molecular events governing these processes are not fully understood. The aim of the present study was to better understand the processes involved in the molecular regulation of IVM using 2-DE analysis followed by mass spectrometry to identify proteins that are differentially expressed during oocyte IVM. Result A total of 16 up-regulated and 12 down-regulated proteins were identified. To investigate the IVM process, we specifically focused on the proteins that were up-regulated during the MII stage when compared with the GV stage, which included PRDX 2, GST, SPSY, myomegalin, PED4D, PRKAB 1, and DTNA. These up-regulated proteins were functionally involved in redox regulation and the cAMP-dependent pathway, which are essential for the intracellular signaling involved in oocyte maturation. Interestingly, the PDE4D and its partner, myomegalin, during the MII stage was consistently confirmed up-regulation by western blot analyses. Conclusion These results could be used to better understand some aspects of the molecular mechanisms underlying porcine oocyte maturation. This study identified some regulatory proteins that may have important roles in the molecular events involved in porcine oocyte maturation, particularly with respect to the regulation of oocyte meiotic resumption, MII arrest and oocyte activation. In addition, this study may have beneficial applications not only to basic science with respect to the improvement of oocyte culture conditions but also to mammalian reproductive biotechnology with potential implications.

  5. Identification of a 251 gene expression signature that can accurately detect M. tuberculosis in patients with and without HIV co-infection.

    Directory of Open Access Journals (Sweden)

    Noor Dawany

    Full Text Available BACKGROUND: Co-infection with tuberculosis (TB is the leading cause of death in HIV-infected individuals. However, diagnosis of TB, especially in the presence of an HIV co-infection, can be limiting due to the high inaccuracy associated with the use of conventional diagnostic methods. Here we report a gene signature that can identify a tuberculosis infection in patients co-infected with HIV as well as in the absence of HIV. METHODS: We analyzed global gene expression data from peripheral blood mononuclear cell (PBMC samples of patients that were either mono-infected with HIV or co-infected with HIV/TB and used support vector machines to identify a gene signature that can distinguish between the two classes. We then validated our results using publically available gene expression data from patients mono-infected with TB. RESULTS: Our analysis successfully identified a 251-gene signature that accurately distinguishes patients co-infected with HIV/TB from those infected with HIV only, with an overall accuracy of 81.4% (sensitivity = 76.2%, specificity = 86.4%. Furthermore, we show that our 251-gene signature can also accurately distinguish patients with active TB in the absence of an HIV infection from both patients with a latent TB infection and healthy controls (88.9-94.7% accuracy; 69.2-90% sensitivity and 90.3-100% specificity. We also demonstrate that the expression levels of the 251-gene signature diminish as a correlate of the length of TB treatment. CONCLUSIONS: A 251-gene signature is described to (a detect TB in the presence or absence of an HIV co-infection, and (b assess response to treatment following anti-TB therapy.

  6. Accurate identification of UDP-glucuronosyltransferase 1A1 (UGT1A1) inhibitors using UGT1A1-overexpressing HeLa cells.

    Science.gov (United States)

    Sun, Hua; Zhou, Xiaotong; Wu, Baojian

    2015-01-01

    1. UDP-glucuronosyltransferase 1A1 (UGT1A1) plays an irreplaceable role in detoxification of bilirubin and many drugs (e.g., SN-38). Here we aimed to explore the potential of UGT1A1-overexpressing HeLa cells (or HeLa1A1 cells) as a tool to accurately identify UGT1A1 inhibitors. 2. Determination of glucuronidation rates (β-estradiol and SN-38 as the substrates) was performed using HeLa1A1 cells and uridine diphosphoglucuronic acid (UDPGA)-supplemented cDNA expressed UGT1A1 enzyme (or microsomes). The inhibitory effects (IC50 values) of 20 structurally diverse compounds on the UGT1A1 activity were determined using HeLa1A1 cells and microsomal incubations. 3. In HeLa1A1 cells, the IC50 values for inhibition of β-estradiol glucuronidation by the tested compounds ranged from 0.33 to 94.6 µM. In the microsomal incubations, the IC50 values ranged from 0.47 to 155 µM. It was found that the IC50 values of all test compounds derived from the cells were well consistent with those from the microsomes (deviated by less than two-fold). Further, the IC50 values from the cells were strongly correlated with those from microsomes (r = 0.944, p HeLa cells were an appropriate tool to accurately depict the inhibition profiles of chemicals against UGT1A1.

  7. Metabolite signal identification in accurate mass metabolomics data with MZedDB, an interactive m/z annotation tool utilising predicted ionisation behaviour 'rules'

    Directory of Open Access Journals (Sweden)

    Snowdon Stuart

    2009-07-01

    Full Text Available Abstract Background Metabolomics experiments using Mass Spectrometry (MS technology measure the mass to charge ratio (m/z and intensity of ionised molecules in crude extracts of complex biological samples to generate high dimensional metabolite 'fingerprint' or metabolite 'profile' data. High resolution MS instruments perform routinely with a mass accuracy of Results Metabolite 'structures' harvested from publicly accessible databases were converted into a common format to generate a comprehensive archive in MZedDB. 'Rules' were derived from chemical information that allowed MZedDB to generate a list of adducts and neutral loss fragments putatively able to form for each structure and calculate, on the fly, the exact molecular weight of every potential ionisation product to provide targets for annotation searches based on accurate mass. We demonstrate that data matrices representing populations of ionisation products generated from different biological matrices contain a large proportion (sometimes > 50% of molecular isotopes, salt adducts and neutral loss fragments. Correlation analysis of ESI-MS data features confirmed the predicted relationships of m/z signals. An integrated isotope enumerator in MZedDB allowed verification of exact isotopic pattern distributions to corroborate experimental data. Conclusion We conclude that although ultra-high accurate mass instruments provide major insight into the chemical diversity of biological extracts, the facile annotation of a large proportion of signals is not possible by simple, automated query of current databases using computed molecular formulae. Parameterising MZedDB to take into account predicted ionisation behaviour and the biological source of any sample improves greatly both the frequency and accuracy of potential annotation 'hits' in ESI-MS data.

  8. Accurate Chromosome Segregation at First Meiotic Division Requires AGO4, a Protein Involved in RNA-Dependent DNA Methylation in Arabidopsis thaliana.

    Science.gov (United States)

    Oliver, Cecilia; Santos, Juan Luis; Pradillo, Mónica

    2016-10-01

    The RNA-directed DNA methylation (RdDM) pathway is important for the transcriptional repression of transposable elements and for heterochromatin formation. Small RNAs are key players in this process by regulating both DNA and histone methylation. Taking into account that methylation underlies gene silencing and that there are genes with meiosis-specific expression profiles, we have wondered whether genes involved in RdDM could play a role during this specialized cell division. To address this issue, we have characterized meiosis progression in pollen mother cells from Arabidopsis thaliana mutant plants defective for several proteins related to RdDM. The most relevant results were obtained for ago4-1 In this mutant, meiocytes display a slight reduction in chiasma frequency, alterations in chromatin conformation around centromeric regions, lagging chromosomes at anaphase I, and defects in spindle organization. These abnormalities lead to the formation of polyads instead of tetrads at the end of meiosis, and might be responsible for the fertility defects observed in this mutant. Findings reported here highlight an involvement of AGO4 during meiosis by ensuring accurate chromosome segregation at anaphase I.

  9. Identification of Two Secondary Ligand Binding Sites in 14-3-3 Proteins Using Fragment Screening.

    Science.gov (United States)

    Sijbesma, Eline; Skora, Lukasz; Leysen, Seppe; Brunsveld, Luc; Koch, Uwe; Nussbaumer, Peter; Jahnke, Wolfgang; Ottmann, Christian

    2017-08-01

    Proteins typically interact with multiple binding partners, and often different parts of their surfaces are employed to establish these protein-protein interactions (PPIs). Members of the class of 14-3-3 adapter proteins bind to several hundred other proteins in the cell. Multiple small molecules for the modulation of 14-3-3 PPIs have been disclosed; however, they all target the conserved phosphopeptide binding channel, so that selectivity is difficult to achieve. Here we report on the discovery of two individual secondary binding sites that have been identified by combining nuclear magnetic resonance-based fragment screening and X-ray crystallography. The two pockets that these fragments occupy are part of at least three physiologically relevant and structurally characterized 14-3-3 PPI interfaces, including those with serotonin N-acetyltransferase and plant transcription factor FT. In addition, the high degree of conservation of the two sites implies their relevance for 14-3-3 PPIs. This first identification of secondary sites on 14-3-3 proteins bound by small molecule ligands might facilitate the development of new chemical tool compounds for more selective PPI modulation.

  10. Proteomic Identification of Altered Cerebral Proteins in the Complex Regional Pain Syndrome Animal Model

    Directory of Open Access Journals (Sweden)

    Francis Sahngun Nahm

    2014-01-01

    Full Text Available Background. Complex regional pain syndrome (CRPS is a rare but debilitating pain disorder. Although the exact pathophysiology of CRPS is not fully understood, central and peripheral mechanisms might be involved in the development of this disorder. To reveal the central mechanism of CRPS, we conducted a proteomic analysis of rat cerebrum using the chronic postischemia pain (CPIP model, a novel experimental model of CRPS. Materials and Methods. After generating the CPIP animal model, we performed a proteomic analysis of the rat cerebrum using a multidimensional protein identification technology, and screened the proteins differentially expressed between the CPIP and control groups. Results. A total of 155 proteins were differentially expressed between the CPIP and control groups: 125 increased and 30 decreased; expressions of proteins related to cell signaling, synaptic plasticity, regulation of cell proliferation, and cytoskeletal formation were increased in the CPIP group. However, proenkephalin A, cereblon, and neuroserpin were decreased in CPIP group. Conclusion. Altered expression of cerebral proteins in the CPIP model indicates cerebral involvement in the pathogenesis of CRPS. Further study is required to elucidate the roles of these proteins in the development and maintenance of CRPS.

  11. Identification of ZASP, a novel protein associated to Zona occludens-2

    Energy Technology Data Exchange (ETDEWEB)

    Lechuga, Susana; Alarcon, Lourdes; Solano, Jesus [Department of Physiology, Biophysics and Neuroscience, Center for Research and Advanced Studies (Cinvestav), Mexico, D.F. 07360 (Mexico); Huerta, Miriam; Lopez-Bayghen, Esther [Department of Genetics and Molecular Biology, Center for Research and Advanced Studies (Cinvestav), Mexico, D.F. 07360 (Mexico); Gonzalez-Mariscal, Lorenza, E-mail: lorenza@fisio.cinvestav.mx [Department of Physiology, Biophysics and Neuroscience, Center for Research and Advanced Studies (Cinvestav), Mexico, D.F. 07360 (Mexico)

    2010-11-15

    With the aim of discovering new molecular interactions of the tight junction protein ZO-2, a two-hybrid screen was performed on a human kidney cDNA library using as bait the middle segment of ZO-2. Through this assay we identified a 24-kDa novel protein herein named ZASP for ZO-2 associated speckle protein. ZO-2/ZASP interaction further confirmed by pull down and immunoprecipitation experiments, requires the presence of the intact PDZ binding motif SQV of ZASP and the third PDZ domain of ZO-2. ZASP mRNA and protein are present in the kidney and in several epithelial cell lines. Endogenous ZASP is expressed primarily in nuclear speckles in co-localization with splicing factor SC-35. Nocodazole treatment and wash out reveals that ZASP disappears from the nucleus during mitosis in accordance with speckle disassembly during metaphase. ZASP amino acid sequence exhibits a canonical nuclear exportation signal and in agreement the protein exits the nucleus through a process mediated by exportin/CRM1. ZASP over-expression blocks the inhibitory activity of ZO-2 on cyclin D1 gene transcription and protein expression. The identification of ZASP helps to unfold the complex nuclear molecular arrays that form on ZO-2 scaffolds.

  12. Identification of ZASP, a novel protein associated to Zona occludens-2.

    Science.gov (United States)

    Lechuga, Susana; Alarcón, Lourdes; Solano, Jesús; Huerta, Miriam; Lopez-Bayghen, Esther; González-Mariscal, Lorenza

    2010-11-15

    With the aim of discovering new molecular interactions of the tight junction protein ZO-2, a two-hybrid screen was performed on a human kidney cDNA library using as bait the middle segment of ZO-2. Through this assay we identified a 24-kDa novel protein herein named ZASP for ZO-2 associated speckle protein. ZO-2/ZASP interaction further confirmed by pull down and immunoprecipitation experiments, requires the presence of the intact PDZ binding motif SQV of ZASP and the third PDZ domain of ZO-2. ZASP mRNA and protein are present in the kidney and in several epithelial cell lines. Endogenous ZASP is expressed primarily in nuclear speckles in co-localization with splicing factor SC-35. Nocodazole treatment and wash out reveals that ZASP disappears from the nucleus during mitosis in accordance with speckle disassembly during metaphase. ZASP amino acid sequence exhibits a canonical nuclear exportation signal and in agreement the protein exits the nucleus through a process mediated by exportin/CRM1. ZASP over-expression blocks the inhibitory activity of ZO-2 on cyclin D1 gene transcription and protein expression. The identification of ZASP helps to unfold the complex nuclear molecular arrays that form on ZO-2 scaffolds.

  13. Closed-loop spontaneous baroreflex transfer function is inappropriate for system identification of neural arc but partly accurate for peripheral arc: predictability analysis.

    Science.gov (United States)

    Kamiya, Atsunori; Kawada, Toru; Shimizu, Shuji; Sugimachi, Masaru

    2011-04-01

    Although the dynamic characteristics of the baroreflex system have been described by baroreflex transfer functions obtained from open-loop analysis, the predictability of time-series output dynamics from input signals, which should confirm the accuracy of system identification, remains to be elucidated. Moreover, despite theoretical concerns over closed-loop system identification, the accuracy and the predictability of the closed-loop spontaneous baroreflex transfer function have not been evaluated compared with the open-loop transfer function. Using urethane and α-chloralose anaesthetized, vagotomized and aortic-denervated rabbits (n = 10), we identified open-loop baroreflex transfer functions by recording renal sympathetic nerve activity (SNA) while varying the vascularly isolated intracarotid sinus pressure (CSP) according to a binary random (white-noise) sequence (operating pressure ± 20 mmHg), and using a simplified equation to calculate closed-loop-spontaneous baroreflex transfer function while matching CSP with systemic arterial pressure (AP). Our results showed that the open-loop baroreflex transfer functions for the neural and peripheral arcs predicted the time-series SNA and AP outputs from measured CSP and SNA inputs, with r2 of 0.8 ± 0.1 and 0.8 ± 0.1, respectively. In contrast, the closed-loop-spontaneous baroreflex transfer function for the neural arc was markedly different from the open-loop transfer function (enhanced gain increase and a phase lead), and did not predict the time-series SNA dynamics (r2; 0.1 ± 0.1). However, the closed-loop-spontaneous baroreflex transfer function of the peripheral arc partially matched the open-loop transfer function in gain and phase functions, and had limited but reasonable predictability of the time-series AP dynamics (r2, 0.7 ± 0.1). A numerical simulation suggested that a noise predominantly in the neural arc under resting conditions might be a possible mechanism responsible for our findings. Furthermore

  14. In silico identification of essential proteins in Corynebacterium pseudotuberculosis based on protein

    DEFF Research Database (Denmark)

    Folador, Edson Luiz; de Carvalho, Paulo Vinícius Sanches Daltro; Silva, Wanderson Marques

    2016-01-01

    BACKGROUND: Corynebacterium pseudotuberculosis (Cp) is a gram-positive bacterium that is classified into equi and ovis serovars. The serovar ovis is the etiological agent of caseous lymphadenitis, a chronic infection affecting sheep and goats, causing economic losses due to carcass condemnation...... of the potential Cp interactome and to identify potentially essential proteins serving as putative drug targets. On average, we predict 16,669 interactions for each of the nine strains (with 15,495 interactions shared among all strains). An in silico sanity check suggests that the potential networks were...... not formed by spurious interactions but have a strong biological bias. With the inferred Cp networks we identify 181 essential proteins, among which 41 are non-host homologous. CONCLUSIONS: The list of candidate interactions of the Cp strains lay the basis for developing novel hypotheses and designing...

  15. Polyproline II structure in proteins: identification by chiroptical spectroscopies, stability, and functions.

    Science.gov (United States)

    Bochicchio, Brigida; Tamburro, Antonio Mario

    2002-11-01

    In the last years polyproline II (PPII) structure has been demonstrated to be essential to biological activities such as signal transduction, transcription, cell motility, and immune response. The polyproline left-handed helical structure was nearly unknown until now and often confused with unordered, disordered, irregular, unstructured, extended, or random coil conformations because it is neither alpha-helical nor beta-turn nor beta-sheet, i.e., a classical structure. In spite of the regularity of the PPII structure and, more precisely, its well-defined dihedral angle values, a typical feature of PPII structure is the absence of any intramolecular hydrogen bonds that renders the PPII structure indistinguishable from an irregular backbone structure by (1)H-NMR spectroscopy. The only way to unambiguously reveal PPII structure in solution is to use spectroscopies based on optical activity, such as circular dichroism (CD), vibrational circular dichroism (VCD), and Raman optical activity (ROA). Herein we focus on the identification of PPII structure by CD, widely considered to be the most reliable methodology. Then we report on VCD and ROA spectroscopies as tools in the identification of PPII structure. A third section is dedicated to the analysis of the stabilization of PPII conformation in aqueous solution. Finally, the significance of PPII in self-assembly processes, in elasticity of elastomeric proteins, and in proteins-(peptides) proteins molecular recognition processes are considered.

  16. Identification, characterization and antigenicity of the Plasmodium vivax rhoptry neck protein 1 (PvRON1

    Directory of Open Access Journals (Sweden)

    Patarroyo Manuel E

    2011-10-01

    Full Text Available Abstract Background Plasmodium vivax malaria remains a major health problem in tropical and sub-tropical regions worldwide. Several rhoptry proteins which are important for interaction with and/or invasion of red blood cells, such as PfRONs, Pf92, Pf38, Pf12 and Pf34, have been described during the last few years and are being considered as potential anti-malarial vaccine candidates. This study describes the identification and characterization of the P. vivax rhoptry neck protein 1 (PvRON1 and examine its antigenicity in natural P. vivax infections. Methods The PvRON1 encoding gene, which is homologous to that encoding the P. falciparum apical sushi protein (ASP according to the plasmoDB database, was selected as our study target. The pvron1 gene transcription was evaluated by RT-PCR using RNA obtained from the P. vivax VCG-1 strain. Two peptides derived from the deduced P. vivax Sal-I PvRON1 sequence were synthesized and inoculated in rabbits for obtaining anti-PvRON1 antibodies which were used to confirm the protein expression in VCG-1 strain schizonts along with its association with detergent-resistant microdomains (DRMs by Western blot, and its localization by immunofluorescence assays. The antigenicity of the PvRON1 protein was assessed using human sera from individuals previously exposed to P. vivax malaria by ELISA. Results In the P. vivax VCG-1 strain, RON1 is a 764 amino acid-long protein. In silico analysis has revealed that PvRON1 shares essential characteristics with different antigens involved in invasion, such as the presence of a secretory signal, a GPI-anchor sequence and a putative sushi domain. The PvRON1 protein is expressed in parasite's schizont stage, localized in rhoptry necks and it is associated with DRMs. Recombinant protein recognition by human sera indicates that this antigen can trigger an immune response during a natural infection with P. vivax. Conclusions This study shows the identification and characterization of

  17. Stealth proteins: in silico identification of a novel protein family rendering bacterial pathogens invisible to host immune defense.

    Directory of Open Access Journals (Sweden)

    Peter Sperisen

    2005-11-01

    Full Text Available There are a variety of bacterial defense strategies to survive in a hostile environment. Generation of extracellular polysaccharides has proved to be a simple but effective strategy against the host's innate immune system. A comparative genomics approach led us to identify a new protein family termed Stealth, most likely involved in the synthesis of extracellular polysaccharides. This protein family is characterized by a series of domains conserved across phylogeny from bacteria to eukaryotes. In bacteria, Stealth (previously characterized as SacB, XcbA, or WefC is encoded by subsets of strains mainly colonizing multicellular organisms, with evidence for a protective effect against the host innate immune defense. More specifically, integrating all the available information about Stealth proteins in bacteria, we propose that Stealth is a D-hexose-1-phosphoryl transferase involved in the synthesis of polysaccharides. In the animal kingdom, Stealth is strongly conserved across evolution from social amoebas to simple and complex multicellular organisms, such as Dictyostelium discoideum, hydra, and human. Based on the occurrence of Stealth in most Eukaryotes and a subset of Prokaryotes together with its potential role in extracellular polysaccharide synthesis, we propose that metazoan Stealth functions to regulate the innate immune system. Moreover, there is good reason to speculate that the acquisition and spread of Stealth could be responsible for future epidemic outbreaks of infectious diseases caused by a large variety of eubacterial pathogens. Our in silico identification of a homologous protein in the human host will help to elucidate the causes of Stealth-dependent virulence. At a more basic level, the characterization of the molecular and cellular function of Stealth proteins may shed light on fundamental mechanisms of innate immune defense against microbial invasion.

  18. Stealth Proteins: In Silico Identification of a Novel Protein Family Rendering Bacterial Pathogens Invisible to Host Immune Defense.

    Directory of Open Access Journals (Sweden)

    2005-11-01

    Full Text Available There are a variety of bacterial defense strategies to survive in a hostile environment. Generation of extracellular polysaccharides has proved to be a simple but effective strategy against the host's innate immune system. A comparative genomics approach led us to identify a new protein family termed Stealth, most likely involved in the synthesis of extracellular polysaccharides. This protein family is characterized by a series of domains conserved across phylogeny from bacteria to eukaryotes. In bacteria, Stealth (previously characterized as SacB, XcbA, or WefC is encoded by subsets of strains mainly colonizing multicellular organisms, with evidence for a protective effect against the host innate immune defense. More specifically, integrating all the available information about Stealth proteins in bacteria, we propose that Stealth is a D-hexose-1-phosphoryl transferase involved in the synthesis of polysaccharides. In the animal kingdom, Stealth is strongly conserved across evolution from social amoebas to simple and complex multicellular organisms, such as Dictyostelium discoideum, hydra, and human. Based on the occurrence of Stealth in most Eukaryotes and a subset of Prokaryotes together with its potential role in extracellular polysaccharide synthesis, we propose that metazoan Stealth functions to regulate the innate immune system. Moreover, there is good reason to speculate that the acquisition and spread of Stealth could be responsible for future epidemic outbreaks of infectious diseases caused by a large variety of eubacterial pathogens. Our in silico identification of a homologous protein in the human host will help to elucidate the causes of Stealth-dependent virulence. At a more basic level, the characterization of the molecular and cellular function of Stealth proteins may shed light on fundamental mechanisms of innate immune defense against microbial invasion.

  19. Stealth proteins: in silico identification of a novel protein family rendering bacterial pathogens invisible to host immune defense.

    Directory of Open Access Journals (Sweden)

    Peter Sperisen

    2005-11-01

    Full Text Available There are a variety of bacterial defense strategies to survive in a hostile environment. Generation of extracellular polysaccharides has proved to be a simple but effective strategy against the host's innate immune system. A comparative genomics approach led us to identify a new protein family termed Stealth, most likely involved in the synthesis of extracellular polysaccharides. This protein family is characterized by a series of domains conserved across phylogeny from bacteria to eukaryotes. In bacteria, Stealth (previously characterized as SacB, XcbA, or WefC is encoded by subsets of strains mainly colonizing multicellular organisms, with evidence for a protective effect against the host innate immune defense. More specifically, integrating all the available information about Stealth proteins in bacteria, we propose that Stealth is a D-hexose-1-phosphoryl transferase involved in the synthesis of polysaccharides. In the animal kingdom, Stealth is strongly conserved across evolution from social amoebas to simple and complex multicellular organisms, such as Dictyostelium discoideum, hydra, and human. Based on the occurrence of Stealth in most Eukaryotes and a subset of Prokaryotes together with its potential role in extracellular polysaccharide synthesis, we propose that metazoan Stealth functions to regulate the innate immune system. Moreover, there is good reason to speculate that the acquisition and spread of Stealth could be responsible for future epidemic outbreaks of infectious diseases caused by a large variety of eubacterial pathogens. Our in silico identification of a homologous protein in the human host will help to elucidate the causes of Stealth-dependent virulence. At a more basic level, the characterization of the molecular and cellular function of Stealth proteins may shed light on fundamental mechanisms of innate immune defense against microbial invasion.

  20. In silico re-identification of properties of drug target proteins.

    Science.gov (United States)

    Kim, Baeksoo; Jo, Jihoon; Han, Jonghyun; Park, Chungoo; Lee, Hyunju

    2017-05-31

    Computational approaches in the identification of drug targets are expected to reduce time and effort in drug development. Advances in genomics and proteomics provide the opportunity to uncover properties of druggable genomes. Although several studies have been conducted for distinguishing drug targets from non-drug targets, they mainly focus on the sequences and functional roles of proteins. Many other properties of proteins have not been fully investigated. Using the DrugBank (version 3.0) database containing nearly 6,816 drug entries including 760 FDA-approved drugs and 1822 of their targets and human UniProt/Swiss-Prot databases, we defined 1578 non-redundant drug target and 17,575 non-drug target proteins. To select these non-redundant protein datasets, we built four datasets (A, B, C, and D) by considering clustering of paralogous proteins. We first reassessed the widely used properties of drug target proteins. We confirmed and extended that drug target proteins (1) are likely to have more hydrophobic, less polar, less PEST sequences, and more signal peptide sequences higher and (2) are more involved in enzyme catalysis, oxidation and reduction in cellular respiration, and operational genes. In this study, we proposed new properties (essentiality, expression pattern, PTMs, and solvent accessibility) for effectively identifying drug target proteins. We found that (1) drug targetability and protein essentiality are decoupled, (2) druggability of proteins has high expression level and tissue specificity, and (3) functional post-translational modification residues are enriched in drug target proteins. In addition, to predict the drug targetability of proteins, we exploited two machine learning methods (Support Vector Machine and Random Forest). When we predicted drug targets by combining previously known protein properties and proposed new properties, an F-score of 0.8307 was obtained. When the newly proposed properties are integrated, the prediction performance

  1. Accurate X-ray position of the Anomalous X-ray Pulsar XTE J1810-197 and identification of its likely IR counterpart

    CERN Document Server

    Israel, G L; Mangano, V; Testa, V; Perna, R; Hummel, W; Mignani, R P; Ageorges, N; Curto, G L; Marco, O; Angelini, L; Campana, S; Covino, S; Marconi, G; Mereghetti, S; Stella, L

    2004-01-01

    We report the accurate sub-arcsec X-ray position of the new Anomalous X-ray Pulsar (AXP) XTE J1810-197, derived with a Chndra-HRC Target of Opportunity observation carried out in November 2003. We also report the discovery of a likely IR counterpart based on a VLT (IR band) Target of Opportunity observation carried out in October 2003. Our proposed counterpart is the only IR source (Ks=20.8) in the X-ray error circle. Its IR colors as well as the X-ray/IR flux ratio, are consistent with those of the counterparts of all other AXPs (at variance with field star colors). Deep Gunn-i band images obtained at the 3.6m ESO telescope detected no sources down to a limiting magnitude of 24.3. Moreover, we find that the pulsed fraction and count rates of XTE J1810-197 remained nearly unchanged since the previous Chandra and XMM-Newton observations (2003 August 27th and September 8th, respectively). We briefly discuss the implications of these results. In particular, we note that the transient (or at least highly variable...

  2. Accurate Identification of ALK Positive Lung Carcinoma Patients: Novel FDA-Cleared Automated Fluorescence In Situ Hybridization Scanning System and Ultrasensitive Immunohistochemistry

    Science.gov (United States)

    Conde, Esther; Suárez-Gauthier, Ana; Benito, Amparo; Garrido, Pilar; García-Campelo, Rosario; Biscuola, Michele; Paz-Ares, Luis; Hardisson, David; de Castro, Javier; Camacho, M. Carmen; Rodriguez-Abreu, Delvys; Abdulkader, Ihab; Ramirez, Josep; Reguart, Noemí; Salido, Marta; Pijuán, Lara; Arriola, Edurne; Sanz, Julián; Folgueras, Victoria; Villanueva, Noemí; Gómez-Román, Javier; Hidalgo, Manuel; López-Ríos, Fernando

    2014-01-01

    Background Based on the excellent results of the clinical trials with ALK-inhibitors, the importance of accurately identifying ALK positive lung cancer has never been greater. However, there are increasing number of recent publications addressing discordances between FISH and IHC. The controversy is further fuelled by the different regulatory approvals. This situation prompted us to investigate two ALK IHC antibodies (using a novel ultrasensitive detection-amplification kit) and an automated ALK FISH scanning system (FDA-cleared) in a series of non-small cell lung cancer tumor samples. Methods Forty-seven ALK FISH-positive and 56 ALK FISH-negative NSCLC samples were studied. All specimens were screened for ALK expression by two IHC antibodies (clone 5A4 from Novocastra and clone D5F3 from Ventana) and for ALK rearrangement by FISH (Vysis ALK FISH break-apart kit), which was automatically captured and scored by using Bioview's automated scanning system. Results All positive cases with the IHC antibodies were FISH-positive. There was only one IHC-negative case with both antibodies which showed a FISH-positive result. The overall sensitivity and specificity of the IHC in comparison with FISH were 98% and 100%, respectively. Conclusions The specificity of these ultrasensitive IHC assays may obviate the need for FISH confirmation in positive IHC cases. However, the likelihood of false negative IHC results strengthens the case for FISH testing, at least in some situations. PMID:25248157

  3. Accurate identification of ALK positive lung carcinoma patients: novel FDA-cleared automated fluorescence in situ hybridization scanning system and ultrasensitive immunohistochemistry.

    Directory of Open Access Journals (Sweden)

    Esther Conde

    Full Text Available BACKGROUND: Based on the excellent results of the clinical trials with ALK-inhibitors, the importance of accurately identifying ALK positive lung cancer has never been greater. However, there are increasing number of recent publications addressing discordances between FISH and IHC. The controversy is further fuelled by the different regulatory approvals. This situation prompted us to investigate two ALK IHC antibodies (using a novel ultrasensitive detection-amplification kit and an automated ALK FISH scanning system (FDA-cleared in a series of non-small cell lung cancer tumor samples. METHODS: Forty-seven ALK FISH-positive and 56 ALK FISH-negative NSCLC samples were studied. All specimens were screened for ALK expression by two IHC antibodies (clone 5A4 from Novocastra and clone D5F3 from Ventana and for ALK rearrangement by FISH (Vysis ALK FISH break-apart kit, which was automatically captured and scored by using Bioview's automated scanning system. RESULTS: All positive cases with the IHC antibodies were FISH-positive. There was only one IHC-negative case with both antibodies which showed a FISH-positive result. The overall sensitivity and specificity of the IHC in comparison with FISH were 98% and 100%, respectively. CONCLUSIONS: The specificity of these ultrasensitive IHC assays may obviate the need for FISH confirmation in positive IHC cases. However, the likelihood of false negative IHC results strengthens the case for FISH testing, at least in some situations.

  4. Identification of a novel Plasmopara halstedii elicitor protein combining de novo peptide sequencing algorithms and RACE-PCR

    Directory of Open Access Journals (Sweden)

    Madlung Johannes

    2010-05-01

    Full Text Available Abstract Background Often high-quality MS/MS spectra of tryptic peptides do not match to any database entry because of only partially sequenced genomes and therefore, protein identification requires de novo peptide sequencing. To achieve protein identification of the economically important but still unsequenced plant pathogenic oomycete Plasmopara halstedii, we first evaluated the performance of three different de novo peptide sequencing algorithms applied to a protein digests of standard proteins using a quadrupole TOF (QStar Pulsar i. Results The performance order of the algorithms was PEAKS online > PepNovo > CompNovo. In summary, PEAKS online correctly predicted 45% of measured peptides for a protein test data set. All three de novo peptide sequencing algorithms were used to identify MS/MS spectra of tryptic peptides of an unknown 57 kDa protein of P. halstedii. We found ten de novo sequenced peptides that showed homology to a Phytophthora infestans protein, a closely related organism of P. halstedii. Employing a second complementary approach, verification of peptide prediction and protein identification was performed by creation of degenerate primers for RACE-PCR and led to an ORF of 1,589 bp for a hypothetical phosphoenolpyruvate carboxykinase. Conclusions Our study demonstrated that identification of proteins within minute amounts of sample material improved significantly by combining sensitive LC-MS methods with different de novo peptide sequencing algorithms. In addition, this is the first study that verified protein prediction from MS data by also employing a second complementary approach, in which RACE-PCR led to identification of a novel elicitor protein in P. halstedii.

  5. Bottom–up protein identifications from microliter quantities of individual human tear samples. Important steps towards clinical relevance.

    Directory of Open Access Journals (Sweden)

    Peter Raus

    2015-12-01

    With 375 confidently identified proteins in the healthy adult tear, the obtained results are comprehensive and in large agreement with previously published observations on pooled samples of multiple patients. We conclude that, to a limited extent, bottom–up tear protein identifications from individual patients may have clinical relevance.

  6. Identification of new hematopoietic cell subsets with a polyclonal antibody library specific for neglected proteins.

    Directory of Open Access Journals (Sweden)

    Monica Moro

    Full Text Available The identification of new markers, the expression of which defines new phenotipically and functionally distinct cell subsets, is a main objective in cell biology. We have addressed the issue of identifying new cell specific markers with a reverse proteomic approach whereby approximately 1700 human open reading frames encoding proteins predicted to be transmembrane or secreted have been selected in silico for being poorly known, cloned and expressed in bacteria. These proteins have been purified and used to immunize mice with the aim of obtaining polyclonal antisera mostly specific for linear epitopes. Such a library, made of about 1600 different polyclonal antisera, has been obtained and screened by flow cytometry on cord blood derived CD34+CD45dim cells and on peripheral blood derived mature lymphocytes (PBLs. We identified three new proteins expressed by fractions of CD34+CD45dim cells and eight new proteins expressed by fractions of PBLs. Remarkably, we identified proteins the presence of which had not been demonstrated previously by transcriptomic analysis. From the functional point of view, looking at new proteins expressed on CD34+CD45dim cells, we identified one cell surface protein (MOSC-1 the expression of which on a minority of CD34+ progenitors marks those CD34+CD45dim cells that will go toward monocyte/granulocyte differentiation. In conclusion, we show a new way of looking at the membranome by assessing expression of generally neglected proteins with a library of polyclonal antisera, and in so doing we have identified new potential subsets of hematopoietic progenitors and of mature PBLs.

  7. Automated design of probes for rRNA-targeted fluorescence in situ hybridization reveals the advantages of using dual probes for accurate identification.

    Science.gov (United States)

    Wright, Erik S; Yilmaz, L Safak; Corcoran, Andrew M; Ökten, Hatice E; Noguera, Daniel R

    2014-08-01

    Fluorescence in situ hybridization (FISH) is a common technique for identifying cells in their natural environment and is often used to complement next-generation sequencing approaches as an integral part of the full-cycle rRNA approach. A major challenge in FISH is the design of oligonucleotide probes with high sensitivity and specificity to their target group. The rapidly expanding number of rRNA sequences has increased awareness of the number of potential nontargets for every FISH probe, making the design of new FISH probes challenging using traditional methods. In this study, we conducted a systematic analysis of published probes that revealed that many have insufficient coverage or specificity for their intended target group. Therefore, we developed an improved thermodynamic model of FISH that can be applied at any taxonomic level, used the model to systematically design probes for all recognized genera of bacteria and archaea, and identified potential cross-hybridizations for the selected probes. This analysis resulted in high-specificity probes for 35.6% of the genera when a single probe was used in the absence of competitor probes and for 60.9% when up to two competitor probes were used. Requiring the hybridization of two independent probes for positive identification further increased specificity. In this case, we could design highly specific probe sets for up to 68.5% of the genera without the use of competitor probes and 87.7% when up to two competitor probes were used. The probes designed in this study, as well as tools for designing new probes, are available online (http://DECIPHER.cee.wisc.edu).

  8. Accurate and Practical Identification of 20 Fusarium Species by Seven-Locus Sequence Analysis and Reverse Line Blot Hybridization, and an In Vitro Antifungal Susceptibility Study▿†

    Science.gov (United States)

    Wang, He; Xiao, Meng; Kong, Fanrong; Chen, Sharon; Dou, Hong-Tao; Sorrell, Tania; Li, Ruo-Yu; Xu, Ying-Chun

    2011-01-01

    Eleven reference and 25 clinical isolates of Fusarium were subject to multilocus DNA sequence analysis to determine the species and haplotypes of the fusarial isolates from Beijing and Shandong, China. Seven loci were analyzed: the translation elongation factor 1 alpha gene (EF-1α); the nuclear rRNA internal transcribed spacer (ITS), large subunit (LSU), and intergenic spacer (IGS) regions; the second largest subunit of the RNA polymerase gene (RPB2); the calmodulin gene (CAM); and the mitochondrial small subunit (mtSSU) rRNA gene. We also evaluated an IGS-targeted PCR/reverse line blot (RLB) assay for species/haplotype identification of Fusarium. Twenty Fusarium species and seven species complexes were identified. Of 25 clinical isolates (10 species), the Gibberella (Fusarium) fujikuroi species complex was the commonest (40%) and was followed by the Fusarium solani species complex (FSSC) (36%) and the F. incarnatum-F. equiseti species complex (12%). Six FSSC isolates were identified to the species level as FSSC-3+4, and three as FSSC-5. Twenty-nine IGS, 27 EF-1α, 26 RPB2, 24 CAM, 18 ITS, 19 LSU, and 18 mtSSU haplotypes were identified; 29 were unique, and haplotypes for 24 clinical strains were novel. By parsimony informative character analysis, the IGS locus was the most phylogenetically informative, and the rRNA gene regions were the least. Results by RLB were concordant with multilocus sequence analysis for all isolates. Amphotericin B was the most active drug against all species. Voriconazole MICs were high (>8 μg/ml) for 15 (42%) isolates, including FSSC. Analysis of larger numbers of isolates is required to determine the clinical utility of the seven-locus sequence analysis and RLB assay in species classification of fusaria. PMID:21389150

  9. Identification of sites of ubiquitination in proteins: a fourier transform ion cyclotron resonance mass spectrometry approach.

    Science.gov (United States)

    Cooper, Helen J; Heath, John K; Jaffray, Ellis; Hay, Ronald T; Lam, Tukiet T; Marshall, Alan G

    2004-12-01

    Structural elucidation of posttranslationally modified peptides and proteins is of key importance in the understanding of an array of biological processes. Ubiquitination is a reversible modification that regulates many cellular functions. Consequences of ubiquitination depend on whether a single ubiquitin or polyubiquitin chain is added to the tagged protein. The lysine residue through which the polyubiquitin chain is formed is also critical for biological activity. Robust methods are therefore required to identify sites of ubiquitination modification, both in the target protein and in ubiquitin. Here, we demonstrate the suitability of Fourier transform ion cyclotron resonance (FT-ICR) mass spectrometry, in conjunction with activated ion electron capture dissociation (AI ECD) or infrared multiphoton dissociation (IRMPD), for the analysis of ubiquitinated proteins. Polyubiquitinated substrate protein GST-Ubc5 was generated in vitro. Tryptic digests of polyubiquitinated species contain modified peptides in which the ubiquitin C-terminal Gly-Gly residues are retained on the modified lysine residues. Direct infusion microelectrospray FT-ICR of the digest and comparison with an in silico digest enables identification of modified peptides and therefore sites of ubiquitination. Fifteen sites of ubiquitination were identified in GST-Ubc5 and four sites in ubiquitin. Assignments were confirmed by AI ECD or IRMPD. The Gly-Gly modification is stable and both tandem mass spectrometric techniques are suitable, providing extensive sequence coverage and retention of the modification on backbone fragments.

  10. Ambient ionization-accurate mass spectrometry (AMI-AMS) for the identification of nonvisible set-off in food-contact materials.

    Science.gov (United States)

    Bentayeb, Karim; Ackerman, Luke K; Begley, Timothy H

    2012-02-29

    Set-off is the unintentional transfer of substances used in printing from the external printed surface of food packaging to the inner, food-contact surface. Ambient ionization-accurate mass spectrometry (AMI-AMS) detected and identified compounds from print set-off not visible to the human eye. AMI mass spectra from inner and outer surfaces of printed and nonprinted food packaging were compared to detect and identify nonvisible set-off components. A protocol to identify unknowns was developed using a custom open-source database of printing inks and food-packaging compounds. The protocol matched print-related food-contact surface ions with the molecular formulas of common ions, isotopes, and fragments of compounds from the database. AMI-AMS was able to detect print set-off and identify seven different compounds. Set-off on the packaging samples was confirmed using gas chromatographic-mass spectrometric (GC-MS) analysis of single-sided solvent extracts. N-Ethyl-2(and 4)-methylbenzenesulfonamide, 2,4-diphenyl-4-methyl-1(and 2)-pentene, and 2,4,7,9-tetramethyl-5-decyne-4,7-diol were present on the food-contact layer at concentrations from 0.21 to 2.7 ± 1.6 μg dm⁻², corresponding to nearly milligram per kilogram concentrations in the packaged food. Other minor set-off compounds were detected only by AMI-AMS, a fast, simple, and thorough technique to detect and identify set-off in food packaging.

  11. Large-scale proteomic identification of S100 proteins in breast cancer tissues

    Directory of Open Access Journals (Sweden)

    Cancemi Patrizia

    2010-09-01

    Full Text Available Abstract Background Attempts to reduce morbidity and mortality in breast cancer is based on efforts to identify novel biomarkers to support prognosis and therapeutic choices. The present study has focussed on S100 proteins as a potentially promising group of markers in cancer development and progression. One reason of interest in this family of proteins is because the majority of the S100 genes are clustered on a region of human chromosome 1q21 that is prone to genomic rearrangements. Moreover, there is increasing evidence that S100 proteins are often up-regulated in many cancers, including breast, and this is frequently associated with tumour progression. Methods Samples of breast cancer tissues were obtained during surgical intervention, according to the bioethical recommendations, and cryo-preserved until used. Tissue extracts were submitted to proteomic preparations for 2D-IPG. Protein identification was performed by N-terminal sequencing and/or peptide mass finger printing. Results The majority of the detected S100 proteins were absent, or present at very low levels, in the non-tumoral tissues adjacent to the primary tumor. This finding strengthens the role of S100 proteins as putative biomarkers. The proteomic screening of 100 cryo-preserved breast cancer tissues showed that some proteins were ubiquitously expressed in almost all patients while others appeared more sporadic. Most, if not all, of the detected S100 members appeared reciprocally correlated. Finally, from the perspective of biomarkers establishment, a promising finding was the observation that patients which developed distant metastases after a three year follow-up showed a general tendency of higher S100 protein expression, compared to the disease-free group. Conclusions This article reports for the first time the comparative proteomic screening of several S100 protein members among a large group of breast cancer patients. The results obtained strongly support the hypothesis

  12. The Identification of Novel Protein-Protein Interactions in Liver that Affect Glucagon Receptor Activity.

    Directory of Open Access Journals (Sweden)

    Junfeng Han

    Full Text Available Glucagon regulates glucose homeostasis by controlling glycogenolysis and gluconeogenesis in the liver. Exaggerated and dysregulated glucagon secretion can exacerbate hyperglycemia contributing to type 2 diabetes (T2D. Thus, it is important to understand how glucagon receptor (GCGR activity and signaling is controlled in hepatocytes. To better understand this, we sought to identify proteins that interact with the GCGR to affect ligand-dependent receptor activation. A Flag-tagged human GCGR was recombinantly expressed in Chinese hamster ovary (CHO cells, and GCGR complexes were isolated by affinity purification (AP. Complexes were then analyzed by mass spectrometry (MS, and protein-GCGR interactions were validated by co-immunoprecipitation (Co-IP and Western blot. This was followed by studies in primary hepatocytes to assess the effects of each interactor on glucagon-dependent glucose production and intracellular cAMP accumulation, and then in immortalized CHO and liver cell lines to further examine cell signaling. Thirty-three unique interactors were identified from the AP-MS screening of GCGR expressing CHO cells in both glucagon liganded and unliganded states. These studies revealed a particularly robust interaction between GCGR and 5 proteins, further validated by Co-IP, Western blot and qPCR. Overexpression of selected interactors in mouse hepatocytes indicated that two interactors, LDLR and TMED2, significantly enhanced glucagon-stimulated glucose production, while YWHAB inhibited glucose production. This was mirrored with glucagon-stimulated cAMP production, with LDLR and TMED2 enhancing and YWHAB inhibiting cAMP accumulation. To further link these interactors to glucose production, key gluconeogenic genes were assessed. Both LDLR and TMED2 stimulated while YWHAB inhibited PEPCK and G6Pase gene expression. In the present study, we have probed the GCGR interactome and found three novel GCGR interactors that control glucagon

  13. A comparative analysis of computational approaches and algorithms for protein subcomplex identification.

    Science.gov (United States)

    Zaki, Nazar; Mora, Antonio

    2014-01-01

    High-throughput AP-MS methods have allowed the identification of many protein complexes. However, most post-processing methods of this type of data have been focused on detection of protein complexes and not its subcomplexes. Here, we review the results of some existing methods that may allow subcomplex detection and propose alternative methods in order to detect subcomplexes from AP-MS data. We assessed and drew comparisons between the use of overlapping clustering methods, methods based in the core-attachment model and our own prediction strategy (TRIBAL). The hypothesis behind TRIBAL is that subcomplex-building information may be concealed in the multiple edges generated by an interaction repeated in different contexts in raw data. The CACHET method offered the best results when the evaluation of the predicted subcomplexes was carried out using both the hypergeometric and geometric scores. TRIBAL offered the best performance when using a strict meet-min score.

  14. The Dictyostelium discoideum cellulose synthase: Structure/function analysis and identification of interacting proteins

    Energy Technology Data Exchange (ETDEWEB)

    Richard L. Blanton

    2004-02-19

    OAK-B135 The major accomplishments of this project were: (1) the initial characterization of dcsA, the gene for the putative catalytic subunit of cellulose synthase in the cellular slime mold Dictyostelium discoideum; (2) the detection of a developmentally regulated event (unidentified, but perhaps a protein modification or association with a protein partner) that is required for cellulose synthase activity (i.e., the dcsA product is necessary, but not sufficient for cellulose synthesis); (3) the continued exploration of the developmental context of cellulose synthesis and DcsA; (4) the isolation of a GFP-DcsA-expressing strain (work in progress); and (5) the identification of Dictyostelium homologues for plant genes whose products play roles in cellulose biosynthesis. Although our progress was slow and many of our results negative, we did develop a number of promising avenues of investigation that can serve as the foundation for future projects.

  15. pMD-Membrane: A Method for Ligand Binding Site Identification in Membrane-Bound Proteins.

    Directory of Open Access Journals (Sweden)

    Priyanka Prakash

    2015-10-01

    Full Text Available Probe-based or mixed solvent molecular dynamics simulation is a useful approach for the identification and characterization of druggable sites in drug targets. However, thus far the method has been applied only to soluble proteins. A major reason for this is the potential effect of the probe molecules on membrane structure. We have developed a technique to overcome this limitation that entails modification of force field parameters to reduce a few pairwise non-bonded interactions between selected atoms of the probe molecules and bilayer lipids. We used the resulting technique, termed pMD-membrane, to identify allosteric ligand binding sites on the G12D and G13D oncogenic mutants of the K-Ras protein bound to a negatively charged lipid bilayer. In addition, we show that differences in probe occupancy can be used to quantify changes in the accessibility of druggable sites due to conformational changes induced by membrane binding or mutation.

  16. Computational identification of protein methylation sites through bi-profile Bayes feature extraction.

    Directory of Open Access Journals (Sweden)

    Jianlin Shao

    Full Text Available Protein methylation is one type of reversible post-translational modifications (PTMs, which plays vital roles in many cellular processes such as transcription activity, DNA repair. Experimental identification of methylation sites on proteins without prior knowledge is costly and time-consuming. In silico prediction of methylation sites might not only provide researches with information on the candidate sites for further determination, but also facilitate to perform downstream characterizations and site-specific investigations. In the present study, a novel approach based on Bi-profile Bayes feature extraction combined with support vector machines (SVMs was employed to develop the model for Prediction of Protein Methylation Sites (BPB-PPMS from primary sequence. Methylation can occur at many residues including arginine, lysine, histidine, glutamine, and proline. For the present, BPB-PPMS is only designed to predict the methylation status for lysine and arginine residues on polypeptides due to the absence of enough experimentally verified data to build and train prediction models for other residues. The performance of BPB-PPMS is measured with a sensitivity of 74.71%, a specificity of 94.32% and an accuracy of 87.98% for arginine as well as a sensitivity of 70.05%, a specificity of 77.08% and an accuracy of 75.51% for lysine in 5-fold cross validation experiments. Results obtained from cross-validation experiments and test on independent data sets suggest that BPB-PPMS presented here might facilitate the identification and annotation of protein methylation. Besides, BPB-PPMS can be extended to build predictors for other types of PTM sites with ease. For public access, BPB-PPMS is available at http://www.bioinfo.bio.cuhk.edu.hk/bpbppms.

  17. Identification of novel DNA repair proteins via primary sequence, secondary structure, and homology

    Directory of Open Access Journals (Sweden)

    Akutsu Tatsuya

    2009-01-01

    functionally unconfirmed proteins that are highly likely to be involved in the repair process. A new web service, INTREPED, has been made available for the immediate search and annotation of DNA repair proteins in newly sequenced genomes. Conclusion Despite complexity due to a multitude of repair pathways, combinations of sequence, structure, and homology with Support Vector Machines offer good methods in addition to existing homology searches for DNA repair protein identification and functional annotation. Most importantly, this study has uncovered relationships between the size of a genome and a genome's available repair repetoire, and offers a number of new predictions as well as a prediction service, both which reduce the search time and cost for novel repair genes and proteins.

  18. Isolation and Identification of Alicyclobacillus with High Dipicolinic Acid and Heat Resistant Proteins from Mango Juice

    Directory of Open Access Journals (Sweden)

    Hamid Reza Akhbariyoon

    2016-10-01

    Full Text Available Background and Objectives: Microbial spoilage of juices and industries related with Alicyclobacillus are considerable international issues. This spore-forming bacterium causes changes in juices odor and taste. The isolation and identification of Alicyclobacillus contamination in juice producing and packaging industries has an essential role in the prevention and control of this type of spoilage bacterium in HACCP (Hazard analysis and critical control points manner.Materials and Methods: A thermo-acidophilic, non-pathogenic and sporeforming bacterium was isolated from mango juice. Preliminary identification of the isolates was based on morphological, biochemical and physiological properties. Identification at species level was made by PCR amplification. The influence of temperature in the range of 25-65°C in the growth of bacterium and in the range of 80-120°C in spore-resistant and heat resistant proteins was investigated and compared with other spore producing bacteria.Results and Conclusion: Phylogenetic analysis of the 16S rRNA gene sequencing indicated that the isolated strain constituted a distinct lineage in the Alicyclobacillus cluster and submitted to NCBI with access number Alicyclobacillus HRM-5 KM983424.1. The spores resisted 110°C for 3 h, and produced 28% dipicolinic acid more comparable to Bacillus licheniformis. Also they could produce 0.69 mg heat resistance protein after 1.5 h treatment in 100°C. The results showed that this strain could have biotechnological applications.Conflict of interests: The authors declare no conflict of interest.

  19. Identification of oral cancer related candidate genes by integrating protein-protein interactions, gene ontology, pathway analysis and immunohistochemistry.

    Science.gov (United States)

    Kumar, Ravindra; Samal, Sabindra K; Routray, Samapika; Dash, Rupesh; Dixit, Anshuman

    2017-05-30

    In the recent years, bioinformatics methods have been reported with a high degree of success for candidate gene identification. In this milieu, we have used an integrated bioinformatics approach assimilating information from gene ontologies (GO), protein-protein interaction (PPI) and network analysis to predict candidate genes related to oral squamous cell carcinoma (OSCC). A total of 40973 PPIs were considered for 4704 cancer-related genes to construct human cancer gene network (HCGN). The importance of each node was measured in HCGN by ten different centrality measures. We have shown that the top ranking genes are related to a significantly higher number of diseases as compared to other genes in HCGN. A total of 39 candidate oral cancer target genes were predicted by combining top ranked genes and the genes corresponding to significantly enriched oral cancer related GO terms. Initial verification using literature and available experimental data indicated that 29 genes were related with OSCC. A detailed pathway analysis led us to propose a role for the selected candidate genes in the invasion and metastasis in OSCC. We further validated our predictions using immunohistochemistry (IHC) and found that the gene FLNA was upregulated while the genes ARRB1 and HTT were downregulated in the OSCC tissue samples.

  20. Proteomic Investigation of Falciparum and Vivax Malaria for Identification of Surrogate Protein Markers

    Science.gov (United States)

    Ray, Sandipan; Renu, Durairaj; Srivastava, Rajneesh; Gollapalli, Kishore; Taur, Santosh; Jhaveri, Tulip; Dhali, Snigdha; Chennareddy, Srinivasarao; Potla, Ankit; Dikshit, Jyoti Bajpai; Srikanth, Rapole; Gogtay, Nithya; Thatte, Urmila; Patankar, Swati; Srivastava, Sanjeeva

    2012-01-01

    This study was conducted to analyze alterations in the human serum proteome as a consequence of infection by malaria parasites Plasmodium falciparum and P. vivax to obtain mechanistic insights about disease pathogenesis, host immune response, and identification of potential protein markers. Serum samples from patients diagnosed with falciparum malaria (FM) (n = 20), vivax malaria (VM) (n = 17) and healthy controls (HC) (n = 20) were investigated using multiple proteomic techniques and results were validated by employing immunoassay-based approaches. Specificity of the identified malaria related serum markers was evaluated by means of analysis of leptospirosis as a febrile control (FC). Compared to HC, 30 and 31 differentially expressed and statistically significant (pphenotypic classes (FM, VM, FC and HC) were predicted with over 95% prediction accuracy. Individual performance of three classifier proteins; haptoglobin, apolipoprotein A-I and retinol-binding protein in diagnosis of malaria was analyzed using receiver operating characteristic (ROC) curves. The discrimination of FM, VM, FC and HC groups on the basis of differentially expressed serum proteins demonstrates the potential of this analytical approach for the detection of malaria as well as other human diseases. PMID:22912677

  1. Proteomic identification of early salicylate- and flg22-responsive redox-sensitive proteins in Arabidopsis

    KAUST Repository

    Liu, Pei

    2015-02-27

    Accumulation of reactive oxygen species (ROS) is one of the early defense responses against pathogen infection in plants. The mechanism about the initial and direct regulation of the defense signaling pathway by ROS remains elusive. Perturbation of cellular redox homeostasis by ROS is believed to alter functions of redox-sensitive proteins through their oxidative modifications. Here we report an OxiTRAQ-based proteomic study in identifying proteins whose cysteines underwent oxidative modifications in Arabidopsis cells during the early response to salicylate or flg22, two defense pathway elicitors that are known to disturb cellular redox homeostasis. Among the salicylate- and/or flg22-responsive redox-sensitive proteins are those involved in transcriptional regulation, chromatin remodeling, RNA processing, post-translational modifications, and nucleocytoplasmic shuttling. The identification of the salicylate-/flg22-responsive redox-sensitive proteins provides a foundation from which further study can be conducted toward understanding biological significance of their oxidative modifications during the plant defense response.

  2. Identification of Protein Pupylation Sites Using Bi-Profile Bayes Feature Extraction and Ensemble Learning

    Directory of Open Access Journals (Sweden)

    Xiaowei Zhao

    2013-01-01

    Full Text Available Pupylation, one of the most important posttranslational modifications of proteins, typically takes place when prokaryotic ubiquitin-like protein (Pup is attached to specific lysine residues on a target protein. Identification of pupylation substrates and their corresponding sites will facilitate the understanding of the molecular mechanism of pupylation. Comparing with the labor-intensive and time-consuming experiment approaches, computational prediction of pupylation sites is much desirable for their convenience and fast speed. In this study, a new bioinformatics tool named EnsemblePup was developed that used an ensemble of support vector machine classifiers to predict pupylation sites. The highlight of EnsemblePup was to utilize the Bi-profile Bayes feature extraction as the encoding scheme. The performance of EnsemblePup was measured with a sensitivity of 79.49%, a specificity of 82.35%, an accuracy of 85.43%, and a Matthews correlation coefficient of 0.617 using the 5-fold cross validation on the training dataset. When compared with other existing methods on a benchmark dataset, the EnsemblePup provided better predictive performance, with a sensitivity of 80.00%, a specificity of 83.33%, an accuracy of 82.00%, and a Matthews correlation coefficient of 0.629. The experimental results suggested that EnsemblePup presented here might be useful to identify and annotate potential pupylation sites in proteins of interest. A web server for predicting pupylation sites was developed.

  3. Identification and evaluation of reference genes for accurate gene expression normalization of fresh and frozen-thawed spermatozoa of water buffalo (Bubalus bubalis).

    Science.gov (United States)

    Ashish, Shende; Bhure, S K; Harikrishna, Pillai; Ramteke, S S; Muhammed Kutty, V H; Shruthi, N; Ravi Kumar, G V P P S; Manish, Mahawar; Ghosh, S K; Mihir, Sarkar

    2017-04-01

    The quantitative real time PCR (qRT-PCR) has become an important tool for gene-expression analysis for a selected number of genes in life science. Although large dynamic range, sensitivity and reproducibility of qRT-PCR is good, the reliability majorly depend on the selection of proper reference genes (RGs) employed for normalization. Although, RGs expression has been reported to vary considerably within same cell type with different experimental treatments. No systematic study has been conducted to identify and evaluate the appropriate RGs in spermatozoa of domestic animals. Therefore, this study was conducted to analyze suitable stable RGs in fresh and frozen-thawed spermatozoa. We have assessed 13 candidate RGs (BACT, RPS18s, RPS15A, ATP5F1, HMBS, ATP2B4, RPL13, EEF2, TBP, EIF2B2, MDH1, B2M and GLUT5) of different functions and pathways using five algorithms. Regardless of the approach, the ranking of the most and the least candidate RGs remained almost same. The comprehensive ranking by RefFinder showed GLUT5, ATP2B4 and B2M, MDH1 as the top two stable and least stable RGs, respectively. The expression levels of four heat shock proteins (HSP) were employed as a target gene to evaluate RGs efficiency for normalization. The results demonstrated an exponential difference in expression levels of the four HSP genes upon normalization of the data with the most stable and the least stable RGs. Our study, provides a convenient RGs for normalization of gene-expression of key metabolic pathways effected during freezing and thawing of spermatozoa of buffalo and other closely related bovines.

  4. Effective identification of Akt interacting proteins by two-step chemical crosslinking, co-immunoprecipitation and mass spectrometry.

    Science.gov (United States)

    Huang, Bill X; Kim, Hee-Yong

    2013-01-01

    Akt is a critical protein for cell survival and known to interact with various proteins. However, Akt binding partners that modulate or regulate Akt activation have not been fully elucidated. Identification of Akt-interacting proteins has been customarily achieved by co-immunoprecipitation combined with western blot and/or MS analysis. An intrinsic problem of the method is loss of interacting proteins during procedures to remove non-specific proteins. Moreover, antibody contamination often interferes with the detection of less abundant proteins. Here, we developed a novel two-step chemical crosslinking strategy to overcome these problems which resulted in a dramatic improvement in identifying Akt interacting partners. Akt antibody was first immobilized on protein A/G beads using disuccinimidyl suberate and allowed to bind to cellular Akt along with its interacting proteins. Subsequently, dithiobis[succinimidylpropionate], a cleavable crosslinker, was introduced to produce stable complexes between Akt and binding partners prior to the SDS-PAGE and nanoLC-MS/MS analysis. This approach enabled identification of ten Akt partners from cell lysates containing as low as 1.5 mg proteins, including two new potential Akt interacting partners. None of these but one protein was detectable without crosslinking procedures. The present method provides a sensitive and effective tool to probe Akt-interacting proteins. This strategy should also prove useful for other protein interactions, particularly those involving less abundant or weakly associating partners.

  5. Identification and characterization of an Eimeria-conserved protein in Eimeria tenella.

    Science.gov (United States)

    Dong, Hui; Wang, Yange; Han, Hongyu; Li, Ting; Zhao, Qiping; Zhu, Shunhai; Li, Liujia; Wu, Youling; Huang, Bing

    2014-02-01

    The precocious lines of Eimeria spp. have unique phenotypes. However, the genetic basis of the precocious phenotype is still poorly understood. The identification of Eimeria genes controlling the precocious phenotype is of immense importance in the fight against coccidiosis. In the present study, a novel gene of Eimeria maxima was cloned using rapid amplification of cDNA ends (RACE) based on the expressed sequence tag (EST). Homologous genes were also found in Eimeria tenella and Eimeria acervulina. Alignment of the amino acid sequences from E. tenella, E. maxima, and E. acervulina showed 80-86 % identity, demonstrating a conserved protein in different Eimeria spp. This gene, designated Eimeria-conserved protein (ECP), contained 235 amino acids with a predicted molecular mass of 25.4 kDa and had 100 % identity with one annotated protein from E. maxima (Emax_0517). Real-time PCR and Western blot analysis revealed that the expression of ECP at mRNA and protein level in E. tenella is developmentally regulated. Messenger RNA levels from the ECP gene were higher in sporozoites than in other developmental stages (unsporulated oocysts, sporulated oocysts, and second-generation merozoites). Expression of ECP protein was detected in unsporulated oocysts, increased in abundance in sporulated oocysts, and was most prominent in sporozoites. Thereafter, the level of the ECP protein decreased, and no ECP-specific protein was detected in second-generation merozoites. Immunostaining with anti-rECP indicated that ECP is highly concentrated in both refractile bodies (RB) of free sporozoites, but is located at the apical end of the sporozoites after invasion of DF-1 cells. The specific staining of the ECP protein becomes more intense in trophozoites and immature first-generation schizonts, but decreases in mature first-generation schizonts. Inhibition of the function of ECP using specific antibodies reduced the ability of E. tenella sporozoites to invade host cells. Compared with the

  6. Identification of protein tyrosine phosphatase 1B and casein as substrates for 124-v-Mos

    Directory of Open Access Journals (Sweden)

    Stabel Silvia

    2002-04-01

    Full Text Available Abstract Background The mos proto-oncogene encodes a cytoplasmic serine/threonine-specific protein kinase with crucial function during meiotic cell division in vertebrates. Based on oncogenic amino acid substitutions the viral derivative, 124-v-Mos, displays constitutive protein kinase activity and functions independent of unknown upstream effectors of mos protein kinase. We have utilized this property of 124-v-Mos and screened for novel mos substrates in immunocomplex kinase assays in vitro. Results We generated recombinant 124-v-Mos using the baculovirus expression system in Spodoptera frugiperda cells and demonstrated constitutive kinase activity by the ability of 124-v-Mos to auto-phosphorylate and to phosphorylate vimentin, a known substrate of c-Mos. Using this approach we analyzed a panel of acidic and basic substrates in immunocomplex protein kinase assays and identified novel in vitro substrates for 124-v-Mos, the protein tyrosine phosphatase 1B (PTP1B, alpha-casein and beta-casein. We controlled mos-specific phosphorylation of PTP1B and casein in comparative assays using a synthetic kinase-inactive 124-v-Mos mutant and further, tryptic digests of mos-phosphorylated beta-casein identified a phosphopeptide specifically targeted by wild-type 124-v-Mos. Two-dimensional phosphoamino acid analyses showed that 124-v-mos targets serine and threonine residues for phosphorylation in casein at a 1:1 ratio but auto-phosphorylation occurs predominantly on serine residues. Conclusion The mos substrates identified in this study represent a basis to approach the identification of the mos-consensus phosphorylation motif, important for the development of specific inhibitors of the Mos protein kinase.

  7. Identification of Rv3852 as an Agrimophol-Binding Protein in Mycobacterium tuberculosis.

    Directory of Open Access Journals (Sweden)

    Nan Zhao

    Full Text Available Mycobacterial tuberculosis (Mtb is able to preserve its intrabacterial pH (pHIB near neutrality in the acidic phagosomes of immunologically activated macrophages and to cause lethal pathology in immunocompetent mice. In contrast, when its ability to maintain pHIB homeostasis is genetically compromised, Mtb dies in acidic phagosomes and is attenuated in the mouse. Compounds that phenocopy the genetic disruption of Mtb's pHIB homeostasis could serve as starting points for drug development in their own right or through identification of their targets. A previously reported screen of a natural product library identified a phloroglucinol, agrimophol, that lowered Mtb's pHIB and killed Mtb at an acidic extrabacterial pH. Inability to identify agrimophol-resistant mutants of Mtb suggested that the compound may have more than one target. Given that polyphenolic compounds may undergo covalent reactions, we attempted an affinity-based method for target identification. The structure-activity relationship of synthetically tractable polyhydroxy diphenylmethane analogs with equivalent bioactivity informed the design of a bioactive agrimophol alkyne. After click-chemistry reaction with azido-biotin and capture on streptavidin, the biotinylated agrimophol analog pulled down the Mtb protein Rv3852, a predicted membrane protein that binds DNA in vitro. A ligand-protein interaction between agrimophol and recombinant Rv3852 was confirmed by isothermal calorimetry (ITC and led to disruption of Rv3852's DNA binding function. However, genetic deletion of rv3852 in Mtb did not phenocopy the effect of agrimophol on Mtb, perhaps because of redundancy of its function.

  8. Identification of similar regions of protein structures using integrated sequence and structure analysis tools

    Directory of Open Access Journals (Sweden)

    Heiland Randy

    2006-03-01

    Full Text Available Abstract Background Understanding protein function from its structure is a challenging problem. Sequence based approaches for finding homology have broad use for annotation of both structure and function. 3D structural information of protein domains and their interactions provide a complementary view to structure function relationships to sequence information. We have developed a web site http://www.sblest.org/ and an API of web services that enables users to submit protein structures and identify statistically significant neighbors and the underlying structural environments that make that match using a suite of sequence and structure analysis tools. To do this, we have integrated S-BLEST, PSI-BLAST and HMMer based superfamily predictions to give a unique integrated view to prediction of SCOP superfamilies, EC number, and GO term, as well as identification of the protein structural environments that are associated with that prediction. Additionally, we have extended UCSF Chimera and PyMOL to support our web services, so that users can characterize their own proteins of interest. Results Users are able to submit their own queries or use a structure already in the PDB. Currently the databases that a user can query include the popular structural datasets ASTRAL 40 v1.69, ASTRAL 95 v1.69, CLUSTER50, CLUSTER70 and CLUSTER90 and PDBSELECT25. The results can be downloaded directly from the site and include function prediction, analysis of the most conserved environments and automated annotation of query proteins. These results reflect both the hits found with PSI-BLAST, HMMer and with S-BLEST. We have evaluated how well annotation transfer can be performed on SCOP ID's, Gene Ontology (GO ID's and EC Numbers. The method is very efficient and totally automated, generally taking around fifteen minutes for a 400 residue protein. Conclusion With structural genomics initiatives determining structures with little, if any, functional characterization

  9. Identification and characterization of a cleavage site in the proteolysis of orf virus 086 protein

    Directory of Open Access Journals (Sweden)

    Xiaoping eWang

    2016-04-01

    Full Text Available The ORF virus (ORFV is among the parapoxvirus genus of the poxviridae family, but little is known about the proteolytic pathways of ORFV encoding proteins. By contrast, the proteolysis mechanism of the vaccinia virus has been extensively explored. Vaccinia virus core protein P4a undergoes a proteolytic process that takes place at a conserved cleavage site Ala-Gly-X (where X is any amino acid and participates in virus assembly. Bioinformatics analysis revealed that an ORFV encoding protein, ORFV086, has a similar structure to the Vaccinia virus P4a core protein. In this study, we focus on the kinetic analysis and proteolysis mechanism of ORFV086. We found, via kinetic analysis, that ORFV086 is a late gene that starts to express at 8 hours post infection at mRNA level and 12 to 24 hours post infection at the protein level. The ORFV086 precursor and a 21kDa fragment can be observed in mature ORFV virions. The same bands were detected at only 3 hours post infection, suggesting that both the ORFV086 precursor and the 21kDa fragment are viral structural proteins. ORFV086 was cleaved from 12 to 24 hours post infection. The cleavage took place at different sites,resulting in seven bands with differing molecular weights. Sequence alignment revealed that five putative cleavage sites were predicted at C-terminal and internal regions of ORFV086. To investigate whether those cleavage sites are involved in proteolytic processing, full length and several deletion mutant ORFV086 recombinant proteins were expressed and probed. The GGS site that produced a 21kDa cleavage fragment was confirmed by identification of N/C-terminal FLAG epitope recombinant proteins, site-directed mutagenesis and Pulse-chase analysis. Interestingly, chase results demonstrated that, at late times, ORFV086 is partially cleaved. Taken together, we concluded that GGS is a cleavage site in ORFV086 and produces a 21kDa fragment post infection. Both ORFV086 precursor and the 21kDa fragment

  10. Identification of an osteopontin-like protein in fish associated with mineral formation.

    Science.gov (United States)

    Fonseca, Vera G; Laizé, Vincent; Valente, Marta S; Cancela, M Leonor

    2007-09-01

    Fish has been recently recognized as a suitable vertebrate model and represents a promising alternative to mammals for studying mechanisms of tissue mineralization and unravelling specific questions related to vertebrate bone formation. The recently developed Sparus aurata (gilthead seabream) osteoblast-like cell line VSa16 was used to construct a cDNA subtractive library aimed at the identification of genes associated with fish tissue mineralization. Suppression subtractive hybridization, combined with mirror orientation selection, identified 194 cDNA clones representing 20 different genes up-regulated during the mineralization of the VSa16 extracellular matrix. One of these genes accounted for 69% of the total number of clones obtained and was later identified as theS. aurata osteopontin-like gene. The 2138-bp full-length S. aurata osteopontin-like cDNA was shown to encode a 374 amino-acid protein containing domains and motifs characteristic of osteopontins, such as an integrin receptor-binding RGD motif, a negatively charged domain and numerous post-translational modifications (e.g. phosphorylations and glycosylations). The common origin of mammalian osteopontin and fish osteopontin-like proteins was indicated through an in silico analysis of available sequences showing similar gene and protein structures and was further demonstrated by their specific expression in mineralized tissues and cell cultures. Accordingly, and given its proven association with mineral formation and its characteristic protein domains, we propose that the fish osteopontin-like protein may play a role in hard tissue mineralization, in a manner similar to osteopontin in higher vertebrates.

  11. Rapid identification of novel immunodominant proteins and characterization of a specific linear epitope of Campylobacter jejuni.

    Directory of Open Access Journals (Sweden)

    Sebastian Hoppe

    Full Text Available Campylobacter jejuni remains one of the major gut pathogens of our time. Its zoonotic nature and wide-spread distribution in industrialized countries calls for a quick and reliable diagnostic tool. Antibody-based detection presents a suitable means to identify pathogenic bacteria. However, the knowledge about immunodominant targets is limited. Thus, an approach is presented, which allows for the rapid screening of numerous cDNA derived expression clones to identify novel antigens. The deeper understanding of immunodominant proteins assists in the design of diagnostic tools and furthers the insight into the bacterium's pathogenicity as well as revealing potential candidates for vaccination. We have successfully screened 1536 clones of an expression library to identify 22 proteins that have not been described as immunodominant before. After subcloning the corresponding 22 genes and expression of full-length proteins, we investigated the immunodominant character by microarrays and ELISA. Subsequently, seven proteins were selected for epitope mapping. For cj0669 and cj0920c linear epitopes were identified. For cj0669, specificity assays revealed a specific linear epitope site. Consequently, an eleven amino acid residue sequence TLIKELKRLGI was analyzed via alanine scan, which revealed the glycine residue to be significant for binding of the antibody. The innovative approach presented herein of generating cDNAs of prokaryotes in combination with a microarray platform rendering time-consuming purification steps obsolete has helped to illuminate novel immunodominant proteins of C.jejuni. The findings of a specific linear epitope pave the way for a plethora of future research and the potential use in diagnostic applications such as serological screenings. Moreover, the current approach is easily adaptable to other highly relevant bacteria making it a formidable tool for the future discovery of antigens and potential biomarkers. Consequently, it is

  12. Rapid identification of novel immunodominant proteins and characterization of a specific linear epitope of Campylobacter jejuni.

    Science.gov (United States)

    Hoppe, Sebastian; Bier, Frank F; von Nickisch-Rosenegk, Markus; Nickisch-Rosenegk, Markus V

    2013-01-01

    Campylobacter jejuni remains one of the major gut pathogens of our time. Its zoonotic nature and wide-spread distribution in industrialized countries calls for a quick and reliable diagnostic tool. Antibody-based detection presents a suitable means to identify pathogenic bacteria. However, the knowledge about immunodominant targets is limited. Thus, an approach is presented, which allows for the rapid screening of numerous cDNA derived expression clones to identify novel antigens. The deeper understanding of immunodominant proteins assists in the design of diagnostic tools and furthers the insight into the bacterium's pathogenicity as well as revealing potential candidates for vaccination. We have successfully screened 1536 clones of an expression library to identify 22 proteins that have not been described as immunodominant before. After subcloning the corresponding 22 genes and expression of full-length proteins, we investigated the immunodominant character by microarrays and ELISA. Subsequently, seven proteins were selected for epitope mapping. For cj0669 and cj0920c linear epitopes were identified. For cj0669, specificity assays revealed a specific linear epitope site. Consequently, an eleven amino acid residue sequence TLIKELKRLGI was analyzed via alanine scan, which revealed the glycine residue to be significant for binding of the antibody. The innovative approach presented herein of generating cDNAs of prokaryotes in combination with a microarray platform rendering time-consuming purification steps obsolete has helped to illuminate novel immunodominant proteins of C.jejuni. The findings of a specific linear epitope pave the way for a plethora of future research and the potential use in diagnostic applications such as serological screenings. Moreover, the current approach is easily adaptable to other highly relevant bacteria making it a formidable tool for the future discovery of antigens and potential biomarkers. Consequently, it is desirable to simplify the

  13. Identification of pancreatic cancer invasion-related proteins by proteomic analysis

    Directory of Open Access Journals (Sweden)

    Clynes Martin

    2009-02-01

    Full Text Available Abstract Background Markers of pancreatic cancer invasion were investigated in two clonal populations of the cell line, MiaPaCa-2, Clone #3 (high invasion and Clone #8 (low invasion using proteomic profiling of an in vitro model of pancreatic cancer. Materials and methods Using 2D-DIGE followed by MALDI-TOF MS, two clonal sub-populations of the pancreatic cancer cell line, MiaPaCa-2 with high and low invasive capacities were incubated on matrigel 24 hours prior to analysis to stimulate cell-ECM contact and mimic in vivo interaction with the basement membrane. Results Sixty proteins were identified as being differentially expressed (> 1.2 fold change and p ≤ 0.05 between Clone #3 and Clone #8. Proteins found to have higher abundance levels in the highly invasive Clone #3 compared to the low invasive Clone #8 include members of the chaperone activity proteins and cytoskeleton constituents whereas metabolism-associated and catalytic proteins had lower abundance levels. Differential protein expression levels of ALDH1A1, VIM, STIP1 and KRT18 and GAPDH were confirmed by immunoblot. Using RNAi technology, STIP1 knockdown significantly reduced invasion and proliferation of the highly invasive Clone #3. Knockdown of another target, VIM by siRNA in Clone #3 cells also resulted in decreased invasion abilities of Clone #3. Elevated expression of STIP1 was observed in pancreatic tumour tissue compared to normal pancreas, whereas ALDH1A1 stained at lower levels in pancreatic tumours, as detected by immunohistochemistry. Conclusion Identification of targets which play a role in the highly invasive phenotype of pancreatic cancer may help to understand the biological behaviour, the rapid progression of this cancer and may be of importance in the development of new therapeutic strategies for pancreatic cancer.

  14. Rapid Identification of Novel Immunodominant Proteins and Characterization of a Specific Linear Epitope of Campylobacter jejuni

    Science.gov (United States)

    Hoppe, Sebastian; Bier, Frank F.; Nickisch-Rosenegk, Markus v.

    2013-01-01

    Campylobacter jejuni remains one of the major gut pathogens of our time. Its zoonotic nature and wide-spread distribution in industrialized countries calls for a quick and reliable diagnostic tool. Antibody-based detection presents a suitable means to identify pathogenic bacteria. However, the knowledge about immunodominant targets is limited. Thus, an approach is presented, which allows for the rapid screening of numerous cDNA derived expression clones to identify novel antigens. The deeper understanding of immunodominant proteins assists in the design of diagnostic tools and furthers the insight into the bacterium’s pathogenicity as well as revealing potential candidates for vaccination. We have successfully screened 1536 clones of an expression library to identify 22 proteins that have not been described as immunodominant before. After subcloning the corresponding 22 genes and expression of full-length proteins, we investigated the immunodominant character by microarrays and ELISA. Subsequently, seven proteins were selected for epitope mapping. For cj0669 and cj0920c linear epitopes were identified. For cj0669, specificity assays revealed a specific linear epitope site. Consequently, an eleven amino acid residue sequence TLIKELKRLGI was analyzed via alanine scan, which revealed the glycine residue to be significant for binding of the antibody. The innovative approach presented herein of generating cDNAs of prokaryotes in combination with a microarray platform rendering time-consuming purification steps obsolete has helped to illuminate novel immunodominant proteins of C.jejuni. The findings of a specific linear epitope pave the way for a plethora of future research and the potential use in diagnostic applications such as serological screenings. Moreover, the current approach is easily adaptable to other highly relevant bacteria making it a formidable tool for the future discovery of antigens and potential biomarkers. Consequently, it is desirable to simplify

  15. Identification of salivary mucin MUC7 binding proteins from Streptococcus gordonii

    Directory of Open Access Journals (Sweden)

    Thornton David J

    2009-08-01

    Full Text Available Abstract Background The salivary mucin MUC7 (previously known as MG2 can adhere to various strains of streptococci that are primary colonizers and predominant microorganisms of the oral cavity. Although there is a growing interest in interaction between oral pathogens and salivary mucins, studies reporting the specific binding sites on the bacteria are rather limited. Identification and characterization of the specific interacting proteins on the bacterial cell surface, termed adhesins, are crucial to further understand host-pathogen interactions. Results We demonstrate here, using purified MUC7 to overlay blots of SDS-extracts of Streptococcus gordonii cell surface proteins, 4 MUC7-binding bands, with apparent molecular masses of 62, 78, 84 and 133 kDa from the Streptococcus gordonii strain, PK488. Putative adhesins were identified by in-gel digestion and subsequent nanoLC-tandem mass spectrometry analysis of resultant peptides. The 62 kDa and 84 kDa bands were identified as elongation factor (EF Tu and EF-G respectively. The 78 kDa band was a hppA gene product; the 74 kDa oligopeptide-binding lipoprotein. The 133 kDa band contained two proteins; alpha enolase and DNA-directed RNA polymerase, beta' subunit. Some of these proteins, for example alpha enolase are expected to be intracellular, however, flow cytometric analysis confirmed its location on the bacterial surface. Conclusion Our data demonstrated that S. gordonii expressed a number of putative MUC7 recognizing proteins and these contribute to MUC7 mucin binding of this streptococcal strain.

  16. Identification of Surface Protein Biomarkers of Listeria monocytogenes via Bioinformatics and Antibody-Based Protein Detection Tools

    Science.gov (United States)

    Zhang, Cathy X. Y.; Brooks, Brian W.; Huang, Hongsheng; Pagotto, Franco

    2016-01-01

    ABSTRACT The Gram-positive bacterium Listeria monocytogenes causes a significant percentage of the fatalities among foodborne illnesses in humans. Surface proteins specifically expressed in a wide range of L. monocytogenes serotypes under selective enrichment culture conditions could serve as potential biomarkers for detection and isolation of this pathogen via antibody-based methods. Our study aimed to identify such biomarkers. Interrogation of the L. monocytogenes serotype 4b strain F2365 genome identified 130 putative or known surface proteins. The homologues of four surface proteins, LMOf2365_0578, LMOf2365_0581, LMOf2365_0639, and LMOf2365_2117, were assessed as biomarkers due to the presence of conserved regions among strains of L. monocytogenes which are variable among other Listeria species. Rabbit polyclonal antibodies against the four recombinant proteins revealed the expression of only LMOf2365_0639 on the surface of serotype 4b strain LI0521 cells despite PCR detection of mRNA transcripts for all four proteins in the organism. Three of 35 monoclonal antibodies (MAbs) to LMOf2365_0639, MAbs M3643, M3644, and M3651, specifically recognized 42 (91.3%) of 46 L. monocytogenes lineage I and II isolates grown in nonselective brain heart infusion medium. While M3644 and M3651 reacted with 14 to 15 (82.4 to 88.2%) of 17 L. monocytogenes lineage I and II isolates, M3643 reacted with 22 (91.7%) of 24 lineage I, II, and III isolates grown in selective enrichment media (UVM1, modified Fraser, Palcam, and UVM2 media). The three MAbs exhibited only weak reactivities (the optical densities at 414 nm were close to the cutoff value) to some other Listeria species grown in selective enrichment media. Collectively, the data indicate the potential of LMOf2365_0639 as a surface biomarker of L. monocytogenes, with the aid of specific MAbs, for pathogen detection, identification, and isolation in clinical, environmental, and food samples. IMPORTANCE L. monocytogenes is

  17. Identification of immunogenic proteins and generation of antibodies against Salmonella Typhimurium using phage display

    Directory of Open Access Journals (Sweden)

    Meyer Torsten

    2012-06-01

    Full Text Available Abstract Background Solely in Europoe, Salmonella Typhimurium causes more than 100,000 infections per year. Improved detection of livestock colonised with S. Typhimurium is necessary to prevent foodborne diseases. Currently, commercially available ELISA assays are based on a mixture of O-antigens (LPS or total cell lysate of Salmonella and are hampered by cross-reaction. The identification of novel immunogenic proteins would be useful to develop ELISA based diagnostic assays with a higher specificity. Results A phage display library of the entire Salmonella Typhimurium genome was constructed and 47 immunogenic o