WorldWideScience

Sample records for protein interaction databases

  1. Protein-Protein Interaction Databases

    DEFF Research Database (Denmark)

    Szklarczyk, Damian; Jensen, Lars Juhl

    2015-01-01

    Years of meticulous curation of scientific literature and increasingly reliable computational predictions have resulted in creation of vast databases of protein interaction data. Over the years, these repositories have become a basic framework in which experiments are analyzed and new directions...

  2. Database of Interacting Proteins (DIP)

    Data.gov (United States)

    U.S. Department of Health & Human Services — The DIP database catalogs experimentally determined interactions between proteins. It combines information from a variety of sources to create a single, consistent...

  3. Update History of This Database - Yeast Interacting Proteins Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Yeast Interacting Proteins Database Update History of This Database Date Update contents 201...0/03/29 Yeast Interacting Proteins Database English archive site is opened. 2000/12/4 Yeast Interacting Proteins Database...( http://itolab.cb.k.u-tokyo.ac.jp/Y2H/ ) is released. About This Database Database Description... Download License Update History of This Database Site Policy | Contact Us Update History of This Database... - Yeast Interacting Proteins Database | LSDB Archive ...

  4. HCVpro: Hepatitis C virus protein interaction database

    KAUST Repository

    Kwofie, Samuel K.

    2011-12-01

    It is essential to catalog characterized hepatitis C virus (HCV) protein-protein interaction (PPI) data and the associated plethora of vital functional information to augment the search for therapies, vaccines and diagnostic biomarkers. In furtherance of these goals, we have developed the hepatitis C virus protein interaction database (HCVpro) by integrating manually verified hepatitis C virus-virus and virus-human protein interactions curated from literature and databases. HCVpro is a comprehensive and integrated HCV-specific knowledgebase housing consolidated information on PPIs, functional genomics and molecular data obtained from a variety of virus databases (VirHostNet, VirusMint, HCVdb and euHCVdb), and from BIND and other relevant biology repositories. HCVpro is further populated with information on hepatocellular carcinoma (HCC) related genes that are mapped onto their encoded cellular proteins. Incorporated proteins have been mapped onto Gene Ontologies, canonical pathways, Online Mendelian Inheritance in Man (OMIM) and extensively cross-referenced to other essential annotations. The database is enriched with exhaustive reviews on structure and functions of HCV proteins, current state of drug and vaccine development and links to recommended journal articles. Users can query the database using specific protein identifiers (IDs), chromosomal locations of a gene, interaction detection methods, indexed PubMed sources as well as HCVpro, BIND and VirusMint IDs. The use of HCVpro is free and the resource can be accessed via http://apps.sanbi.ac.za/hcvpro/ or http://cbrc.kaust.edu.sa/hcvpro/. © 2011 Elsevier B.V.

  5. Full Data of Yeast Interacting Proteins Database (Original Version) - Yeast Interacting Proteins Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Yeast Interacting Proteins Database Full Data of Yeast Interacting Proteins Database (Origin...al Version) Data detail Data name Full Data of Yeast Interacting Proteins Database (Original Version) DOI 10....18908/lsdba.nbdc00742-004 Description of data contents The entire data in the Yeast Interacting Proteins Database...eir interactions are required. Several sources including YPD (Yeast Proteome Database, Costanzo, M. C., Hoga...ematic name in the SGD (Saccharomyces Genome Database; http://www.yeastgenome.org /). Bait gene name The gen

  6. Database Description - Yeast Interacting Proteins Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Yeast Interacting Proteins Database Database Description General information of database Database... name Yeast Interacting Proteins Database Alternative name - DOI 10.18908/lsdba.nbdc00742-000 Creator C...-ken 277-8561 Tel: +81-4-7136-3989 FAX: +81-4-7136-3979 E-mail : Database classif...s cerevisiae Taxonomy ID: 4932 Database description Information on interactions and related information obta...l Acad Sci U S A. 2001 Apr 10;98(8):4569-74. Epub 2001 Mar 13. External Links: Original website information Database

  7. Core Data of Yeast Interacting Proteins Database (Original Version) - Yeast Interacting Proteins Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available y are in the reverse direction. *1 A comprehensive two-hybrid analysis to explore the yeast protein interact...s. 2000 Jan 1;28(1):73-6. *2 The yeast proteome database (YPD) and Caenorhabditis elegans proteome database (WormPD): comprehensive...000 Jan 1;28(1):73-6. *3 A comprehensive analysis of protein-protein interactions in Saccharomyces cerevisia

  8. HCVpro: Hepatitis C virus protein interaction database

    KAUST Repository

    Kwofie, Samuel K.; Schaefer, Ulf; Sundararajan, Vijayaraghava Seshadri; Bajic, Vladimir B.; Christoffels, Alan G.

    2011-01-01

    It is essential to catalog characterized hepatitis C virus (HCV) protein-protein interaction (PPI) data and the associated plethora of vital functional information to augment the search for therapies, vaccines and diagnostic biomarkers

  9. Yeast Interacting Proteins Database: YPR103W, YOR047C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available tein involved in control of glucose-regulated gene expression; interacts with protein kinase Snf1p, glucose sensors...gulated gene expression; interacts with protein kinase Snf1p, glucose sensors Snf

  10. Yeast Interacting Proteins Database: YLR447C, YOR047C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available xpression; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Sp...; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Spt15p; act

  11. Yeast Interacting Proteins Database: YGR013W, YKL012W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available tion U1 snRNP protein involved in splicing, interacts with the branchpoint-binding protein during the formation of the second commitm... PRP40 U1 snRNP protein involved in splicing, interacts with the branchpoint-binding protein during the form...ation of the second commitment complex Rows with this prey as prey (1) Rows with

  12. Yeast Interacting Proteins Database: YGL237C, YOR047C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ene expression; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding prote... expression; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein

  13. Yeast Interacting Proteins Database: YKL002W, YOR047C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ene expression; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding prote...xpression; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Sp

  14. Yeast Interacting Proteins Database: YGL145W, YNL258C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ripheral membrane protein required for Golgi-to-ER retrograde traffic; component ... membrane protein required for Golgi-to-ER retrograde traffic; component of the ER target site that interact

  15. Yeast Interacting Proteins Database: YOR047C, YKL038W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available racts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Spt15p; acts as a...Bait description Protein involved in control of glucose-regulated gene expression; interacts with protein kinase Snf1p, glucose senso...rs Snf3p and Rgt2p, and TATA-binding protein Spt15p; acts as a regulator of the tra

  16. Yeast Interacting Proteins Database: YFR049W, YOR047C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Spt15p; acts as a regulator... (0) YOR047C STD1 Protein involved in control of glucose-regulated gene expression; interacts with protein kinase Snf1p, glucose sens...ors Snf3p and Rgt2p, and TATA-binding protein Spt15p; ac

  17. Yeast Interacting Proteins Database: YMR280C, YOR047C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available olved in control of glucose-regulated gene expression; interacts with protein kinase Snf1p, glucose sensor... glucose-regulated gene expression; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, an

  18. Yeast Interacting Proteins Database: YOR358W, YOR047C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Spt15p; act...rotein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Spt15p; acts as a regulator o

  19. Yeast Interacting Proteins Database: YGL127C, YOR047C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ith protein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Spt15p; acts as a regula...rotein involved in control of glucose-regulated gene expression; interacts with protein kinase Snf1p, glucose sensors

  20. Yeast Interacting Proteins Database: YOR302W, YOR047C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available rol of glucose-regulated gene expression; interacts with protein kinase Snf1p, glucose sensors Snf3p and Rgt...tein kinase Snf1p, glucose sensors Snf3p and Rgt2p, and TATA-binding protein Spt1

  1. Yeast Interacting Proteins Database: YNL258C, YKR022C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YNL258C DSL1 Peripheral membrane protein required for Golgi-to-ER retrograde traffi...equired for Golgi-to-ER retrograde traffic; component of the ER target site that interacts with coatomer, th...it ORF YNL258C Bait gene name DSL1 Bait description Peripheral membrane protein r

  2. Yeast Interacting Proteins Database: YDR176W, YDL239C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle pole...ining structure at the leading edge of the prospore membrane via interaction with spindle pole body componen...DY3 Prey description Protein required for spore wall formation, thought to mediate assembly of a Don1p-conta

  3. Yeast Interacting Proteins Database: YDL239C, YDR273W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle p...it as prey (1) YDR273W DON1 Meiosis-specific component of the spindle pole body, part of the leading... edge protein (LEP) coat, forms a ring-like structure at the leading edge of the prospore...ption Protein required for spore wall formation, thought to mediate assembly of a Don1p-containing structure at the leading...description Meiosis-specific component of the spindle pole body, part of the leading edge protein (LEP) coat

  4. Yeast Interacting Proteins Database: YDL239C, YLR423C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle p...cription Protein required for spore wall formation, thought to mediate assembly of a Don1p-containing structure at the leading

  5. Yeast Interacting Proteins Database: YDL239C, YPL070W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle p...cription Protein required for spore wall formation, thought to mediate assembly of a Don1p-containing structure at the leading

  6. Yeast Interacting Proteins Database: YDL239C, YML042W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle p...iption Protein required for spore wall formation, thought to mediate assembly of a Don1p-containing structure at the leading

  7. Yeast Interacting Proteins Database: YDL239C, YKL103C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle p...ait description Protein required for spore wall formation, thought to mediate assembly of a Don1p-containing structure at the leading

  8. License - Yeast Interacting Proteins Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Yeast Interacting Proteins Database License to Use This Database Last updated : 2010/02/15 You may use this database...nal License described below. The Standard License specifies the license terms regarding the use of this database... and the requirements you must follow in using this database. The Additional ...the Standard License. Standard License The Standard License for this database is the license specified in th...e Creative Commons Attribution-Share Alike 2.1 Japan . If you use data from this database

  9. A protein domain interaction interface database: InterPare

    Directory of Open Access Journals (Sweden)

    Lee Jungsul

    2005-08-01

    Full Text Available Abstract Background Most proteins function by interacting with other molecules. Their interaction interfaces are highly conserved throughout evolution to avoid undesirable interactions that lead to fatal disorders in cells. Rational drug discovery includes computational methods to identify the interaction sites of lead compounds to the target molecules. Identifying and classifying protein interaction interfaces on a large scale can help researchers discover drug targets more efficiently. Description We introduce a large-scale protein domain interaction interface database called InterPare http://interpare.net. It contains both inter-chain (between chains interfaces and intra-chain (within chain interfaces. InterPare uses three methods to detect interfaces: 1 the geometric distance method for checking the distance between atoms that belong to different domains, 2 Accessible Surface Area (ASA, a method for detecting the buried region of a protein that is detached from a solvent when forming multimers or complexes, and 3 the Voronoi diagram, a computational geometry method that uses a mathematical definition of interface regions. InterPare includes visualization tools to display protein interior, surface, and interaction interfaces. It also provides statistics such as the amino acid propensities of queried protein according to its interior, surface, and interface region. The atom coordinates that belong to interface, surface, and interior regions can be downloaded from the website. Conclusion InterPare is an open and public database server for protein interaction interface information. It contains the large-scale interface data for proteins whose 3D-structures are known. As of November 2004, there were 10,583 (Geometric distance, 10,431 (ASA, and 11,010 (Voronoi diagram entries in the Protein Data Bank (PDB containing interfaces, according to the above three methods. In the case of the geometric distance method, there are 31,620 inter-chain domain

  10. Yeast Interacting Proteins Database: YNR006W, YHL002W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ling Golgi proteins, forming lumenal membranes and sorting ubiquitinated proteins destined for degradation; ..., as well as for recycling of Golgi proteins and formation of lumenal membranes Rows with this prey as prey ...1p; required for recycling Golgi proteins, forming lumenal membranes and sorting ubiquitinated proteins dest...degradation, as well as for recycling of Golgi proteins and formation of lumenal membranes

  11. Yeast Interacting Proteins Database: YHL002W, YNR006W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ycling of Golgi proteins and formation of lumenal membranes Rows with this bait as bait (1) Rows with this b...required for recycling Golgi proteins, forming lumenal membranes and sorting ubiquitinated proteins destined...on, as well as for recycling of Golgi proteins and formation of lumenal membranes...ith Hse1p; required for recycling Golgi proteins, forming lumenal membranes and sorting ubiquitinated protei

  12. Yeast Interacting Proteins Database: YNL152W, YMR032W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YNL152W INN1 Essential protein that associates with the contractile actomyosin ring... Bait description Essential protein that associates with the contractile actomyosin ring, required for ingre

  13. Yeast Interacting Proteins Database: YNL258C, YGL145W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YNL258C DSL1 Peripheral membrane protein required for Golgi-to-ER retrograde traffi...t description Peripheral membrane protein required for Golgi-to-ER retrograde traffic; component of the ER t

  14. Yeast Interacting Proteins Database: YNL216W, YLR453C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YNL216W RAP1 DNA-binding protein involved in either activation or repression of transcription, depending...NA-binding protein involved in either activation or repression of transcription, depending on binding site c

  15. Yeast Interacting Proteins Database: YOL006C, YMR233W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available fusion protein localizes to the cytoplasm, nucleus and nucleolus Rows with this prey as prey (1) Rows with t...on protein localizes to the cytoplasm, nucleus and nucleolus Rows with this prey

  16. Yeast Interacting Proteins Database: YKL002W, YFL034C-B [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available integral membrane proteins into lumenal vesicles of multivesicular bodies, and for delivery of newly synthes...ntegral membrane proteins into lumenal vesicles of multivesicular bodies, and for delivery of newly synthesi

  17. Yeast Interacting Proteins Database: YJR091C, YKL002W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available g of integral membrane proteins into lumenal vesicles of multivesicular bodies, and for delivery of newly sy... integral membrane proteins into lumenal vesicles of multivesicular bodies, and for delivery of newly synthe

  18. Yeast Interacting Proteins Database: YCL046W, YGL115W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YCL046W - Dubious open reading frame unlikely to encode a protein, based on availab...ading frame unlikely to encode a protein, based on available experimental and comparative sequence data; par

  19. Yeast Interacting Proteins Database: YLR347C, YLR377C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available gy-mediated degradation depending on growth conditions; interacts with Vid30p Rows with this prey as prey (4...r autophagy-mediated degradation depending on growth conditions; interacts with V

  20. Yeast Interacting Proteins Database: YNL189W, YLR377C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available phagy-mediated degradation depending on growth conditions; interacts with Vid30p Rows with this prey as prey...ated or autophagy-mediated degradation depending on growth conditions; interacts

  1. Yeast Interacting Proteins Database: YML064C, YLR377C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available d or autophagy-mediated degradation depending on growth conditions; interacts with Vid30p Rows with this pre...er proteasome-mediated or autophagy-mediated degradation depending on growth conditions; interacts with Vid3

  2. Yeast Interacting Proteins Database: YOR117W, YJL184W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available c stress response, telomere uncapping and elongation, transcription; component of the EKC/KEOPS protein comp...n proposed to be involved in the modification of N-linked oligosaccharides, osmotic stress response, telomere uncap

  3. Yeast Interacting Proteins Database: YER081W, YDR105C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YDR105C TMS1 Vacuolar membrane protein of unknown function that is conserved in mammals; predicted to contai...tion that is conserved in mammals; predicted to contain eleven transmembrane heli

  4. Yeast Interacting Proteins Database: YKL002W, YLR423C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available integral membrane proteins into lumenal vesicles of multivesicular bodies, and for delivery of newly synthes... into lumenal vesicles of multivesicular bodies, and for delivery of newly synthesized vacuolar enzymes to t

  5. Yeast Interacting Proteins Database: YKL002W, YDL165W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available integral membrane proteins into lumenal vesicles of multivesicular bodies, and for delivery of newly synthes...ins into lumenal vesicles of multivesicular bodies, and for delivery of newly synthesized vacuolar enzymes t

  6. Yeast Interacting Proteins Database: YLR447C, YDR277C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available uction pathway, required for repression of transcription by Rgt1p; interacts with Rgt1p and the Snf3p and Rgt2p glucose sensors...transduction pathway, required for repression of transcription by Rgt1p; interacts with Rgt1p and the Snf3p and Rgt2p glucose sensors

  7. Yeast Interacting Proteins Database: YMR125W, YPL178W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available so contains Sto1p, component of the spliceosomal commitment complex; interacts with Npl3p, possibly to packa...lso contains Sto1p, component of the spliceosomal commitment complex; interacts with Npl3p, possibly to pack

  8. Yeast Interacting Proteins Database: YDL239C, YGR268C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle p...sembly of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spi

  9. Yeast Interacting Proteins Database: YDL239C, YPL255W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle p...ediate assembly of a Don1p-containing structure at the leading edge of the prospore membrane via interaction

  10. Yeast Interacting Proteins Database: YDL239C, YOR324C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle p... a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle pole

  11. Yeast Interacting Proteins Database: YDL239C, YDR148C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle p...mbly of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spind

  12. Yeast Interacting Proteins Database: YNR051C, YER151C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available that coregulates anterograde and retrograde transport between the endoplasmic reticulum and Golgi compartme...C UBP3 Ubiquitin-specific protease that interacts with Bre5p to co-regulate anterograde and retrograde...t coregulates anterograde and retrograde transport between the endoplasmic reticulum and Golgi compartments;...3 Prey description Ubiquitin-specific protease that interacts with Bre5p to co-regulate anterograde and retrograde

  13. Yeast Interacting Proteins Database: YLR377C, YLR377C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available sis pathway, required for glucose metabolism; undergoes either proteasome-mediated or autophagy-mediated degradation depending...utophagy-mediated degradation depending on growth conditions; interacts with Vid30p Rows with this prey as p...d or autophagy-mediated degradation depending on growth conditions; interacts wit...me-mediated or autophagy-mediated degradation depending on growth conditions; int

  14. Yeast Interacting Proteins Database: YDL239C, YHR184W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle p...e wall formation, thought to mediate assembly of a Don1p-containing structure at the leading edge of the pro

  15. Yeast Interacting Proteins Database: YDL239C, YPL124W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle p...ore wall formation, thought to mediate assembly of a Don1p-containing structure at the leading edge of the p

  16. Yeast Interacting Proteins Database: YDL239C, YAL028W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle p... to mediate assembly of a Don1p-containing structure at the leading edge of the prospore membrane via intera

  17. Yeast Interacting Proteins Database: YDL239C, YLR072W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle p... wall formation, thought to mediate assembly of a Don1p-containing structure at the leading edge of the pros

  18. Yeast Interacting Proteins Database: YDL239C, YBR072W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available of a Don1p-containing structure at the leading edge of the prospore membrane via interaction with spindle p...ht to mediate assembly of a Don1p-containing structure at the leading edge of the prospore membrane via inte

  19. Yeast Interacting Proteins Database: YOL069W, YIL144W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available complex (Ndc80p-Nuf2p-Spc24p-Spc25p); involved in chromosome segregation, spindle checkpoint activity and kinetochore clustering...vity, kinetochore assembly and clustering Rows with this prey as prey (2) Rows with this prey as bait (0) 12...-Nuf2p-Spc24p-Spc25p); involved in chromosome segregation, spindle checkpoint activity and kinetochore clustering...d coiled-coil protein involved in chromosome segregation, spindle checkpoint activity, kinetochore assembly and clustering

  20. SynechoNET: integrated protein-protein interaction database of a model cyanobacterium Synechocystis sp. PCC 6803

    OpenAIRE

    Kim, Woo-Yeon; Kang, Sungsoo; Kim, Byoung-Chul; Oh, Jeehyun; Cho, Seongwoong; Bhak, Jong; Choi, Jong-Soon

    2008-01-01

    Background Cyanobacteria are model organisms for studying photosynthesis, carbon and nitrogen assimilation, evolution of plant plastids, and adaptability to environmental stresses. Despite many studies on cyanobacteria, there is no web-based database of their regulatory and signaling protein-protein interaction networks to date. Description We report a database and website SynechoNET that provides predicted protein-protein interactions. SynechoNET shows cyanobacterial domain-domain interactio...

  1. iPfam: a database of protein family and domain interactions found in the Protein Data Bank.

    Science.gov (United States)

    Finn, Robert D; Miller, Benjamin L; Clements, Jody; Bateman, Alex

    2014-01-01

    The database iPfam, available at http://ipfam.org, catalogues Pfam domain interactions based on known 3D structures that are found in the Protein Data Bank, providing interaction data at the molecular level. Previously, the iPfam domain-domain interaction data was integrated within the Pfam database and website, but it has now been migrated to a separate database. This allows for independent development, improving data access and giving clearer separation between the protein family and interactions datasets. In addition to domain-domain interactions, iPfam has been expanded to include interaction data for domain bound small molecule ligands. Functional annotations are provided from source databases, supplemented by the incorporation of Wikipedia articles where available. iPfam (version 1.0) contains >9500 domain-domain and 15 500 domain-ligand interactions. The new website provides access to this data in a variety of ways, including interactive visualizations of the interaction data.

  2. The human interactome knowledge base (hint-kb): An integrative human protein interaction database enriched with predicted protein–protein interaction scores using a novel hybrid technique

    KAUST Repository

    Theofilatos, Konstantinos A.; Dimitrakopoulos, Christos M.; Likothanassis, Spiridon D.; Kleftogiannis, Dimitrios A.; Moschopoulos, Charalampos N.; Alexakos, Christos; Papadimitriou, Stergios; Mavroudi, Seferina P.

    2013-01-01

    Proteins are the functional components of many cellular processes and the identification of their physical protein–protein interactions (PPIs) is an area of mature academic research. Various databases have been developed containing information about

  3. STITCH 2: an interaction network database for small molecules and proteins

    DEFF Research Database (Denmark)

    Kuhn, Michael; Szklarczyk, Damian; Franceschini, Andrea

    2010-01-01

    Over the last years, the publicly available knowledge on interactions between small molecules and proteins has been steadily increasing. To create a network of interactions, STITCH aims to integrate the data dispersed over the literature and various databases of biological pathways, drug......-target relationships and binding affinities. In STITCH 2, the number of relevant interactions is increased by incorporation of BindingDB, PharmGKB and the Comparative Toxicogenomics Database. The resulting network can be explored interactively or used as the basis for large-scale analyses. To facilitate links to other...... chemical databases, we adopt InChIKeys that allow identification of chemicals with a short, checksum-like string. STITCH 2.0 connects proteins from 630 organisms to over 74,000 different chemicals, including 2200 drugs. STITCH can be accessed at http://stitch.embl.de/....

  4. Genes2Networks: connecting lists of gene symbols using mammalian protein interactions databases

    Directory of Open Access Journals (Sweden)

    Ma'ayan Avi

    2007-10-01

    Full Text Available Abstract Background In recent years, mammalian protein-protein interaction network databases have been developed. The interactions in these databases are either extracted manually from low-throughput experimental biomedical research literature, extracted automatically from literature using techniques such as natural language processing (NLP, generated experimentally using high-throughput methods such as yeast-2-hybrid screens, or interactions are predicted using an assortment of computational approaches. Genes or proteins identified as significantly changing in proteomic experiments, or identified as susceptibility disease genes in genomic studies, can be placed in the context of protein interaction networks in order to assign these genes and proteins to pathways and protein complexes. Results Genes2Networks is a software system that integrates the content of ten mammalian interaction network datasets. Filtering techniques to prune low-confidence interactions were implemented. Genes2Networks is delivered as a web-based service using AJAX. The system can be used to extract relevant subnetworks created from "seed" lists of human Entrez gene symbols. The output includes a dynamic linkable three color web-based network map, with a statistical analysis report that identifies significant intermediate nodes used to connect the seed list. Conclusion Genes2Networks is powerful web-based software that can help experimental biologists to interpret lists of genes and proteins such as those commonly produced through genomic and proteomic experiments, as well as lists of genes and proteins associated with disease processes. This system can be used to find relationships between genes and proteins from seed lists, and predict additional genes or proteins that may play key roles in common pathways or protein complexes.

  5. PARPs database: A LIMS systems for protein-protein interaction data mining or laboratory information management system

    Directory of Open Access Journals (Sweden)

    Picard-Cloutier Aude

    2007-12-01

    Full Text Available Abstract Background In the "post-genome" era, mass spectrometry (MS has become an important method for the analysis of proteins and the rapid advancement of this technique, in combination with other proteomics methods, results in an increasing amount of proteome data. This data must be archived and analysed using specialized bioinformatics tools. Description We herein describe "PARPs database," a data analysis and management pipeline for liquid chromatography tandem mass spectrometry (LC-MS/MS proteomics. PARPs database is a web-based tool whose features include experiment annotation, protein database searching, protein sequence management, as well as data-mining of the peptides and proteins identified. Conclusion Using this pipeline, we have successfully identified several interactions of biological significance between PARP-1 and other proteins, namely RFC-1, 2, 3, 4 and 5.

  6. MEGADOCK-Web: an integrated database of high-throughput structure-based protein-protein interaction predictions.

    Science.gov (United States)

    Hayashi, Takanori; Matsuzaki, Yuri; Yanagisawa, Keisuke; Ohue, Masahito; Akiyama, Yutaka

    2018-05-08

    Protein-protein interactions (PPIs) play several roles in living cells, and computational PPI prediction is a major focus of many researchers. The three-dimensional (3D) structure and binding surface are important for the design of PPI inhibitors. Therefore, rigid body protein-protein docking calculations for two protein structures are expected to allow elucidation of PPIs different from known complexes in terms of 3D structures because known PPI information is not explicitly required. We have developed rapid PPI prediction software based on protein-protein docking, called MEGADOCK. In order to fully utilize the benefits of computational PPI predictions, it is necessary to construct a comprehensive database to gather prediction results and their predicted 3D complex structures and to make them easily accessible. Although several databases exist that provide predicted PPIs, the previous databases do not contain a sufficient number of entries for the purpose of discovering novel PPIs. In this study, we constructed an integrated database of MEGADOCK PPI predictions, named MEGADOCK-Web. MEGADOCK-Web provides more than 10 times the number of PPI predictions than previous databases and enables users to conduct PPI predictions that cannot be found in conventional PPI prediction databases. In MEGADOCK-Web, there are 7528 protein chains and 28,331,628 predicted PPIs from all possible combinations of those proteins. Each protein structure is annotated with PDB ID, chain ID, UniProt AC, related KEGG pathway IDs, and known PPI pairs. Additionally, MEGADOCK-Web provides four powerful functions: 1) searching precalculated PPI predictions, 2) providing annotations for each predicted protein pair with an experimentally known PPI, 3) visualizing candidates that may interact with the query protein on biochemical pathways, and 4) visualizing predicted complex structures through a 3D molecular viewer. MEGADOCK-Web provides a huge amount of comprehensive PPI predictions based on

  7. Core Data of Yeast Interacting Proteins Database (Annotation Updated Version) - Yeast Interacting Proteins Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available hod As the indicator of reliability of the interactions obtained by the experiment, the literature informati...ic Acids Res. 28, 73-76.) are used for literature collection. Number of data entries 841 interactions Data i...d from the YPD. Literature sharing score The score concerning co-occurrence of Prey and Bait in the litera...ture, calculated by the calculation formula. Calculation formula for the score: Cur

  8. HippDB: a database of readily targeted helical protein-protein interactions.

    Science.gov (United States)

    Bergey, Christina M; Watkins, Andrew M; Arora, Paramjit S

    2013-11-01

    HippDB catalogs every protein-protein interaction whose structure is available in the Protein Data Bank and which exhibits one or more helices at the interface. The Web site accepts queries on variables such as helix length and sequence, and it provides computational alanine scanning and change in solvent-accessible surface area values for every interfacial residue. HippDB is intended to serve as a starting point for structure-based small molecule and peptidomimetic drug development. HippDB is freely available on the web at http://www.nyu.edu/projects/arora/hippdb. The Web site is implemented in PHP, MySQL and Apache. Source code freely available for download at http://code.google.com/p/helidb, implemented in Perl and supported on Linux. arora@nyu.edu.

  9. Full Data of Yeast Interacting Proteins Database (Annotation Updated Version) - Yeast Interacting Proteins Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available As the indicator of reliability of the interactions obtained by the experiment, the literature...cids Res. 28, 73-76.) are used for literature collection. Number of data entries ...ber of articles obtained from the YPD. Literature sharing score The score concerning co-occurrence of Prey and Bait in the literature

  10. The human interactome knowledge base (hint-kb): An integrative human protein interaction database enriched with predicted protein–protein interaction scores using a novel hybrid technique

    KAUST Repository

    Theofilatos, Konstantinos A.

    2013-07-12

    Proteins are the functional components of many cellular processes and the identification of their physical protein–protein interactions (PPIs) is an area of mature academic research. Various databases have been developed containing information about experimentally and computationally detected human PPIs as well as their corresponding annotation data. However, these databases contain many false positive interactions, are partial and only a few of them incorporate data from various sources. To overcome these limitations, we have developed HINT-KB (http://biotools.ceid.upatras.gr/hint-kb/), a knowledge base that integrates data from various sources, provides a user-friendly interface for their retrieval, cal-culatesasetoffeaturesofinterest and computesaconfidence score for every candidate protein interaction. This confidence score is essential for filtering the false positive interactions which are present in existing databases, predicting new protein interactions and measuring the frequency of each true protein interaction. For this reason, a novel machine learning hybrid methodology, called (Evolutionary Kalman Mathematical Modelling—EvoKalMaModel), was used to achieve an accurate and interpretable scoring methodology. The experimental results indicated that the proposed scoring scheme outperforms existing computational methods for the prediction of PPIs.

  11. The drug-minded protein interaction database (DrumPID) for efficient target analysis and drug development.

    Science.gov (United States)

    Kunz, Meik; Liang, Chunguang; Nilla, Santosh; Cecil, Alexander; Dandekar, Thomas

    2016-01-01

    The drug-minded protein interaction database (DrumPID) has been designed to provide fast, tailored information on drugs and their protein networks including indications, protein targets and side-targets. Starting queries include compound, target and protein interactions and organism-specific protein families. Furthermore, drug name, chemical structures and their SMILES notation, affected proteins (potential drug targets), organisms as well as diseases can be queried including various combinations and refinement of searches. Drugs and protein interactions are analyzed in detail with reference to protein structures and catalytic domains, related compound structures as well as potential targets in other organisms. DrumPID considers drug functionality, compound similarity, target structure, interactome analysis and organismic range for a compound, useful for drug development, predicting drug side-effects and structure-activity relationships.Database URL:http://drumpid.bioapps.biozentrum.uni-wuerzburg.de. © The Author(s) 2016. Published by Oxford University Press.

  12. A database and tool, IM Browser, for exploring and integrating emerging gene and protein interaction data for Drosophila

    Directory of Open Access Journals (Sweden)

    Parrish Jodi R

    2006-04-01

    Full Text Available Abstract Background Biological processes are mediated by networks of interacting genes and proteins. Efforts to map and understand these networks are resulting in the proliferation of interaction data derived from both experimental and computational techniques for a number of organisms. The volume of this data combined with the variety of specific forms it can take has created a need for comprehensive databases that include all of the available data sets, and for exploration tools to facilitate data integration and analysis. One powerful paradigm for the navigation and analysis of interaction data is an interaction graph or map that represents proteins or genes as nodes linked by interactions. Several programs have been developed for graphical representation and analysis of interaction data, yet there remains a need for alternative programs that can provide casual users with rapid easy access to many existing and emerging data sets. Description Here we describe a comprehensive database of Drosophila gene and protein interactions collected from a variety of sources, including low and high throughput screens, genetic interactions, and computational predictions. We also present a program for exploring multiple interaction data sets and for combining data from different sources. The program, referred to as the Interaction Map (IM Browser, is a web-based application for searching and visualizing interaction data stored in a relational database system. Use of the application requires no downloads and minimal user configuration or training, thereby enabling rapid initial access to interaction data. IM Browser was designed to readily accommodate and integrate new types of interaction data as it becomes available. Moreover, all information associated with interaction measurements or predictions and the genes or proteins involved are accessible to the user. This allows combined searches and analyses based on either common or technique-specific attributes

  13. TcoF-DB: dragon database for human transcription co-factors and transcription factor interacting proteins

    KAUST Repository

    Schaefer, Ulf

    2010-10-21

    The initiation and regulation of transcription in eukaryotes is complex and involves a large number of transcription factors (TFs), which are known to bind to the regulatory regions of eukaryotic DNA. Apart from TF-DNA binding, protein-protein interaction involving TFs is an essential component of the machinery facilitating transcriptional regulation. Proteins that interact with TFs in the context of transcription regulation but do not bind to the DNA themselves, we consider transcription co-factors (TcoFs). The influence of TcoFs on transcriptional regulation and initiation, although indirect, has been shown to be significant with the functionality of TFs strongly influenced by the presence of TcoFs. While the role of TFs and their interaction with regulatory DNA regions has been well-studied, the association between TFs and TcoFs has so far been given less attention. Here, we present a resource that is comprised of a collection of human TFs and the TcoFs with which they interact. Other proteins that have a proven interaction with a TF, but are not considered TcoFs are also included. Our database contains 157 high-confidence TcoFs and additionally 379 hypothetical TcoFs. These have been identified and classified according to the type of available evidence for their involvement in transcriptional regulation and their presence in the cell nucleus. We have divided TcoFs into four groups, one of which contains high-confidence TcoFs and three others contain TcoFs which are hypothetical to different extents. We have developed the Dragon Database for Human Transcription Co-Factors and Transcription Factor Interacting Proteins (TcoF-DB). A web-based interface for this resource can be freely accessed at http://cbrc.kaust.edu.sa/tcof/ and http://apps.sanbi.ac.za/tcof/. © The Author(s) 2010.

  14. TcoF-DB: dragon database for human transcription co-factors and transcription factor interacting proteins

    KAUST Repository

    Schaefer, Ulf; Schmeier, Sebastian; Bajic, Vladimir B.

    2010-01-01

    The initiation and regulation of transcription in eukaryotes is complex and involves a large number of transcription factors (TFs), which are known to bind to the regulatory regions of eukaryotic DNA. Apart from TF-DNA binding, protein-protein interaction involving TFs is an essential component of the machinery facilitating transcriptional regulation. Proteins that interact with TFs in the context of transcription regulation but do not bind to the DNA themselves, we consider transcription co-factors (TcoFs). The influence of TcoFs on transcriptional regulation and initiation, although indirect, has been shown to be significant with the functionality of TFs strongly influenced by the presence of TcoFs. While the role of TFs and their interaction with regulatory DNA regions has been well-studied, the association between TFs and TcoFs has so far been given less attention. Here, we present a resource that is comprised of a collection of human TFs and the TcoFs with which they interact. Other proteins that have a proven interaction with a TF, but are not considered TcoFs are also included. Our database contains 157 high-confidence TcoFs and additionally 379 hypothetical TcoFs. These have been identified and classified according to the type of available evidence for their involvement in transcriptional regulation and their presence in the cell nucleus. We have divided TcoFs into four groups, one of which contains high-confidence TcoFs and three others contain TcoFs which are hypothetical to different extents. We have developed the Dragon Database for Human Transcription Co-Factors and Transcription Factor Interacting Proteins (TcoF-DB). A web-based interface for this resource can be freely accessed at http://cbrc.kaust.edu.sa/tcof/ and http://apps.sanbi.ac.za/tcof/. © The Author(s) 2010.

  15. The PMDB Protein Model Database

    Science.gov (United States)

    Castrignanò, Tiziana; De Meo, Paolo D'Onorio; Cozzetto, Domenico; Talamo, Ivano Giuseppe; Tramontano, Anna

    2006-01-01

    The Protein Model Database (PMDB) is a public resource aimed at storing manually built 3D models of proteins. The database is designed to provide access to models published in the scientific literature, together with validating experimental data. It is a relational database and it currently contains >74 000 models for ∼240 proteins. The system is accessible at and allows predictors to submit models along with related supporting evidence and users to download them through a simple and intuitive interface. Users can navigate in the database and retrieve models referring to the same target protein or to different regions of the same protein. Each model is assigned a unique identifier that allows interested users to directly access the data. PMID:16381873

  16. InterAction Database (IADB)

    Science.gov (United States)

    The InterAction Database includes demographic and prescription information for more than 500,000 patients in the northern and middle Netherlands and has been integrated with other systems to enhance data collection and analysis.

  17. Interactive bibliographical database on color

    Science.gov (United States)

    Caivano, Jose L.

    2002-06-01

    The paper describes the methodology and results of a project under development, aimed at the elaboration of an interactive bibliographical database on color in all fields of application: philosophy, psychology, semiotics, education, anthropology, physical and natural sciences, biology, medicine, technology, industry, architecture and design, arts, linguistics, geography, history. The project is initially based upon an already developed bibliography, published in different journals, updated in various opportunities, and now available at the Internet, with more than 2,000 entries. The interactive database will amplify that bibliography, incorporating hyperlinks and contents (indexes, abstracts, keywords, introductions, or eventually the complete document), and devising mechanisms for information retrieval. The sources to be included are: books, doctoral dissertations, multimedia publications, reference works. The main arrangement will be chronological, but the design of the database will allow rearrangements or selections by different fields: subject, Decimal Classification System, author, language, country, publisher, etc. A further project is to develop another database, including color-specialized journals or newsletters, and articles on color published in international journals, arranged in this case by journal name and date of publication, but allowing also rearrangements or selections by author, subject and keywords.

  18. SpirPro: A Spirulina proteome database and web-based tools for the analysis of protein-protein interactions at the metabolic level in Spirulina (Arthrospira) platensis C1.

    Science.gov (United States)

    Senachak, Jittisak; Cheevadhanarak, Supapon; Hongsthong, Apiradee

    2015-07-29

    Spirulina (Arthrospira) platensis is the only cyanobacterium that in addition to being studied at the molecular level and subjected to gene manipulation, can also be mass cultivated in outdoor ponds for commercial use as a food supplement. Thus, encountering environmental changes, including temperature stresses, is common during the mass production of Spirulina. The use of cyanobacteria as an experimental platform, especially for photosynthetic gene manipulation in plants and bacteria, is becoming increasingly important. Understanding the mechanisms and protein-protein interaction networks that underlie low- and high-temperature responses is relevant to Spirulina mass production. To accomplish this goal, high-throughput techniques such as OMICs analyses are used. Thus, large datasets must be collected, managed and subjected to information extraction. Therefore, databases including (i) proteomic analysis and protein-protein interaction (PPI) data and (ii) domain/motif visualization tools are required for potential use in temperature response models for plant chloroplasts and photosynthetic bacteria. A web-based repository was developed including an embedded database, SpirPro, and tools for network visualization. Proteome data were analyzed integrated with protein-protein interactions and/or metabolic pathways from KEGG. The repository provides various information, ranging from raw data (2D-gel images) to associated results, such as data from interaction and/or pathway analyses. This integration allows in silico analyses of protein-protein interactions affected at the metabolic level and, particularly, analyses of interactions between and within the affected metabolic pathways under temperature stresses for comparative proteomic analysis. The developed tool, which is coded in HTML with CSS/JavaScript and depicted in Scalable Vector Graphics (SVG), is designed for interactive analysis and exploration of the constructed network. SpirPro is publicly available on the web

  19. MIPS: a database for protein sequences and complete genomes.

    Science.gov (United States)

    Mewes, H W; Hani, J; Pfeiffer, F; Frishman, D

    1998-01-01

    The MIPS group [Munich Information Center for Protein Sequences of the German National Center for Environment and Health (GSF)] at the Max-Planck-Institute for Biochemistry, Martinsried near Munich, Germany, is involved in a number of data collection activities, including a comprehensive database of the yeast genome, a database reflecting the progress in sequencing the Arabidopsis thaliana genome, the systematic analysis of other small genomes and the collection of protein sequence data within the framework of the PIR-International Protein Sequence Database (described elsewhere in this volume). Through its WWW server (http://www.mips.biochem.mpg.de ) MIPS provides access to a variety of generic databases, including a database of protein families as well as automatically generated data by the systematic application of sequence analysis algorithms. The yeast genome sequence and its related information was also compiled on CD-ROM to provide dynamic interactive access to the 16 chromosomes of the first eukaryotic genome unraveled. PMID:9399795

  20. [Validation of interaction databases in psychopharmacotherapy].

    Science.gov (United States)

    Hahn, M; Roll, S C

    2018-03-01

    Drug-drug interaction databases are an important tool to increase drug safety in polypharmacy. There are several drug interaction databases available but it is unclear which one shows the best results and therefore increases safety for the user of the databases and the patients. So far, there has been no validation of German drug interaction databases. Validation of German drug interaction databases regarding the number of hits, mechanisms of drug interaction, references, clinical advice, and severity of the interaction. A total of 36 drug interactions which were published in the last 3-5 years were checked in 5 different databases. Besides the number of hits, it was also documented if the mechanism was correct, clinical advice was given, primary literature was cited, and the severity level of the drug-drug interaction was given. All databases showed weaknesses regarding the hit rate of the tested drug interactions, with a maximum of 67.7% hits. The highest score in this validation was achieved by MediQ with 104 out of 180 points. PsiacOnline achieved 83 points, arznei-telegramm® 58, ifap index® 54 and the ABDA-database 49 points. Based on this validation MediQ seems to be the most suitable databank for the field of psychopharmacotherapy. The best results in this comparison were achieved by MediQ but this database also needs improvement with respect to the hit rate so that the users can rely on the results and therefore increase drug therapy safety.

  1. Improving decoy databases for protein folding algorithms

    KAUST Repository

    Lindsey, Aaron

    2014-01-01

    Copyright © 2014 ACM. Predicting protein structures and simulating protein folding are two of the most important problems in computational biology today. Simulation methods rely on a scoring function to distinguish the native structure (the most energetically stable) from non-native structures. Decoy databases are collections of non-native structures used to test and verify these functions. We present a method to evaluate and improve the quality of decoy databases by adding novel structures and removing redundant structures. We test our approach on 17 different decoy databases of varying size and type and show significant improvement across a variety of metrics. We also test our improved databases on a popular modern scoring function and show that they contain a greater number of native-like structures than the original databases, thereby producing a more rigorous database for testing scoring functions.

  2. Aquaporin Protein-Protein Interactions

    Directory of Open Access Journals (Sweden)

    Jennifer Virginia Roche

    2017-10-01

    Full Text Available Aquaporins are tetrameric membrane-bound channels that facilitate transport of water and other small solutes across cell membranes. In eukaryotes, they are frequently regulated by gating or trafficking, allowing for the cell to control membrane permeability in a specific manner. Protein–protein interactions play crucial roles in both regulatory processes and also mediate alternative functions such as cell adhesion. In this review, we summarize recent knowledge about aquaporin protein–protein interactions; dividing the interactions into three types: (1 interactions between aquaporin tetramers; (2 interactions between aquaporin monomers within a tetramer (hetero-tetramerization; and (3 transient interactions with regulatory proteins. We particularly focus on the structural aspects of the interactions, discussing the small differences within a conserved overall fold that allow for aquaporins to be differentially regulated in an organism-, tissue- and trigger-specific manner. A deep knowledge about these differences is needed to fully understand aquaporin function and regulation in many physiological processes, and may enable design of compounds targeting specific aquaporins for treatment of human disease.

  3. Proteomics: Protein Identification Using Online Databases

    Science.gov (United States)

    Eurich, Chris; Fields, Peter A.; Rice, Elizabeth

    2012-01-01

    Proteomics is an emerging area of systems biology that allows simultaneous study of thousands of proteins expressed in cells, tissues, or whole organisms. We have developed this activity to enable high school or college students to explore proteomic databases using mass spectrometry data files generated from yeast proteins in a college laboratory…

  4. DB-PABP: a database of polyanion-binding proteins.

    Science.gov (United States)

    Fang, Jianwen; Dong, Yinghua; Salamat-Miller, Nazila; Middaugh, C Russell

    2008-01-01

    The interactions between polyanions (PAs) and polyanion-binding proteins (PABPs) have been found to play significant roles in many essential biological processes including intracellular organization, transport and protein folding. Furthermore, many neurodegenerative disease-related proteins are PABPs. Thus, a better understanding of PA/PABP interactions may not only enhance our understandings of biological systems but also provide new clues to these deadly diseases. The literature in this field is widely scattered, suggesting the need for a comprehensive and searchable database of PABPs. The DB-PABP is a comprehensive, manually curated and searchable database of experimentally characterized PABPs. It is freely available and can be accessed online at http://pabp.bcf.ku.edu/DB_PABP/. The DB-PABP was implemented as a MySQL relational database. An interactive web interface was created using Java Server Pages (JSP). The search page of the database is organized into a main search form and a section for utilities. The main search form enables custom searches via four menus: protein names, polyanion names, the source species of the proteins and the methods used to discover the interactions. Available utilities include a commonality matrix, a function of listing PABPs by the number of interacting polyanions and a string search for author surnames. The DB-PABP is maintained at the University of Kansas. We encourage users to provide feedback and submit new data and references.

  5. Our interests in protein-protein interactions

    Indian Academy of Sciences (India)

    protein interactions. Evolution of P-P partnerships. Evolution of P-P structures. Evolutionary dynamics of P-P interactions. Dynamics of P-P interaction network. Host-pathogen interactions. CryoEM mapping of gigantic protein assemblies.

  6. The Protein Identifier Cross-Referencing (PICR service: reconciling protein identifiers across multiple source databases

    Directory of Open Access Journals (Sweden)

    Leinonen Rasko

    2007-10-01

    Full Text Available Abstract Background Each major protein database uses its own conventions when assigning protein identifiers. Resolving the various, potentially unstable, identifiers that refer to identical proteins is a major challenge. This is a common problem when attempting to unify datasets that have been annotated with proteins from multiple data sources or querying data providers with one flavour of protein identifiers when the source database uses another. Partial solutions for protein identifier mapping exist but they are limited to specific species or techniques and to a very small number of databases. As a result, we have not found a solution that is generic enough and broad enough in mapping scope to suit our needs. Results We have created the Protein Identifier Cross-Reference (PICR service, a web application that provides interactive and programmatic (SOAP and REST access to a mapping algorithm that uses the UniProt Archive (UniParc as a data warehouse to offer protein cross-references based on 100% sequence identity to proteins from over 70 distinct source databases loaded into UniParc. Mappings can be limited by source database, taxonomic ID and activity status in the source database. Users can copy/paste or upload files containing protein identifiers or sequences in FASTA format to obtain mappings using the interactive interface. Search results can be viewed in simple or detailed HTML tables or downloaded as comma-separated values (CSV or Microsoft Excel (XLS files suitable for use in a local database or a spreadsheet. Alternatively, a SOAP interface is available to integrate PICR functionality in other applications, as is a lightweight REST interface. Conclusion We offer a publicly available service that can interactively map protein identifiers and protein sequences to the majority of commonly used protein databases. Programmatic access is available through a standards-compliant SOAP interface or a lightweight REST interface. The PICR

  7. GPCR & company: databases and servers for GPCRs and interacting partners.

    Science.gov (United States)

    Kowalsman, Noga; Niv, Masha Y

    2014-01-01

    G-protein-coupled receptors (GPCRs) are a large superfamily of membrane receptors that are involved in a wide range of signaling pathways. To fulfill their tasks, GPCRs interact with a variety of partners, including small molecules, lipids and proteins. They are accompanied by different proteins during all phases of their life cycle. Therefore, GPCR interactions with their partners are of great interest in basic cell-signaling research and in drug discovery.Due to the rapid development of computers and internet communication, knowledge and data can be easily shared within the worldwide research community via freely available databases and servers. These provide an abundance of biological, chemical and pharmacological information.This chapter describes the available web resources for investigating GPCR interactions. We review about 40 freely available databases and servers, and provide a few sentences about the essence and the data they supply. For simplification, the databases and servers were grouped under the following topics: general GPCR-ligand interactions; particular families of GPCRs and their ligands; GPCR oligomerization; GPCR interactions with intracellular partners; and structural information on GPCRs. In conclusion, a multitude of useful tools are currently available. Summary tables are provided to ease navigation between the numerous and partially overlapping resources. Suggestions for future enhancements of the online tools include the addition of links from general to specialized databases and enabling usage of user-supplied template for GPCR structural modeling.

  8. Protein Annotation from Protein Interaction Networks and Gene Ontology

    OpenAIRE

    Nguyen, Cao D.; Gardiner, Katheleen J.; Cios, Krzysztof J.

    2011-01-01

    We introduce a novel method for annotating protein function that combines Naïve Bayes and association rules, and takes advantage of the underlying topology in protein interaction networks and the structure of graphs in the Gene Ontology. We apply our method to proteins from the Human Protein Reference Database (HPRD) and show that, in comparison with other approaches, it predicts protein functions with significantly higher recall with no loss of precision. Specifically, it achieves 51% precis...

  9. Development of human protein reference database as an initial platform for approaching systems biology in humans

    DEFF Research Database (Denmark)

    Peri, Suraj; Navarro, J Daniel; Amanchy, Ramars

    2003-01-01

    Human Protein Reference Database (HPRD) is an object database that integrates a wealth of information relevant to the function of human proteins in health and disease. Data pertaining to thousands of protein-protein interactions, posttranslational modifications, enzyme/substrate relationships...

  10. Discerning molecular interactions: A comprehensive review on biomolecular interaction databases and network analysis tools.

    Science.gov (United States)

    Miryala, Sravan Kumar; Anbarasu, Anand; Ramaiah, Sudha

    2018-02-05

    Computational analysis of biomolecular interaction networks is now gaining a lot of importance to understand the functions of novel genes/proteins. Gene interaction (GI) network analysis and protein-protein interaction (PPI) network analysis play a major role in predicting the functionality of interacting genes or proteins and gives an insight into the functional relationships and evolutionary conservation of interactions among the genes. An interaction network is a graphical representation of gene/protein interactome, where each gene/protein is a node, and interaction between gene/protein is an edge. In this review, we discuss the popular open source databases that serve as data repositories to search and collect protein/gene interaction data, and also tools available for the generation of interaction network, visualization and network analysis. Also, various network analysis approaches like topological approach and clustering approach to study the network properties and functional enrichment server which illustrates the functions and pathway of the genes and proteins has been discussed. Hence the distinctive attribute mentioned in this review is not only to provide an overview of tools and web servers for gene and protein-protein interaction (PPI) network analysis but also to extract useful and meaningful information from the interaction networks. Copyright © 2017 Elsevier B.V. All rights reserved.

  11. cuticleDB: a relational database of Arthropod cuticular proteins

    Directory of Open Access Journals (Sweden)

    Willis Judith H

    2004-09-01

    Full Text Available Abstract Background The insect exoskeleton or cuticle is a bi-partite composite of proteins and chitin that provides protective, skeletal and structural functions. Little information is available about the molecular structure of this important complex that exhibits a helicoidal architecture. Scores of sequences of cuticular proteins have been obtained from direct protein sequencing, from cDNAs, and from genomic analyses. Most of these cuticular protein sequences contain motifs found only in arthropod proteins. Description cuticleDB is a relational database containing all structural proteins of Arthropod cuticle identified to date. Many come from direct sequencing of proteins isolated from cuticle and from sequences from cDNAs that share common features with these authentic cuticular proteins. It also includes proteins from the Drosophila melanogaster and the Anopheles gambiae genomes, that have been predicted to be cuticular proteins, based on a Pfam motif (PF00379 responsible for chitin binding in Arthropod cuticle. The total number of the database entries is 445: 370 derive from insects, 60 from Crustacea and 15 from Chelicerata. The database can be accessed from our web server at http://bioinformatics.biol.uoa.gr/cuticleDB. Conclusions CuticleDB was primarily designed to contain correct and full annotation of cuticular protein data. The database will be of help to future genome annotators. Users will be able to test hypotheses for the existence of known and also of yet unknown motifs in cuticular proteins. An analysis of motifs may contribute to understanding how proteins contribute to the physical properties of cuticle as well as to the precise nature of their interaction with chitin.

  12. Oligomeric protein structure networks: insights into protein-protein interactions

    Directory of Open Access Journals (Sweden)

    Brinda KV

    2005-12-01

    Full Text Available Abstract Background Protein-protein association is essential for a variety of cellular processes and hence a large number of investigations are being carried out to understand the principles of protein-protein interactions. In this study, oligomeric protein structures are viewed from a network perspective to obtain new insights into protein association. Structure graphs of proteins have been constructed from a non-redundant set of protein oligomer crystal structures by considering amino acid residues as nodes and the edges are based on the strength of the non-covalent interactions between the residues. The analysis of such networks has been carried out in terms of amino acid clusters and hubs (highly connected residues with special emphasis to protein interfaces. Results A variety of interactions such as hydrogen bond, salt bridges, aromatic and hydrophobic interactions, which occur at the interfaces are identified in a consolidated manner as amino acid clusters at the interface, from this study. Moreover, the characterization of the highly connected hub-forming residues at the interfaces and their comparison with the hubs from the non-interface regions and the non-hubs in the interface regions show that there is a predominance of charged interactions at the interfaces. Further, strong and weak interfaces are identified on the basis of the interaction strength between amino acid residues and the sizes of the interface clusters, which also show that many protein interfaces are stronger than their monomeric protein cores. The interface strengths evaluated based on the interface clusters and hubs also correlate well with experimentally determined dissociation constants for known complexes. Finally, the interface hubs identified using the present method correlate very well with experimentally determined hotspots in the interfaces of protein complexes obtained from the Alanine Scanning Energetics database (ASEdb. A few predictions of interface hot

  13. Drug interaction databases in medical literature

    DEFF Research Database (Denmark)

    Kongsholm, Gertrud Gansmo; Nielsen, Anna Katrine Toft; Damkier, Per

    2015-01-01

    PURPOSE: It is well documented that drug-drug interaction databases (DIDs) differ substantially with respect to classification of drug-drug interactions (DDIs). The aim of this study was to study online available transparency of ownership, funding, information, classifications, staff training...... available transparency of ownership, funding, information, classifications, staff training, and underlying documentation varies substantially among various DIDs. Open access DIDs had a statistically lower score on parameters assessed....... and the three most commonly used subscription DIDs in the medical literature. The following parameters were assessed for each of the databases: Ownership, classification of interactions, primary information sources, and staff qualification. We compared the overall proportion of yes/no answers from open access...

  14. Protein structure database search and evolutionary classification.

    Science.gov (United States)

    Yang, Jinn-Moon; Tung, Chi-Hua

    2006-01-01

    As more protein structures become available and structural genomics efforts provide structural models in a genome-wide strategy, there is a growing need for fast and accurate methods for discovering homologous proteins and evolutionary classifications of newly determined structures. We have developed 3D-BLAST, in part, to address these issues. 3D-BLAST is as fast as BLAST and calculates the statistical significance (E-value) of an alignment to indicate the reliability of the prediction. Using this method, we first identified 23 states of the structural alphabet that represent pattern profiles of the backbone fragments and then used them to represent protein structure databases as structural alphabet sequence databases (SADB). Our method enhanced BLAST as a search method, using a new structural alphabet substitution matrix (SASM) to find the longest common substructures with high-scoring structured segment pairs from an SADB database. Using personal computers with Intel Pentium4 (2.8 GHz) processors, our method searched more than 10 000 protein structures in 1.3 s and achieved a good agreement with search results from detailed structure alignment methods. [3D-BLAST is available at http://3d-blast.life.nctu.edu.tw].

  15. Interactive protein manipulation

    Energy Technology Data Exchange (ETDEWEB)

    SNCrivelli@lbl.gov

    2003-07-01

    We describe an interactive visualization and modeling program for the creation of protein structures ''from scratch''. The input to our program is an amino acid sequence -decoded from a gene- and a sequence of predicted secondary structure types for each amino acid-provided by external structure prediction programs. Our program can be used in the set-up phase of a protein structure prediction process; the structures created with it serve as input for a subsequent global internal energy minimization, or another method of protein structure prediction. Our program supports basic visualization methods for protein structures, interactive manipulation based on inverse kinematics, and visualization guides to aid a user in creating ''good'' initial structures.

  16. Interactive protein manipulation

    International Nuclear Information System (INIS)

    2003-01-01

    We describe an interactive visualization and modeling program for the creation of protein structures ''from scratch''. The input to our program is an amino acid sequence -decoded from a gene- and a sequence of predicted secondary structure types for each amino acid-provided by external structure prediction programs. Our program can be used in the set-up phase of a protein structure prediction process; the structures created with it serve as input for a subsequent global internal energy minimization, or another method of protein structure prediction. Our program supports basic visualization methods for protein structures, interactive manipulation based on inverse kinematics, and visualization guides to aid a user in creating ''good'' initial structures

  17. Bioinformatic Prediction of WSSV-Host Protein-Protein Interaction

    Directory of Open Access Journals (Sweden)

    Zheng Sun

    2014-01-01

    Full Text Available WSSV is one of the most dangerous pathogens in shrimp aquaculture. However, the molecular mechanism of how WSSV interacts with shrimp is still not very clear. In the present study, bioinformatic approaches were used to predict interactions between proteins from WSSV and shrimp. The genome data of WSSV (NC_003225.1 and the constructed transcriptome data of F. chinensis were used to screen potentially interacting proteins by searching in protein interaction databases, including STRING, Reactome, and DIP. Forty-four pairs of proteins were suggested to have interactions between WSSV and the shrimp. Gene ontology analysis revealed that 6 pairs of these interacting proteins were classified into “extracellular region” or “receptor complex” GO-terms. KEGG pathway analysis showed that they were involved in the “ECM-receptor interaction pathway.” In the 6 pairs of interacting proteins, an envelope protein called “collagen-like protein” (WSSV-CLP encoded by an early virus gene “wsv001” in WSSV interacted with 6 deduced proteins from the shrimp, including three integrin alpha (ITGA, two integrin beta (ITGB, and one syndecan (SDC. Sequence analysis on WSSV-CLP, ITGA, ITGB, and SDC revealed that they possessed the sequence features for protein-protein interactions. This study might provide new insights into the interaction mechanisms between WSSV and shrimp.

  18. RAIN: RNA-protein Association and Interaction Networks

    DEFF Research Database (Denmark)

    Junge, Alexander; Refsgaard, Jan Christian; Garde, Christian

    2017-01-01

    is challenging due to data heterogeneity. Here, we present a database of ncRNA-RNA and ncRNA-protein interactions and its integration with the STRING database of protein-protein interactions. These ncRNA associations cover four organisms and have been established from curated examples, experimental data...

  19. A protein relational database and protein family knowledge bases to facilitate structure-based design analyses.

    Science.gov (United States)

    Mobilio, Dominick; Walker, Gary; Brooijmans, Natasja; Nilakantan, Ramaswamy; Denny, R Aldrin; Dejoannis, Jason; Feyfant, Eric; Kowticwar, Rupesh K; Mankala, Jyoti; Palli, Satish; Punyamantula, Sairam; Tatipally, Maneesh; John, Reji K; Humblet, Christine

    2010-08-01

    The Protein Data Bank is the most comprehensive source of experimental macromolecular structures. It can, however, be difficult at times to locate relevant structures with the Protein Data Bank search interface. This is particularly true when searching for complexes containing specific interactions between protein and ligand atoms. Moreover, searching within a family of proteins can be tedious. For example, one cannot search for some conserved residue as residue numbers vary across structures. We describe herein three databases, Protein Relational Database, Kinase Knowledge Base, and Matrix Metalloproteinase Knowledge Base, containing protein structures from the Protein Data Bank. In Protein Relational Database, atom-atom distances between protein and ligand have been precalculated allowing for millisecond retrieval based on atom identity and distance constraints. Ring centroids, centroid-centroid and centroid-atom distances and angles have also been included permitting queries for pi-stacking interactions and other structural motifs involving rings. Other geometric features can be searched through the inclusion of residue pair and triplet distances. In Kinase Knowledge Base and Matrix Metalloproteinase Knowledge Base, the catalytic domains have been aligned into common residue numbering schemes. Thus, by searching across Protein Relational Database and Kinase Knowledge Base, one can easily retrieve structures wherein, for example, a ligand of interest is making contact with the gatekeeper residue.

  20. Thermodynamic database for proteins: features and applications.

    Science.gov (United States)

    Gromiha, M Michael; Sarai, Akinori

    2010-01-01

    We have developed a thermodynamic database for proteins and mutants, ProTherm, which is a collection of a large number of thermodynamic data on protein stability along with the sequence and structure information, experimental methods and conditions, and literature information. This is a valuable resource for understanding/predicting the stability of proteins, and it can be accessible at http://www.gibk26.bse.kyutech.ac.jp/jouhou/Protherm/protherm.html . ProTherm has several features including various search, display, and sorting options and visualization tools. We have analyzed the data in ProTherm to examine the relationship among thermodynamics, structure, and function of proteins. We describe the progress on the development of methods for understanding/predicting protein stability, such as (i) relationship between the stability of protein mutants and amino acid properties, (ii) average assignment method, (iii) empirical energy functions, (iv) torsion, distance, and contact potentials, and (v) machine learning techniques. The list of online resources for predicting protein stability has also been provided.

  1. Interactive Exploration for Continuously Expanding Neuron Databases.

    Science.gov (United States)

    Li, Zhongyu; Metaxas, Dimitris N; Lu, Aidong; Zhang, Shaoting

    2017-02-15

    This paper proposes a novel framework to help biologists explore and analyze neurons based on retrieval of data from neuron morphological databases. In recent years, the continuously expanding neuron databases provide a rich source of information to associate neuronal morphologies with their functional properties. We design a coarse-to-fine framework for efficient and effective data retrieval from large-scale neuron databases. In the coarse-level, for efficiency in large-scale, we employ a binary coding method to compress morphological features into binary codes of tens of bits. Short binary codes allow for real-time similarity searching in Hamming space. Because the neuron databases are continuously expanding, it is inefficient to re-train the binary coding model from scratch when adding new neurons. To solve this problem, we extend binary coding with online updating schemes, which only considers the newly added neurons and update the model on-the-fly, without accessing the whole neuron databases. In the fine-grained level, we introduce domain experts/users in the framework, which can give relevance feedback for the binary coding based retrieval results. This interactive strategy can improve the retrieval performance through re-ranking the above coarse results, where we design a new similarity measure and take the feedback into account. Our framework is validated on more than 17,000 neuron cells, showing promising retrieval accuracy and efficiency. Moreover, we demonstrate its use case in assisting biologists to identify and explore unknown neurons. Copyright © 2017 Elsevier Inc. All rights reserved.

  2. Data management of protein interaction networks

    CERN Document Server

    Cannataro, Mario

    2012-01-01

    Interactomics: a complete survey from data generation to knowledge extraction With the increasing use of high-throughput experimental assays, more and more protein interaction databases are becoming available. As a result, computational analysis of protein-to-protein interaction (PPI) data and networks, now known as interactomics, has become an essential tool to determine functionally associated proteins. From wet lab technologies to data management to knowledge extraction, this timely book guides readers through the new science of interactomics, giving them the tools needed to: Generate

  3. MVP: a microbe-phage interaction database.

    Science.gov (United States)

    Gao, Na L; Zhang, Chengwei; Zhang, Zhanbing; Hu, Songnian; Lercher, Martin J; Zhao, Xing-Ming; Bork, Peer; Liu, Zhi; Chen, Wei-Hua

    2018-01-04

    Phages invade microbes, accomplish host lysis and are of vital importance in shaping the community structure of environmental microbiota. More importantly, most phages have very specific hosts; they are thus ideal tools to manipulate environmental microbiota at species-resolution. The main purpose of MVP (Microbe Versus Phage) is to provide a comprehensive catalog of phage-microbe interactions and assist users to select phage(s) that can target (and potentially to manipulate) specific microbes of interest. We first collected 50 782 viral sequences from various sources and clustered them into 33 097 unique viral clusters based on sequence similarity. We then identified 26 572 interactions between 18 608 viral clusters and 9245 prokaryotes (i.e. bacteria and archaea); we established these interactions based on 30 321 evidence entries that we collected from published datasets, public databases and re-analysis of genomic and metagenomic sequences. Based on these interactions, we calculated the host range for each of the phage clusters and accordingly grouped them into subgroups such as 'species-', 'genus-' and 'family-' specific phage clusters. MVP is equipped with a modern, responsive and intuitive interface, and is freely available at: http://mvp.medgenius.info. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. Protein annotation from protein interaction networks and Gene Ontology.

    Science.gov (United States)

    Nguyen, Cao D; Gardiner, Katheleen J; Cios, Krzysztof J

    2011-10-01

    We introduce a novel method for annotating protein function that combines Naïve Bayes and association rules, and takes advantage of the underlying topology in protein interaction networks and the structure of graphs in the Gene Ontology. We apply our method to proteins from the Human Protein Reference Database (HPRD) and show that, in comparison with other approaches, it predicts protein functions with significantly higher recall with no loss of precision. Specifically, it achieves 51% precision and 60% recall versus 45% and 26% for Majority and 24% and 61% for χ²-statistics, respectively. Copyright © 2011 Elsevier Inc. All rights reserved.

  5. Evolution of protein-protein interactions

    Indian Academy of Sciences (India)

    Evolution of protein-protein interactions · Our interests in protein-protein interactions · Slide 3 · Slide 4 · Slide 5 · Slide 6 · Slide 7 · Slide 8 · Slide 9 · Slide 10 · Slide 11 · Slide 12 · Slide 13 · Slide 14 · Slide 15 · Slide 16 · Slide 17 · Slide 18 · Slide 19 · Slide 20.

  6. Toward an interactive article: integrating journals and biological databases

    Directory of Open Access Journals (Sweden)

    Marygold Steven J

    2011-05-01

    Full Text Available Abstract Background Journal articles and databases are two major modes of communication in the biological sciences, and thus integrating these critical resources is of urgent importance to increase the pace of discovery. Projects focused on bridging the gap between journals and databases have been on the rise over the last five years and have resulted in the development of automated tools that can recognize entities within a document and link those entities to a relevant database. Unfortunately, automated tools cannot resolve ambiguities that arise from one term being used to signify entities that are quite distinct from one another. Instead, resolving these ambiguities requires some manual oversight. Finding the right balance between the speed and portability of automation and the accuracy and flexibility of manual effort is a crucial goal to making text markup a successful venture. Results We have established a journal article mark-up pipeline that links GENETICS journal articles and the model organism database (MOD WormBase. This pipeline uses a lexicon built with entities from the database as a first step. The entity markup pipeline results in links from over nine classes of objects including genes, proteins, alleles, phenotypes and anatomical terms. New entities and ambiguities are discovered and resolved by a database curator through a manual quality control (QC step, along with help from authors via a web form that is provided to them by the journal. New entities discovered through this pipeline are immediately sent to an appropriate curator at the database. Ambiguous entities that do not automatically resolve to one link are resolved by hand ensuring an accurate link. This pipeline has been extended to other databases, namely Saccharomyces Genome Database (SGD and FlyBase, and has been implemented in marking up a paper with links to multiple databases. Conclusions Our semi-automated pipeline hyperlinks articles published in GENETICS to

  7. Protein - AT Atlas | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data ..._protein.zip File URL: ftp://ftp.biosciencedbc.jp/archive/at_atlas/LATEST/at_atla...About This Database Database Description Download License Update History of This Database Site Policy | Contact Us Protein - AT Atlas | LSDB Archive ...

  8. SCOWLP: a web-based database for detailed characterization and visualization of protein interfaces

    Directory of Open Access Journals (Sweden)

    Schroeder Michael

    2006-03-01

    Full Text Available Abstract Background Currently there is a strong need for methods that help to obtain an accurate description of protein interfaces in order to be able to understand the principles that govern molecular recognition and protein function. Many of the recent efforts to computationally identify and characterize protein networks extract protein interaction information at atomic resolution from the PDB. However, they pay none or little attention to small protein ligands and solvent. They are key components and mediators of protein interactions and fundamental for a complete description of protein interfaces. Interactome profiling requires the development of computational tools to extract and analyze protein-protein, protein-ligand and detailed solvent interaction information from the PDB in an automatic and comparative fashion. Adding this information to the existing one on protein-protein interactions will allow us to better understand protein interaction networks and protein function. Description SCOWLP (Structural Characterization Of Water, Ligands and Proteins is a user-friendly and publicly accessible web-based relational database for detailed characterization and visualization of the PDB protein interfaces. The SCOWLP database includes proteins, peptidic-ligands and interface water molecules as descriptors of protein interfaces. It contains currently 74,907 protein interfaces and 2,093,976 residue-residue interactions formed by 60,664 structural units (protein domains and peptidic-ligands and their interacting solvent. The SCOWLP web-server allows detailed structural analysis and comparisons of protein interfaces at atomic level by text query of PDB codes and/or by navigating a SCOP-based tree. It includes a visualization tool to interactively display the interfaces and label interacting residues and interface solvent by atomic physicochemical properties. SCOWLP is automatically updated with every SCOP release. Conclusion SCOWLP enriches

  9. Prediction of Protein-Protein Interactions Related to Protein Complexes Based on Protein Interaction Networks

    Directory of Open Access Journals (Sweden)

    Peng Liu

    2015-01-01

    Full Text Available A method for predicting protein-protein interactions based on detected protein complexes is proposed to repair deficient interactions derived from high-throughput biological experiments. Protein complexes are pruned and decomposed into small parts based on the adaptive k-cores method to predict protein-protein interactions associated with the complexes. The proposed method is adaptive to protein complexes with different structure, number, and size of nodes in a protein-protein interaction network. Based on different complex sets detected by various algorithms, we can obtain different prediction sets of protein-protein interactions. The reliability of the predicted interaction sets is proved by using estimations with statistical tests and direct confirmation of the biological data. In comparison with the approaches which predict the interactions based on the cliques, the overlap of the predictions is small. Similarly, the overlaps among the predicted sets of interactions derived from various complex sets are also small. Thus, every predicted set of interactions may complement and improve the quality of the original network data. Meanwhile, the predictions from the proposed method replenish protein-protein interactions associated with protein complexes using only the network topology.

  10. The MAHNOB Mimicry Database - a database of naturalistic human interactions

    NARCIS (Netherlands)

    Bilakhia, Sanjay; Petridis, Stavros; Nijholt, Antinus; Pantic, Maja

    2015-01-01

    People mimic verbal and nonverbal expressions and behaviour of their counterparts in various social interactions. Research in psychology and social sciences has shown that mimicry has the power to influence social judgment and various social behaviours, including negotiation and debating, courtship,

  11. PROXiMATE: a database of mutant protein-protein complex thermodynamics and kinetics.

    Science.gov (United States)

    Jemimah, Sherlyn; Yugandhar, K; Michael Gromiha, M

    2017-09-01

    We have developed PROXiMATE, a database of thermodynamic data for more than 6000 missense mutations in 174 heterodimeric protein-protein complexes, supplemented with interaction network data from STRING database, solvent accessibility, sequence, structural and functional information, experimental conditions and literature information. Additional features include complex structure visualization, search and display options, download options and a provision for users to upload their data. The database is freely available at http://www.iitm.ac.in/bioinfo/PROXiMATE/ . The website is implemented in Python, and supports recent versions of major browsers such as IE10, Firefox, Chrome and Opera. gromiha@iitm.ac.in. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  12. Protein - Trypanosomes Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data List Contact us Trypanoso...nhibitor of the protein. Data file File name: trypanosome.zip File URL: ftp://ftp....biosciencedbc.jp/archive/trypanosome/LATEST/trypanosome.zip File size: 1.4 KB Simple search URL http://togo...db.biosciencedbc.jp/togodb/view/trypanosome#en Data acquisition method - Data analysis method - Number of da...ndelian inheritance in Man ) map Location of the gene on a chromosome or its chromosome number pdb PDB ID (P

  13. Improving decoy databases for protein folding algorithms

    KAUST Repository

    Lindsey, Aaron; Yeh, Hsin-Yi (Cindy); Wu, Chih-Peng; Thomas, Shawna; Amato, Nancy M.

    2014-01-01

    energetically stable) from non-native structures. Decoy databases are collections of non-native structures used to test and verify these functions. We present a method to evaluate and improve the quality of decoy databases by adding novel structures and removing

  14. MIPS: a database for genomes and protein sequences.

    Science.gov (United States)

    Mewes, H W; Frishman, D; Güldener, U; Mannhaupt, G; Mayer, K; Mokrejs, M; Morgenstern, B; Münsterkötter, M; Rudd, S; Weil, B

    2002-01-01

    The Munich Information Center for Protein Sequences (MIPS-GSF, Neuherberg, Germany) continues to provide genome-related information in a systematic way. MIPS supports both national and European sequencing and functional analysis projects, develops and maintains automatically generated and manually annotated genome-specific databases, develops systematic classification schemes for the functional annotation of protein sequences, and provides tools for the comprehensive analysis of protein sequences. This report updates the information on the yeast genome (CYGD), the Neurospora crassa genome (MNCDB), the databases for the comprehensive set of genomes (PEDANT genomes), the database of annotated human EST clusters (HIB), the database of complete cDNAs from the DHGP (German Human Genome Project), as well as the project specific databases for the GABI (Genome Analysis in Plants) and HNB (Helmholtz-Netzwerk Bioinformatik) networks. The Arabidospsis thaliana database (MATDB), the database of mitochondrial proteins (MITOP) and our contribution to the PIR International Protein Sequence Database have been described elsewhere [Schoof et al. (2002) Nucleic Acids Res., 30, 91-93; Scharfe et al. (2000) Nucleic Acids Res., 28, 155-158; Barker et al. (2001) Nucleic Acids Res., 29, 29-32]. All databases described, the protein analysis tools provided and the detailed descriptions of our projects can be accessed through the MIPS World Wide Web server (http://mips.gsf.de).

  15. Toxicological relationships between proteins obtained from protein target predictions of large toxicity databases

    International Nuclear Information System (INIS)

    Nigsch, Florian; Mitchell, John B.O.

    2008-01-01

    The combination of models for protein target prediction with large databases containing toxicological information for individual molecules allows the derivation of 'toxiclogical' profiles, i.e., to what extent are molecules of known toxicity predicted to interact with a set of protein targets. To predict protein targets of drug-like and toxic molecules, we built a computational multiclass model using the Winnow algorithm based on a dataset of protein targets derived from the MDL Drug Data Report. A 15-fold Monte Carlo cross-validation using 50% of each class for training, and the remaining 50% for testing, provided an assessment of the accuracy of that model. We retained the 3 top-ranking predictions and found that in 82% of all cases the correct target was predicted within these three predictions. The first prediction was the correct one in almost 70% of cases. A model built on the whole protein target dataset was then used to predict the protein targets for 150 000 molecules from the MDL Toxicity Database. We analysed the frequency of the predictions across the panel of protein targets for experimentally determined toxicity classes of all molecules. This allowed us to identify clusters of proteins related by their toxicological profiles, as well as toxicities that are related. Literature-based evidence is provided for some specific clusters to show the relevance of the relationships identified

  16. Deciphering peculiar protein-protein interacting modules in Deinococcus radiodurans

    Directory of Open Access Journals (Sweden)

    Barkallah Insaf

    2009-04-01

    Full Text Available Abstract Interactomes of proteins under positive selection from ionizing-radiation-resistant bacteria (IRRB might be a part of the answer to the question as to how IRRB, particularly Deinococcus radiodurans R1 (Deira, resist ionizing radiation. Here, using the Database of Interacting Proteins (DIP and the Protein Structural Interactome (PSI-base server for PSI map, we have predicted novel interactions of orthologs of the 58 proteins under positive selection in Deira and other IRRB, but which are absent in IRSB. Among these, 18 domains and their interactomes have been identified in DNA checkpoint and repair; kinases pathways; energy and nucleotide metabolisms were the important biological processes that were found to be involved. This finding provides new clues to the cellular pathways that can to be important for ionizing-radiation resistance in Deira.

  17. Interactive searching of facial image databases

    Science.gov (United States)

    Nicholls, Robert A.; Shepherd, John W.; Shepherd, Jean

    1995-09-01

    A set of psychological facial descriptors has been devised to enable computerized searching of criminal photograph albums. The descriptors have been used to encode image databased of up to twelve thousand images. Using a system called FACES, the databases are searched by translating a witness' verbal description into corresponding facial descriptors. Trials of FACES have shown that this coding scheme is more productive and efficient than searching traditional photograph albums. An alternative method of searching the encoded database using a genetic algorithm is currenly being tested. The genetic search method does not require the witness to verbalize a description of the target but merely to indicate a degree of similarity between the target and a limited selection of images from the database. The major drawback of FACES is that is requires a manual encoding of images. Research is being undertaken to automate the process, however, it will require an algorithm which can predict human descriptive values. Alternatives to human derived coding schemes exist using statistical classifications of images. Since databases encoded using statistical classifiers do not have an obvious direct mapping to human derived descriptors, a search method which does not require the entry of human descriptors is required. A genetic search algorithm is being tested for such a purpose.

  18. Targeting protein-protein interactions for parasite control.

    Directory of Open Access Journals (Sweden)

    Christina M Taylor

    2011-04-01

    Full Text Available Finding new drug targets for pathogenic infections would be of great utility for humanity, as there is a large need to develop new drugs to fight infections due to the developing resistance and side effects of current treatments. Current drug targets for pathogen infections involve only a single protein. However, proteins rarely act in isolation, and the majority of biological processes occur via interactions with other proteins, so protein-protein interactions (PPIs offer a realm of unexplored potential drug targets and are thought to be the next-generation of drug targets. Parasitic worms were chosen for this study because they have deleterious effects on human health, livestock, and plants, costing society billions of dollars annually and many sequenced genomes are available. In this study, we present a computational approach that utilizes whole genomes of 6 parasitic and 1 free-living worm species and 2 hosts. The species were placed in orthologous groups, then binned in species-specific orthologous groups. Proteins that are essential and conserved among species that span a phyla are of greatest value, as they provide foundations for developing broad-control strategies. Two PPI databases were used to find PPIs within the species specific bins. PPIs with unique helminth proteins and helminth proteins with unique features relative to the host, such as indels, were prioritized as drug targets. The PPIs were scored based on RNAi phenotype and homology to the PDB (Protein DataBank. EST data for the various life stages, GO annotation, and druggability were also taken into consideration. Several PPIs emerged from this study as potential drug targets. A few interactions were supported by co-localization of expression in M. incognita (plant parasite and B. malayi (H. sapiens parasite, which have extremely different modes of parasitism. As more genomes of pathogens are sequenced and PPI databases expanded, this methodology will become increasingly

  19. Detecting mutually exclusive interactions in protein-protein interaction maps.

    KAUST Repository

    Sá nchez Claros, Carmen; Tramontano, Anna

    2012-01-01

    Comprehensive protein interaction maps can complement genetic and biochemical experiments and allow the formulation of new hypotheses to be tested in the system of interest. The computational analysis of the maps may help to focus on interesting cases and thereby to appropriately prioritize the validation experiments. We show here that, by automatically comparing and analyzing structurally similar regions of proteins of known structure interacting with a common partner, it is possible to identify mutually exclusive interactions present in the maps with a sensitivity of 70% and a specificity higher than 85% and that, in about three fourth of the correctly identified complexes, we also correctly recognize at least one residue (five on average) belonging to the interaction interface. Given the present and continuously increasing number of proteins of known structure, the requirement of the knowledge of the structure of the interacting proteins does not substantially impact on the coverage of our strategy that can be estimated to be around 25%. We also introduce here the Estrella server that embodies this strategy, is designed for users interested in validating specific hypotheses about the functional role of a protein-protein interaction and it also allows access to pre-computed data for seven organisms.

  20. Detecting mutually exclusive interactions in protein-protein interaction maps.

    KAUST Repository

    Sánchez Claros, Carmen

    2012-06-08

    Comprehensive protein interaction maps can complement genetic and biochemical experiments and allow the formulation of new hypotheses to be tested in the system of interest. The computational analysis of the maps may help to focus on interesting cases and thereby to appropriately prioritize the validation experiments. We show here that, by automatically comparing and analyzing structurally similar regions of proteins of known structure interacting with a common partner, it is possible to identify mutually exclusive interactions present in the maps with a sensitivity of 70% and a specificity higher than 85% and that, in about three fourth of the correctly identified complexes, we also correctly recognize at least one residue (five on average) belonging to the interaction interface. Given the present and continuously increasing number of proteins of known structure, the requirement of the knowledge of the structure of the interacting proteins does not substantially impact on the coverage of our strategy that can be estimated to be around 25%. We also introduce here the Estrella server that embodies this strategy, is designed for users interested in validating specific hypotheses about the functional role of a protein-protein interaction and it also allows access to pre-computed data for seven organisms.

  1. MIPS: a database for protein sequences, homology data and yeast genome information.

    Science.gov (United States)

    Mewes, H W; Albermann, K; Heumann, K; Liebl, S; Pfeiffer, F

    1997-01-01

    The MIPS group (Martinsried Institute for Protein Sequences) at the Max-Planck-Institute for Biochemistry, Martinsried near Munich, Germany, collects, processes and distributes protein sequence data within the framework of the tripartite association of the PIR-International Protein Sequence Database (,). MIPS contributes nearly 50% of the data input to the PIR-International Protein Sequence Database. The database is distributed on CD-ROM together with PATCHX, an exhaustive supplement of unique, unverified protein sequences from external sources compiled by MIPS. Through its WWW server (http://www.mips.biochem.mpg.de/ ) MIPS permits internet access to sequence databases, homology data and to yeast genome information. (i) Sequence similarity results from the FASTA program () are stored in the FASTA database for all proteins from PIR-International and PATCHX. The database is dynamically maintained and permits instant access to FASTA results. (ii) Starting with FASTA database queries, proteins have been classified into families and superfamilies (PROT-FAM). (iii) The HPT (hashed position tree) data structure () developed at MIPS is a new approach for rapid sequence and pattern searching. (iv) MIPS provides access to the sequence and annotation of the complete yeast genome (), the functional classification of yeast genes (FunCat) and its graphical display, the 'Genome Browser' (). A CD-ROM based on the JAVA programming language providing dynamic interactive access to the yeast genome and the related protein sequences has been compiled and is available on request. PMID:9016498

  2. CVcat: An interactive database on cataclysmic variables

    Science.gov (United States)

    Kube, J.; Gänsicke, B. T.; Euchner, F.; Hoffmann, B.

    2003-06-01

    CVcat is a database that contains published data on cataclysmic variables and related objects. Unlike in the existing online sources, the users are allowed to add data to the catalogue. The concept of an ``open catalogue'' approach is reviewed together with the experience from one year of public usage of CVcat. New concepts to be included in the upcoming AstroCat framework and the next CVcat implementation are presented. CVcat can be found at http://www.cvcat.org.

  3. SHEETSPAIR: A Database of Amino Acid Pairs in Protein Sheet Structures

    Directory of Open Access Journals (Sweden)

    Ning Zhang

    2007-10-01

    Full Text Available Within folded strands of a protein, amino acids (AAs on every adjacent two strands form a pair of AAs. To explore the interactions between strands in a protein sheet structure, we have established an Internet-accessible relational database named SheetsPairs based on SQL Server 2000. The database has collected AAs pairs in proteins with detailed information. Furthermore, it utilizes a non-freetext database structure to store protein sequences and a specific database table with a unique number to store strands, which provides more searching options and rapid and accurate access to data queries. An IIS web server has been set up for data retrieval through a custom web interface, which enables complex data queries. Also searchable are parallel or anti-parallel folded strands and the list of strands in a specified protein.

  4. Scoring functions for protein-protein interactions.

    Science.gov (United States)

    Moal, Iain H; Moretti, Rocco; Baker, David; Fernández-Recio, Juan

    2013-12-01

    The computational evaluation of protein-protein interactions will play an important role in organising the wealth of data being generated by high-throughput initiatives. Here we discuss future applications, report recent developments and identify areas requiring further investigation. Many functions have been developed to quantify the structural and energetic properties of interacting proteins, finding use in interrelated challenges revolving around the relationship between sequence, structure and binding free energy. These include loop modelling, side-chain refinement, docking, multimer assembly, affinity prediction, affinity change upon mutation, hotspots location and interface design. Information derived from models optimised for one of these challenges can be used to benefit the others, and can be unified within the theoretical frameworks of multi-task learning and Pareto-optimal multi-objective learning. Copyright © 2013 Elsevier Ltd. All rights reserved.

  5. Inferring high-confidence human protein-protein interactions

    Directory of Open Access Journals (Sweden)

    Yu Xueping

    2012-05-01

    Full Text Available Abstract Background As numerous experimental factors drive the acquisition, identification, and interpretation of protein-protein interactions (PPIs, aggregated assemblies of human PPI data invariably contain experiment-dependent noise. Ascertaining the reliability of PPIs collected from these diverse studies and scoring them to infer high-confidence networks is a non-trivial task. Moreover, a large number of PPIs share the same number of reported occurrences, making it impossible to distinguish the reliability of these PPIs and rank-order them. For example, for the data analyzed here, we found that the majority (>83% of currently available human PPIs have been reported only once. Results In this work, we proposed an unsupervised statistical approach to score a set of diverse, experimentally identified PPIs from nine primary databases to create subsets of high-confidence human PPI networks. We evaluated this ranking method by comparing it with other methods and assessing their ability to retrieve protein associations from a number of diverse and independent reference sets. These reference sets contain known biological data that are either directly or indirectly linked to interactions between proteins. We quantified the average effect of using ranked protein interaction data to retrieve this information and showed that, when compared to randomly ranked interaction data sets, the proposed method created a larger enrichment (~134% than either ranking based on the hypergeometric test (~109% or occurrence ranking (~46%. Conclusions From our evaluations, it was clear that ranked interactions were always of value because higher-ranked PPIs had a higher likelihood of retrieving high-confidence experimental data. Reducing the noise inherent in aggregated experimental PPIs via our ranking scheme further increased the accuracy and enrichment of PPIs derived from a number of biologically relevant data sets. These results suggest that using our high

  6. Extended functions of the database machine FREND for interactive systems

    International Nuclear Information System (INIS)

    Hikita, S.; Kawakami, S.; Sano, K.

    1984-01-01

    Well-designed visual interfaces encourage non-expert users to use relational database systems. In those systems such as office automation systems or engineering database systems, non-expert users interactively access to database from visual terminals. Some users may want to occupy database or other users may share database according to various situations. Because, those jobs need a lot of time to be completed, concurrency control must be well designed to enhance the concurrency. The extended method of concurrency control of FREND is presented in this paper. The authors assume that systems are composed of workstations, a local area network and the database machine FREND. This paper also stresses that those workstations and FREND must cooperate to complete concurrency control for interactive applications

  7. An ontology-based search engine for protein-protein interactions.

    Science.gov (United States)

    Park, Byungkyu; Han, Kyungsook

    2010-01-18

    Keyword matching or ID matching is the most common searching method in a large database of protein-protein interactions. They are purely syntactic methods, and retrieve the records in the database that contain a keyword or ID specified in a query. Such syntactic search methods often retrieve too few search results or no results despite many potential matches present in the database. We have developed a new method for representing protein-protein interactions and the Gene Ontology (GO) using modified Gödel numbers. This representation is hidden from users but enables a search engine using the representation to efficiently search protein-protein interactions in a biologically meaningful way. Given a query protein with optional search conditions expressed in one or more GO terms, the search engine finds all the interaction partners of the query protein by unique prime factorization of the modified Gödel numbers representing the query protein and the search conditions. Representing the biological relations of proteins and their GO annotations by modified Gödel numbers makes a search engine efficiently find all protein-protein interactions by prime factorization of the numbers. Keyword matching or ID matching search methods often miss the interactions involving a protein that has no explicit annotations matching the search condition, but our search engine retrieves such interactions as well if they satisfy the search condition with a more specific term in the ontology.

  8. SwissPalm: Protein Palmitoylation database.

    Science.gov (United States)

    Blanc, Mathieu; David, Fabrice; Abrami, Laurence; Migliozzi, Daniel; Armand, Florence; Bürgi, Jérôme; van der Goot, Françoise Gisou

    2015-01-01

    Protein S-palmitoylation is a reversible post-translational modification that regulates many key biological processes, although the full extent and functions of protein S-palmitoylation remain largely unexplored. Recent developments of new chemical methods have allowed the establishment of palmitoyl-proteomes of a variety of cell lines and tissues from different species.  As the amount of information generated by these high-throughput studies is increasing, the field requires centralization and comparison of this information. Here we present SwissPalm ( http://swisspalm.epfl.ch), our open, comprehensive, manually curated resource to study protein S-palmitoylation. It currently encompasses more than 5000 S-palmitoylated protein hits from seven species, and contains more than 500 specific sites of S-palmitoylation. SwissPalm also provides curated information and filters that increase the confidence in true positive hits, and integrates predictions of S-palmitoylated cysteine scores, orthologs and isoform multiple alignments. Systems analysis of the palmitoyl-proteome screens indicate that 10% or more of the human proteome is susceptible to S-palmitoylation. Moreover, ontology and pathway analyses of the human palmitoyl-proteome reveal that key biological functions involve this reversible lipid modification. Comparative analysis finally shows a strong crosstalk between S-palmitoylation and other post-translational modifications. Through the compilation of data and continuous updates, SwissPalm will provide a powerful tool to unravel the global importance of protein S-palmitoylation.

  9. An Interactive Database of Cocaine-Responsive Gene Expression

    Directory of Open Access Journals (Sweden)

    Willard M. Freeman

    2002-01-01

    Full Text Available The postgenomic era of large-scale gene expression studies is inundating drug abuse researchers and many other scientists with findings related to gene expression. This information is distributed across many different journals, and requires laborious literature searches. Here, we present an interactive database that combines existing information related to cocaine-mediated changes in gene expression in an easy-to-use format. The database is limited to statistically significant changes in mRNA or protein expression after cocaine administration. The Flash-based program is integrated into a Web page, and organizes changes in gene expression based on neuroanatomical region, general function, and gene name. Accompanying each gene is a description of the gene, links to the original publications, and a link to the appropriate OMIM (Online Mendelian Inheritance in Man entry. The nature of this review allows for timely modifications and rapid inclusion of new publications, and should help researchers build second-generation hypotheses on the role of gene expression changes in the physiology and behavior of cocaine abuse. Furthermore, this method of organizing large volumes of scientific information can easily be adapted to assist researchers in fields outside of drug abuse.

  10. Merging in-silico and in vitro salivary protein complex partners using the STRING database: A tutorial.

    Science.gov (United States)

    Crosara, Karla Tonelli Bicalho; Moffa, Eduardo Buozi; Xiao, Yizhi; Siqueira, Walter Luiz

    2018-01-16

    Protein-protein interaction is a common physiological mechanism for protection and actions of proteins in an organism. The identification and characterization of protein-protein interactions in different organisms is necessary to better understand their physiology and to determine their efficacy. In a previous in vitro study using mass spectrometry, we identified 43 proteins that interact with histatin 1. Six previously documented interactors were confirmed and 37 novel partners were identified. In this tutorial, we aimed to demonstrate the usefulness of the STRING database for studying protein-protein interactions. We used an in-silico approach along with the STRING database (http://string-db.org/) and successfully performed a fast simulation of a novel constructed histatin 1 protein-protein network, including both the previously known and the predicted interactors, along with our newly identified interactors. Our study highlights the advantages and importance of applying bioinformatics tools to merge in-silico tactics with experimental in vitro findings for rapid advancement of our knowledge about protein-protein interactions. Our findings also indicate that bioinformatics tools such as the STRING protein network database can help predict potential interactions between proteins and thus serve as a guide for future steps in our exploration of the Human Interactome. Our study highlights the usefulness of the STRING protein database for studying protein-protein interactions. The STRING database can collect and integrate data about known and predicted protein-protein associations from many organisms, including both direct (physical) and indirect (functional) interactions, in an easy-to-use interface. Copyright © 2017 Elsevier B.V. All rights reserved.

  11. ARCPHdb: A comprehensive protein database for SF1 and SF2 helicase from archaea.

    Science.gov (United States)

    Moukhtar, Mirna; Chaar, Wafi; Abdel-Razzak, Ziad; Khalil, Mohamad; Taha, Samir; Chamieh, Hala

    2017-01-01

    Superfamily 1 and Superfamily 2 helicases, two of the largest helicase protein families, play vital roles in many biological processes including replication, transcription and translation. Study of helicase proteins in the model microorganisms of archaea have largely contributed to the understanding of their function, architecture and assembly. Based on a large phylogenomics approach, we have identified and classified all SF1 and SF2 protein families in ninety five sequenced archaea genomes. Here we developed an online webserver linked to a specialized protein database named ARCPHdb to provide access for SF1 and SF2 helicase families from archaea. ARCPHdb was implemented using MySQL relational database. Web interfaces were developed using Netbeans. Data were stored according to UniProt accession numbers, NCBI Ref Seq ID, PDB IDs and Entrez Databases. A user-friendly interactive web interface has been developed to browse, search and download archaeal helicase protein sequences, their available 3D structure models, and related documentation available in the literature provided by ARCPHdb. The database provides direct links to matching external databases. The ARCPHdb is the first online database to compile all protein information on SF1 and SF2 helicase from archaea in one platform. This database provides essential resource information for all researchers interested in the field. Copyright © 2016 Elsevier Ltd. All rights reserved.

  12. TOPDOM: database of conservatively located domains and motifs in proteins.

    Science.gov (United States)

    Varga, Julia; Dobson, László; Tusnády, Gábor E

    2016-09-01

    The TOPDOM database-originally created as a collection of domains and motifs located consistently on the same side of the membranes in α-helical transmembrane proteins-has been updated and extended by taking into consideration consistently localized domains and motifs in globular proteins, too. By taking advantage of the recently developed CCTOP algorithm to determine the type of a protein and predict topology in case of transmembrane proteins, and by applying a thorough search for domains and motifs as well as utilizing the most up-to-date version of all source databases, we managed to reach a 6-fold increase in the size of the whole database and a 2-fold increase in the number of transmembrane proteins. TOPDOM database is available at http://topdom.enzim.hu The webpage utilizes the common Apache, PHP5 and MySQL software to provide the user interface for accessing and searching the database. The database itself is generated on a high performance computer. tusnady.gabor@ttk.mta.hu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  13. Protein - TP Atlas | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data ...p_atlas_protein.zip File URL: ftp://ftp.biosciencedbc.jp/archive/tp_atlas/LATEST/...story of This Database Site Policy | Contact Us Protein - TP Atlas | LSDB Archive ...

  14. The DExH/D protein family database.

    Science.gov (United States)

    Jankowsky, E; Jankowsky, A

    2000-01-01

    DExH/D proteins are essential for all aspects of cellular RNA metabolism and processing, in the replication of many viruses and in DNA replication. DExH/D proteins are subject to current biological, biochemical and biophysical research which provides a continuous wealth of data. The DExH/D protein family database compiles this information and makes it available over the WWW (http://www.columbia.edu/ ej67/dbhome.htm ). The database can be fully searched by text based queries, facilitating fast access to specific information about this important class of enzymes.

  15. Fly-DPI: database of protein interactomes for D. melanogaster in the approach of systems biology

    Directory of Open Access Journals (Sweden)

    Lin Chieh-Hua

    2006-12-01

    Full Text Available Abstract Background Proteins control and mediate many biological activities of cells by interacting with other protein partners. This work presents a statistical model to predict protein interaction networks of Drosophila melanogaster based on insight into domain interactions. Results Three high-throughput yeast two-hybrid experiments and the collection in FlyBase were used as our starting datasets. The co-occurrences of domains in these interactive events are converted into a probability score of domain-domain interaction. These scores are used to infer putative interaction among all available open reading frames (ORFs of fruit fly. Additionally, the likelihood function is used to estimate all potential protein-protein interactions. All parameters are successfully iterated and MLE is obtained for each pair of domains. Additionally, the maximized likelihood reaches its converged criteria and maintains the probability stable. The hybrid model achieves a high specificity with a loss of sensitivity, suggesting that the model may possess major features of protein-protein interactions. Several putative interactions predicted by the proposed hybrid model are supported by literatures, while experimental data with a low probability score indicate an uncertain reliability and require further proof of interaction. Fly-DPI is the online database used to present this work. It is an integrated proteomics tool with comprehensive protein annotation information from major databases as well as an effective means of predicting protein-protein interactions. As a novel search strategy, the ping-pong search is a naïve path map between two chosen proteins based on pre-computed shortest paths. Adopting effective filtering strategies will facilitate researchers in depicting the bird's eye view of the network of interest. Fly-DPI can be accessed at http://flydpi.nhri.org.tw. Conclusion This work provides two reference systems, statistical and biological, to evaluate

  16. AMYPdb: A database dedicated to amyloid precursor proteins

    Directory of Open Access Journals (Sweden)

    Delamarche Christian

    2008-06-01

    Full Text Available Abstract Background Misfolding and aggregation of proteins into ordered fibrillar structures is associated with a number of severe pathologies, including Alzheimer's disease, prion diseases, and type II diabetes. The rapid accumulation of knowledge about the sequences and structures of these proteins allows using of in silico methods to investigate the molecular mechanisms of their abnormal conformational changes and assembly. However, such an approach requires the collection of accurate data, which are inconveniently dispersed among several generalist databases. Results We therefore created a free online knowledge database (AMYPdb dedicated to amyloid precursor proteins and we have performed large scale sequence analysis of the included data. Currently, AMYPdb integrates data on 31 families, including 1,705 proteins from nearly 600 organisms. It displays links to more than 2,300 bibliographic references and 1,200 3D-structures. A Wiki system is available to insert data into the database, providing a sharing and collaboration environment. We generated and analyzed 3,621 amino acid sequence patterns, reporting highly specific patterns for each amyloid family, along with patterns likely to be involved in protein misfolding and aggregation. Conclusion AMYPdb is a comprehensive online database aiming at the centralization of bioinformatic data regarding all amyloid proteins and their precursors. Our sequence pattern discovery and analysis approach unveiled protein regions of significant interest. AMYPdb is freely accessible 1.

  17. UbiProt: a database of ubiquitylated proteins

    Directory of Open Access Journals (Sweden)

    Kondratieva Ekaterina V

    2007-04-01

    Full Text Available Abstract Background Post-translational protein modification with ubiquitin, or ubiquitylation, is one of the hottest topics in a modern biology due to a dramatic impact on diverse metabolic pathways and involvement in pathogenesis of severe human diseases. A great number of eukaryotic proteins was found to be ubiquitylated. However, data about particular ubiquitylated proteins are rather disembodied. Description To fill a general need for collecting and systematizing experimental data concerning ubiquitylation we have developed a new resource, UbiProt Database, a knowledgebase of ubiquitylated proteins. The database contains retrievable information about overall characteristics of a particular protein, ubiquitylation features, related ubiquitylation and de-ubiquitylation machinery and literature references reflecting experimental evidence of ubiquitylation. UbiProt is available at http://ubiprot.org.ru for free. Conclusion UbiProt Database is a public resource offering comprehensive information on ubiquitylated proteins. The resource can serve as a general reference source both for researchers in ubiquitin field and those who deal with particular ubiquitylated proteins which are of their interest. Further development of the UbiProt Database is expected to be of common interest for research groups involved in studies of the ubiquitin system.

  18. Domain fusion analysis by applying relational algebra to protein sequence and domain databases.

    Science.gov (United States)

    Truong, Kevin; Ikura, Mitsuhiko

    2003-05-06

    Domain fusion analysis is a useful method to predict functionally linked proteins that may be involved in direct protein-protein interactions or in the same metabolic or signaling pathway. As separate domain databases like BLOCKS, PROSITE, Pfam, SMART, PRINTS-S, ProDom, TIGRFAMs, and amalgamated domain databases like InterPro continue to grow in size and quality, a computational method to perform domain fusion analysis that leverages on these efforts will become increasingly powerful. This paper proposes a computational method employing relational algebra to find domain fusions in protein sequence databases. The feasibility of this method was illustrated on the SWISS-PROT+TrEMBL sequence database using domain predictions from the Pfam HMM (hidden Markov model) database. We identified 235 and 189 putative functionally linked protein partners in H. sapiens and S. cerevisiae, respectively. From scientific literature, we were able to confirm many of these functional linkages, while the remainder offer testable experimental hypothesis. Results can be viewed at http://calcium.uhnres.utoronto.ca/pi. As the analysis can be computed quickly on any relational database that supports standard SQL (structured query language), it can be dynamically updated along with the sequence and domain databases, thereby improving the quality of predictions over time.

  19. GPCR Interaction - GRIPDB | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available switchLanguage; BLAST Search Image Search Home About Archive Update History Data List Contact us GRI...ed information (disease etc.). Data file File name: gripdb_main.zip File URL: ftp://ftp.biosciencedbc.jp/archive/gripdb/LATEST/gri...godb.biosciencedbc.jp/togodb/view/gripdb_main#en Data acquisition method PDB, Refseq Data analysis method - ...Number of data entries 409 entries Data item Description GRIP ID Interaction ID Main Title Interaction title...tabase Database Description Download License Update History of This Database Site Policy | Contact Us GPCR Interaction - GRIPDB | LSDB Archive ...

  20. Intercellular protein-protein interactions at synapses.

    Science.gov (United States)

    Yang, Xiaofei; Hou, Dongmei; Jiang, Wei; Zhang, Chen

    2014-06-01

    Chemical synapses are asymmetric intercellular junctions through which neurons send nerve impulses to communicate with other neurons or excitable cells. The appropriate formation of synapses, both spatially and temporally, is essential for brain function and depends on the intercellular protein-protein interactions of cell adhesion molecules (CAMs) at synaptic clefts. The CAM proteins link pre- and post-synaptic sites, and play essential roles in promoting synapse formation and maturation, maintaining synapse number and type, accumulating neurotransmitter receptors and ion channels, controlling neuronal differentiation, and even regulating synaptic plasticity directly. Alteration of the interactions of CAMs leads to structural and functional impairments, which results in many neurological disorders, such as autism, Alzheimer's disease and schizophrenia. Therefore, it is crucial to understand the functions of CAMs during development and in the mature neural system, as well as in the pathogenesis of some neurological disorders. Here, we review the function of the major classes of CAMs, and how dysfunction of CAMs relates to several neurological disorders.

  1. Feature generation and representations for protein-protein interaction classification.

    Science.gov (United States)

    Lan, Man; Tan, Chew Lim; Su, Jian

    2009-10-01

    Automatic detecting protein-protein interaction (PPI) relevant articles is a crucial step for large-scale biological database curation. The previous work adopted POS tagging, shallow parsing and sentence splitting techniques, but they achieved worse performance than the simple bag-of-words representation. In this paper, we generated and investigated multiple types of feature representations in order to further improve the performance of PPI text classification task. Besides the traditional domain-independent bag-of-words approach and the term weighting methods, we also explored other domain-dependent features, i.e. protein-protein interaction trigger keywords, protein named entities and the advanced ways of incorporating Natural Language Processing (NLP) output. The integration of these multiple features has been evaluated on the BioCreAtIvE II corpus. The experimental results showed that both the advanced way of using NLP output and the integration of bag-of-words and NLP output improved the performance of text classification. Specifically, in comparison with the best performance achieved in the BioCreAtIvE II IAS, the feature-level and classifier-level integration of multiple features improved the performance of classification 2.71% and 3.95%, respectively.

  2. Protein- protein interaction detection system using fluorescent protein microdomains

    Science.gov (United States)

    Waldo, Geoffrey S.; Cabantous, Stephanie

    2010-02-23

    The invention provides a protein labeling and interaction detection system based on engineered fragments of fluorescent and chromophoric proteins that require fused interacting polypeptides to drive the association of the fragments, and further are soluble and stable, and do not change the solubility of polypeptides to which they are fused. In one embodiment, a test protein X is fused to a sixteen amino acid fragment of GFP (.beta.-strand 10, amino acids 198-214), engineered to not perturb fusion protein solubility. A second test protein Y is fused to a sixteen amino acid fragment of GFP (.beta.-strand 11, amino acids 215-230), engineered to not perturb fusion protein solubility. When X and Y interact, they bring the GFP strands into proximity, and are detected by complementation with a third GFP fragment consisting of GFP amino acids 1-198 (strands 1-9). When GFP strands 10 and 11 are held together by interaction of protein X and Y, they spontaneous association with GFP strands 1-9, resulting in structural complementation, folding, and concomitant GFP fluorescence.

  3. A Novel Approach for Protein-Named Entity Recognition and Protein-Protein Interaction Extraction

    Directory of Open Access Journals (Sweden)

    Meijing Li

    2015-01-01

    Full Text Available Many researchers focus on developing protein-named entity recognition (Protein-NER or PPI extraction systems. However, the studies about these two topics cannot be merged well; then existing PPI extraction systems’ Protein-NER still needs to improve. In this paper, we developed the protein-protein interaction extraction system named PPIMiner based on Support Vector Machine (SVM and parsing tree. PPIMiner consists of three main models: natural language processing (NLP model, Protein-NER model, and PPI discovery model. The Protein-NER model, which is named ProNER, identifies the protein names based on two methods: dictionary-based method and machine learning-based method. ProNER is capable of identifying more proteins than dictionary-based Protein-NER model in other existing systems. The final discovered PPIs extracted via PPI discovery model are represented in detail because we showed the protein interaction types and the occurrence frequency through two different methods. In the experiments, the result shows that the performances achieved by our ProNER and PPI discovery model are better than other existing tools. PPIMiner applied this protein-named entity recognition approach and parsing tree based PPI extraction method to improve the performance of PPI extraction. We also provide an easy-to-use interface to access PPIs database and an online system for PPIs extraction and Protein-NER.

  4. Topology and weights in a protein domain interaction network--a novel way to predict protein interactions.

    Science.gov (United States)

    Wuchty, Stefan

    2006-05-23

    While the analysis of unweighted biological webs as diverse as genetic, protein and metabolic networks allowed spectacular insights in the inner workings of a cell, biological networks are not only determined by their static grid of links. In fact, we expect that the heterogeneity in the utilization of connections has a major impact on the organization of cellular activities as well. We consider a web of interactions between protein domains of the Protein Family database (PFAM), which are weighted by a probability score. We apply metrics that combine the static layout and the weights of the underlying interactions. We observe that unweighted measures as well as their weighted counterparts largely share the same trends in the underlying domain interaction network. However, we only find weak signals that weights and the static grid of interactions are connected entities. Therefore assuming that a protein interaction is governed by a single domain interaction, we observe strong and significant correlations of the highest scoring domain interaction and the confidence of protein interactions in the underlying interactions of yeast and fly. Modeling an interaction between proteins if we find a high scoring protein domain interaction we obtain 1, 428 protein interactions among 361 proteins in the human malaria parasite Plasmodium falciparum. Assessing their quality by a logistic regression method we observe that increasing confidence of predicted interactions is accompanied by high scoring domain interactions and elevated levels of functional similarity and evolutionary conservation. Our results indicate that probability scores are randomly distributed, allowing to treat static grid and weights of domain interactions as separate entities. In particular, these finding confirms earlier observations that a protein interaction is a matter of a single interaction event on domain level. As an immediate application, we show a simple way to predict potential protein interactions

  5. Topology and weights in a protein domain interaction network – a novel way to predict protein interactions

    Directory of Open Access Journals (Sweden)

    Wuchty Stefan

    2006-05-01

    Full Text Available Abstract Background While the analysis of unweighted biological webs as diverse as genetic, protein and metabolic networks allowed spectacular insights in the inner workings of a cell, biological networks are not only determined by their static grid of links. In fact, we expect that the heterogeneity in the utilization of connections has a major impact on the organization of cellular activities as well. Results We consider a web of interactions between protein domains of the Protein Family database (PFAM, which are weighted by a probability score. We apply metrics that combine the static layout and the weights of the underlying interactions. We observe that unweighted measures as well as their weighted counterparts largely share the same trends in the underlying domain interaction network. However, we only find weak signals that weights and the static grid of interactions are connected entities. Therefore assuming that a protein interaction is governed by a single domain interaction, we observe strong and significant correlations of the highest scoring domain interaction and the confidence of protein interactions in the underlying interactions of yeast and fly. Modeling an interaction between proteins if we find a high scoring protein domain interaction we obtain 1, 428 protein interactions among 361 proteins in the human malaria parasite Plasmodium falciparum. Assessing their quality by a logistic regression method we observe that increasing confidence of predicted interactions is accompanied by high scoring domain interactions and elevated levels of functional similarity and evolutionary conservation. Conclusion Our results indicate that probability scores are randomly distributed, allowing to treat static grid and weights of domain interactions as separate entities. In particular, these finding confirms earlier observations that a protein interaction is a matter of a single interaction event on domain level. As an immediate application, we

  6. MultitaskProtDB: a database of multitasking proteins.

    Science.gov (United States)

    Hernández, Sergio; Ferragut, Gabriela; Amela, Isaac; Perez-Pons, JosepAntoni; Piñol, Jaume; Mozo-Villarias, Angel; Cedano, Juan; Querol, Enrique

    2014-01-01

    We have compiled MultitaskProtDB, available online at http://wallace.uab.es/multitask, to provide a repository where the many multitasking proteins found in the literature can be stored. Multitasking or moonlighting is the capability of some proteins to execute two or more biological functions. Usually, multitasking proteins are experimentally revealed by serendipity. This ability of proteins to perform multitasking functions helps us to understand one of the ways used by cells to perform many complex functions with a limited number of genes. Even so, the study of this phenomenon is complex because, among other things, there is no database of moonlighting proteins. The existence of such a tool facilitates the collection and dissemination of these important data. This work reports the database, MultitaskProtDB, which is designed as a friendly user web page containing >288 multitasking proteins with their NCBI and UniProt accession numbers, canonical and additional biological functions, monomeric/oligomeric states, PDB codes when available and bibliographic references. This database also serves to gain insight into some characteristics of multitasking proteins such as frequencies of the different pairs of functions, phylogenetic conservation and so forth.

  7. Calculations on Noncovalent Interactions and Databases of Benchmark Interaction Energies

    Czech Academy of Sciences Publication Activity Database

    Hobza, Pavel

    2012-01-01

    Roč. 45, č. 4 (2012), s. 663-672 ISSN 0001-4842 R&D Projects: GA ČR GBP208/12/G016 Grant - others:European Social Fund(XE) CZ.1.05/2.1.00/03.0058 Institutional research plan: CEZ:AV0Z40550506 Keywords : non-covalent interactions * covalent interactions * quantum chemical approach Subject RIV: CF - Physical ; Theoretical Chemistry Impact factor: 20.833, year: 2012

  8. Predicting and validating protein interactions using network structure.

    Directory of Open Access Journals (Sweden)

    Pao-Yang Chen

    2008-07-01

    Full Text Available Protein interactions play a vital part in the function of a cell. As experimental techniques for detection and validation of protein interactions are time consuming, there is a need for computational methods for this task. Protein interactions appear to form a network with a relatively high degree of local clustering. In this paper we exploit this clustering by suggesting a score based on triplets of observed protein interactions. The score utilises both protein characteristics and network properties. Our score based on triplets is shown to complement existing techniques for predicting protein interactions, outperforming them on data sets which display a high degree of clustering. The predicted interactions score highly against test measures for accuracy. Compared to a similar score derived from pairwise interactions only, the triplet score displays higher sensitivity and specificity. By looking at specific examples, we show how an experimental set of interactions can be enriched and validated. As part of this work we also examine the effect of different prior databases upon the accuracy of prediction and find that the interactions from the same kingdom give better results than from across kingdoms, suggesting that there may be fundamental differences between the networks. These results all emphasize that network structure is important and helps in the accurate prediction of protein interactions. The protein interaction data set and the program used in our analysis, and a list of predictions and validations, are available at http://www.stats.ox.ac.uk/bioinfo/resources/PredictingInteractions.

  9. Specificity and affinity quantification of protein-protein interactions.

    Science.gov (United States)

    Yan, Zhiqiang; Guo, Liyong; Hu, Liang; Wang, Jin

    2013-05-01

    Most biological processes are mediated by the protein-protein interactions. Determination of the protein-protein structures and insight into their interactions are vital to understand the mechanisms of protein functions. Currently, compared with the isolated protein structures, only a small fraction of protein-protein structures are experimentally solved. Therefore, the computational docking methods play an increasing role in predicting the structures and interactions of protein-protein complexes. The scoring function of protein-protein interactions is the key responsible for the accuracy of the computational docking. Previous scoring functions were mostly developed by optimizing the binding affinity which determines the stability of the protein-protein complex, but they are often lack of the consideration of specificity which determines the discrimination of native protein-protein complex against competitive ones. We developed a scoring function (named as SPA-PP, specificity and affinity of the protein-protein interactions) by incorporating both the specificity and affinity into the optimization strategy. The testing results and comparisons with other scoring functions show that SPA-PP performs remarkably on both predictions of binding pose and binding affinity. Thus, SPA-PP is a promising quantification of protein-protein interactions, which can be implemented into the protein docking tools and applied for the predictions of protein-protein structure and affinity. The algorithm is implemented in C language, and the code can be downloaded from http://dl.dropbox.com/u/1865642/Optimization.cpp.

  10. Filling and mining the reactive metabolite target protein database.

    Science.gov (United States)

    Hanzlik, Robert P; Fang, Jianwen; Koen, Yakov M

    2009-04-15

    The post-translational modification of proteins is a well-known endogenous mechanism for regulating protein function and activity. Cellular proteins are also susceptible to post-translational modification by xenobiotic agents that possess, or whose metabolites possess, significant electrophilic character. Such non-physiological modifications to endogenous proteins are sometimes benign, but in other cases they are strongly associated with, and are presumed to cause, lethal cytotoxic consequences via necrosis and/or apoptosis. The Reactive Metabolite Target Protein Database (TPDB) is a searchable, freely web-accessible (http://tpdb.medchem.ku.edu:8080/protein_database/) resource that attempts to provide a comprehensive, up-to-date listing of known reactive metabolite target proteins. In this report we characterize the TPDB by reviewing briefly how the information it contains came to be known. We also compare its information to that provided by other types of "-omics" studies relevant to toxicology, and we illustrate how bioinformatic analysis of target proteins may help to elucidate mechanisms of cytotoxic responses to reactive metabolites.

  11. Coevolution of interacting fertilization proteins.

    Directory of Open Access Journals (Sweden)

    Nathaniel L Clark

    2009-07-01

    Full Text Available Reproductive proteins are among the fastest evolving in the proteome, often due to the consequences of positive selection, and their rapid evolution is frequently attributed to a coevolutionary process between interacting female and male proteins. Such a process could leave characteristic signatures at coevolving genes. One signature of coevolution, predicted by sexual selection theory, is an association of alleles between the two genes. Another predicted signature is a correlation of evolutionary rates during divergence due to compensatory evolution. We studied female-male coevolution in the abalone by resequencing sperm lysin and its interacting egg coat protein, VERL, in populations of two species. As predicted, we found intergenic linkage disequilibrium between lysin and VERL, despite our demonstration that they are not physically linked. This finding supports a central prediction of sexual selection using actual genotypes, that of an association between a male trait and its female preference locus. We also created a novel likelihood method to show that lysin and VERL have experienced correlated rates of evolution. These two signatures of coevolution can provide statistical rigor to hypotheses of coevolution and could be exploited for identifying coevolving proteins a priori. We also present polymorphism-based evidence for positive selection and implicate recent selective events at the specific structural regions of lysin and VERL responsible for their species-specific interaction. Finally, we observed deep subdivision between VERL alleles in one species, which matches a theoretical prediction of sexual conflict. Thus, abalone fertilization proteins illustrate how coevolution can lead to reproductive barriers and potentially drive speciation.

  12. HitPredict version 4: comprehensive reliability scoring of physical protein?protein interactions from more than 100 species

    OpenAIRE

    L?pez, Yosvany; Nakai, Kenta; Patil, Ashwini

    2015-01-01

    HitPredict is a consolidated resource of experimentally identified, physical protein?protein interactions with confidence scores to indicate their reliability. The study of genes and their inter-relationships using methods such as network and pathway analysis requires high quality protein?protein interaction information. Extracting reliable interactions from most of the existing databases is challenging because they either contain only a subset of the available interactions, or a mixture of p...

  13. Using the Pathogen-Host Interactions database (PHI-base to investigate plant pathogen genomes and genes implicated in virulence

    Directory of Open Access Journals (Sweden)

    Martin eUrban

    2015-08-01

    Full Text Available New pathogen-host interaction mechanisms can be revealed by integrating mutant phenotype data with genetic information. PHI-base is a multi-species manually curated database combining peer-reviewed published phenotype data from plant and animal pathogens and gene/protein information in a single database.

  14. Drosophila Protein interaction Map (DPiM)

    OpenAIRE

    Guruharsha, K.G.; Obar, Robert A.; Mintseris, Julian; Aishwarya, K.; Krishnan, R.T.; VijayRaghavan, K.; Artavanis-Tsakonas, Spyros

    2012-01-01

    Proteins perform essential cellular functions as part of protein complexes, often in conjunction with RNA, DNA, metabolites and other small molecules. The genome encodes thousands of proteins but not all of them are expressed in every cell type; and expressed proteins are not active at all times. Such diversity of protein expression and function accounts for the level of biological intricacy seen in nature. Defining protein-protein interactions in protein complexes, and establishing the when,...

  15. Exploring Protein Function Using the Saccharomyces Genome Database.

    Science.gov (United States)

    Wong, Edith D

    2017-01-01

    Elucidating the function of individual proteins will help to create a comprehensive picture of cell biology, as well as shed light on human disease mechanisms, possible treatments, and cures. Due to its compact genome, and extensive history of experimentation and annotation, the budding yeast Saccharomyces cerevisiae is an ideal model organism in which to determine protein function. This information can then be leveraged to infer functions of human homologs. Despite the large amount of research and biological data about S. cerevisiae, many proteins' functions remain unknown. Here, we explore ways to use the Saccharomyces Genome Database (SGD; http://www.yeastgenome.org ) to predict the function of proteins and gain insight into their roles in various cellular processes.

  16. THPdb: Database of FDA-approved peptide and protein therapeutics.

    Directory of Open Access Journals (Sweden)

    Salman Sadullah Usmani

    Full Text Available THPdb (http://crdd.osdd.net/raghava/thpdb/ is a manually curated repository of Food and Drug Administration (FDA approved therapeutic peptides and proteins. The information in THPdb has been compiled from 985 research publications, 70 patents and other resources like DrugBank. The current version of the database holds a total of 852 entries, providing comprehensive information on 239 US-FDA approved therapeutic peptides and proteins and their 380 drug variants. The information on each peptide and protein includes their sequences, chemical properties, composition, disease area, mode of activity, physical appearance, category or pharmacological class, pharmacodynamics, route of administration, toxicity, target of activity, etc. In addition, we have annotated the structure of most of the protein and peptides. A number of user-friendly tools have been integrated to facilitate easy browsing and data analysis. To assist scientific community, a web interface and mobile App have also been developed.

  17. DenHunt - A Comprehensive Database of the Intricate Network of Dengue-Human Interactions.

    Directory of Open Access Journals (Sweden)

    Prashanthi Karyala

    2016-09-01

    Full Text Available Dengue virus (DENV is a human pathogen and its etiology has been widely established. There are many interactions between DENV and human proteins that have been reported in literature. However, no publicly accessible resource for efficiently retrieving the information is yet available. In this study, we mined all publicly available dengue-human interactions that have been reported in the literature into a database called DenHunt. We retrieved 682 direct interactions of human proteins with dengue viral components, 382 indirect interactions and 4120 differentially expressed human genes in dengue infected cell lines and patients. We have illustrated the importance of DenHunt by mapping the dengue-human interactions on to the host interactome and observed that the virus targets multiple host functional complexes of important cellular processes such as metabolism, immune system and signaling pathways suggesting a potential role of these interactions in viral pathogenesis. We also observed that 7 percent of the dengue virus interacting human proteins are also associated with other infectious and non-infectious diseases. Finally, the understanding that comes from such analyses could be used to design better strategies to counteract the diseases caused by dengue virus. The whole dataset has been catalogued in a searchable database, called DenHunt (http://proline.biochem.iisc.ernet.in/DenHunt/.

  18. DenHunt - A Comprehensive Database of the Intricate Network of Dengue-Human Interactions.

    Science.gov (United States)

    Karyala, Prashanthi; Metri, Rahul; Bathula, Christopher; Yelamanchi, Syam K; Sahoo, Lipika; Arjunan, Selvam; Sastri, Narayan P; Chandra, Nagasuma

    2016-09-01

    Dengue virus (DENV) is a human pathogen and its etiology has been widely established. There are many interactions between DENV and human proteins that have been reported in literature. However, no publicly accessible resource for efficiently retrieving the information is yet available. In this study, we mined all publicly available dengue-human interactions that have been reported in the literature into a database called DenHunt. We retrieved 682 direct interactions of human proteins with dengue viral components, 382 indirect interactions and 4120 differentially expressed human genes in dengue infected cell lines and patients. We have illustrated the importance of DenHunt by mapping the dengue-human interactions on to the host interactome and observed that the virus targets multiple host functional complexes of important cellular processes such as metabolism, immune system and signaling pathways suggesting a potential role of these interactions in viral pathogenesis. We also observed that 7 percent of the dengue virus interacting human proteins are also associated with other infectious and non-infectious diseases. Finally, the understanding that comes from such analyses could be used to design better strategies to counteract the diseases caused by dengue virus. The whole dataset has been catalogued in a searchable database, called DenHunt (http://proline.biochem.iisc.ernet.in/DenHunt/).

  19. Reciprocal carbonyl-carbonyl interactions in small molecules and proteins.

    Science.gov (United States)

    Rahim, Abdur; Saha, Pinaki; Jha, Kunal Kumar; Sukumar, Nagamani; Sarma, Bani Kanta

    2017-07-19

    Carbonyl-carbonyl n→π* interactions where a lone pair (n) of the oxygen atom of a carbonyl group is delocalized over the π* orbital of a nearby carbonyl group have attracted a lot of attention in recent years due to their ability to affect the 3D structure of small molecules, polyesters, peptides, and proteins. In this paper, we report the discovery of a "reciprocal" carbonyl-carbonyl interaction with substantial back and forth n→π* and π→π* electron delocalization between neighboring carbonyl groups. We have carried out experimental studies, analyses of crystallographic databases and theoretical calculations to show the presence of this interaction in both small molecules and proteins. In proteins, these interactions are primarily found in polyproline II (PPII) helices. As PPII are the most abundant secondary structures in unfolded proteins, we propose that these local interactions may have implications in protein folding.Carbonyl-carbonyl π* non covalent interactions affect the structure and stability of small molecules and proteins. Here, the authors carry out experimental studies, analyses of crystallographic databases and theoretical calculations to describe an additional type of carbonyl-carbonyl interaction.

  20. VaProS: a database-integration approach for protein/genome information retrieval

    KAUST Repository

    Gojobori, Takashi; Ikeo, Kazuho; Katayama, Yukie; Kawabata, Takeshi; Kinjo, Akira R.; Kinoshita, Kengo; Kwon, Yeondae; Migita, Ohsuke; Mizutani, Hisashi; Muraoka, Masafumi; Nagata, Koji; Omori, Satoshi; Sugawara, Hideaki; Yamada, Daichi; Yura, Kei

    2016-01-01

    Life science research now heavily relies on all sorts of databases for genome sequences, transcription, protein three-dimensional (3D) structures, protein–protein interactions, phenotypes and so forth. The knowledge accumulated by all the omics research is so vast that a computer-aided search of data is now a prerequisite for starting a new study. In addition, a combinatory search throughout these databases has a chance to extract new ideas and new hypotheses that can be examined by wet-lab experiments. By virtually integrating the related databases on the Internet, we have built a new web application that facilitates life science researchers for retrieving experts’ knowledge stored in the databases and for building a new hypothesis of the research target. This web application, named VaProS, puts stress on the interconnection between the functional information of genome sequences and protein 3D structures, such as structural effect of the gene mutation. In this manuscript, we present the notion of VaProS, the databases and tools that can be accessed without any knowledge of database locations and data formats, and the power of search exemplified in quest of the molecular mechanisms of lysosomal storage disease. VaProS can be freely accessed at http://p4d-info.nig.ac.jp/vapros/.

  1. VaProS: a database-integration approach for protein/genome information retrieval

    KAUST Repository

    Gojobori, Takashi

    2016-12-24

    Life science research now heavily relies on all sorts of databases for genome sequences, transcription, protein three-dimensional (3D) structures, protein–protein interactions, phenotypes and so forth. The knowledge accumulated by all the omics research is so vast that a computer-aided search of data is now a prerequisite for starting a new study. In addition, a combinatory search throughout these databases has a chance to extract new ideas and new hypotheses that can be examined by wet-lab experiments. By virtually integrating the related databases on the Internet, we have built a new web application that facilitates life science researchers for retrieving experts’ knowledge stored in the databases and for building a new hypothesis of the research target. This web application, named VaProS, puts stress on the interconnection between the functional information of genome sequences and protein 3D structures, such as structural effect of the gene mutation. In this manuscript, we present the notion of VaProS, the databases and tools that can be accessed without any knowledge of database locations and data formats, and the power of search exemplified in quest of the molecular mechanisms of lysosomal storage disease. VaProS can be freely accessed at http://p4d-info.nig.ac.jp/vapros/.

  2. Yeast Interacting Proteins Database: YNL189W, YGL175C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ait as prey (0) YGL175C SAE2 Endonuclease that processes hairpin DNA structures w... (0) Prey ORF YGL175C Prey gene name SAE2 Prey description Endonuclease that processes hairpin DNA structures

  3. Yeast Interacting Proteins Database: YDR490C, YGR086C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available bait as prey (0) YGR086C PIL1 Primary component of eisosomes, which are large immobile cell cortex struct...ctures associated with endocytosis; null mutants show activation of Pkc1p/Ypk1p str...y (0) Prey ORF YGR086C Prey gene name PIL1 Prey description Primary component of eisosomes, which are large immobile cell cortex stru...ures associated with endocytosis; null mutants show activation of Pkc1p/Ypk1p stres

  4. Yeast Interacting Proteins Database: YGR086C, YKL142W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YGR086C PIL1 Primary component of eisosomes, which are large immobile cell cortex structures... ORF YGR086C Bait gene name PIL1 Bait description Primary component of eisosomes, which are large immobile cell cortex structures

  5. Yeast Interacting Proteins Database: YPL022W, YLR135W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available repair; cleaves branched structures in a complex with Slx1p; involved in Rad1p/Rad10p-dependent removal of ... Prey gene name SLX4 Prey description Endonuclease involved in processing DNA during recombination and repair; cleaves branched struc...tures in a complex with Slx1p; involved in Rad1p/Rad10p-dependent removal of 3'-non

  6. Yeast Interacting Proteins Database: YNL189W, YDR318W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ponent of the COMA complex (Ctf19p, Okp1p, Mcm21p, Ame1p) that bridges kinetochore subunits that are in cont...t of the COMA complex (Ctf19p, Okp1p, Mcm21p, Ame1p) that bridges kinetochore subunits that are in contact w

  7. Yeast Interacting Proteins Database: YML042W, YML042W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available xisomes, transfers activated acetyl groups to carnitine to form acetylcarnitine which can be shuttled across membranes...etyl groups to carnitine to form acetylcarnitine which can be shuttled across membranes Rows with this prey ...ne which can be shuttled across membranes Rows with this bait as bait Rows with this bait as bait (1) Rows w...ansfers activated acetyl groups to carnitine to form acetylcarnitine which can be shuttled across membranes

  8. Yeast Interacting Proteins Database: YNL189W, YLR328W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ait as prey (0) YLR328W NMA1 Nicotinic acid mononucleotide adenylyltransferase, involved in pathways... of NAD biosynthesis, including the de novo, NAD(+) salvage, and nicotinamide riboside salvage pathways... Nicotinic acid mononucleotide adenylyltransferase, involved in pathways of NAD biosynthesis, including the ...de novo, NAD(+) salvage, and nicotinamide riboside salvage pathways Rows with this prey as prey Rows with th

  9. Yeast Interacting Proteins Database: YGR010W, YLR328W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available 1 Nicotinic acid mononucleotide adenylyltransferase, involved in pathways of NAD biosynthesis, including the... de novo, NAD(+) salvage, and nicotinamide riboside salvage pathways Rows with th...ne name NMA1 Prey description Nicotinic acid mononucleotide adenylyltransferase, involved in pathways of NAD... biosynthesis, including the de novo, NAD(+) salvage, and nicotinamide riboside salvage pathways

  10. Yeast Interacting Proteins Database: YLR328W, YGR010W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YLR328W NMA1 Nicotinic acid mononucleotide adenylyltransferase, involved in pathways... of NAD biosynthesis, including the de novo, NAD(+) salvage, and nicotinamide riboside salvage pathways Row... ORF YLR328W Bait gene name NMA1 Bait description Nicotinic acid mononucleotide adenylyltransferase, involved in pathways...otinamide riboside salvage pathways Rows with this bait as bait Rows with this bait as bait (2) Rows with th

  11. Yeast Interacting Proteins Database: YLR328W, YLR328W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YLR328W NMA1 Nicotinic acid mononucleotide adenylyltransferase, involved in pathways... of NAD biosynthesis, including the de novo, NAD(+) salvage, and nicotinamide riboside salvage pathways Row...ylyltransferase, involved in pathways of NAD biosynthesis, including the de novo,... NAD(+) salvage, and nicotinamide riboside salvage pathways Rows with this prey as prey (4) Rows with this p... description Nicotinic acid mononucleotide adenylyltransferase, involved in pathways

  12. Yeast Interacting Proteins Database: YML064C, YLR328W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available th this bait as prey (0) YLR328W NMA1 Nicotinic acid mononucleotide adenylyltransferase, involved in pathways... of NAD biosynthesis, including the de novo, NAD(+) salvage, and nicotinamide riboside salvage pathways...ic acid mononucleotide adenylyltransferase, involved in pathways of NAD biosynthe...sis, including the de novo, NAD(+) salvage, and nicotinamide riboside salvage pathways Rows with this prey a

  13. Yeast Interacting Proteins Database: YJL137C, YLR258W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ait as prey (1) YLR258W GSY2 Glycogen synthase, similar to Gsy1p; expression induced by glucose limitation, ...ssion induced by glucose limitation, nitrogen starvation, heat shock, and stationary phase; activity regulat

  14. Yeast Interacting Proteins Database: YEL062W, YPL255W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available iates downregulation of TOR Complex 1 activity in response to amino acid limitation; transcription is induce...regulation of TOR Complex 1 activity in response to amino acid limitation; transcription is induced in respo

  15. Yeast Interacting Proteins Database: YFR015C, YLR258W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available yeast homolog; expression induced by glucose limitation, nitrogen starvation, environmental stress, and entr...n synthase, similar to Gsy1p; expression induced by glucose limitation, nitrogen ...; expression induced by glucose limitation, nitrogen starvation, environmental stress, and entry into statio...ogen synthase, similar to Gsy1p; expression induced by glucose limitation, nitrogen starvation, heat shock,

  16. Yeast Interacting Proteins Database: YNL189W, YHR216W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available g, expression is repressed by nutrient limitation Rows with this prey as prey (1) Rows with this prey as bai...osynthesis, expression is induced by mycophenolic acid resulting in resistance to the drug, expression is repressed by nutrient limit...ation Rows with this prey as prey Rows with this prey as

  17. Yeast Interacting Proteins Database: YLR258W, YLR258W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YLR258W GSY2 Glycogen synthase, similar to Gsy1p; expression induced by glucose limitation...bait as prey (3) YLR258W GSY2 Glycogen synthase, similar to Gsy1p; expression induced by glucose limitatio...pression induced by glucose limitation, nitrogen starvation, heat shock, and stationary phase; activity regu...LR258W Bait ORF YLR258W Bait gene name GSY2 Bait description Glycogen synthase, similar to Gsy1p; expression induced by glucose limit...ation, nitrogen starvation, heat shock, and stationary phase; activity regulated by

  18. Yeast Interacting Proteins Database: YDL153C, YBR041W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available Faa4p that imports and activates exogenous fatty acids Rows with this prey as prey (1) Rows with this prey ...d transporter and very long-chain fatty acyl-CoA synthetase, may form a complex with Faa1p or Faa4p that imports and activates exogen...ous fatty acids Rows with this prey as prey Rows with this prey as prey (1) Rows wi

  19. Yeast Interacting Proteins Database: YOR171C, YOR034C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ain base phosphates, which function as signaling molecules, regulates synthesis of ceramide from exogenous l...chain base phosphates, which function as signaling molecules, regulates synthesis of ceramide from exogenous

  20. Yeast Interacting Proteins Database: YPL151C, YOR036W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available t-SNARE) for vesicular intermediates traveling between the Golgi apparatus and the vacuole; controls entry o...e PEP12 Prey description Target membrane receptor (t-SNARE) for vesicular intermediates traveling between th

  1. Yeast Interacting Proteins Database: YOR036W, YBL102W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YOR036W PEP12 Target membrane receptor (t-SNARE) for vesicular intermediates travel...me PEP12 Bait description Target membrane receptor (t-SNARE) for vesicular intermediates traveling between t

  2. Yeast Interacting Proteins Database: YOL069W, YMR294W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available complex (Ndc80p-Nuf2p-Spc24p-Spc25p); involved in chromosome segregation, spindle checkpoint activity and kinetochore clustering...heckpoint activity and kinetochore clustering Rows with this bait as bait Rows with this bait as bait (3) Ro

  3. Yeast Interacting Proteins Database: YOL069W, YNL086W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available complex (Ndc80p-Nuf2p-Spc24p-Spc25p); involved in chromosome segregation, spindle checkpoint activity and kinetochore clustering...ved in chromosome segregation, spindle checkpoint activity and kinetochore clustering Rows with this bait as

  4. Yeast Interacting Proteins Database: YGR113W, YIL144W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available indle checkpoint activity, kinetochore assembly and clustering Rows with this prey as prey (2) Rows with thi...heckpoint activity, kinetochore assembly and clustering Rows with this prey as pr

  5. Yeast Interacting Proteins Database: YPL031C, YPL219W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ting the cellular response to nutrient levels and environmental conditions and progression through the cell ...e cellular response to nutrient levels and environmental conditions and progression through the cell cycle R

  6. Yeast Interacting Proteins Database: YER059W, YDL224C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ment for passage through Start and commitment to cell division Rows with this prey as prey (1) Rows with thi... passage through Start and commitment to cell division Rows with this prey as prey Rows with this prey as pr

  7. Yeast Interacting Proteins Database: YDR439W, YCR086W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available inetochores during meiosis I to mediate accurate homolog segregation; required for condensin recruitment to ...and then Mam1p at kinetochores during meiosis I to mediate accurate homolog segregation; required for condensin recruitment...p, and then Mam1p at kinetochores during meiosis I to mediate accurate homolog segregation; required for condensin recruitment...with Lrs4p and then Mam1p at kinetochores during meiosis I to mediate accurate homolog segregation; required for condensin recruitmen

  8. Yeast Interacting Proteins Database: YFR015C, YFR015C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available yeast homolog; expression induced by glucose limitation, nitrogen starvation, environmental stress, and entr...ression induced by glucose limitation, nitrogen starvation, environmental stress, and entry into stationary ...tion, nitrogen starvation, environmental stress, and entry into stationary phase Rows with this bait as bait..., the more highly expressed yeast homolog; expression induced by glucose limitation, nitrogen starvation, environmental

  9. Yeast Interacting Proteins Database: YFR015C, YJL137C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available yeast homolog; expression induced by glucose limitation, nitrogen starvation, environmental stress, and entr...pression induced by glucose limitation, nitrogen starvation, environmental stress, and entry into stationary

  10. Yeast Interacting Proteins Database: YPL031C, YER059W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ting the cellular response to nutrient levels and environmental conditions and progression through the cell ...ers; involved in regulating the cellular response to nutrient levels and environmental conditions and progre

  11. Yeast Interacting Proteins Database: YPL031C, YIL050W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ting the cellular response to nutrient levels and environmental conditions and progression through the cell ... with ten cyclin partners; involved in regulating the cellular response to nutrient levels and environmental

  12. Yeast Interacting Proteins Database: YNL189W, YPL111W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ession responds to both induction by arginine and nitrogen catabolite repression; disruption enhances freeze... catabolite repression; disruption enhances freeze tolerance Rows with this prey as prey Rows with this prey...ginase, responsible for arginine degradation, expression responds to both induction by arginine and nitrogen

  13. Yeast Interacting Proteins Database: YML064C, YPL111W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available both induction by arginine and nitrogen catabolite repression; disruption enhanc...inine and nitrogen catabolite repression; disruption enhances freeze tolerance Rows with this prey as prey R

  14. Yeast Interacting Proteins Database: YLR175W, YNL124W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available n ortholog dyskerin cause the disorder dyskeratosis congenita Rows with this bait as bait (1) Rows with this...dyskerin cause the disorder dyskeratosis congenita Rows with this bait as bait Ro

  15. Yeast Interacting Proteins Database: YHR114W, YER096W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available required for the synthesis of the chitosan layer of ascospores; has similarity to Skt5p, which activates Ch..., required for the synthesis of the chitosan layer of ascospores; has similarity to Skt5p, which activates C

  16. Yeast Interacting Proteins Database: YER127W, YDR299W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available to inhibited pre-rRNA processing and reduced polysome levels; localizes primarily to the nucleolus Rows with...of 18S rRNA; depletion leads to inhibited pre-rRNA processing and reduced polysome levels; localizes primarily to the nucleolus

  17. Yeast Interacting Proteins Database: YER127W, YLR423C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available to inhibited pre-rRNA processing and reduced polysome levels; localizes primarily to the nucleolus Rows with...etion leads to inhibited pre-rRNA processing and reduced polysome levels; localizes primarily to the nucleolus

  18. Yeast Interacting Proteins Database: YER081W, YDL168W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available n of long chain and complex alcohols, regulated by Hog1p-Sko1p Rows with this prey as prey (1) Rows with thi...ohols, regulated by Hog1p-Sko1p Rows with this prey as prey Rows with this prey as ...dependent formaldehyde dehydrogenase activities, functions in formaldehyde detoxification and formation of long chain and complex alc

  19. Yeast Interacting Proteins Database: YGL061C, YDR016C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YGL061C DUO1 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...6C DAD1 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...it description Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force p... Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force produced by MT

  20. Yeast Interacting Proteins Database: YGR113W, YGL079W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YGR113W DAM1 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...ntial subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force produced by MT depol

  1. Yeast Interacting Proteins Database: YKR037C, YGL061C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YKR037C SPC34 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...061C DUO1 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...Bait description Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...ion Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force produced by

  2. Yeast Interacting Proteins Database: YGR113W, YDR016C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YGR113W DAM1 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...DR016C DAD1 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force... Bait description Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...ription Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force produced

  3. Yeast Interacting Proteins Database: YGR113W, YLR424W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YGR113W DAM1 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...13W Bait gene name DAM1 Bait description Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force

  4. Yeast Interacting Proteins Database: YGR113W, YKR037C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YGR113W DAM1 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...KR037C SPC34 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...M1 Bait description Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...escription Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force produ

  5. Yeast Interacting Proteins Database: YKR037C, YDR016C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YKR037C SPC34 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...016C DAD1 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...Bait description Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...ion Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force produced by

  6. Yeast Interacting Proteins Database: YGL061C, YER016W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YGL061C DUO1 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force... complex (aka DASH complex), couples kinetochores to the force produced by MT depolymerization thereby aidin

  7. Yeast Interacting Proteins Database: YKR037C, YLR423C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YKR037C SPC34 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force... subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force produced by MT depolymeri

  8. Yeast Interacting Proteins Database: YNL189W, YDR201W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available plex), couples kinetochores to the force produced by MT depolymerization thereby aiding in chromosome segreg...), couples kinetochores to the force produced by MT depolymerization thereby aiding in chromosome segregatio

  9. Yeast Interacting Proteins Database: YGR113W, YGL061C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YGR113W DAM1 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...GL061C DUO1 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...M1 Bait description Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...scription Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force produc

  10. Yeast Interacting Proteins Database: YKR083C, YKL052C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YKR083C DAD2 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...2C ASK1 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...e name DAD2 Bait description Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...ey description Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force p

  11. Yeast Interacting Proteins Database: YLR423C, YKR083C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available of the Dam1 complex (aka DASH complex), couples kinetochores to the force produce...lex), couples kinetochores to the force produced by MT depolymerization thereby aiding in chromosome segrega

  12. Yeast Interacting Proteins Database: YKR083C, YDR201W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YKR083C DAD2 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...1W SPC19 Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force...ait description Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force ...on Essential subunit of the Dam1 complex (aka DASH complex), couples kinetochores to the force produced by M

  13. Yeast Interacting Proteins Database: YDR034C, YGR113W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available complex (aka DASH complex), couples kinetochores to the force produced by MT depolymerization thereby aidin...Rows with this bait as prey (0) YGR113W DAM1 Essential subunit of the Dam1 complex (aka DASH complex), coupl...es kinetochores to the force produced by MT depolymerization thereby aiding in ch

  14. Yeast Interacting Proteins Database: YLR288C, YLR125W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available rotrimeric complex (Rad17p-Mec3p-Ddc1p) that forms a sliding clamp, loaded onto partial duplex DNA by a clam... complex (Rad17p-Mec3p-Ddc1p) that forms a sliding clamp, loaded onto partial duplex DNA by a clamp loader c

  15. Yeast Interacting Proteins Database: YLR288C, YKL107W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available rotrimeric complex (Rad17p-Mec3p-Ddc1p) that forms a sliding clamp, loaded onto partial duplex DNA by a clam...c1p) that forms a sliding clamp, loaded onto partial duplex DNA by a clamp loader complex; homolog of human

  16. Yeast Interacting Proteins Database: YDR311W, YLR288C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available f a heterotrimeric complex (Rad17p-Mec3p-Ddc1p) that forms a sliding clamp, loaded onto partial duplex...liding clamp, loaded onto partial duplex DNA by a clamp loader complex; homolog of human and S. pombe Hus1 R

  17. Yeast Interacting Proteins Database: YLR288C, YKL044W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available rotrimeric complex (Rad17p-Mec3p-Ddc1p) that forms a sliding clamp, loaded onto partial duplex DNA by a clam...of a heterotrimeric complex (Rad17p-Mec3p-Ddc1p) that forms a sliding clamp, loaded onto partial duplex DNA

  18. Yeast Interacting Proteins Database: YLR288C, YMR159C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available rotrimeric complex (Rad17p-Mec3p-Ddc1p) that forms a sliding clamp, loaded onto partial duplex DNA by a clam...that forms a sliding clamp, loaded onto partial duplex DNA by a clamp loader complex; homolog of human and S

  19. Yeast Interacting Proteins Database: YBR228W, YLR135W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available YBR228W SLX1 Subunit of a complex, with Slx4p, that hydrolyzes 5' branches from duplex...of a complex, with Slx4p, that hydrolyzes 5' branches from duplex DNA in response to stalled or converging r

  20. Yeast Interacting Proteins Database: YDL139C, YCR077C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ing factor; also required for faithful chromosome transmission, maintenance of rDNA locus stability, and pro...ng factor; also required for faithful chromosome transmission, maintenance of rDNA locus stability, and prot

  1. Yeast Interacting Proteins Database: YJL124C, YCR077C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available ecapping factor; also required for faithful chromosome transmission, maintenance ... Topoisomerase II-associated deadenylation-dependent mRNA-decapping factor; also required for faithful chrom

  2. Yeast Interacting Proteins Database: YDL175C, YCR077C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available II-associated deadenylation-dependent mRNA-decapping factor; also required for faith...ciated deadenylation-dependent mRNA-decapping factor; also required for faithful chromosome transmission, ma

  3. Yeast Interacting Proteins Database: YBL026W, YCR077C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available nt mRNA-decapping factor; also required for faithful chromosome transmission, maintenance of rDNA locus stab...lation-dependent mRNA-decapping factor; also required for faithful chromosome transmission, maintenance of r

  4. Yeast Interacting Proteins Database: YHR114W, YDL217C [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available y (0) YDL217C TIM22 Component of the mitochondrial Tim54p-Tim22p complex involved in insertion of polytopi...rey gene name TIM22 Prey description Component of the mitochondrial Tim54p-Tim22p complex involved in insertion of polytopic

  5. Coarse-grain modelling of protein-protein interactions

    NARCIS (Netherlands)

    Baaden, Marc; Marrink, Siewert J.

    2013-01-01

    Here, we review recent advances towards the modelling of protein-protein interactions (PPI) at the coarse-grained (CG) level, a technique that is now widely used to understand protein affinity, aggregation and self-assembly behaviour. PPI models of soluble proteins and membrane proteins are

  6. ASAView: Database and tool for solvent accessibility representation in proteins

    Directory of Open Access Journals (Sweden)

    Fawareh Hamed

    2004-05-01

    Full Text Available Abstract Background Accessible surface area (ASA or solvent accessibility of amino acids in a protein has important implications. Knowledge of surface residues helps in locating potential candidates of active sites. Therefore, a method to quickly see the surface residues in a two dimensional model would help to immediately understand the population of amino acid residues on the surface and in the inner core of the proteins. Results ASAView is an algorithm, an application and a database of schematic representations of solvent accessibility of amino acid residues within proteins. A characteristic two-dimensional spiral plot of solvent accessibility provides a convenient graphical view of residues in terms of their exposed surface areas. In addition, sequential plots in the form of bar charts are also provided. Online plots of the proteins included in the entire Protein Data Bank (PDB, are provided for the entire protein as well as their chains separately. Conclusions These graphical plots of solvent accessibility are likely to provide a quick view of the overall topological distribution of residues in proteins. Chain-wise computation of solvent accessibility is also provided.

  7. Completion of autobuilt protein models using a database of protein fragments

    International Nuclear Information System (INIS)

    Cowtan, Kevin

    2012-01-01

    Two developments in the process of automated protein model building in the Buccaneer software are described: the use of a database of protein fragments in improving the model completeness and the assembly of disconnected chain fragments into complete molecules. Two developments in the process of automated protein model building in the Buccaneer software are presented. A general-purpose library for protein fragments of arbitrary size is described, with a highly optimized search method allowing the use of a larger database than in previous work. The problem of assembling an autobuilt model into complete chains is discussed. This involves the assembly of disconnected chain fragments into complete molecules and the use of the database of protein fragments in improving the model completeness. Assembly of fragments into molecules is a standard step in existing model-building software, but the methods have not received detailed discussion in the literature

  8. Screening of cellular proteins that interact with the classical swine ...

    Indian Academy of Sciences (India)

    In the current study, aiming to find more clues in understanding the molecular mechanisms of CSFV NS5A's function, the yeast two-hybrid (Y2H) system was adopted to screen for CSFV NS5A interactive proteins in the cDNA library of the swine umbilical vein endothelial cell (SUVEC). Alignment with the NCBI database ...

  9. Proteins interacting with cloning scars: a source of false positive protein-protein interactions.

    Science.gov (United States)

    Banks, Charles A S; Boanca, Gina; Lee, Zachary T; Florens, Laurence; Washburn, Michael P

    2015-02-23

    A common approach for exploring the interactome, the network of protein-protein interactions in cells, uses a commercially available ORF library to express affinity tagged bait proteins; these can be expressed in cells and endogenous cellular proteins that copurify with the bait can be identified as putative interacting proteins using mass spectrometry. Control experiments can be used to limit false-positive results, but in many cases, there are still a surprising number of prey proteins that appear to copurify specifically with the bait. Here, we have identified one source of false-positive interactions in such studies. We have found that a combination of: 1) the variable sequence of the C-terminus of the bait with 2) a C-terminal valine "cloning scar" present in a commercially available ORF library, can in some cases create a peptide motif that results in the aberrant co-purification of endogenous cellular proteins. Control experiments may not identify false positives resulting from such artificial motifs, as aberrant binding depends on sequences that vary from one bait to another. It is possible that such cryptic protein binding might occur in other systems using affinity tagged proteins; this study highlights the importance of conducting careful follow-up studies where novel protein-protein interactions are suspected.

  10. Applications of Protein Thermodynamic Database for Understanding Protein Mutant Stability and Designing Stable Mutants.

    Science.gov (United States)

    Gromiha, M Michael; Anoosha, P; Huang, Liang-Tsung

    2016-01-01

    Protein stability is the free energy difference between unfolded and folded states of a protein, which lies in the range of 5-25 kcal/mol. Experimentally, protein stability is measured with circular dichroism, differential scanning calorimetry, and fluorescence spectroscopy using thermal and denaturant denaturation methods. These experimental data have been accumulated in the form of a database, ProTherm, thermodynamic database for proteins and mutants. It also contains sequence and structure information of a protein, experimental methods and conditions, and literature information. Different features such as search, display, and sorting options and visualization tools have been incorporated in the database. ProTherm is a valuable resource for understanding/predicting the stability of proteins and it can be accessed at http://www.abren.net/protherm/ . ProTherm has been effectively used to examine the relationship among thermodynamics, structure, and function of proteins. We describe the recent progress on the development of methods for understanding/predicting protein stability, such as (1) general trends on mutational effects on stability, (2) relationship between the stability of protein mutants and amino acid properties, (3) applications of protein three-dimensional structures for predicting their stability upon point mutations, (4) prediction of protein stability upon single mutations from amino acid sequence, and (5) prediction methods for addressing double mutants. A list of online resources for predicting has also been provided.

  11. Can infrared spectroscopy provide information on protein-protein interactions?

    Science.gov (United States)

    Haris, Parvez I

    2010-08-01

    For most biophysical techniques, characterization of protein-protein interactions is challenging; this is especially true with methods that rely on a physical phenomenon that is common to both of the interacting proteins. Thus, for example, in IR spectroscopy, the carbonyl vibration (1600-1700 cm(-1)) associated with the amide bonds from both of the interacting proteins will overlap extensively, making the interpretation of spectral changes very complicated. Isotope-edited infrared spectroscopy, where one of the interacting proteins is uniformly labelled with (13)C or (13)C,(15)N has been introduced as a solution to this problem, enabling the study of protein-protein interactions using IR spectroscopy. The large shift of the amide I band (approx. 45 cm(-1) towards lower frequency) upon (13)C labelling of one of the proteins reveals the amide I band of the unlabelled protein, enabling it to be used as a probe for monitoring conformational changes. With site-specific isotopic labelling, structural resolution at the level of individual amino acid residues can be achieved. Furthermore, the ability to record IR spectra of proteins in diverse environments means that isotope-edited IR spectroscopy can be used to structurally characterize difficult systems such as protein-protein complexes bound to membranes or large insoluble peptide/protein aggregates. In the present article, examples of application of isotope-edited IR spectroscopy for studying protein-protein interactions are provided.

  12. Protein-Protein Interactions Prediction Based on Iterative Clique Extension with Gene Ontology Filtering

    Directory of Open Access Journals (Sweden)

    Lei Yang

    2014-01-01

    Full Text Available Cliques (maximal complete subnets in protein-protein interaction (PPI network are an important resource used to analyze protein complexes and functional modules. Clique-based methods of predicting PPI complement the data defection from biological experiments. However, clique-based predicting methods only depend on the topology of network. The false-positive and false-negative interactions in a network usually interfere with prediction. Therefore, we propose a method combining clique-based method of prediction and gene ontology (GO annotations to overcome the shortcoming and improve the accuracy of predictions. According to different GO correcting rules, we generate two predicted interaction sets which guarantee the quality and quantity of predicted protein interactions. The proposed method is applied to the PPI network from the Database of Interacting Proteins (DIP and most of the predicted interactions are verified by another biological database, BioGRID. The predicted protein interactions are appended to the original protein network, which leads to clique extension and shows the significance of biological meaning.

  13. Evaluation of clustering algorithms for protein-protein interaction networks

    Directory of Open Access Journals (Sweden)

    van Helden Jacques

    2006-11-01

    Full Text Available Abstract Background Protein interactions are crucial components of all cellular processes. Recently, high-throughput methods have been developed to obtain a global description of the interactome (the whole network of protein interactions for a given organism. In 2002, the yeast interactome was estimated to contain up to 80,000 potential interactions. This estimate is based on the integration of data sets obtained by various methods (mass spectrometry, two-hybrid methods, genetic studies. High-throughput methods are known, however, to yield a non-negligible rate of false positives, and to miss a fraction of existing interactions. The interactome can be represented as a graph where nodes correspond with proteins and edges with pairwise interactions. In recent years clustering methods have been developed and applied in order to extract relevant modules from such graphs. These algorithms require the specification of parameters that may drastically affect the results. In this paper we present a comparative assessment of four algorithms: Markov Clustering (MCL, Restricted Neighborhood Search Clustering (RNSC, Super Paramagnetic Clustering (SPC, and Molecular Complex Detection (MCODE. Results A test graph was built on the basis of 220 complexes annotated in the MIPS database. To evaluate the robustness to false positives and false negatives, we derived 41 altered graphs by randomly removing edges from or adding edges to the test graph in various proportions. Each clustering algorithm was applied to these graphs with various parameter settings, and the clusters were compared with the annotated complexes. We analyzed the sensitivity of the algorithms to the parameters and determined their optimal parameter values. We also evaluated their robustness to alterations of the test graph. We then applied the four algorithms to six graphs obtained from high-throughput experiments and compared the resulting clusters with the annotated complexes. Conclusion This

  14. PSAIA – Protein Structure and Interaction Analyzer

    Directory of Open Access Journals (Sweden)

    Vlahoviček Kristian

    2008-04-01

    Full Text Available Abstract Background PSAIA (Protein Structure and Interaction Analyzer was developed to compute geometric parameters for large sets of protein structures in order to predict and investigate protein-protein interaction sites. Results In addition to most relevant established algorithms, PSAIA offers a new method PIADA (Protein Interaction Atom Distance Algorithm for the determination of residue interaction pairs. We found that PIADA produced more satisfactory results than comparable algorithms implemented in PSAIA. Particular advantages of PSAIA include its capacity to combine different methods to detect the locations and types of interactions between residues and its ability, without any further automation steps, to handle large numbers of protein structures and complexes. Generally, the integration of a variety of methods enables PSAIA to offer easier automation of analysis and greater reliability of results. PSAIA can be used either via a graphical user interface or from the command-line. Results are generated in either tabular or XML format. Conclusion In a straightforward fashion and for large sets of protein structures, PSAIA enables the calculation of protein geometric parameters and the determination of location and type for protein-protein interaction sites. XML formatted output enables easy conversion of results to various formats suitable for statistic analysis. Results from smaller data sets demonstrated the influence of geometry on protein interaction sites. Comprehensive analysis of properties of large data sets lead to new information useful in the prediction of protein-protein interaction sites.

  15. Understanding Protein-Protein Interactions Using Local Structural Features

    DEFF Research Database (Denmark)

    Planas-Iglesias, Joan; Bonet, Jaume; García-García, Javier

    2013-01-01

    Protein-protein interactions (PPIs) play a relevant role among the different functions of a cell. Identifying the PPI network of a given organism (interactome) is useful to shed light on the key molecular mechanisms within a biological system. In this work, we show the role of structural features...... interacting and non-interacting protein pairs to classify the structural features that sustain the binding (or non-binding) behavior. Our study indicates that not only the interacting region but also the rest of the protein surface are important for the interaction fate. The interpretation...... to score the likelihood of the interaction between two proteins and to develop a method for the prediction of PPIs. We have tested our method on several sets with unbalanced ratios of interactions and non-interactions to simulate real conditions, obtaining accuracies higher than 25% in the most unfavorable...

  16. Biodiversity of Antarctic echinoids: a comprehensive and interactive database

    Directory of Open Access Journals (Sweden)

    Bruno David

    2005-12-01

    Full Text Available Eighty-one echinoid species are present south of the Antarctic Convergence, and they represent an important component of the benthic fauna. “Antarctic echinoids” is an interactive database synthesising the results of more than 100 years of Antarctic expeditions, and comprising information about all echinoid species. It includes illustrated keys for determination of the species, and information about their morphology and ecology (text, illustrations and glossary and their distribution (maps and histograms of bathymetrical distribution; the sources of the information (bibliography, collections and expeditions are also provided. All these data (taxonomic, morphologic, geographic, bathymetric… can be interactively queried in two main ways: (1 display of listings that can be browsed, sorted according to various criteria, or printed; and (2 interactive requests crossing the different kinds of data. Many other possibilities are offered, and an on-line help file is also available.

  17. InSilico Proteomics System: Integration and Application of Protein and Protein-Protein Interaction Data using Microsoft .NET

    Directory of Open Access Journals (Sweden)

    Straßer Wolfgang

    2006-12-01

    Full Text Available In the last decades, biological databases became the major knowledge resource for researchers in the field of molecular biology. The distribution of information among these databases is one of the major problems. An overview about the subject area of data access and representation of protein and protein-protein interaction data within public biological databases is described. For a comprehensive and consistent way of searching and analysing integrated protein and protein-protein interaction data, the InSilico Proteomics (ISP project has been initiated. Its three main objectives are (1 to provide an integrated knowledge pool for data investigation and global network analysis functions for a better understanding of a cell’s interactome, (2 employment of public data for plausibility analysis and validation of in-house experimental data and (3 testing the applicability of Microsoft’s .NET architecture for bioinformatics applications. Data integrated into the ISP database can be queried through the Web portal PRIMOS (PRotein Interaction and MOlecule Search which is freely available at http://biomis.fh-hagenberg.at/isp/primos.

  18. Ebolavirus Database: Gene and Protein Information Resource for Ebolaviruses

    Directory of Open Access Journals (Sweden)

    Rayapadi G. Swetha

    2016-01-01

    Full Text Available Ebola Virus Disease (EVD is a life-threatening haemorrhagic fever in humans. Even though there are many reports on EVD, the protein precursor functions and virulent factors of ebolaviruses remain poorly understood. Comparative analyses of Ebolavirus genomes will help in the identification of these important features. This prompted us to develop the Ebolavirus Database (EDB and we have provided links to various tools that will aid researchers to locate important regions in both the genomes and proteomes of Ebolavirus. The genomic analyses of ebolaviruses will provide important clues for locating the essential and core functional genes. The aim of EDB is to act as an integrated resource for ebolaviruses and we strongly believe that the database will be a useful tool for clinicians, microbiologists, health care workers, and bioscience researchers.

  19. Integral UBL domain proteins: a family of proteasome interacting proteins

    DEFF Research Database (Denmark)

    Hartmann-Petersen, Rasmus; Gordon, Colin

    2004-01-01

    The family of ubiquitin-like (UBL) domain proteins (UDPs) comprises a conserved group of proteins involved in a multitude of different cellular activities. However, recent studies on UBL-domain proteins indicate that these proteins appear to share a common property in their ability to interact...

  20. Evolutionary reprograming of protein-protein interaction specificity.

    Science.gov (United States)

    Akiva, Eyal; Babbitt, Patricia C

    2015-10-22

    Using mutation libraries and deep sequencing, Aakre et al. study the evolution of protein-protein interactions using a toxin-antitoxin model. The results indicate probable trajectories via "intermediate" proteins that are promiscuous, thus avoiding transitions via non-interactions. These results extend observations about other biological interactions and enzyme evolution, suggesting broadly general principles. Copyright © 2015 Elsevier Inc. All rights reserved.

  1. Spatial interactions database development for effective probabilistic risk assessment

    International Nuclear Information System (INIS)

    Liming, J. K.; Dunn, R. F.

    2008-01-01

    In preparation for a subsequent probabilistic risk assessment (PRA) fire risk analysis update, the STP Nuclear Operating Company (STPNOC) is updating its spatial interactions database (SID). This work is being performed to support updating the spatial interactions analysis (SIA) initially performed for the original South Texas Project Electric Generating Station (STPEGS) probabilistic safely assessment (PSA) and updated in the STPEGS Level 2 PSA and IPE Report. S/A is a large-scope screening analysis performed for nuclear power plant PRA that serves as a prerequisite basis for more detailed location-dependent, hazard-spec analyses in the PRA, such as fire risk analysis, flooding risk analysis, etc. SIA is required to support the 'completeness' argument for the PRA scope. The objectives of the current SID development effort are to update the spatial interactions analysis data, to the greatest degree practical, to be consistent with the following: the as-built plant as of December 31, 2007 the in-effect STPNOC STPEGS Units 1 and 2 PRA the current technology and intent of NUREG/CR-6850 guidance for lire risk analysis database support the requirements for PRA SIA, including fire and flooding risk analysis, established by NRC Regulatory Guide 1.200 and the ASME PRA Standard (ASME RA-S-2002 updated through ASME RA-Sc-2007,) This paper presents the approach and methodology for state-of-the-art SID development and applications, including an overview of the SIA process for nuclear power plant PRA. The paper shows how current relational database technology and existing, conventional station information sources can be employed to collect, process, and analyze spatial interactions data for the plant in an effective and efficient manner to meet the often challenging requirements of industry guidelines and standards such as NUREG/CR-6850, NRC Regulatory Guide 1.200, and ASME RA-S-2002 (updated through ASME RA-Sc 2007). This paper includes tables and figures illustrating how SIA

  2. A conserved mammalian protein interaction network.

    Directory of Open Access Journals (Sweden)

    Åsa Pérez-Bercoff

    Full Text Available Physical interactions between proteins mediate a variety of biological functions, including signal transduction, physical structuring of the cell and regulation. While extensive catalogs of such interactions are known from model organisms, their evolutionary histories are difficult to study given the lack of interaction data from phylogenetic outgroups. Using phylogenomic approaches, we infer a upper bound on the time of origin for a large set of human protein-protein interactions, showing that most such interactions appear relatively ancient, dating no later than the radiation of placental mammals. By analyzing paired alignments of orthologous and putatively interacting protein-coding genes from eight mammals, we find evidence for weak but significant co-evolution, as measured by relative selective constraint, between pairs of genes with interacting proteins. However, we find no strong evidence for shared instances of directional selection within an interacting pair. Finally, we use a network approach to show that the distribution of selective constraint across the protein interaction network is non-random, with a clear tendency for interacting proteins to share similar selective constraints. Collectively, the results suggest that, on the whole, protein interactions in mammals are under selective constraint, presumably due to their functional roles.

  3. Protein (Cyanobacteria) - PGDBj - Ortholog DB | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available ut This Database Database Description Download License Update History of This Database Site Policy | Contact Us Protein (Cyanobacteria) - PGDBj - Ortholog DB | LSDB Archive ... ...List Contact us PGDBj - Ortholog DB Protein (Cyanobacteria) Data detail Data name Protein (Cyanobacteria) DO...switchLanguage; BLAST Search Image Search Home About Archive Update History Data

  4. EXPANDING ACADEMIC VOCABULARY WITH AN INTERACTIVE ON-LINE DATABASE

    Directory of Open Access Journals (Sweden)

    Marlise Horst

    2005-05-01

    Full Text Available University students used a set of existing and purpose-built on-line tools for vocabulary learning in an experimental ESL course. The resources included concordance, dictionary, cloze-builder, hypertext, and a database with interactive self-quizzing feature (all freely available at www.lextutor.ca. The vocabulary targeted for learning consisted of (a Coxhead's (2000 Academic Word List, a list of items that occur frequently in university textbooks, and (b unfamiliar words students had met in academic texts and selected for entry into the class database. The suite of tools were designed to foster retention by engaging learners in deep processing, an aspect that is often described as missing in computer exercises for vocabulary learning. Database entries were examined to determine whether context sentences supported word meanings adequately and whether entered words reflected the unavailability of cognates in the various first languages of the participants. Pre- and post-treatment performance on tests of knowledge of words targeted for learning in the course were compared to establish learning gains. Regression analyses investigated connections between use of specific computer tools and gains.

  5. Prediction of heterodimeric protein complexes from weighted protein-protein interaction networks using novel features and kernel functions.

    Directory of Open Access Journals (Sweden)

    Peiying Ruan

    Full Text Available Since many proteins express their functional activity by interacting with other proteins and forming protein complexes, it is very useful to identify sets of proteins that form complexes. For that purpose, many prediction methods for protein complexes from protein-protein interactions have been developed such as MCL, MCODE, RNSC, PCP, RRW, and NWE. These methods have dealt with only complexes with size of more than three because the methods often are based on some density of subgraphs. However, heterodimeric protein complexes that consist of two distinct proteins occupy a large part according to several comprehensive databases of known complexes. In this paper, we propose several feature space mappings from protein-protein interaction data, in which each interaction is weighted based on reliability. Furthermore, we make use of prior knowledge on protein domains to develop feature space mappings, domain composition kernel and its combination kernel with our proposed features. We perform ten-fold cross-validation computational experiments. These results suggest that our proposed kernel considerably outperforms the naive Bayes-based method, which is the best existing method for predicting heterodimeric protein complexes.

  6. Protein-protein interactions and cancer: targeting the central dogma.

    Science.gov (United States)

    Garner, Amanda L; Janda, Kim D

    2011-01-01

    Between 40,000 and 200,000 protein-protein interactions have been predicted to exist within the human interactome. As these interactions are of a critical nature in many important cellular functions and their dysregulation is causal of disease, the modulation of these binding events has emerged as a leading, yet difficult therapeutic arena. In particular, the targeting of protein-protein interactions relevant to cancer is of fundamental importance as the tumor-promoting function of several aberrantly expressed proteins in the cancerous state is directly resultant of its ability to interact with a protein-binding partner. Of significance, these protein complexes play a crucial role in each of the steps of the central dogma of molecular biology, the fundamental processes of genetic transmission. With the many important discoveries being made regarding the mechanisms of these genetic process, the identification of new chemical probes are needed to better understand and validate the druggability of protein-protein interactions related to the central dogma. In this review, we provide an overview of current small molecule-based protein-protein interaction inhibitors for each stage of the central dogma: transcription, mRNA splicing and translation. Importantly, through our analysis we have uncovered a lack of necessary probes targeting mRNA splicing and translation, thus, opening up the possibility for expansion of these fields.

  7. HitPredict version 4: comprehensive reliability scoring of physical protein-protein interactions from more than 100 species.

    Science.gov (United States)

    López, Yosvany; Nakai, Kenta; Patil, Ashwini

    2015-01-01

    HitPredict is a consolidated resource of experimentally identified, physical protein-protein interactions with confidence scores to indicate their reliability. The study of genes and their inter-relationships using methods such as network and pathway analysis requires high quality protein-protein interaction information. Extracting reliable interactions from most of the existing databases is challenging because they either contain only a subset of the available interactions, or a mixture of physical, genetic and predicted interactions. Automated integration of interactions is further complicated by varying levels of accuracy of database content and lack of adherence to standard formats. To address these issues, the latest version of HitPredict provides a manually curated dataset of 398 696 physical associations between 70 808 proteins from 105 species. Manual confirmation was used to resolve all issues encountered during data integration. For improved reliability assessment, this version combines a new score derived from the experimental information of the interactions with the original score based on the features of the interacting proteins. The combined interaction score performs better than either of the individual scores in HitPredict as well as the reliability score of another similar database. HitPredict provides a web interface to search proteins and visualize their interactions, and the data can be downloaded for offline analysis. Data usability has been enhanced by mapping protein identifiers across multiple reference databases. Thus, the latest version of HitPredict provides a significantly larger, more reliable and usable dataset of protein-protein interactions from several species for the study of gene groups. Database URL: http://hintdb.hgc.jp/htp. © The Author(s) 2015. Published by Oxford University Press.

  8. Mass spectrometric analysis of protein interactions

    DEFF Research Database (Denmark)

    Borch, Jonas; Jørgensen, Thomas J. D.; Roepstorff, Peter

    2005-01-01

    Mass spectrometry is a powerful tool for identification of interaction partners and structural characterization of protein interactions because of its high sensitivity, mass accuracy and tolerance towards sample heterogeneity. Several tools that allow studies of protein interaction are now...... available and recent developments that increase the confidence of studies of protein interaction by mass spectrometry include quantification of affinity-purified proteins by stable isotope labeling and reagents for surface topology studies that can be identified by mass-contributing reporters (e.g. isotope...... labels, cleavable cross-linkers or fragment ions. The use of mass spectrometers to study protein interactions using deuterium exchange and for analysis of intact protein complexes recently has progressed considerably....

  9. Human cancer protein-protein interaction network: a structural perspective.

    Directory of Open Access Journals (Sweden)

    Gozde Kar

    2009-12-01

    Full Text Available Protein-protein interaction networks provide a global picture of cellular function and biological processes. Some proteins act as hub proteins, highly connected to others, whereas some others have few interactions. The dysfunction of some interactions causes many diseases, including cancer. Proteins interact through their interfaces. Therefore, studying the interface properties of cancer-related proteins will help explain their role in the interaction networks. Similar or overlapping binding sites should be used repeatedly in single interface hub proteins, making them promiscuous. Alternatively, multi-interface hub proteins make use of several distinct binding sites to bind to different partners. We propose a methodology to integrate protein interfaces into cancer interaction networks (ciSPIN, cancer structural protein interface network. The interactions in the human protein interaction network are replaced by interfaces, coming from either known or predicted complexes. We provide a detailed analysis of cancer related human protein-protein interfaces and the topological properties of the cancer network. The results reveal that cancer-related proteins have smaller, more planar, more charged and less hydrophobic binding sites than non-cancer proteins, which may indicate low affinity and high specificity of the cancer-related interactions. We also classified the genes in ciSPIN according to phenotypes. Within phenotypes, for breast cancer, colorectal cancer and leukemia, interface properties were found to be discriminating from non-cancer interfaces with an accuracy of 71%, 67%, 61%, respectively. In addition, cancer-related proteins tend to interact with their partners through distinct interfaces, corresponding mostly to multi-interface hubs, which comprise 56% of cancer-related proteins, and constituting the nodes with higher essentiality in the network (76%. We illustrate the interface related affinity properties of two cancer-related hub

  10. Integrating protein structures and precomputed genealogies in the Magnum database: Examples with cellular retinoid binding proteins

    Directory of Open Access Journals (Sweden)

    Bradley Michael E

    2006-02-01

    Full Text Available Abstract Background When accurate models for the divergent evolution of protein sequences are integrated with complementary biological information, such as folded protein structures, analyses of the combined data often lead to new hypotheses about molecular physiology. This represents an excellent example of how bioinformatics can be used to guide experimental research. However, progress in this direction has been slowed by the lack of a publicly available resource suitable for general use. Results The precomputed Magnum database offers a solution to this problem for ca. 1,800 full-length protein families with at least one crystal structure. The Magnum deliverables include 1 multiple sequence alignments, 2 mapping of alignment sites to crystal structure sites, 3 phylogenetic trees, 4 inferred ancestral sequences at internal tree nodes, and 5 amino acid replacements along tree branches. Comprehensive evaluations revealed that the automated procedures used to construct Magnum produced accurate models of how proteins divergently evolve, or genealogies, and correctly integrated these with the structural data. To demonstrate Magnum's capabilities, we asked for amino acid replacements requiring three nucleotide substitutions, located at internal protein structure sites, and occurring on short phylogenetic tree branches. In the cellular retinoid binding protein family a site that potentially modulates ligand binding affinity was discovered. Recruitment of cellular retinol binding protein to function as a lens crystallin in the diurnal gecko afforded another opportunity to showcase the predictive value of a browsable database containing branch replacement patterns integrated with protein structures. Conclusion We integrated two areas of protein science, evolution and structure, on a large scale and created a precomputed database, known as Magnum, which is the first freely available resource of its kind. Magnum provides evolutionary and structural

  11. Molecular simulations of lipid-mediated protein-protein interactions

    NARCIS (Netherlands)

    de Meyer, F.J.M.; Venturoli, M.; Smit, B.

    2008-01-01

    Recent experimental results revealed that lipid-mediated interactions due to hydrophobic forces may be important in determining the protein topology after insertion in the membrane, in regulating the protein activity, in protein aggregation and in signal transduction. To gain insight into the

  12. CPLA 1.0: an integrated database of protein lysine acetylation.

    Science.gov (United States)

    Liu, Zexian; Cao, Jun; Gao, Xinjiao; Zhou, Yanhong; Wen, Longping; Yang, Xiangjiao; Yao, Xuebiao; Ren, Jian; Xue, Yu

    2011-01-01

    As a reversible post-translational modification (PTM) discovered decades ago, protein lysine acetylation was known for its regulation of transcription through the modification of histones. Recent studies discovered that lysine acetylation targets broad substrates and especially plays an essential role in cellular metabolic regulation. Although acetylation is comparable with other major PTMs such as phosphorylation, an integrated resource still remains to be developed. In this work, we presented the compendium of protein lysine acetylation (CPLA) database for lysine acetylated substrates with their sites. From the scientific literature, we manually collected 7151 experimentally identified acetylation sites in 3311 targets. We statistically studied the regulatory roles of lysine acetylation by analyzing the Gene Ontology (GO) and InterPro annotations. Combined with protein-protein interaction information, we systematically discovered a potential human lysine acetylation network (HLAN) among histone acetyltransferases (HATs), substrates and histone deacetylases (HDACs). In particular, there are 1862 triplet relationships of HAT-substrate-HDAC retrieved from the HLAN, at least 13 of which were previously experimentally verified. The online services of CPLA database was implemented in PHP + MySQL + JavaScript, while the local packages were developed in JAVA 1.5 (J2SE 5.0). The CPLA database is freely available for all users at: http://cpla.biocuckoo.org.

  13. An Interactive Multi-instrument Database of Solar Flares

    Energy Technology Data Exchange (ETDEWEB)

    Sadykov, Viacheslav M; Kosovichev, Alexander G; Oria, Vincent; Nita, Gelu M [Center for Computational Heliophysics, New Jersey Institute of Technology, Newark, NJ 07102 (United States)

    2017-07-01

    Solar flares are complicated physical phenomena that are observable in a broad range of the electromagnetic spectrum, from radio waves to γ -rays. For a more comprehensive understanding of flares, it is necessary to perform a combined multi-wavelength analysis using observations from many satellites and ground-based observatories. For an efficient data search, integration of different flare lists, and representation of observational data, we have developed the Interactive Multi-Instrument Database of Solar Flares (IMIDSF, https://solarflare.njit.edu/). The web-accessible database is fully functional and allows the user to search for uniquely identified flare events based on their physical descriptors and the availability of observations by a particular set of instruments. Currently, the data from three primary flare lists ( Geostationary Operational Environmental Satellites , RHESSI , and HEK) and a variety of other event catalogs ( Hinode , Fermi GBM, Konus- W IND, the OVSA flare catalogs, the CACTus CME catalog, the Filament eruption catalog) and observing logs ( IRIS and Nobeyama coverage) are integrated, and an additional set of physical descriptors (temperature and emission measure) is provided along with an observing summary, data links, and multi-wavelength light curves for each flare event since 2002 January. We envision that this new tool will allow researchers to significantly speed up the search of events of interest for statistical and case studies.

  14. Protein Structural Change Data - PSCDB | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us PSCDB Protein Structural Change Data Data detail Data name Protein Structural Change Data DO...History of This Database Site Policy | Contact Us Protein Structural Change Data - PSCDB | LSDB Archive ...

  15. Information assessment on predicting protein-protein interactions

    Directory of Open Access Journals (Sweden)

    Gerstein Mark

    2004-10-01

    Full Text Available Abstract Background Identifying protein-protein interactions is fundamental for understanding the molecular machinery of the cell. Proteome-wide studies of protein-protein interactions are of significant value, but the high-throughput experimental technologies suffer from high rates of both false positive and false negative predictions. In addition to high-throughput experimental data, many diverse types of genomic data can help predict protein-protein interactions, such as mRNA expression, localization, essentiality, and functional annotation. Evaluations of the information contributions from different evidences help to establish more parsimonious models with comparable or better prediction accuracy, and to obtain biological insights of the relationships between protein-protein interactions and other genomic information. Results Our assessment is based on the genomic features used in a Bayesian network approach to predict protein-protein interactions genome-wide in yeast. In the special case, when one does not have any missing information about any of the features, our analysis shows that there is a larger information contribution from the functional-classification than from expression correlations or essentiality. We also show that in this case alternative models, such as logistic regression and random forest, may be more effective than Bayesian networks for predicting interactions. Conclusions In the restricted problem posed by the complete-information subset, we identified that the MIPS and Gene Ontology (GO functional similarity datasets as the dominating information contributors for predicting the protein-protein interactions under the framework proposed by Jansen et al. Random forests based on the MIPS and GO information alone can give highly accurate classifications. In this particular subset of complete information, adding other genomic data does little for improving predictions. We also found that the data discretizations used in the

  16. A Mesoscopic Model for Protein-Protein Interactions in Solution

    OpenAIRE

    Lund, Mikael; Jönsson, Bo

    2003-01-01

    Protein self-association may be detrimental in biological systems, but can be utilized in a controlled fashion for protein crystallization. It is hence of considerable interest to understand how factors like solution conditions prevent or promote aggregation. Here we present a computational model describing interactions between protein molecules in solution. The calculations are based on a molecular description capturing the detailed structure of the protein molecule using x-ray or nuclear ma...

  17. Interaction between plate make and protein in protein crystallisation screening.

    Directory of Open Access Journals (Sweden)

    Gordon J King

    Full Text Available BACKGROUND: Protein crystallisation screening involves the parallel testing of large numbers of candidate conditions with the aim of identifying conditions suitable as a starting point for the production of diffraction quality crystals. Generally, condition screening is performed in 96-well plates. While previous studies have examined the effects of protein construct, protein purity, or crystallisation condition ingredients on protein crystallisation, few have examined the effect of the crystallisation plate. METHODOLOGY/PRINCIPAL FINDINGS: We performed a statistically rigorous examination of protein crystallisation, and evaluated interactions between crystallisation success and plate row/column, different plates of same make, different plate makes and different proteins. From our analysis of protein crystallisation, we found a significant interaction between plate make and the specific protein being crystallised. CONCLUSIONS/SIGNIFICANCE: Protein crystal structure determination is the principal method for determining protein structure but is limited by the need to produce crystals of the protein under study. Many important proteins are difficult to crystallize, so that identification of factors that assist crystallisation could open up the structure determination of these more challenging targets. Our findings suggest that protein crystallisation success may be improved by matching a protein with its optimal plate make.

  18. Mapping Protein-Protein Interactions by Quantitative Proteomics

    DEFF Research Database (Denmark)

    Dengjel, Joern; Kratchmarova, Irina; Blagoev, Blagoy

    2010-01-01

    spectrometry (MS)-based proteomics in combination with affinity purification protocols has become the method of choice to map and track the dynamic changes in protein-protein interactions, including the ones occurring during cellular signaling events. Different quantitative MS strategies have been used...... to characterize protein interaction networks. In this chapter we describe in detail the use of stable isotope labeling by amino acids in cell culture (SILAC) for the quantitative analysis of stimulus-dependent dynamic protein interactions.......Proteins exert their function inside a cell generally in multiprotein complexes. These complexes are highly dynamic structures changing their composition over time and cell state. The same protein may thereby fulfill different functions depending on its binding partners. Quantitative mass...

  19. Building blocks for protein interaction devices

    Science.gov (United States)

    Grünberg, Raik; Ferrar, Tony S.; van der Sloot, Almer M.; Constante, Marco; Serrano, Luis

    2010-01-01

    Here, we propose a framework for the design of synthetic protein networks from modular protein–protein or protein–peptide interactions and provide a starter toolkit of protein building blocks. Our proof of concept experiments outline a general work flow for part–based protein systems engineering. We streamlined the iterative BioBrick cloning protocol and assembled 25 synthetic multidomain proteins each from seven standardized DNA fragments. A systematic screen revealed two main factors controlling protein expression in Escherichia coli: obstruction of translation initiation by mRNA secondary structure or toxicity of individual domains. Eventually, 13 proteins were purified for further characterization. Starting from well-established biotechnological tools, two general–purpose interaction input and two readout devices were built and characterized in vitro. Constitutive interaction input was achieved with a pair of synthetic leucine zippers. The second interaction was drug-controlled utilizing the rapamycin-induced binding of FRB(T2098L) to FKBP12. The interaction kinetics of both devices were analyzed by surface plasmon resonance. Readout was based on Förster resonance energy transfer between fluorescent proteins and was quantified for various combinations of input and output devices. Our results demonstrate the feasibility of parts-based protein synthetic biology. Additionally, we identify future challenges and limitations of modular design along with approaches to address them. PMID:20215443

  20. 1.15 - Structural Chemogenomics Databases to Navigate Protein–Ligand Interaction Space

    NARCIS (Netherlands)

    Kanev, G.K.; Kooistra, A.J.; de Esch, I.J.P.; de Graaf, C.

    2017-01-01

    Structural chemogenomics databases allow the integration and exploration of heterogeneous genomic, structural, chemical, and pharmacological data in order to extract useful information that is applicable for the discovery of new protein targets and biologically active molecules. Integrated databases

  1. ZifBASE: a database of zinc finger proteins and associated resources

    Directory of Open Access Journals (Sweden)

    Punetha Ankita

    2009-09-01

    Full Text Available Abstract Background Information on the occurrence of zinc finger protein motifs in genomes is crucial to the developing field of molecular genome engineering. The knowledge of their target DNA-binding sequences is vital to develop chimeric proteins for targeted genome engineering and site-specific gene correction. There is a need to develop a computational resource of zinc finger proteins (ZFP to identify the potential binding sites and its location, which reduce the time of in vivo task, and overcome the difficulties in selecting the specific type of zinc finger protein and the target site in the DNA sequence. Description ZifBASE provides an extensive collection of various natural and engineered ZFP. It uses standard names and a genetic and structural classification scheme to present data retrieved from UniProtKB, GenBank, Protein Data Bank, ModBase, Protein Model Portal and the literature. It also incorporates specialized features of ZFP including finger sequences and positions, number of fingers, physiochemical properties, classes, framework, PubMed citations with links to experimental structures (PDB, if available and modeled structures of natural zinc finger proteins. ZifBASE provides information on zinc finger proteins (both natural and engineered ones, the number of finger units in each of the zinc finger proteins (with multiple fingers, the synergy between the adjacent fingers and their positions. Additionally, it gives the individual finger sequence and their target DNA site to which it binds for better and clear understanding on the interactions of adjacent fingers. The current version of ZifBASE contains 139 entries of which 89 are engineered ZFPs, containing 3-7F totaling to 296 fingers. There are 50 natural zinc finger protein entries ranging from 2-13F, totaling to 307 fingers. It has sequences and structures from literature, Protein Data Bank, ModBase and Protein Model Portal. The interface is cross linked to other public

  2. Phthalic Acid Chemical Probes Synthesized for Protein-Protein Interaction Analysis

    Directory of Open Access Journals (Sweden)

    Chin-Jen Wu

    2013-06-01

    Full Text Available Plasticizers are additives that are used to increase the flexibility of plastic during manufacturing. However, in injection molding processes, plasticizers cannot be generated with monomers because they can peel off from the plastics into the surrounding environment, water, or food, or become attached to skin. Among the various plasticizers that are used, 1,2-benzenedicarboxylic acid (phthalic acid is a typical precursor to generate phthalates. In addition, phthalic acid is a metabolite of diethylhexyl phthalate (DEHP. According to Gene_Ontology gene/protein database, phthalates can cause genital diseases, cardiotoxicity, hepatotoxicity, nephrotoxicity, etc. In this study, a silanized linker (3-aminopropyl triethoxyslane, APTES was deposited on silicon dioxides (SiO2 particles and phthalate chemical probes were manufactured from phthalic acid and APTES–SiO2. These probes could be used for detecting proteins that targeted phthalic acid and for protein-protein interactions. The phthalic acid chemical probes we produced were incubated with epithelioid cell lysates of normal rat kidney (NRK-52E cells to detect the interactions between phthalic acid and NRK-52E extracted proteins. These chemical probes interacted with a number of chaperones such as protein disulfide-isomerase A6, heat shock proteins, and Serpin H1. Ingenuity Pathways Analysis (IPA software showed that these chemical probes were a practical technique for protein-protein interaction analysis.

  3. Selection of peptides interfering with protein-protein interaction.

    Science.gov (United States)

    Gaida, Annette; Hagemann, Urs B; Mattay, Dinah; Räuber, Christina; Müller, Kristian M; Arndt, Katja M

    2009-01-01

    Cell physiology depends on a fine-tuned network of protein-protein interactions, and misguided interactions are often associated with various diseases. Consequently, peptides, which are able to specifically interfere with such adventitious interactions, are of high interest for analytical as well as medical purposes. One of the most abundant protein interaction domains is the coiled-coil motif, and thus provides a premier target. Coiled coils, which consist of two or more alpha-helices wrapped around each other, have one of the simplest interaction interfaces, yet they are able to confer highly specific homo- and heterotypic interactions involved in virtually any cellular process. While there are several ways to generate interfering peptides, the combination of library design with a powerful selection system seems to be one of the most effective and promising approaches. This chapter guides through all steps of such a process, starting with library options and cloning, detailing suitable selection techniques and ending with purification for further down-stream characterization. Such generated peptides will function as versatile tools to interfere with the natural function of their targets thereby illuminating their down-stream signaling and, in general, promoting understanding of factors leading to specificity and stability in protein-protein interactions. Furthermore, peptides interfering with medically relevant proteins might become important diagnostics and therapeutics.

  4. Characterising non-covalent interactions with the Cambridge Structural Database.

    Science.gov (United States)

    Lommerse, J P; Taylor, R

    1997-02-01

    This review describes how the CSD can be used to study non-covalent interactions. Several different types of information may be obtained. First, the relative frequencies of various interactions can be studied; for example, we have shown that the terminal oxygen atoms of phosphate groups accept hydrogen bonds far more often than the linkage oxygens. Secondly, information can be obtained about the geometries of nonbonded contacts; for example, hydrogen bonds to P-O groups rarely form along the extension of the P-O bond, whereas short contacts between oxygen and carbon-bound iodine show a strong preference for linear C-I ... O angles. Thirdly, the CSD can be searched for novel interactions which may be exploited in inhibitor design; for example, the I ... O contacts just mentioned, and N-H ... pi hydrogen bonds. Finally, the CSD can suggest synthetic targets for medicinal chemistry; for example, molecules containing delocalised electron deficient groups such as trimethylammonium, pyridinium, thaizolium and dinitrophenyl have a good chance of binding to an active-site tryptophan. Although the CSD contains small-molecule crystal structures, not protein-ligand complexes, there is considerable evidence that the contacts seen in the two types of structures are similar. We have illustrated this a number of times in the present review and additional evidence has been given previously by Klebe. The major advantages of the CSD are its size, diversity and experimental accuracy. For these reasons, it is a useful tool for modellers engaged in rational inhibitor design.

  5. On the role of electrostatics on protein-protein interactions

    Science.gov (United States)

    Zhang, Zhe; Witham, Shawn; Alexov, Emil

    2011-01-01

    The role of electrostatics on protein-protein interactions and binding is reviewed in this article. A brief outline of the computational modeling, in the framework of continuum electrostatics, is presented and basic electrostatic effects occurring upon the formation of the complex are discussed. The role of the salt concentration and pH of the water phase on protein-protein binding free energy is demonstrated and indicates that the increase of the salt concentration tends to weaken the binding, an observation that is attributed to the optimization of the charge-charge interactions across the interface. It is pointed out that the pH-optimum (pH of optimal binding affinity) varies among the protein-protein complexes, and perhaps is a result of their adaptation to particular subcellular compartment. At the end, the similarities and differences between hetero- and homo-complexes are outlined and discussed with respect to the binding mode and charge complementarity. PMID:21572182

  6. Protein-Protein Interactions (PPI) reagents: | Office of Cancer Genomics

    Science.gov (United States)

    The CTD2 Center at Emory University has a library of genes used to study protein-protein interactions in mammalian cells. These genes are cloned in different mammalian expression vectors. A list of available cancer-associated genes can be accessed below.

  7. Protein-Protein Interaction Reagents | Office of Cancer Genomics

    Science.gov (United States)

    The CTD2 Center at Emory University has a library of genes used to study protein-protein interactions in mammalian cells. These genes are cloned in different mammalian expression vectors. A list of available cancer-associated genes can be accessed below. Emory_CTD^2_PPI_Reagents.xlsx Contact: Haian Fu

  8. Efficiency of Database Search for Identification of Mutated and Modified Proteins via Mass Spectrometry

    OpenAIRE

    Pevzner, Pavel A.; Mulyukov, Zufar; Dancik, Vlado; Tang, Chris L

    2001-01-01

    Although protein identification by matching tandem mass spectra (MS/MS) against protein databases is a widespread tool in mass spectrometry, the question about reliability of such searches remains open. Absence of rigorous significance scores in MS/MS database search makes it difficult to discard random database hits and may lead to erroneous protein identification, particularly in the case of mutated or post-translationally modified peptides. This problem is especially important for high-thr...

  9. Non-interacting surface solvation and dynamics in protein-protein interactions

    NARCIS (Netherlands)

    Visscher, Koen M.; Kastritis, Panagiotis L.|info:eu-repo/dai/nl/315886668; Bonvin, Alexandre M J J|info:eu-repo/dai/nl/113691238

    2015-01-01

    Protein-protein interactions control a plethora of cellular processes, including cell proliferation, differentiation, apoptosis, and signal transduction. Understanding how and why proteins interact will inevitably lead to novel structure-based drug design methods, as well as design of de novo

  10. Databases

    Digital Repository Service at National Institute of Oceanography (India)

    Kunte, P.D.

    Information on bibliographic as well as numeric/textual databases relevant to coastal geomorphology has been included in a tabular form. Databases cover a broad spectrum of related subjects like coastal environment and population aspects, coastline...

  11. Water-Protein Interactions: The Secret of Protein Dynamics

    Directory of Open Access Journals (Sweden)

    Silvia Martini

    2013-01-01

    Full Text Available Water-protein interactions help to maintain flexible conformation conditions which are required for multifunctional protein recognition processes. The intimate relationship between the protein surface and hydration water can be analyzed by studying experimental water properties measured in protein systems in solution. In particular, proteins in solution modify the structure and the dynamics of the bulk water at the solute-solvent interface. The ordering effects of proteins on hydration water are extended for several angstroms. In this paper we propose a method for analyzing the dynamical properties of the water molecules present in the hydration shells of proteins. The approach is based on the analysis of the effects of protein-solvent interactions on water protons NMR relaxation parameters. NMR relaxation parameters, especially the nonselective (R1NS and selective (R1SE spin-lattice relaxation rates of water protons, are useful for investigating the solvent dynamics at the macromolecule-solvent interfaces as well as the perturbation effects caused by the water-macromolecule interactions on the solvent dynamical properties. In this paper we demonstrate that Nuclear Magnetic Resonance Spectroscopy can be used to determine the dynamical contributions of proteins to the water molecules belonging to their hydration shells.

  12. DSFL database: A hub of target proteins of Leishmania sp. to combat leishmaniasis

    Directory of Open Access Journals (Sweden)

    Ameer Khusro

    2017-07-01

    Full Text Available Leishmaniasis is a vector-borne chronic infectious tropical dermal disease caused by the protozoa parasite of the genus Leishmania that causes high mortality globally. Among three different clinical forms of leishmaniasis, visceral leishmaniasis (VL or kala-azar is a systemic public health disease with high morbidity and mortality in developing countries, caused by Leishmania donovani, Leishmania infantum or Leishmania chagasi. Unfortunately, there is no vaccine available till date for the treatment of leishmaniasis. On the other hand, the therapeutics approved to treat this fatal disease is expensive, toxic, and associated with serious side effects. Furthermore, the emergence of drug-resistant Leishmania parasites in most endemic countries due to the incessant utilization of existing drugs is a major concern at present. Drug Search for Leishmaniasis (DSFL is a unique database that involves 50 crystallized target proteins of varied Leishmania sp. in order to develop new drugs in future by interacting several antiparasitic compounds or molecules with specific protein through computational tools. The structure of target protein from different Leishmania sp. is available in this database. In this review, we spotlighted not only the current global status of leishmaniasis in brief but also detailed information about target proteins of various Leishmania sp. available in DSFL. DSFL has created a new expectation for mankind in order to combat leishmaniasis by targeting parasitic proteins and commence a new era to get rid of drug resistance parasites. The database will substantiate to be a worthwhile project for further development of new, non-toxic, and cost-effective antileishmanial drugs as targeted therapies using in vitro/in vivo assays.

  13. Interactions between whey proteins and kaolinite surfaces

    International Nuclear Information System (INIS)

    Barral, S.; Villa-Garcia, M.A.; Rendueles, M.; Diaz, M.

    2008-01-01

    The nature of the interactions between whey proteins and kaolinite surfaces was investigated by adsorption-desorption experiments at room temperature, performed at the isoelectric point (IEP) of the proteins and at pH 7. It was found that kaolinite is a strong adsorbent for proteins, reaching the maximum adsorption capacity at the IEP of each protein. At pH 7.0, the retention capacity decreased considerably. The adsorption isotherms showed typical Langmuir characteristics. X-ray diffraction data for the protein-kaolinite complexes showed that protein molecules were not intercalated in the mineral structure, but immobilized at the external surfaces and the edges of the kaolinite. Fourier transform IR results indicate the absence of hydrogen bonding between kaolinite surfaces and the polypeptide chain. The adsorption patterns appear to be related to electrostatic interactions, although steric effects should be also considered

  14. Interactions between whey proteins and kaolinite surfaces

    Energy Technology Data Exchange (ETDEWEB)

    Barral, S. [Department of Chemical Engineering and Environmental Technology, University of Oviedo, Julian Claveria 8, 33006 Oviedo (Spain); Villa-Garcia, M.A. [Department of Organic and Inorganic Chemistry, University of Oviedo, Julian Claveria 8, 33006 Oviedo (Spain)], E-mail: mavg@uniovi.es; Rendueles, M. [Project Management Area, University of Oviedo, Independencia 13, 33004 Oviedo (Spain); Diaz, M. [Department of Chemical Engineering and Environmental Technology, University of Oviedo, Julian Claveria 8, 33006 Oviedo (Spain)

    2008-07-15

    The nature of the interactions between whey proteins and kaolinite surfaces was investigated by adsorption-desorption experiments at room temperature, performed at the isoelectric point (IEP) of the proteins and at pH 7. It was found that kaolinite is a strong adsorbent for proteins, reaching the maximum adsorption capacity at the IEP of each protein. At pH 7.0, the retention capacity decreased considerably. The adsorption isotherms showed typical Langmuir characteristics. X-ray diffraction data for the protein-kaolinite complexes showed that protein molecules were not intercalated in the mineral structure, but immobilized at the external surfaces and the edges of the kaolinite. Fourier transform IR results indicate the absence of hydrogen bonding between kaolinite surfaces and the polypeptide chain. The adsorption patterns appear to be related to electrostatic interactions, although steric effects should be also considered.

  15. EKPD: a hierarchical database of eukaryotic protein kinases and protein phosphatases.

    Science.gov (United States)

    Wang, Yongbo; Liu, Zexian; Cheng, Han; Gao, Tianshun; Pan, Zhicheng; Yang, Qing; Guo, Anyuan; Xue, Yu

    2014-01-01

    We present here EKPD (http://ekpd.biocuckoo.org), a hierarchical database of eukaryotic protein kinases (PKs) and protein phosphatases (PPs), the key molecules responsible for the reversible phosphorylation of proteins that are involved in almost all aspects of biological processes. As extensive experimental and computational efforts have been carried out to identify PKs and PPs, an integrative resource with detailed classification and annotation information would be of great value for both experimentalists and computational biologists. In this work, we first collected 1855 PKs and 347 PPs from the scientific literature and various public databases. Based on previously established rationales, we classified all of the known PKs and PPs into a hierarchical structure with three levels, i.e. group, family and individual PK/PP. There are 10 groups with 149 families for the PKs and 10 groups with 33 families for the PPs. We constructed 139 and 27 Hidden Markov Model profiles for PK and PP families, respectively. Then we systematically characterized ∼50,000 PKs and >10,000 PPs in eukaryotes. In addition, >500 PKs and >400 PPs were computationally identified by ortholog search. Finally, the online service of the EKPD database was implemented in PHP + MySQL + JavaScript.

  16. Dr. PIAS: an integrative system for assessing the druggability of protein-protein interactions

    Directory of Open Access Journals (Sweden)

    Furuya Toshio

    2011-02-01

    Full Text Available Abstract Background The amount of data on protein-protein interactions (PPIs available in public databases and in the literature has rapidly expanded in recent years. PPI data can provide useful information for researchers in pharmacology and medicine as well as those in interactome studies. There is urgent need for a novel methodology or software allowing the efficient utilization of PPI data in pharmacology and medicine. Results To address this need, we have developed the 'Druggable Protein-protein Interaction Assessment System' (Dr. PIAS. Dr. PIAS has a meta-database that stores various types of information (tertiary structures, drugs/chemicals, and biological functions associated with PPIs retrieved from public sources. By integrating this information, Dr. PIAS assesses whether a PPI is druggable as a target for small chemical ligands by using a supervised machine-learning method, support vector machine (SVM. Dr. PIAS holds not only known druggable PPIs but also all PPIs of human, mouse, rat, and human immunodeficiency virus (HIV proteins identified to date. Conclusions The design concept of Dr. PIAS is distinct from other published PPI databases in that it focuses on selecting the PPIs most likely to make good drug targets, rather than merely collecting PPI data.

  17. Inferring domain-domain interactions from protein-protein interactions with formal concept analysis.

    Directory of Open Access Journals (Sweden)

    Susan Khor

    Full Text Available Identifying reliable domain-domain interactions will increase our ability to predict novel protein-protein interactions, to unravel interactions in protein complexes, and thus gain more information about the function and behavior of genes. One of the challenges of identifying reliable domain-domain interactions is domain promiscuity. Promiscuous domains are domains that can occur in many domain architectures and are therefore found in many proteins. This becomes a problem for a method where the score of a domain-pair is the ratio between observed and expected frequencies because the protein-protein interaction network is sparse. As such, many protein-pairs will be non-interacting and domain-pairs with promiscuous domains will be penalized. This domain promiscuity challenge to the problem of inferring reliable domain-domain interactions from protein-protein interactions has been recognized, and a number of work-arounds have been proposed. This paper reports on an application of Formal Concept Analysis to this problem. It is found that the relationship between formal concepts provides a natural way for rare domains to elevate the rank of promiscuous domain-pairs and enrich highly ranked domain-pairs with reliable domain-domain interactions. This piggybacking of promiscuous domain-pairs onto less promiscuous domain-pairs is possible only with concept lattices whose attribute-labels are not reduced and is enhanced by the presence of proteins that comprise both promiscuous and rare domains.

  18. Inferring Domain-Domain Interactions from Protein-Protein Interactions with Formal Concept Analysis

    Science.gov (United States)

    Khor, Susan

    2014-01-01

    Identifying reliable domain-domain interactions will increase our ability to predict novel protein-protein interactions, to unravel interactions in protein complexes, and thus gain more information about the function and behavior of genes. One of the challenges of identifying reliable domain-domain interactions is domain promiscuity. Promiscuous domains are domains that can occur in many domain architectures and are therefore found in many proteins. This becomes a problem for a method where the score of a domain-pair is the ratio between observed and expected frequencies because the protein-protein interaction network is sparse. As such, many protein-pairs will be non-interacting and domain-pairs with promiscuous domains will be penalized. This domain promiscuity challenge to the problem of inferring reliable domain-domain interactions from protein-protein interactions has been recognized, and a number of work-arounds have been proposed. This paper reports on an application of Formal Concept Analysis to this problem. It is found that the relationship between formal concepts provides a natural way for rare domains to elevate the rank of promiscuous domain-pairs and enrich highly ranked domain-pairs with reliable domain-domain interactions. This piggybacking of promiscuous domain-pairs onto less promiscuous domain-pairs is possible only with concept lattices whose attribute-labels are not reduced and is enhanced by the presence of proteins that comprise both promiscuous and rare domains. PMID:24586450

  19. Coevolution study of mitochondria respiratory chain proteins: toward the understanding of protein--protein interaction.

    Science.gov (United States)

    Yang, Ming; Ge, Yan; Wu, Jiayan; Xiao, Jingfa; Yu, Jun

    2011-05-20

    Coevolution can be seen as the interdependency between evolutionary histories. In the context of protein evolution, functional correlation proteins are ever-present coordinated evolutionary characters without disruption of organismal integrity. As to complex system, there are two forms of protein--protein interactions in vivo, which refer to inter-complex interaction and intra-complex interaction. In this paper, we studied the difference of coevolution characters between inter-complex interaction and intra-complex interaction using "Mirror tree" method on the respiratory chain (RC) proteins. We divided the correlation coefficients of every pairwise RC proteins into two groups corresponding to the binary protein--protein interaction in intra-complex and the binary protein--protein interaction in inter-complex, respectively. A dramatical discrepancy is detected between the coevolution characters of the two sets of protein interactions (Wilcoxon test, p-value = 4.4 × 10(-6)). Our finding reveals some critical information on coevolutionary study and assists the mechanical investigation of protein--protein interaction. Furthermore, the results also provide some unique clue for supramolecular organization of protein complexes in the mitochondrial inner membrane. More detailed binding sites map and genome information of nuclear encoded RC proteins will be extraordinary valuable for the further mitochondria dynamics study. Copyright © 2011. Published by Elsevier Ltd.

  20. Detecting protein-protein interactions in living cells

    DEFF Research Database (Denmark)

    Gottschalk, Marie; Bach, Anders; Hansen, Jakob Lerche

    2009-01-01

    to the endogenous C-terminal peptide of the NMDA receptor, as evaluated by a cell-free protein-protein interaction assay. However, it is important to address both membrane permeability and effect in living cells. Therefore a bioluminescence resonance energy transfer (BRET) assay was established, where the C......-terminal of the NMDA receptor and PDZ2 of PSD-95 were fused to green fluorescent protein (GFP) and Renilla luciferase (Rluc) and expressed in COS7 cells. A robust and specific BRET signal was obtained by expression of the appropriate partner proteins and subsequently, the assay was used to evaluate a Tat......The PDZ domain mediated interaction between the NMDA receptor and its intracellular scaffolding protein, PSD-95, is a potential target for treatment of ischemic brain diseases. We have recently developed a number of peptide analogues with improved affinity for the PDZ domains of PSD-95 compared...

  1. HAEdb: a novel interactive, locus-specific mutation database for the C1 inhibitor gene.

    Science.gov (United States)

    Kalmár, Lajos; Hegedüs, Tamás; Farkas, Henriette; Nagy, Melinda; Tordai, Attila

    2005-01-01

    Hereditary angioneurotic edema (HAE) is an autosomal dominant disorder characterized by episodic local subcutaneous and submucosal edema and is caused by the deficiency of the activated C1 esterase inhibitor protein (C1-INH or C1INH; approved gene symbol SERPING1). Published C1-INH mutations are represented in large universal databases (e.g., OMIM, HGMD), but these databases update their data rather infrequently, they are not interactive, and they do not allow searches according to different criteria. The HAEdb, a C1-INH gene mutation database (http://hae.biomembrane.hu) was created to contribute to the following expectations: 1) help the comprehensive collection of information on genetic alterations of the C1-INH gene; 2) create a database in which data can be searched and compared according to several flexible criteria; and 3) provide additional help in new mutation identification. The website uses MySQL, an open-source, multithreaded, relational database management system. The user-friendly graphical interface was written in the PHP web programming language. The website consists of two main parts, the freely browsable search function, and the password-protected data deposition function. Mutations of the C1-INH gene are divided in two parts: gross mutations involving DNA fragments >1 kb, and micro mutations encompassing all non-gross mutations. Several attributes (e.g., affected exon, molecular consequence, family history) are collected for each mutation in a standardized form. This database may facilitate future comprehensive analyses of C1-INH mutations and also provide regular help for molecular diagnostic testing of HAE patients in different centers.

  2. Gene composer: database software for protein construct design, codon engineering, and gene synthesis.

    Science.gov (United States)

    Lorimer, Don; Raymond, Amy; Walchli, John; Mixon, Mark; Barrow, Adrienne; Wallace, Ellen; Grice, Rena; Burgin, Alex; Stewart, Lance

    2009-04-21

    To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene assembly procedure with mis-match specific endonuclease

  3. Gene Composer: database software for protein construct design, codon engineering, and gene synthesis

    Directory of Open Access Journals (Sweden)

    Mixon Mark

    2009-04-01

    Full Text Available Abstract Background To improve efficiency in high throughput protein structure determination, we have developed a database software package, Gene Composer, which facilitates the information-rich design of protein constructs and their codon engineered synthetic gene sequences. With its modular workflow design and numerous graphical user interfaces, Gene Composer enables researchers to perform all common bio-informatics steps used in modern structure guided protein engineering and synthetic gene engineering. Results An interactive Alignment Viewer allows the researcher to simultaneously visualize sequence conservation in the context of known protein secondary structure, ligand contacts, water contacts, crystal contacts, B-factors, solvent accessible area, residue property type and several other useful property views. The Construct Design Module enables the facile design of novel protein constructs with altered N- and C-termini, internal insertions or deletions, point mutations, and desired affinity tags. The modifications can be combined and permuted into multiple protein constructs, and then virtually cloned in silico into defined expression vectors. The Gene Design Module uses a protein-to-gene algorithm that automates the back-translation of a protein amino acid sequence into a codon engineered nucleic acid gene sequence according to a selected codon usage table with minimal codon usage threshold, defined G:C% content, and desired sequence features achieved through synonymous codon selection that is optimized for the intended expression system. The gene-to-oligo algorithm of the Gene Design Module plans out all of the required overlapping oligonucleotides and mutagenic primers needed to synthesize the desired gene constructs by PCR, and for physically cloning them into selected vectors by the most popular subcloning strategies. Conclusion We present a complete description of Gene Composer functionality, and an efficient PCR-based synthetic gene

  4. PPI finder: a mining tool for human protein-protein interactions.

    Directory of Open Access Journals (Sweden)

    Min He

    Full Text Available BACKGROUND: The exponential increase of published biomedical literature prompts the use of text mining tools to manage the information overload automatically. One of the most common applications is to mine protein-protein interactions (PPIs from PubMed abstracts. Currently, most tools in mining PPIs from literature are using co-occurrence-based approaches or rule-based approaches. Hybrid methods (frame-based approaches by combining these two methods may have better performance in predicting PPIs. However, the predicted PPIs from these methods are rarely evaluated by known PPI databases and co-occurred terms in Gene Ontology (GO database. METHODOLOGY/PRINCIPAL FINDINGS: We here developed a web-based tool, PPI Finder, to mine human PPIs from PubMed abstracts based on their co-occurrences and interaction words, followed by evidences in human PPI databases and shared terms in GO database. Only 28% of the co-occurred pairs in PubMed abstracts appeared in any of the commonly used human PPI databases (HPRD, BioGRID and BIND. On the other hand, of the known PPIs in HPRD, 69% showed co-occurrences in the literature, and 65% shared GO terms. CONCLUSIONS: PPI Finder provides a useful tool for biologists to uncover potential novel PPIs. It is freely accessible at http://liweilab.genetics.ac.cn/tm/.

  5. The BioFragment Database (BFDb): An open-data platform for computational chemistry analysis of noncovalent interactions

    Science.gov (United States)

    Burns, Lori A.; Faver, John C.; Zheng, Zheng; Marshall, Michael S.; Smith, Daniel G. A.; Vanommeslaeghe, Kenno; MacKerell, Alexander D.; Merz, Kenneth M.; Sherrill, C. David

    2017-10-01

    Accurate potential energy models are necessary for reliable atomistic simulations of chemical phenomena. In the realm of biomolecular modeling, large systems like proteins comprise very many noncovalent interactions (NCIs) that can contribute to the protein's stability and structure. This work presents two high-quality chemical databases of common fragment interactions in biomolecular systems as extracted from high-resolution Protein DataBank crystal structures: 3380 sidechain-sidechain interactions and 100 backbone-backbone interactions that inaugurate the BioFragment Database (BFDb). Absolute interaction energies are generated with a computationally tractable explicitly correlated coupled cluster with perturbative triples [CCSD(T)-F12] "silver standard" (0.05 kcal/mol average error) for NCI that demands only a fraction of the cost of the conventional "gold standard," CCSD(T) at the complete basis set limit. By sampling extensively from biological environments, BFDb spans the natural diversity of protein NCI motifs and orientations. In addition to supplying a thorough assessment for lower scaling force-field (2), semi-empirical (3), density functional (244), and wavefunction (45) methods (comprising >1M interaction energies), BFDb provides interactive tools for running and manipulating the resulting large datasets and offers a valuable resource for potential energy model development and validation.

  6. The Mitochondrial Protein Atlas: A Database of Experimentally Verified Information on the Human Mitochondrial Proteome.

    Science.gov (United States)

    Godin, Noa; Eichler, Jerry

    2017-09-01

    Given its central role in various biological systems, as well as its involvement in numerous pathologies, the mitochondrion is one of the best-studied organelles. However, although the mitochondrial genome has been extensively investigated, protein-level information remains partial, and in many cases, hypothetical. The Mitochondrial Protein Atlas (MPA; URL: lifeserv.bgu.ac.il/wb/jeichler/MPA ) is a database that provides a complete, manually curated inventory of only experimentally validated human mitochondrial proteins. The MPA presently contains 911 unique protein entries, each of which is associated with at least one experimentally validated and referenced mitochondrial localization. The MPA also contains experimentally validated and referenced information defining function, structure, involvement in pathologies, interactions with other MPA proteins, as well as the method(s) of analysis used in each instance. Connections to relevant external data sources are offered for each entry, including links to NCBI Gene, PubMed, and Protein Data Bank. The MPA offers a prototype for other information sources that allow for a distinction between what has been confirmed and what remains to be verified experimentally.

  7. Biospecific protein immobilization for rapid analysis of weak protein interactions using self-interaction nanoparticle spectroscopy.

    Science.gov (United States)

    Bengali, Aditya N; Tessier, Peter M

    2009-10-01

    "Reversible" protein interactions govern diverse biological behavior ranging from intracellular transport and toxic protein aggregation to protein crystallization and inactivation of protein therapeutics. Much less is known about weak protein interactions than their stronger counterparts since they are difficult to characterize, especially in a parallel format (in contrast to a sequential format) necessary for high-throughput screening. We have recently introduced a highly efficient approach of characterizing protein self-association, namely self-interaction nanoparticle spectroscopy (SINS; Tessier et al., 2008; J Am Chem Soc 130:3106-3112). This approach exploits the separation-dependent optical properties of gold nanoparticles to detect weak self-interactions between proteins immobilized on nanoparticles. A limitation of our previous work is that differences in the sequence and structure of proteins can lead to significant differences in their affinity to adsorb to nanoparticle surfaces, which complicates analysis of the corresponding protein self-association behavior. In this work we demonstrate a highly specific approach for coating nanoparticles with proteins using biotin-avidin interactions to generate protein-nanoparticle conjugates that report protein self-interactions through changes in their optical properties. Using lysozyme as a model protein that is refractory to characterization by conventional SINS, we demonstrate that surface Plasmon wavelengths for gold-avidin-lysozyme conjugates over a range of solution conditions (i.e., pH and ionic strength) are well correlated with lysozyme osmotic second virial coefficient measurements. Since SINS requires orders of magnitude less protein and time than conventional methods (e.g., static light scattering), we envision this approach will find application in large screens of protein self-association aimed at either preventing (e.g., protein aggregation) or promoting (e.g., protein crystallization) these

  8. Analysis of Protein-Membrane Interactions

    DEFF Research Database (Denmark)

    Kemmer, Gerdi Christine

    Cellular membranes are complex structures, consisting of hundreds of different lipids and proteins. These membranes act as barriers between distinct environments, constituting hot spots for many essential functions of the cell, including signaling, energy conversion, and transport. These functions....... Discovered interactions were then probed on the level of the membrane using liposome-based assays. In the second part, a transmembrane protein was investigated. Assays to probe activity of the plasma membrane ATPase (Arabidopsis thaliana H+ -ATPase isoform 2 (AHA2)) in single liposomes using both giant...... are implemented by soluble proteins reversibly binding to, as well as by integral membrane proteins embedded in, cellular membranes. The activity and interaction of these proteins is furthermore modulated by the lipids of the membrane. Here, liposomes were used as model membrane systems to investigate...

  9. NMR Studies of Protein Hydration and Protein-Ligand Interactions

    Science.gov (United States)

    Chong, Yuan

    Water on the surface of a protein is called hydration water. Hydration water is known to play a crucial role in a variety of biological processes including protein folding, enzymatic activation, and drug binding. Although the significance of hydration water has been recognized, the underlying mechanism remains far from being understood. This dissertation employs a unique in-situ nuclear magnetic resonance (NMR) technique to study the mechanism of protein hydration and the role of hydration in alcohol-protein interactions. Water isotherms in proteins are measured at different temperatures via the in-situ NMR technique. Water is found to interact differently with hydrophilic and hydrophobic groups on the protein. Water adsorption on hydrophilic groups is hardly affected by the temperature, while water adsorption on hydrophobic groups strongly depends on the temperature around 10 C, below which the adsorption is substantially reduced. This effect is induced by the dramatic decrease in the protein flexibility below 10 C. Furthermore, nanosecond to microsecond protein dynamics and the free energy, enthalpy, and entropy of protein hydration are studied as a function of hydration level and temperature. A crossover at 10 C in protein dynamics and thermodynamics is revealed. The effect of water at hydrophilic groups on protein dynamics and thermodynamics shows little temperature dependence, whereas water at hydrophobic groups has stronger effect above 10 C. In addition, I investigate the role of water in alcohol binding to the protein using the in-situ NMR detection. The isotherms of alcohols are first measured on dry proteins, then on proteins with a series of controlled hydration levels. The free energy, enthalpy, and entropy of alcohol binding are also determined. Two distinct types of alcohol binding are identified. On the one hand, alcohols can directly bind to a few specific sites on the protein. This type of binding is independent of temperature and can be

  10. CLIPZ: a database and analysis environment for experimentally determined binding sites of RNA-binding proteins.

    Science.gov (United States)

    Khorshid, Mohsen; Rodak, Christoph; Zavolan, Mihaela

    2011-01-01

    The stability, localization and translation rate of mRNAs are regulated by a multitude of RNA-binding proteins (RBPs) that find their targets directly or with the help of guide RNAs. Among the experimental methods for mapping RBP binding sites, cross-linking and immunoprecipitation (CLIP) coupled with deep sequencing provides transcriptome-wide coverage as well as high resolution. However, partly due to their vast volume, the data that were so far generated in CLIP experiments have not been put in a form that enables fast and interactive exploration of binding sites. To address this need, we have developed the CLIPZ database and analysis environment. Binding site data for RBPs such as Argonaute 1-4, Insulin-like growth factor II mRNA-binding protein 1-3, TNRC6 proteins A-C, Pumilio 2, Quaking and Polypyrimidine tract binding protein can be visualized at the level of the genome and of individual transcripts. Individual users can upload their own sequence data sets while being able to limit the access to these data to specific users, and analyses of the public and private data sets can be performed interactively. CLIPZ, available at http://www.clipz.unibas.ch, aims to provide an open access repository of information for post-transcriptional regulatory elements.

  11. Efficient extraction of protein-protein interactions from full-text articles.

    Science.gov (United States)

    Hakenberg, Jörg; Leaman, Robert; Vo, Nguyen Ha; Jonnalagadda, Siddhartha; Sullivan, Ryan; Miller, Christopher; Tari, Luis; Baral, Chitta; Gonzalez, Graciela

    2010-01-01

    Proteins and their interactions govern virtually all cellular processes, such as regulation, signaling, metabolism, and structure. Most experimental findings pertaining to such interactions are discussed in research papers, which, in turn, get curated by protein interaction databases. Authors, editors, and publishers benefit from efforts to alleviate the tasks of searching for relevant papers, evidence for physical interactions, and proper identifiers for each protein involved. The BioCreative II.5 community challenge addressed these tasks in a competition-style assessment to evaluate and compare different methodologies, to make aware of the increasing accuracy of automated methods, and to guide future implementations. In this paper, we present our approaches for protein-named entity recognition, including normalization, and for extraction of protein-protein interactions from full text. Our overall goal is to identify efficient individual components, and we compare various compositions to handle a single full-text article in between 10 seconds and 2 minutes. We propose strategies to transfer document-level annotations to the sentence-level, which allows for the creation of a more fine-grained training corpus; we use this corpus to automatically derive around 5,000 patterns. We rank sentences by relevance to the task of finding novel interactions with physical evidence, using a sentence classifier built from this training corpus. Heuristics for paraphrasing sentences help to further remove unnecessary information that might interfere with patterns, such as additional adjectives, clauses, or bracketed expressions. In BioCreative II.5, we achieved an f-score of 22 percent for finding protein interactions, and 43 percent for mapping proteins to UniProt IDs; disregarding species, f-scores are 30 percent and 55 percent, respectively. On average, our best-performing setup required around 2 minutes per full text. All data and pattern sets as well as Java classes that

  12. Databases

    Directory of Open Access Journals (Sweden)

    Nick Ryan

    2004-01-01

    Full Text Available Databases are deeply embedded in archaeology, underpinning and supporting many aspects of the subject. However, as well as providing a means for storing, retrieving and modifying data, databases themselves must be a result of a detailed analysis and design process. This article looks at this process, and shows how the characteristics of data models affect the process of database design and implementation. The impact of the Internet on the development of databases is examined, and the article concludes with a discussion of a range of issues associated with the recording and management of archaeological data.

  13. Protein-protein interactions: an application of Tus-Ter mediated protein microarray system.

    Science.gov (United States)

    Sitaraman, Kalavathy; Chatterjee, Deb K

    2011-01-01

    In this chapter, we present a novel, cost-effective microarray strategy that utilizes expression-ready plasmid DNAs to generate protein arrays on-demand and its use to validate protein-protein interactions. These expression plasmids were constructed in such a way so as to serve a dual purpose of synthesizing the protein of interest as well as capturing the synthesized protein. The microarray system is based on the high affinity binding of Escherichia coli "Tus" protein to "Ter," a 20 bp DNA sequence involved in the regulation of DNA replication. The protein expression is carried out in a cell-free protein synthesis system, with rabbit reticulocyte lysates, and the target proteins are detected either by labeled incorporated tag specific or by gene-specific antibodies. This microarray system has been successfully used for the detection of protein-protein interaction because both the target protein and the query protein can be transcribed and translated simultaneously in the microarray slides. The utility of this system for detecting protein-protein interaction is demonstrated by a few well-known examples: Jun/Fos, FRB/FKBP12, p53/MDM2, and CDK4/p16. In all these cases, the presence of protein complexes resulted in the localization of fluorophores at the specific sites of the immobilized target plasmids. Interestingly, during our interactions studies we also detected a previously unknown interaction between CDK2 and p16. Thus, this Tus-Ter based system of protein microarray can be used for the validation of known protein interactions as well as for identifying new protein-protein interactions. In addition, it can be used to examine and identify targets of nucleic acid-protein, ligand-receptor, enzyme-substrate, and drug-protein interactions.

  14. STRING 8--a global view on proteins and their functional interactions in 630 organisms

    DEFF Research Database (Denmark)

    Jensen, Lars Juhl; Kuhn, Michael; Stark, Manuel

    2008-01-01

    Functional partnerships between proteins are at the core of complex cellular phenotypes, and the networks formed by interacting proteins provide researchers with crucial scaffolds for modeling, data reduction and annotation. STRING is a database and web resource dedicated to protein-protein inter......Functional partnerships between proteins are at the core of complex cellular phenotypes, and the networks formed by interacting proteins provide researchers with crucial scaffolds for modeling, data reduction and annotation. STRING is a database and web resource dedicated to protein......-protein interactions, including both physical and functional interactions. It weights and integrates information from numerous sources, including experimental repositories, computational prediction methods and public text collections, thus acting as a meta-database that maps all interaction evidence onto a common set...... of genomes and proteins. The most important new developments in STRING 8 over previous releases include a URL-based programming interface, which can be used to query STRING from other resources, improved interaction prediction via genomic neighborhood in prokaryotes, and the inclusion of protein structures...

  15. Protein-protein interactions within late pre-40S ribosomes.

    Directory of Open Access Journals (Sweden)

    Melody G Campbell

    2011-01-01

    Full Text Available Ribosome assembly in eukaryotic organisms requires more than 200 assembly factors to facilitate and coordinate rRNA transcription, processing, and folding with the binding of the ribosomal proteins. Many of these assembly factors bind and dissociate at defined times giving rise to discrete assembly intermediates, some of which have been partially characterized with regards to their protein and RNA composition. Here, we have analyzed the protein-protein interactions between the seven assembly factors bound to late cytoplasmic pre-40S ribosomes using recombinant proteins in binding assays. Our data show that these factors form two modules: one comprising Enp1 and the export adaptor Ltv1 near the beak structure, and the second comprising the kinase Rio2, the nuclease Nob1, and a regulatory RNA binding protein Dim2/Pno1 on the front of the head. The GTPase-like Tsr1 and the universally conserved methylase Dim1 are also peripherally connected to this second module. Additionally, in an effort to further define the locations for these essential proteins, we have analyzed the interactions between these assembly factors and six ribosomal proteins: Rps0, Rps3, Rps5, Rps14, Rps15 and Rps29. Together, these results and previous RNA-protein crosslinking data allow us to propose a model for the binding sites of these seven assembly factors. Furthermore, our data show that the essential kinase Rio2 is located at the center of the pre-ribosomal particle and interacts, directly or indirectly, with every other assembly factor, as well as three ribosomal proteins required for cytoplasmic 40S maturation. These data suggest that Rio2 could play a central role in regulating cytoplasmic maturation steps.

  16. A rice kinase-protein interaction map.

    Science.gov (United States)

    Ding, Xiaodong; Richter, Todd; Chen, Mei; Fujii, Hiroaki; Seo, Young Su; Xie, Mingtang; Zheng, Xianwu; Kanrar, Siddhartha; Stevenson, Rebecca A; Dardick, Christopher; Li, Ying; Jiang, Hao; Zhang, Yan; Yu, Fahong; Bartley, Laura E; Chern, Mawsheng; Bart, Rebecca; Chen, Xiuhua; Zhu, Lihuang; Farmerie, William G; Gribskov, Michael; Zhu, Jian-Kang; Fromm, Michael E; Ronald, Pamela C; Song, Wen-Yuan

    2009-03-01

    Plants uniquely contain large numbers of protein kinases, and for the vast majority of the 1,429 kinases predicted in the rice (Oryza sativa) genome, little is known of their functions. Genetic approaches often fail to produce observable phenotypes; thus, new strategies are needed to delineate kinase function. We previously developed a cost-effective high-throughput yeast two-hybrid system. Using this system, we have generated a protein interaction map of 116 representative rice kinases and 254 of their interacting proteins. Overall, the resulting interaction map supports a large number of known or predicted kinase-protein interactions from both plants and animals and reveals many new functional insights. Notably, we found a potential widespread role for E3 ubiquitin ligases in pathogen defense signaling mediated by receptor-like kinases, particularly by the kinases that may have evolved from recently expanded kinase subfamilies in rice. We anticipate that the data provided here will serve as a foundation for targeted functional studies in rice and other plants. The application of yeast two-hybrid and TAPtag analyses for large-scale plant protein interaction studies is also discussed.

  17. The effect of protein-protein and protein-membrane interactions on membrane fouling in ultrafiltration

    NARCIS (Netherlands)

    Huisman, I.H.; Prádanos, P.; Hernández, A.

    2000-01-01

    It was studied how protein-protein and protein-membrane interactions influence the filtration performance during the ultrafiltration of protein solutions over polymeric membranes. This was done by measuring flux, streaming potential, and protein transmission during filtration of bovine serum albumin

  18. Interaction between policy measures. Analysis tool in the MURE database

    Energy Technology Data Exchange (ETDEWEB)

    Boonekamp, P.G.M. [ECN Policy Studies, Petten (Netherlands); Faberi, S. [Institute of Studies for the Integration of Systems ISIS, Rome (Italy)

    2013-12-15

    The ODYSSEE database on energy efficiency indicators (www.odyssee-indicators.org) has been set up to enable the monitoring and evaluation of realised energy efficiency improvements and related energy savings. The database covers the 27 EU countries as well as Norway and Croatia and data are available from 1990 on. This report describes how sets of mutually consistent impacts for packages as well as individual policy measures can be determined in the MURE database (MURE is the French abbreviation for Mesures d'Utilisation Rationnelle de l'Energie)

  19. Potential disruption of protein-protein interactions by graphene oxide

    Energy Technology Data Exchange (ETDEWEB)

    Feng, Mei [Department of Physics, Institute of Quantitative Biology, Zhejiang University, Hangzhou 310027 (China); Kang, Hongsuk; Luan, Binquan [Computational Biological Center, IBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598 (United States); Yang, Zaixing [Institute of Quantitative Biology and Medicine, SRMP and RAD-X, and Collaborative Innovation Center of Radiation Medicine of Jiangsu Higher Education Institutions, Soochow University, Suzhou 215123 (China); Zhou, Ruhong, E-mail: ruhong@us.ibm.com [Department of Physics, Institute of Quantitative Biology, Zhejiang University, Hangzhou 310027 (China); Computational Biological Center, IBM Thomas J. Watson Research Center, Yorktown Heights, New York 10598 (United States); Department of Chemistry, Columbia University, New York, New York 10027 (United States)

    2016-06-14

    Graphene oxide (GO) is a promising novel nanomaterial with a wide range of potential biomedical applications due to its many intriguing properties. However, very little research has been conducted to study its possible adverse effects on protein-protein interactions (and thus subsequent toxicity to human). Here, the potential cytotoxicity of GO is investigated at molecular level using large-scale, all-atom molecular dynamics simulations to explore the interaction mechanism between a protein dimer and a GO nanosheet oxidized at different levels. Our theoretical results reveal that GO nanosheet could intercalate between the two monomers of HIV-1 integrase dimer, disrupting the protein-protein interactions and eventually lead to dimer disassociation as graphene does [B. Luan et al., ACS Nano 9(1), 663 (2015)], albeit its insertion process is slower when compared with graphene due to the additional steric and attractive interactions. This study helps to better understand the toxicity of GO to cell functions which could shed light on how to improve its biocompatibility and biosafety for its wide potential biomedical applications.

  20. Potential disruption of protein-protein interactions by graphene oxide

    International Nuclear Information System (INIS)

    Feng, Mei; Kang, Hongsuk; Luan, Binquan; Yang, Zaixing; Zhou, Ruhong

    2016-01-01

    Graphene oxide (GO) is a promising novel nanomaterial with a wide range of potential biomedical applications due to its many intriguing properties. However, very little research has been conducted to study its possible adverse effects on protein-protein interactions (and thus subsequent toxicity to human). Here, the potential cytotoxicity of GO is investigated at molecular level using large-scale, all-atom molecular dynamics simulations to explore the interaction mechanism between a protein dimer and a GO nanosheet oxidized at different levels. Our theoretical results reveal that GO nanosheet could intercalate between the two monomers of HIV-1 integrase dimer, disrupting the protein-protein interactions and eventually lead to dimer disassociation as graphene does [B. Luan et al., ACS Nano 9(1), 663 (2015)], albeit its insertion process is slower when compared with graphene due to the additional steric and attractive interactions. This study helps to better understand the toxicity of GO to cell functions which could shed light on how to improve its biocompatibility and biosafety for its wide potential biomedical applications.

  1. Specificity of molecular interactions in transient protein-protein interaction interfaces.

    Science.gov (United States)

    Cho, Kyu-il; Lee, KiYoung; Lee, Kwang H; Kim, Dongsup; Lee, Doheon

    2006-11-15

    In this study, we investigate what types of interactions are specific to their biological function, and what types of interactions are persistent regardless of their functional category in transient protein-protein heterocomplexes. This is the first approach to analyze protein-protein interfaces systematically at the molecular interaction level in the context of protein functions. We perform systematic analysis at the molecular interaction level using classification and feature subset selection technique prevalent in the field of pattern recognition. To represent the physicochemical properties of protein-protein interfaces, we design 18 molecular interaction types using canonical and noncanonical interactions. Then, we construct input vector using the frequency of each interaction type in protein-protein interface. We analyze the 131 interfaces of transient protein-protein heterocomplexes in PDB: 33 protease-inhibitors, 52 antibody-antigens, 46 signaling proteins including 4 cyclin dependent kinase and 26 G-protein. Using kNN classification and feature subset selection technique, we show that there are specific interaction types based on their functional category, and such interaction types are conserved through the common binding mechanism, rather than through the sequence or structure conservation. The extracted interaction types are C(alpha)-- H...O==C interaction, cation...anion interaction, amine...amine interaction, and amine...cation interaction. With these four interaction types, we achieve the classification success rate up to 83.2% with leave-one-out cross-validation at k = 15. Of these four interaction types, C(alpha)--H...O==C shows binding specificity for protease-inhibitor complexes, while cation-anion interaction is predominant in signaling complexes. The amine ... amine and amine...cation interaction give a minor contribution to the classification accuracy. When combined with these two interactions, they increase the accuracy by 3.8%. In the case of

  2. Noise reduction in protein-protein interaction graphs by the implementation of a novel weighting scheme

    Directory of Open Access Journals (Sweden)

    Moschopoulos Charalampos

    2011-06-01

    Full Text Available Abstract Background Recent technological advances applied to biology such as yeast-two-hybrid, phage display and mass spectrometry have enabled us to create a detailed map of protein interaction networks. These interaction networks represent a rich, yet noisy, source of data that could be used to extract meaningful information, such as protein complexes. Several interaction network weighting schemes have been proposed so far in the literature in order to eliminate the noise inherent in interactome data. In this paper, we propose a novel weighting scheme and apply it to the S. cerevisiae interactome. Complex prediction rates are improved by up to 39%, depending on the clustering algorithm applied. Results We adopt a two step procedure. During the first step, by applying both novel and well established protein-protein interaction (PPI weighting methods, weights are introduced to the original interactome graph based on the confidence level that a given interaction is a true-positive one. The second step applies clustering using established algorithms in the field of graph theory, as well as two variations of Spectral clustering. The clustered interactome networks are also cross-validated against the confirmed protein complexes present in the MIPS database. Conclusions The results of our experimental work demonstrate that interactome graph weighting methods clearly improve the clustering results of several clustering algorithms. Moreover, our proposed weighting scheme outperforms other approaches of PPI graph weighting.

  3. The Princeton Protein Orthology Database (P-POD): a comparative genomics analysis tool for biologists.

    OpenAIRE

    Sven Heinicke; Michael S Livstone; Charles Lu; Rose Oughtred; Fan Kang; Samuel V Angiuoli; Owen White; David Botstein; Kara Dolinski

    2007-01-01

    Many biological databases that provide comparative genomics information and tools are now available on the internet. While certainly quite useful, to our knowledge none of the existing databases combine results from multiple comparative genomics methods with manually curated information from the literature. Here we describe the Princeton Protein Orthology Database (P-POD, http://ortholog.princeton.edu), a user-friendly database system that allows users to find and visualize the phylogenetic r...

  4. KFC Server: interactive forecasting of protein interaction hot spots.

    Science.gov (United States)

    Darnell, Steven J; LeGault, Laura; Mitchell, Julie C

    2008-07-01

    The KFC Server is a web-based implementation of the KFC (Knowledge-based FADE and Contacts) model-a machine learning approach for the prediction of binding hot spots, or the subset of residues that account for most of a protein interface's; binding free energy. The server facilitates the automated analysis of a user submitted protein-protein or protein-DNA interface and the visualization of its hot spot predictions. For each residue in the interface, the KFC Server characterizes its local structural environment, compares that environment to the environments of experimentally determined hot spots and predicts if the interface residue is a hot spot. After the computational analysis, the user can visualize the results using an interactive job viewer able to quickly highlight predicted hot spots and surrounding structural features within the protein structure. The KFC Server is accessible at http://kfc.mitchell-lab.org.

  5. PIWI Proteins and PIWI-Interacting RNA

    DEFF Research Database (Denmark)

    Han, Yi Neng; Li, Yuan; Xia, Sheng Qiang

    2017-01-01

    tissue types as well and play important roles in transposon silencing, epigenetic regulation, gene and protein regulation, genome rearrangement, spermatogenesis and germ stem-cell maintenance. PIWI proteins were first discovered in Drosophila and they play roles in spermatogenesis, germline stem-cell......P-Element induced wimpy testis (PIWI)-interacting RNAs (piRNAs) are a type of noncoding RNAs (ncRNAs) and interact with PIWI proteins. piRNAs were primarily described in the germline, but emerging evidence revealed that piRNAs are expressed in a tissue-specific manner among multiple human somatic...... maintenance, self-renewal, retrotransposons silencing and the male germline mobility control. A growing number of studies have demonstrated that several piRNA and PIWI proteins are aberrantly expressed in various kinds of cancers and may probably serve as a novel biomarker and therapeutic target for cancer...

  6. Conformational dynamics data bank: a database for conformational dynamics of proteins and supramolecular protein assemblies.

    Science.gov (United States)

    Kim, Do-Nyun; Altschuler, Josiah; Strong, Campbell; McGill, Gaël; Bathe, Mark

    2011-01-01

    The conformational dynamics data bank (CDDB, http://www.cdyn.org) is a database that aims to provide comprehensive results on the conformational dynamics of high molecular weight proteins and protein assemblies. Analysis is performed using a recently introduced coarse-grained computational approach that is applied to the majority of structures present in the electron microscopy data bank (EMDB). Results include equilibrium thermal fluctuations and elastic strain energy distributions that identify rigid versus flexible protein domains generally, as well as those associated with specific functional transitions, and correlations in molecular motions that identify molecular regions that are highly coupled dynamically, with implications for allosteric mechanisms. A practical web-based search interface enables users to easily collect conformational dynamics data in various formats. The data bank is maintained and updated automatically to include conformational dynamics results for new structural entries as they become available in the EMDB. The CDDB complements static structural information to facilitate the investigation and interpretation of the biological function of proteins and protein assemblies essential to cell function.

  7. Protein structure determination by exhaustive search of Protein Data Bank derived databases.

    Science.gov (United States)

    Stokes-Rees, Ian; Sliz, Piotr

    2010-12-14

    Parallel sequence and structure alignment tools have become ubiquitous and invaluable at all levels in the study of biological systems. We demonstrate the application and utility of this same parallel search paradigm to the process of protein structure determination, benefitting from the large and growing corpus of known structures. Such searches were previously computationally intractable. Through the method of Wide Search Molecular Replacement, developed here, they can be completed in a few hours with the aide of national-scale federated cyberinfrastructure. By dramatically expanding the range of models considered for structure determination, we show that small (less than 12% structural coverage) and low sequence identity (less than 20% identity) template structures can be identified through multidimensional template scoring metrics and used for structure determination. Many new macromolecular complexes can benefit significantly from such a technique due to the lack of known homologous protein folds or sequences. We demonstrate the effectiveness of the method by determining the structure of a full-length p97 homologue from Trichoplusia ni. Example cases with the MHC/T-cell receptor complex and the EmoB protein provide systematic estimates of minimum sequence identity, structure coverage, and structural similarity required for this method to succeed. We describe how this structure-search approach and other novel computationally intensive workflows are made tractable through integration with the US national computational cyberinfrastructure, allowing, for example, rapid processing of the entire Structural Classification of Proteins protein fragment database.

  8. Receptor-interacting protein (RIP) kinase family

    OpenAIRE

    Zhang, Duanwu; Lin, Juan; Han, Jiahuai

    2010-01-01

    Receptor-interacting protein (RIP) kinases are a group of threonine/serine protein kinases with a relatively conserved kinase domain but distinct non-kinase regions. A number of different domain structures, such as death and caspase activation and recruitment domain (CARD) domains, were found in different RIP family members, and these domains should be keys in determining the specific function of each RIP kinase. It is known that RIP kinases participate in different biological processes, incl...

  9. Computational prediction of protein-protein interactions in Leishmania predicted proteomes.

    Directory of Open Access Journals (Sweden)

    Antonio M Rezende

    Full Text Available The Trypanosomatids parasites Leishmania braziliensis, Leishmania major and Leishmania infantum are important human pathogens. Despite of years of study and genome availability, effective vaccine has not been developed yet, and the chemotherapy is highly toxic. Therefore, it is clear just interdisciplinary integrated studies will have success in trying to search new targets for developing of vaccines and drugs. An essential part of this rationale is related to protein-protein interaction network (PPI study which can provide a better understanding of complex protein interactions in biological system. Thus, we modeled PPIs for Trypanosomatids through computational methods using sequence comparison against public database of protein or domain interaction for interaction prediction (Interolog Mapping and developed a dedicated combined system score to address the predictions robustness. The confidence evaluation of network prediction approach was addressed using gold standard positive and negative datasets and the AUC value obtained was 0.94. As result, 39,420, 43,531 and 45,235 interactions were predicted for L. braziliensis, L. major and L. infantum respectively. For each predicted network the top 20 proteins were ranked by MCC topological index. In addition, information related with immunological potential, degree of protein sequence conservation among orthologs and degree of identity compared to proteins of potential parasite hosts was integrated. This information integration provides a better understanding and usefulness of the predicted networks that can be valuable to select new potential biological targets for drug and vaccine development. Network modularity which is a key when one is interested in destabilizing the PPIs for drug or vaccine purposes along with multiple alignments of the predicted PPIs were performed revealing patterns associated with protein turnover. In addition, around 50% of hypothetical protein present in the networks

  10. Access to DNA and protein databases on the Internet.

    Science.gov (United States)

    Harper, R

    1994-02-01

    During the past year, the number of biological databases that can be queried via Internet has dramatically increased. This increase has resulted from the introduction of networking tools, such as Gopher and WAIS, that make it easy for research workers to index databases and make them available for on-line browsing. Biocomputing in the nineties will see the advent of more client/server options for the solution of problems in bioinformatics.

  11. GRIP: A web-based system for constructing Gold Standard datasets for protein-protein interaction prediction

    Directory of Open Access Journals (Sweden)

    Zheng Huiru

    2009-01-01

    Full Text Available Abstract Background Information about protein interaction networks is fundamental to understanding protein function and cellular processes. Interaction patterns among proteins can suggest new drug targets and aid in the design of new therapeutic interventions. Efforts have been made to map interactions on a proteomic-wide scale using both experimental and computational techniques. Reference datasets that contain known interacting proteins (positive cases and non-interacting proteins (negative cases are essential to support computational prediction and validation of protein-protein interactions. Information on known interacting and non interacting proteins are usually stored within databases. Extraction of these data can be both complex and time consuming. Although, the automatic construction of reference datasets for classification is a useful resource for researchers no public resource currently exists to perform this task. Results GRIP (Gold Reference dataset constructor from Information on Protein complexes is a web-based system that provides researchers with the functionality to create reference datasets for protein-protein interaction prediction in Saccharomyces cerevisiae. Both positive and negative cases for a reference dataset can be extracted, organised and downloaded by the user. GRIP also provides an upload facility whereby users can submit proteins to determine protein complex membership. A search facility is provided where a user can search for protein complex information in Saccharomyces cerevisiae. Conclusion GRIP is developed to retrieve information on protein complex, cellular localisation, and physical and genetic interactions in Saccharomyces cerevisiae. Manual construction of reference datasets can be a time consuming process requiring programming knowledge. GRIP simplifies and speeds up this process by allowing users to automatically construct reference datasets. GRIP is free to access at http://rosalind.infj.ulst.ac.uk/GRIP/.

  12. Detection of protein-protein interactions by ribosome display and protein in situ immobilisation.

    Science.gov (United States)

    He, Mingyue; Liu, Hong; Turner, Martin; Taussig, Michael J

    2009-12-31

    We describe a method for identification of protein-protein interactions by combining two cell-free protein technologies, namely ribosome display and protein in situ immobilisation. The method requires only PCR fragments as the starting material, the target proteins being made through cell-free protein synthesis, either associated with their encoding mRNA as ribosome complexes or immobilised on a solid surface. The use of ribosome complexes allows identification of interacting protein partners from their attached coding mRNA. To demonstrate the procedures, we have employed the lymphocyte signalling proteins Vav1 and Grb2 and confirmed the interaction between Grb2 and the N-terminal SH3 domain of Vav1. The method has promise for library screening of pairwise protein interactions, down to the analytical level of individual domain or motif mapping.

  13. UNcleProt (Universal Nuclear Protein database of barley): The first nuclear protein database that distinguishes proteins from different phases of the cell cycle

    Czech Academy of Sciences Publication Activity Database

    Blavet, Nicolas; Uřinovská, J.; Jeřábková, Hana; Chamrád, I.; Vrána, Jan; Lenobel, R.; Beinhauer, D.; Šebela, M.; Doležel, Jaroslav; Petrovská, Beáta

    2017-01-01

    Roč. 8, č. 1 (2017), s. 70-80 ISSN 1949-1034 R&D Projects: GA ČR(CZ) GA14-28443S; GA MŠk(CZ) LO1204 Institutional support: RVO:61389030 Keywords : cicer-arietinum l. * rice oryza-sativa * chromatin-associated protein s * proteomic analysis * mitotic chromosomes * dehydration * localization * chickpea * network * phosphoproteome * barley * cell cycle * database * flow-cytometry * localization * mass spectrometry * nuclear proteome * nucleus Subject RIV: CE - Biochemistry OBOR OECD: Cell biology Impact factor: 2.387, year: 2016

  14. A membrane protein / signaling protein interaction network for Arabidopsis version AMPv2

    Directory of Open Access Journals (Sweden)

    Sylvie Lalonde

    2010-09-01

    Full Text Available Interactions between membrane proteins and the soluble fraction are essential for signal transduction and for regulating nutrient transport. To gain insights into the membrane-based interactome, 3,852 open reading frames (ORFs out of a target list of 8,383 representing membrane and signaling proteins from Arabidopsis thaliana were cloned into a Gateway compatible vector. The mating-based split-ubiquitin system was used to screen for potential protein-protein interactions (pPPIs among 490 Arabidopsis ORFs. A binary robotic screen between 142 receptor-like kinases, 72 transporters, 57 soluble protein kinases and phosphatases, 40 glycosyltransferases, 95 proteins of various functions and 89 proteins with unknown function detected 387 out of 90,370 possible PPIs. A secondary screen confirmed 343 (of 387 pPPIs between 179 proteins, yielding a scale-free network (r2=0.863. Eighty of 142 transmembrane receptor-like kinases (RLK tested positive, identifying three homomers, 63 heteromers and 80 pPPIs with other proteins. Thirty-one out of 142 RLK interactors (including RLKs had previously been found to be phosphorylated; thus interactors may be substrates for respective RLKs. None of the pPPIs described here had been reported in the major interactome databases, including potential interactors of G protein-coupled receptors, phospholipase C, and AMT ammonium transporters. Two RLKs found as putative interactors of AMT1;1 were independently confirmed using a split luciferase assay in Arabidopsis protoplasts. These RLKs may be involved in ammonium-dependent phosphorylation of the C-terminus and regulation of ammonium uptake activity. The robotic screening method established here will enable a systematic analysis of membrane protein interactions in fungi, plants and metazoa.

  15. PACSY, a relational database management system for protein structure and chemical shift analysis.

    Science.gov (United States)

    Lee, Woonghee; Yu, Wookyung; Kim, Suhkmann; Chang, Iksoo; Lee, Weontae; Markley, John L

    2012-10-01

    PACSY (Protein structure And Chemical Shift NMR spectroscopY) is a relational database management system that integrates information from the Protein Data Bank, the Biological Magnetic Resonance Data Bank, and the Structural Classification of Proteins database. PACSY provides three-dimensional coordinates and chemical shifts of atoms along with derived information such as torsion angles, solvent accessible surface areas, and hydrophobicity scales. PACSY consists of six relational table types linked to one another for coherence by key identification numbers. Database queries are enabled by advanced search functions supported by an RDBMS server such as MySQL or PostgreSQL. PACSY enables users to search for combinations of information from different database sources in support of their research. Two software packages, PACSY Maker for database creation and PACSY Analyzer for database analysis, are available from http://pacsy.nmrfam.wisc.edu.

  16. PACSY, a relational database management system for protein structure and chemical shift analysis

    Energy Technology Data Exchange (ETDEWEB)

    Lee, Woonghee, E-mail: whlee@nmrfam.wisc.edu [University of Wisconsin-Madison, National Magnetic Resonance Facility at Madison, and Biochemistry Department (United States); Yu, Wookyung [Center for Proteome Biophysics, Pusan National University, Department of Physics (Korea, Republic of); Kim, Suhkmann [Pusan National University, Department of Chemistry and Chemistry Institute for Functional Materials (Korea, Republic of); Chang, Iksoo [Center for Proteome Biophysics, Pusan National University, Department of Physics (Korea, Republic of); Lee, Weontae, E-mail: wlee@spin.yonsei.ac.kr [Yonsei University, Structural Biochemistry and Molecular Biophysics Laboratory, Department of Biochemistry (Korea, Republic of); Markley, John L., E-mail: markley@nmrfam.wisc.edu [University of Wisconsin-Madison, National Magnetic Resonance Facility at Madison, and Biochemistry Department (United States)

    2012-10-15

    PACSY (Protein structure And Chemical Shift NMR spectroscopY) is a relational database management system that integrates information from the Protein Data Bank, the Biological Magnetic Resonance Data Bank, and the Structural Classification of Proteins database. PACSY provides three-dimensional coordinates and chemical shifts of atoms along with derived information such as torsion angles, solvent accessible surface areas, and hydrophobicity scales. PACSY consists of six relational table types linked to one another for coherence by key identification numbers. Database queries are enabled by advanced search functions supported by an RDBMS server such as MySQL or PostgreSQL. PACSY enables users to search for combinations of information from different database sources in support of their research. Two software packages, PACSY Maker for database creation and PACSY Analyzer for database analysis, are available from http://pacsy.nmrfam.wisc.eduhttp://pacsy.nmrfam.wisc.edu.

  17. PACSY, a relational database management system for protein structure and chemical shift analysis

    Science.gov (United States)

    Lee, Woonghee; Yu, Wookyung; Kim, Suhkmann; Chang, Iksoo

    2012-01-01

    PACSY (Protein structure And Chemical Shift NMR spectroscopY) is a relational database management system that integrates information from the Protein Data Bank, the Biological Magnetic Resonance Data Bank, and the Structural Classification of Proteins database. PACSY provides three-dimensional coordinates and chemical shifts of atoms along with derived information such as torsion angles, solvent accessible surface areas, and hydrophobicity scales. PACSY consists of six relational table types linked to one another for coherence by key identification numbers. Database queries are enabled by advanced search functions supported by an RDBMS server such as MySQL or PostgreSQL. PACSY enables users to search for combinations of information from different database sources in support of their research. Two software packages, PACSY Maker for database creation and PACSY Analyzer for database analysis, are available from http://pacsy.nmrfam.wisc.edu. PMID:22903636

  18. PACSY, a relational database management system for protein structure and chemical shift analysis

    International Nuclear Information System (INIS)

    Lee, Woonghee; Yu, Wookyung; Kim, Suhkmann; Chang, Iksoo; Lee, Weontae; Markley, John L.

    2012-01-01

    PACSY (Protein structure And Chemical Shift NMR spectroscopY) is a relational database management system that integrates information from the Protein Data Bank, the Biological Magnetic Resonance Data Bank, and the Structural Classification of Proteins database. PACSY provides three-dimensional coordinates and chemical shifts of atoms along with derived information such as torsion angles, solvent accessible surface areas, and hydrophobicity scales. PACSY consists of six relational table types linked to one another for coherence by key identification numbers. Database queries are enabled by advanced search functions supported by an RDBMS server such as MySQL or PostgreSQL. PACSY enables users to search for combinations of information from different database sources in support of their research. Two software packages, PACSY Maker for database creation and PACSY Analyzer for database analysis, are available from http://pacsy.nmrfam.wisc.eduhttp://pacsy.nmrfam.wisc.edu.

  19. PROCARB: A Database of Known and Modelled Carbohydrate-Binding Protein Structures with Sequence-Based Prediction Tools

    Directory of Open Access Journals (Sweden)

    Adeel Malik

    2010-01-01

    Full Text Available Understanding of the three-dimensional structures of proteins that interact with carbohydrates covalently (glycoproteins as well as noncovalently (protein-carbohydrate complexes is essential to many biological processes and plays a significant role in normal and disease-associated functions. It is important to have a central repository of knowledge available about these protein-carbohydrate complexes as well as preprocessed data of predicted structures. This can be significantly enhanced by tools de novo which can predict carbohydrate-binding sites for proteins in the absence of structure of experimentally known binding site. PROCARB is an open-access database comprising three independently working components, namely, (i Core PROCARB module, consisting of three-dimensional structures of protein-carbohydrate complexes taken from Protein Data Bank (PDB, (ii Homology Models module, consisting of manually developed three-dimensional models of N-linked and O-linked glycoproteins of unknown three-dimensional structure, and (iii CBS-Pred prediction module, consisting of web servers to predict carbohydrate-binding sites using single sequence or server-generated PSSM. Several precomputed structural and functional properties of complexes are also included in the database for quick analysis. In particular, information about function, secondary structure, solvent accessibility, hydrogen bonds and literature reference, and so forth, is included. In addition, each protein in the database is mapped to Uniprot, Pfam, PDB, and so forth.

  20. Identification of NAD interacting residues in proteins

    Directory of Open Access Journals (Sweden)

    Raghava Gajendra PS

    2010-03-01

    Full Text Available Abstract Background Small molecular cofactors or ligands play a crucial role in the proper functioning of cells. Accurate annotation of their target proteins and binding sites is required for the complete understanding of reaction mechanisms. Nicotinamide adenine dinucleotide (NAD+ or NAD is one of the most commonly used organic cofactors in living cells, which plays a critical role in cellular metabolism, storage and regulatory processes. In the past, several NAD binding proteins (NADBP have been reported in the literature, which are responsible for a wide-range of activities in the cell. Attempts have been made to derive a rule for the binding of NAD+ to its target proteins. However, so far an efficient model could not be derived due to the time consuming process of structure determination, and limitations of similarity based approaches. Thus a sequence and non-similarity based method is needed to characterize the NAD binding sites to help in the annotation. In this study attempts have been made to predict NAD binding proteins and their interacting residues (NIRs from amino acid sequence using bioinformatics tools. Results We extracted 1556 proteins chains from 555 NAD binding proteins whose structure is available in Protein Data Bank. Then we removed all redundant protein chains and finally obtained 195 non-redundant NAD binding protein chains, where no two chains have more than 40% sequence identity. In this study all models were developed and evaluated using five-fold cross validation technique on the above dataset of 195 NAD binding proteins. While certain type of residues are preferred (e.g. Gly, Tyr, Thr, His in NAD interaction, residues like Ala, Glu, Leu, Lys are not preferred. A support vector machine (SVM based method has been developed using various window lengths of amino acid sequence for predicting NAD interacting residues and obtained maximum Matthew's correlation coefficient (MCC 0.47 with accuracy 74.13% at window length 17

  1. Next-Generation Sequencing for Binary Protein-Protein Interactions

    Directory of Open Access Journals (Sweden)

    Bernhard eSuter

    2015-12-01

    Full Text Available The yeast two-hybrid (Y2H system exploits host cell genetics in order to display binary protein-protein interactions (PPIs via defined and selectable phenotypes. Numerous improvements have been made to this method, adapting the screening principle for diverse applications, including drug discovery and the scale-up for proteome wide interaction screens in human and other organisms. Here we discuss a systematic workflow and analysis scheme for screening data generated by Y2H and related assays that includes high-throughput selection procedures, readout of comprehensive results via next-generation sequencing (NGS, and the interpretation of interaction data via quantitative statistics. The novel assays and tools will serve the broader scientific community to harness the power of NGS technology to address PPI networks in health and disease. We discuss examples of how this next-generation platform can be applied to address specific questions in diverse fields of biology and medicine.

  2. PCNA Structure and Interactions with Partner Proteins

    KAUST Repository

    Oke, Muse; Zaher, Manal S.; Hamdan, Samir

    2018-01-01

    Proliferating cell nuclear antigen (PCNA) consists of three identical monomers that topologically encircle double-stranded DNA. PCNA stimulates the processivity of DNA polymerase δ and, to a less extent, the intrinsically highly processive DNA polymerase ε. It also functions as a platform that recruits and coordinates the activities of a large number of DNA processing proteins. Emerging structural and biochemical studies suggest that the nature of PCNA-partner proteins interactions is complex. A hydrophobic groove at the front side of PCNA serves as a primary docking site for the consensus PIP box motifs present in many PCNA-binding partners. Sequences that immediately flank the PIP box motif or regions that are distant from it could also interact with the hydrophobic groove and other regions of PCNA. Posttranslational modifications on the backside of PCNA could add another dimension to its interaction with partner proteins. An encounter of PCNA with different DNA structures might also be involved in coordinating its interactions. Finally, the ability of PCNA to bind up to three proteins while topologically linked to DNA suggests that it would be a versatile toolbox in many different DNA processing reactions.

  3. PCNA Structure and Interactions with Partner Proteins

    KAUST Repository

    Oke, Muse

    2018-01-29

    Proliferating cell nuclear antigen (PCNA) consists of three identical monomers that topologically encircle double-stranded DNA. PCNA stimulates the processivity of DNA polymerase δ and, to a less extent, the intrinsically highly processive DNA polymerase ε. It also functions as a platform that recruits and coordinates the activities of a large number of DNA processing proteins. Emerging structural and biochemical studies suggest that the nature of PCNA-partner proteins interactions is complex. A hydrophobic groove at the front side of PCNA serves as a primary docking site for the consensus PIP box motifs present in many PCNA-binding partners. Sequences that immediately flank the PIP box motif or regions that are distant from it could also interact with the hydrophobic groove and other regions of PCNA. Posttranslational modifications on the backside of PCNA could add another dimension to its interaction with partner proteins. An encounter of PCNA with different DNA structures might also be involved in coordinating its interactions. Finally, the ability of PCNA to bind up to three proteins while topologically linked to DNA suggests that it would be a versatile toolbox in many different DNA processing reactions.

  4. Semantic integration to identify overlapping functional modules in protein interaction networks

    Directory of Open Access Journals (Sweden)

    Ramanathan Murali

    2007-07-01

    Full Text Available Abstract Background The systematic analysis of protein-protein interactions can enable a better understanding of cellular organization, processes and functions. Functional modules can be identified from the protein interaction networks derived from experimental data sets. However, these analyses are challenging because of the presence of unreliable interactions and the complex connectivity of the network. The integration of protein-protein interactions with the data from other sources can be leveraged for improving the effectiveness of functional module detection algorithms. Results We have developed novel metrics, called semantic similarity and semantic interactivity, which use Gene Ontology (GO annotations to measure the reliability of protein-protein interactions. The protein interaction networks can be converted into a weighted graph representation by assigning the reliability values to each interaction as a weight. We presented a flow-based modularization algorithm to efficiently identify overlapping modules in the weighted interaction networks. The experimental results show that the semantic similarity and semantic interactivity of interacting pairs were positively correlated with functional co-occurrence. The effectiveness of the algorithm for identifying modules was evaluated using functional categories from the MIPS database. We demonstrated that our algorithm had higher accuracy compared to other competing approaches. Conclusion The integration of protein interaction networks with GO annotation data and the capability of detecting overlapping modules substantially improve the accuracy of module identification.

  5. Integration and visualization of non-coding RNA and protein interaction networks

    OpenAIRE

    Junge, Alexander; Refsgaard, Jan Christian; Garde, Christian; Pan, Xiaoyong; Santos Delgado, Alberto; Anthon, Christian; Alkan, Ferhat; von Mering, Christian; Workman, Christopher; Jensen, Lars Juhl; Gorodkin, Jan

    2015-01-01

    Non-coding RNAs (ncRNAs) fulfill a diverse set of biological functions relying on interactions with other molecular entities. The advent of new experimental and computational approaches makes it possible to study ncRNAs and their associations on an unprecedented scale. We present RAIN (RNA Association and Interaction Networks) - a database that combines ncRNA-ncRNA, ncRNA-mRNA and ncRNA-protein interactions with large-scale protein association networks available in the STRING database. By int...

  6. Drosophila protein interaction map (DPiM): a paradigm for metazoan protein complex interactions.

    Science.gov (United States)

    Guruharsha, K G; Obar, Robert A; Mintseris, Julian; Aishwarya, K; Krishnan, R T; Vijayraghavan, K; Artavanis-Tsakonas, Spyros

    2012-01-01

    Proteins perform essential cellular functions as part of protein complexes, often in conjunction with RNA, DNA, metabolites and other small molecules. The genome encodes thousands of proteins but not all of them are expressed in every cell type; and expressed proteins are not active at all times. Such diversity of protein expression and function accounts for the level of biological intricacy seen in nature. Defining protein-protein interactions in protein complexes, and establishing the when, what and where of potential interactions, is therefore crucial to understanding the cellular function of any protein-especially those that have not been well studied by traditional molecular genetic approaches. We generated a large-scale resource of affinity-tagged expression-ready clones and used co-affinity purification combined with tandem mass-spectrometry to identify protein partners of nearly 5,000 Drosophila melanogaster proteins. The resulting protein complex "map" provided a blueprint of metazoan protein complex organization. Here we describe how the map has provided valuable insights into protein function in addition to generating hundreds of testable hypotheses. We also discuss recent technological advancements that will be critical in addressing the next generation of questions arising from the map.

  7. Cell penetrating peptides to dissect host-pathogen protein-protein interactions in Theileria -transformed leukocytes

    KAUST Repository

    Haidar, Malak; de Laté , Perle Latré ; Kennedy, Eileen J.; Langsley, Gordon

    2017-01-01

    One powerful application of cell penetrating peptides is the delivery into cells of molecules that function as specific competitors or inhibitors of protein-protein interactions. Ablating defined protein-protein interactions is a refined way

  8. The PANTHER database of protein families, subfamilies, functions and pathways

    OpenAIRE

    Mi, Huaiyu; Lazareva-Ulitsky, Betty; Loo, Rozina; Kejariwal, Anish; Vandergriff, Jody; Rabkin, Steven; Guo, Nan; Muruganujan, Anushya; Doremieux, Olivier; Campbell, Michael J.; Kitano, Hiroaki; Thomas, Paul D.

    2004-01-01

    PANTHER is a large collection of protein families that have been subdivided into functionally related subfamilies, using human expertise. These subfamilies model the divergence of specific functions within protein families, allowing more accurate association with function (ontology terms and pathways), as well as inference of amino acids important for functional specificity. Hidden Markov models (HMMs) are built for each family and subfamily for classifying additional protein sequences. The l...

  9. Protein-protein interaction site predictions with three-dimensional probability distributions of interacting atoms on protein surfaces.

    Directory of Open Access Journals (Sweden)

    Ching-Tai Chen

    Full Text Available Protein-protein interactions are key to many biological processes. Computational methodologies devised to predict protein-protein interaction (PPI sites on protein surfaces are important tools in providing insights into the biological functions of proteins and in developing therapeutics targeting the protein-protein interaction sites. One of the general features of PPI sites is that the core regions from the two interacting protein surfaces are complementary to each other, similar to the interior of proteins in packing density and in the physicochemical nature of the amino acid composition. In this work, we simulated the physicochemical complementarities by constructing three-dimensional probability density maps of non-covalent interacting atoms on the protein surfaces. The interacting probabilities were derived from the interior of known structures. Machine learning algorithms were applied to learn the characteristic patterns of the probability density maps specific to the PPI sites. The trained predictors for PPI sites were cross-validated with the training cases (consisting of 432 proteins and were tested on an independent dataset (consisting of 142 proteins. The residue-based Matthews correlation coefficient for the independent test set was 0.423; the accuracy, precision, sensitivity, specificity were 0.753, 0.519, 0.677, and 0.779 respectively. The benchmark results indicate that the optimized machine learning models are among the best predictors in identifying PPI sites on protein surfaces. In particular, the PPI site prediction accuracy increases with increasing size of the PPI site and with increasing hydrophobicity in amino acid composition of the PPI interface; the core interface regions are more likely to be recognized with high prediction confidence. The results indicate that the physicochemical complementarity patterns on protein surfaces are important determinants in PPIs, and a substantial portion of the PPI sites can be predicted

  10. Protein-Protein Interaction Site Predictions with Three-Dimensional Probability Distributions of Interacting Atoms on Protein Surfaces

    Science.gov (United States)

    Chen, Ching-Tai; Peng, Hung-Pin; Jian, Jhih-Wei; Tsai, Keng-Chang; Chang, Jeng-Yih; Yang, Ei-Wen; Chen, Jun-Bo; Ho, Shinn-Ying; Hsu, Wen-Lian; Yang, An-Suei

    2012-01-01

    Protein-protein interactions are key to many biological processes. Computational methodologies devised to predict protein-protein interaction (PPI) sites on protein surfaces are important tools in providing insights into the biological functions of proteins and in developing therapeutics targeting the protein-protein interaction sites. One of the general features of PPI sites is that the core regions from the two interacting protein surfaces are complementary to each other, similar to the interior of proteins in packing density and in the physicochemical nature of the amino acid composition. In this work, we simulated the physicochemical complementarities by constructing three-dimensional probability density maps of non-covalent interacting atoms on the protein surfaces. The interacting probabilities were derived from the interior of known structures. Machine learning algorithms were applied to learn the characteristic patterns of the probability density maps specific to the PPI sites. The trained predictors for PPI sites were cross-validated with the training cases (consisting of 432 proteins) and were tested on an independent dataset (consisting of 142 proteins). The residue-based Matthews correlation coefficient for the independent test set was 0.423; the accuracy, precision, sensitivity, specificity were 0.753, 0.519, 0.677, and 0.779 respectively. The benchmark results indicate that the optimized machine learning models are among the best predictors in identifying PPI sites on protein surfaces. In particular, the PPI site prediction accuracy increases with increasing size of the PPI site and with increasing hydrophobicity in amino acid composition of the PPI interface; the core interface regions are more likely to be recognized with high prediction confidence. The results indicate that the physicochemical complementarity patterns on protein surfaces are important determinants in PPIs, and a substantial portion of the PPI sites can be predicted correctly with

  11. PreBIND and Textomy – mining the biomedical literature for protein-protein interactions using a support vector machine

    Directory of Open Access Journals (Sweden)

    Baskin Berivan

    2003-03-01

    Full Text Available Abstract Background The majority of experimentally verified molecular interaction and biological pathway data are present in the unstructured text of biomedical journal articles where they are inaccessible to computational methods. The Biomolecular interaction network database (BIND seeks to capture these data in a machine-readable format. We hypothesized that the formidable task-size of backfilling the database could be reduced by using Support Vector Machine technology to first locate interaction information in the literature. We present an information extraction system that was designed to locate protein-protein interaction data in the literature and present these data to curators and the public for review and entry into BIND. Results Cross-validation estimated the support vector machine's test-set precision, accuracy and recall for classifying abstracts describing interaction information was 92%, 90% and 92% respectively. We estimated that the system would be able to recall up to 60% of all non-high throughput interactions present in another yeast-protein interaction database. Finally, this system was applied to a real-world curation problem and its use was found to reduce the task duration by 70% thus saving 176 days. Conclusions Machine learning methods are useful as tools to direct interaction and pathway database back-filling; however, this potential can only be realized if these techniques are coupled with human review and entry into a factual database such as BIND. The PreBIND system described here is available to the public at http://bind.ca. Current capabilities allow searching for human, mouse and yeast protein-interaction information.

  12. Protein (Viridiplantae) - PGDBj - Ortholog DB | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available ase Description Download License Update History of This Database Site Policy | Contact Us Protein (Viridiplantae) - PGDBj - Ortholog DB | LSDB Archive ... ...List Contact us PGDBj - Ortholog DB Protein (Viridiplantae) Data detail Data name Protein (Viridiplantae) DO...switchLanguage; BLAST Search Image Search Home About Archive Update History Data

  13. Tools for controlling protein interactions with light

    Science.gov (United States)

    Tucker, Chandra L.; Vrana, Justin D.; Kennedy, Matthew J.

    2014-01-01

    Genetically-encoded actuators that allow control of protein-protein interactions with light, termed ‘optical dimerizers’, are emerging as new tools for experimental biology. In recent years, numerous new and versatile dimerizer systems have been developed. Here we discuss the design of optical dimerizer experiments, including choice of a dimerizer system, photoexcitation sources, and coordinate use of imaging reporters. We provide detailed protocols for experiments using two dimerization systems we previously developed, CRY2/CIB and UVR8/UVR8, for use controlling transcription, protein localization, and protein secretion with light. Additionally, we provide instructions and software for constructing a pulse-controlled LED light device for use in experiments requiring extended light treatments. PMID:25181301

  14. [Interaction of protein with charged colloidal particles].

    Science.gov (United States)

    Durdenko, E V; Kuznetsova, S M; Basova, L V; Tikhonenko, S A; Saburova, E A

    2011-01-01

    The functional state of three proteins of different molecular weight (urease, lactate dehydrogenase, and hemoglobin) in the presence of the linear polyelectrolytes poly(allylamine hydrochloride) (PAA) and sodium poly(styrenesulfonate) (PSS) in the dissolved state and of the same polyelectrolytes bound to the surface of microspheres has been investigated. Microspheres were prepared by consecutive absorption of oppositely charged polyelectrolytes so that the outer layer of the shell was PAA for the acidic protein urease, and PSS for the alkaline proteins LDH and hemoglobin. It was shown that the dissolved polyelectrolyte completely inactivates all three proteins within one minute with a slight difference in the time constant. (By Hb inactivation are conventionally meant changes in the heme environment observed from the spectrum in the Soret band.) In the presence of microspheres, the proteins were adsorbed on their surface; in this case, more than 95% of the activity was retained within two hours. The proportion of the protein adsorbed on microspheres accounted for about 98% for urease, 72% for Hb, and 35% for LDH, as determined from the tryptophan fluorescence data. The interaction of hemoglobin with another type of charged colloidal particles, phospholipid vesicles, leads to the destruction of the tertiary structure of the protein, which made itself evident in the optical absorption spectra in the Soret band, as well as the spectra of tryptophan fluorescence and circular dichroism. In this case, according to circular dichroism, the percentage of alpha-helical structure of Hb was maintained. The differences in the physical and chemical mechanisms of interaction of proteins with these two types of charged colloidal particles that leads to differences in the degree of denaturing effects are discussed.

  15. Interactive Multi-Instrument Database of Solar Flares

    Science.gov (United States)

    Ranjan, Shubha S.; Spaulding, Ryan; Deardorff, Donald G.

    2018-01-01

    The fundamental motivation of the project is that the scientific output of solar research can be greatly enhanced by better exploitation of the existing solar/heliosphere space-data products jointly with ground-based observations. Our primary focus is on developing a specific innovative methodology based on recent advances in "big data" intelligent databases applied to the growing amount of high-spatial and multi-wavelength resolution, high-cadence data from NASA's missions and supporting ground-based observatories. Our flare database is not simply a manually searchable time-based catalog of events or list of web links pointing to data. It is a preprocessed metadata repository enabling fast search and automatic identification of all recorded flares sharing a specifiable set of characteristics, features, and parameters. The result is a new and unique database of solar flares and data search and classification tools for the Heliophysics community, enabling multi-instrument/multi-wavelength investigations of flare physics and supporting further development of flare-prediction methodologies.

  16. Bayesian network model for identification of pathways by integrating protein interaction with genetic interaction data.

    Science.gov (United States)

    Fu, Changhe; Deng, Su; Jin, Guangxu; Wang, Xinxin; Yu, Zu-Guo

    2017-09-21

    Molecular interaction data at proteomic and genetic levels provide physical and functional insights into a molecular biosystem and are helpful for the construction of pathway structures complementarily. Despite advances in inferring biological pathways using genetic interaction data, there still exists weakness in developed models, such as, activity pathway networks (APN), when integrating the data from proteomic and genetic levels. It is necessary to develop new methods to infer pathway structure by both of interaction data. We utilized probabilistic graphical model to develop a new method that integrates genetic interaction and protein interaction data and infers exquisitely detailed pathway structure. We modeled the pathway network as Bayesian network and applied this model to infer pathways for the coherent subsets of the global genetic interaction profiles, and the available data set of endoplasmic reticulum genes. The protein interaction data were derived from the BioGRID database. Our method can accurately reconstruct known cellular pathway structures, including SWR complex, ER-Associated Degradation (ERAD) pathway, N-Glycan biosynthesis pathway, Elongator complex, Retromer complex, and Urmylation pathway. By comparing N-Glycan biosynthesis pathway and Urmylation pathway identified from our approach with that from APN, we found that our method is able to overcome its weakness (certain edges are inexplicable). According to underlying protein interaction network, we defined a simple scoring function that only adopts genetic interaction information to avoid the balance difficulty in the APN. Using the effective stochastic simulation algorithm, the performance of our proposed method is significantly high. We developed a new method based on Bayesian network to infer detailed pathway structures from interaction data at proteomic and genetic levels. The results indicate that the developed method performs better in predicting signaling pathways than previously

  17. Prediction and characterization of protein-protein interaction networks in swine

    Directory of Open Access Journals (Sweden)

    Wang Fen

    2012-01-01

    Full Text Available Abstract Background Studying the large-scale protein-protein interaction (PPI network is important in understanding biological processes. The current research presents the first PPI map of swine, which aims to give new insights into understanding their biological processes. Results We used three methods, Interolog-based prediction of porcine PPI network, domain-motif interactions from structural topology-based prediction of porcine PPI network and motif-motif interactions from structural topology-based prediction of porcine PPI network, to predict porcine protein interactions among 25,767 porcine proteins. We predicted 20,213, 331,484, and 218,705 porcine PPIs respectively, merged the three results into 567,441 PPIs, constructed four PPI networks, and analyzed the topological properties of the porcine PPI networks. Our predictions were validated with Pfam domain annotations and GO annotations. Averages of 70, 10,495, and 863 interactions were related to the Pfam domain-interacting pairs in iPfam database. For comparison, randomized networks were generated, and averages of only 4.24, 66.79, and 44.26 interactions were associated with Pfam domain-interacting pairs in iPfam database. In GO annotations, we found 52.68%, 75.54%, 27.20% of the predicted PPIs sharing GO terms respectively. However, the number of PPI pairs sharing GO terms in the 10,000 randomized networks reached 52.68%, 75.54%, 27.20% is 0. Finally, we determined the accuracy and precision of the methods. The methods yielded accuracies of 0.92, 0.53, and 0.50 at precisions of about 0.93, 0.74, and 0.75, respectively. Conclusion The results reveal that the predicted PPI networks are considerably reliable. The present research is an important pioneering work on protein function research. The porcine PPI data set, the confidence score of each interaction and a list of related data are available at (http://pppid.biositemap.com/.

  18. Topology-function conservation in protein-protein interaction networks.

    Science.gov (United States)

    Davis, Darren; Yaveroğlu, Ömer Nebil; Malod-Dognin, Noël; Stojmirovic, Aleksandar; Pržulj, Nataša

    2015-05-15

    Proteins underlay the functioning of a cell and the wiring of proteins in protein-protein interaction network (PIN) relates to their biological functions. Proteins with similar wiring in the PIN (topology around them) have been shown to have similar functions. This property has been successfully exploited for predicting protein functions. Topological similarity is also used to guide network alignment algorithms that find similarly wired proteins between PINs of different species; these similarities are used to transfer annotation across PINs, e.g. from model organisms to human. To refine these functional predictions and annotation transfers, we need to gain insight into the variability of the topology-function relationships. For example, a function may be significantly associated with specific topologies, while another function may be weakly associated with several different topologies. Also, the topology-function relationships may differ between different species. To improve our understanding of topology-function relationships and of their conservation among species, we develop a statistical framework that is built upon canonical correlation analysis. Using the graphlet degrees to represent the wiring around proteins in PINs and gene ontology (GO) annotations to describe their functions, our framework: (i) characterizes statistically significant topology-function relationships in a given species, and (ii) uncovers the functions that have conserved topology in PINs of different species, which we term topologically orthologous functions. We apply our framework to PINs of yeast and human, identifying seven biological process and two cellular component GO terms to be topologically orthologous for the two organisms. © The Author 2015. Published by Oxford University Press.

  19. O-GLYCOBASE version 4.0: a revised database of O-glycosylated proteins

    DEFF Research Database (Denmark)

    Gupta, Ramneek; Birch, Hanne; Rapacki, Krzysztof

    1999-01-01

    O-GLYCBASE is a database of glycoproteins with O-linked glycosylation sites. Entries with at least one experimentally verified O-glycosylation site have been complied from protein sequence databases and literature. Each entry contains information about the glycan involved, the species, sequence, ...

  20. Integrated Controlling System and Unified Database for High Throughput Protein Crystallography Experiments

    International Nuclear Information System (INIS)

    Gaponov, Yu.A.; Igarashi, N.; Hiraki, M.; Sasajima, K.; Matsugaki, N.; Suzuki, M.; Kosuge, T.; Wakatsuki, S.

    2004-01-01

    An integrated controlling system and a unified database for high throughput protein crystallography experiments have been developed. Main features of protein crystallography experiments (purification, crystallization, crystal harvesting, data collection, data processing) were integrated into the software under development. All information necessary to perform protein crystallography experiments is stored (except raw X-ray data that are stored in a central data server) in a MySQL relational database. The database contains four mutually linked hierarchical trees describing protein crystals, data collection of protein crystal and experimental data processing. A database editor was designed and developed. The editor supports basic database functions to view, create, modify and delete user records in the database. Two search engines were realized: direct search of necessary information in the database and object oriented search. The system is based on TCP/IP secure UNIX sockets with four predefined sending and receiving behaviors, which support communications between all connected servers and clients with remote control functions (creating and modifying data for experimental conditions, data acquisition, viewing experimental data, and performing data processing). Two secure login schemes were designed and developed: a direct method (using the developed Linux clients with secure connection) and an indirect method (using the secure SSL connection using secure X11 support from any operating system with X-terminal and SSH support). A part of the system has been implemented on a new MAD beam line, NW12, at the Photon Factory Advanced Ring for general user experiments

  1. Protein-Ligand Empirical Interaction Components for Virtual Screening.

    Science.gov (United States)

    Yan, Yuna; Wang, Weijun; Sun, Zhaoxi; Zhang, John Z H; Ji, Changge

    2017-08-28

    A major shortcoming of empirical scoring functions is that they often fail to predict binding affinity properly. Removing false positives of docking results is one of the most challenging works in structure-based virtual screening. Postdocking filters, making use of all kinds of experimental structure and activity information, may help in solving the issue. We describe a new method based on detailed protein-ligand interaction decomposition and machine learning. Protein-ligand empirical interaction components (PLEIC) are used as descriptors for support vector machine learning to develop a classification model (PLEIC-SVM) to discriminate false positives from true positives. Experimentally derived activity information is used for model training. An extensive benchmark study on 36 diverse data sets from the DUD-E database has been performed to evaluate the performance of the new method. The results show that the new method performs much better than standard empirical scoring functions in structure-based virtual screening. The trained PLEIC-SVM model is able to capture important interaction patterns between ligand and protein residues for one specific target, which is helpful in discarding false positives in postdocking filtering.

  2. Poly(ethylene glycol) interactions with proteins

    Czech Academy of Sciences Publication Activity Database

    Hašek, Jindřich

    2006-01-01

    Roč. 2, č. 23 (2006), s. 613-618 ISSN 0044-2968. [European Powder Diffraction Conference /9./. Prague, 02.09.2004-05.09.2004] R&D Projects: GA ČR(CZ) GA204/02/0843 Institutional research plan: CEZ:AV0Z40500505 Keywords : poly(ethylene glycol) * PEO * protein-polymer interaction Subject RIV: CD - Macromolecular Chemistry Impact factor: 1.897, year: 2006

  3. Identification of Protein-Protein Interactions with Glutathione-S-Transferase (GST) Fusion Proteins.

    Science.gov (United States)

    Einarson, Margret B; Pugacheva, Elena N; Orlinick, Jason R

    2007-08-01

    INTRODUCTIONGlutathione-S-transferase (GST) fusion proteins have had a wide range of applications since their introduction as tools for synthesis of recombinant proteins in bacteria. GST was originally selected as a fusion moiety because of several desirable properties. First and foremost, when expressed in bacteria alone, or as a fusion, GST is not sequestered in inclusion bodies (in contrast to previous fusion protein systems). Second, GST can be affinity-purified without denaturation because it binds to immobilized glutathione, which provides the basis for simple purification. Consequently, GST fusion proteins are routinely used for antibody generation and purification, protein-protein interaction studies, and biochemical analysis. This article describes the use of GST fusion proteins as probes for the identification of protein-protein interactions.

  4. Notable Aspects of Glycan-Protein Interactions

    Directory of Open Access Journals (Sweden)

    Miriam Cohen

    2015-09-01

    Full Text Available This mini review highlights several interesting aspects of glycan-mediated interactions that are common between cells, bacteria, and viruses. Glycans are ubiquitously found on all living cells, and in the extracellular milieu of multicellular organisms. They are known to mediate initial binding and recognition events of both immune cells and pathogens with their target cells or tissues. The host target tissues are hidden under a layer of secreted glycosylated decoy targets. In addition, pathogens can utilize and display host glycans to prevent identification as foreign by the host’s immune system (molecular mimicry. Both the host and pathogens continually evolve. The host evolves to prevent infection and the pathogens evolve to evade host defenses. Many pathogens express both glycan-binding proteins and glycosidases. Interestingly, these proteins are often located at the tip of elongated protrusions in bacteria, or in the leading edge of the cell. Glycan-protein interactions have low affinity and, as a result, multivalent interactions are often required to achieve biologically relevant binding. These enable dynamic forms of adhesion mechanisms, reviewed here, and include rolling (cells, stick and roll (bacteria or surfacing (viruses.

  5. CellMap visualizes protein-protein interactions and subcellular localization

    Science.gov (United States)

    Dallago, Christian; Goldberg, Tatyana; Andrade-Navarro, Miguel Angel; Alanis-Lobato, Gregorio; Rost, Burkhard

    2018-01-01

    Many tools visualize protein-protein interaction (PPI) networks. The tool introduced here, CellMap, adds one crucial novelty by visualizing PPI networks in the context of subcellular localization, i.e. the location in the cell or cellular component in which a PPI happens. Users can upload images of cells and define areas of interest against which PPIs for selected proteins are displayed (by default on a cartoon of a cell). Annotations of localization are provided by the user or through our in-house database. The visualizer and server are written in JavaScript, making CellMap easy to customize and to extend by researchers and developers. PMID:29497493

  6. Detection of protein complex from protein-protein interaction network using Markov clustering

    International Nuclear Information System (INIS)

    Ochieng, P J; Kusuma, W A; Haryanto, T

    2017-01-01

    Detection of complexes, or groups of functionally related proteins, is an important challenge while analysing biological networks. However, existing algorithms to identify protein complexes are insufficient when applied to dense networks of experimentally derived interaction data. Therefore, we introduced a graph clustering method based on Markov clustering algorithm to identify protein complex within highly interconnected protein-protein interaction networks. Protein-protein interaction network was first constructed to develop geometrical network, the network was then partitioned using Markov clustering to detect protein complexes. The interest of the proposed method was illustrated by its application to Human Proteins associated to type II diabetes mellitus. Flow simulation of MCL algorithm was initially performed and topological properties of the resultant network were analysed for detection of the protein complex. The results indicated the proposed method successfully detect an overall of 34 complexes with 11 complexes consisting of overlapping modules and 20 non-overlapping modules. The major complex consisted of 102 proteins and 521 interactions with cluster modularity and density of 0.745 and 0.101 respectively. The comparison analysis revealed MCL out perform AP, MCODE and SCPS algorithms with high clustering coefficient (0.751) network density and modularity index (0.630). This demonstrated MCL was the most reliable and efficient graph clustering algorithm for detection of protein complexes from PPI networks. (paper)

  7. ARAMEMNON, a novel database for Arabidopsis integral membrane proteins

    DEFF Research Database (Denmark)

    Schwacke, Rainer; Schneider, Anja; van der Graaff, Eric

    2003-01-01

    spans and are possibly linked to transport functions. The ARAMEMNON DB enables direct comparison of the predictions of seven different TM span computation programs and the predictions of subcellular localization by eight signal peptide recognition programs. A special function displays the proteins...

  8. Quantifying the molecular origins of opposite solvent effects on protein-protein interactions.

    Directory of Open Access Journals (Sweden)

    Vincent Vagenende

    Full Text Available Although the nature of solvent-protein interactions is generally weak and non-specific, addition of cosolvents such as denaturants and osmolytes strengthens protein-protein interactions for some proteins, whereas it weakens protein-protein interactions for others. This is exemplified by the puzzling observation that addition of glycerol oppositely affects the association constants of two antibodies, D1.3 and D44.1, with lysozyme. To resolve this conundrum, we develop a methodology based on the thermodynamic principles of preferential interaction theory and the quantitative characterization of local protein solvation from molecular dynamics simulations. We find that changes of preferential solvent interactions at the protein-protein interface quantitatively account for the opposite effects of glycerol on the antibody-antigen association constants. Detailed characterization of local protein solvation in the free and associated protein states reveals how opposite solvent effects on protein-protein interactions depend on the extent of dewetting of the protein-protein contact region and on structural changes that alter cooperative solvent-protein interactions at the periphery of the protein-protein interface. These results demonstrate the direct relationship between macroscopic solvent effects on protein-protein interactions and atom-scale solvent-protein interactions, and establish a general methodology for predicting and understanding solvent effects on protein-protein interactions in diverse biological environments.

  9. Visualization of protein interaction networks: problems and solutions

    Directory of Open Access Journals (Sweden)

    Agapito Giuseppe

    2013-01-01

    Full Text Available Abstract Background Visualization concerns the representation of data visually and is an important task in scientific research. Protein-protein interactions (PPI are discovered using either wet lab techniques, such mass spectrometry, or in silico predictions tools, resulting in large collections of interactions stored in specialized databases. The set of all interactions of an organism forms a protein-protein interaction network (PIN and is an important tool for studying the behaviour of the cell machinery. Since graphic representation of PINs may highlight important substructures, e.g. protein complexes, visualization is more and more used to study the underlying graph structure of PINs. Although graphs are well known data structures, there are different open problems regarding PINs visualization: the high number of nodes and connections, the heterogeneity of nodes (proteins and edges (interactions, the possibility to annotate proteins and interactions with biological information extracted by ontologies (e.g. Gene Ontology that enriches the PINs with semantic information, but complicates their visualization. Methods In these last years many software tools for the visualization of PINs have been developed. Initially thought for visualization only, some of them have been successively enriched with new functions for PPI data management and PIN analysis. The paper analyzes the main software tools for PINs visualization considering four main criteria: (i technology, i.e. availability/license of the software and supported OS (Operating System platforms; (ii interoperability, i.e. ability to import/export networks in various formats, ability to export data in a graphic format, extensibility of the system, e.g. through plug-ins; (iii visualization, i.e. supported layout and rendering algorithms and availability of parallel implementation; (iv analysis, i.e. availability of network analysis functions, such as clustering or mining of the graph, and the

  10. Protein interaction networks by proteome peptide scanning.

    Directory of Open Access Journals (Sweden)

    Christiane Landgraf

    2004-01-01

    Full Text Available A substantial proportion of protein interactions relies on small domains binding to short peptides in the partner proteins. Many of these interactions are relatively low affinity and transient, and they impact on signal transduction. However, neither the number of potential interactions mediated by each domain nor the degree of promiscuity at a whole proteome level has been investigated. We have used a combination of phage display and SPOT synthesis to discover all the peptides in the yeast proteome that have the potential to bind to eight SH3 domains. We first identified the peptides that match a relaxed consensus, as deduced from peptides selected by phage display experiments. Next, we synthesized all the matching peptides at high density on a cellulose membrane, and we probed them directly with the SH3 domains. The domains that we have studied were grouped by this approach into five classes with partially overlapping specificity. Within the classes, however, the domains display a high promiscuity and bind to a large number of common targets with comparable affinity. We estimate that the yeast proteome contains as few as six peptides that bind to the Abp1 SH3 domain with a dissociation constant lower than 100 microM, while it contains as many as 50-80 peptides with corresponding affinity for the SH3 domain of Yfr024c. All the targets of the Abp1 SH3 domain, identified by this approach, bind to the native protein in vivo, as shown by coimmunoprecipitation experiments. Finally, we demonstrate that this strategy can be extended to the analysis of the entire human proteome. We have developed an approach, named WISE (whole interactome scanning experiment, that permits rapid and reliable identification of the partners of any peptide recognition module by peptide scanning of a proteome. Since the SPOT synthesis approach is semiquantitative and provides an approximation of the dissociation constants of the several thousands of interactions that are

  11. Computational analysis of RNA-protein interaction interfaces via the Voronoi diagram.

    Science.gov (United States)

    Mahdavi, Sedigheh; Mohades, Ali; Salehzadeh Yazdi, Ali; Jahandideh, Samad; Masoudi-Nejad, Ali

    2012-01-21

    Cellular functions are mediated by various biological processes including biomolecular interactions, such as protein-protein, DNA-protein and RNA-protein interactions in which RNA-Protein interactions are indispensable for many biological processes like cell development and viral replication. Unlike the protein-protein and protein-DNA interactions, accurate mechanisms and structures of the RNA-Protein complexes are not fully understood. A large amount of theoretical evidence have shown during the past several years that computational geometry is the first pace in understanding the binding profiles and plays a key role in the study of intricate biological structures, interactions and complexes. In this paper, RNA-Protein interaction interface surface is computed via the weighted Voronoi diagram of atoms. Using two filter operations provides a natural definition for interface atoms as classic methods. Unbounded parts of Voronoi facets that are far from the complex are trimmed using modified convex hull of atom centers. This algorithm is implemented to a database with different RNA-Protein complexes extracted from Protein Data Bank (PDB). Afterward, the features of interfaces have been computed and compared with classic method. The results show high correlation coefficients between interface size in the Voronoi model and the classical model based on solvent accessibility, as well as high accuracy and precision in comparison to classical model. Copyright © 2011 Elsevier Ltd. All rights reserved.

  12. Prediction of host - pathogen protein interactions between Mycobacterium tuberculosis and Homo sapiens using sequence motifs.

    Science.gov (United States)

    Huo, Tong; Liu, Wei; Guo, Yu; Yang, Cheng; Lin, Jianping; Rao, Zihe

    2015-03-26

    Emergence of multiple drug resistant strains of M. tuberculosis (MDR-TB) threatens to derail global efforts aimed at reigning in the pathogen. Co-infections of M. tuberculosis with HIV are difficult to treat. To counter these new challenges, it is essential to study the interactions between M. tuberculosis and the host to learn how these bacteria cause disease. We report a systematic flow to predict the host pathogen interactions (HPIs) between M. tuberculosis and Homo sapiens based on sequence motifs. First, protein sequences were used as initial input for identifying the HPIs by 'interolog' method. HPIs were further filtered by prediction of domain-domain interactions (DDIs). Functional annotations of protein and publicly available experimental results were applied to filter the remaining HPIs. Using such a strategy, 118 pairs of HPIs were identified, which involve 43 proteins from M. tuberculosis and 48 proteins from Homo sapiens. A biological interaction network between M. tuberculosis and Homo sapiens was then constructed using the predicted inter- and intra-species interactions based on the 118 pairs of HPIs. Finally, a web accessible database named PATH (Protein interactions of M. tuberculosis and Human) was constructed to store these predicted interactions and proteins. This interaction network will facilitate the research on host-pathogen protein-protein interactions, and may throw light on how M. tuberculosis interacts with its host.

  13. Parallel force assay for protein-protein interactions.

    Science.gov (United States)

    Aschenbrenner, Daniela; Pippig, Diana A; Klamecka, Kamila; Limmer, Katja; Leonhardt, Heinrich; Gaub, Hermann E

    2014-01-01

    Quantitative proteome research is greatly promoted by high-resolution parallel format assays. A characterization of protein complexes based on binding forces offers an unparalleled dynamic range and allows for the effective discrimination of non-specific interactions. Here we present a DNA-based Molecular Force Assay to quantify protein-protein interactions, namely the bond between different variants of GFP and GFP-binding nanobodies. We present different strategies to adjust the maximum sensitivity window of the assay by influencing the binding strength of the DNA reference duplexes. The binding of the nanobody Enhancer to the different GFP constructs is compared at high sensitivity of the assay. Whereas the binding strength to wild type and enhanced GFP are equal within experimental error, stronger binding to superfolder GFP is observed. This difference in binding strength is attributed to alterations in the amino acids that form contacts according to the crystal structure of the initial wild type GFP-Enhancer complex. Moreover, we outline the potential for large-scale parallelization of the assay.

  14. Parallel force assay for protein-protein interactions.

    Directory of Open Access Journals (Sweden)

    Daniela Aschenbrenner

    Full Text Available Quantitative proteome research is greatly promoted by high-resolution parallel format assays. A characterization of protein complexes based on binding forces offers an unparalleled dynamic range and allows for the effective discrimination of non-specific interactions. Here we present a DNA-based Molecular Force Assay to quantify protein-protein interactions, namely the bond between different variants of GFP and GFP-binding nanobodies. We present different strategies to adjust the maximum sensitivity window of the assay by influencing the binding strength of the DNA reference duplexes. The binding of the nanobody Enhancer to the different GFP constructs is compared at high sensitivity of the assay. Whereas the binding strength to wild type and enhanced GFP are equal within experimental error, stronger binding to superfolder GFP is observed. This difference in binding strength is attributed to alterations in the amino acids that form contacts according to the crystal structure of the initial wild type GFP-Enhancer complex. Moreover, we outline the potential for large-scale parallelization of the assay.

  15. Interrogating the architecture of protein assemblies and protein interaction networks by cross-linking mass spectrometry

    NARCIS (Netherlands)

    Liu, Fan; Heck, Albert J R

    2015-01-01

    Proteins are involved in almost all processes of the living cell. They are organized through extensive networks of interaction, by tightly bound macromolecular assemblies or more transiently via signaling nodes. Therefore, revealing the architecture of protein complexes and protein interaction

  16. Rationalizing the chemical space of protein-protein interaction inhibitors.

    Science.gov (United States)

    Sperandio, Olivier; Reynès, Christelle H; Camproux, Anne-Claude; Villoutreix, Bruno O

    2010-03-01

    Protein-protein interactions (PPIs) are one of the next major classes of therapeutic targets, although they are too intricate to tackle with standard approaches. This is due, in part, to the inadequacy of today's chemical libraries. However, the emergence of a growing number of experimentally validated inhibitors of PPIs (i-PPIs) allows drug designers to use chemoinformatics and machine learning technologies to unravel the nature of the chemical space covered by the reported compounds. Key characteristics of i-PPIs can then be revealed and highlight the importance of specific shapes and/or aromatic bonds, enabling the design of i-PPI-enriched focused libraries and, therefore, of cost-effective screening strategies. 2009 Elsevier Ltd. All rights reserved.

  17. The reactive metabolite target protein database (TPDB)--a web-accessible resource.

    Science.gov (United States)

    Hanzlik, Robert P; Koen, Yakov M; Theertham, Bhargav; Dong, Yinghua; Fang, Jianwen

    2007-03-16

    The toxic effects of many simple organic compounds stem from their biotransformation to chemically reactive metabolites which bind covalently to cellular proteins. To understand the mechanisms of cytotoxic responses it may be important to know which proteins become adducted and whether some may be common targets of multiple toxins. The literature of this field is widely scattered but expanding rapidly, suggesting the need for a comprehensive, searchable database of reactive metabolite target proteins. The Reactive Metabolite Target Protein Database (TPDB) is a comprehensive, curated, searchable, documented compilation of publicly available information on the protein targets of reactive metabolites of 18 well-studied chemicals and drugs of known toxicity. TPDB software enables i) string searches for author names and proteins names/synonyms, ii) more complex searches by selecting chemical compound, animal species, target tissue and protein names/synonyms from pull-down menus, and iii) commonality searches over multiple chemicals. Tabulated search results provide information, references and links to other databases. The TPDB is a unique on-line compilation of information on the covalent modification of cellular proteins by reactive metabolites of chemicals and drugs. Its comprehensiveness and searchability should facilitate the elucidation of mechanisms of reactive metabolite toxicity. The database is freely available at http://tpdb.medchem.ku.edu/tpdb.html.

  18. The reactive metabolite target protein database (TPDB – a web-accessible resource

    Directory of Open Access Journals (Sweden)

    Dong Yinghua

    2007-03-01

    Full Text Available Abstract Background The toxic effects of many simple organic compounds stem from their biotransformation to chemically reactive metabolites which bind covalently to cellular proteins. To understand the mechanisms of cytotoxic responses it may be important to know which proteins become adducted and whether some may be common targets of multiple toxins. The literature of this field is widely scattered but expanding rapidly, suggesting the need for a comprehensive, searchable database of reactive metabolite target proteins. Description The Reactive Metabolite Target Protein Database (TPDB is a comprehensive, curated, searchable, documented compilation of publicly available information on the protein targets of reactive metabolites of 18 well-studied chemicals and drugs of known toxicity. TPDB software enables i string searches for author names and proteins names/synonyms, ii more complex searches by selecting chemical compound, animal species, target tissue and protein names/synonyms from pull-down menus, and iii commonality searches over multiple chemicals. Tabulated search results provide information, references and links to other databases. Conclusion The TPDB is a unique on-line compilation of information on the covalent modification of cellular proteins by reactive metabolites of chemicals and drugs. Its comprehensiveness and searchability should facilitate the elucidation of mechanisms of reactive metabolite toxicity. The database is freely available at http://tpdb.medchem.ku.edu/tpdb.html

  19. Protein complex prediction based on k-connected subgraphs in protein interaction network

    OpenAIRE

    Habibi, Mahnaz; Eslahchi, Changiz; Wong, Limsoon

    2010-01-01

    Abstract Background Protein complexes play an important role in cellular mechanisms. Recently, several methods have been presented to predict protein complexes in a protein interaction network. In these methods, a protein complex is predicted as a dense subgraph of protein interactions. However, interactions data are incomplete and a protein complex does not have to be a complete or dense subgraph. Results We propose a more appropriate protein complex prediction method, CFA, that is based on ...

  20. Protein function prediction using neighbor relativity in protein-protein interaction network.

    Science.gov (United States)

    Moosavi, Sobhan; Rahgozar, Masoud; Rahimi, Amir

    2013-04-01

    There is a large gap between the number of discovered proteins and the number of functionally annotated ones. Due to the high cost of determining protein function by wet-lab research, function prediction has become a major task for computational biology and bioinformatics. Some researches utilize the proteins interaction information to predict function for un-annotated proteins. In this paper, we propose a novel approach called "Neighbor Relativity Coefficient" (NRC) based on interaction network topology which estimates the functional similarity between two proteins. NRC is calculated for each pair of proteins based on their graph-based features including distance, common neighbors and the number of paths between them. In order to ascribe function to an un-annotated protein, NRC estimates a weight for each neighbor to transfer its annotation to the unknown protein. Finally, the unknown protein will be annotated by the top score transferred functions. We also investigate the effect of using different coefficients for various types of functions. The proposed method has been evaluated on Saccharomyces cerevisiae and Homo sapiens interaction networks. The performance analysis demonstrates that NRC yields better results in comparison with previous protein function prediction approaches that utilize interaction network. Copyright © 2012 Elsevier Ltd. All rights reserved.

  1. Receptor-interacting protein (RIP) kinase family

    Science.gov (United States)

    Zhang, Duanwu; Lin, Juan; Han, Jiahuai

    2010-01-01

    Receptor-interacting protein (RIP) kinases are a group of threonine/serine protein kinases with a relatively conserved kinase domain but distinct non-kinase regions. A number of different domain structures, such as death and caspase activation and recruitment domain (CARD) domains, were found in different RIP family members, and these domains should be keys in determining the specific function of each RIP kinase. It is known that RIP kinases participate in different biological processes, including those in innate immunity, but their downstream substrates are largely unknown. This review will give an overview of the structures and functions of RIP family members, and an update of recent progress in RIP kinase research. PMID:20383176

  2. HPIminer: A text mining system for building and visualizing human protein interaction networks and pathways.

    Science.gov (United States)

    Subramani, Suresh; Kalpana, Raja; Monickaraj, Pankaj Moses; Natarajan, Jeyakumar

    2015-04-01

    The knowledge on protein-protein interactions (PPI) and their related pathways are equally important to understand the biological functions of the living cell. Such information on human proteins is highly desirable to understand the mechanism of several diseases such as cancer, diabetes, and Alzheimer's disease. Because much of that information is buried in biomedical literature, an automated text mining system for visualizing human PPI and pathways is highly desirable. In this paper, we present HPIminer, a text mining system for visualizing human protein interactions and pathways from biomedical literature. HPIminer extracts human PPI information and PPI pairs from biomedical literature, and visualize their associated interactions, networks and pathways using two curated databases HPRD and KEGG. To our knowledge, HPIminer is the first system to build interaction networks from literature as well as curated databases. Further, the new interactions mined only from literature and not reported earlier in databases are highlighted as new. A comparative study with other similar tools shows that the resultant network is more informative and provides additional information on interacting proteins and their associated networks. Copyright © 2015 Elsevier Inc. All rights reserved.

  3. BIPS: BIANA Interolog Prediction Server. A tool for protein-protein interaction inference.

    Science.gov (United States)

    Garcia-Garcia, Javier; Schleker, Sylvia; Klein-Seetharaman, Judith; Oliva, Baldo

    2012-07-01

    Protein-protein interactions (PPIs) play a crucial role in biology, and high-throughput experiments have greatly increased the coverage of known interactions. Still, identification of complete inter- and intraspecies interactomes is far from being complete. Experimental data can be complemented by the prediction of PPIs within an organism or between two organisms based on the known interactions of the orthologous genes of other organisms (interologs). Here, we present the BIANA (Biologic Interactions and Network Analysis) Interolog Prediction Server (BIPS), which offers a web-based interface to facilitate PPI predictions based on interolog information. BIPS benefits from the capabilities of the framework BIANA to integrate the several PPI-related databases. Additional metadata can be used to improve the reliability of the predicted interactions. Sensitivity and specificity of the server have been calculated using known PPIs from different interactomes using a leave-one-out approach. The specificity is between 72 and 98%, whereas sensitivity varies between 1 and 59%, depending on the sequence identity cut-off used to calculate similarities between sequences. BIPS is freely accessible at http://sbi.imim.es/BIPS.php.

  4. Fragment molecular orbital method for studying lanthanide interactions with proteins

    Energy Technology Data Exchange (ETDEWEB)

    Tsushima, Satoru [Helmholtz-Zentrum Dresden-Rossendorf e.V., Dresden (Germany). Biophysics; Komeiji, Y. [National Institute of Advanced Industrial Science and Technology (AIST), Tsukuba (Japan); Mochizuki, Y. [Rikkyo Univ., Tokyo (Japan)

    2017-06-01

    The binding affinity of the calcium-binding protein calmodulin towards Eu{sup 3+} was studied as a model for lanthanide protein interactions in the large family of ''EF-hand'' calcium-binding proteins.

  5. A domain-based approach to predict protein-protein interactions

    Directory of Open Access Journals (Sweden)

    Resat Haluk

    2007-06-01

    Full Text Available Abstract Background Knowing which proteins exist in a certain organism or cell type and how these proteins interact with each other are necessary for the understanding of biological processes at the whole cell level. The determination of the protein-protein interaction (PPI networks has been the subject of extensive research. Despite the development of reasonably successful methods, serious technical difficulties still exist. In this paper we present DomainGA, a quantitative computational approach that uses the information about the domain-domain interactions to predict the interactions between proteins. Results DomainGA is a multi-parameter optimization method in which the available PPI information is used to derive a quantitative scoring scheme for the domain-domain pairs. Obtained domain interaction scores are then used to predict whether a pair of proteins interacts. Using the yeast PPI data and a series of tests, we show the robustness and insensitivity of the DomainGA method to the selection of the parameter sets, score ranges, and detection rules. Our DomainGA method achieves very high explanation ratios for the positive and negative PPIs in yeast. Based on our cross-verification tests on human PPIs, comparison of the optimized scores with the structurally observed domain interactions obtained from the iPFAM database, and sensitivity and specificity analysis; we conclude that our DomainGA method shows great promise to be applicable across multiple organisms. Conclusion We envision the DomainGA as a first step of a multiple tier approach to constructing organism specific PPIs. As it is based on fundamental structural information, the DomainGA approach can be used to create potential PPIs and the accuracy of the constructed interaction template can be further improved using complementary methods. Explanation ratios obtained in the reported test case studies clearly show that the false prediction rates of the template networks constructed

  6. Protein-Protein Interaction Network and Gene Ontology

    Science.gov (United States)

    Choi, Yunkyu; Kim, Seok; Yi, Gwan-Su; Park, Jinah

    Evolution of computer technologies makes it possible to access a large amount and various kinds of biological data via internet such as DNA sequences, proteomics data and information discovered about them. It is expected that the combination of various data could help researchers find further knowledge about them. Roles of a visualization system are to invoke human abilities to integrate information and to recognize certain patterns in the data. Thus, when the various kinds of data are examined and analyzed manually, an effective visualization system is an essential part. One instance of these integrated visualizations can be combination of protein-protein interaction (PPI) data and Gene Ontology (GO) which could help enhance the analysis of PPI network. We introduce a simple but comprehensive visualization system that integrates GO and PPI data where GO and PPI graphs are visualized side-by-side and supports quick reference functions between them. Furthermore, the proposed system provides several interactive visualization methods for efficiently analyzing the PPI network and GO directedacyclic- graph such as context-based browsing and common ancestors finding.

  7. A Conventional Liner Acoustic/Drag Interaction Benchmark Database

    Science.gov (United States)

    Howerton, Brian M.; Jones, Michael G.

    2017-01-01

    The aerodynamic drag of acoustic liners has become a significant topic in the design of such for aircraft noise applications. In order to evaluate the benefits of concepts designed to reduce liner drag, it is necessary to establish the baseline performance of liners employing the typical design features of conventional configurations. This paper details a set of experiments in the NASA Langley Grazing Flow Impedance Tube to quantify the relative drag of a number of perforate-over-honeycomb liner configurations at flow speeds of M=0.3 and 0.5. These conventional liners are investigated to determine their resistance factors using a static pressure drop approach. Comparison of the resistance factors gives a relative measurement of liner drag. For these same flow conditions, acoustic measurements are performed with tonal excitation from 400 to 3000 Hz at source sound pressure levels of 140 and 150 dB. Educed impedance and attenuation spectra are used to determine the interaction between acoustic performance and drag.

  8. Unified Alignment of Protein-Protein Interaction Networks.

    Science.gov (United States)

    Malod-Dognin, Noël; Ban, Kristina; Pržulj, Nataša

    2017-04-19

    Paralleling the increasing availability of protein-protein interaction (PPI) network data, several network alignment methods have been proposed. Network alignments have been used to uncover functionally conserved network parts and to transfer annotations. However, due to the computational intractability of the network alignment problem, aligners are heuristics providing divergent solutions and no consensus exists on a gold standard, or which scoring scheme should be used to evaluate them. We comprehensively evaluate the alignment scoring schemes and global network aligners on large scale PPI data and observe that three methods, HUBALIGN, L-GRAAL and NATALIE, regularly produce the most topologically and biologically coherent alignments. We study the collective behaviour of network aligners and observe that PPI networks are almost entirely aligned with a handful of aligners that we unify into a new tool, Ulign. Ulign enables complete alignment of two networks, which traditional global and local aligners fail to do. Also, multiple mappings of Ulign define biologically relevant soft clusterings of proteins in PPI networks, which may be used for refining the transfer of annotations across networks. Hence, PPI networks are already well investigated by current aligners, so to gain additional biological insights, a paradigm shift is needed. We propose such a shift come from aligning all available data types collectively rather than any particular data type in isolation from others.

  9. ProDis-ContSHC: learning protein dissimilarity measures and hierarchical context coherently for protein-protein comparison in protein database retrieval.

    Science.gov (United States)

    Wang, Jingyan; Gao, Xin; Wang, Quanquan; Li, Yongping

    2012-05-08

    The need to retrieve or classify protein molecules using structure or sequence-based similarity measures underlies a wide range of biomedical applications. Traditional protein search methods rely on a pairwise dissimilarity/similarity measure for comparing a pair of proteins. This kind of pairwise measures suffer from the limitation of neglecting the distribution of other proteins and thus cannot satisfy the need for high accuracy of the retrieval systems. Recent work in the machine learning community has shown that exploiting the global structure of the database and learning the contextual dissimilarity/similarity measures can improve the retrieval performance significantly. However, most existing contextual dissimilarity/similarity learning algorithms work in an unsupervised manner, which does not utilize the information of the known class labels of proteins in the database. In this paper, we propose a novel protein-protein dissimilarity learning algorithm, ProDis-ContSHC. ProDis-ContSHC regularizes an existing dissimilarity measure dij by considering the contextual information of the proteins. The context of a protein is defined by its neighboring proteins. The basic idea is, for a pair of proteins (i, j), if their context N(i) and N(j) is similar to each other, the two proteins should also have a high similarity. We implement this idea by regularizing dij by a factor learned from the context N(i) and N(j).Moreover, we divide the context to hierarchial sub-context and get the contextual dissimilarity vector for each protein pair. Using the class label information of the proteins, we select the relevant (a pair of proteins that has the same class labels) and irrelevant (with different labels) protein pairs, and train an SVM model to distinguish between their contextual dissimilarity vectors. The SVM model is further used to learn a supervised regularizing factor. Finally, with the new Supervised learned Dissimilarity measure, we update the Protein Hierarchial

  10. ProDis-ContSHC: Learning protein dissimilarity measures and hierarchical context coherently for protein-protein comparison in protein database retrieval

    KAUST Repository

    Wang, Jim Jing-Yan

    2012-05-08

    Background: The need to retrieve or classify protein molecules using structure or sequence-based similarity measures underlies a wide range of biomedical applications. Traditional protein search methods rely on a pairwise dissimilarity/similarity measure for comparing a pair of proteins. This kind of pairwise measures suffer from the limitation of neglecting the distribution of other proteins and thus cannot satisfy the need for high accuracy of the retrieval systems. Recent work in the machine learning community has shown that exploiting the global structure of the database and learning the contextual dissimilarity/similarity measures can improve the retrieval performance significantly. However, most existing contextual dissimilarity/similarity learning algorithms work in an unsupervised manner, which does not utilize the information of the known class labels of proteins in the database.Results: In this paper, we propose a novel protein-protein dissimilarity learning algorithm, ProDis-ContSHC. ProDis-ContSHC regularizes an existing dissimilarity measure dij by considering the contextual information of the proteins. The context of a protein is defined by its neighboring proteins. The basic idea is, for a pair of proteins (i, j), if their context N (i) and N (j) is similar to each other, the two proteins should also have a high similarity. We implement this idea by regularizing dij by a factor learned from the context N (i) and N (j). Moreover, we divide the context to hierarchial sub-context and get the contextual dissimilarity vector for each protein pair. Using the class label information of the proteins, we select the relevant (a pair of proteins that has the same class labels) and irrelevant (with different labels) protein pairs, and train an SVM model to distinguish between their contextual dissimilarity vectors. The SVM model is further used to learn a supervised regularizing factor. Finally, with the new Supervised learned Dissimilarity measure, we update

  11. Protein-protein interaction network-based detection of functionally similar proteins within species.

    Science.gov (United States)

    Song, Baoxing; Wang, Fen; Guo, Yang; Sang, Qing; Liu, Min; Li, Dengyun; Fang, Wei; Zhang, Deli

    2012-07-01

    Although functionally similar proteins across species have been widely studied, functionally similar proteins within species showing low sequence similarity have not been examined in detail. Identification of these proteins is of significant importance for understanding biological functions, evolution of protein families, progression of co-evolution, and convergent evolution and others which cannot be obtained by detection of functionally similar proteins across species. Here, we explored a method of detecting functionally similar proteins within species based on graph theory. After denoting protein-protein interaction networks using graphs, we split the graphs into subgraphs using the 1-hop method. Proteins with functional similarities in a species were detected using a method of modified shortest path to compare these subgraphs and to find the eligible optimal results. Using seven protein-protein interaction networks and this method, some functionally similar proteins with low sequence similarity that cannot detected by sequence alignment were identified. By analyzing the results, we found that, sometimes, it is difficult to separate homologous from convergent evolution. Evaluation of the performance of our method by gene ontology term overlap showed that the precision of our method was excellent. Copyright © 2012 Wiley Periodicals, Inc.

  12. Radioresistance related genes screened by protein-protein interaction network analysis in nasopharyngeal carcinoma

    International Nuclear Information System (INIS)

    Zhu Xiaodong; Guo Ya; Qu Song; Li Ling; Huang Shiting; Li Danrong; Zhang Wei

    2012-01-01

    Objective: To discover radioresistance associated molecular biomarkers and its mechanism in nasopharyngeal carcinoma by protein-protein interaction network analysis. Methods: Whole genome expression microarray was applied to screen out differentially expressed genes in two cell lines CNE-2R and CNE-2 with different radiosensitivity. Four differentially expressed genes were randomly selected for further verification by the semi-quantitative RT-PCR analysis with self-designed primers. The common differentially expressed genes from two experiments were analyzed with the SNOW online database in order to find out the central node related to the biomarkers of nasopharyngeal carcinoma radioresistance. The expression of STAT1 in CNE-2R and CNE-2 cells was measured by Western blot. Results: Compared with CNE-2 cells, 374 genes in CNE-2R cells were differentially expressed while 197 genes showed significant differences. Four randomly selected differentially expressed genes were verified by RT-PCR and had same change trend in consistent with the results of chip assay. Analysis with the SNOW database demonstrated that those 197 genes could form a complicated interaction network where STAT1 and JUN might be two key nodes. Indeed, the STAT1-α expression in CNE-2R was higher than that in CNE-2 (t=4.96, P<0.05). Conclusions: The key nodes of STAT1 and JUN may be the molecular biomarkers leading to radioresistance in nasopharyngeal carcinoma, and STAT1-α might have close relationship with radioresistance. (authors)

  13. DGIdb 3.0: a redesign and expansion of the drug-gene interaction database.

    Science.gov (United States)

    Cotto, Kelsy C; Wagner, Alex H; Feng, Yang-Yang; Kiwala, Susanna; Coffman, Adam C; Spies, Gregory; Wollam, Alex; Spies, Nicholas C; Griffith, Obi L; Griffith, Malachi

    2018-01-04

    The drug-gene interaction database (DGIdb, www.dgidb.org) consolidates, organizes and presents drug-gene interactions and gene druggability information from papers, databases and web resources. DGIdb normalizes content from 30 disparate sources and allows for user-friendly advanced browsing, searching and filtering for ease of access through an intuitive web user interface, application programming interface (API) and public cloud-based server image. DGIdb v3.0 represents a major update of the database. Nine of the previously included 24 sources were updated. Six new resources were added, bringing the total number of sources to 30. These updates and additions of sources have cumulatively resulted in 56 309 interaction claims. This has also substantially expanded the comprehensive catalogue of druggable genes and anti-neoplastic drug-gene interactions included in the DGIdb. Along with these content updates, v3.0 has received a major overhaul of its codebase, including an updated user interface, preset interaction search filters, consolidation of interaction information into interaction groups, greatly improved search response times and upgrading the underlying web application framework. In addition, the expanded API features new endpoints which allow users to extract more detailed information about queried drugs, genes and drug-gene interactions, including listings of PubMed IDs, interaction type and other interaction metadata.

  14. Protein-lipid interactions at interfaces

    Directory of Open Access Journals (Sweden)

    Wilde, P.

    2000-04-01

    Full Text Available Foams and emulsions are both types of multiphase foods and are a dispersion of one immiscible phase (e.g. air or oil in another (e.g. water. Amphiphilic molecules (either proteins or chemical compounds are able to stabilise the interface between these phases and are termed emulsifiers. The ability of protein emulsifiers to bind lipid is reviewed, and the mechanisms underlying the behaviour of these and low molecular weight surfactants (LMWS at the interface are summarised. New research, exploiting atomic force microscopy, has given fresh insights into the mechanisms by which proteins and LMWS interact when both are present at the interface, compromising the stability of foams and emulsions stabilised by these mixtures. The understanding of component interactions at the interfacial level is essential if advances are to be made in the control and manipulation of multiphase foods during production and storage.Las espumas y las emulsiones son dispersiones de una fase inmiscible (ejemplo aire o aceite en otra (ejemplo agua. Las moléculas anfifílicas (bien proteínas o compuestos químicos pueden estabilizar la interfase y se denominan emulsionantes. En este artículo se revisa la habilidad de los emulsionantes proteínicos para enlazar lípidos y los mecanismos que subyacen en el comportamiento de estas moléculas así como de los tensioactivos de bajo peso molecular en la interfase. Recientes investigaciones que usan la microscopía han ofrecido visiones nuevas de los mecanismos mediante los cuales las proteínas y los tensioactivos de bajo peso molecular interaccionan cuando ambos están presentes en la interfase, comprometiendo la estabilidad de espumas y emulsiones estabilizadas por estas mezclas. El entendimiento de las interacciones entre componentes a nivel interfacial es esencial para lograr avances en el control y manipulación de alimentos multifases durante la producción y el almacenamiento.

  15. Small sets of interacting proteins suggest functional linkage mechanisms via Bayesian analogical reasoning.

    Science.gov (United States)

    Airoldi, Edoardo M; Heller, Katherine A; Silva, Ricardo

    2011-07-01

    Proteins and protein complexes coordinate their activity to execute cellular functions. In a number of experimental settings, including synthetic genetic arrays, genetic perturbations and RNAi screens, scientists identify a small set of protein interactions of interest. A working hypothesis is often that these interactions are the observable phenotypes of some functional process, which is not directly observable. Confirmatory analysis requires finding other pairs of proteins whose interaction may be additional phenotypical evidence about the same functional process. Extant methods for finding additional protein interactions rely heavily on the information in the newly identified set of interactions. For instance, these methods leverage the attributes of the individual proteins directly, in a supervised setting, in order to find relevant protein pairs. A small set of protein interactions provides a small sample to train parameters of prediction methods, thus leading to low confidence. We develop RBSets, a computational approach to ranking protein interactions rooted in analogical reasoning; that is, the ability to learn and generalize relations between objects. Our approach is tailored to situations where the training set of protein interactions is small, and leverages the attributes of the individual proteins indirectly, in a Bayesian ranking setting that is perhaps closest to propensity scoring in mathematical psychology. We find that RBSets leads to good performance in identifying additional interactions starting from a small evidence set of interacting proteins, for which an underlying biological logic in terms of functional processes and signaling pathways can be established with some confidence. Our approach is scalable and can be applied to large databases with minimal computational overhead. Our results suggest that analogical reasoning within a Bayesian ranking problem is a promising new approach for real-time biological discovery. Java code is available at

  16. RAID: a comprehensive resource for human RNA-associated (RNA–RNA/RNA–protein) interaction

    Science.gov (United States)

    Zhang, Xiaomeng; Wu, Deng; Chen, Liqun; Li, Xiang; Yang, Jinxurong; Fan, Dandan; Dong, Tingting; Liu, Mingyue; Tan, Puwen; Xu, Jintian; Yi, Ying; Wang, Yuting; Zou, Hua; Hu, Yongfei; Fan, Kaili; Kang, Juanjuan; Huang, Yan; Miao, Zhengqiang; Bi, Miaoman; Jin, Nana; Li, Kongning; Li, Xia; Xu, Jianzhen; Wang, Dong

    2014-01-01

    Transcriptomic analyses have revealed an unexpected complexity in the eukaryote transcriptome, which includes not only protein-coding transcripts but also an expanding catalog of noncoding RNAs (ncRNAs). Diverse coding and noncoding RNAs (ncRNAs) perform functions through interaction with each other in various cellular processes. In this project, we have developed RAID (http://www.rna-society.org/raid), an RNA-associated (RNA–RNA/RNA–protein) interaction database. RAID intends to provide the scientific community with all-in-one resources for efficient browsing and extraction of the RNA-associated interactions in human. This version of RAID contains more than 6100 RNA-associated interactions obtained by manually reviewing more than 2100 published papers, including 4493 RNA–RNA interactions and 1619 RNA–protein interactions. Each entry contains detailed information on an RNA-associated interaction, including RAID ID, RNA/protein symbol, RNA/protein categories, validated method, expressing tissue, literature references (Pubmed IDs), and detailed functional description. Users can query, browse, analyze, and manipulate RNA-associated (RNA–RNA/RNA–protein) interaction. RAID provides a comprehensive resource of human RNA-associated (RNA–RNA/RNA–protein) interaction network. Furthermore, this resource will help in uncovering the generic organizing principles of cellular function network. PMID:24803509

  17. An update of the DEF database of protein fold class predictions

    DEFF Research Database (Denmark)

    Reczko, Martin; Karras, Dimitris; Bohr, Henrik

    1997-01-01

    An update is given on the Database of Expected Fold classes (DEF) that contains a collection of fold-class predictions made from protein sequences and a mail server that provides new predictions for new sequences. To any given sequence one of 49 fold-classes is chosen to classify the structure re...... related to the sequence with high accuracy. The updated predictions system is developed using data from the new version of the 3D-ALI database of aligned protein structures and thus is giving more reliable and more detailed predictions than the previous DEF system.......An update is given on the Database of Expected Fold classes (DEF) that contains a collection of fold-class predictions made from protein sequences and a mail server that provides new predictions for new sequences. To any given sequence one of 49 fold-classes is chosen to classify the structure...

  18. Interaction between actinides and protein: the calmodulin

    International Nuclear Information System (INIS)

    Brulfert, Florian

    2016-01-01

    Considering the environmental impact of the Fukushima nuclear accident, it is fundamental to study the mechanisms governing the effects of the released radionuclides on the biosphere and thus identify the molecular processes generating the transport and deposition of actinides, such as neptunium and uranium. However, the information about the microscopic aspect of the interaction between actinides and biological molecules (peptides, proteins...) is scarce. The data being mostly reported from a physiological point of view, the structure of the coordination sites remains largely unknown. These microscopic data are indeed essential for the understanding of the interdependency between structural aspect, function and affinity.The Calmodulin (CaM) (abbreviation for Calcium-Modulated protein), also known for its affinity towards actinides, acts as a metabolic regulator of calcium. This protein is a Ca carrier, which is present ubiquitously in the human body, may also bind other metals such as actinides. Thus, in case of a contamination, actinides that bind to CaM could avoid the protein to perform properly and lead to repercussions on a large range of vital functions.The complexation of Np and U was studied by EXAFS spectroscopy which showed that actinides were incorporated in a calcium coordination site. Once the thermodynamical and structural aspects studied, the impact of the coordination site distortion on the biological efficiency was analyzed. In order to evaluate these consequences, a calorimetric method based on enzyme kinetics was developed. This experiment, which was conducted with both uranium (50 - 500 nM) and neptunium (30 - 250 nM) showed a decrease of the heat produced by the enzymatic reaction with an increasing concentration of actinides in the medium. Our findings showed that the Calmodulin actinide complex works as an enzymatic inhibitor. Furthermore, at higher neptunium (250 nM) and uranium (500 nM) concentration the metals seem to have a poison

  19. Medicago PhosphoProtein Database: a repository for Medicago truncatula phosphoprotein data

    Directory of Open Access Journals (Sweden)

    Christopher M. Rose

    2012-06-01

    Full Text Available The ability of legume crops to fix atmospheric nitrogen via a symbiotic association with soil rhizobia makes them an essential component of many agricultural systems. Initiation of this symbiosis requires protein phosphorylation-mediated signaling in response to rhizobial signals named Nod factors. Medicago truncatula (Medicago is the model system for studying legume biology, making the study of its phosphoproteome essential. Here, we describe the Medicago Phosphoprotein Database (http://phospho.medicago.wisc.edu, a repository built to house phosphoprotein, phosphopeptide, and phosphosite data specific to Medicago. Currently, the Medicago Phosphoprotein Database holds 3,457 unique phosphopeptides that contain 3,404 non-redundant sites of phosphorylation on 829 proteins. Through the web-based interface, users are allowed to browse identified proteins or search for proteins of interest. Furthermore, we allow users to conduct BLAST searches of the database using both peptide sequences and phosphorylation motifs as queries. The data contained within the database are available for download to be investigated at the user’s discretion. The Medicago Phosphoprotein Database will be updated continually with novel phosphoprotein and phosphopeptide identifications, with the intent of constructing an unparalleled compendium of large-scale Medicago phosphorylation data.

  20. A scored human protein-protein interaction network to catalyze genomic interpretation

    DEFF Research Database (Denmark)

    Li, Taibo; Wernersson, Rasmus; Hansen, Rasmus B

    2017-01-01

    Genome-scale human protein-protein interaction networks are critical to understanding cell biology and interpreting genomic data, but challenging to produce experimentally. Through data integration and quality control, we provide a scored human protein-protein interaction network (InWeb_InBioMap,......Genome-scale human protein-protein interaction networks are critical to understanding cell biology and interpreting genomic data, but challenging to produce experimentally. Through data integration and quality control, we provide a scored human protein-protein interaction network (In...

  1. HIP2: An online database of human plasma proteins from healthy individuals

    Directory of Open Access Journals (Sweden)

    Shen Changyu

    2008-04-01

    Full Text Available Abstract Background With the introduction of increasingly powerful mass spectrometry (MS techniques for clinical research, several recent large-scale MS proteomics studies have sought to characterize the entire human plasma proteome with a general objective for identifying thousands of proteins leaked from tissues in the circulating blood. Understanding the basic constituents, diversity, and variability of the human plasma proteome is essential to the development of sensitive molecular diagnosis and treatment monitoring solutions for future biomedical applications. Biomedical researchers today, however, do not have an integrated online resource in which they can search for plasma proteins collected from different mass spectrometry platforms, experimental protocols, and search software for healthy individuals. The lack of such a resource for comparisons has made it difficult to interpret proteomics profile changes in patients' plasma and to design protein biomarker discovery experiments. Description To aid future protein biomarker studies of disease and health from human plasma, we developed an online database, HIP2 (Healthy Human Individual's Integrated Plasma Proteome. The current version contains 12,787 protein entries linked to 86,831 peptide entries identified using different MS platforms. Conclusion This web-based database will be useful to biomedical researchers involved in biomarker discovery research. This database has been developed to be the comprehensive collection of healthy human plasma proteins, and has protein data captured in a relational database schema built to contain mappings of supporting peptide evidence from several high-quality and high-throughput mass-spectrometry (MS experimental data sets. Users can search for plasma protein/peptide annotations, peptide/protein alignments, and experimental/sample conditions with options for filter-based retrieval to achieve greater analytical power for discovery and validation.

  2. Categorizing Biases in High-Confidence High-Throughput Protein-Protein Interaction Data Sets*

    Science.gov (United States)

    Yu, Xueping; Ivanic, Joseph; Memišević, Vesna; Wallqvist, Anders; Reifman, Jaques

    2011-01-01

    We characterized and evaluated the functional attributes of three yeast high-confidence protein-protein interaction data sets derived from affinity purification/mass spectrometry, protein-fragment complementation assay, and yeast two-hybrid experiments. The interacting proteins retrieved from these data sets formed distinct, partially overlapping sets with different protein-protein interaction characteristics. These differences were primarily a function of the deployed experimental technologies used to recover these interactions. This affected the total coverage of interactions and was especially evident in the recovery of interactions among different functional classes of proteins. We found that the interaction data obtained by the yeast two-hybrid method was the least biased toward any particular functional characterization. In contrast, interacting proteins in the affinity purification/mass spectrometry and protein-fragment complementation assay data sets were over- and under-represented among distinct and different functional categories. We delineated how these differences affected protein complex organization in the network of interactions, in particular for strongly interacting complexes (e.g. RNA and protein synthesis) versus weak and transient interacting complexes (e.g. protein transport). We quantified methodological differences in detecting protein interactions from larger protein complexes, in the correlation of protein abundance among interacting proteins, and in their connectivity of essential proteins. In the latter case, we showed that minimizing inherent methodology biases removed many of the ambiguous conclusions about protein essentiality and protein connectivity. We used these findings to rationalize how biological insights obtained by analyzing data sets originating from different sources sometimes do not agree or may even contradict each other. An important corollary of this work was that discrepancies in biological insights did not

  3. Protein complex prediction in large ontology attributed protein-protein interaction networks.

    Science.gov (United States)

    Zhang, Yijia; Lin, Hongfei; Yang, Zhihao; Wang, Jian; Li, Yanpeng; Xu, Bo

    2013-01-01

    Protein complexes are important for unraveling the secrets of cellular organization and function. Many computational approaches have been developed to predict protein complexes in protein-protein interaction (PPI) networks. However, most existing approaches focus mainly on the topological structure of PPI networks, and largely ignore the gene ontology (GO) annotation information. In this paper, we constructed ontology attributed PPI networks with PPI data and GO resource. After constructing ontology attributed networks, we proposed a novel approach called CSO (clustering based on network structure and ontology attribute similarity). Structural information and GO attribute information are complementary in ontology attributed networks. CSO can effectively take advantage of the correlation between frequent GO annotation sets and the dense subgraph for protein complex prediction. Our proposed CSO approach was applied to four different yeast PPI data sets and predicted many well-known protein complexes. The experimental results showed that CSO was valuable in predicting protein complexes and achieved state-of-the-art performance.

  4. Interaction of Proteins Identified in Human Thyroid Cells

    Science.gov (United States)

    Pietsch, Jessica; Riwaldt, Stefan; Bauer, Johann; Sickmann, Albert; Weber, Gerhard; Grosse, Jirka; Infanger, Manfred; Eilles, Christoph; Grimm, Daniela

    2013-01-01

    Influence of gravity forces on the regulation of protein expression by healthy and malignant thyroid cells was studied with the aim to identify protein interactions. Western blot analyses of a limited number of proteins suggested a time-dependent regulation of protein expression by simulated microgravity. After applying free flow isoelectric focusing and mass spectrometry to search for differently expressed proteins by thyroid cells exposed to simulated microgravity for three days, a considerable number of candidates for gravi-sensitive proteins were detected. In order to show how proteins sensitive to microgravity could directly influence other proteins, we investigated all polypeptide chains identified with Mascot scores above 100, looking for groups of interacting proteins. Hence, UniProtKB entry numbers of all detected proteins were entered into the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING) and processed. The program indicated that we had detected various groups of interacting proteins in each of the three cell lines studied. The major groups of interacting proteins play a role in pathways of carbohydrate and protein metabolism, regulation of cell growth and cell membrane structuring. Analyzing these groups, networks of interaction could be established which show how a punctual influence of simulated microgravity may propagate via various members of interaction chains. PMID:23303277

  5. Interaction of Proteins Identified in Human Thyroid Cells

    Directory of Open Access Journals (Sweden)

    Jessica Pietsch

    2013-01-01

    Full Text Available Influence of gravity forces on the regulation of protein expression by healthy and malignant thyroid cells was studied with the aim to identify protein interactions. Western blot analyses of a limited number of proteins suggested a time-dependent regulation of protein expression by simulated microgravity. After applying free flow isoelectric focusing and mass spectrometry to search for differently expressed proteins by thyroid cells exposed to simulated microgravity for three days, a considerable number of candidates for gravi-sensitive proteins were detected. In order to show how proteins sensitive to microgravity could directly influence other proteins, we investigated all polypeptide chains identified with Mascot scores above 100, looking for groups of interacting proteins. Hence, UniProtKB entry numbers of all detected proteins were entered into the Search Tool for the Retrieval of Interacting Genes/Proteins (STRING and processed. The program indicated that we had detected various groups of interacting proteins in each of the three cell lines studied. The major groups of interacting proteins play a role in pathways of carbohydrate and protein metabolism, regulation of cell growth and cell membrane structuring. Analyzing these groups, networks of interaction could be established which show how a punctual influence of simulated microgravity may propagate via various members of interaction chains.

  6. CPAD, Curated Protein Aggregation Database: A Repository of Manually Curated Experimental Data on Protein and Peptide Aggregation.

    Science.gov (United States)

    Thangakani, A Mary; Nagarajan, R; Kumar, Sandeep; Sakthivel, R; Velmurugan, D; Gromiha, M Michael

    2016-01-01

    Accurate distinction between peptide sequences that can form amyloid-fibrils or amorphous β-aggregates, identification of potential aggregation prone regions in proteins, and prediction of change in aggregation rate of a protein upon mutation(s) are critical to research on protein misfolding diseases, such as Alzheimer's and Parkinson's, as well as biotechnological production of protein based therapeutics. We have developed a Curated Protein Aggregation Database (CPAD), which has collected results from experimental studies performed by scientific community aimed at understanding protein/peptide aggregation. CPAD contains more than 2300 experimentally observed aggregation rates upon mutations in known amyloidogenic proteins. Each entry includes numerical values for the following parameters: change in rate of aggregation as measured by fluorescence intensity or turbidity, name and source of the protein, Uniprot and Protein Data Bank codes, single point as well as multiple mutations, and literature citation. The data in CPAD has been supplemented with five different types of additional information: (i) Amyloid fibril forming hexa-peptides, (ii) Amorphous β-aggregating hexa-peptides, (iii) Amyloid fibril forming peptides of different lengths, (iv) Amyloid fibril forming hexa-peptides whose crystal structures are available in the Protein Data Bank (PDB) and (v) Experimentally validated aggregation prone regions found in amyloidogenic proteins. Furthermore, CPAD is linked to other related databases and resources, such as Uniprot, Protein Data Bank, PUBMED, GAP, TANGO, WALTZ etc. We have set up a web interface with different search and display options so that users have the ability to get the data in multiple ways. CPAD is freely available at http://www.iitm.ac.in/bioinfo/CPAD/. The potential applications of CPAD have also been discussed.

  7. Predicting Protein-Protein Interaction Sites with a Novel Membership Based Fuzzy SVM Classifier.

    Science.gov (United States)

    Sriwastava, Brijesh K; Basu, Subhadip; Maulik, Ujjwal

    2015-01-01

    Predicting residues that participate in protein-protein interactions (PPI) helps to identify, which amino acids are located at the interface. In this paper, we show that the performance of the classical support vector machine (SVM) algorithm can further be improved with the use of a custom-designed fuzzy membership function, for the partner-specific PPI interface prediction problem. We evaluated the performances of both classical SVM and fuzzy SVM (F-SVM) on the PPI databases of three different model proteomes of Homo sapiens, Escherichia coli and Saccharomyces Cerevisiae and calculated the statistical significance of the developed F-SVM over classical SVM algorithm. We also compared our performance with the available state-of-the-art fuzzy methods in this domain and observed significant performance improvements. To predict interaction sites in protein complexes, local composition of amino acids together with their physico-chemical characteristics are used, where the F-SVM based prediction method exploits the membership function for each pair of sequence fragments. The average F-SVM performance (area under ROC curve) on the test samples in 10-fold cross validation experiment are measured as 77.07, 78.39, and 74.91 percent for the aforementioned organisms respectively. Performances on independent test sets are obtained as 72.09, 73.24 and 82.74 percent respectively. The software is available for free download from http://code.google.com/p/cmater-bioinfo.

  8. Identifying potential survival strategies of HIV-1 through virus-host protein interaction networks

    Directory of Open Access Journals (Sweden)

    Boucher Charles AB

    2010-07-01

    Full Text Available Abstract Background The National Institute of Allergy and Infectious Diseases has launched the HIV-1 Human Protein Interaction Database in an effort to catalogue all published interactions between HIV-1 and human proteins. In order to systematically investigate these interactions functionally and dynamically, we have constructed an HIV-1 human protein interaction network. This network was analyzed for important proteins and processes that are specific for the HIV life-cycle. In order to expose viral strategies, network motif analysis was carried out showing reoccurring patterns in virus-host dynamics. Results Our analyses show that human proteins interacting with HIV form a densely connected and central sub-network within the total human protein interaction network. The evaluation of this sub-network for connectivity and centrality resulted in a set of proteins essential for the HIV life-cycle. Remarkably, we were able to associate proteins involved in RNA polymerase II transcription with hubs and proteasome formation with bottlenecks. Inferred network motifs show significant over-representation of positive and negative feedback patterns between virus and host. Strikingly, such patterns have never been reported in combined virus-host systems. Conclusions HIV infection results in a reprioritization of cellular processes reflected by an increase in the relative importance of transcriptional machinery and proteasome formation. We conclude that during the evolution of HIV, some patterns of interaction have been selected for resulting in a system where virus proteins preferably interact with central human proteins for direct control and with proteasomal proteins for indirect control over the cellular processes. Finally, the patterns described by network motifs illustrate how virus and host interact with one another.

  9. BtoxDB: a comprehensive database of protein structural data on toxin-antitoxin systems.

    Science.gov (United States)

    Barbosa, Luiz Carlos Bertucci; Garrido, Saulo Santesso; Marchetto, Reinaldo

    2015-03-01

    Toxin-antitoxin (TA) systems are diverse and abundant genetic modules in prokaryotic cells that are typically formed by two genes encoding a stable toxin and a labile antitoxin. Because TA systems are able to repress growth or kill cells and are considered to be important actors in cell persistence (multidrug resistance without genetic change), these modules are considered potential targets for alternative drug design. In this scenario, structural information for the proteins in these systems is highly valuable. In this report, we describe the development of a web-based system, named BtoxDB, that stores all protein structural data on TA systems. The BtoxDB database was implemented as a MySQL relational database using PHP scripting language. Web interfaces were developed using HTML, CSS and JavaScript. The data were collected from the PDB, UniProt and Entrez databases. These data were appropriately filtered using specialized literature and our previous knowledge about toxin-antitoxin systems. The database provides three modules ("Search", "Browse" and "Statistics") that enable searches, acquisition of contents and access to statistical data. Direct links to matching external databases are also available. The compilation of all protein structural data on TA systems in one platform is highly useful for researchers interested in this content. BtoxDB is publicly available at http://www.gurupi.uft.edu.br/btoxdb. Copyright © 2015 Elsevier Ltd. All rights reserved.

  10. Evidence of probabilistic behaviour in protein interaction networks

    Directory of Open Access Journals (Sweden)

    Reifman Jaques

    2008-01-01

    Full Text Available Abstract Background Data from high-throughput experiments of protein-protein interactions are commonly used to probe the nature of biological organization and extract functional relationships between sets of proteins. What has not been appreciated is that the underlying mechanisms involved in assembling these networks may exhibit considerable probabilistic behaviour. Results We find that the probability of an interaction between two proteins is generally proportional to the numerical product of their individual interacting partners, or degrees. The degree-weighted behaviour is manifested throughout the protein-protein interaction networks studied here, except for the high-degree, or hub, interaction areas. However, we find that the probabilities of interaction between the hubs are still high. Further evidence is provided by path length analyses, which show that these hubs are separated by very few links. Conclusion The results suggest that protein-protein interaction networks incorporate probabilistic elements that lead to scale-rich hierarchical architectures. These observations seem to be at odds with a biologically-guided organization. One interpretation of the findings is that we are witnessing the ability of proteins to indiscriminately bind rather than the protein-protein interactions that are actually utilized by the cell in biological processes. Therefore, the topological study of a degree-weighted network requires a more refined methodology to extract biological information about pathways, modules, or other inferred relationships among proteins.

  11. Annotating the protein-RNA interaction sites in proteins using evolutionary information and protein backbone structure.

    Science.gov (United States)

    Li, Tao; Li, Qian-Zhong

    2012-11-07

    RNA-protein interactions play important roles in various biological processes. The precise detection of RNA-protein interaction sites is very important for understanding essential biological processes and annotating the function of the proteins. In this study, based on various features from amino acid sequence and structure, including evolutionary information, solvent accessible surface area and torsion angles (φ, ψ) in the backbone structure of the polypeptide chain, a computational method for predicting RNA-binding sites in proteins is proposed. When the method is applied to predict RNA-binding sites in three datasets: RBP86 containing 86 protein chains, RBP107 containing 107 proteins chains and RBP109 containing 109 proteins chains, better sensitivities and specificities are obtained compared to previously published methods in five-fold cross-validation tests. In order to make further examination for the efficiency of our method, the RBP107 dataset is used as training set, RBP86 and RBP109 datasets are used as the independent test sets. In addition, as examples of our prediction, RNA-binding sites in a few proteins are presented. The annotated results are consistent with the PDB annotation. These results show that our method is useful for annotating RNA binding sites of novel proteins.

  12. Computational Approaches for Prediction of Pathogen-Host Protein-Protein Interactions

    Directory of Open Access Journals (Sweden)

    Esmaeil eNourani

    2015-02-01

    Full Text Available Infectious diseases are still among the major and prevalent health problems, mostly because of the drug resistance of novel variants of pathogens. Molecular interactions between pathogens and their hosts are the key part of the infection mechanisms. Novel antimicrobial therapeutics to fight drug resistance is only possible in case of a thorough understanding of pathogen-host interaction (PHI systems. Existing databases, which contain experimentally verified PHI data, suffer from scarcity of reported interactions due to the technically challenging and time consuming process of experiments. This has motivated many researchers to address the problem by proposing computational approaches for analysis and prediction of PHIs. The computational methods primarily utilize sequence information, protein structure and known interactions. Classic machine learning techniques are used when there are sufficient known interactions to be used as training data. On the opposite case, transfer and multi task learning methods are preferred. Here, we present an overview of these computational approaches for PHI prediction, discussing their weakness and abilities, with future directions.

  13. PDTD: a web-accessible protein database for drug target identification

    Directory of Open Access Journals (Sweden)

    Gao Zhenting

    2008-02-01

    Full Text Available Abstract Background Target identification is important for modern drug discovery. With the advances in the development of molecular docking, potential binding proteins may be discovered by docking a small molecule to a repository of proteins with three-dimensional (3D structures. To complete this task, a reverse docking program and a drug target database with 3D structures are necessary. To this end, we have developed a web server tool, TarFisDock (Target Fishing Docking http://www.dddc.ac.cn/tarfisdock, which has been used widely by others. Recently, we have constructed a protein target database, Potential Drug Target Database (PDTD, and have integrated PDTD with TarFisDock. This combination aims to assist target identification and validation. Description PDTD is a web-accessible protein database for in silico target identification. It currently contains >1100 protein entries with 3D structures presented in the Protein Data Bank. The data are extracted from the literatures and several online databases such as TTD, DrugBank and Thomson Pharma. The database covers diverse information of >830 known or potential drug targets, including protein and active sites structures in both PDB and mol2 formats, related diseases, biological functions as well as associated regulating (signaling pathways. Each target is categorized by both nosology and biochemical function. PDTD supports keyword search function, such as PDB ID, target name, and disease name. Data set generated by PDTD can be viewed with the plug-in of molecular visualization tools and also can be downloaded freely. Remarkably, PDTD is specially designed for target identification. In conjunction with TarFisDock, PDTD can be used to identify binding proteins for small molecules. The results can be downloaded in the form of mol2 file with the binding pose of the probe compound and a list of potential binding targets according to their ranking scores. Conclusion PDTD serves as a comprehensive and

  14. Interleukin-1beta induced changes in the protein expression of rat islets: a computerized database

    DEFF Research Database (Denmark)

    Andersen, H U; Fey, S J; Larsen, Peter Mose

    1997-01-01

    as well as the intracellular mechanisms of action of interleukin 1-mediated beta-cell cytotoxicity are unknown. However, previous studies have found an association of beta-cell destruction with alterations in protein synthesis. Thus, two-dimensional (2-D) gel electrophoresis of pancreatic islet proteins...... may be an important tool facilitating studies of the molecular pathogenesis of insulin-dependent diabetes mellitus. 2-D gel electrophoresis of islet proteins may lead to (i) the determination of qualitative and quantitative changes in specific islet proteins induced by cytokines, (ii......) the determination of the effects of agents modulating cytokine action, and (iii) the identification of primary islet protein antigen(s) initiating the immune destruction of the beta-cells. Therefore, the aim of this study was to create databases (DB) of all reproducibly detectable protein spots on 10% and 15...

  15. Analysis of protein-protein interaction networks by means of annotated graph mining algorithms

    NARCIS (Netherlands)

    Rahmani, Hossein

    2012-01-01

    This thesis discusses solutions to several open problems in Protein-Protein Interaction (PPI) networks with the aid of Knowledge Discovery. PPI networks are usually represented as undirected graphs, with nodes corresponding to proteins and edges representing interactions among protein pairs. A large

  16. Wiki-pi: a web-server of annotated human protein-protein interactions to aid in discovery of protein function.

    Directory of Open Access Journals (Sweden)

    Naoki Orii

    Full Text Available Protein-protein interactions (PPIs are the basis of biological functions. Knowledge of the interactions of a protein can help understand its molecular function and its association with different biological processes and pathways. Several publicly available databases provide comprehensive information about individual proteins, such as their sequence, structure, and function. There also exist databases that are built exclusively to provide PPIs by curating them from published literature. The information provided in these web resources is protein-centric, and not PPI-centric. The PPIs are typically provided as lists of interactions of a given gene with links to interacting partners; they do not present a comprehensive view of the nature of both the proteins involved in the interactions. A web database that allows search and retrieval based on biomedical characteristics of PPIs is lacking, and is needed. We present Wiki-Pi (read Wiki-π, a web-based interface to a database of human PPIs, which allows users to retrieve interactions by their biomedical attributes such as their association to diseases, pathways, drugs and biological functions. Each retrieved PPI is shown with annotations of both of the participant proteins side-by-side, creating a basis to hypothesize the biological function facilitated by the interaction. Conceptually, it is a search engine for PPIs analogous to PubMed for scientific literature. Its usefulness in generating novel scientific hypotheses is demonstrated through the study of IGSF21, a little-known gene that was recently identified to be associated with diabetic retinopathy. Using Wiki-Pi, we infer that its association to diabetic retinopathy may be mediated through its interactions with the genes HSPB1, KRAS, TMSB4X and DGKD, and that it may be involved in cellular response to external stimuli, cytoskeletal organization and regulation of molecular activity. The website also provides a wiki-like capability allowing users

  17. Functional structural motifs for protein-ligand, protein-protein, and protein-nucleic acid interactions and their connection to supersecondary structures.

    Science.gov (United States)

    Kinjo, Akira R; Nakamura, Haruki

    2013-01-01

    Protein functions are mediated by interactions between proteins and other molecules. One useful approach to analyze protein functions is to compare and classify the structures of interaction interfaces of proteins. Here, we describe the procedures for compiling a database of interface structures and efficiently comparing the interface structures. To do so requires a good understanding of the data structures of the Protein Data Bank (PDB). Therefore, we also provide a detailed account of the PDB exchange dictionary necessary for extracting data that are relevant for analyzing interaction interfaces and secondary structures. We identify recurring structural motifs by classifying similar interface structures, and we define a coarse-grained representation of supersecondary structures (SSS) which represents a sequence of two or three secondary structure elements including their relative orientations as a string of four to seven letters. By examining the correspondence between structural motifs and SSS strings, we show that no SSS string has particularly high propensity to be found interaction interfaces in general, indicating any SSS can be used as a binding interface. When individual structural motifs are examined, there are some SSS strings that have high propensity for particular groups of structural motifs. In addition, it is shown that while the SSS strings found in particular structural motifs for nonpolymer and protein interfaces are as abundant as in other structural motifs that belong to the same subunit, structural motifs for nucleic acid interfaces exhibit somewhat stronger preference for SSS strings. In regard to protein folds, many motif-specific SSS strings were found across many folds, suggesting that SSS may be a useful description to investigate the universality of ligand binding modes.

  18. Regulation of PCNA-protein interactions for genome stability

    DEFF Research Database (Denmark)

    Mailand, Niels; Gibbs-Seymour, Ian; Bekker-Jensen, Simon

    2013-01-01

    Proliferating cell nuclear antigen (PCNA) has a central role in promoting faithful DNA replication, providing a molecular platform that facilitates the myriad protein-protein and protein-DNA interactions that occur at the replication fork. Numerous PCNA-associated proteins compete for binding...

  19. Gap junctions and connexin-interacting proteins

    NARCIS (Netherlands)

    Giepmans, Ben N G

    2004-01-01

    Gap junctions form channels between adjacent cells. The core proteins of these channels are the connexins. Regulation of gap junction communication (GJC) can be modulated by connexin-associating proteins, such as regulatory protein phosphatases and protein kinases, of which c-Src is the

  20. Protein backbone angle restraints from searching a database for chemical shift and sequence homology

    Energy Technology Data Exchange (ETDEWEB)

    Cornilescu, Gabriel; Delaglio, Frank; Bax, Ad [National Institutes of Health, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases (United States)

    1999-03-15

    Chemical shifts of backbone atoms in proteins are exquisitely sensitive to local conformation, and homologous proteins show quite similar patterns of secondary chemical shifts. The inverse of this relation is used to search a database for triplets of adjacent residues with secondary chemical shifts and sequence similarity which provide the best match to the query triplet of interest. The database contains 13C{alpha}, 13C{beta}, 13C', 1H{alpha} and 15N chemical shifts for 20 proteins for which a high resolution X-ray structure is available. The computer program TALOS was developed to search this database for strings of residues with chemical shift and residue type homology. The relative importance of the weighting factors attached to the secondary chemical shifts of the five types of resonances relative to that of sequence similarity was optimized empirically. TALOS yields the 10 triplets which have the closest similarity in secondary chemical shift and amino acid sequence to those of the query sequence. If the central residues in these 10 triplets exhibit similar {phi} and {psi} backbone angles, their averages can reliably be used as angular restraints for the protein whose structure is being studied. Tests carried out for proteins of known structure indicate that the root-mean-square difference (rmsd) between the output of TALOS and the X-ray derived backbone angles is about 15 deg. Approximately 3% of the predictions made by TALOS are found to be in error.

  1. The dynamic multisite interactions between two intrinsically disordered proteins

    KAUST Repository

    Wu, Shaowen; Wang, Dongdong; Liu, Jin; Feng, Yitao; Weng, Jingwei; Li, Yu; Gao, Xin; Liu, Jianwei; Wang, Wenning

    2017-01-01

    Protein interactions involving intrinsically disordered proteins (IDPs) comprise a variety of binding modes, from the well characterized folding upon binding to dynamic fuzzy complex. To date, most studies concern the binding of an IDP to a

  2. UET: a database of evolutionarily-predicted functional determinants of protein sequences that cluster as functional sites in protein structures.

    Science.gov (United States)

    Lua, Rhonald C; Wilson, Stephen J; Konecki, Daniel M; Wilkins, Angela D; Venner, Eric; Morgan, Daniel H; Lichtarge, Olivier

    2016-01-04

    The structure and function of proteins underlie most aspects of biology and their mutational perturbations often cause disease. To identify the molecular determinants of function as well as targets for drugs, it is central to characterize the important residues and how they cluster to form functional sites. The Evolutionary Trace (ET) achieves this by ranking the functional and structural importance of the protein sequence positions. ET uses evolutionary distances to estimate functional distances and correlates genotype variations with those in the fitness phenotype. Thus, ET ranks are worse for sequence positions that vary among evolutionarily closer homologs but better for positions that vary mostly among distant homologs. This approach identifies functional determinants, predicts function, guides the mutational redesign of functional and allosteric specificity, and interprets the action of coding sequence variations in proteins, people and populations. Now, the UET database offers pre-computed ET analyses for the protein structure databank, and on-the-fly analysis of any protein sequence. A web interface retrieves ET rankings of sequence positions and maps results to a structure to identify functionally important regions. This UET database integrates several ways of viewing the results on the protein sequence or structure and can be found at http://mammoth.bcm.tmc.edu/uet/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  3. The Ser/Thr Protein Kinase Protein-Protein Interaction Map of M. tuberculosis.

    Science.gov (United States)

    Wu, Fan-Lin; Liu, Yin; Jiang, He-Wei; Luan, Yi-Zhao; Zhang, Hai-Nan; He, Xiang; Xu, Zhao-Wei; Hou, Jing-Li; Ji, Li-Yun; Xie, Zhi; Czajkowsky, Daniel M; Yan, Wei; Deng, Jiao-Yu; Bi, Li-Jun; Zhang, Xian-En; Tao, Sheng-Ce

    2017-08-01

    Mycobacterium tuberculosis (Mtb) is the causative agent of tuberculosis, the leading cause of death among all infectious diseases. There are 11 eukaryotic-like serine/threonine protein kinases (STPKs) in Mtb, which are thought to play pivotal roles in cell growth, signal transduction and pathogenesis. However, their underlying mechanisms of action remain largely uncharacterized. In this study, using a Mtb proteome microarray, we have globally identified the binding proteins in Mtb for all of the STPKs, and constructed the first STPK protein interaction (KPI) map that includes 492 binding proteins and 1,027 interactions. Bioinformatics analysis showed that the interacting proteins reflect diverse functions, including roles in two-component system, transcription, protein degradation, and cell wall integrity. Functional investigations confirmed that PknG regulates cell wall integrity through key components of peptidoglycan (PG) biosynthesis, e.g. MurC. The global STPK-KPIs network constructed here is expected to serve as a rich resource for understanding the key signaling pathways in Mtb, thus facilitating drug development and effective control of Mtb. © 2017 by The American Society for Biochemistry and Molecular Biology, Inc.

  4. Identification of proteins that may directly interact with human RPA.

    Science.gov (United States)

    Nakaya, Ryou; Takaya, Junichiro; Onuki, Takeshi; Moritani, Mariko; Nozaki, Naohito; Ishimi, Yukio

    2010-11-01

    RPA, which consisted of three subunits (RPA1, 2 and 3), plays essential roles in DNA transactions. At the DNA replication forks, RPA binds to single-stranded DNA region to stabilize the structure and to assemble other replication proteins. Interactions between RPA and several replication proteins have been reported but the analysis is not comprehensive. We systematically performed the qualitative analysis to identify RPA interaction partners to understand the protein-protein interaction at the replication forks. We expressed in insect cells the three subunits of human RPA, together with one replication protein, which is present at the forks under normal conditions and/or under the replication stress conditions, to examine the interaction. Among 30 proteins examined in total, it was found that at least 14 proteins interacted with RPA. RPA interacted with MCM3-7, MCM-BP and CDC45 proteins among the proteins that play roles in the initiation and the elongation of the DNA replication. RPA bound with TIPIN, CLASPIN and RAD17, which are involved in the DNA replication checkpoint functions. RPA also bound with cyclin-dependent kinases and an amino-terminal fragment of Rb protein that negatively regulates DNA replication. These results suggest that RPA interacts with the specific proteins among those that play roles in the regulation of the replication fork progression.

  5. quinolinium iodide in suppression of protein–protein interactions

    Indian Academy of Sciences (India)

    In searching for alternative ways to reduce protein–protein interactions or to inhibit the amyloid formation, the inhibitory effects ..... ing the exposure of hydrophobic surfaces mirrors the ... is well-supported by electrostatic interactions between.

  6. PATtyFams: Protein families for the microbial genomes in the PATRIC database

    Directory of Open Access Journals (Sweden)

    James J Davis

    2016-02-01

    Full Text Available The ability to build accurate protein families is a fundamental operation in bioinformatics that influences comparative analyses, genome annotation and metabolic modeling. For several years we have been maintaining protein families for all microbial genomes in the PATRIC database (Pathosystems Resource Integration Center, patricbrc.org in order to drive many of the comparative analysis tools that are available through the PATRIC website. However, due to the burgeoning number of genomes, traditional approaches for generating protein families are becoming prohibitive. In this report, we describe a new approach for generating protein families, which we call PATtyFams. This method uses the k-mer-based function assignments available through RAST (Rapid Annotation using Subsystem Technology to rapidly guide family formation, and then differentiates the function-based groups into families using a Markov Cluster algorithm (MCL. This new approach for generating protein families is rapid, scalable and has properties that are consistent with alignment-based methods.

  7. HKC: An Algorithm to Predict Protein Complexes in Protein-Protein Interaction Networks

    Directory of Open Access Journals (Sweden)

    Xiaomin Wang

    2011-01-01

    Full Text Available With the availability of more and more genome-scale protein-protein interaction (PPI networks, research interests gradually shift to Systematic Analysis on these large data sets. A key topic is to predict protein complexes in PPI networks by identifying clusters that are densely connected within themselves but sparsely connected with the rest of the network. In this paper, we present a new topology-based algorithm, HKC, to detect protein complexes in genome-scale PPI networks. HKC mainly uses the concepts of highest k-core and cohesion to predict protein complexes by identifying overlapping clusters. The experiments on two data sets and two benchmarks show that our algorithm has relatively high F-measure and exhibits better performance compared with some other methods.

  8. Targeting protein-protein interaction between MLL1 and reciprocal proteins for leukemia therapy.

    Science.gov (United States)

    Wang, Zhi-Hui; Li, Dong-Dong; Chen, Wei-Lin; You, Qi-Dong; Guo, Xiao-Ke

    2018-01-15

    The mixed lineage leukemia protein-1 (MLL1), as a lysine methyltransferase, predominantly regulates the methylation of histone H3 lysine 4 (H3K4) and functions in hematopoietic stem cell (HSC) self-renewal. MLL1 gene fuses with partner genes that results in the generation of MLL1 fusion proteins (MLL1-FPs), which are frequently detected in acute leukemia. In the progress of leukemogenesis, a great deal of proteins cooperate with MLL1 to form multiprotein complexes serving for the dysregulation of H3K4 methylation, the overexpression of homeobox (HOX) cluster genes, and the consequent generation of leukemia. Hence, disrupting the interactions between MLL1 and the reciprocal proteins has been considered to be a new treatment strategy for leukemia. Here, we reviewed potential protein-protein interactions (PPIs) between MLL1 and its reciprocal proteins, and summarized the inhibitors to target MLL1 PPIs. The druggability of MLL1 PPIs for leukemia were also discussed. Copyright © 2017. Published by Elsevier Ltd.

  9. Proteins interacting with the 26S proteasome

    DEFF Research Database (Denmark)

    Hartmann-Petersen, R; Gordon, C

    2004-01-01

    The 26S proteasome is the multi-protein protease that recognizes and degrades ubiquitinylated substrates targeted for destruction by the ubiquitin pathway. In addition to the well-documented subunit organization of the 26S holoenzyme, it is clear that a number of other proteins transiently...... associate with the 26S complex. These transiently associated proteins confer a number of different roles such as substrate presentation, cleavage of the multi-ubiquitin chain from the protein substrate and turnover of misfolded proteins. Such activities are essential for the 26S proteasome to efficiently...... fulfill its intracellular function in protein degradation....

  10. muBLASTP: database-indexed protein sequence search on multicore CPUs.

    Science.gov (United States)

    Zhang, Jing; Misra, Sanchit; Wang, Hao; Feng, Wu-Chun

    2016-11-04

    The Basic Local Alignment Search Tool (BLAST) is a fundamental program in the life sciences that searches databases for sequences that are most similar to a query sequence. Currently, the BLAST algorithm utilizes a query-indexed approach. Although many approaches suggest that sequence search with a database index can achieve much higher throughput (e.g., BLAT, SSAHA, and CAFE), they cannot deliver the same level of sensitivity as the query-indexed BLAST, i.e., NCBI BLAST, or they can only support nucleotide sequence search, e.g., MegaBLAST. Due to different challenges and characteristics between query indexing and database indexing, the existing techniques for query-indexed search cannot be used into database indexed search. muBLASTP, a novel database-indexed BLAST for protein sequence search, delivers identical hits returned to NCBI BLAST. On Intel Haswell multicore CPUs, for a single query, the single-threaded muBLASTP achieves up to a 4.41-fold speedup for alignment stages, and up to a 1.75-fold end-to-end speedup over single-threaded NCBI BLAST. For a batch of queries, the multithreaded muBLASTP achieves up to a 5.7-fold speedups for alignment stages, and up to a 4.56-fold end-to-end speedup over multithreaded NCBI BLAST. With a newly designed index structure for protein database and associated optimizations in BLASTP algorithm, we re-factored BLASTP algorithm for modern multicore processors that achieves much higher throughput with acceptable memory footprint for the database index.

  11. In silico study of protein to protein interaction analysis of AMP-activated protein kinase and mitochondrial activity in three different farm animal species

    Science.gov (United States)

    Prastowo, S.; Widyas, N.

    2018-03-01

    AMP-activated protein kinase (AMPK) is cellular energy censor which works based on ATP and AMP concentration. This protein interacts with mitochondria in determine its activity to generate energy for cell metabolism purposes. For that, this paper aims to compare the protein to protein interaction of AMPK and mitochondrial activity genes in the metabolism of known animal farm (domesticated) that are cattle (Bos taurus), pig (Sus scrofa) and chicken (Gallus gallus). In silico study was done using STRING V.10 as prominent protein interaction database, followed with biological function comparison in KEGG PATHWAY database. Set of genes (12 in total) were used as input analysis that are PRKAA1, PRKAA2, PRKAB1, PRKAB2, PRKAG1, PRKAG2, PRKAG3, PPARGC1, ACC, CPT1B, NRF2 and SOD. The first 7 genes belong to gene in AMPK family, while the last 5 belong to mitochondrial activity genes. The protein interaction result shows 11, 8 and 5 metabolism pathways in Bos taurus, Sus scrofa and Gallus gallus, respectively. The top pathway in Bos taurus is AMPK signaling pathway (10 genes), Sus scrofa is Adipocytokine signaling pathway (8 genes) and Gallus gallus is FoxO signaling pathway (5 genes). Moreover, the common pathways found in those 3 species are Adipocytokine signaling pathway, Insulin signaling pathway and FoxO signaling pathway. Genes clustered in Adipocytokine and Insulin signaling pathway are PRKAA2, PPARGC1A, PRKAB1 and PRKAG2. While, in FoxO signaling pathway are PRKAA2, PRKAB1, PRKAG2. According to that, we found PRKAA2, PRKAB1 and PRKAG2 are the common genes. Based on the bioinformatics analysis, we can demonstrate that protein to protein interaction shows distinct different of metabolism in different species. However, further validation is needed to give a clear explanation.

  12. An online interactive geometric database including exact solutions of Einstein's field equations

    International Nuclear Information System (INIS)

    Ishak, Mustapha; Lake, Kayll

    2002-01-01

    We describe a new interactive database (GRDB) of geometric objects in the general area of differential geometry. Database objects include, but are not restricted to, exact solutions of Einstein's field equations. GRDB is designed for researchers (and teachers) in applied mathematics, physics and related fields. The flexible search environment allows the database to be useful over a wide spectrum of interests, for example, from practical considerations of neutron star models in astrophysics to abstract space-time classification schemes. The database is built using a modular and object-oriented design and uses several Java technologies (e.g. Applets, Servlets, JDBC). These are platform-independent and well adapted for applications developed for the World Wide Web. GRDB is accompanied by a virtual calculator (GRTensorJ), a graphical user interface to the computer algebra system GRTensorII, used to perform online coordinate, tetrad or basis calculations. The highly interactive nature of GRDB allows systematic internal self-checking and minimization of the required internal records. This new database is now available online at http://grdb.org

  13. Context-specific protein network miner - an online system for exploring context-specific protein interaction networks from the literature

    KAUST Repository

    Chowdhary, Rajesh

    2012-04-06

    Background: Protein interaction networks (PINs) specific within a particular context contain crucial information regarding many cellular biological processes. For example, PINs may include information on the type and directionality of interaction (e.g. phosphorylation), location of interaction (i.e. tissues, cells), and related diseases. Currently, very few tools are capable of deriving context-specific PINs for conducting exploratory analysis. Results: We developed a literature-based online system, Context-specific Protein Network Miner (CPNM), which derives context-specific PINs in real-time from the PubMed database based on a set of user-input keywords and enhanced PubMed query system. CPNM reports enriched information on protein interactions (with type and directionality), their network topology with summary statistics (e.g. most densely connected proteins in the network; most densely connected protein-pairs; and proteins connected by most inbound/outbound links) that can be explored via a user-friendly interface. Some of the novel features of the CPNM system include PIN generation, ontology-based PubMed query enhancement, real-time, user-queried, up-to-date PubMed document processing, and prediction of PIN directionality. Conclusions: CPNM provides a tool for biologists to explore PINs. It is freely accessible at http://www.biotextminer.com/CPNM/. © 2012 Chowdhary et al.

  14. Context-specific protein network miner - an online system for exploring context-specific protein interaction networks from the literature

    KAUST Repository

    Chowdhary, Rajesh; Tan, Sin Lam; Zhang, Jinfeng; Karnik, Shreyas; Bajic, Vladimir B.; Liu, Jun S.

    2012-01-01

    Background: Protein interaction networks (PINs) specific within a particular context contain crucial information regarding many cellular biological processes. For example, PINs may include information on the type and directionality of interaction (e.g. phosphorylation), location of interaction (i.e. tissues, cells), and related diseases. Currently, very few tools are capable of deriving context-specific PINs for conducting exploratory analysis. Results: We developed a literature-based online system, Context-specific Protein Network Miner (CPNM), which derives context-specific PINs in real-time from the PubMed database based on a set of user-input keywords and enhanced PubMed query system. CPNM reports enriched information on protein interactions (with type and directionality), their network topology with summary statistics (e.g. most densely connected proteins in the network; most densely connected protein-pairs; and proteins connected by most inbound/outbound links) that can be explored via a user-friendly interface. Some of the novel features of the CPNM system include PIN generation, ontology-based PubMed query enhancement, real-time, user-queried, up-to-date PubMed document processing, and prediction of PIN directionality. Conclusions: CPNM provides a tool for biologists to explore PINs. It is freely accessible at http://www.biotextminer.com/CPNM/. © 2012 Chowdhary et al.

  15. Computational design of protein interactions: designing proteins that neutralize influenza by inhibiting its hemagglutinin surface protein

    Science.gov (United States)

    Fleishman, Sarel

    2012-02-01

    Molecular recognition underlies all life processes. Design of interactions not seen in nature is a test of our understanding of molecular recognition and could unlock the vast potential of subtle control over molecular interaction networks, allowing the design of novel diagnostics and therapeutics for basic and applied research. We developed the first general method for designing protein interactions. The method starts by computing a region of high affinity interactions between dismembered amino acid residues and the target surface and then identifying proteins that can harbor these residues. Designs are tested experimentally for binding the target surface and successful ones are affinity matured using yeast cell surface display. Applied to the conserved stem region of influenza hemagglutinin we designed two unrelated proteins that, following affinity maturation, bound hemagglutinin at subnanomolar dissociation constants. Co-crystal structures of hemagglutinin bound to the two designed binders were within 1Angstrom RMSd of their models, validating the accuracy of the design strategy. One of the designed proteins inhibits the conformational changes that underlie hemagglutinin's cell-invasion functions and blocks virus infectivity in cell culture, suggesting that such proteins may in future serve as diagnostics and antivirals against a wide range of pathogenic influenza strains. We have used this method to obtain experimentally validated binders of several other target proteins, demonstrating the generality of the approach. We discuss the combination of modeling and high-throughput characterization of design variants which has been key to the success of this approach, as well as how we have used the data obtained in this project to enhance our understanding of molecular recognition. References: Science 332:816 JMB, in press Protein Sci 20:753

  16. PPI-IRO: A two-stage method for protein-protein interaction extraction based on interaction relation ontology

    KAUST Repository

    Li, Chuanxi; Chen, Peng; Wang, Rujing; Wang, Xiujie; Su, Yaru; Li, Jinyan

    2014-01-01

    Mining Protein-Protein Interactions (PPIs) from the fast-growing biomedical literature resources has been proven as an effective approach for the identifi cation of biological regulatory networks. This paper presents a novel method based on the idea

  17. Globular and disordered – the non-identical twins in protein-protein interactions

    Directory of Open Access Journals (Sweden)

    Kaare eTeilum

    2015-07-01

    Full Text Available In biology proteins from different structural classes interact across and within classes in ways that are optimized to achieve balanced functional outputs. The interactions between intrinsically disordered proteins (IDPs and other proteins rely on changes in flexibility and this is seen as a strong determinant for their function. This has fostered the notion that IDP’s bind with low affinity but high specificity. Here we have analyzed available detailed thermodynamic data for protein-protein interactions to put to the test if the thermodynamic profiles of IDP interactions differ from those of other protein-protein interactions. We find that ordered proteins and the disordered ones act as non identical twins operating by similar principles but where the disordered proteins complexes are on average less stable by 2.5 kcal mol-1.

  18. The dynamic multisite interactions between two intrinsically disordered proteins

    KAUST Repository

    Wu, Shaowen

    2017-05-11

    Protein interactions involving intrinsically disordered proteins (IDPs) comprise a variety of binding modes, from the well characterized folding upon binding to dynamic fuzzy complex. To date, most studies concern the binding of an IDP to a structured protein, while the Interaction between two IDPs is poorly understood. In this study, we combined NMR, smFRET, and molecular dynamics (MD) simulation to characterize the interaction between two IDPs, the C-terminal domain (CTD) of protein 4.1G and the nuclear mitotic apparatus (NuMA) protein. It is revealed that CTD and NuMA form a fuzzy complex with remaining structural disorder. Multiple binding sites on both proteins were identified by MD and mutagenesis studies. Our study provides an atomic scenario in which two IDPs bearing multiple binding sites interact with each other in dynamic equilibrium. The combined approach employed here could be widely applicable for investigating IDPs and their dynamic interactions.

  19. BRCA1 interacts directly with the Fanconi anemia protein FANCA.

    Science.gov (United States)

    Folias, Alexandra; Matkovic, Mara; Bruun, Donald; Reid, Sonja; Hejna, James; Grompe, Markus; D'Andrea, Alan; Moses, Robb

    2002-10-01

    Fanconi anemia (FA) is a rare autosomal recessive disease characterized by skeletal defects, anemia, chromosomal instability and increased risk of leukemia. At the cellular level FA is characterized by increased sensitivity to agents forming interstrand crosslinks (ICL) in DNA. Six FA genes have been cloned and interactions among individual FANC proteins have been found. The FANCD2 protein co-localizes in nuclear foci with the BRCA1 protein following DNA damage and during S-phase, requiring the FANCA, C, E and G proteins to do so. This finding may reflect a direct role for the BRCA1 protein in double strand break (DSB) repair and interaction with the FANC proteins. Therefore interactions between BRCA1 and the FANC proteins were investigated. Among the known FANC proteins, we find evidence for direct interaction only between the FANCA protein and BRCA1. The evidence rests on three different tests: yeast two-hybrid analysis, coimmunoprecipitation from in vitro synthesis, and coimmunoprecipitation from cell extracts. The amino terminal portion of FANCA and the central part (aa 740-1083) of BRCA1 contain the sites of interaction. The interaction does not depend on DNA damage, thus FANCA and BRCA1 are constitutively interacting. The demonstrated interaction directly connects BRCA1 to the FA pathway of DNA repair.

  20. MannDB – A microbial database of automated protein sequence analyses and evidence integration for protein characterization

    Directory of Open Access Journals (Sweden)

    Kuczmarski Thomas A

    2006-10-01

    Full Text Available Abstract Background MannDB was created to meet a need for rapid, comprehensive automated protein sequence analyses to support selection of proteins suitable as targets for driving the development of reagents for pathogen or protein toxin detection. Because a large number of open-source tools were needed, it was necessary to produce a software system to scale the computations for whole-proteome analysis. Thus, we built a fully automated system for executing software tools and for storage, integration, and display of automated protein sequence analysis and annotation data. Description MannDB is a relational database that organizes data resulting from fully automated, high-throughput protein-sequence analyses using open-source tools. Types of analyses provided include predictions of cleavage, chemical properties, classification, features, functional assignment, post-translational modifications, motifs, antigenicity, and secondary structure. Proteomes (lists of hypothetical and known proteins are downloaded and parsed from Genbank and then inserted into MannDB, and annotations from SwissProt are downloaded when identifiers are found in the Genbank entry or when identical sequences are identified. Currently 36 open-source tools are run against MannDB protein sequences either on local systems or by means of batch submission to external servers. In addition, BLAST against protein entries in MvirDB, our database of microbial virulence factors, is performed. A web client browser enables viewing of computational results and downloaded annotations, and a query tool enables structured and free-text search capabilities. When available, links to external databases, including MvirDB, are provided. MannDB contains whole-proteome analyses for at least one representative organism from each category of biological threat organism listed by APHIS, CDC, HHS, NIAID, USDA, USFDA, and WHO. Conclusion MannDB comprises a large number of genomes and comprehensive protein

  1. Protein interactions in genome maintenance as novel antibacterial targets.

    Directory of Open Access Journals (Sweden)

    Aimee H Marceau

    Full Text Available Antibacterial compounds typically act by directly inhibiting essential bacterial enzyme activities. Although this general mechanism of action has fueled traditional antibiotic discovery efforts for decades, new antibiotic development has not kept pace with the emergence of drug resistant bacterial strains. These limitations have severely restricted the therapeutic tools available for treating bacterial infections. Here we test an alternative antibacterial lead-compound identification strategy in which essential protein-protein interactions are targeted rather than enzymatic activities. Bacterial single-stranded DNA-binding proteins (SSBs form conserved protein interaction "hubs" that are essential for recruiting many DNA replication, recombination, and repair proteins to SSB/DNA nucleoprotein substrates. Three small molecules that block SSB/protein interactions are shown to have antibacterial activity against diverse bacterial species. Consistent with a model in which the compounds target multiple SSB/protein interactions, treatment of Bacillus subtilis cultures with the compounds leads to rapid inhibition of DNA replication and recombination, and ultimately to cell death. The compounds also have unanticipated effects on protein synthesis that could be due to a previously unknown role for SSB/protein interactions in translation or to off-target effects. Our results highlight the potential of targeting protein-protein interactions, particularly those that mediate genome maintenance, as a powerful approach for identifying new antibacterial compounds.

  2. Two-dimensional gel human protein databases offer a systematic approach to the study of cell proliferation and differentiation

    DEFF Research Database (Denmark)

    Celis, julio E.; Gesser, Borbala; Dejgaard, Kurt

    1989-01-01

    Human cellular protein databases have been established using computer-analyzed 2D gel electrophoresis. These databases, which include information on various properties of proteins, offer a global approach to the study of regulation of cell proliferation and differentiation. Furthermore, thanks...

  3. Two dimensional gel human protein databases offer a systematic approach to the study of cell proliferation and differentiation

    DEFF Research Database (Denmark)

    Celis, J E; Gesser, B; Dejgaard, K

    1989-01-01

    Human cellular protein databases have been established using computer-analyzed 2D gel electrophoresis. These databases, which include information on various properties of proteins, offer a global approach to the study of regulation of cell proliferation and differentiation. Furthermore, thanks to...

  4. The simulation approach to lipid-protein interactions.

    Science.gov (United States)

    Paramo, Teresa; Garzón, Diana; Holdbrook, Daniel A; Khalid, Syma; Bond, Peter J

    2013-01-01

    The interactions between lipids and proteins are crucial for a range of biological processes, from the folding and stability of membrane proteins to signaling and metabolism facilitated by lipid-binding proteins. However, high-resolution structural details concerning functional lipid/protein interactions are scarce due to barriers in both experimental isolation of native lipid-bound complexes and subsequent biophysical characterization. The molecular dynamics (MD) simulation approach provides a means to complement available structural data, yielding dynamic, structural, and thermodynamic data for a protein embedded within a physiologically realistic, modelled lipid environment. In this chapter, we provide a guide to current methods for setting up and running simulations of membrane proteins and soluble, lipid-binding proteins, using standard atomistically detailed representations, as well as simplified, coarse-grained models. In addition, we outline recent studies that illustrate the power of the simulation approach in the context of biologically relevant lipid/protein interactions.

  5. Prediction of protein–protein interactions: unifying evolution and structure at protein interfaces

    International Nuclear Information System (INIS)

    Tuncbag, Nurcan; Gursoy, Attila; Keskin, Ozlem

    2011-01-01

    The vast majority of the chores in the living cell involve protein–protein interactions. Providing details of protein interactions at the residue level and incorporating them into protein interaction networks are crucial toward the elucidation of a dynamic picture of cells. Despite the rapid increase in the number of structurally known protein complexes, we are still far away from a complete network. Given experimental limitations, computational modeling of protein interactions is a prerequisite to proceed on the way to complete structural networks. In this work, we focus on the question 'how do proteins interact?' rather than 'which proteins interact?' and we review structure-based protein–protein interaction prediction approaches. As a sample approach for modeling protein interactions, PRISM is detailed which combines structural similarity and evolutionary conservation in protein interfaces to infer structures of complexes in the protein interaction network. This will ultimately help us to understand the role of protein interfaces in predicting bound conformations

  6. Mitochondrial nucleoid interacting proteins support mitochondrial protein synthesis.

    Science.gov (United States)

    He, J; Cooper, H M; Reyes, A; Di Re, M; Sembongi, H; Litwin, T R; Gao, J; Neuman, K C; Fearnley, I M; Spinazzola, A; Walker, J E; Holt, I J

    2012-07-01

    Mitochondrial ribosomes and translation factors co-purify with mitochondrial nucleoids of human cells, based on affinity protein purification of tagged mitochondrial DNA binding proteins. Among the most frequently identified proteins were ATAD3 and prohibitin, which have been identified previously as nucleoid components, using a variety of methods. Both proteins are demonstrated to be required for mitochondrial protein synthesis in human cultured cells, and the major binding partner of ATAD3 is the mitochondrial ribosome. Altered ATAD3 expression also perturbs mtDNA maintenance and replication. These findings suggest an intimate association between nucleoids and the machinery of protein synthesis in mitochondria. ATAD3 and prohibitin are tightly associated with the mitochondrial membranes and so we propose that they support nucleic acid complexes at the inner membrane of the mitochondrion.

  7. Hot-spot analysis for drug discovery targeting protein-protein interactions.

    Science.gov (United States)

    Rosell, Mireia; Fernández-Recio, Juan

    2018-04-01

    Protein-protein interactions are important for biological processes and pathological situations, and are attractive targets for drug discovery. However, rational drug design targeting protein-protein interactions is still highly challenging. Hot-spot residues are seen as the best option to target such interactions, but their identification requires detailed structural and energetic characterization, which is only available for a tiny fraction of protein interactions. Areas covered: In this review, the authors cover a variety of computational methods that have been reported for the energetic analysis of protein-protein interfaces in search of hot-spots, and the structural modeling of protein-protein complexes by docking. This can help to rationalize the discovery of small-molecule inhibitors of protein-protein interfaces of therapeutic interest. Computational analysis and docking can help to locate the interface, molecular dynamics can be used to find suitable cavities, and hot-spot predictions can focus the search for inhibitors of protein-protein interactions. Expert opinion: A major difficulty for applying rational drug design methods to protein-protein interactions is that in the majority of cases the complex structure is not available. Fortunately, computational docking can complement experimental data. An interesting aspect to explore in the future is the integration of these strategies for targeting PPIs with large-scale mutational analysis.

  8. Mouse IDGenes: a reference database for genetic interactions in the developing mouse brain.

    Science.gov (United States)

    Matthes, Michaela; Preusse, Martin; Zhang, Jingzhong; Schechter, Julia; Mayer, Daniela; Lentes, Bernd; Theis, Fabian; Prakash, Nilima; Wurst, Wolfgang; Trümbach, Dietrich

    2014-01-01

    The study of developmental processes in the mouse and other vertebrates includes the understanding of patterning along the anterior-posterior, dorsal-ventral and medial- lateral axis. Specifically, neural development is also of great clinical relevance because several human neuropsychiatric disorders such as schizophrenia, autism disorders or drug addiction and also brain malformations are thought to have neurodevelopmental origins, i.e. pathogenesis initiates during childhood and adolescence. Impacts during early neurodevelopment might also predispose to late-onset neurodegenerative disorders, such as Parkinson's disease. The neural tube develops from its precursor tissue, the neural plate, in a patterning process that is determined by compartmentalization into morphogenetic units, the action of local signaling centers and a well-defined and locally restricted expression of genes and their interactions. While public databases provide gene expression data with spatio-temporal resolution, they usually neglect the genetic interactions that govern neural development. Here, we introduce Mouse IDGenes, a reference database for genetic interactions in the developing mouse brain. The database is highly curated and offers detailed information about gene expressions and the genetic interactions at the developing mid-/hindbrain boundary. To showcase the predictive power of interaction data, we infer new Wnt/β-catenin target genes by machine learning and validate one of them experimentally. The database is updated regularly. Moreover, it can easily be extended by the research community. Mouse IDGenes will contribute as an important resource to the research on mouse brain development, not exclusively by offering data retrieval, but also by allowing data input. http://mouseidgenes.helmholtz-muenchen.de. © The Author(s) 2014. Published by Oxford University Press.

  9. A Physical Interaction Network of Dengue Virus and Human Proteins*

    Science.gov (United States)

    Khadka, Sudip; Vangeloff, Abbey D.; Zhang, Chaoying; Siddavatam, Prasad; Heaton, Nicholas S.; Wang, Ling; Sengupta, Ranjan; Sahasrabudhe, Sudhir; Randall, Glenn; Gribskov, Michael; Kuhn, Richard J.; Perera, Rushika; LaCount, Douglas J.

    2011-01-01

    Dengue virus (DENV), an emerging mosquito-transmitted pathogen capable of causing severe disease in humans, interacts with host cell factors to create a more favorable environment for replication. However, few interactions between DENV and human proteins have been reported to date. To identify DENV-human protein interactions, we used high-throughput yeast two-hybrid assays to screen the 10 DENV proteins against a human liver activation domain library. From 45 DNA-binding domain clones containing either full-length viral genes or partially overlapping gene fragments, we identified 139 interactions between DENV and human proteins, the vast majority of which are novel. These interactions involved 105 human proteins, including six previously implicated in DENV infection and 45 linked to the replication of other viruses. Human proteins with functions related to the complement and coagulation cascade, the centrosome, and the cytoskeleton were enriched among the DENV interaction partners. To determine if the cellular proteins were required for DENV infection, we used small interfering RNAs to inhibit their expression. Six of 12 proteins targeted (CALR, DDX3X, ERC1, GOLGA2, TRIP11, and UBE2I) caused a significant decrease in the replication of a DENV replicon. We further showed that calreticulin colocalized with viral dsRNA and with the viral NS3 and NS5 proteins in DENV-infected cells, consistent with a direct role for calreticulin in DENV replication. Human proteins that interacted with DENV had significantly higher average degree and betweenness than expected by chance, which provides additional support for the hypothesis that viruses preferentially target cellular proteins that occupy central position in the human protein interaction network. This study provides a valuable starting point for additional investigations into the roles of human proteins in DENV infection. PMID:21911577

  10. A physical interaction network of dengue virus and human proteins.

    Science.gov (United States)

    Khadka, Sudip; Vangeloff, Abbey D; Zhang, Chaoying; Siddavatam, Prasad; Heaton, Nicholas S; Wang, Ling; Sengupta, Ranjan; Sahasrabudhe, Sudhir; Randall, Glenn; Gribskov, Michael; Kuhn, Richard J; Perera, Rushika; LaCount, Douglas J

    2011-12-01

    Dengue virus (DENV), an emerging mosquito-transmitted pathogen capable of causing severe disease in humans, interacts with host cell factors to create a more favorable environment for replication. However, few interactions between DENV and human proteins have been reported to date. To identify DENV-human protein interactions, we used high-throughput yeast two-hybrid assays to screen the 10 DENV proteins against a human liver activation domain library. From 45 DNA-binding domain clones containing either full-length viral genes or partially overlapping gene fragments, we identified 139 interactions between DENV and human proteins, the vast majority of which are novel. These interactions involved 105 human proteins, including six previously implicated in DENV infection and 45 linked to the replication of other viruses. Human proteins with functions related to the complement and coagulation cascade, the centrosome, and the cytoskeleton were enriched among the DENV interaction partners. To determine if the cellular proteins were required for DENV infection, we used small interfering RNAs to inhibit their expression. Six of 12 proteins targeted (CALR, DDX3X, ERC1, GOLGA2, TRIP11, and UBE2I) caused a significant decrease in the replication of a DENV replicon. We further showed that calreticulin colocalized with viral dsRNA and with the viral NS3 and NS5 proteins in DENV-infected cells, consistent with a direct role for calreticulin in DENV replication. Human proteins that interacted with DENV had significantly higher average degree and betweenness than expected by chance, which provides additional support for the hypothesis that viruses preferentially target cellular proteins that occupy central position in the human protein interaction network. This study provides a valuable starting point for additional investigations into the roles of human proteins in DENV infection.

  11. Viral Genome DataBase: storing and analyzing genes and proteins from complete viral genomes.

    Science.gov (United States)

    Hiscock, D; Upton, C

    2000-05-01

    The Viral Genome DataBase (VGDB) contains detailed information of the genes and predicted protein sequences from 15 completely sequenced genomes of large (&100 kb) viruses (2847 genes). The data that is stored includes DNA sequence, protein sequence, GenBank and user-entered notes, molecular weight (MW), isoelectric point (pI), amino acid content, A + T%, nucleotide frequency, dinucleotide frequency and codon use. The VGDB is a mySQL database with a user-friendly JAVA GUI. Results of queries can be easily sorted by any of the individual parameters. The software and additional figures and information are available at http://athena.bioc.uvic.ca/genomes/index.html .

  12. Interactions among tobacco sieve element occlusion (SEO) proteins.

    Science.gov (United States)

    Jekat, Stephan B; Ernst, Antonia M; Zielonka, Sascia; Noll, Gundula A; Prüfer, Dirk

    2012-12-01

    Angiosperms transport their photoassimilates through sieve tubes, which comprise longitudinally-connected sieve elements. In dicots and also some monocots, the sieve elements contain parietal structural proteins known as phloem proteins or P-proteins. Following injury, P proteins disperse and accumulate as viscous plugs at the sieve plates to prevent the loss of valuable transport sugars. Tobacco (Nicotiana tabacum) P-proteins are multimeric complexes comprising subunits encoded by members of the SEO (sieve element occlusion) gene family. The existence of multiple subunits suggests that P-protein assembly involves interactions between SEO proteins, but this process is largely uncharacterized and it is unclear whether the different subunits perform unique roles or are redundant. We therefore extended our analysis of the tobacco P-proteins NtSEO1 and NtSEO2 to investigate potential interactions between them, and found that both proteins can form homomeric and heteromeric complexes in planta.

  13. dbPAF: an integrative database of protein phosphorylation in animals and fungi.

    Science.gov (United States)

    Ullah, Shahid; Lin, Shaofeng; Xu, Yang; Deng, Wankun; Ma, Lili; Zhang, Ying; Liu, Zexian; Xue, Yu

    2016-03-24

    Protein phosphorylation is one of the most important post-translational modifications (PTMs) and regulates a broad spectrum of biological processes. Recent progresses in phosphoproteomic identifications have generated a flood of phosphorylation sites, while the integration of these sites is an urgent need. In this work, we developed a curated database of dbPAF, containing known phosphorylation sites in H. sapiens, M. musculus, R. norvegicus, D. melanogaster, C. elegans, S. pombe and S. cerevisiae. From the scientific literature and public databases, we totally collected and integrated 54,148 phosphoproteins with 483,001 phosphorylation sites. Multiple options were provided for accessing the data, while original references and other annotations were also present for each phosphoprotein. Based on the new data set, we computationally detected significantly over-represented sequence motifs around phosphorylation sites, predicted potential kinases that are responsible for the modification of collected phospho-sites, and evolutionarily analyzed phosphorylation conservation states across different species. Besides to be largely consistent with previous reports, our results also proposed new features of phospho-regulation. Taken together, our database can be useful for further analyses of protein phosphorylation in human and other model organisms. The dbPAF database was implemented in PHP + MySQL and freely available at http://dbpaf.biocuckoo.org.

  14. Globular and disordered-the non-identical twins in protein-protein interactions

    DEFF Research Database (Denmark)

    Teilum, Kaare; Olsen, Johan Gotthardt; Kragelund, Birthe Brandt

    2015-01-01

    as a strong determinant for their function. This has fostered the notion that IDP's bind with low affinity but high specificity. Here we have analyzed available detailed thermodynamic data for protein-protein interactions to put to the test if the thermodynamic profiles of IDP interactions differ from those...... of other protein-protein interactions. We find that ordered proteins and the disordered ones act as non-identical twins operating by similar principles but where the disordered proteins complexes are on average less stable by 2.5 kcal mol(-1)....

  15. Mapping functional prion-prion protein interaction sites using prion protein based peptide-arrays

    NARCIS (Netherlands)

    Rigter, A.; Priem, J.; Timmers-Parohi, D.; Langeveld, J.; Bossers, A.

    2009-01-01

    Protein-protein interactions are at the basis of most if not all biological processes in living cells. Therefore, adapting existing techniques or developing new techniques to study interactions between proteins are of importance in elucidating which amino acid sequences contribute to these

  16. Imaging protein-protein interactions in living cells

    NARCIS (Netherlands)

    Hink, M.A.; Bisseling, T.; Visser, A.J.W.G.

    2002-01-01

    The complex organization of plant cells makes it likely that the molecular behaviour of proteins in the test tube and the cell is different. For this reason, it is essential though a challenge to study proteins in their natural environment. Several innovative microspectroscopic approaches provide

  17. Semi-supervised drug-protein interaction prediction from heterogeneous biological spaces.

    Science.gov (United States)

    Xia, Zheng; Wu, Ling-Yun; Zhou, Xiaobo; Wong, Stephen T C

    2010-09-13

    Predicting drug-protein interactions from heterogeneous biological data sources is a key step for in silico drug discovery. The difficulty of this prediction task lies in the rarity of known drug-protein interactions and myriad unknown interactions to be predicted. To meet this challenge, a manifold regularization semi-supervised learning method is presented to tackle this issue by using labeled and unlabeled information which often generates better results than using the labeled data alone. Furthermore, our semi-supervised learning method integrates known drug-protein interaction network information as well as chemical structure and genomic sequence data. Using the proposed method, we predicted certain drug-protein interactions on the enzyme, ion channel, GPCRs, and nuclear receptor data sets. Some of them are confirmed by the latest publicly available drug targets databases such as KEGG. We report encouraging results of using our method for drug-protein interaction network reconstruction which may shed light on the molecular interaction inference and new uses of marketed drugs.

  18. Potato leafroll virus structural proteins manipulate overlapping, yet distinct protein interaction networks during infection.

    Science.gov (United States)

    DeBlasio, Stacy L; Johnson, Richard; Sweeney, Michelle M; Karasev, Alexander; Gray, Stewart M; MacCoss, Michael J; Cilia, Michelle

    2015-06-01

    Potato leafroll virus (PLRV) produces a readthrough protein (RTP) via translational readthrough of the coat protein amber stop codon. The RTP functions as a structural component of the virion and as a nonincorporated protein in concert with numerous insect and plant proteins to regulate virus movement/transmission and tissue tropism. Affinity purification coupled to quantitative MS was used to generate protein interaction networks for a PLRV mutant that is unable to produce the read through domain (RTD) and compared to the known wild-type PLRV protein interaction network. By quantifying differences in the protein interaction networks, we identified four distinct classes of PLRV-plant interactions: those plant and nonstructural viral proteins interacting with assembled coat protein (category I); plant proteins in complex with both coat protein and RTD (category II); plant proteins in complex with the RTD (category III); and plant proteins that had higher affinity for virions lacking the RTD (category IV). Proteins identified as interacting with the RTD are potential candidates for regulating viral processes that are mediated by the RTP such as phloem retention and systemic movement and can potentially be useful targets for the development of strategies to prevent infection and/or viral transmission of Luteoviridae species that infect important crop species. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  19. Structural study of surfactant-dependent interaction with protein

    Energy Technology Data Exchange (ETDEWEB)

    Mehan, Sumit; Aswal, Vinod K., E-mail: vkaswal@barc.gov.in [Solid State Physics Division, Bhabha Atomic Research Centre, Mumbai 400 085 (India); Kohlbrecher, Joachim [Laboratory for Neutron Scattering, Paul Scherrer Institut, CH-5232 PSI Villigen (Switzerland)

    2015-06-24

    Small-angle neutron scattering (SANS) has been used to study the complex structure of anionic BSA protein with three different (cationic DTAB, anionic SDS and non-ionic C12E10) surfactants. These systems form very different surfactant-dependent complexes. We show that the structure of protein-surfactant complex is initiated by the site-specific electrostatic interaction between the components, followed by the hydrophobic interaction at high surfactant concentrations. It is also found that hydrophobic interaction is preferred over the electrostatic interaction in deciding the resultant structure of protein-surfactant complexes.

  20. Interaction between Vaccinium bracteatum Thunb. leaf pigment and rice proteins.

    Science.gov (United States)

    Wang, Li; Xu, Yuan; Zhou, Sumei; Qian, Haifeng; Zhang, Hui; Qi, Xiguang; Fan, Meihua

    2016-03-01

    In this study, we investigated the interaction of Vaccinium bracteatum Thunb. leaf (VBTL) pigment and rice proteins. In the presence of rice protein, VBTL pigment antioxidant activity and free polyphenol content decreased by 67.19% and 68.11%, respectively, and L(∗) of the protein-pigment complex decreased significantly over time. L(∗) values of albumin, globulin and glutelin during 60-min pigment exposure decreased by 55.00, 57.14, and 54.30%, respectively, indicating that these proteins had bound to the pigment. A significant difference in protein surface hydrophobicity was observed between rice proteins and pigment-protein complexes, indicating that hydrophobic interaction is a major binding mechanism between VBTL pigment and rice proteins. A significant difference in secondary structures between proteins and protein-pigment complexes was also uncovered, indicating that hydrogen bonding may be another mode of interaction between VBTL pigment and rice proteins. Our results indicate that VBTL pigment can stain rice proteins with hydrophobic and hydrogen interactions. Copyright © 2015 Elsevier Ltd. All rights reserved.

  1. Protein-lipid interactions: from membrane domains to cellular networks

    National Research Council Canada - National Science Library

    Tamm, Lukas K

    2005-01-01

    ... membranes is the lipid bilayer. Embedded in the fluid lipid bilayer are proteins of various shapes and traits. This volume illuminates from physical, chemical and biological angles the numerous - mostly quite weak - interactions between lipids, proteins, and proteins and lipids that define the delicate, highly dynamic and yet so stable fabri...

  2. Interaction between Protein, Phytate, and Microbial Phytase. In Vitro Studies

    NARCIS (Netherlands)

    Kies, A.K.; Jonge, de L.H.; Kemme, P.A.; Jongbloed, A.W.

    2006-01-01

    The interaction between protein and phytate was investigated in vitro using proteins extracted from five common feedstuffs and from casein. The appearance of naturally present soluble protein-phytate complexes in the feedstuffs, the formation of complexes at different pHs, and the degradation of

  3. NatalieQ: A web server for protein-protein interaction network querying

    NARCIS (Netherlands)

    El-Kebir, M.; Brandt, B.W.; Heringa, J.; Klau, G.W.

    2014-01-01

    Background Molecular interactions need to be taken into account to adequately model the complex behavior of biological systems. These interactions are captured by various types of biological networks, such as metabolic, gene-regulatory, signal transduction and protein-protein interaction networks.

  4. Casein - whey protein interactions in heated milk

    NARCIS (Netherlands)

    Vasbinder, Astrid Jolanda

    2002-01-01

    Heating of milk is an essential step in the processing of various dairy products, like for example yoghurt. A major consequence of the heat treatment is the denaturation of whey proteins, which either associate with the casein micelle or form soluble whey protein aggregates. By combination of

  5. The role of electrostatics in protein-protein interactions of a monoclonal antibody.

    Science.gov (United States)

    Roberts, D; Keeling, R; Tracka, M; van der Walle, C F; Uddin, S; Warwicker, J; Curtis, R

    2014-07-07

    Understanding how protein-protein interactions depend on the choice of buffer, salt, ionic strength, and pH is needed to have better control over protein solution behavior. Here, we have characterized the pH and ionic strength dependence of protein-protein interactions in terms of an interaction parameter kD obtained from dynamic light scattering and the osmotic second virial coefficient B22 measured by static light scattering. A simplified protein-protein interaction model based on a Baxter adhesive potential and an electric double layer force is used to separate out the contributions of longer-ranged electrostatic interactions from short-ranged attractive forces. The ionic strength dependence of protein-protein interactions for solutions at pH 6.5 and below can be accurately captured using a Deryaguin-Landau-Verwey-Overbeek (DLVO) potential to describe the double layer forces. In solutions at pH 9, attractive electrostatics occur over the ionic strength range of 5-275 mM. At intermediate pH values (7.25 to 8.5), there is a crossover effect characterized by a nonmonotonic ionic strength dependence of protein-protein interactions, which can be rationalized by the competing effects of long-ranged repulsive double layer forces at low ionic strength and a shorter ranged electrostatic attraction, which dominates above a critical ionic strength. The change of interactions from repulsive to attractive indicates a concomitant change in the angular dependence of protein-protein interaction from isotropic to anisotropic. In the second part of the paper, we show how the Baxter adhesive potential can be used to predict values of kD from fitting to B22 measurements, thus providing a molecular basis for the linear correlation between the two protein-protein interaction parameters.

  6. Protein Charge and Mass Contribute to the Spatio-temporal Dynamics of Protein-Protein Interactions in a Minimal Proteome

    Science.gov (United States)

    Xu, Yu; Wang, Hong; Nussinov, Ruth; Ma, Buyong

    2013-01-01

    We constructed and simulated a ‘minimal proteome’ model using Langevin dynamics. It contains 206 essential protein types which were compiled from the literature. For comparison, we generated six proteomes with randomized concentrations. We found that the net charges and molecular weights of the proteins in the minimal genome are not random. The net charge of a protein decreases linearly with molecular weight, with small proteins being mostly positively charged and large proteins negatively charged. The protein copy numbers in the minimal genome have the tendency to maximize the number of protein-protein interactions in the network. Negatively charged proteins which tend to have larger sizes can provide large collision cross-section allowing them to interact with other proteins; on the other hand, the smaller positively charged proteins could have higher diffusion speed and are more likely to collide with other proteins. Proteomes with random charge/mass populations form less stable clusters than those with experimental protein copy numbers. Our study suggests that ‘proper’ populations of negatively and positively charged proteins are important for maintaining a protein-protein interaction network in a proteome. It is interesting to note that the minimal genome model based on the charge and mass of E. Coli may have a larger protein-protein interaction network than that based on the lower organism M. pneumoniae. PMID:23420643

  7. The master two-dimensional gel database of human AMA cell proteins: towards linking protein and genome sequence and mapping information (update 1991)

    DEFF Research Database (Denmark)

    Celis, J E; Leffers, H; Rasmussen, H H

    1991-01-01

    autoantigens" and "cDNAs". For convenience we have included an alphabetical list of all known proteins recorded in this database. In the long run, the main goal of this database is to link protein and DNA sequencing and mapping information (Human Genome Program) and to provide an integrated picture......The master two-dimensional gel database of human AMA cells currently lists 3801 cellular and secreted proteins, of which 371 cellular polypeptides (306 IEF; 65 NEPHGE) were added to the master images during the last 10 months. These include: (i) very basic and acidic proteins that do not focus...

  8. Dendrimer-protein interactions versus dendrimer-based nanomedicine.

    Science.gov (United States)

    Shcharbin, Dzmitry; Shcharbina, Natallia; Dzmitruk, Volha; Pedziwiatr-Werbicka, Elzbieta; Ionov, Maksim; Mignani, Serge; de la Mata, F Javier; Gómez, Rafael; Muñoz-Fernández, Maria Angeles; Majoral, Jean-Pierre; Bryszewska, Maria

    2017-04-01

    Dendrimers are hyperbranched polymers belonging to the huge class of nanomedical devices. Their wide application in biology and medicine requires understanding of the fundamental mechanisms of their interactions with biological systems. Summarizing, electrostatic force plays the predominant role in dendrimer-protein interactions, especially with charged dendrimers. Other kinds of interactions have been proven, such as H-bonding, van der Waals forces, and even hydrophobic interactions. These interactions depend on the characteristics of both participants: flexibility and surface charge of a dendrimer, rigidity of protein structure and the localization of charged amino acids at its surface. pH and ionic strength of solutions can significantly modulate interactions. Ligands and cofactors attached to a protein can also change dendrimer-protein interactions. Binding of dendrimers to a protein can change its secondary structure, conformation, intramolecular mobility and functional activity. However, this strongly depends on rigidity versus flexibility of a protein's structure. In addition, the potential applications of dendrimers to nanomedicine are reviwed related to dendrimer-protein interactions. Copyright © 2017 Elsevier B.V. All rights reserved.

  9. Searching the protein structure database for ligand-binding site similarities using CPASS v.2

    Directory of Open Access Journals (Sweden)

    Caprez Adam

    2011-01-01

    Full Text Available Abstract Background A recent analysis of protein sequences deposited in the NCBI RefSeq database indicates that ~8.5 million protein sequences are encoded in prokaryotic and eukaryotic genomes, where ~30% are explicitly annotated as "hypothetical" or "uncharacterized" protein. Our Comparison of Protein Active-Site Structures (CPASS v.2 database and software compares the sequence and structural characteristics of experimentally determined ligand binding sites to infer a functional relationship in the absence of global sequence or structure similarity. CPASS is an important component of our Functional Annotation Screening Technology by NMR (FAST-NMR protocol and has been successfully applied to aid the annotation of a number of proteins of unknown function. Findings We report a major upgrade to our CPASS software and database that significantly improves its broad utility. CPASS v.2 is designed with a layered architecture to increase flexibility and portability that also enables job distribution over the Open Science Grid (OSG to increase speed. Similarly, the CPASS interface was enhanced to provide more user flexibility in submitting a CPASS query. CPASS v.2 now allows for both automatic and manual definition of ligand-binding sites and permits pair-wise, one versus all, one versus list, or list versus list comparisons. Solvent accessible surface area, ligand root-mean square difference, and Cβ distances have been incorporated into the CPASS similarity function to improve the quality of the results. The CPASS database has also been updated. Conclusions CPASS v.2 is more than an order of magnitude faster than the original implementation, and allows for multiple simultaneous job submissions. Similarly, the CPASS database of ligand-defined binding sites has increased in size by ~ 38%, dramatically increasing the likelihood of a positive search result. The modification to the CPASS similarity function is effective in reducing CPASS similarity scores

  10. Interaction between -Synuclein and Other Proteins in Neurodegenerative Disorders

    Directory of Open Access Journals (Sweden)

    Kurt A. Jellinger

    2011-01-01

    Full Text Available Protein aggregation is a common characteristic of many neurodegenerative disorders, and the interaction between pathological/toxic proteins to cause neurodegeneration is a hot topic of current neuroscience research. Despite clinical, genetic, and experimental differences, evidence increasingly indicates considerable overlap between synucleinopathies and tauopathies or other protein-misfolding diseases. Inclusions, characteristics of these disorders, also occurring in other neurodegenerative diseases, suggest interactions of pathological proteins engaging common downstream pathways. Novel findings that have shifted our understanding in the role of pathologic proteins in the pathogenesis of Parkinson and Alzheimer diseases have confirmed correlations/overlaps between these and other neurodegenerative disorders. The synergistic effects of α-synuclein, hyperphosphorylated tau, amyloid-β, and other pathologic proteins, and the underlying molecular pathogenic mechanisms, including induction and spread of protein aggregates, are critically reviewed, suggesting a dualism or triad of neurodegeneration in protein-misfolding disorders, although the etiology of most of these processes is still mysterious.

  11. Finding low-conductance sets with dense interactions (FLCD) for better protein complex prediction.

    Science.gov (United States)

    Wang, Yijie; Qian, Xiaoning

    2017-03-14

    Intuitively, proteins in the same protein complexes should highly interact with each other but rarely interact with the other proteins in protein-protein interaction (PPI) networks. Surprisingly, many existing computational algorithms do not directly detect protein complexes based on both of these topological properties. Most of them, depending on mathematical definitions of either "modularity" or "conductance", have their own limitations: Modularity has the inherent resolution problem ignoring small protein complexes; and conductance characterizes the separability of complexes but fails to capture the interaction density within complexes. In this paper, we propose a two-step algorithm FLCD (Finding Low-Conductance sets with Dense interactions) to predict overlapping protein complexes with the desired topological structure, which is densely connected inside and well separated from the rest of the networks. First, FLCD detects well-separated subnetworks based on approximating a potential low-conductance set through a personalized PageRank vector from a protein and then solving a mixed integer programming (MIP) problem to find the minimum-conductance set within the identified low-conductance set. At the second step, the densely connected parts in those subnetworks are discovered as the protein complexes by solving another MIP problem that aims to find the dense subnetwork in the minimum-conductance set. Experiments on four large-scale yeast PPI networks from different public databases demonstrate that the complexes predicted by FLCD have better correspondence with the yeast protein complex gold standards than other three state-of-the-art algorithms (ClusterONE, LinkComm, and SR-MCL). Additionally, results of FLCD show higher biological relevance with respect to Gene Ontology (GO) terms by GO enrichment analysis.

  12. Protein-material interactions: From micro-to-nano scale

    International Nuclear Information System (INIS)

    Tsapikouni, Theodora S.; Missirlis, Yannis F.

    2008-01-01

    The article presents a survey on the significance of protein-material interactions, the mechanisms which control them and the techniques used for their study. Protein-surface interactions play a key role in regenerative medicine, drug delivery, biosensor technology and chromatography, while it is related to various undesired effects such as biofouling and bio-prosthetic malfunction. Although the effects of protein-surface interaction concern the micro-scale, being sometimes obvious even with bare eyes, they derive from biophysical events at the nano-scale. The sequential steps for protein adsorption involve events at the single biomolecule level and the forces driving or inhibiting protein adsorption act at the molecular level too. Following the scaling of protein-surface interactions, various techniques have been developed for their study both in the micro- and nano-scale. Protein labelling with radioisotopes or fluorescent probes, colorimetric assays and the quartz crystal microbalance were the first techniques used to monitor protein adsorption isotherms, while the surface force apparatus was used to measure the interaction forces between protein layers at the micro-scale. Recently, more elaborate techniques like total internal reflection fluorescence (TIRF), Fourier transform infrared spectroscopy (FTIR), surface plasmon resonance, Raman spectroscopy, ellipsometry and time of flight secondary ion mass spectrometry (ToF-SIMS) have been applied for the investigation of protein density, structure or orientation at the interfaces. However, a turning point in the study of protein interactions with the surfaces was the invention and the wide-spread use of atomic force microscopy (AFM) which can both image single protein molecules on surfaces and directly measure the interaction force

  13. Protein backbone chemical shifts predicted from searching a database for torsion angle and sequence homology

    International Nuclear Information System (INIS)

    Shen Yang; Bax, Ad

    2007-01-01

    Chemical shifts of nuclei in or attached to a protein backbone are exquisitely sensitive to their local environment. A computer program, SPARTA, is described that uses this correlation with local structure to predict protein backbone chemical shifts, given an input three-dimensional structure, by searching a newly generated database for triplets of adjacent residues that provide the best match in φ/ψ/χ 1 torsion angles and sequence similarity to the query triplet of interest. The database contains 15 N, 1 H N , 1 H α , 13 C α , 13 C β and 13 C' chemical shifts for 200 proteins for which a high resolution X-ray (≤2.4 A) structure is available. The relative importance of the weighting factors for the φ/ψ/χ 1 angles and sequence similarity was optimized empirically. The weighted, average secondary shifts of the central residues in the 20 best-matching triplets, after inclusion of nearest neighbor, ring current, and hydrogen bonding effects, are used to predict chemical shifts for the protein of known structure. Validation shows good agreement between the SPARTA-predicted and experimental shifts, with standard deviations of 2.52, 0.51, 0.27, 0.98, 1.07 and 1.08 ppm for 15 N, 1 H N , 1 H α , 13 C α , 13 C β and 13 C', respectively, including outliers

  14. Predicting the binding patterns of hub proteins: a study using yeast protein interaction networks.

    Directory of Open Access Journals (Sweden)

    Carson M Andorf

    Full Text Available Protein-protein interactions are critical to elucidating the role played by individual proteins in important biological pathways. Of particular interest are hub proteins that can interact with large numbers of partners and often play essential roles in cellular control. Depending on the number of binding sites, protein hubs can be classified at a structural level as singlish-interface hubs (SIH with one or two binding sites, or multiple-interface hubs (MIH with three or more binding sites. In terms of kinetics, hub proteins can be classified as date hubs (i.e., interact with different partners at different times or locations or party hubs (i.e., simultaneously interact with multiple partners.Our approach works in 3 phases: Phase I classifies if a protein is likely to bind with another protein. Phase II determines if a protein-binding (PB protein is a hub. Phase III classifies PB proteins as singlish-interface versus multiple-interface hubs and date versus party hubs. At each stage, we use sequence-based predictors trained using several standard machine learning techniques.Our method is able to predict whether a protein is a protein-binding protein with an accuracy of 94% and a correlation coefficient of 0.87; identify hubs from non-hubs with 100% accuracy for 30% of the data; distinguish date hubs/party hubs with 69% accuracy and area under ROC curve of 0.68; and SIH/MIH with 89% accuracy and area under ROC curve of 0.84. Because our method is based on sequence information alone, it can be used even in settings where reliable protein-protein interaction data or structures of protein-protein complexes are unavailable to obtain useful insights into the functional and evolutionary characteristics of proteins and their interactions.We provide a web server for our three-phase approach: http://hybsvm.gdcb.iastate.edu.

  15. Identification and correction of abnormal, incomplete and mispredicted proteins in public databases

    Directory of Open Access Journals (Sweden)

    Bányai László

    2008-08-01

    Full Text Available Abstract Background Despite significant improvements in computational annotation of genomes, sequences of abnormal, incomplete or incorrectly predicted genes and proteins remain abundant in public databases. Since the majority of incomplete, abnormal or mispredicted entries are not annotated as such, these errors seriously affect the reliability of these databases. Here we describe the MisPred approach that may provide an efficient means for the quality control of databases. The current version of the MisPred approach uses five distinct routines for identifying abnormal, incomplete or mispredicted entries based on the principle that a sequence is likely to be incorrect if some of its features conflict with our current knowledge about protein-coding genes and proteins: (i conflict between the predicted subcellular localization of proteins and the absence of the corresponding sequence signals; (ii presence of extracellular and cytoplasmic domains and the absence of transmembrane segments; (iii co-occurrence of extracellular and nuclear domains; (iv violation of domain integrity; (v chimeras encoded by two or more genes located on different chromosomes. Results Analyses of predicted EnsEMBL protein sequences of nine deuterostome (Homo sapiens, Mus musculus, Rattus norvegicus, Monodelphis domestica, Gallus gallus, Xenopus tropicalis, Fugu rubripes, Danio rerio and Ciona intestinalis and two protostome species (Caenorhabditis elegans and Drosophila melanogaster have revealed that the absence of expected signal peptides and violation of domain integrity account for the majority of mispredictions. Analyses of sequences predicted by NCBI's GNOMON annotation pipeline show that the rates of mispredictions are comparable to those of EnsEMBL. Interestingly, even the manually curated UniProtKB/Swiss-Prot dataset is contaminated with mispredicted or abnormal proteins, although to a much lesser extent than UniProtKB/TrEMBL or the EnsEMBL or GNOMON

  16. Bam35 tectivirus intraviral interaction map unveils new function and localization of phage ORFan proteins.

    Science.gov (United States)

    Berjón-Otero, Mónica; Lechuga, Ana; Mehla, Jitender; Uetz, Peter; Salas, Margarita; Redrejo-Rodríguez, Modesto

    2017-07-26

    Tectiviridae comprises a group of tail-less, icosahedral, membrane-containing bacteriophages that can be divided into two groups by their hosts, either Gram-negative or Gram-positive bacteria. While the first group is composed of PRD1 and nearly identical well characterized lytic viruses, the second one includes more variable temperate phages, like GIL16 or Bam35, whose hosts are Bacillus cereus and related Gram-positive bacteria.In the genome of Bam35, nearly half of the 32 annotated open reading frames (ORFs) have no homologs in databases (ORFans), being putative proteins of unknown function, which hinders the understanding of their biology. With the aim of increasing the knowledge of the viral proteome, we carried out a comprehensive yeast two-hybrid analysis among all the putative proteins encoded by the Bam35 genome. The resulting protein interactome comprises 76 unique interactions among 24 proteins, of which 12 have an unknown function. These results suggested that the P17 protein is the minor capsid protein of Bam35 and P24 is the penton protein, being the latter also supported by iterative threading protein modeling. Moreover, the inner membrane transglycosylase protein P26 could have an additional structural role. We also detected interactions involving non-structural proteins, such as the DNA binding protein P1 and the genome terminal protein (P4), which was confirmed by co-immunoprecipitation of recombinant proteins. Altogether, our results provide a functional view of the Bam35 viral proteome, with a focus on the composition and organization of the viral particle. IMPORTANCE Tail-less viruses of the family Tectiviridae can infect commensal and pathogenic Gram-positive and Gram-negative bacteria. Moreover, they have been proposed to be at the evolutionary origin of several groups of large eukaryotic DNA viruses and self-replicating plasmids. However, due to their ancient origin and complex diversity, many tectiviral proteins are ORFans of unknown

  17. A credit-card library approach for disrupting protein-protein interactions.

    Science.gov (United States)

    Xu, Yang; Shi, Jin; Yamamoto, Noboru; Moss, Jason A; Vogt, Peter K; Janda, Kim D

    2006-04-15

    Protein-protein interfaces are prominent in many therapeutically important targets. Using small organic molecules to disrupt protein-protein interactions is a current challenge in chemical biology. An important example of protein-protein interactions is provided by the Myc protein, which is frequently deregulated in human cancers. Myc belongs to the family of basic helix-loop-helix leucine zipper (bHLH-ZIP) transcription factors. It is biologically active only as heterodimer with the bHLH-ZIP protein Max. Herein, we report a new strategy for the disruption of protein-protein interactions that has been corroborated through the design and synthesis of a small parallel library composed of 'credit-card' compounds. These compounds are derived from a planar, aromatic scaffold and functionalized with four points of diversity. From a 285 membered library, several hits were obtained that disrupted the c-Myc-Max interaction and cellular functions of c-Myc. The IC50 values determined for this small focused library for the disruption of Myc-Max dimerization are quite potent, especially since small molecule antagonists of protein-protein interactions are notoriously difficult to find. Furthermore, several of the compounds were active at the cellular level as shown by their biological effects on Myc action in chicken embryo fibroblast assays. In light of our findings, this approach is considered a valuable addition to the armamentarium of new molecules being developed to interact with protein-protein interfaces. Finally, this strategy for disrupting protein-protein interactions should prove applicable to other families of proteins.

  18. Affinity purification combined with mass spectrometry to identify herpes simplex virus protein-protein interactions.

    Science.gov (United States)

    Meckes, David G

    2014-01-01

    The identification and characterization of herpes simplex virus protein interaction complexes are fundamental to understanding the molecular mechanisms governing the replication and pathogenesis of the virus. Recent advances in affinity-based methods, mass spectrometry configurations, and bioinformatics tools have greatly increased the quantity and quality of protein-protein interaction datasets. In this chapter, detailed and reliable methods that can easily be implemented are presented for the identification of protein-protein interactions using cryogenic cell lysis, affinity purification, trypsin digestion, and mass spectrometry.

  19. Unveiling protein functions through the dynamics of the interaction network.

    Directory of Open Access Journals (Sweden)

    Irene Sendiña-Nadal

    Full Text Available Protein interaction networks have become a tool to study biological processes, either for predicting molecular functions or for designing proper new drugs to regulate the main biological interactions. Furthermore, such networks are known to be organized in sub-networks of proteins contributing to the same cellular function. However, the protein function prediction is not accurate and each protein has traditionally been assigned to only one function by the network formalism. By considering the network of the physical interactions between proteins of the yeast together with a manual and single functional classification scheme, we introduce a method able to reveal important information on protein function, at both micro- and macro-scale. In particular, the inspection of the properties of oscillatory dynamics on top of the protein interaction network leads to the identification of misclassification problems in protein function assignments, as well as to unveil correct identification of protein functions. We also demonstrate that our approach can give a network representation of the meta-organization of biological processes by unraveling the interactions between different functional classes.

  20. Characterization of interactions between inclusion membrane proteins from Chlamydia trachomatis

    Directory of Open Access Journals (Sweden)

    Emilie eGauliard

    2015-02-01

    Full Text Available Chlamydiae are obligate intracellular pathogens of eukaryotes. The bacteria grow in an intracellular vesicle called an inclusion, the membrane of which is heavily modified by chlamydial proteins called Incs (Inclusion membrane proteins. Incs represent 7-10% of the genomes of Chlamydia and, given their localization at the interface between the host and the pathogen, likely play a key role in the development and pathogenesis of the bacterium. However, their functions remain largely unknown. Here, we characterized the interaction properties between various Inc proteins of C. trachomatis, using a bacterial two-hybrid (BACTH method suitable for detecting interactions between integral membrane proteins. To validate this approach, we first examined the oligomerization properties of the well-characterized IncA protein and showed that both the cytoplasmic domain and the transmembrane region independently contribute to IncA oligomerization. We then analyzed a set of Inc proteins and identified novel interactions between these components. Two small Incs, IncF and Ct222, were found here to interact with many other Inc proteins and may thus represent interaction nodes within the inclusion membrane. Our data suggest that the Inc proteins may assemble in the membrane of the inclusion to form specific multi-molecular complexes in an hierarchical and temporal manner. These studies will help to better define the putative functions of the Inc proteins in the infectious process of Chlamydia.

  1. Alignment of non-covalent interactions at protein-protein interfaces.

    Directory of Open Access Journals (Sweden)

    Hongbo Zhu

    Full Text Available BACKGROUND: The study and comparison of protein-protein interfaces is essential for the understanding of the mechanisms of interaction between proteins. While there are many methods for comparing protein structures and protein binding sites, so far no methods have been reported for comparing the geometry of non-covalent interactions occurring at protein-protein interfaces. METHODOLOGY/PRINCIPAL FINDINGS: Here we present a method for aligning non-covalent interactions between different protein-protein interfaces. The method aligns the vector representations of van der Waals interactions and hydrogen bonds based on their geometry. The method has been applied to a dataset which comprises a variety of protein-protein interfaces. The alignments are consistent to a large extent with the results obtained using two other complementary approaches. In addition, we apply the method to three examples of protein mimicry. The method successfully aligns respective interfaces and allows for recognizing conserved interface regions. CONCLUSIONS/SIGNIFICANCE: The Galinter method has been validated in the comparison of interfaces in which homologous subunits are involved, including cases of mimicry. The method is also applicable to comparing interfaces involving non-peptidic compounds. Galinter assists users in identifying local interface regions with similar patterns of non-covalent interactions. This is particularly relevant to the investigation of the molecular basis of interaction mimicry.

  2. Classification of protein-protein interaction full-text documents using text and citation network features.

    Science.gov (United States)

    Kolchinsky, Artemy; Abi-Haidar, Alaa; Kaur, Jasleen; Hamed, Ahmed Abdeen; Rocha, Luis M

    2010-01-01

    We participated (as Team 9) in the Article Classification Task of the Biocreative II.5 Challenge: binary classification of full-text documents relevant for protein-protein interaction. We used two distinct classifiers for the online and offline challenges: 1) the lightweight Variable Trigonometric Threshold (VTT) linear classifier we successfully introduced in BioCreative 2 for binary classification of abstracts and 2) a novel Naive Bayes classifier using features from the citation network of the relevant literature. We supplemented the supplied training data with full-text documents from the MIPS database. The lightweight VTT classifier was very competitive in this new full-text scenario: it was a top-performing submission in this task, taking into account the rank product of the Area Under the interpolated precision and recall Curve, Accuracy, Balanced F-Score, and Matthew's Correlation Coefficient performance measures. The novel citation network classifier for the biomedical text mining domain, while not a top performing classifier in the challenge, performed above the central tendency of all submissions, and therefore indicates a promising new avenue to investigate further in bibliome informatics.

  3. Construction and analysis of protein-protein interaction network correlated with ankylosing spondylitis.

    Science.gov (United States)

    Kanwal, Attiya; Fazal, Sahar

    2018-01-05

    Ankylosing spondylitis, a systemic illness is a foundation of progressing joint swelling that for the most part influences the spine. However, it frequently causes aggravation in different joints far from the spine, and in addition organs, for example, the eyes, heart, lungs, and kidneys. It's an immune system ailment that may be activated by specific sorts of bacterial or viral diseases that initiate an invulnerable reaction that don't close off after the contamination is recuperated. The particular reason for ankylosing spondylitis is obscure, yet hereditary qualities assume a huge part in this condition. The rising apparatuses of network medicine offer a stage to investigate an unpredictable illness at framework level. In this study, we meant to recognize the key proteins and the biological regulator pathways including in AS and further investigating the molecular connectivity between these pathways by the topological examination of the Protein-protein communication (PPI) system. The extended network including of 93 nodes and have 199 interactions respectively scanned from STRING database and some separated small networks. 24 proteins with high BC at the threshold of 0.01 and 55 proteins with large degree at the threshold of 1 have been identified. CD4 with highest BC and Closeness centrality located in the centre of the network. The backbone network derived from high BC proteins presents a clear and visual overview which shows all important regulatory pathways for AS and the crosstalk between them. The finding of this research suggests that AS variation is orchestrated by an integrated PPI network centered on CD4 out of 93 nodes. Ankylosing spondylitis, a systemic disease is an establishment of advancing joint swelling that generally impacts the spine. Be that as it may, it as often as possible causes disturbance in various joints a long way from the spine, and what's more organs. It's a resistant framework affliction that might be actuated by particular sorts

  4. Visualization of Host-Polerovirus Interaction Topologies Using Protein Interaction Reporter Technology.

    Science.gov (United States)

    DeBlasio, Stacy L; Chavez, Juan D; Alexander, Mariko M; Ramsey, John; Eng, Jimmy K; Mahoney, Jaclyn; Gray, Stewart M; Bruce, James E; Cilia, Michelle

    2016-02-15

    Demonstrating direct interactions between host and virus proteins during infection is a major goal and challenge for the field of virology. Most protein interactions are not binary or easily amenable to structural determination. Using infectious preparations of a polerovirus (Potato leafroll virus [PLRV]) and protein interaction reporter (PIR), a revolutionary technology that couples a mass spectrometric-cleavable chemical cross-linker with high-resolution mass spectrometry, we provide the first report of a host-pathogen protein interaction network that includes data-derived, topological features for every cross-linked site that was identified. We show that PLRV virions have hot spots of protein interaction and multifunctional surface topologies, revealing how these plant viruses maximize their use of binding interfaces. Modeling data, guided by cross-linking constraints, suggest asymmetric packing of the major capsid protein in the virion, which supports previous epitope mapping studies. Protein interaction topologies are conserved with other species in the Luteoviridae and with unrelated viruses in the Herpesviridae and Adenoviridae. Functional analysis of three PLRV-interacting host proteins in planta using a reverse-genetics approach revealed a complex, molecular tug-of-war between host and virus. Structural mimicry and diversifying selection-hallmarks of host-pathogen interactions-were identified within host and viral binding interfaces predicted by our models. These results illuminate the functional diversity of the PLRV-host protein interaction network and demonstrate the usefulness of PIR technology for precision mapping of functional host-pathogen protein interaction topologies. The exterior shape of a plant virus and its interacting host and insect vector proteins determine whether a virus will be transmitted by an insect or infect a specific host. Gaining this information is difficult and requires years of experimentation. We used protein interaction

  5. Integration of gel-based and gel-free proteomic data for functional analysis of proteins through Soybean Proteome Database

    KAUST Repository

    Komatsu, Setsuko

    2017-05-10

    The Soybean Proteome Database (SPD) stores data on soybean proteins obtained with gel-based and gel-free proteomic techniques. The database was constructed to provide information on proteins for functional analyses. The majority of the data is focused on soybean (Glycine max ‘Enrei’). The growth and yield of soybean are strongly affected by environmental stresses such as flooding. The database was originally constructed using data on soybean proteins separated by two-dimensional polyacrylamide gel electrophoresis, which is a gel-based proteomic technique. Since 2015, the database has been expanded to incorporate data obtained by label-free mass spectrometry-based quantitative proteomics, which is a gel-free proteomic technique. Here, the portions of the database consisting of gel-free proteomic data are described. The gel-free proteomic database contains 39,212 proteins identified in 63 sample sets, such as temporal and organ-specific samples of soybean plants grown under flooding stress or non-stressed conditions. In addition, data on organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored. Furthermore, the database integrates multiple omics data such as genomics, transcriptomics, metabolomics, and proteomics. The SPD database is accessible at http://proteome.dc.affrc.go.jp/Soybean/. Biological significanceThe Soybean Proteome Database stores data obtained from both gel-based and gel-free proteomic techniques. The gel-free proteomic database comprises 39,212 proteins identified in 63 sample sets, such as different organs of soybean plants grown under flooding stress or non-stressed conditions in a time-dependent manner. In addition, organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored in the gel-free proteomics database. A total of 44,704 proteins, including 5490 proteins identified using a gel-based proteomic technique, are stored in the SPD. It accounts for approximately 80% of all

  6. Integration of gel-based and gel-free proteomic data for functional analysis of proteins through Soybean Proteome Database.

    Science.gov (United States)

    Komatsu, Setsuko; Wang, Xin; Yin, Xiaojian; Nanjo, Yohei; Ohyanagi, Hajime; Sakata, Katsumi

    2017-06-23

    The Soybean Proteome Database (SPD) stores data on soybean proteins obtained with gel-based and gel-free proteomic techniques. The database was constructed to provide information on proteins for functional analyses. The majority of the data is focused on soybean (Glycine max 'Enrei'). The growth and yield of soybean are strongly affected by environmental stresses such as flooding. The database was originally constructed using data on soybean proteins separated by two-dimensional polyacrylamide gel electrophoresis, which is a gel-based proteomic technique. Since 2015, the database has been expanded to incorporate data obtained by label-free mass spectrometry-based quantitative proteomics, which is a gel-free proteomic technique. Here, the portions of the database consisting of gel-free proteomic data are described. The gel-free proteomic database contains 39,212 proteins identified in 63 sample sets, such as temporal and organ-specific samples of soybean plants grown under flooding stress or non-stressed conditions. In addition, data on organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored. Furthermore, the database integrates multiple omics data such as genomics, transcriptomics, metabolomics, and proteomics. The SPD database is accessible at http://proteome.dc.affrc.go.jp/Soybean/. The Soybean Proteome Database stores data obtained from both gel-based and gel-free proteomic techniques. The gel-free proteomic database comprises 39,212 proteins identified in 63 sample sets, such as different organs of soybean plants grown under flooding stress or non-stressed conditions in a time-dependent manner. In addition, organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored in the gel-free proteomics database. A total of 44,704 proteins, including 5490 proteins identified using a gel-based proteomic technique, are stored in the SPD. It accounts for approximately 80% of all predicted proteins from

  7. Integration of gel-based and gel-free proteomic data for functional analysis of proteins through Soybean Proteome Database

    KAUST Repository

    Komatsu, Setsuko; Wang, Xin; Yin, Xiaojian; Nanjo, Yohei; Ohyanagi, Hajime; Sakata, Katsumi

    2017-01-01

    The Soybean Proteome Database (SPD) stores data on soybean proteins obtained with gel-based and gel-free proteomic techniques. The database was constructed to provide information on proteins for functional analyses. The majority of the data is focused on soybean (Glycine max ‘Enrei’). The growth and yield of soybean are strongly affected by environmental stresses such as flooding. The database was originally constructed using data on soybean proteins separated by two-dimensional polyacrylamide gel electrophoresis, which is a gel-based proteomic technique. Since 2015, the database has been expanded to incorporate data obtained by label-free mass spectrometry-based quantitative proteomics, which is a gel-free proteomic technique. Here, the portions of the database consisting of gel-free proteomic data are described. The gel-free proteomic database contains 39,212 proteins identified in 63 sample sets, such as temporal and organ-specific samples of soybean plants grown under flooding stress or non-stressed conditions. In addition, data on organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored. Furthermore, the database integrates multiple omics data such as genomics, transcriptomics, metabolomics, and proteomics. The SPD database is accessible at http://proteome.dc.affrc.go.jp/Soybean/. Biological significanceThe Soybean Proteome Database stores data obtained from both gel-based and gel-free proteomic techniques. The gel-free proteomic database comprises 39,212 proteins identified in 63 sample sets, such as different organs of soybean plants grown under flooding stress or non-stressed conditions in a time-dependent manner. In addition, organellar proteins identified in mitochondria, nuclei, and endoplasmic reticulum are stored in the gel-free proteomics database. A total of 44,704 proteins, including 5490 proteins identified using a gel-based proteomic technique, are stored in the SPD. It accounts for approximately 80% of all

  8. Topological and functional properties of the small GTPases protein interaction network.

    Directory of Open Access Journals (Sweden)

    Anna Delprato

    Full Text Available Small GTP binding proteins of the Ras superfamily (Ras, Rho, Rab, Arf, and Ran regulate key cellular processes such as signal transduction, cell proliferation, cell motility, and vesicle transport. A great deal of experimental evidence supports the existence of signaling cascades and feedback loops within and among the small GTPase subfamilies suggesting that these proteins function in a coordinated and cooperative manner. The interplay occurs largely through association with bi-partite regulatory and effector proteins but can also occur through the active form of the small GTPases themselves. In order to understand the connectivity of the small GTPases signaling routes, a systems-level approach that analyzes data describing direct and indirect interactions was used to construct the small GTPases protein interaction network. The data were curated from the Search Tool for the Retrieval of Interacting Genes (STRING database and include only experimentally validated interactions. The network method enables the conceptualization of the overall structure as well as the underlying organization of the protein-protein interactions. The interaction network described here is comprised of 778 nodes and 1943 edges and has a scale-free topology. Rac1, Cdc42, RhoA, and HRas are identified as the hubs. Ten sub-network motifs are also identified in this study with themes in apoptosis, cell growth/proliferation, vesicle traffic, cell adhesion/junction dynamics, the nicotinamide adenine dinucleotide phosphate (NADPH oxidase response, transcription regulation, receptor-mediated endocytosis, gene silencing, and growth factor signaling. Bottleneck proteins that bridge signaling paths and proteins that overlap in multiple small GTPase networks are described along with the functional annotation of all proteins in the network.

  9. Multi-level machine learning prediction of protein–protein interactions in Saccharomyces cerevisiae

    Directory of Open Access Journals (Sweden)

    Julian Zubek

    2015-07-01

    Full Text Available Accurate identification of protein–protein interactions (PPI is the key step in understanding proteins’ biological functions, which are typically context-dependent. Many existing PPI predictors rely on aggregated features from protein sequences, however only a few methods exploit local information about specific residue contacts. In this work we present a two-stage machine learning approach for prediction of protein–protein interactions. We start with the carefully filtered data on protein complexes available for Saccharomyces cerevisiae in the Protein Data Bank (PDB database. First, we build linear descriptions of interacting and non-interacting sequence segment pairs based on their inter-residue distances. Secondly, we train machine learning classifiers to predict binary segment interactions for any two short sequence fragments. The final prediction of the protein–protein interaction is done using the 2D matrix representation of all-against-all possible interacting sequence segments of both analysed proteins. The level-I predictor achieves 0.88 AUC for micro-scale, i.e., residue-level prediction. The level-II predictor improves the results further by a more complex learning paradigm. We perform 30-fold macro-scale, i.e., protein-level cross-validation experiment. The level-II predictor using PSIPRED-predicted secondary structure reaches 0.70 precision, 0.68 recall, and 0.70 AUC, whereas other popular methods provide results below 0.6 threshold (recall, precision, AUC. Our results demonstrate that multi-scale sequence features aggregation procedure is able to improve the machine learning results by more than 10% as compared to other sequence representations. Prepared datasets and source code for our experimental pipeline are freely available for download from: http://zubekj.github.io/mlppi/ (open source Python implementation, OS independent.

  10. PHI-base: a new interface and further additions for the multi-species pathogen–host interactions database

    Science.gov (United States)

    Urban, Martin; Cuzick, Alayne; Rutherford, Kim; Irvine, Alistair; Pedro, Helder; Pant, Rashmi; Sadanadan, Vidyendra; Khamari, Lokanath; Billal, Santoshkumar; Mohanty, Sagar; Hammond-Kosack, Kim E.

    2017-01-01

    The pathogen–host interactions database (PHI-base) is available at www.phi-base.org. PHI-base contains expertly curated molecular and biological information on genes proven to affect the outcome of pathogen–host interactions reported in peer reviewed research articles. In addition, literature that indicates specific gene alterations that did not affect the disease interaction phenotype are curated to provide complete datasets for comparative purposes. Viruses are not included. Here we describe a revised PHI-base Version 4 data platform with improved search, filtering and extended data display functions. A PHIB-BLAST search function is provided and a link to PHI-Canto, a tool for authors to directly curate their own published data into PHI-base. The new release of PHI-base Version 4.2 (October 2016) has an increased data content containing information from 2219 manually curated references. The data provide information on 4460 genes from 264 pathogens tested on 176 hosts in 8046 interactions. Prokaryotic and eukaryotic pathogens are represented in almost equal numbers. Host species belong ∼70% to plants and 30% to other species of medical and/or environmental importance. Additional data types included into PHI-base 4 are the direct targets of pathogen effector proteins in experimental and natural host organisms. The curation problems encountered and the future directions of the PHI-base project are briefly discussed. PMID:27915230

  11. PHI-base: a new interface and further additions for the multi-species pathogen-host interactions database.

    Science.gov (United States)

    Urban, Martin; Cuzick, Alayne; Rutherford, Kim; Irvine, Alistair; Pedro, Helder; Pant, Rashmi; Sadanadan, Vidyendra; Khamari, Lokanath; Billal, Santoshkumar; Mohanty, Sagar; Hammond-Kosack, Kim E

    2017-01-04

    The pathogen-host interactions database (PHI-base) is available at www.phi-base.org PHI-base contains expertly curated molecular and biological information on genes proven to affect the outcome of pathogen-host interactions reported in peer reviewed research articles. In addition, literature that indicates specific gene alterations that did not affect the disease interaction phenotype are curated to provide complete datasets for comparative purposes. Viruses are not included. Here we describe a revised PHI-base Version 4 data platform with improved search, filtering and extended data display functions. A PHIB-BLAST search function is provided and a link to PHI-Canto, a tool for authors to directly curate their own published data into PHI-base. The new release of PHI-base Version 4.2 (October 2016) has an increased data content containing information from 2219 manually curated references. The data provide information on 4460 genes from 264 pathogens tested on 176 hosts in 8046 interactions. Prokaryotic and eukaryotic pathogens are represented in almost equal numbers. Host species belong ∼70% to plants and 30% to other species of medical and/or environmental importance. Additional data types included into PHI-base 4 are the direct targets of pathogen effector proteins in experimental and natural host organisms. The curation problems encountered and the future directions of the PHI-base project are briefly discussed. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  12. An attempt to understand glioma stem cell biology through centrality analysis of a protein interaction network.

    Science.gov (United States)

    Mallik, Mrinmay Kumar

    2018-02-07

    is indicative of their strong influence in the protein protein interaction network. Similarly the newly proposed GEADCA helped identify the transcription factors with high centrality values indicative of their key roles in transcriptional regulation. The enrichment studies provided a list of molecular functions, biological processes and biochemical pathways associated with the constructed network. The study shows how pathway based databases may be used to create and analyze a relevant protein interaction network in glioma cancer stem cells and identify the essential elements within it to gather insights into the molecular interactions that regulate the properties of glioma stem cells. How these insights may be utilized to help the development of future research towards formulation of new management strategies have been discussed from a theoretical standpoint. Copyright © 2017 Elsevier Ltd. All rights reserved.

  13. From nonspecific DNA-protein encounter complexes to the prediction of DNA-protein interactions.

    Directory of Open Access Journals (Sweden)

    Mu Gao

    2009-03-01

    Full Text Available DNA-protein interactions are involved in many essential biological activities. Because there is no simple mapping code between DNA base pairs and protein amino acids, the prediction of DNA-protein interactions is a challenging problem. Here, we present a novel computational approach for predicting DNA-binding protein residues and DNA-protein interaction modes without knowing its specific DNA target sequence. Given the structure of a DNA-binding protein, the method first generates an ensemble of complex structures obtained by rigid-body docking with a nonspecific canonical B-DNA. Representative models are subsequently selected through clustering and ranking by their DNA-protein interfacial energy. Analysis of these encounter complex models suggests that the recognition sites for specific DNA binding are usually favorable interaction sites for the nonspecific DNA probe and that nonspecific DNA-protein interaction modes exhibit some similarity to specific DNA-protein binding modes. Although the method requires as input the knowledge that the protein binds DNA, in benchmark tests, it achieves better performance in identifying DNA-binding sites than three previously established methods, which are based on sophisticated machine-learning techniques. We further apply our method to protein structures predicted through modeling and demonstrate that our method performs satisfactorily on protein models whose root-mean-square Calpha deviation from native is up to 5 A from their native structures. This study provides valuable structural insights into how a specific DNA-binding protein interacts with a nonspecific DNA sequence. The similarity between the specific DNA-protein interaction mode and nonspecific interaction modes may reflect an important sampling step in search of its specific DNA targets by a DNA-binding protein.

  14. Prediction of protein-protein interactions between viruses and human by an SVM model

    Directory of Open Access Journals (Sweden)

    Cui Guangyu

    2012-05-01

    Full Text Available Abstract Background Several computational methods have been developed to predict protein-protein interactions from amino acid sequences, but most of those methods are intended for the interactions within a species rather than for interactions across different species. Methods for predicting interactions between homogeneous proteins are not appropriate for finding those between heterogeneous proteins since they do not distinguish the interactions between proteins of the same species from those of different species. Results We developed a new method for representing a protein sequence of variable length in a frequency vector of fixed length, which encodes the relative frequency of three consecutive amino acids of a sequence. We built a support vector machine (SVM model to predict human proteins that interact with virus proteins. In two types of viruses, human papillomaviruses (HPV and hepatitis C virus (HCV, our SVM model achieved an average accuracy above 80%, which is higher than that of another SVM model with a different representation scheme. Using the SVM model and Gene Ontology (GO annotations of proteins, we predicted new interactions between virus proteins and human proteins. Conclusions Encoding the relative frequency of amino acid triplets of a protein sequence is a simple yet powerful representation method for predicting protein-protein interactions across different species. The representation method has several advantages: (1 it enables a prediction model to achieve a better performance than other representations, (2 it generates feature vectors of fixed length regardless of the sequence length, and (3 the same representation is applicable to different types of proteins.

  15. The establishment of a database of Italian feeds for the Cornell Net Carbohydrate and Protein System

    Directory of Open Access Journals (Sweden)

    Enzo Tartari

    2010-01-01

    Full Text Available A field application of the Cornell Net Carbohydrate and Protein System (CNCPS in Italy has been limited because thefeed bank is based on North American feedstuffs and still few laboratories are able to analyze feeds as requested by theCNCPS. Moreover, the standardization of analytical procedures is still not homogeneous among laboratories. This workwas carried out to establish a first database for feeds commonly used in Italy, providing nutritionists and producers anaccurate and current feed composition, also indicating methods and apparatus for analytical procedures potentially availablefor routine analysis. A total of 909 samples of hays, silages and raw materials (protein feeds, cereals and by-productswere analyzed through 1999 and 2002; analysis included protein solubility and degradability, protein fractions,structural carbohydrate fractions and the calculation of neutral detergent structural carbohydrates. When possible, averagedata were compared with those included in the feed bank of CNCPS ver. 3 and with those obtained by another Italianlaboratory. The main differences were observed in chemical composition of forages and silages, whose composition largelydepends on environmental conditions and physiological stage; protein feeds, cereals and by-products showed somedifferences in crude protein, soluble protein and protein fractions even in feeds of national origin.The intent to modify the feed bank values of CNCPS for establishing an Italian data base of feeds will require a collaborativestudy of many laboratories not only for forages, hays and silages samples - whose composition is greatly dependenton environmental factors and agronomic techniques - but also for protein fractions, whose values are largely influencedby even small changes in analytical techniques.

  16. The role of hydrophobic interactions in positioning of peripheral proteins in membranes

    Directory of Open Access Journals (Sweden)

    Lomize Mikhail A

    2007-06-01

    Full Text Available Abstract Background Three-dimensional (3D structures of numerous peripheral membrane proteins have been determined. Biological activity, stability, and conformations of these proteins depend on their spatial positions with respect to the lipid bilayer. However, these positions are usually undetermined. Results We report the first large-scale computational study of monotopic/peripheral proteins with known 3D structures. The optimal translational and rotational positions of 476 proteins are determined by minimizing energy of protein transfer from water to the lipid bilayer, which is approximated by a hydrocarbon slab with a decadiene-like polarity and interfacial regions characterized by water-permeation profiles. Predicted membrane-binding sites, protein tilt angles and membrane penetration depths are consistent with spin-labeling, chemical modification, fluorescence, NMR, mutagenesis, and other experimental studies of 53 peripheral proteins and peptides. Experimental membrane binding affinities of peripheral proteins were reproduced in cases that did not involve a helix-coil transition, specific binding of lipids, or a predominantly electrostatic association. Coordinates of all examined peripheral proteins and peptides with the calculated hydrophobic membrane boundaries, subcellular localization, topology, structural classification, and experimental references are available through the Orientations of Proteins in Membranes (OPM database. Conclusion Positions of diverse peripheral proteins and peptides in the lipid bilayer can be accurately predicted using their 3D structures that represent a proper membrane-bound conformation and oligomeric state, and have membrane binding elements present. The success of the implicit solvation model suggests that hydrophobic interactions are usually sufficient to determine the spatial position of a protein in the membrane, even when electrostatic interactions or specific binding of lipids are substantial. Our

  17. PDBj Mine: design and implementation of relational database interface for Protein Data Bank Japan.

    Science.gov (United States)

    Kinjo, Akira R; Yamashita, Reiko; Nakamura, Haruki

    2010-08-25

    This article is a tutorial for PDBj Mine, a new database and its interface for Protein Data Bank Japan (PDBj). In PDBj Mine, data are loaded from files in the PDBMLplus format (an extension of PDBML, PDB's canonical XML format, enriched with annotations), which are then served for the user of PDBj via the worldwide web (WWW). We describe the basic design of the relational database (RDB) and web interfaces of PDBj Mine. The contents of PDBMLplus files are first broken into XPath entities, and these paths and data are indexed in the way that reflects the hierarchical structure of the XML files. The data for each XPath type are saved into the corresponding relational table that is named as the XPath itself. The generation of table definitions from the PDBMLplus XML schema is fully automated. For efficient search, frequently queried terms are compiled into a brief summary table. Casual users can perform simple keyword search, and 'Advanced Search' which can specify various conditions on the entries. More experienced users can query the database using SQL statements which can be constructed in a uniform manner. Thus, PDBj Mine achieves a combination of the flexibility of XML documents and the robustness of the RDB. Database URL: http://www.pdbj.org/

  18. Kin-Driver: a database of driver mutations in protein kinases.

    Science.gov (United States)

    Simonetti, Franco L; Tornador, Cristian; Nabau-Moretó, Nuria; Molina-Vila, Miguel A; Marino-Buslje, Cristina

    2014-01-01

    Somatic mutations in protein kinases (PKs) are frequent driver events in many human tumors, while germ-line mutations are associated with hereditary diseases. Here we present Kin-driver, the first database that compiles driver mutations in PKs with experimental evidence demonstrating their functional role. Kin-driver is a manual expert-curated database that pays special attention to activating mutations (AMs) and can serve as a validation set to develop new generation tools focused on the prediction of gain-of-function driver mutations. It also offers an easy and intuitive environment to facilitate the visualization and analysis of mutations in PKs. Because all mutations are mapped onto a multiple sequence alignment, analogue positions between kinases can be identified and tentative new mutations can be proposed for studying by transferring annotation. Finally, our database can also be of use to clinical and translational laboratories, helping them to identify uncommon AMs that can correlate with response to new antitumor drugs. The website was developed using PHP and JavaScript, which are supported by all major browsers; the database was built using MySQL server. Kin-driver is available at: http://kin-driver.leloir.org.ar/ © The Author(s) 2014. Published by Oxford University Press.

  19. Hydrophobic Interaction Chromatography for Bottom-Up Proteomics Analysis of Single Proteins and Protein Complexes.

    Science.gov (United States)

    Rackiewicz, Michal; Große-Hovest, Ludger; Alpert, Andrew J; Zarei, Mostafa; Dengjel, Jörn

    2017-06-02

    Hydrophobic interaction chromatography (HIC) is a robust standard analytical method to purify proteins while preserving their biological activity. It is widely used to study post-translational modifications of proteins and drug-protein interactions. In the current manuscript we employed HIC to separate proteins, followed by bottom-up LC-MS/MS experiments. We used this approach to fractionate antibody species followed by comprehensive peptide mapping as well as to study protein complexes in human cells. HIC-reversed-phase chromatography (RPC)-mass spectrometry (MS) is a powerful alternative to fractionate proteins for bottom-up proteomics experiments making use of their distinct hydrophobic properties.

  20. Novel Technology for Protein-Protein Interaction-based Targeted Drug Discovery

    Directory of Open Access Journals (Sweden)

    Jung Me Hwang

    2011-12-01

    Full Text Available We have developed a simple but highly efficient in-cell protein-protein interaction (PPI discovery system based on the translocation properties of protein kinase C- and its C1a domain in live cells. This system allows the visual detection of trimeric and dimeric protein interactions including cytosolic, nuclear, and/or membrane proteins with their cognate ligands. In addition, this system can be used to identify pharmacological small compounds that inhibit specific PPIs. These properties make this PPI system an attractive tool for screening drug candidates and mapping the protein interactome.

  1. Protein prenylation: a new mode of host-pathogen interaction.

    Science.gov (United States)

    Amaya, Moushimi; Baranova, Ancha; van Hoek, Monique L

    2011-12-09

    Post translational modifications are required for proteins to be fully functional. The three step process, prenylation, leads to farnesylation or geranylgeranylation, which increase the hydrophobicity of the prenylated protein for efficient anchoring into plasma membranes and/or organellar membranes. Prenylated proteins function in a number of signaling and regulatory pathways that are responsible for basic cell operations. Well characterized prenylated proteins include Ras, Rac and Rho. Recently, pathogenic prokaryotic proteins, such as SifA and AnkB, have been shown to be prenylated by eukaryotic host cell machinery, but their functions remain elusive. The identification of other bacterial proteins undergoing this type of host-directed post-translational modification shows promise in elucidating host-pathogen interactions to develop new therapeutics. This review incorporates new advances in the study of protein prenylation into a broader aspect of biology with a focus on host-pathogen interaction. Copyright © 2011 Elsevier Inc. All rights reserved.

  2. Anomalous Protein-Protein Interactions in Multivalent Salt Solution

    Czech Academy of Sciences Publication Activity Database

    Pasquier, C.; Vazdar, M.; Forsman, J.; Jungwirth, Pavel; Lund, M.

    2017-01-01

    Roč. 121, č. 14 (2017), s. 3000-3006 ISSN 1520-6106 R&D Projects: GA ČR(CZ) GA16-01074S Institutional support: RVO:61388963 Keywords : Monte Carlo * molecular dynamics * membranes * proteins * multivalent salts Subject RIV: CF - Physical ; Theoretical Chemistry OBOR OECD: Physical chemistry Impact factor: 3.177, year: 2016

  3. GenderMedDB: an interactive database of sex and gender-specific medical literature.

    Science.gov (United States)

    Oertelt-Prigione, Sabine; Gohlke, Björn-Oliver; Dunkel, Mathias; Preissner, Robert; Regitz-Zagrosek, Vera

    2014-01-01

    Searches for sex and gender-specific publications are complicated by the absence of a specific algorithm within search engines and by the lack of adequate archives to collect the retrieved results. We previously addressed this issue by initiating the first systematic archive of medical literature containing sex and/or gender-specific analyses. This initial collection has now been greatly enlarged and re-organized as a free user-friendly database with multiple functions: GenderMedDB (http://gendermeddb.charite.de). GenderMedDB retrieves the included publications from the PubMed database. Manuscripts containing sex and/or gender-specific analysis are continuously screened and the relevant findings organized systematically into disciplines and diseases. Publications are furthermore classified by research type, subject and participant numbers. More than 11,000 abstracts are currently included in the database, after screening more than 40,000 publications. The main functions of the database include searches by publication data or content analysis based on pre-defined classifications. In addition, registrants are enabled to upload relevant publications, access descriptive publication statistics and interact in an open user forum. Overall, GenderMedDB offers the advantages of a discipline-specific search engine as well as the functions of a participative tool for the gender medicine community.

  4. Interacting with the National Database for Autism Research (NDAR) via the LONI Pipeline workflow environment.

    Science.gov (United States)

    Torgerson, Carinna M; Quinn, Catherine; Dinov, Ivo; Liu, Zhizhong; Petrosyan, Petros; Pelphrey, Kevin; Haselgrove, Christian; Kennedy, David N; Toga, Arthur W; Van Horn, John Darrell

    2015-03-01

    Under the umbrella of the National Database for Clinical Trials (NDCT) related to mental illnesses, the National Database for Autism Research (NDAR) seeks to gather, curate, and make openly available neuroimaging data from NIH-funded studies of autism spectrum disorder (ASD). NDAR has recently made its database accessible through the LONI Pipeline workflow design and execution environment to enable large-scale analyses of cortical architecture and function via local, cluster, or "cloud"-based computing resources. This presents a unique opportunity to overcome many of the customary limitations to fostering biomedical neuroimaging as a science of discovery. Providing open access to primary neuroimaging data, workflow methods, and high-performance computing will increase uniformity in data collection protocols, encourage greater reliability of published data, results replication, and broaden the range of researchers now able to perform larger studies than ever before. To illustrate the use of NDAR and LONI Pipeline for performing several commonly performed neuroimaging processing steps and analyses, this paper presents example workflows useful for ASD neuroimaging researchers seeking to begin using this valuable combination of online data and computational resources. We discuss the utility of such database and workflow processing interactivity as a motivation for the sharing of additional primary data in ASD research and elsewhere.

  5. A protein interaction map of the kalimantacin biosynthesis assembly line

    Directory of Open Access Journals (Sweden)

    Birgit Uytterhoeven

    2016-11-01

    Full Text Available The antimicrobial secondary metabolite kalimantacin is produced by a hybrid polyketide/ non-ribosomal peptide system in Pseudomonas fluorescens BCCM_ID9359. In this study, the kalimantacin biosynthesis gene cluster is analyzed by yeast two-hybrid analysis, creating a protein-protein interaction map of the entire assembly line. In total, 28 potential interactions were identified, of which 13 could be confirmed further. These interactions include the dimerization of ketosynthase domains, a link between assembly line modules 9 and 10, and a specific interaction between the trans-acting enoyl reductase BatK and the carrier proteins of modules 8 and 10. These interactions reveal fundamental insight into the biosynthesis of secondary metabolites.This study is the first to reveal interactions in a complete biosynthetic pathway. Similar future studies could build a strong basis for engineering strategies in such clusters.

  6. A Global Protein Kinase and Phosphatase Interaction Network in Yeast

    Science.gov (United States)

    Breitkreutz, Ashton; Choi, Hyungwon; Sharom, Jeffrey R.; Boucher, Lorrie; Neduva, Victor; Larsen, Brett; Lin, Zhen-Yuan; Breitkreutz, Bobby-Joe; Stark, Chris; Liu, Guomin; Ahn, Jessica; Dewar-Darch, Danielle; Reguly, Teresa; Tang, Xiaojing; Almeida, Ricardo; Qin, Zhaohui Steve; Pawson, Tony; Gingras, Anne-Claude; Nesvizhskii, Alexey I.; Tyers, Mike

    2011-01-01

    The interactions of protein kinases and phosphatases with their regulatory subunits and substrates underpin cellular regulation. We identified a kinase and phosphatase interaction (KPI) network of 1844 interactions in budding yeast by mass spectrometric analysis of protein complexes. The KPI network contained many dense local regions of interactions that suggested new functions. Notably, the cell cycle phosphatase Cdc14 associated with multiple kinases that revealed roles for Cdc14 in mitogen-activated protein kinase signaling, the DNA damage response, and metabolism, whereas interactions of the target of rapamycin complex 1 (TORC1) uncovered new effector kinases in nitrogen and carbon metabolism. An extensive backbone of kinase-kinase interactions cross-connects the proteome and may serve to coordinate diverse cellular responses. PMID:20489023

  7. KLIFS : a knowledge-based structural database to navigate kinase-ligand interaction space

    NARCIS (Netherlands)

    van Linden, O.P.J.; Kooistra, A.J.; Leurs, R.; de Esch, I.J.P.; de Graaf, C.

    2013-01-01

    Protein kinases regulate the majority of signal transduction pathways in cells and have become important targets for the development of designer drugs. We present a systematic analysis of kinase-ligand interactions in all regions of the catalytic cleft of all 1252 human kinase-ligand cocrystal

  8. A method for investigating protein-protein interactions related to Salmonella typhimurium pathogenesis

    Energy Technology Data Exchange (ETDEWEB)

    Chowdhury, Saiful M. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Shi, Liang [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Yoon, Hyunjin [Dartmouth College, Hanover, NH (United States); Ansong, Charles [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Rommereim, Leah M. [Dartmouth College, Hanover, NH (United States); Norbeck, Angela D. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Auberry, Kenneth J. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Moore, R. J. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Adkins, Joshua N. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States); Heffron, Fred [Oregon Health and Science Univ., Portland, OR (United States); Smith, Richard D. [Pacific Northwest National Lab. (PNNL), Richland, WA (United States)

    2009-02-10

    We successfully modified an existing method to investigate protein-protein interactions in the pathogenic bacterium Salmonella typhimurium (STM). This method includes i) addition of a histidine-biotin-histidine tag to the bait proteins via recombinant DNA techniques; ii) in vivo cross-linking with formaldehyde; iii) tandem affinity purification of bait proteins under fully denaturing conditions; and iv) identification of the proteins cross-linked to the bait proteins by liquid-chromatography in conjunction with tandem mass-spectrometry. In vivo cross-linking stabilized protein interactions permitted the subsequent two-step purification step conducted under denaturing conditions. The two-step purification greatly reduced nonspecific binding of non-cross-linked proteins to bait proteins. Two different negative controls were employed to reduce false-positive identification. In an initial demonstration of this approach, we tagged three selected STM proteins- HimD, PduB and PhoP- with known binding partners that ranged from stable (e.g., HimD) to transient (i.e., PhoP). Distinct sets of interacting proteins were identified with each bait protein, including the known binding partners such as HimA for HimD, as well as anticipated and unexpected binding partners. Our results suggest that novel protein-protein interactions may be critical to pathogenesis by Salmonella typhimurium. .

  9. ORFer--retrieval of protein sequences and open reading frames from GenBank and storage into relational databases or text files.

    Science.gov (United States)

    Büssow, Konrad; Hoffmann, Steve; Sievert, Volker

    2002-12-19

    Functional genomics involves the parallel experimentation with large sets of proteins. This requires management of large sets of open reading frames as a prerequisite of the cloning and recombinant expression of these proteins. A Java program was developed for retrieval of protein and nucleic acid sequences and annotations from NCBI GenBank, using the XML sequence format. Annotations retrieved by ORFer include sequence name, organism and also the completeness of the sequence. The program has a graphical user interface, although it can be used in a non-interactive mode. For protein sequences, the program also extracts the open reading frame sequence, if available, and checks its correct translation. ORFer accepts user input in the form of single or lists of GenBank GI identifiers or accession numbers. It can be used to extract complete sets of open reading frames and protein sequences from any kind of GenBank sequence entry, including complete genomes or chromosomes. Sequences are either stored with their features in a relational database or can be exported as text files in Fasta or tabulator delimited format. The ORFer program is freely available at http://www.proteinstrukturfabrik.de/orfer. The ORFer program allows for fast retrieval of DNA sequences, protein sequences and their open reading frames and sequence annotations from GenBank. Furthermore, storage of sequences and features in a relational database is supported. Such a database can supplement a laboratory information system (LIMS) with appropriate sequence information.

  10. Quality control methodology for high-throughput protein-protein interaction screening.

    Science.gov (United States)

    Vazquez, Alexei; Rual, Jean-François; Venkatesan, Kavitha

    2011-01-01

    Protein-protein interactions are key to many aspects of the cell, including its cytoskeletal structure, the signaling processes in which it is involved, or its metabolism. Failure to form protein complexes or signaling cascades may sometimes translate into pathologic conditions such as cancer or neurodegenerative diseases. The set of all protein interactions between the proteins encoded by an organism constitutes its protein interaction network, representing a scaffold for biological function. Knowing the protein interaction network of an organism, combined with other sources of biological information, can unravel fundamental biological circuits and may help better understand the molecular basics of human diseases. The protein interaction network of an organism can be mapped by combining data obtained from both low-throughput screens, i.e., "one gene at a time" experiments and high-throughput screens, i.e., screens designed to interrogate large sets of proteins at once. In either case, quality controls are required to deal with the inherent imperfect nature of experimental assays. In this chapter, we discuss experimental and statistical methodologies to quantify error rates in high-throughput protein-protein interactions screens.

  11. High Performance Protein Sequence Database Scanning on the Cell Broadband Engine

    Directory of Open Access Journals (Sweden)

    Adrianto Wirawan

    2009-01-01

    Full Text Available The enormous growth of biological sequence databases has caused bioinformatics to be rapidly moving towards a data-intensive, computational science. As a result, the computational power needed by bioinformatics applications is growing rapidly as well. The recent emergence of low cost parallel multicore accelerator technologies has made it possible to reduce execution times of many bioinformatics applications. In this paper, we demonstrate how the Cell Broadband Engine can be used as a computational platform to accelerate two approaches for protein sequence database scanning: exhaustive and heuristic. We present efficient parallelization techniques for two representative algorithms: the dynamic programming based Smith–Waterman algorithm and the popular BLASTP heuristic. Their implementation on a Playstation®3 leads to significant runtime savings compared to corresponding sequential implementations.

  12. Surfing the Protein-Protein Interaction Surface Using Docking Methods: Application to the Design of PPI Inhibitors

    Directory of Open Access Journals (Sweden)

    Rushikesh Sable

    2015-06-01

    Full Text Available Blocking protein-protein interactions (PPI using small molecules or peptides modulates biochemical pathways and has therapeutic significance. PPI inhibition for designing drug-like molecules is a new area that has been explored extensively during the last decade. Considering the number of available PPI inhibitor databases and the limited number of 3D structures available for proteins, docking and scoring methods play a major role in designing PPI inhibitors as well as stabilizers. Docking methods are used in the design of PPI inhibitors at several stages of finding a lead compound, including modeling the protein complex, screening for hot spots on the protein-protein interaction interface and screening small molecules or peptides that bind to the PPI interface. There are three major challenges to the use of docking on the relatively flat surfaces of PPI. In this review we will provide some examples of the use of docking in PPI inhibitor design as well as its limitations. The combination of experimental and docking methods with improved scoring function has thus far resulted in few success stories of PPI inhibitors for therapeutic purposes. Docking algorithms used for PPI are in the early stages, however, and as more data are available docking will become a highly promising area in the design of PPI inhibitors or stabilizers.

  13. Surfing the Protein-Protein Interaction Surface Using Docking Methods: Application to the Design of PPI Inhibitors.

    Science.gov (United States)

    Sable, Rushikesh; Jois, Seetharama

    2015-06-23

    Blocking protein-protein interactions (PPI) using small molecules or peptides modulates biochemical pathways and has therapeutic significance. PPI inhibition for designing drug-like molecules is a new area that has been explored extensively during the last decade. Considering the number of available PPI inhibitor databases and the limited number of 3D structures available for proteins, docking and scoring methods play a major role in designing PPI inhibitors as well as stabilizers. Docking methods are used in the design of PPI inhibitors at several stages of finding a lead compound, including modeling the protein complex, screening for hot spots on the protein-protein interaction interface and screening small molecules or peptides that bind to the PPI interface. There are three major challenges to the use of docking on the relatively flat surfaces of PPI. In this review we will provide some examples of the use of docking in PPI inhibitor design as well as its limitations. The combination of experimental and docking methods with improved scoring function has thus far resulted in few success stories of PPI inhibitors for therapeutic purposes. Docking algorithms used for PPI are in the early stages, however, and as more data are available docking will become a highly promising area in the design of PPI inhibitors or stabilizers.

  14. Specificity and evolvability in eukaryotic protein interaction networks.

    Directory of Open Access Journals (Sweden)

    Pedro Beltrao

    2007-02-01

    Full Text Available Progress in uncovering the protein interaction networks of several species has led to questions of what underlying principles might govern their organization. Few studies have tried to determine the impact of protein interaction network evolution on the observed physiological differences between species. Using comparative genomics and structural information, we show here that eukaryotic species have rewired their interactomes at a fast rate of approximately 10(-5 interactions changed per protein pair, per million years of divergence. For Homo sapiens this corresponds to 10(3 interactions changed per million years. Additionally we find that the specificity of binding strongly determines the interaction turnover and that different biological processes show significantly different link dynamics. In particular, human proteins involved in immune response, transport, and establishment of localization show signs of positive selection for change of interactions. Our analysis suggests that a small degree of molecular divergence can give rise to important changes at the network level. We propose that the power law distribution observed in protein interaction networks could be partly explained by the cell's requirement for different degrees of protein binding specificity.

  15. Evolutionary diversification of protein-protein interactions by interface add-ons.

    Science.gov (United States)

    Plach, Maximilian G; Semmelmann, Florian; Busch, Florian; Busch, Markus; Heizinger, Leonhard; Wysocki, Vicki H; Merkl, Rainer; Sterner, Reinhard

    2017-10-03

    Cells contain a multitude of protein complexes whose subunits interact with high specificity. However, the number of different protein folds and interface geometries found in nature is limited. This raises the question of how protein-protein interaction specificity is achieved on the structural level and how the formation of nonphysiological complexes is avoided. Here, we describe structural elements called interface add-ons that fulfill this function and elucidate their role for the diversification of protein-protein interactions during evolution. We identified interface add-ons in 10% of a representative set of bacterial, heteromeric protein complexes. The importance of interface add-ons for protein-protein interaction specificity is demonstrated by an exemplary experimental characterization of over 30 cognate and hybrid glutamine amidotransferase complexes in combination with comprehensive genetic profiling and protein design. Moreover, growth experiments showed that the lack of interface add-ons can lead to physiologically harmful cross-talk between essential biosynthetic pathways. In sum, our complementary in silico, in vitro, and in vivo analysis argues that interface add-ons are a practical and widespread evolutionary strategy to prevent the formation of nonphysiological complexes by specializing protein-protein interactions.

  16. Neutron cross-sections database for amino acids and proteins analysis

    Energy Technology Data Exchange (ETDEWEB)

    Voi, Dante L.; Ferreira, Francisco de O.; Nunes, Rogerio Chaffin, E-mail: dante@ien.gov.br, E-mail: fferreira@ien.gov.br, E-mail: Chaffin@ien.gov.br [Instituto de Engenharia Nuclear (IEN/CNEN-RJ), Rio de Janeiro, RJ (Brazil); Rocha, Helio F. da, E-mail: hrocha@gbl.com.br [Universidade Federal do Rio de Janeiro (IPPMG/UFRJ), Rio de Janeiro, RJ (Brazil). Instituto de Pediatria

    2015-07-01

    Biological materials may be studied using neutrons as an unconventional tool of analysis. Dynamics and structures data can be obtained for amino acids, protein and others cellular components by neutron cross sections determinations especially for applications in nuclear purity and conformation analysis. The instrument used for this is the crystal spectrometer of the Instituto de Engenharia Nuclear (IEN-CNEN-RJ), the only one in Latin America that uses neutrons for this type of analyzes and it is installed in one of the reactor Argonauta irradiation channels. The experimentally values obtained are compared with calculated values using literature data with a rigorous analysis of the chemical composition, conformation and molecular structure analysis of the materials. A neutron cross-section database was constructed to assist in determining molecular dynamic, structure and formulae of biological materials. The database contains neutron cross-sections values of all amino acids, chemical elements, molecular groups, auxiliary radicals, as well as values of constants and parameters necessary for the analysis. An unprecedented analytical procedure was developed using the neutron cross section parceling and grouping method for data manipulation. This database is a result of measurements obtained from twenty amino acids that were provided by different manufactories and are used in oral administration in hospital individuals for nutritional applications. It was also constructed a small data file of compounds with different molecular groups including carbon, nitrogen, sulfur and oxygen, all linked to hydrogen atoms. A review of global and national scene in the acquisition of neutron cross sections data, the formation of libraries and the application of neutrons for analyzing biological materials is presented. This database has further application in protein analysis and the neutron cross-section from the insulin was estimated. (author)

  17. Neutron cross-sections database for amino acids and proteins analysis

    International Nuclear Information System (INIS)

    Voi, Dante L.; Ferreira, Francisco de O.; Nunes, Rogerio Chaffin; Rocha, Helio F. da

    2015-01-01

    Biological materials may be studied using neutrons as an unconventional tool of analysis. Dynamics and structures data can be obtained for amino acids, protein and others cellular components by neutron cross sections determinations especially for applications in nuclear purity and conformation analysis. The instrument used for this is the crystal spectrometer of the Instituto de Engenharia Nuclear (IEN-CNEN-RJ), the only one in Latin America that uses neutrons for this type of analyzes and it is installed in one of the reactor Argonauta irradiation channels. The experimentally values obtained are compared with calculated values using literature data with a rigorous analysis of the chemical composition, conformation and molecular structure analysis of the materials. A neutron cross-section database was constructed to assist in determining molecular dynamic, structure and formulae of biological materials. The database contains neutron cross-sections values of all amino acids, chemical elements, molecular groups, auxiliary radicals, as well as values of constants and parameters necessary for the analysis. An unprecedented analytical procedure was developed using the neutron cross section parceling and grouping method for data manipulation. This database is a result of measurements obtained from twenty amino acids that were provided by different manufactories and are used in oral administration in hospital individuals for nutritional applications. It was also constructed a small data file of compounds with different molecular groups including carbon, nitrogen, sulfur and oxygen, all linked to hydrogen atoms. A review of global and national scene in the acquisition of neutron cross sections data, the formation of libraries and the application of neutrons for analyzing biological materials is presented. This database has further application in protein analysis and the neutron cross-section from the insulin was estimated. (author)

  18. Multiplex single-molecule interaction profiling of DNA barcoded proteins

    Science.gov (United States)

    Gu, Liangcai; Li, Chao; Aach, John; Hill, David E.; Vidal, Marc; Church, George M.

    2014-01-01

    In contrast with advances in massively parallel DNA sequencing1, high-throughput protein analyses2-4 are often limited by ensemble measurements, individual analyte purification and hence compromised quality and cost-effectiveness. Single-molecule (SM) protein detection achieved using optical methods5 is limited by the number of spectrally nonoverlapping chromophores. Here, we introduce a single molecular interaction-sequencing (SMI-Seq) technology for parallel protein interaction profiling leveraging SM advantages. DNA barcodes are attached to proteins collectively via ribosome display6 or individually via enzymatic conjugation. Barcoded proteins are assayed en masse in aqueous solution and subsequently immobilized in a polyacrylamide (PAA) thin film to construct a random SM array, where barcoding DNAs are amplified into in situ polymerase colonies (polonies)7 and analyzed by DNA sequencing. This method allows precise quantification of various proteins with a theoretical maximum array density of over one million polonies per square millimeter. Furthermore, protein interactions can be measured based on the statistics of colocalized polonies arising from barcoding DNAs of interacting proteins. Two demanding applications, G-protein coupled receptor (GPCR) and antibody binding profiling, were demonstrated. SMI-Seq enables “library vs. library” screening in a one-pot assay, simultaneously interrogating molecular binding affinity and specificity. PMID:25252978

  19. Visualization and targeted disruption of protein interactions in living cells

    Science.gov (United States)

    Herce, Henry D.; Deng, Wen; Helma, Jonas; Leonhardt, Heinrich; Cardoso, M. Cristina

    2013-01-01

    Protein–protein interactions are the basis of all processes in living cells, but most studies of these interactions rely on biochemical in vitro assays. Here we present a simple and versatile fluorescent-three-hybrid (F3H) strategy to visualize and target protein–protein interactions. A high-affinity nanobody anchors a GFP-fusion protein of interest at a defined cellular structure and the enrichment of red-labelled interacting proteins is measured at these sites. With this approach, we visualize the p53–HDM2 interaction in living cells and directly monitor the disruption of this interaction by Nutlin 3, a drug developed to boost p53 activity in cancer therapy. We further use this approach to develop a cell-permeable vector that releases a highly specific peptide disrupting the p53 and HDM2 interaction. The availability of multiple anchor sites and the simple optical readout of this nanobody-based capture assay enable systematic and versatile analyses of protein–protein interactions in practically any cell type and species. PMID:24154492

  20. Identification of brain-specific angiogenesis inhibitor 2 as an interaction partner of glutaminase interacting protein

    International Nuclear Information System (INIS)

    Zencir, Sevil; Ovee, Mohiuddin; Dobson, Melanie J.; Banerjee, Monimoy; Topcu, Zeki; Mohanty, Smita

    2011-01-01

    Highlights: → Brain-specific angiogenesis inhibitor 2 (BAI2) is a new partner protein for GIP. → BAI2 interaction with GIP was revealed by yeast two-hybrid assay. → Binding of BAI2 to GIP was characterized by NMR, CD and fluorescence. → BAI2 and GIP binding was mediated through the C-terminus of BAI2. -- Abstract: The vast majority of physiological processes in living cells are mediated by protein-protein interactions often specified by particular protein sequence motifs. PDZ domains, composed of 80-100 amino acid residues, are an important class of interaction motif. Among the PDZ-containing proteins, glutaminase interacting protein (GIP), also known as Tax Interacting Protein TIP-1, is unique in being composed almost exclusively of a single PDZ domain. GIP has important roles in cellular signaling, protein scaffolding and modulation of tumor growth and interacts with a number of physiological partner proteins, including Glutaminase L, β-Catenin, FAS, HTLV-1 Tax, HPV16 E6, Rhotekin and Kir 2.3. To identify the network of proteins that interact with GIP, a human fetal brain cDNA library was screened using a yeast two-hybrid assay with GIP as bait. We identified brain-specific angiogenesis inhibitor 2 (BAI2), a member of the adhesion-G protein-coupled receptors (GPCRs), as a new partner of GIP. BAI2 is expressed primarily in neurons, further expanding GIP cellular functions. The interaction between GIP and the carboxy-terminus of BAI2 was characterized using fluorescence, circular dichroism (CD) and nuclear magnetic resonance (NMR) spectroscopy assays. These biophysical analyses support the interaction identified in the yeast two-hybrid assay. This is the first study reporting BAI2 as an interaction partner of GIP.