WorldWideScience

Sample records for cluster protein iscu

  1. Interaction of the iron–sulfur cluster assembly protein IscU with the Hsc66/Hsc20 molecular chaperone system of Escherichia coli

    Science.gov (United States)

    Hoff, Kevin G.; Silberg, Jonathan J.; Vickery, Larry E.

    2000-01-01

    The iscU gene in bacteria is located in a gene cluster encoding proteins implicated in iron–sulfur cluster assembly and an hsc70-type (heat shock cognate) molecular chaperone system, iscSUA-hscBA. To investigate possible interactions between these systems, we have overproduced and purified the IscU protein from Escherichia coli and have studied its interactions with the hscA and hscB gene products Hsc66 and Hsc20. IscU and its iron–sulfur complex (IscU–Fe/S) stimulated the basal steady-state ATPase activity of Hsc66 weakly in the absence of Hsc20 but, in the presence of Hsc20, increased the ATPase activity up to 480-fold. Hsc20 also decreased the apparent Km for IscU stimulation of Hsc66 ATPase activity, and surface plasmon resonance studies revealed that Hsc20 enhances binding of IscU to Hsc66. Surface plasmon resonance and isothermal titration calorimetry further showed that IscU and Hsc20 form a complex, and Hsc20 may thereby aid in the targeting of IscU to Hsc66. These results establish a direct and specific role for the Hsc66/Hsc20 chaperone system in functioning with isc gene components for the assembly of iron–sulfur cluster proteins. PMID:10869428

  2. Transfer of sulfur from IscS to IscU during Fe/S cluster assembly.

    Science.gov (United States)

    Urbina, H D; Silberg, J J; Hoff, K G; Vickery, L E

    2001-11-30

    The cysteine desulfurase enzymes NifS and IscS provide sulfur for the biosynthesis of Fe/S proteins. NifU and IscU have been proposed to serve as template or scaffold proteins in the initial Fe/S cluster assembly events, but the mechanism of sulfur transfer from NifS or IscS to NifU or IscU has not been elucidated. We have employed [(35)S]cysteine radiotracer studies to monitor sulfur transfer between IscS and IscU from Escherichia coli and have used direct binding measurements to investigate interactions between the proteins. IscS catalyzed transfer of (35)S from [(35)S]cysteine to IscU in the absence of additional thiol reagents, suggesting that transfer can occur directly and without involvement of an intermediate carrier. Surface plasmon resonance studies and isothermal titration calorimetry measurements further revealed that IscU binds to IscS with high affinity (K(d) approximately 2 microm) in support of a direct transfer mechanism. Transfer was inhibited by treatment of IscU with iodoacetamide, and (35)S was released by reducing reagents, suggesting that transfer of persulfide sulfur occurs to cysteinyl groups of IscU. A deletion mutant of IscS lacking C-terminal residues 376-413 (IscSDelta376-413) displayed cysteine desulfurase activity similar to the full-length protein but exhibited lower binding affinity for IscU, decreased ability to transfer (35)S to IscU, and reduced activity in assays of Fe/S cluster assembly on IscU. The findings with IscSDelta376-413 provide additional support for a mechanism of sulfur transfer involving a direct interaction between IscS and IscU and suggest that the C-terminal region of IscS may be important for binding IscU.

  3. Hsc66 substrate specificity is directed toward a discrete region of the iron-sulfur cluster template protein IscU.

    Science.gov (United States)

    Hoff, Kevin G; Ta, Dennis T; Tapley, Tim L; Silberg, Jonathan J; Vickery, Larry E

    2002-07-26

    Hsc66 and Hsc20 comprise a specialized chaperone system important for the assembly of iron-sulfur clusters in Escherchia coli. Only a single substrate, the Fe/S template protein IscU, has been identified for the Hsc66/Hsc20 system, but the mechanism by which Hsc66 selectively binds IscU is unknown. We have investigated Hsc66 substrate specificity using phage display and a peptide array of IscU. Screening of a heptameric peptide phage display library revealed that Hsc66 prefers peptides with a centrally located Pro-Pro motif. Using a cellulose-bound peptide array of IscU we determined that Hsc66 interacts specifically with a region (residues 99-103, LPPVK) that is invariant among all IscU family members. A synthetic peptide (ELPPVKIHC) corresponding to IscU residues 98-106 behaves in a similar manner to native IscU, stimulating the ATPase activity of Hsc66 with similar affinity as IscU, preventing Hsc66 suppression of bovine rhodanese aggregation, and interacting with the peptide-binding domain of Hsc66. Unlike native IscU, however, the synthetic peptide is not bound by Hsc20 and does not synergistically stimulate Hsc66 ATPase activity with Hsc20. Our results indicate that Hsc66 and Hsc20 recognize distinct regions of IscU and further suggest that Hsc66 will not bind LPPVK motifs with high affinity in vivo unless they are in the context of native IscU and can be directed to Hsc66 by Hsc20.

  4. MicroRNA-210 regulates mitochondrial free radical response to hypoxia and krebs cycle in cancer cells by targeting iron sulfur cluster protein ISCU.

    Directory of Open Access Journals (Sweden)

    Elena Favaro

    2010-04-01

    Full Text Available Hypoxia in cancers results in the upregulation of hypoxia inducible factor 1 (HIF-1 and a microRNA, hsa-miR-210 (miR-210 which is associated with a poor prognosis.In human cancer cell lines and tumours, we found that miR-210 targets the mitochondrial iron sulfur scaffold protein ISCU, required for assembly of iron-sulfur clusters, cofactors for key enzymes involved in the Krebs cycle, electron transport, and iron metabolism. Down regulation of ISCU was the major cause of induction of reactive oxygen species (ROS in hypoxia. ISCU suppression reduced mitochondrial complex 1 activity and aconitase activity, caused a shift to glycolysis in normoxia and enhanced cell survival. Cancers with low ISCU had a worse prognosis.Induction of these major hallmarks of cancer show that a single microRNA, miR-210, mediates a new mechanism of adaptation to hypoxia, by regulating mitochondrial function via iron-sulfur cluster metabolism and free radical generation.

  5. Regulation of the HscA ATPase reaction cycle by the co-chaperone HscB and the iron-sulfur cluster assembly protein IscU.

    Science.gov (United States)

    Silberg, Jonathan J; Tapley, Tim L; Hoff, Kevin G; Vickery, Larry E

    2004-12-24

    The ATPase activity of HscA, a specialized hsp70 molecular chaperone from Escherichia coli, is regulated by the iron-sulfur cluster assembly protein IscU and the J-type co-chaperone HscB. IscU behaves as a substrate for HscA, and HscB enhances the binding of IscU to HscA. To better understand the mechanism by which HscB and IscU regulate HscA, we examined binding of HscB to the different conformational states of HscA and the effects of HscB and IscU on the kinetics of the individual steps of the HscA ATPase reaction cycle. Affinity sensor studies revealed that whereas IscU binds both ADP (R-state) and ATP (T-state) HscA complexes, HscB interacts only with an ATP-bound state. Studies of ATPase activity under single-turnover and rapid mixing conditions showed that both IscU and HscB interact with the low peptide affinity T-state of HscA (HscA++.ATP) and that both modestly accelerate (3-10-fold) the rate-determining steps in the HscA reaction cycle, k(hyd) and k(T-->R). When present together, IscU and HscB synergistically stimulate both k(hyd) (approximately = 500-fold) and k(T-->R) (approximately = 60-fold), leading to enhanced formation of the HscA.ADP-IscU complex (substrate capture). Following ADP/ATP exchange, IscU also stimulates k(R-->T) (approximately = 50-fold) and thereby accelerates the rate at which the low peptide affinity HscA++.ATP T-state is regenerated. Because HscA nucleotide exchange is fast, the overall rate of the chaperone cycle in vivo will be determined by the availability of the IscU-HscB substrate-co-chaperone complex.

  6. Interactions of iron-bound frataxin with ISCU and ferredoxin on the cysteine desulfurase complex leading to Fe-S cluster assembly.

    Science.gov (United States)

    Cai, Kai; Frederick, Ronnie O; Tonelli, Marco; Markley, John L

    2018-06-01

    Frataxin (FXN) is involved in mitochondrial iron‑sulfur (Fe-S) cluster biogenesis and serves to accelerate Fe-S cluster formation. FXN deficiency is associated with Friedreich ataxia, a neurodegenerative disease. We have used a combination of isothermal titration calorimetry and multinuclear NMR spectroscopy to investigate interactions among the components of the biological machine that carries out the assembly of iron‑sulfur clusters in human mitochondria. Our results show that FXN tightly binds a single Fe 2+ but not Fe 3+ . While FXN (with or without bound Fe 2+ ) does not bind the scaffold protein ISCU directly, the two proteins interact mutually when each is bound to the cysteine desulfurase complex ([NFS1] 2 :[ISD11] 2 :[Acp] 2 ), abbreviated as (NIA) 2 , where "N" represents the cysteine desulfurase (NFS1), "I" represents the accessory protein (ISD11), and "A" represents acyl carrier protein (Acp). FXN binds (NIA) 2 weakly in the absence of ISCU but more strongly in its presence. Fe 2+ -FXN binds to the (NIA) 2 -ISCU 2 complex without release of iron. However, upon the addition of both l-cysteine and a reductant (either reduced FDX2 or DTT), Fe 2+ is released from FXN as consistent with Fe 2+ -FXN being the proximal source of iron for Fe-S cluster assembly. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

  7. pH-induced conformational change of IscU at low pH correlates with protonation/deprotonation of two conserved histidine residues.

    Science.gov (United States)

    Dai, Ziqi; Kim, Jin Hae; Tonelli, Marco; Ali, Ibrahim K; Markley, John L

    2014-08-19

    IscU, the scaffold protein for the major iron-sulfur cluster biosynthesis pathway in microorganisms and mitochondria (ISC pathway), plays important roles in the formation of [2Fe-2S] and [4Fe-4S] clusters and their delivery to acceptor apo-proteins. Our laboratory has shown that IscU populates two distinct, functionally relevant conformational states, a more structured state (S) and a more dynamic state (D), that differ by cis/trans isomerizations about two peptidyl-prolyl peptide bonds [Kim, J. H., Tonelli, M., and Markley, J. L. (2012) Proc. Natl. Acad. Sci. U.S.A., 109, 454-459. Dai Z., Tonelli, M., and Markley, J. L. (2012) Biochemistry, 51, 9595-9602. Cai, K., Frederick, R. O., Kim, J. H., Reinen, N. M., Tonelli, M., and Markley, J. L. (2013) J. Biol. Chem., 288, 28755-28770]. Here, we report our findings on the pH dependence of the D ⇄ S equilibrium for Escherichia coli IscU in which the D-state is stabilized at low and high pH values. We show that the lower limb of the pH dependence curve results from differences in the pKa values of two conserved histidine residues (His10 and His105) in the two states. The net proton affinity of His10 is about 50 times higher and that of His105 is 13 times higher in the D-state than in the S-state. The origin of the high limb of the D ⇄ S pH dependence remains to be determined. These results show that changes in proton inventory need to be taken into account in the steps in iron-sulfur cluster assembly and transfer that involve transitions of IscU between its S- and D-states.

  8. Architecture of the Human Mitochondrial Iron-Sulfur Cluster Assembly Machinery*

    Science.gov (United States)

    Gakh, Oleksandr; Ranatunga, Wasantha; Smith, Douglas Y.; Ahlgren, Eva-Christina; Al-Karadaghi, Salam; Thompson, James R.; Isaya, Grazia

    2016-01-01

    Fe-S clusters, essential cofactors needed for the activity of many different enzymes, are assembled by conserved protein machineries inside bacteria and mitochondria. As the architecture of the human machinery remains undefined, we co-expressed in Escherichia coli the following four proteins involved in the initial step of Fe-S cluster synthesis: FXN42–210 (iron donor); [NFS1]·[ISD11] (sulfur donor); and ISCU (scaffold upon which new clusters are assembled). We purified a stable, active complex consisting of all four proteins with 1:1:1:1 stoichiometry. Using negative staining transmission EM and single particle analysis, we obtained a three-dimensional model of the complex with ∼14 Å resolution. Molecular dynamics flexible fitting of protein structures docked into the EM map of the model revealed a [FXN42–210]24·[NFS1]24·[ISD11]24·[ISCU]24 complex, consistent with the measured 1:1:1:1 stoichiometry of its four components. The complex structure fulfills distance constraints obtained from chemical cross-linking of the complex at multiple recurring interfaces, involving hydrogen bonds, salt bridges, or hydrophobic interactions between conserved residues. The complex consists of a central roughly cubic [FXN42–210]24·[ISCU]24 sub-complex with one symmetric ISCU trimer bound on top of one symmetric FXN42–210 trimer at each of its eight vertices. Binding of 12 [NFS1]2·[ISD11]2 sub-complexes to the surface results in a globular macromolecule with a diameter of ∼15 nm and creates 24 Fe-S cluster assembly centers. The organization of each center recapitulates a previously proposed conserved mechanism for sulfur donation from NFS1 to ISCU and reveals, for the first time, a path for iron donation from FXN42–210 to ISCU. PMID:27519411

  9. Role of the HSPA9/HSC20 chaperone pair in promoting directional human iron-sulfur cluster exchange involving monothiol glutaredoxin 5.

    Science.gov (United States)

    Olive, Joshua A; Cowan, J A

    2018-07-01

    Iron‑sulfur clusters are essential cofactors found across all domains of life. Their assembly and transfer are accomplished by highly conserved protein complexes and partners. In eukaryotes a [2Fe-2S] cluster is first assembled in the mitochondria on the iron‑sulfur cluster scaffold protein ISCU in tandem with iron, sulfide, and electron donors. Current models suggest that a chaperone pair interacts with a cluster-bound ISCU to facilitate cluster transfer to a monothiol glutaredoxin. In humans this protein is glutaredoxin 5 (GLRX5) and the cluster can then be exchanged with a variety of target apo proteins. By use of circular dichroism spectroscopy, the kinetics of cluster exchange reactivity has been evaluated for human GLRX5 with a variety of cluster donor and acceptor partners, and the role of chaperones determined for several of these. In contrast to the prokaryotic model, where heat-shock type chaperone proteins HscA and HscB are required for successful and efficient transfer of a [2Fe-2S] cluster from the ISCU scaffold to a monothiol glutaredoxin. However, in the human system the chaperone homologs, HSPA9 and HSC20, are not necessary for human ISCU to promote cluster transfer to GLRX5, and appear to promote the reverse transfer. Cluster exchange with the human iron‑sulfur cluster carrier protein NFU1 and ferredoxins (FDX's), and the role of chaperones, has also been evaluated, demonstrating in certain cases control over the directionality of cluster transfer. In contrast to other prokaryotic and eukaryotic organisms, NFU1 is identified as a more likely physiological donor of [2Fe-2S] cluster to human GLRX5 than ISCU. Copyright © 2018 Elsevier Inc. All rights reserved.

  10. Structural studies of the Enterococcus faecalis SufU [Fe-S] cluster protein

    Directory of Open Access Journals (Sweden)

    Frazzon Jeverson

    2009-02-01

    Full Text Available Abstract Background Iron-sulfur clusters are ubiquitous and evolutionarily ancient inorganic prosthetic groups, the biosynthesis of which depends on complex protein machineries. Three distinct assembly systems involved in the maturation of cellular Fe-S proteins have been determined, designated the NIF, ISC and SUF systems. Although well described in several organisms, these machineries are poorly understood in Gram-positive bacteria. Within the Firmicutes phylum, the Enterococcus spp. genus have recently assumed importance in clinical microbiology being considered as emerging pathogens for humans, wherein Enterococcus faecalis represents the major species associated with nosocomial infections. The aim of this study was to carry out a phylogenetic analysis in Enterococcus faecalis V583 and a structural and conformational characterisation of it SufU protein. Results BLAST searches of the Enterococcus genome revealed a series of genes with sequence similarity to the Escherichia coli SUF machinery of [Fe-S] cluster biosynthesis, namely sufB, sufC, sufD and SufS. In addition, the E. coli IscU ortholog SufU was found to be the scaffold protein of Enterococcus spp., containing all features considered essential for its biological activity, including conserved amino acid residues involved in substrate and/or co-factor binding (Cys50,76,138 and Asp52 and, phylogenetic analyses showed a close relationship with orthologues from other Gram-positive bacteria. Molecular dynamics for structural determinations and molecular modeling using E. faecalis SufU primary sequence protein over the PDB:1su0 crystallographic model from Streptococcus pyogenes were carried out with a subsequent 50 ns molecular dynamic trajectory. This presented a stable model, showing secondary structure modifications near the active site and conserved cysteine residues. Molecular modeling using Haemophilus influenzae IscU primary sequence over the PDB:1su0 crystal followed by a MD

  11. Using an Integrated Script Control Unit (ISCU to Assist the Power Electronics Education

    Directory of Open Access Journals (Sweden)

    Zhigang Gao

    2017-11-01

    Full Text Available An integrated script control unit (ISCU is invented to work as the digital controller in power electronics educations. The ISCU mainly consists of two parts, a control board and computer software. The computer software enables college students to write specific scripts, which can be compiled and saved on the control board, to design the control flow and algorithms. The processor on the board will realize the algorithms that are designed by the user. All of the variables can be monitored by the computer software, which is helpful to find the bugs in the algorithms. ISCU can help the under-graduate students to design converters even if they are unfamiliar with the programming languages and developing environment. Users can write and validate algorithms for converters quickly without writing any tedious codes (such as initialization, dealing with the interrupts for specific processors with ISCU. The college students who lack the necessary skills to program the processor, can benefit when they are studying the power electronic techniques. Importantly, the ISCU is considered to be free for everyone. The details and the principles of ISCU are introduced, and a bi-directional DC-DC converter is built based on ISCU to validate the proposed characteristics.

  12. Knock-downs of mitochondrial iron-sulfur cluster assembly proteins IscS and IscU down-regulate the active mitochondrion of procyclic Trypanosoma brucei

    Czech Academy of Sciences Publication Activity Database

    Šmíd, O.; Horáková, Eva; Vilímová, V.; Hrdý, I.; Cammack, R.; Horváth, A.; Lukeš, Julius; Tachezy, J.

    2006-01-01

    Roč. 281, č. 39 (2006), s. 28679-28686 ISSN 0021-9258 R&D Projects: GA ČR GA204/04/0435; GA AV ČR IAA5022302 Institutional research plan: CEZ:AV0Z60220518 Keywords : IscS * IscU * FeS Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 5.808, year: 2006

  13. Role of IscX in Iron-Sulfur Cluster Biogenesis in Escherichia coli

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Jin Hae; Bothe, Jameson R.; Frederick, Ronnie O.; Holder, Johneisa C.; Markley, John L. [UW

    2014-08-20

    The Escherichia coli isc operon encodes key proteins involved in the biosynthesis of iron–sulfur (Fe–S) clusters. Whereas extensive studies of most ISC proteins have revealed their functional properties, the role of IscX (also dubbed YfhJ), a small acidic protein encoded by the last gene in the operon, has remained in question. Previous studies showed that IscX binds iron ions and interacts with the cysteine desulfurase (IscS) and the scaffold protein for cluster assembly (IscU), and it has been proposed that IscX functions either as an iron supplier or a regulator of Fe–S cluster biogenesis. We have used a combination of NMR spectroscopy, small-angle X-ray scattering (SAXS), chemical cross-linking, and enzymatic assays to enlarge our understanding of the interactions of IscX with iron ions, IscU, and IscS. We used chemical shift perturbation to identify the binding interfaces of IscX and IscU in their complex. NMR studies showed that Fe2+ from added ferrous ammonium sulfate binds IscX much more avidly than does Fe3+ from added ferric ammonium citrate and that Fe2+ strengthens the interaction between IscX and IscU. We found that the addition of IscX to the IscU–IscS binary complex led to the formation of a ternary complex with reduced cysteine desulfurase activity, and we determined a low-resolution model for that complex from a combination of NMR and SAXS data. We postulate that the inhibition of cysteine desulfurase activity by IscX serves to reduce unproductive conversion of cysteine to alanine. By incorporating these new findings with results from prior studies, we propose a detailed mechanism for Fe–S cluster assembly in which IscX serves both as a donor of Fe2+ and as a regulator of cysteine desulfurase activity.

  14. Controlled expression of nif and isc iron-sulfur protein maturation components reveals target specificity and limited functional replacement between the two systems.

    Science.gov (United States)

    Dos Santos, Patricia C; Johnson, Deborah C; Ragle, Brook E; Unciuleac, Mihaela-Carmen; Dean, Dennis R

    2007-04-01

    The nitrogen-fixing organism Azotobacter vinelandii contains at least two systems that catalyze formation of [Fe-S] clusters. One of these systems is encoded by nif genes, whose products supply [Fe-S] clusters required for maturation of nitrogenase. The other system is encoded by isc genes, whose products are required for maturation of [Fe-S] proteins that participate in general metabolic processes. The two systems are similar in that they include an enzyme for the mobilization of sulfur (NifS or IscS) and an assembly scaffold (NifU or IscU) upon which [Fe-S] clusters are formed. Normal cellular levels of the Nif system, which supplies [Fe-S] clusters for the maturation of nitrogenase, cannot also supply [Fe-S] clusters for the maturation of other cellular [Fe-S] proteins. Conversely, when produced at the normal physiological levels, the Isc system cannot supply [Fe-S] clusters for the maturation of nitrogenase. In the present work we found that such target specificity for IscU can be overcome by elevated production of NifU. We also found that NifU, when expressed at normal levels, is able to partially replace the function of IscU if cells are cultured under low-oxygen-availability conditions. In contrast to the situation with IscU, we could not establish conditions in which the function of IscS could be replaced by NifS. We also found that elevated expression of the Isc components, as a result of deletion of the regulatory iscR gene, improved the capacity for nitrogen-fixing growth of strains deficient in either NifU or NifS.

  15. A missed Fe-S cluster handoff causes a metabolic shakeup.

    Science.gov (United States)

    Berteau, Olivier

    2018-05-25

    The general framework of pathways by which iron-sulfur (Fe-S) clusters are assembled in cells is well-known, but the cellular consequences of disruptions to that framework are not fully understood. Crooks et al. report a novel cellular system that creates an acute Fe-S cluster deficiency, using mutants of ISCU, the main scaffold protein for Fe-S cluster assembly. Surprisingly, the resultant metabolic reprogramming leads to the accumulation of lipid droplets, a situation encountered in many poorly understood pathological conditions, highlighting unanticipated links between Fe-S assembly machinery and human disease. © 2018 Berteau.

  16. In vivo fluorescent detection of Fe-S clusters coordinated by human GRX2.

    Science.gov (United States)

    Hoff, Kevin G; Culler, Stephanie J; Nguyen, Peter Q; McGuire, Ryan M; Silberg, Jonathan J; Smolke, Christina D

    2009-12-24

    A major challenge to studying Fe-S cluster biosynthesis in higher eukaryotes is the lack of simple tools for imaging metallocluster binding to proteins. We describe the first fluorescent approach for in vivo detection of 2Fe2S clusters that is based upon the complementation of Venus fluorescent protein fragments via human glutaredoxin 2 (GRX2) coordination of a 2Fe2S cluster. We show that Escherichia coli and mammalian cells expressing Venus fragments fused to GRX2 exhibit greater fluorescence than cells expressing fragments fused to a C37A mutant that cannot coordinate a metallocluster. In addition, we find that maximal fluorescence in the cytosol of mammalian cells requires the iron-sulfur cluster assembly proteins ISCU and NFS1. These findings provide evidence that glutaredoxins can dimerize within mammalian cells through coordination of a 2Fe2S cluster as observed with purified recombinant proteins. Copyright 2009 Elsevier Ltd. All rights reserved.

  17. Defining the Architecture of the Core Machinery for the Assembly of Fe-S Clusters in Human Mitochondria.

    Science.gov (United States)

    Gakh, Oleksandr; Ranatunga, Wasantha; Galeano, Belinda K; Smith, Douglas S; Thompson, James R; Isaya, Grazia

    2017-01-01

    Although Fe-S clusters may assemble spontaneously from elemental iron and sulfur in protein-free systems, the potential toxicity of free Fe 2+ , Fe 3+ , and S 2- ions in aerobic environments underscores the requirement for specialized proteins to oversee the safe assembly of Fe-S clusters in living cells. Prokaryotes first developed multiprotein systems for Fe-S cluster assembly, from which mitochondria later derived their own system and became the main Fe-S cluster suppliers for eukaryotic cells. Early studies in yeast and human mitochondria indicated that Fe-S cluster assembly in eukaryotes is centered around highly conserved Fe-S proteins (human ISCU) that serve as scaffolds upon which new Fe-S clusters are assembled from (i) elemental sulfur, provided by a pyridoxal phosphate-dependent cysteine desulfurase (human NFS1) and its stabilizing-binding partner (human ISD11), and (ii) elemental iron, provided by an iron-binding protein of the frataxin family (human FXN). Further studies revealed that all of these proteins could form stable complexes that could reach molecular masses of megadaltons. However, the protein-protein interaction surfaces, catalytic mechanisms, and overall architecture of these macromolecular machines remained undefined for quite some time. The delay was due to difficulties inherent in reconstituting these very large multiprotein complexes in vitro or isolating them from cells in sufficient quantities to enable biochemical and structural studies. Here, we describe approaches we developed to reconstitute the human Fe-S cluster assembly machinery in Escherichia coli and to define its remarkable architecture. © 2017 Elsevier Inc. All rights reserved.

  18. Disease: H00898 [KEGG MEDICUS

    Lifescience Database Archive (English)

    Full Text Available ice mutation in the iron-sulfur cluster scaffold protein ISCU causes myopathy with exercise intolerance. ... JOURNAL ... Am J Hum Genet 82:652-60 (2008) DOI:10.1016/j.ajhg.2007.12.012

  19. Regulation of human Nfu activity in Fe-S cluster delivery-characterization of the interaction between Nfu and the HSPA9/Hsc20 chaperone complex.

    Science.gov (United States)

    Wachnowsky, Christine; Liu, Yushi; Yoon, Taejin; Cowan, J A

    2018-01-01

    Iron-sulfur cluster biogenesis is a complex, but highly regulated process that involves de novo cluster formation from iron and sulfide ions on a scaffold protein, and subsequent delivery to final targets via a series of Fe-S cluster-binding carrier proteins. The process of cluster release from the scaffold/carrier for transfer to the target proteins may be mediated by a dedicated Fe-S cluster chaperone system. In human cells, the chaperones include heat shock protein HSPA9 and the J-type chaperone Hsc20. While the role of chaperones has been somewhat clarified in yeast and bacterial systems, many questions remain over their functional roles in cluster delivery and interactions with a variety of human Fe-S cluster proteins. One such protein, Nfu, has recently been recognized as a potential interaction partner of the chaperone complex. Herein, we examined the ability of human Nfu to function as a carrier by interacting with the human chaperone complex. Human Nfu is shown to bind to both chaperone proteins with binding affinities similar to those observed for IscU binding to the homologous HSPA9 and Hsc20, while Nfu can also stimulate the ATPase activity of HSPA9. Additionally, the chaperone complex was able to promote Nfu function by enhancing the second-order rate constants for Fe-S cluster transfer to target proteins and providing directionality in cluster transfer from Nfu by eliminating promiscuous transfer reactions. Together, these data support a hypothesis in which Nfu can serve as an alternative carrier protein for chaperone-mediated cluster release and delivery in Fe-S cluster biogenesis and trafficking. © 2017 Federation of European Biochemical Societies.

  20. Anaerobic Copper Toxicity and Iron-Sulfur Cluster Biogenesis in Escherichia coli.

    Science.gov (United States)

    Tan, Guoqiang; Yang, Jing; Li, Tang; Zhao, Jin; Sun, Shujuan; Li, Xiaokang; Lin, Chuxian; Li, Jianghui; Zhou, Huaibin; Lyu, Jianxin; Ding, Huangen

    2017-08-15

    While copper is an essential trace element in biology, pollution of groundwater from copper has become a threat to all living organisms. Cellular mechanisms underlying copper toxicity, however, are still not fully understood. Previous studies have shown that iron-sulfur proteins are among the primary targets of copper toxicity in Escherichia coli under aerobic conditions. Here, we report that, under anaerobic conditions, iron-sulfur proteins in E. coli cells are even more susceptible to copper in medium. Whereas addition of 0.2 mM copper(II) chloride to LB (Luria-Bertani) medium has very little or no effect on iron-sulfur proteins in wild-type E. coli cells under aerobic conditions, the same copper treatment largely inactivates iron-sulfur proteins by blocking iron-sulfur cluster biogenesis in the cells under anaerobic conditions. Importantly, proteins that do not have iron-sulfur clusters (e.g., fumarase C and cysteine desulfurase) in E. coli cells are not significantly affected by copper treatment under aerobic or anaerobic conditions, indicating that copper may specifically target iron-sulfur proteins in cells. Additional studies revealed that E. coli cells accumulate more intracellular copper under anaerobic conditions than under aerobic conditions and that the elevated copper content binds to the iron-sulfur cluster assembly proteins IscU and IscA, which effectively inhibits iron-sulfur cluster biogenesis. The results suggest that the copper-mediated inhibition of iron-sulfur proteins does not require oxygen and that iron-sulfur cluster biogenesis is the primary target of anaerobic copper toxicity in cells. IMPORTANCE Copper contamination in groundwater has become a threat to all living organisms. However, cellular mechanisms underlying copper toxicity have not been fully understood up to now. The work described here reveals that iron-sulfur proteins in Escherichia coli cells are much more susceptible to copper in medium under anaerobic conditions than they

  1. Detection of protein complex from protein-protein interaction network using Markov clustering

    International Nuclear Information System (INIS)

    Ochieng, P J; Kusuma, W A; Haryanto, T

    2017-01-01

    Detection of complexes, or groups of functionally related proteins, is an important challenge while analysing biological networks. However, existing algorithms to identify protein complexes are insufficient when applied to dense networks of experimentally derived interaction data. Therefore, we introduced a graph clustering method based on Markov clustering algorithm to identify protein complex within highly interconnected protein-protein interaction networks. Protein-protein interaction network was first constructed to develop geometrical network, the network was then partitioned using Markov clustering to detect protein complexes. The interest of the proposed method was illustrated by its application to Human Proteins associated to type II diabetes mellitus. Flow simulation of MCL algorithm was initially performed and topological properties of the resultant network were analysed for detection of the protein complex. The results indicated the proposed method successfully detect an overall of 34 complexes with 11 complexes consisting of overlapping modules and 20 non-overlapping modules. The major complex consisted of 102 proteins and 521 interactions with cluster modularity and density of 0.745 and 0.101 respectively. The comparison analysis revealed MCL out perform AP, MCODE and SCPS algorithms with high clustering coefficient (0.751) network density and modularity index (0.630). This demonstrated MCL was the most reliable and efficient graph clustering algorithm for detection of protein complexes from PPI networks. (paper)

  2. Do protein crystals nucleate within dense liquid clusters?

    International Nuclear Information System (INIS)

    Maes, Dominique; Vorontsova, Maria A.; Potenza, Marco A. C.; Sanvito, Tiziano; Sleutel, Mike; Giglio, Marzio; Vekilov, Peter G.

    2015-01-01

    The evolution of protein-rich clusters and nucleating crystals were characterized by dynamic light scattering (DLS), confocal depolarized dynamic light scattering (cDDLS) and depolarized oblique illumination dark-field microscopy. Newly nucleated crystals within protein-rich clusters were detected directly. These observations indicate that the protein-rich clusters are locations for crystal nucleation. Protein-dense liquid clusters are regions of high protein concentration that have been observed in solutions of several proteins. The typical cluster size varies from several tens to several hundreds of nanometres and their volume fraction remains below 10 −3 of the solution. According to the two-step mechanism of nucleation, the protein-rich clusters serve as locations for and precursors to the nucleation of protein crystals. While the two-step mechanism explained several unusual features of protein crystal nucleation kinetics, a direct observation of its validity for protein crystals has been lacking. Here, two independent observations of crystal nucleation with the proteins lysozyme and glucose isomerase are discussed. Firstly, the evolutions of the protein-rich clusters and nucleating crystals were characterized simultaneously by dynamic light scattering (DLS) and confocal depolarized dynamic light scattering (cDDLS), respectively. It is demonstrated that protein crystals appear following a significant delay after cluster formation. The cDDLS correlation functions follow a Gaussian decay, indicative of nondiffusive motion. A possible explanation is that the crystals are contained inside large clusters and are driven by the elasticity of the cluster surface. Secondly, depolarized oblique illumination dark-field microscopy reveals the evolution from liquid clusters without crystals to newly nucleated crystals contained in the clusters to grown crystals freely diffusing in the solution. Collectively, the observations indicate that the protein-rich clusters in

  3. Structure based alignment and clustering of proteins (STRALCP)

    Science.gov (United States)

    Zemla, Adam T.; Zhou, Carol E.; Smith, Jason R.; Lam, Marisa W.

    2013-06-18

    Disclosed are computational methods of clustering a set of protein structures based on local and pair-wise global similarity values. Pair-wise local and global similarity values are generated based on pair-wise structural alignments for each protein in the set of protein structures. Initially, the protein structures are clustered based on pair-wise local similarity values. The protein structures are then clustered based on pair-wise global similarity values. For each given cluster both a representative structure and spans of conserved residues are identified. The representative protein structure is used to assign newly-solved protein structures to a group. The spans are used to characterize conservation and assign a "structural footprint" to the cluster.

  4. K-nearest uphill clustering in the protein structure space

    KAUST Repository

    Cui, Xuefeng

    2016-08-26

    The protein structure classification problem, which is to assign a protein structure to a cluster of similar proteins, is one of the most fundamental problems in the construction and application of the protein structure space. Early manually curated protein structure classifications (e.g., SCOP and CATH) are very successful, but recently suffer the slow updating problem because of the increased throughput of newly solved protein structures. Thus, fully automatic methods to cluster proteins in the protein structure space have been designed and developed. In this study, we observed that the SCOP superfamilies are highly consistent with clustering trees representing hierarchical clustering procedures, but the tree cutting is very challenging and becomes the bottleneck of clustering accuracy. To overcome this challenge, we proposed a novel density-based K-nearest uphill clustering method that effectively eliminates noisy pairwise protein structure similarities and identifies density peaks as cluster centers. Specifically, the density peaks are identified based on K-nearest uphills (i.e., proteins with higher densities) and K-nearest neighbors. To our knowledge, this is the first attempt to apply and develop density-based clustering methods in the protein structure space. Our results show that our density-based clustering method outperforms the state-of-the-art clustering methods previously applied to the problem. Moreover, we observed that computational methods and human experts could produce highly similar clusters at high precision values, while computational methods also suggest to split some large superfamilies into smaller clusters. © 2016 Elsevier B.V.

  5. Protein sequences clustering of herpes virus by using Tribe Markov clustering (Tribe-MCL)

    Science.gov (United States)

    Bustamam, A.; Siswantining, T.; Febriyani, N. L.; Novitasari, I. D.; Cahyaningrum, R. D.

    2017-07-01

    The herpes virus can be found anywhere and one of the important characteristics is its ability to cause acute and chronic infection at certain times so as a result of the infection allows severe complications occurred. The herpes virus is composed of DNA containing protein and wrapped by glycoproteins. In this work, the Herpes viruses family is classified and analyzed by clustering their protein-sequence using Tribe Markov Clustering (Tribe-MCL) algorithm. Tribe-MCL is an efficient clustering method based on the theory of Markov chains, to classify protein families from protein sequences using pre-computed sequence similarity information. We implement the Tribe-MCL algorithm using an open source program of R. We select 24 protein sequences of Herpes virus obtained from NCBI database. The dataset consists of three types of glycoprotein B, F, and H. Each type has eight herpes virus that infected humans. Based on our simulation using different inflation factor r=1.5, 2, 3 we find a various number of the clusters results. The greater the inflation factor the greater the number of their clusters. Each protein will grouped together in the same type of protein.

  6. Spectromicroscopy of self-assembled protein clusters

    Energy Technology Data Exchange (ETDEWEB)

    Schonschek, O.; Hormes, J.; Herzog, V. [Univ. of Bonn (Germany)

    1997-04-01

    The aim of this project is to use synchrotron radiation as a tool to study biomedical questions concerned with the thyroid glands. The biological background is outlined in a recent paper. In short, Thyroglobulin (TG), the precursor protein of the hormone thyroxine, forms large (20 - 500 microns in diameter) clusters in the extracellular lumen of thyrocytes. The process of the cluster formation is still not well understood but is thought to be a main storage mechanism of TG and therefore thyroxine inside the thyroid glands. For human thyroids, the interconnections of the proteins inside the clusters are mainly disulfide bondings. Normally, sulfur bridges are catalyzed by an enzyme called Protein Disulfide Bridge Isomerase (PDI). While this enzyme is supposed to be not present in any extracellular space, the cluster formation of TG takes place in the lumen between the thyrocytes. A possible explanation is the autocatalysis of TG.

  7. Characterization of an M-Cluster-Substituted Nitrogenase VFe Protein.

    Science.gov (United States)

    Rebelein, Johannes G; Lee, Chi Chung; Newcomb, Megan; Hu, Yilin; Ribbe, Markus W

    2018-03-13

    The Mo- and V-nitrogenases are two homologous members of the nitrogenase family that are distinguished mainly by the presence of different heterometals (Mo or V) at their respective cofactor sites (M- or V-cluster). However, the V-nitrogenase is ~600-fold more active than its Mo counterpart in reducing CO to hydrocarbons at ambient conditions. Here, we expressed an M-cluster-containing, hybrid V-nitrogenase in Azotobacter vinelandii and compared it to its native, V-cluster-containing counterpart in order to assess the impact of protein scaffold and cofactor species on the differential reactivities of Mo- and V-nitrogenases toward CO. Housed in the VFe protein component of V-nitrogenase, the M-cluster displayed electron paramagnetic resonance (EPR) features similar to those of the V-cluster and demonstrated an ~100-fold increase in hydrocarbon formation activity from CO reduction, suggesting a significant impact of protein environment on the overall CO-reducing activity of nitrogenase. On the other hand, the M-cluster was still ~6-fold less active than the V-cluster in the same protein scaffold, and it retained its inability to form detectable amounts of methane from CO reduction, illustrating a fine-tuning effect of the cofactor properties on this nitrogenase-catalyzed reaction. Together, these results provided important insights into the two major determinants for the enzymatic activity of CO reduction while establishing a useful framework for further elucidation of the essential catalytic elements for the CO reactivity of nitrogenase. IMPORTANCE This is the first report on the in vivo generation and in vitro characterization of an M-cluster-containing V-nitrogenase hybrid. The "normalization" of the protein scaffold to that of the V-nitrogenase permits a direct comparison between the cofactor species of the Mo- and V-nitrogenases (M- and V-clusters) in CO reduction, whereas the discrepancy between the protein scaffolds of the Mo- and V-nitrogenases (MoFe and VFe

  8. Supported silver clusters as nanoplasmonic transducers for protein sensing

    DEFF Research Database (Denmark)

    Fojan, Peter; Hanif, Muhammad; Bartling, Stephen

    2015-01-01

    Transducers for optical sensing of proteins are prepared using cluster beam deposition on quartz substrates. Surface plasmon resonance phenomenon of the supported silver clusters is used for the detection. It is shown that surface immobilisation procedure providing adhesion of the silver clusters...... stages and protein immobilisation scheme the sensing of protein of interest can be assured using a relatively simple optical spectroscopy method....... an enhancement of the plasmon absorption band used for the detection. Atomic force microscopy study allows to suggest that immobilisation of antibodies on silver clusters has been achieved, thus giving a possibility to incubate and detect an antigen of interest. Hence, by applying the developed preparation...

  9. Turning Escherichia coli into a Frataxin-Dependent Organism.

    Directory of Open Access Journals (Sweden)

    Béatrice Roche

    2015-05-01

    Full Text Available Fe-S bound proteins are ubiquitous and contribute to most basic cellular processes. A defect in the ISC components catalyzing Fe-S cluster biogenesis leads to drastic phenotypes in both eukaryotes and prokaryotes. In this context, the Frataxin protein (FXN stands out as an exception. In eukaryotes, a defect in FXN results in severe defects in Fe-S cluster biogenesis, and in humans, this is associated with Friedreich's ataxia, a neurodegenerative disease. In contrast, prokaryotes deficient in the FXN homolog CyaY are fully viable, despite the clear involvement of CyaY in ISC-catalyzed Fe-S cluster formation. The molecular basis of the differing importance in the contribution of FXN remains enigmatic. Here, we have demonstrated that a single mutation in the scaffold protein IscU rendered E. coli viability strictly dependent upon a functional CyaY. Remarkably, this mutation changed an Ile residue, conserved in prokaryotes at position 108, into a Met residue, conserved in eukaryotes. We found that in the double mutant IscUIM ΔcyaY, the ISC pathway was completely abolished, becoming equivalent to the ΔiscU deletion strain and recapitulating the drastic phenotype caused by FXN deletion in eukaryotes. Biochemical analyses of the "eukaryotic-like" IscUIM scaffold revealed that it exhibited a reduced capacity to form Fe-S clusters. Finally, bioinformatic studies of prokaryotic IscU proteins allowed us to trace back the source of FXN-dependency as it occurs in present-day eukaryotes. We propose an evolutionary scenario in which the current mitochondrial Isu proteins originated from the IscUIM version present in the ancestor of the Rickettsiae. Subsequent acquisition of SUF, the second Fe-S cluster biogenesis system, in bacteria, was accompanied by diminished contribution of CyaY in prokaryotic Fe-S cluster biogenesis, and increased tolerance to change in the amino acid present at the 108th position of the scaffold.

  10. Evaluation of clustering algorithms for protein-protein interaction networks

    Directory of Open Access Journals (Sweden)

    van Helden Jacques

    2006-11-01

    Full Text Available Abstract Background Protein interactions are crucial components of all cellular processes. Recently, high-throughput methods have been developed to obtain a global description of the interactome (the whole network of protein interactions for a given organism. In 2002, the yeast interactome was estimated to contain up to 80,000 potential interactions. This estimate is based on the integration of data sets obtained by various methods (mass spectrometry, two-hybrid methods, genetic studies. High-throughput methods are known, however, to yield a non-negligible rate of false positives, and to miss a fraction of existing interactions. The interactome can be represented as a graph where nodes correspond with proteins and edges with pairwise interactions. In recent years clustering methods have been developed and applied in order to extract relevant modules from such graphs. These algorithms require the specification of parameters that may drastically affect the results. In this paper we present a comparative assessment of four algorithms: Markov Clustering (MCL, Restricted Neighborhood Search Clustering (RNSC, Super Paramagnetic Clustering (SPC, and Molecular Complex Detection (MCODE. Results A test graph was built on the basis of 220 complexes annotated in the MIPS database. To evaluate the robustness to false positives and false negatives, we derived 41 altered graphs by randomly removing edges from or adding edges to the test graph in various proportions. Each clustering algorithm was applied to these graphs with various parameter settings, and the clusters were compared with the annotated complexes. We analyzed the sensitivity of the algorithms to the parameters and determined their optimal parameter values. We also evaluated their robustness to alterations of the test graph. We then applied the four algorithms to six graphs obtained from high-throughput experiments and compared the resulting clusters with the annotated complexes. Conclusion This

  11. The hybrid-cluster protein ('prismane protein') from Escherichia coli. Characterization of the hybrid-cluster protein, redox properties of the [2Fe-2S] and [4Fe-2S-2O] clusters and identification of an associated NADH oxidoreductase containing FAD and[2Fe-2S

    NARCIS (Netherlands)

    Berg, van den W.A.M.; Hagen, W.R.; Dongen, van W.M.A.M.

    2000-01-01

    Hybrid-cluster proteins ('prismane proteins') have previously been isolated and characterized from strictly anaerobic sulfate-reducing bacteria. These proteins contain two types of Fe/S clusters unique in biological systems: a [4Fe-4S] cubane cluster with spin-admixed S = 3/2 ground-state

  12. Detection of secondary structure elements in proteins by hydrophobic cluster analysis.

    Science.gov (United States)

    Woodcock, S; Mornon, J P; Henrissat, B

    1992-10-01

    Hydrophobic cluster analysis (HCA) is a protein sequence comparison method based on alpha-helical representations of the sequences where the size, shape and orientation of the clusters of hydrophobic residues are primarily compared. The effectiveness of HCA has been suggested to originate from its potential ability to focus on the residues forming the hydrophobic core of globular proteins. We have addressed the robustness of the bidimensional representation used for HCA in its ability to detect the regular secondary structure elements of proteins. Various parameters have been studied such as those governing cluster size and limits, the hydrophobic residues constituting the clusters as well as the potential shift of the cluster positions with respect to the position of the regular secondary structure elements. The following results have been found to support the alpha-helical bidimensional representation used in HCA: (i) there is a positive correlation (clearly above background noise) between the hydrophobic clusters and the regular secondary structure elements in proteins; (ii) the hydrophobic clusters are centred on the regular secondary structure elements; (iii) the pitch of the helical representation which gives the best correspondence is that of an alpha-helix. The correspondence between hydrophobic clusters and regular secondary structure elements suggests a way to implement variable gap penalties during the automatic alignment of protein sequences.

  13. Network based approaches reveal clustering in protein point patterns

    Science.gov (United States)

    Parker, Joshua; Barr, Valarie; Aldridge, Joshua; Samelson, Lawrence E.; Losert, Wolfgang

    2014-03-01

    Recent advances in super-resolution imaging have allowed for the sub-diffraction measurement of the spatial location of proteins on the surfaces of T-cells. The challenge is to connect these complex point patterns to the internal processes and interactions, both protein-protein and protein-membrane. We begin analyzing these patterns by forming a geometric network amongst the proteins and looking at network measures, such the degree distribution. This allows us to compare experimentally observed patterns to models. Specifically, we find that the experimental patterns differ from heterogeneous Poisson processes, highlighting an internal clustering structure. Further work will be to compare our results to simulated protein-protein interactions to determine clustering mechanisms.

  14. Clustering evolving proteins into homologous families.

    Science.gov (United States)

    Chan, Cheong Xin; Mahbob, Maisarah; Ragan, Mark A

    2013-04-08

    Clustering sequences into groups of putative homologs (families) is a critical first step in many areas of comparative biology and bioinformatics. The performance of clustering approaches in delineating biologically meaningful families depends strongly on characteristics of the data, including content bias and degree of divergence. New, highly scalable methods have recently been introduced to cluster the very large datasets being generated by next-generation sequencing technologies. However, there has been little systematic investigation of how characteristics of the data impact the performance of these approaches. Using clusters from a manually curated dataset as reference, we examined the performance of a widely used graph-based Markov clustering algorithm (MCL) and a greedy heuristic approach (UCLUST) in delineating protein families coded by three sets of bacterial genomes of different G+C content. Both MCL and UCLUST generated clusters that are comparable to the reference sets at specific parameter settings, although UCLUST tends to under-cluster compositionally biased sequences (G+C content 33% and 66%). Using simulated data, we sought to assess the individual effects of sequence divergence, rate heterogeneity, and underlying G+C content. Performance decreased with increasing sequence divergence, decreasing among-site rate variation, and increasing G+C bias. Two MCL-based methods recovered the simulated families more accurately than did UCLUST. MCL using local alignment distances is more robust across the investigated range of sequence features than are greedy heuristics using distances based on global alignment. Our results demonstrate that sequence divergence, rate heterogeneity and content bias can individually and in combination affect the accuracy with which MCL and UCLUST can recover homologous protein families. For application to data that are more divergent, and exhibit higher among-site rate variation and/or content bias, MCL may often be the better

  15. Effect of mitochondrial complex I inhibition on Fe-S cluster protein activity

    Energy Technology Data Exchange (ETDEWEB)

    Mena, Natalia P. [Department of Biology, Faculty of Sciences, Universidad de Chile, Las Palmeras 3425, Santiago (Chile); Millennium Institute of Cell Dynamics and Biotechnology, Santiago (Chile); Bulteau, Anne Laure [UPMC Univ Paris 06, UMRS 975 - UMR 7725, Centre de Recherche en Neurosciences, ICM, Therapeutique Experimentale de la Neurodegenerescence, Hopital de la Salpetriere, F-75005 Paris (France); Inserm, U 975, Centre de Recherche en Neurosciences, ICM, Therapeutique Experimentale de la Neurodegenerescence, Hopital de la Salpetriere, F-75005 Paris (France); CNRS, UMR 7225, Centre de Recherche en Neurosciences, ICM, Therapeutique Experimentale de la Neurodegenerescence, Hopital de la Salpetriere, F-75005 Paris (France); ICM, Therapeutique Experimentale de la Neurodegenerescence, Hopital de la Salpetriere, Paris 75013 (France); Salazar, Julio [Millennium Institute of Cell Dynamics and Biotechnology, Santiago (Chile); Hirsch, Etienne C. [UPMC Univ Paris 06, UMRS 975 - UMR 7725, Centre de Recherche en Neurosciences, ICM, Therapeutique Experimentale de la Neurodegenerescence, Hopital de la Salpetriere, F-75005 Paris (France); Inserm, U 975, Centre de Recherche en Neurosciences, ICM, Therapeutique Experimentale de la Neurodegenerescence, Hopital de la Salpetriere, F-75005 Paris (France); CNRS, UMR 7225, Centre de Recherche en Neurosciences, ICM, Therapeutique Experimentale de la Neurodegenerescence, Hopital de la Salpetriere, F-75005 Paris (France); ICM, Therapeutique Experimentale de la Neurodegenerescence, Hopital de la Salpetriere, Paris 75013 (France); Nunez, Marco T., E-mail: mnunez@uchile.cl [Department of Biology, Faculty of Sciences, Universidad de Chile, Las Palmeras 3425, Santiago (Chile); Millennium Institute of Cell Dynamics and Biotechnology, Santiago (Chile)

    2011-06-03

    Highlights: {yields} Mitochondrial complex I inhibition resulted in decreased activity of Fe-S containing enzymes mitochondrial aconitase and cytoplasmic aconitase and xanthine oxidase. {yields} Complex I inhibition resulted in the loss of Fe-S clusters in cytoplasmic aconitase and of glutamine phosphoribosyl pyrophosphate amidotransferase. {yields} Consistent with loss of cytoplasmic aconitase activity, an increase in iron regulatory protein 1 activity was found. {yields} Complex I inhibition resulted in an increase in the labile cytoplasmic iron pool. -- Abstract: Iron-sulfur (Fe-S) clusters are small inorganic cofactors formed by tetrahedral coordination of iron atoms with sulfur groups. Present in numerous proteins, these clusters are involved in key biological processes such as electron transfer, metabolic and regulatory processes, DNA synthesis and repair and protein structure stabilization. Fe-S clusters are synthesized mainly in the mitochondrion, where they are directly incorporated into mitochondrial Fe-S cluster-containing proteins or exported for cytoplasmic and nuclear cluster-protein assembly. In this study, we tested the hypothesis that inhibition of mitochondrial complex I by rotenone decreases Fe-S cluster synthesis and cluster content and activity of Fe-S cluster-containing enzymes. Inhibition of complex I resulted in decreased activity of three Fe-S cluster-containing enzymes: mitochondrial and cytosolic aconitases and xanthine oxidase. In addition, the Fe-S cluster content of glutamine phosphoribosyl pyrophosphate amidotransferase and mitochondrial aconitase was dramatically decreased. The reduction in cytosolic aconitase activity was associated with an increase in iron regulatory protein (IRP) mRNA binding activity and with an increase in the cytoplasmic labile iron pool. Since IRP activity post-transcriptionally regulates the expression of iron import proteins, Fe-S cluster inhibition may result in a false iron deficiency signal. Given that

  16. Effect of mitochondrial complex I inhibition on Fe-S cluster protein activity

    International Nuclear Information System (INIS)

    Mena, Natalia P.; Bulteau, Anne Laure; Salazar, Julio; Hirsch, Etienne C.; Nunez, Marco T.

    2011-01-01

    Highlights: → Mitochondrial complex I inhibition resulted in decreased activity of Fe-S containing enzymes mitochondrial aconitase and cytoplasmic aconitase and xanthine oxidase. → Complex I inhibition resulted in the loss of Fe-S clusters in cytoplasmic aconitase and of glutamine phosphoribosyl pyrophosphate amidotransferase. → Consistent with loss of cytoplasmic aconitase activity, an increase in iron regulatory protein 1 activity was found. → Complex I inhibition resulted in an increase in the labile cytoplasmic iron pool. -- Abstract: Iron-sulfur (Fe-S) clusters are small inorganic cofactors formed by tetrahedral coordination of iron atoms with sulfur groups. Present in numerous proteins, these clusters are involved in key biological processes such as electron transfer, metabolic and regulatory processes, DNA synthesis and repair and protein structure stabilization. Fe-S clusters are synthesized mainly in the mitochondrion, where they are directly incorporated into mitochondrial Fe-S cluster-containing proteins or exported for cytoplasmic and nuclear cluster-protein assembly. In this study, we tested the hypothesis that inhibition of mitochondrial complex I by rotenone decreases Fe-S cluster synthesis and cluster content and activity of Fe-S cluster-containing enzymes. Inhibition of complex I resulted in decreased activity of three Fe-S cluster-containing enzymes: mitochondrial and cytosolic aconitases and xanthine oxidase. In addition, the Fe-S cluster content of glutamine phosphoribosyl pyrophosphate amidotransferase and mitochondrial aconitase was dramatically decreased. The reduction in cytosolic aconitase activity was associated with an increase in iron regulatory protein (IRP) mRNA binding activity and with an increase in the cytoplasmic labile iron pool. Since IRP activity post-transcriptionally regulates the expression of iron import proteins, Fe-S cluster inhibition may result in a false iron deficiency signal. Given that inhibition of complex

  17. K-nearest uphill clustering in the protein structure space

    KAUST Repository

    Cui, Xuefeng; Gao, Xin

    2016-01-01

    The protein structure classification problem, which is to assign a protein structure to a cluster of similar proteins, is one of the most fundamental problems in the construction and application of the protein structure space. Early manually curated

  18. Gene identification and protein classification in microbial metagenomic sequence data via incremental clustering

    Directory of Open Access Journals (Sweden)

    Li Weizhong

    2008-04-01

    Full Text Available Abstract Background The identification and study of proteins from metagenomic datasets can shed light on the roles and interactions of the source organisms in their communities. However, metagenomic datasets are characterized by the presence of organisms with varying GC composition, codon usage biases etc., and consequently gene identification is challenging. The vast amount of sequence data also requires faster protein family classification tools. Results We present a computational improvement to a sequence clustering approach that we developed previously to identify and classify protein coding genes in large microbial metagenomic datasets. The clustering approach can be used to identify protein coding genes in prokaryotes, viruses, and intron-less eukaryotes. The computational improvement is based on an incremental clustering method that does not require the expensive all-against-all compute that was required by the original approach, while still preserving the remote homology detection capabilities. We present evaluations of the clustering approach in protein-coding gene identification and classification, and also present the results of updating the protein clusters from our previous work with recent genomic and metagenomic sequences. The clustering results are available via CAMERA, (http://camera.calit2.net. Conclusion The clustering paradigm is shown to be a very useful tool in the analysis of microbial metagenomic data. The incremental clustering method is shown to be much faster than the original approach in identifying genes, grouping sequences into existing protein families, and also identifying novel families that have multiple members in a metagenomic dataset. These clusters provide a basis for further studies of protein families.

  19. A hybrid clustering approach to recognition of protein families in 114 microbial genomes

    Directory of Open Access Journals (Sweden)

    Gogarten J Peter

    2004-04-01

    Full Text Available Abstract Background Grouping proteins into sequence-based clusters is a fundamental step in many bioinformatic analyses (e.g., homology-based prediction of structure or function. Standard clustering methods such as single-linkage clustering capture a history of cluster topologies as a function of threshold, but in practice their usefulness is limited because unrelated sequences join clusters before biologically meaningful families are fully constituted, e.g. as the result of matches to so-called promiscuous domains. Use of the Markov Cluster algorithm avoids this non-specificity, but does not preserve topological or threshold information about protein families. Results We describe a hybrid approach to sequence-based clustering of proteins that combines the advantages of standard and Markov clustering. We have implemented this hybrid approach over a relational database environment, and describe its application to clustering a large subset of PDB, and to 328577 proteins from 114 fully sequenced microbial genomes. To demonstrate utility with difficult problems, we show that hybrid clustering allows us to constitute the paralogous family of ATP synthase F1 rotary motor subunits into a single, biologically interpretable hierarchical grouping that was not accessible using either single-linkage or Markov clustering alone. We describe validation of this method by hybrid clustering of PDB and mapping SCOP families and domains onto the resulting clusters. Conclusion Hybrid (Markov followed by single-linkage clustering combines the advantages of the Markov Cluster algorithm (avoidance of non-specific clusters resulting from matches to promiscuous domains and single-linkage clustering (preservation of topological information as a function of threshold. Within the individual Markov clusters, single-linkage clustering is a more-precise instrument, discerning sub-clusters of biological relevance. Our hybrid approach thus provides a computationally efficient

  20. Site-directed mutagenesis of Azotobacter vinelandii ferredoxin I: [Fe-S] cluster-driven protein rearrangement

    International Nuclear Information System (INIS)

    Martin, A.E.; Burgess, B.K.; Stout, C.D.; Cash, V.L.; Dean, D.R.; Jensen, G.M.; Stephens, P.J.

    1990-01-01

    Azotobacter vinelandii ferredoxin I is a small protein that contains one [4Fe-4S] cluster and one [3Fe-4S] cluster. Recently the x-ray crystal structure has been redetermined and the fdxA gene, which encodes the protein, has been cloned and sequenced. Here the authors report the site-directed mutation of Cys-20, which is a ligand of the [4Fe-4S] cluster in the native protein, to alanine and the characterization of the protein product by x-ray crystallographic and spectroscopic methods. The data show that the mutant protein again contains one [4Fe-4S] cluster and one [3Fe-4S] cluster. The new [4Fe-4S] cluster obtains its fourth ligand from Cys-24, a free cysteine in the native structure. The formation of this [4Fe-4S] cluster drives rearrangement of the protein structure

  1. Predicting protein complexes from weighted protein-protein interaction graphs with a novel unsupervised methodology: Evolutionary enhanced Markov clustering.

    Science.gov (United States)

    Theofilatos, Konstantinos; Pavlopoulou, Niki; Papasavvas, Christoforos; Likothanassis, Spiros; Dimitrakopoulos, Christos; Georgopoulos, Efstratios; Moschopoulos, Charalampos; Mavroudi, Seferina

    2015-03-01

    Proteins are considered to be the most important individual components of biological systems and they combine to form physical protein complexes which are responsible for certain molecular functions. Despite the large availability of protein-protein interaction (PPI) information, not much information is available about protein complexes. Experimental methods are limited in terms of time, efficiency, cost and performance constraints. Existing computational methods have provided encouraging preliminary results, but they phase certain disadvantages as they require parameter tuning, some of them cannot handle weighted PPI data and others do not allow a protein to participate in more than one protein complex. In the present paper, we propose a new fully unsupervised methodology for predicting protein complexes from weighted PPI graphs. The proposed methodology is called evolutionary enhanced Markov clustering (EE-MC) and it is a hybrid combination of an adaptive evolutionary algorithm and a state-of-the-art clustering algorithm named enhanced Markov clustering. EE-MC was compared with state-of-the-art methodologies when applied to datasets from the human and the yeast Saccharomyces cerevisiae organisms. Using public available datasets, EE-MC outperformed existing methodologies (in some datasets the separation metric was increased by 10-20%). Moreover, when applied to new human datasets its performance was encouraging in the prediction of protein complexes which consist of proteins with high functional similarity. In specific, 5737 protein complexes were predicted and 72.58% of them are enriched for at least one gene ontology (GO) function term. EE-MC is by design able to overcome intrinsic limitations of existing methodologies such as their inability to handle weighted PPI networks, their constraint to assign every protein in exactly one cluster and the difficulties they face concerning the parameter tuning. This fact was experimentally validated and moreover, new

  2. RRW: repeated random walks on genome-scale protein networks for local cluster discovery

    Directory of Open Access Journals (Sweden)

    Can Tolga

    2009-09-01

    Full Text Available Abstract Background We propose an efficient and biologically sensitive algorithm based on repeated random walks (RRW for discovering functional modules, e.g., complexes and pathways, within large-scale protein networks. Compared to existing cluster identification techniques, RRW implicitly makes use of network topology, edge weights, and long range interactions between proteins. Results We apply the proposed technique on a functional network of yeast genes and accurately identify statistically significant clusters of proteins. We validate the biological significance of the results using known complexes in the MIPS complex catalogue database and well-characterized biological processes. We find that 90% of the created clusters have the majority of their catalogued proteins belonging to the same MIPS complex, and about 80% have the majority of their proteins involved in the same biological process. We compare our method to various other clustering techniques, such as the Markov Clustering Algorithm (MCL, and find a significant improvement in the RRW clusters' precision and accuracy values. Conclusion RRW, which is a technique that exploits the topology of the network, is more precise and robust in finding local clusters. In addition, it has the added flexibility of being able to find multi-functional proteins by allowing overlapping clusters.

  3. Lack of Dependence of the Sizes of the Mesoscopic Protein Clusters on Electrostatics.

    Science.gov (United States)

    Vorontsova, Maria A; Chan, Ho Yin; Lubchenko, Vassiliy; Vekilov, Peter G

    2015-11-03

    Protein-rich clusters of steady submicron size and narrow size distribution exist in protein solutions in apparent violation of the classical laws of phase equilibrium. Even though they contain a minor fraction of the total protein, evidence suggests that they may serve as essential precursors for the nucleation of ordered solids such as crystals, sickle-cell hemoglobin polymers, and amyloid fibrils. The cluster formation mechanism remains elusive. We use the highly basic protein lysozyme at nearly neutral and lower pH as a model and explore the response of the cluster population to the electrostatic forces, which govern numerous biophysical phenomena, including crystallization and fibrillization. We tune the strength of intermolecular electrostatic forces by varying the solution ionic strength I and pH and find that despite the weaker repulsion at higher I and pH, the cluster size remains constant. Cluster responses to the presence of urea and ethanol demonstrate that cluster formation is controlled by hydrophobic interactions between the peptide backbones, exposed to the solvent after partial protein unfolding that may lead to transient protein oligomers. These findings reveal that the mechanism of the mesoscopic clusters is fundamentally different from those underlying the two main classes of ordered protein solid phases, crystals and amyloid fibrils, and partial unfolding of the protein chain may play a significant role. Copyright © 2015 Biophysical Society. Published by Elsevier Inc. All rights reserved.

  4. Which clustering algorithm is better for predicting protein complexes?

    Directory of Open Access Journals (Sweden)

    Moschopoulos Charalampos N

    2011-12-01

    Full Text Available Abstract Background Protein-Protein interactions (PPI play a key role in determining the outcome of most cellular processes. The correct identification and characterization of protein interactions and the networks, which they comprise, is critical for understanding the molecular mechanisms within the cell. Large-scale techniques such as pull down assays and tandem affinity purification are used in order to detect protein interactions in an organism. Today, relatively new high-throughput methods like yeast two hybrid, mass spectrometry, microarrays, and phage display are also used to reveal protein interaction networks. Results In this paper we evaluated four different clustering algorithms using six different interaction datasets. We parameterized the MCL, Spectral, RNSC and Affinity Propagation algorithms and applied them to six PPI datasets produced experimentally by Yeast 2 Hybrid (Y2H and Tandem Affinity Purification (TAP methods. The predicted clusters, so called protein complexes, were then compared and benchmarked with already known complexes stored in published databases. Conclusions While results may differ upon parameterization, the MCL and RNSC algorithms seem to be more promising and more accurate at predicting PPI complexes. Moreover, they predict more complexes than other reviewed algorithms in absolute numbers. On the other hand the spectral clustering algorithm achieves the highest valid prediction rate in our experiments. However, it is nearly always outperformed by both RNSC and MCL in terms of the geometrical accuracy while it generates the fewest valid clusters than any other reviewed algorithm. This article demonstrates various metrics to evaluate the accuracy of such predictions as they are presented in the text below. Supplementary material can be found at: http://www.bioacademy.gr/bioinformatics/projects/ppireview.htm

  5. Clustering and visualizing similarity networks of membrane proteins.

    Science.gov (United States)

    Hu, Geng-Ming; Mai, Te-Lun; Chen, Chi-Ming

    2015-08-01

    We proposed a fast and unsupervised clustering method, minimum span clustering (MSC), for analyzing the sequence-structure-function relationship of biological networks, and demonstrated its validity in clustering the sequence/structure similarity networks (SSN) of 682 membrane protein (MP) chains. The MSC clustering of MPs based on their sequence information was found to be consistent with their tertiary structures and functions. For the largest seven clusters predicted by MSC, the consistency in chain function within the same cluster is found to be 100%. From analyzing the edge distribution of SSN for MPs, we found a characteristic threshold distance for the boundary between clusters, over which SSN of MPs could be properly clustered by an unsupervised sparsification of the network distance matrix. The clustering results of MPs from both MSC and the unsupervised sparsification methods are consistent with each other, and have high intracluster similarity and low intercluster similarity in sequence, structure, and function. Our study showed a strong sequence-structure-function relationship of MPs. We discussed evidence of convergent evolution of MPs and suggested applications in finding structural similarities and predicting biological functions of MP chains based on their sequence information. © 2015 Wiley Periodicals, Inc.

  6. Fast large-scale clustering of protein structures using Gauss integrals

    DEFF Research Database (Denmark)

    Harder, Tim; Borg, Mikael; Boomsma, Wouter

    2011-01-01

    trajectories. Results: We present Pleiades, a novel approach to clustering protein structures with a rigorous mathematical underpinning. The method approximates clustering based on the root mean square deviation by rst mapping structures to Gauss integral vectors – which were introduced by Røgen and co......-workers – and subsequently performing K-means clustering. Conclusions: Compared to current methods, Pleiades dramatically improves on the time needed to perform clustering, and can cluster a signicantly larger number of structures, while providing state-ofthe- art results. The number of low energy structures generated...

  7. Clusters of proteins in bio-membranes: insights into the roles of interaction potential shapes and of protein diversity

    OpenAIRE

    Meilhac, Nicolas; Destainville, Nicolas

    2011-01-01

    It has recently been proposed that proteins embedded in lipidic bio-membranes can spontaneously self-organize into stable small clusters, or membrane nano-domains, due to the competition between short-range attractive and longer-range repulsive forces between proteins, specific to these systems. In this paper, we carry on our investigation, by Monte Carlo simulations, of different aspects of cluster phases of proteins in bio-membranes. First, we compare different long-range potentials (includ...

  8. Proteins in similarity relationship with the cluster - Gclust Server | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Gclust Server Proteins in similarity relationship with the cluster Data detail Data name Pro...teins in similarity relationship with the cluster DOI 10.18908/lsdba.nbdc00464-003 Description of data conte...s Proteins in similarity relationship with the cluster - Gclust Server | LSDB Archive ...

  9. Homo-FRET imaging as a tool to quantify protein and lipid clustering.

    Science.gov (United States)

    Bader, Arjen N; Hoetzl, Sandra; Hofman, Erik G; Voortman, Jarno; van Bergen en Henegouwen, Paul M P; van Meer, Gerrit; Gerritsen, Hans C

    2011-02-25

    Homo-FRET, Förster resonance energy transfer between identical fluorophores, can be conveniently measured by observing its effect on the fluorescence anisotropy. This review aims to summarize the possibilities of fluorescence anisotropy imaging techniques to investigate clustering of identical proteins and lipids. Homo-FRET imaging has the ability to determine distances between fluorophores. In addition it can be employed to quantify cluster sizes as well as cluster size distributions. The interpretation of homo-FRET signals is complicated by the fact that both the mutual orientations of the fluorophores and the number of fluorophores per cluster affect the fluorescence anisotropy in a similar way. The properties of the fluorescence probes are very important. Taking these properties into account is critical for the correct interpretation of homo-FRET signals in protein- and lipid-clustering studies. This is be exemplified by studies on the clustering of the lipid raft markers GPI and K-ras, as well as for EGF receptor clustering in the plasma membrane. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  10. Self Organizing Maps to efficiently cluster and functionally interpret protein conformational ensembles

    Directory of Open Access Journals (Sweden)

    Fabio Stella

    2013-09-01

    Full Text Available An approach that combines Self-Organizing maps, hierarchical clustering and network components is presented, aimed at comparing protein conformational ensembles obtained from multiple Molecular Dynamic simulations. As a first result the original ensembles can be summarized by using only the representative conformations of the clusters obtained. In addition the network components analysis allows to discover and interpret the dynamic behavior of the conformations won by each neuron. The results showed the ability of this approach to efficiently derive a functional interpretation of the protein dynamics described by the original conformational ensemble, highlighting its potential as a support for protein engineering.

  11. Looping and clustering model for the organization of protein-DNA complexes on the bacterial genome

    Science.gov (United States)

    Walter, Jean-Charles; Walliser, Nils-Ole; David, Gabriel; Dorignac, Jérôme; Geniet, Frédéric; Palmeri, John; Parmeggiani, Andrea; Wingreen, Ned S.; Broedersz, Chase P.

    2018-03-01

    The bacterial genome is organized by a variety of associated proteins inside a structure called the nucleoid. These proteins can form complexes on DNA that play a central role in various biological processes, including chromosome segregation. A prominent example is the large ParB-DNA complex, which forms an essential component of the segregation machinery in many bacteria. ChIP-Seq experiments show that ParB proteins localize around centromere-like parS sites on the DNA to which ParB binds specifically, and spreads from there over large sections of the chromosome. Recent theoretical and experimental studies suggest that DNA-bound ParB proteins can interact with each other to condense into a coherent 3D complex on the DNA. However, the structural organization of this protein-DNA complex remains unclear, and a predictive quantitative theory for the distribution of ParB proteins on DNA is lacking. Here, we propose the looping and clustering model, which employs a statistical physics approach to describe protein-DNA complexes. The looping and clustering model accounts for the extrusion of DNA loops from a cluster of interacting DNA-bound proteins that is organized around a single high-affinity binding site. Conceptually, the structure of the protein-DNA complex is determined by a competition between attractive protein interactions and loop closure entropy of this protein-DNA cluster on the one hand, and the positional entropy for placing loops within the cluster on the other. Indeed, we show that the protein interaction strength determines the ‘tightness’ of the loopy protein-DNA complex. Thus, our model provides a theoretical framework for quantitatively computing the binding profiles of ParB-like proteins around a cognate (parS) binding site.

  12. Dynamic Change in p63 Protein Expression during Implantation of Urothelial Cancer Clusters

    Directory of Open Access Journals (Sweden)

    Takahiro Yoshida

    2015-07-01

    Full Text Available Although the dissemination of urothelial cancer cells is supposed to be a major cause of the multicentricity of urothelial tumors, the mechanism of implantation has not been well investigated. Here, we found that cancer cell clusters from the urine of patients with urothelial cancer retain the ability to survive, grow, and adhere. By using cell lines and primary cells collected from multiple patients, we demonstrate that △Np63α protein in cancer cell clusters was rapidly decreased through proteasomal degradation when clusters were attached to the matrix, leading to downregulation of E-cadherin and upregulation of N-cadherin. Decreased △Np63α protein level in urothelial cancer cell clusters was involved in the clearance of the urothelium. Our data provide the first evidence that clusters of urothelial cancer cells exhibit dynamic changes in △Np63α expression during attachment to the matrix, and decreased △Np63α protein plays a critical role in the interaction between cancer cell clusters and the urothelium. Thus, because △Np63α might be involved in the process of intraluminal dissemination of urothelial cancer cells, blocking the degradation of △Np63α could be a target of therapy to prevent the dissemination of urothelial cancer.

  13. Efficient algorithms for accurate hierarchical clustering of huge datasets: tackling the entire protein space.

    Science.gov (United States)

    Loewenstein, Yaniv; Portugaly, Elon; Fromer, Menachem; Linial, Michal

    2008-07-01

    UPGMA (average linking) is probably the most popular algorithm for hierarchical data clustering, especially in computational biology. However, UPGMA requires the entire dissimilarity matrix in memory. Due to this prohibitive requirement, UPGMA is not scalable to very large datasets. We present a novel class of memory-constrained UPGMA (MC-UPGMA) algorithms. Given any practical memory size constraint, this framework guarantees the correct clustering solution without explicitly requiring all dissimilarities in memory. The algorithms are general and are applicable to any dataset. We present a data-dependent characterization of hardness and clustering efficiency. The presented concepts are applicable to any agglomerative clustering formulation. We apply our algorithm to the entire collection of protein sequences, to automatically build a comprehensive evolutionary-driven hierarchy of proteins from sequence alone. The newly created tree captures protein families better than state-of-the-art large-scale methods such as CluSTr, ProtoNet4 or single-linkage clustering. We demonstrate that leveraging the entire mass embodied in all sequence similarities allows to significantly improve on current protein family clusterings which are unable to directly tackle the sheer mass of this data. Furthermore, we argue that non-metric constraints are an inherent complexity of the sequence space and should not be overlooked. The robustness of UPGMA allows significant improvement, especially for multidomain proteins, and for large or divergent families. A comprehensive tree built from all UniProt sequence similarities, together with navigation and classification tools will be made available as part of the ProtoNet service. A C++ implementation of the algorithm is available on request.

  14. Differential dynamic microscopy of weakly scattering and polydisperse protein-rich clusters

    Science.gov (United States)

    Safari, Mohammad S.; Vorontsova, Maria A.; Poling-Skutvik, Ryan; Vekilov, Peter G.; Conrad, Jacinta C.

    2015-10-01

    Nanoparticle dynamics impact a wide range of biological transport processes and applications in nanomedicine and natural resource engineering. Differential dynamic microscopy (DDM) was recently developed to quantify the dynamics of submicron particles in solutions from fluctuations of intensity in optical micrographs. Differential dynamic microscopy is well established for monodisperse particle populations, but has not been applied to solutions containing weakly scattering polydisperse biological nanoparticles. Here we use bright-field DDM (BDDM) to measure the dynamics of protein-rich liquid clusters, whose size ranges from tens to hundreds of nanometers and whose total volume fraction is less than 10-5. With solutions of two proteins, hemoglobin A and lysozyme, we evaluate the cluster diffusion coefficients from the dependence of the diffusive relaxation time on the scattering wave vector. We establish that for weakly scattering populations, an optimal thickness of the sample chamber exists at which the BDDM signal is maximized at the smallest sample volume. The average cluster diffusion coefficient measured using BDDM is consistently lower than that obtained from dynamic light scattering at a scattering angle of 90∘. This apparent discrepancy is due to Mie scattering from the polydisperse cluster population, in which larger clusters preferentially scatter more light in the forward direction.

  15. BiP clustering facilitates protein folding in the endoplasmic reticulum.

    Directory of Open Access Journals (Sweden)

    Marc Griesemer

    2014-07-01

    Full Text Available The chaperone BiP participates in several regulatory processes within the endoplasmic reticulum (ER: translocation, protein folding, and ER-associated degradation. To facilitate protein folding, a cooperative mechanism known as entropic pulling has been proposed to demonstrate the molecular-level understanding of how multiple BiP molecules bind to nascent and unfolded proteins. Recently, experimental evidence revealed the spatial heterogeneity of BiP within the nuclear and peripheral ER of S. cerevisiae (commonly referred to as 'clusters'. Here, we developed a model to evaluate the potential advantages of accounting for multiple BiP molecules binding to peptides, while proposing that BiP's spatial heterogeneity may enhance protein folding and maturation. Scenarios were simulated to gauge the effectiveness of binding multiple chaperone molecules to peptides. Using two metrics: folding efficiency and chaperone cost, we determined that the single binding site model achieves a higher efficiency than models characterized by multiple binding sites, in the absence of cooperativity. Due to entropic pulling, however, multiple chaperones perform in concert to facilitate the resolubilization and ultimate yield of folded proteins. As a result of cooperativity, multiple binding site models used fewer BiP molecules and maintained a higher folding efficiency than the single binding site model. These insilico investigations reveal that clusters of BiP molecules bound to unfolded proteins may enhance folding efficiency through cooperative action via entropic pulling.

  16. Introducing a Clustering Step in a Consensus Approach for the Scoring of Protein-Protein Docking Models

    KAUST Repository

    Chermak, Edrisse; De Donato, Renato; Lensink, Marc F.; Petta, Andrea; Serra, Luigi; Scarano, Vittorio; Cavallo, Luigi; Oliva, Romina

    2016-01-01

    Correctly scoring protein-protein docking models to single out native-like ones is an open challenge. It is also an object of assessment in CAPRI (Critical Assessment of PRedicted Interactions), the community-wide blind docking experiment. We introduced in the field the first pure consensus method, CONSRANK, which ranks models based on their ability to match the most conserved contacts in the ensemble they belong to. In CAPRI, scorers are asked to evaluate a set of available models and select the top ten ones, based on their own scoring approach. Scorers' performance is ranked based on the number of targets/interfaces for which they could provide at least one correct solution. In such terms, blind testing in CAPRI Round 30 (a joint prediction round with CASP11) has shown that critical cases for CONSRANK are represented by targets showing multiple interfaces or for which only a very small number of correct solutions are available. To address these challenging cases, CONSRANK has now been modified to include a contact-based clustering of the models as a preliminary step of the scoring process. We used an agglomerative hierarchical clustering based on the number of common inter-residue contacts within the models. Two criteria, with different thresholds, were explored in the cluster generation, setting either the number of common contacts or of total clusters. For each clustering approach, after selecting the top (most populated) ten clusters, CONSRANK was run on these clusters and the top-ranked model for each cluster was selected, in the limit of 10 models per target. We have applied our modified scoring approach, Clust-CONSRANK, to SCORE_SET, a set of CAPRI scoring models made recently available by CAPRI assessors, and to the subset of homodimeric targets in CAPRI Round 30 for which CONSRANK failed to include a correct solution within the ten selected models. Results show that, for the challenging cases, the clustering step typically enriches the ten top ranked

  17. Introducing a Clustering Step in a Consensus Approach for the Scoring of Protein-Protein Docking Models

    KAUST Repository

    Chermak, Edrisse

    2016-11-15

    Correctly scoring protein-protein docking models to single out native-like ones is an open challenge. It is also an object of assessment in CAPRI (Critical Assessment of PRedicted Interactions), the community-wide blind docking experiment. We introduced in the field the first pure consensus method, CONSRANK, which ranks models based on their ability to match the most conserved contacts in the ensemble they belong to. In CAPRI, scorers are asked to evaluate a set of available models and select the top ten ones, based on their own scoring approach. Scorers\\' performance is ranked based on the number of targets/interfaces for which they could provide at least one correct solution. In such terms, blind testing in CAPRI Round 30 (a joint prediction round with CASP11) has shown that critical cases for CONSRANK are represented by targets showing multiple interfaces or for which only a very small number of correct solutions are available. To address these challenging cases, CONSRANK has now been modified to include a contact-based clustering of the models as a preliminary step of the scoring process. We used an agglomerative hierarchical clustering based on the number of common inter-residue contacts within the models. Two criteria, with different thresholds, were explored in the cluster generation, setting either the number of common contacts or of total clusters. For each clustering approach, after selecting the top (most populated) ten clusters, CONSRANK was run on these clusters and the top-ranked model for each cluster was selected, in the limit of 10 models per target. We have applied our modified scoring approach, Clust-CONSRANK, to SCORE_SET, a set of CAPRI scoring models made recently available by CAPRI assessors, and to the subset of homodimeric targets in CAPRI Round 30 for which CONSRANK failed to include a correct solution within the ten selected models. Results show that, for the challenging cases, the clustering step typically enriches the ten top ranked

  18. Zinc fingers, zinc clusters, and zinc twists in DNA-binding protein domains

    International Nuclear Information System (INIS)

    Vallee, B.L.; Auld, D.S.; Coleman, J.E.

    1991-01-01

    The authors recognize three distinct motifs of DNA-binding zinc proteins: (i) zinc fingers, (ii) zinc clusters, and (iii) zinc twists. Until very recently, x-ray crystallographic or NMR three-dimensional structure analyses of DNA-binding zinc proteins have not been available to serve as standards of reference for the zinc binding sites of these families of proteins. Those of the DNA-binding domains of the fungal transcription factor GAL4 and the rat glucocorticoid receptor are the first to have been determined. Both proteins contain two zinc binding sites, and in both, cysteine residues are the sole zinc ligands. In GAL4, two zinc atoms are bound to six cysteine residues which form a zinc cluster akin to that of metallothionein; the distance between the two zinc atoms of GAL4 is ∼3.5 angstrom. In the glucocorticoid receptor, each zinc atom is bound to four cysteine residues; the interatomic zinc-zinc distance is ∼13 angstrom, and in this instance, a zinc twist is represented by a helical DNA recognition site located between the two zinc atoms. Zinc clusters and zinc twists are here recognized as two distinctive motifs in DNA-binding proteins containing multiple zinc atoms. For native zinc fingers, structural data do not exist as yet; consequently, the interatomic distances between zinc atoms are not known. As further structural data become available, the structural and functional significance of these different motifs in their binding to DNA and other proteins participating in the transmission of the genetic message will become apparent

  19. Using the clustered circular layout as an informative method for visualizing protein-protein interaction networks.

    Science.gov (United States)

    Fung, David C Y; Wilkins, Marc R; Hart, David; Hong, Seok-Hee

    2010-07-01

    The force-directed layout is commonly used in computer-generated visualizations of protein-protein interaction networks. While it is good for providing a visual outline of the protein complexes and their interactions, it has two limitations when used as a visual analysis method. The first is poor reproducibility. Repeated running of the algorithm does not necessarily generate the same layout, therefore, demanding cognitive readaptation on the investigator's part. The second limitation is that it does not explicitly display complementary biological information, e.g. Gene Ontology, other than the protein names or gene symbols. Here, we present an alternative layout called the clustered circular layout. Using the human DNA replication protein-protein interaction network as a case study, we compared the two network layouts for their merits and limitations in supporting visual analysis.

  20. Phenotype Clustering of Breast Epithelial Cells in Confocal Imagesbased on Nuclear Protein Distribution Analysis

    Energy Technology Data Exchange (ETDEWEB)

    Long, Fuhui; Peng, Hanchuan; Sudar, Damir; Levievre, Sophie A.; Knowles, David W.

    2006-09-05

    Background: The distribution of the chromatin-associatedproteins plays a key role in directing nuclear function. Previously, wedeveloped an image-based method to quantify the nuclear distributions ofproteins and showed that these distributions depended on the phenotype ofhuman mammary epithelial cells. Here we describe a method that creates ahierarchical tree of the given cell phenotypes and calculates thestatistical significance between them, based on the clustering analysisof nuclear protein distributions. Results: Nuclear distributions ofnuclear mitotic apparatus protein were previously obtained fornon-neoplastic S1 and malignant T4-2 human mammary epithelial cellscultured for up to 12 days. Cell phenotype was defined as S1 or T4-2 andthe number of days in cultured. A probabilistic ensemble approach wasused to define a set of consensus clusters from the results of multipletraditional cluster analysis techniques applied to the nucleardistribution data. Cluster histograms were constructed to show how cellsin any one phenotype were distributed across the consensus clusters.Grouping various phenotypes allowed us to build phenotype trees andcalculate the statistical difference between each group. The resultsshowed that non-neoplastic S1 cells could be distinguished from malignantT4-2 cells with 94.19 percent accuracy; that proliferating S1 cells couldbe distinguished from differentiated S1 cells with 92.86 percentaccuracy; and showed no significant difference between the variousphenotypes of T4-2 cells corresponding to increasing tumor sizes.Conclusion: This work presents a cluster analysis method that canidentify significant cell phenotypes, based on the nuclear distributionof specific proteins, with high accuracy.

  1. Protein-protein association and cellular localization of four essential gene products encoded by tellurite resistance-conferring cluster "ter" from pathogenic Escherichia coli.

    Science.gov (United States)

    Valkovicova, Lenka; Vavrova, Silvia Minarikova; Mravec, Jozef; Grones, Jozef; Turna, Jan

    2013-12-01

    Gene cluster "ter" conferring high tellurite resistance has been identified in various pathogenic bacteria including Escherichia coli O157:H7. However, the precise mechanism as well as the molecular function of the respective gene products is unclear. Here we describe protein-protein association and localization analyses of four essential Ter proteins encoded by minimal resistance-conferring fragment (terBCDE) by means of recombinant expression. By using a two-plasmid complementation system we show that the overproduced single Ter proteins are not able to mediate tellurite resistance, but all Ter members play an irreplaceable role within the cluster. We identified several types of homotypic and heterotypic protein-protein associations among the Ter proteins by in vitro and in vivo pull-down assays and determined their cellular localization by cytosol/membrane fractionation. Our results strongly suggest that Ter proteins function involves their mutual association, which probably happens at the interface of the inner plasma membrane and the cytosol.

  2. Automatic extraction of gene ontology annotation and its correlation with clusters in protein networks

    Directory of Open Access Journals (Sweden)

    Mazo Ilya

    2007-07-01

    Full Text Available Abstract Background Uncovering cellular roles of a protein is a task of tremendous importance and complexity that requires dedicated experimental work as well as often sophisticated data mining and processing tools. Protein functions, often referred to as its annotations, are believed to manifest themselves through topology of the networks of inter-proteins interactions. In particular, there is a growing body of evidence that proteins performing the same function are more likely to interact with each other than with proteins with other functions. However, since functional annotation and protein network topology are often studied separately, the direct relationship between them has not been comprehensively demonstrated. In addition to having the general biological significance, such demonstration would further validate the data extraction and processing methods used to compose protein annotation and protein-protein interactions datasets. Results We developed a method for automatic extraction of protein functional annotation from scientific text based on the Natural Language Processing (NLP technology. For the protein annotation extracted from the entire PubMed, we evaluated the precision and recall rates, and compared the performance of the automatic extraction technology to that of manual curation used in public Gene Ontology (GO annotation. In the second part of our presentation, we reported a large-scale investigation into the correspondence between communities in the literature-based protein networks and GO annotation groups of functionally related proteins. We found a comprehensive two-way match: proteins within biological annotation groups form significantly denser linked network clusters than expected by chance and, conversely, densely linked network communities exhibit a pronounced non-random overlap with GO groups. We also expanded the publicly available GO biological process annotation using the relations extracted by our NLP technology

  3. Sifting through genomes with iterative-sequence clustering produces a large, phylogenetically diverse protein-family resource.

    Science.gov (United States)

    Sharpton, Thomas J; Jospin, Guillaume; Wu, Dongying; Langille, Morgan G I; Pollard, Katherine S; Eisen, Jonathan A

    2012-10-13

    New computational resources are needed to manage the increasing volume of biological data from genome sequencing projects. One fundamental challenge is the ability to maintain a complete and current catalog of protein diversity. We developed a new approach for the identification of protein families that focuses on the rapid discovery of homologous protein sequences. We implemented fully automated and high-throughput procedures to de novo cluster proteins into families based upon global alignment similarity. Our approach employs an iterative clustering strategy in which homologs of known families are sifted out of the search for new families. The resulting reduction in computational complexity enables us to rapidly identify novel protein families found in new genomes and to perform efficient, automated updates that keep pace with genome sequencing. We refer to protein families identified through this approach as "Sifting Families," or SFams. Our analysis of ~10.5 million protein sequences from 2,928 genomes identified 436,360 SFams, many of which are not represented in other protein family databases. We validated the quality of SFam clustering through statistical as well as network topology-based analyses. We describe the rapid identification of SFams and demonstrate how they can be used to annotate genomes and metagenomes. The SFam database catalogs protein-family quality metrics, multiple sequence alignments, hidden Markov models, and phylogenetic trees. Our source code and database are publicly available and will be subject to frequent updates (http://edhar.genomecenter.ucdavis.edu/sifting_families/).

  4. Comprehensive cluster analysis with Transitivity Clustering.

    Science.gov (United States)

    Wittkop, Tobias; Emig, Dorothea; Truss, Anke; Albrecht, Mario; Böcker, Sebastian; Baumbach, Jan

    2011-03-01

    Transitivity Clustering is a method for the partitioning of biological data into groups of similar objects, such as genes, for instance. It provides integrated access to various functions addressing each step of a typical cluster analysis. To facilitate this, Transitivity Clustering is accessible online and offers three user-friendly interfaces: a powerful stand-alone version, a web interface, and a collection of Cytoscape plug-ins. In this paper, we describe three major workflows: (i) protein (super)family detection with Cytoscape, (ii) protein homology detection with incomplete gold standards and (iii) clustering of gene expression data. This protocol guides the user through the most important features of Transitivity Clustering and takes ∼1 h to complete.

  5. Structural fragment clustering reveals novel structural and functional motifs in α-helical transmembrane proteins

    Directory of Open Access Journals (Sweden)

    Vassilev Boris

    2010-04-01

    Full Text Available Abstract Background A large proportion of an organism's genome encodes for membrane proteins. Membrane proteins are important for many cellular processes, and several diseases can be linked to mutations in them. With the tremendous growth of sequence data, there is an increasing need to reliably identify membrane proteins from sequence, to functionally annotate them, and to correctly predict their topology. Results We introduce a technique called structural fragment clustering, which learns sequential motifs from 3D structural fragments. From over 500,000 fragments, we obtain 213 statistically significant, non-redundant, and novel motifs that are highly specific to α-helical transmembrane proteins. From these 213 motifs, 58 of them were assigned to function and checked in the scientific literature for a biological assessment. Seventy percent of the motifs are found in co-factor, ligand, and ion binding sites, 30% at protein interaction interfaces, and 12% bind specific lipids such as glycerol or cardiolipins. The vast majority of motifs (94% appear across evolutionarily unrelated families, highlighting the modularity of functional design in membrane proteins. We describe three novel motifs in detail: (1 a dimer interface motif found in voltage-gated chloride channels, (2 a proton transfer motif found in heme-copper oxidases, and (3 a convergently evolved interface helix motif found in an aspartate symporter, a serine protease, and cytochrome b. Conclusions Our findings suggest that functional modules exist in membrane proteins, and that they occur in completely different evolutionary contexts and cover different binding sites. Structural fragment clustering allows us to link sequence motifs to function through clusters of structural fragments. The sequence motifs can be applied to identify and characterize membrane proteins in novel genomes.

  6. On the Power and Limits of Sequence Similarity Based Clustering of Proteins Into Families

    DEFF Research Database (Denmark)

    Wiwie, Christian; Röttger, Richard

    2017-01-01

    Over the last decades, we have observed an ongoing tremendous growth of available sequencing data fueled by the advancements in wet-lab technology. The sequencing information is only the beginning of the actual understanding of how organisms survive and prosper. It is, for instance, equally...... important to also unravel the proteomic repertoire of an organism. A classical computational approach for detecting protein families is a sequence-based similarity calculation coupled with a subsequent cluster analysis. In this work we have intensively analyzed various clustering tools on a large scale. We...... used the data to investigate the behavior of the tools' parameters underlining the diversity of the protein families. Furthermore, we trained regression models for predicting the expected performance of a clustering tool for an unknown data set and aimed to also suggest optimal parameters...

  7. Molecular comparison of the structural proteins encoding gene clusters of two related Lactobacillus delbrueckii bacteriophages.

    Science.gov (United States)

    Vasala, A; Dupont, L; Baumann, M; Ritzenthaler, P; Alatossava, T

    1993-01-01

    Virulent phage LL-H and temperate phage mv4 are two related bacteriophages of Lactobacillus delbrueckii. The gene clusters encoding structural proteins of these two phages have been sequenced and further analyzed. Six open reading frames (ORF-1 to ORF-6) were detected. Protein sequencing and Western immunoblotting experiments confirmed that ORF-3 (g34) encoded the main capsid protein Gp34. The presence of a putative late promoter in front of the phage LL-H g34 gene was suggested by primer extension experiments. Comparative sequence analysis between phage LL-H and phage mv4 revealed striking similarities in the structure and organization of this gene cluster, suggesting that the genes encoding phage structural proteins belong to a highly conservative module. Images PMID:8497043

  8. Sifting through genomes with iterative-sequence clustering produces a large, phylogenetically diverse protein-family resource

    Directory of Open Access Journals (Sweden)

    Sharpton Thomas J

    2012-10-01

    Full Text Available Abstract Background New computational resources are needed to manage the increasing volume of biological data from genome sequencing projects. One fundamental challenge is the ability to maintain a complete and current catalog of protein diversity. We developed a new approach for the identification of protein families that focuses on the rapid discovery of homologous protein sequences. Results We implemented fully automated and high-throughput procedures to de novo cluster proteins into families based upon global alignment similarity. Our approach employs an iterative clustering strategy in which homologs of known families are sifted out of the search for new families. The resulting reduction in computational complexity enables us to rapidly identify novel protein families found in new genomes and to perform efficient, automated updates that keep pace with genome sequencing. We refer to protein families identified through this approach as “Sifting Families,” or SFams. Our analysis of ~10.5 million protein sequences from 2,928 genomes identified 436,360 SFams, many of which are not represented in other protein family databases. We validated the quality of SFam clustering through statistical as well as network topology–based analyses. Conclusions We describe the rapid identification of SFams and demonstrate how they can be used to annotate genomes and metagenomes. The SFam database catalogs protein-family quality metrics, multiple sequence alignments, hidden Markov models, and phylogenetic trees. Our source code and database are publicly available and will be subject to frequent updates (http://edhar.genomecenter.ucdavis.edu/sifting_families/.

  9. The transcriptional repressor protein NsrR senses nitric oxide directly via a [2Fe-2S] cluster.

    Directory of Open Access Journals (Sweden)

    Nicholas P Tucker

    Full Text Available The regulatory protein NsrR, a member of the Rrf2 family of transcription repressors, is specifically dedicated to sensing nitric oxide (NO in a variety of pathogenic and non-pathogenic bacteria. It has been proposed that NO directly modulates NsrR activity by interacting with a predicted [Fe-S] cluster in the NsrR protein, but no experimental evidence has been published to support this hypothesis. Here we report the purification of NsrR from the obligate aerobe Streptomyces coelicolor. We demonstrate using UV-visible, near UV CD and EPR spectroscopy that the protein contains an NO-sensitive [2Fe-2S] cluster when purified from E. coli. Upon exposure of NsrR to NO, the cluster is nitrosylated, which results in the loss of DNA binding activity as detected by bandshift assays. Removal of the [2Fe-2S] cluster to generate apo-NsrR also resulted in loss of DNA binding activity. This is the first demonstration that NsrR contains an NO-sensitive [2Fe-2S] cluster that is required for DNA binding activity.

  10. CytoCluster: A Cytoscape Plugin for Cluster Analysis and Visualization of Biological Networks.

    Science.gov (United States)

    Li, Min; Li, Dongyan; Tang, Yu; Wu, Fangxiang; Wang, Jianxin

    2017-08-31

    Nowadays, cluster analysis of biological networks has become one of the most important approaches to identifying functional modules as well as predicting protein complexes and network biomarkers. Furthermore, the visualization of clustering results is crucial to display the structure of biological networks. Here we present CytoCluster, a cytoscape plugin integrating six clustering algorithms, HC-PIN (Hierarchical Clustering algorithm in Protein Interaction Networks), OH-PIN (identifying Overlapping and Hierarchical modules in Protein Interaction Networks), IPCA (Identifying Protein Complex Algorithm), ClusterONE (Clustering with Overlapping Neighborhood Expansion), DCU (Detecting Complexes based on Uncertain graph model), IPC-MCE (Identifying Protein Complexes based on Maximal Complex Extension), and BinGO (the Biological networks Gene Ontology) function. Users can select different clustering algorithms according to their requirements. The main function of these six clustering algorithms is to detect protein complexes or functional modules. In addition, BinGO is used to determine which Gene Ontology (GO) categories are statistically overrepresented in a set of genes or a subgraph of a biological network. CytoCluster can be easily expanded, so that more clustering algorithms and functions can be added to this plugin. Since it was created in July 2013, CytoCluster has been downloaded more than 9700 times in the Cytoscape App store and has already been applied to the analysis of different biological networks. CytoCluster is available from http://apps.cytoscape.org/apps/cytocluster.

  11. Cluster protein structures using recurrence quantification analysis on coordinates of alpha-carbon atoms of proteins

    International Nuclear Information System (INIS)

    Zhou Yu; Yu Zuguo; Anh, Vo

    2007-01-01

    The 3-dimensional coordinates of alpha-carbon atoms of proteins are used to distinguish the protein structural classes based on recurrence quantification analysis (RQA). We consider two independent variables from RQA of coordinates of alpha-carbon atoms, %determ1 and %determ2, which were defined by Webber et al. [C.L. Webber Jr., A. Giuliani, J.P. Zbilut, A. Colosimo, Proteins Struct. Funct. Genet. 44 (2001) 292]. The variable %determ2 is used to define two new variables, %determ2 1 and %determ2 2 . Then three variables %determ1, %determ2 1 and %determ2 2 are used to construct a 3-dimensional variable space. Each protein is represented by a point in this variable space. The points corresponding to proteins from the α, β, α+β and α/β structural classes position into different areas in this variable space. In order to give a quantitative assessment of our clustering on the selected proteins, Fisher's discriminant algorithm is used. Numerical results indicate that the discriminant accuracies are very high and satisfactory

  12. Analysis of ligand-protein exchange by Clustering of Ligand Diffusion Coefficient Pairs (CoLD-CoP)

    Science.gov (United States)

    Snyder, David A.; Chantova, Mihaela; Chaudhry, Saadia

    2015-06-01

    NMR spectroscopy is a powerful tool in describing protein structures and protein activity for pharmaceutical and biochemical development. This study describes a method to determine weak binding ligands in biological systems by using hierarchic diffusion coefficient clustering of multidimensional data obtained with a 400 MHz Bruker NMR. Comparison of DOSY spectrums of ligands of the chemical library in the presence and absence of target proteins show translational diffusion rates for small molecules upon interaction with macromolecules. For weak binders such as compounds found in fragment libraries, changes in diffusion rates upon macromolecular binding are on the order of the precision of DOSY diffusion measurements, and identifying such subtle shifts in diffusion requires careful statistical analysis. The "CoLD-CoP" (Clustering of Ligand Diffusion Coefficient Pairs) method presented here uses SAHN clustering to identify protein-binders in a chemical library or even a not fully characterized metabolite mixture. We will show how DOSY NMR and the "CoLD-CoP" method complement each other in identifying the most suitable candidates for lysozyme and wheat germ acid phosphatase.

  13. Cluster based on sequence comparison of homologous proteins of 95 organism species - Gclust Server | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Gclust Server Cluster based on sequence comparison of homologous proteins of 95 organism spe...cies Data detail Data name Cluster based on sequence comparison of homologous proteins of 95 organism specie...istory of This Database Site Policy | Contact Us Cluster based on sequence compariso

  14. Clustering on Membranes

    DEFF Research Database (Denmark)

    Johannes, Ludger; Pezeshkian, Weria; Ipsen, John H

    2018-01-01

    Clustering of extracellular ligands and proteins on the plasma membrane is required to perform specific cellular functions, such as signaling and endocytosis. Attractive forces that originate in perturbations of the membrane's physical properties contribute to this clustering, in addition to direct...... protein-protein interactions. However, these membrane-mediated forces have not all been equally considered, despite their importance. In this review, we describe how line tension, lipid depletion, and membrane curvature contribute to membrane-mediated clustering. Additional attractive forces that arise...... from protein-induced perturbation of a membrane's fluctuations are also described. This review aims to provide a survey of the current understanding of membrane-mediated clustering and how this supports precise biological functions....

  15. Application of clustering methods: Regularized Markov clustering (R-MCL) for analyzing dengue virus similarity

    Science.gov (United States)

    Lestari, D.; Raharjo, D.; Bustamam, A.; Abdillah, B.; Widhianto, W.

    2017-07-01

    Dengue virus consists of 10 different constituent proteins and are classified into 4 major serotypes (DEN 1 - DEN 4). This study was designed to perform clustering against 30 protein sequences of dengue virus taken from Virus Pathogen Database and Analysis Resource (VIPR) using Regularized Markov Clustering (R-MCL) algorithm and then we analyze the result. By using Python program 3.4, R-MCL algorithm produces 8 clusters with more than one centroid in several clusters. The number of centroid shows the density level of interaction. Protein interactions that are connected in a tissue, form a complex protein that serves as a specific biological process unit. The analysis of result shows the R-MCL clustering produces clusters of dengue virus family based on the similarity role of their constituent protein, regardless of serotypes.

  16. clusterMaker: a multi-algorithm clustering plugin for Cytoscape

    Directory of Open Access Journals (Sweden)

    Morris John H

    2011-11-01

    Full Text Available Abstract Background In the post-genomic era, the rapid increase in high-throughput data calls for computational tools capable of integrating data of diverse types and facilitating recognition of biologically meaningful patterns within them. For example, protein-protein interaction data sets have been clustered to identify stable complexes, but scientists lack easily accessible tools to facilitate combined analyses of multiple data sets from different types of experiments. Here we present clusterMaker, a Cytoscape plugin that implements several clustering algorithms and provides network, dendrogram, and heat map views of the results. The Cytoscape network is linked to all of the other views, so that a selection in one is immediately reflected in the others. clusterMaker is the first Cytoscape plugin to implement such a wide variety of clustering algorithms and visualizations, including the only implementations of hierarchical clustering, dendrogram plus heat map visualization (tree view, k-means, k-medoid, SCPS, AutoSOME, and native (Java MCL. Results Results are presented in the form of three scenarios of use: analysis of protein expression data using a recently published mouse interactome and a mouse microarray data set of nearly one hundred diverse cell/tissue types; the identification of protein complexes in the yeast Saccharomyces cerevisiae; and the cluster analysis of the vicinal oxygen chelate (VOC enzyme superfamily. For scenario one, we explore functionally enriched mouse interactomes specific to particular cellular phenotypes and apply fuzzy clustering. For scenario two, we explore the prefoldin complex in detail using both physical and genetic interaction clusters. For scenario three, we explore the possible annotation of a protein as a methylmalonyl-CoA epimerase within the VOC superfamily. Cytoscape session files for all three scenarios are provided in the Additional Files section. Conclusions The Cytoscape plugin cluster

  17. Role of protein-glutathione contacts in defining glutaredoxin-3 [2Fe-2S] cluster chirality, ligand exchange and transfer chemistry.

    Science.gov (United States)

    Sen, Sambuddha; Cowan, J A

    2017-10-01

    Monothiol glutaredoxins (Grx) serve as intermediate cluster carriers in iron-sulfur cluster trafficking. The [2Fe-2S]-bound holo forms of Grx proteins display cysteinyl coordination from exogenous glutathione (GSH), in addition to contact from protein-derived Cys. Herein, we report mechanistic studies that investigate the role of exogenous glutathione in defining cluster chirality, ligand exchange, and the cluster transfer chemistry of Saccharomyces cerevisiae Grx3. Systematic perturbations were introduced to the glutathione-binding site by substitution of conserved charged amino acids that form crucial electrostatic contacts with the glutathione molecule. Native Grx3 could also be reconstituted in the absence of glutathione, with either DTT, BME or free L-cysteine as the source of the exogenous Fe-S ligand contact, while retaining full functional reactivity. The delivery of the [2Fe-2S] cluster to Grx3 from cluster donor proteins such as Isa, Nfu, and a [2Fe-2S](GS) 4 complex, revealed that electrostatic contacts are of key importance for positioning the exogenous glutathione that in turn influences the chiral environment of the cluster. All Grx3 derivatives were reconstituted by standard chemical reconstitution protocols and found to transfer cluster to apo ferredoxin 1 (Fdx1) at rates comparable to native protein, even when using DTT, BME or free L-cysteine as a thiol source in place of GSH during reconstitution. Kinetic analysis of cluster transfer from holo derivatives to apo Fdx1 has led to a mechanistic model for cluster transfer chemistry of native holo Grx3, and identification of the likely rate-limiting step for the reaction.

  18. Analysis of NFU-1 metallocofactor binding-site substitutions-impacts on iron-sulfur cluster coordination and protein structure and function.

    Science.gov (United States)

    Wesley, Nathaniel A; Wachnowsky, Christine; Fidai, Insiya; Cowan, J A

    2017-11-01

    Iron-sulfur (Fe/S) clusters are ancient prosthetic groups found in numerous metalloproteins and are conserved across all kingdoms of life due to their diverse, yet essential functional roles. Genetic mutations to a specific subset of mitochondrial Fe/S cluster delivery proteins are broadly categorized as disease-related under multiple mitochondrial dysfunction syndrome (MMDS), with symptoms indicative of a general failure of the metabolic system. Multiple mitochondrial dysfunction syndrome 1 (MMDS1) arises as a result of the missense mutation in NFU1, an Fe/S cluster scaffold protein, which substitutes a glycine near the Fe/S cluster-binding pocket to a cysteine (p.Gly208Cys). This substitution has been shown to promote protein dimerization such that cluster delivery to NFU1 is blocked, preventing downstream cluster trafficking. However, the possibility of this additional cysteine, located adjacent to the cluster-binding site, serving as an Fe/S cluster ligand has not yet been explored. To fully understand the consequences of this Gly208Cys replacement, complementary substitutions at the Fe/S cluster-binding pocket for native and Gly208Cys NFU1 were made, along with six other variants. Herein, we report the results of an investigation on the effect of these substitutions on both cluster coordination and NFU1 structure and function. The data suggest that the G208C substitution does not contribute to cluster binding. Rather, replacement of the glycine at position 208 changes the oligomerization state as a result of global structural alterations that result in the downstream effects manifest as MMDS1, but does not perturb the coordination chemistry of the Fe-S cluster. © 2017 Federation of European Biochemical Societies.

  19. The Pacific Ocean virome (POV: a marine viral metagenomic dataset and associated protein clusters for quantitative viral ecology.

    Directory of Open Access Journals (Sweden)

    Bonnie L Hurwitz

    Full Text Available Bacteria and their viruses (phage are fundamental drivers of many ecosystem processes including global biogeochemistry and horizontal gene transfer. While databases and resources for studying function in uncultured bacterial communities are relatively advanced, many fewer exist for their viral counterparts. The issue is largely technical in that the majority (often 90% of viral sequences are functionally 'unknown' making viruses a virtually untapped resource of functional and physiological information. Here, we provide a community resource that organizes this unknown sequence space into 27 K high confidence protein clusters using 32 viral metagenomes from four biogeographic regions in the Pacific Ocean that vary by season, depth, and proximity to land, and include some of the first deep pelagic ocean viral metagenomes. These protein clusters more than double currently available viral protein clusters, including those from environmental datasets. Further, a protein cluster guided analysis of functional diversity revealed that richness decreased (i from deep to surface waters, (ii from winter to summer, (iii and with distance from shore in surface waters only. These data provide a framework from which to draw on for future metadata-enabled functional inquiries of the vast viral unknown.

  20. The Pacific Ocean virome (POV): a marine viral metagenomic dataset and associated protein clusters for quantitative viral ecology.

    Science.gov (United States)

    Hurwitz, Bonnie L; Sullivan, Matthew B

    2013-01-01

    Bacteria and their viruses (phage) are fundamental drivers of many ecosystem processes including global biogeochemistry and horizontal gene transfer. While databases and resources for studying function in uncultured bacterial communities are relatively advanced, many fewer exist for their viral counterparts. The issue is largely technical in that the majority (often 90%) of viral sequences are functionally 'unknown' making viruses a virtually untapped resource of functional and physiological information. Here, we provide a community resource that organizes this unknown sequence space into 27 K high confidence protein clusters using 32 viral metagenomes from four biogeographic regions in the Pacific Ocean that vary by season, depth, and proximity to land, and include some of the first deep pelagic ocean viral metagenomes. These protein clusters more than double currently available viral protein clusters, including those from environmental datasets. Further, a protein cluster guided analysis of functional diversity revealed that richness decreased (i) from deep to surface waters, (ii) from winter to summer, (iii) and with distance from shore in surface waters only. These data provide a framework from which to draw on for future metadata-enabled functional inquiries of the vast viral unknown.

  1. Ligand cluster-based protein network and ePlatton, a multi-target ligand finder.

    Science.gov (United States)

    Du, Yu; Shi, Tieliu

    2016-01-01

    Small molecules are information carriers that make cells aware of external changes and couple internal metabolic and signalling pathway systems with each other. In some specific physiological status, natural or artificial molecules are used to interact with selective biological targets to activate or inhibit their functions to achieve expected biological and physiological output. Millions of years of evolution have optimized biological processes and pathways and now the endocrine and immune system cannot work properly without some key small molecules. In the past thousands of years, the human race has managed to find many medicines against diseases by trail-and-error experience. In the recent decades, with the deepening understanding of life and the progress of molecular biology, researchers spare no effort to design molecules targeting one or two key enzymes and receptors related to corresponding diseases. But recent studies in pharmacogenomics have shown that polypharmacology may be necessary for the effects of drugs, which challenge the paradigm, 'one drug, one target, one disease'. Nowadays, cheminformatics and structural biology can help us reasonably take advantage of the polypharmacology to design next-generation promiscuous drugs and drug combination therapies. 234,591 protein-ligand interactions were extracted from ChEMBL. By the 2D structure similarity, 13,769 ligand emerged from 156,151 distinct ligands which were recognized by 1477 proteins. Ligand cluster- and sequence-based protein networks (LCBN, SBN) were constructed, compared and analysed. For assisting compound designing, exploring polypharmacology and finding possible drug combination, we integrated the pathway, disease, drug adverse reaction and the relationship of targets and ligand clusters into the web platform, ePlatton, which is available at http://www.megabionet.org/eplatton. Although there were some disagreements between the LCBN and SBN, communities in both networks were largely the same

  2. Diblock-copolymer-mediated self-assembly of protein-stabilized iron oxide nanoparticle clusters for magnetic resonance imaging.

    Science.gov (United States)

    Tähkä, Sari; Laiho, Ari; Kostiainen, Mauri A

    2014-03-03

    Superparamagnetic iron oxide nanoparticles (SPIONs) can be used as efficient transverse relaxivity (T2 ) contrast agents in magnetic resonance imaging (MRI). Organizing small (Doxide) diblock copolymer (P2QVP-b-PEO) to mediate the self-assembly of protein-cage-encapsulated iron oxide (γ-Fe2 O3 ) nanoparticles (magnetoferritin) into stable PEO-coated clusters. This approach relies on electrostatic interactions between the cationic N-methyl-2-vinylpyridinium iodide block and magnetoferritin protein cage surface (pI≈4.5) to form a dense core, whereas the neutral ethylene oxide block provides a stabilizing biocompatible shell. Formation of the complexes was studied in aqueous solvent medium with dynamic light scattering (DLS) and cryogenic transmission electron microcopy (cryo-TEM). DLS results indicated that the hydrodynamic diameter (Dh ) of the clusters is approximately 200 nm, and cryo-TEM showed that the clusters have an anisotropic stringlike morphology. MRI studies showed that in the clusters the longitudinal relaxivity (r1 ) is decreased and the transverse relaxivity (r2 ) is increased relative to free magnetoferritin (MF), thus indicating that clusters can provide considerable contrast enhancement. © 2014 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  3. GPI-anchored proteins are confined in subdiffraction clusters at the apical surface of polarized epithelial cells.

    Science.gov (United States)

    Paladino, Simona; Lebreton, Stéphanie; Lelek, Mickaël; Riccio, Patrizia; De Nicola, Sergio; Zimmer, Christophe; Zurzolo, Chiara

    2017-12-01

    Spatio-temporal compartmentalization of membrane proteins is critical for the regulation of diverse vital functions in eukaryotic cells. It was previously shown that, at the apical surface of polarized MDCK cells, glycosylphosphatidylinositol (GPI)-anchored proteins (GPI-APs) are organized in small cholesterol-independent clusters of single GPI-AP species (homoclusters), which are required for the formation of larger cholesterol-dependent clusters formed by multiple GPI-AP species (heteroclusters). This clustered organization is crucial for the biological activities of GPI-APs; hence, understanding the spatio-temporal properties of their membrane organization is of fundamental importance. Here, by using direct stochastic optical reconstruction microscopy coupled to pair correlation analysis (pc-STORM), we were able to visualize and measure the size of these clusters. Specifically, we show that they are non-randomly distributed and have an average size of 67 nm. We also demonstrated that polarized MDCK and non-polarized CHO cells have similar cluster distribution and size, but different sensitivity to cholesterol depletion. Finally, we derived a model that allowed a quantitative characterization of the cluster organization of GPI-APs at the apical surface of polarized MDCK cells for the first time. Experimental FRET (fluorescence resonance energy transfer)/FLIM (fluorescence-lifetime imaging microscopy) data were correlated to the theoretical predictions of the model. © 2017 The Author(s).

  4. Density parameter estimation for finding clusters of homologous proteins-tracing actinobacterial pathogenicity lifestyles

    DEFF Research Database (Denmark)

    Röttger, Richard; Kalaghatgi, Prabhav; Sun, Peng

    2013-01-01

    Homology detection is a long-standing challenge in computational biology. To tackle this problem, typically all-versus-all BLAST results are coupled with data partitioning approaches resulting in clusters of putative homologous proteins. One of the main problems, however, has been widely neglecte...

  5. Lactobacillus plantarum gene clusters encoding putative cell-surface protein complexes for carbohydrate utilization are conserved in specific gram-positive bacteria

    Directory of Open Access Journals (Sweden)

    Muscariello Lidia

    2006-05-01

    Full Text Available Abstract Background Genomes of gram-positive bacteria encode many putative cell-surface proteins, of which the majority has no known function. From the rapidly increasing number of available genome sequences it has become apparent that many cell-surface proteins are conserved, and frequently encoded in gene clusters or operons, suggesting common functions, and interactions of multiple components. Results A novel gene cluster encoding exclusively cell-surface proteins was identified, which is conserved in a subgroup of gram-positive bacteria. Each gene cluster generally has one copy of four new gene families called cscA, cscB, cscC and cscD. Clusters encoding these cell-surface proteins were found only in complete genomes of Lactobacillus plantarum, Lactobacillus sakei, Enterococcus faecalis, Listeria innocua, Listeria monocytogenes, Lactococcus lactis ssp lactis and Bacillus cereus and in incomplete genomes of L. lactis ssp cremoris, Lactobacillus casei, Enterococcus faecium, Pediococcus pentosaceus, Lactobacillius brevis, Oenococcus oeni, Leuconostoc mesenteroides, and Bacillus thuringiensis. These genes are neither present in the genomes of streptococci, staphylococci and clostridia, nor in the Lactobacillus acidophilus group, suggesting a niche-specific distribution, possibly relating to association with plants. All encoded proteins have a signal peptide for secretion by the Sec-dependent pathway, while some have cell-surface anchors, novel WxL domains, and putative domains for sugar binding and degradation. Transcriptome analysis in L. plantarum shows that the cscA-D genes are co-expressed, supporting their operon organization. Many gene clusters are significantly up-regulated in a glucose-grown, ccpA-mutant derivative of L. plantarum, suggesting catabolite control. This is supported by the presence of predicted CRE-sites upstream or inside the up-regulated cscA-D gene clusters. Conclusion We propose that the CscA, CscB, CscC and Csc

  6. Distinct functional domains within the acidic cluster of tegument protein pp28 required for trafficking and cytoplasmic envelopment of human cytomegalovirus.

    Science.gov (United States)

    Seo, Jun-Young; Jeon, Hyejin; Hong, Sookyung; Britt, William J

    2016-10-01

    Human cytomegalovirus UL99-encoded tegument protein pp28 contains a 16 aa acidic cluster that is required for pp28 trafficking to the assembly compartment (AC) and the virus assembly. However, functional signals within the acidic cluster of pp28 remain undefined. Here, we demonstrated that an acidic cluster rather than specific sorting signals was required for trafficking to the AC. Recombinant viruses with chimeric pp28 proteins expressing non-native acidic clusters exhibited delayed viral growth kinetics and decreased production of infectious virus, indicating that the native acidic cluster of pp28 was essential for wild-type virus assembly. These results suggested that the acidic cluster of pp28 has distinct functional domains required for trafficking and for efficient virus assembly. The first half (aa 44-50) of the acidic cluster was sufficient for pp28 trafficking, whereas the native acidic cluster consisting of aa 51-59 was required for the assembly of wild-type levels of infectious virus.

  7. Searching remote homology with spectral clustering with symmetry in neighborhood cluster kernels.

    Directory of Open Access Journals (Sweden)

    Ujjwal Maulik

    Full Text Available Remote homology detection among proteins utilizing only the unlabelled sequences is a central problem in comparative genomics. The existing cluster kernel methods based on neighborhoods and profiles and the Markov clustering algorithms are currently the most popular methods for protein family recognition. The deviation from random walks with inflation or dependency on hard threshold in similarity measure in those methods requires an enhancement for homology detection among multi-domain proteins. We propose to combine spectral clustering with neighborhood kernels in Markov similarity for enhancing sensitivity in detecting homology independent of "recent" paralogs. The spectral clustering approach with new combined local alignment kernels more effectively exploits the unsupervised protein sequences globally reducing inter-cluster walks. When combined with the corrections based on modified symmetry based proximity norm deemphasizing outliers, the technique proposed in this article outperforms other state-of-the-art cluster kernels among all twelve implemented kernels. The comparison with the state-of-the-art string and mismatch kernels also show the superior performance scores provided by the proposed kernels. Similar performance improvement also is found over an existing large dataset. Therefore the proposed spectral clustering framework over combined local alignment kernels with modified symmetry based correction achieves superior performance for unsupervised remote homolog detection even in multi-domain and promiscuous domain proteins from Genolevures database families with better biological relevance. Source code available upon request.sarkar@labri.fr.

  8. Distinct cell clusters touching islet cells induce islet cell replication in association with over-expression of Regenerating Gene (REG protein in fulminant type 1 diabetes.

    Directory of Open Access Journals (Sweden)

    Kaoru Aida

    Full Text Available BACKGROUND: Pancreatic islet endocrine cell-supporting architectures, including islet encapsulating basement membranes (BMs, extracellular matrix (ECM, and possible cell clusters, are unclear. PROCEDURES: The architectures around islet cell clusters, including BMs, ECM, and pancreatic acinar-like cell clusters, were studied in the non-diabetic state and in the inflamed milieu of fulminant type 1 diabetes in humans. RESULT: Immunohistochemical and electron microscopy analyses demonstrated that human islet cell clusters and acinar-like cell clusters adhere directly to each other with desmosomal structures and coated-pit-like structures between the two cell clusters. The two cell-clusters are encapsulated by a continuous capsule composed of common BMs/ECM. The acinar-like cell clusters have vesicles containing regenerating (REG Iα protein. The vesicles containing REG Iα protein are directly secreted to islet cells. In the inflamed milieu of fulminant type 1 diabetes, the acinar-like cell clusters over-expressed REG Iα protein. Islet endocrine cells, including beta-cells and non-beta cells, which were packed with the acinar-like cell clusters, show self-replication with a markedly increased number of Ki67-positive cells. CONCLUSION: The acinar-like cell clusters touching islet endocrine cells are distinct, because the cell clusters are packed with pancreatic islet clusters and surrounded by common BMs/ECM. Furthermore, the acinar-like cell clusters express REG Iα protein and secrete directly to neighboring islet endocrine cells in the non-diabetic state, and the cell clusters over-express REG Iα in the inflamed milieu of fulminant type 1 diabetes with marked self-replication of islet cells.

  9. The unique fold and lability of the [2Fe-2S] clusters of NEET proteins mediate their key functions in health and disease.

    Science.gov (United States)

    Karmi, Ola; Marjault, Henri-Baptiste; Pesce, Luca; Carloni, Paolo; Onuchic, Jose' N; Jennings, Patricia A; Mittler, Ron; Nechushtai, Rachel

    2018-02-12

    NEET proteins comprise a new class of [2Fe-2S] cluster proteins. In human, three genes encode for NEET proteins: cisd1 encodes mitoNEET (mNT), cisd2 encodes the Nutrient-deprivation autophagy factor-1 (NAF-1) and cisd3 encodes MiNT (Miner2). These recently discovered proteins play key roles in many processes related to normal metabolism and disease. Indeed, NEET proteins are involved in iron, Fe-S, and reactive oxygen homeostasis in cells and play an important role in regulating apoptosis and autophagy. mNT and NAF-1 are homodimeric and reside on the outer mitochondrial membrane. NAF-1 also resides in the membranes of the ER associated mitochondrial membranes (MAM) and the ER. MiNT is a monomer with distinct asymmetry in the molecular surfaces surrounding the clusters. Unlike its paralogs mNT and NAF-1, it resides within the mitochondria. NAF-1 and mNT share similar backbone folds to the plant homodimeric NEET protein (At-NEET), while MiNT's backbone fold resembles a bacterial MiNT protein. Despite the variation of amino acid composition among these proteins, all NEET proteins retained their unique CDGSH domain harboring their unique 3Cys:1His [2Fe-2S] cluster coordination through evolution. The coordinating exposed His was shown to convey the lability to the NEET proteins' [2Fe-2S] clusters. In this minireview, we discuss the NEET fold and its structural elements. Special attention is given to the unique lability of the NEETs' [2Fe-2S] cluster and the implication of the latter to the NEET proteins' cellular and systemic function in health and disease.

  10. Classification of protein profiles using fuzzy clustering techniques

    DEFF Research Database (Denmark)

    Karemore, Gopal; Mullick, Jhinuk B.; Sujatha, R.

    2010-01-01

     Present  study  has  brought  out  a  comparison  of PCA  and  fuzzy  clustering  techniques  in  classifying  protein profiles  (chromatogram)  of  homogenates  of  different  tissue origins:  Ovarian,  Cervix,  Oral  cancers,  which  were  acquired using HPLC–LIF (High Performance Liquid...... Chromatography- Laser   Induced   Fluorescence)   method   developed   in   our laboratory. Study includes 11 chromatogram spectra each from oral,  cervical,  ovarian  cancers  as  well  as  healthy  volunteers. Generally  multivariate  analysis  like  PCA  demands  clear  data that   is   devoid   of   day......   PCA   mapping   in   classifying   various cancers from healthy spectra with classification rate up to 95 % from  60%.  Methods  are  validated  using  various  clustering indexes   and   shows   promising   improvement   in   developing optical pathology like HPLC-LIF for early detection of various...

  11. PSPP: a protein structure prediction pipeline for computing clusters.

    Directory of Open Access Journals (Sweden)

    Michael S Lee

    2009-07-01

    Full Text Available Protein structures are critical for understanding the mechanisms of biological systems and, subsequently, for drug and vaccine design. Unfortunately, protein sequence data exceed structural data by a factor of more than 200 to 1. This gap can be partially filled by using computational protein structure prediction. While structure prediction Web servers are a notable option, they often restrict the number of sequence queries and/or provide a limited set of prediction methodologies. Therefore, we present a standalone protein structure prediction software package suitable for high-throughput structural genomic applications that performs all three classes of prediction methodologies: comparative modeling, fold recognition, and ab initio. This software can be deployed on a user's own high-performance computing cluster.The pipeline consists of a Perl core that integrates more than 20 individual software packages and databases, most of which are freely available from other research laboratories. The query protein sequences are first divided into domains either by domain boundary recognition or Bayesian statistics. The structures of the individual domains are then predicted using template-based modeling or ab initio modeling. The predicted models are scored with a statistical potential and an all-atom force field. The top-scoring ab initio models are annotated by structural comparison against the Structural Classification of Proteins (SCOP fold database. Furthermore, secondary structure, solvent accessibility, transmembrane helices, and structural disorder are predicted. The results are generated in text, tab-delimited, and hypertext markup language (HTML formats. So far, the pipeline has been used to study viral and bacterial proteomes.The standalone pipeline that we introduce here, unlike protein structure prediction Web servers, allows users to devote their own computing assets to process a potentially unlimited number of queries as well as perform

  12. Identification of multiply charged proteins and amino acid clusters by liquid nitrogen assisted spray ionization mass spectrometry.

    Science.gov (United States)

    Kumar Kailasa, Suresh; Hasan, Nazim; Wu, Hui-Fen

    2012-08-15

    The development of liquid nitrogen assisted spray ionization mass spectrometry (LNASI MS) for the analysis of multiply charged proteins (insulin, ubiquitin, cytochrome c, α-lactalbumin, myoglobin and BSA), peptides (glutathione, HW6, angiotensin-II and valinomycin) and amino acid (arginine) clusters is described. The charged droplets are formed by liquid nitrogen assisted sample spray through a stainless steel nebulizer and transported into mass analyzer for the identification of multiply charged protein ions. The effects of acids and modifier volumes for the efficient ionization of the above analytes in LNASI MS were carefully investigated. Multiply charged proteins and amino acid clusters were effectively identified by LNASI MS. The present approach can effectively detect the multiply charged states of cytochrome c at 400 nM. A comparison between LNASI and ESI, CSI, SSI and V-EASI methods on instrumental conditions, applied temperature and observed charge states for the multiply charged proteins, shows that the LNASI method produces the good quality spectra of amino acid clusters at ambient conditions without applied any electric field and heat. To date, we believe that the LNASI method is the most simple, low cost and provided an alternative paradigm for production of multiply charged ions by LNASI MS, just as ESI-like ions yet no need for applying any electrical field and it could be operated at low temperature for generation of highly charged protein/peptide ions. Copyright © 2012 Elsevier B.V. All rights reserved.

  13. Evidence for the additions of clustered interacting nodes during the evolution of protein interaction networks from network motifs

    Directory of Open Access Journals (Sweden)

    Guo Hao

    2011-05-01

    Full Text Available Abstract Background High-throughput screens have revealed large-scale protein interaction networks defining most cellular functions. How the proteins were added to the protein interaction network during its growth is a basic and important issue. Network motifs represent the simplest building blocks of cellular machines and are of biological significance. Results Here we study the evolution of protein interaction networks from the perspective of network motifs. We find that in current protein interaction networks, proteins of the same age class tend to form motifs and such co-origins of motif constituents are affected by their topologies and biological functions. Further, we find that the proteins within motifs whose constituents are of the same age class tend to be densely interconnected, co-evolve and share the same biological functions, and these motifs tend to be within protein complexes. Conclusions Our findings provide novel evidence for the hypothesis of the additions of clustered interacting nodes and point out network motifs, especially the motifs with the dense topology and specific function may play important roles during this process. Our results suggest functional constraints may be the underlying driving force for such additions of clustered interacting nodes.

  14. Transcriptional analysis of the jamaicamide gene cluster from the marine cyanobacterium Lyngbya majuscula and identification of possible regulatory proteins

    Directory of Open Access Journals (Sweden)

    Dorrestein Pieter C

    2009-12-01

    Full Text Available Abstract Background The marine cyanobacterium Lyngbya majuscula is a prolific producer of bioactive secondary metabolites. Although biosynthetic gene clusters encoding several of these compounds have been identified, little is known about how these clusters of genes are transcribed or regulated, and techniques targeting genetic manipulation in Lyngbya strains have not yet been developed. We conducted transcriptional analyses of the jamaicamide gene cluster from a Jamaican strain of Lyngbya majuscula, and isolated proteins that could be involved in jamaicamide regulation. Results An unusually long untranslated leader region of approximately 840 bp is located between the jamaicamide transcription start site (TSS and gene cluster start codon. All of the intergenic regions between the pathway ORFs were transcribed into RNA in RT-PCR experiments; however, a promoter prediction program indicated the possible presence of promoters in multiple intergenic regions. Because the functionality of these promoters could not be verified in vivo, we used a reporter gene assay in E. coli to show that several of these intergenic regions, as well as the primary promoter preceding the TSS, are capable of driving β-galactosidase production. A protein pulldown assay was also used to isolate proteins that may regulate the jamaicamide pathway. Pulldown experiments using the intergenic region upstream of jamA as a DNA probe isolated two proteins that were identified by LC-MS/MS. By BLAST analysis, one of these had close sequence identity to a regulatory protein in another cyanobacterial species. Protein comparisons suggest a possible correlation between secondary metabolism regulation and light dependent complementary chromatic adaptation. Electromobility shift assays were used to evaluate binding of the recombinant proteins to the jamaicamide promoter region. Conclusion Insights into natural product regulation in cyanobacteria are of significant value to drug discovery

  15. A point mutation in the [2Fe–2S] cluster binding region of the NAF-1 protein (H114C) dramatically hinders the cluster donor properties

    Energy Technology Data Exchange (ETDEWEB)

    Tamir, Sagi; Eisenberg-Domovich, Yael [The Hebrew University of Jerusalem, Edmond J. Safra Campus at Givat Ram, Jerusalem 91904 (Israel); Conlan, Andrea R.; Stofleth, Jason T.; Lipper, Colin H.; Paddock, Mark L. [University of California at San Diego, La Jolla, CA 92093 (United States); Mittler, Ron [University of North Texas, Denton, TX 76203 (United States); Jennings, Patricia A. [University of California at San Diego, La Jolla, CA 92093 (United States); Livnah, Oded, E-mail: oded.livnah@huji.ac.il; Nechushtai, Rachel, E-mail: oded.livnah@huji.ac.il [The Hebrew University of Jerusalem, Edmond J. Safra Campus at Givat Ram, Jerusalem 91904 (Israel)

    2014-06-01

    NAF-1 has been shown to be related with human health and disease, is upregulated in epithelial breast cancer and suppression of its expression significantly suppresses tumor growth. It is shown that replacement of the single His ligand with Cys resulted in dramatic changes to the properties of its 2Fe-2S clusters without any global crystal structural changes. NAF-1 is an important [2Fe–2S] NEET protein associated with human health and disease. A mis-splicing mutation in NAF-1 results in Wolfram Syndrome type 2, a lethal childhood disease. Upregulation of NAF-1 is found in epithelial breast cancer cells, and suppression of NAF-1 expression by knockdown significantly suppresses tumor growth. Key to NAF-1 function is the NEET fold with its [2Fe–2S] cluster. In this work, the high-resolution structure of native NAF-1 was determined to 1.65 Å resolution (R factor = 13.5%) together with that of a mutant in which the single His ligand of its [2Fe–2S] cluster, His114, was replaced by Cys. The NAF-1 H114C mutant structure was determined to 1.58 Å resolution (R factor = 16.0%). All structural differences were localized to the cluster binding site. Compared with native NAF-1, the [2Fe–2S] clusters of the H114C mutant were found to (i) be 25-fold more stable, (ii) have a redox potential that is 300 mV more negative and (iii) have their cluster donation/transfer function abolished. Because no global structural differences were found between the mutant and the native (wild-type) NAF-1 proteins, yet significant functional differences exist between them, the NAF-1 H114C mutant is an excellent tool to decipher the underlying biological importance of the [2Fe–2S] cluster of NAF-1 in vivo.

  16. A point mutation in the [2Fe–2S] cluster binding region of the NAF-1 protein (H114C) dramatically hinders the cluster donor properties

    International Nuclear Information System (INIS)

    Tamir, Sagi; Eisenberg-Domovich, Yael; Conlan, Andrea R.; Stofleth, Jason T.; Lipper, Colin H.; Paddock, Mark L.; Mittler, Ron; Jennings, Patricia A.; Livnah, Oded; Nechushtai, Rachel

    2014-01-01

    NAF-1 has been shown to be related with human health and disease, is upregulated in epithelial breast cancer and suppression of its expression significantly suppresses tumor growth. It is shown that replacement of the single His ligand with Cys resulted in dramatic changes to the properties of its 2Fe-2S clusters without any global crystal structural changes. NAF-1 is an important [2Fe–2S] NEET protein associated with human health and disease. A mis-splicing mutation in NAF-1 results in Wolfram Syndrome type 2, a lethal childhood disease. Upregulation of NAF-1 is found in epithelial breast cancer cells, and suppression of NAF-1 expression by knockdown significantly suppresses tumor growth. Key to NAF-1 function is the NEET fold with its [2Fe–2S] cluster. In this work, the high-resolution structure of native NAF-1 was determined to 1.65 Å resolution (R factor = 13.5%) together with that of a mutant in which the single His ligand of its [2Fe–2S] cluster, His114, was replaced by Cys. The NAF-1 H114C mutant structure was determined to 1.58 Å resolution (R factor = 16.0%). All structural differences were localized to the cluster binding site. Compared with native NAF-1, the [2Fe–2S] clusters of the H114C mutant were found to (i) be 25-fold more stable, (ii) have a redox potential that is 300 mV more negative and (iii) have their cluster donation/transfer function abolished. Because no global structural differences were found between the mutant and the native (wild-type) NAF-1 proteins, yet significant functional differences exist between them, the NAF-1 H114C mutant is an excellent tool to decipher the underlying biological importance of the [2Fe–2S] cluster of NAF-1 in vivo

  17. Interaction between Nbp35 and Cfd1 proteins of cytosolic Fe-S cluster assembly reveals a stable complex formation in Entamoeba histolytica.

    Directory of Open Access Journals (Sweden)

    Shadab Anwar

    Full Text Available Iron-Sulfur (Fe-S proteins are involved in many biological functions such as electron transport, photosynthesis, regulation of gene expression and enzymatic activities. Biosynthesis and transfer of Fe-S clusters depend on Fe-S clusters assembly processes such as ISC, SUF, NIF, and CIA systems. Unlike other eukaryotes which possess ISC and CIA systems, amitochondriate Entamoeba histolytica has retained NIF & CIA systems for Fe-S cluster assembly in the cytosol. In the present study, we have elucidated interaction between two proteins of E. histolytica CIA system, Cytosolic Fe-S cluster deficient 1 (Cfd1 protein and Nucleotide binding protein 35 (Nbp35. In-silico analysis showed that structural regions ranging from amino acid residues (P33-K35, G131-V135 and I147-E151 of Nbp35 and (G5-V6, M34-D39 and G46-A52 of Cfd1 are involved in the formation of protein-protein complex. Furthermore, Molecular dynamic (MD simulations study suggested that hydrophobic forces surpass over hydrophilic forces between Nbp35 and Cfd1 and Van-der-Waal interaction plays crucial role in the formation of stable complex. Both proteins were separately cloned, expressed as recombinant fusion proteins in E. coli and purified to homogeneity by affinity column chromatography. Physical interaction between Nbp35 and Cfd1 proteins was confirmed in vitro by co-purification of recombinant Nbp35 with thrombin digested Cfd1 and in vivo by pull down assay and immunoprecipitation. The insilico, in vitro as well as in vivo results prove a stable interaction between these two proteins, supporting the possibility of its involvement in Fe-S cluster transfer to target apo-proteins through CIA machinery in E. histolytica. Our study indicates that initial synthesis of a Fe-S precursor in mitochondria is not necessary for the formation of Cfd1-Nbp35 complex. Thus, Cfd1 and Nbp35 with the help of cytosolic NifS and NifU proteins can participate in the maturation of non-mitosomal Fe-S proteins

  18. Association of papillomavirus E6 proteins with either MAML1 or E6AP clusters E6 proteins by structure, function, and evolutionary relatedness.

    Directory of Open Access Journals (Sweden)

    Nicole Brimer

    2017-12-01

    Full Text Available Papillomavirus E6 proteins bind to LXXLL peptide motifs displayed on targeted cellular proteins. Alpha genus HPV E6 proteins associate with the cellular ubiquitin ligase E6AP (UBE3A, by binding to an LXXLL peptide (ELTLQELLGEE displayed by E6AP, thereby stimulating E6AP ubiquitin ligase activity. Beta, Gamma, and Delta genera E6 proteins bind a similar LXXLL peptide (WMSDLDDLLGS on the cellular transcriptional co-activator MAML1 and thereby repress Notch signaling. We expressed 45 different animal and human E6 proteins from diverse papillomavirus genera to ascertain the overall preference of E6 proteins for E6AP or MAML1. E6 proteins from all HPV genera except Alpha preferentially interacted with MAML1 over E6AP. Among animal papillomaviruses, E6 proteins from certain ungulate (SsPV1 from pigs and cetacean (porpoises and dolphins hosts functionally resembled Alpha genus HPV by binding and targeting the degradation of E6AP. Beta genus HPV E6 proteins functionally clustered with Delta, Pi, Tau, Gamma, Chi, Mu, Lambda, Iota, Dyokappa, Rho, and Dyolambda E6 proteins to bind and repress MAML1. None of the tested E6 proteins physically and functionally interacted with both MAML1 and E6AP, indicating an evolutionary split. Further, interaction of an E6 protein was insufficient to activate degradation of E6AP, indicating that E6 proteins that target E6AP co-evolved to separately acquire both binding and triggering of ubiquitin ligase activation. E6 proteins with similar biological function clustered together in phylogenetic trees and shared structural features. This suggests that the divergence of E6 proteins from either MAML1 or E6AP binding preference is a major event in papillomavirus evolution.

  19. Fluorescence detection of a protein-bound 2Fe2S cluster.

    Science.gov (United States)

    Hoff, Kevin G; Goodlitt, Rochelle; Li, Rui; Smolke, Christina D; Silberg, Jonathan J

    2009-03-02

    A fluorescent biosensor is described for 2Fe2S clusters that is composed of green fluorescent protein (GFP) fused to glutaredoxin 2 (Grx2), as illustrated here. 2Fe2S detection is based on the reduction of GFP fluorescence upon the 2Fe2S-induced dimerization of GFP-Grx2. This assay is sufficiently sensitive to detect submicromolar changes in 2Fe2S levels, thus making it suitable for high-throughput measurements of metallocluster degradation and synthesis reactions.

  20. From Lipid Homeostasis to Differentiation: Old and New Functions of the Zinc Cluster Proteins Ecm22, Upc2, Sut1 and Sut2

    Directory of Open Access Journals (Sweden)

    Ifeoluwapo Matthew Joshua

    2017-04-01

    Full Text Available Zinc cluster proteins are a large family of transcriptional regulators with a wide range of biological functions. The zinc cluster proteins Ecm22, Upc2, Sut1 and Sut2 have initially been identified as regulators of sterol import in the budding yeast Saccharomyces cerevisiae. These proteins also control adaptations to anaerobic growth, sterol biosynthesis as well as filamentation and mating. Orthologs of these zinc cluster proteins have been identified in several species of Candida. Upc2 plays a critical role in antifungal resistance in these important human fungal pathogens. Upc2 is therefore an interesting potential target for novel antifungals. In this review we discuss the functions, mode of actions and regulation of Ecm22, Upc2, Sut1 and Sut2 in budding yeast and Candida.

  1. Architecture of the Yeast Mitochondrial Iron-Sulfur Cluster Assembly Machinery: THE SUB-COMPLEX FORMED BY THE IRON DONOR, Yfh1 PROTEIN, AND THE SCAFFOLD, Isu1 PROTEIN.

    Science.gov (United States)

    Ranatunga, Wasantha; Gakh, Oleksandr; Galeano, Belinda K; Smith, Douglas Y; Söderberg, Christopher A G; Al-Karadaghi, Salam; Thompson, James R; Isaya, Grazia

    2016-05-06

    The biosynthesis of Fe-S clusters is a vital process involving the delivery of elemental iron and sulfur to scaffold proteins via molecular interactions that are still poorly defined. We reconstituted a stable, functional complex consisting of the iron donor, Yfh1 (yeast frataxin homologue 1), and the Fe-S cluster scaffold, Isu1, with 1:1 stoichiometry, [Yfh1]24·[Isu1]24 Using negative staining transmission EM and single particle analysis, we obtained a three-dimensional reconstruction of this complex at a resolution of ∼17 Å. In addition, via chemical cross-linking, limited proteolysis, and mass spectrometry, we identified protein-protein interaction surfaces within the complex. The data together reveal that [Yfh1]24·[Isu1]24 is a roughly cubic macromolecule consisting of one symmetric Isu1 trimer binding on top of one symmetric Yfh1 trimer at each of its eight vertices. Furthermore, molecular modeling suggests that two subunits of the cysteine desulfurase, Nfs1, may bind symmetrically on top of two adjacent Isu1 trimers in a manner that creates two putative [2Fe-2S] cluster assembly centers. In each center, conserved amino acids known to be involved in sulfur and iron donation by Nfs1 and Yfh1, respectively, are in close proximity to the Fe-S cluster-coordinating residues of Isu1. We suggest that this architecture is suitable to ensure concerted and protected transfer of potentially toxic iron and sulfur atoms to Isu1 during Fe-S cluster assembly. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  2. Conformational Clusters of Phosphorylated Tyrosine.

    Science.gov (United States)

    Abdelrasoul, Maha; Ponniah, Komala; Mao, Alice; Warden, Meghan S; Elhefnawy, Wessam; Li, Yaohang; Pascal, Steven M

    2017-12-06

    Tyrosine phosphorylation plays an important role in many cellular and intercellular processes including signal transduction, subcellular localization, and regulation of enzymatic activity. In 1999, Blom et al., using the limited number of protein data bank (PDB) structures available at that time, reported that the side chain structures of phosphorylated tyrosine (pY) are partitioned into two conserved conformational clusters ( Blom, N.; Gammeltoft, S.; Brunak, S. J. Mol. Biol. 1999 , 294 , 1351 - 1362 ). We have used the spectral clustering algorithm to cluster the increasingly growing number of protein structures with pY sites, and have found that the pY residues cluster into three distinct side chain conformations. Two of these pY conformational clusters associate strongly with a narrow range of tyrosine backbone conformation. The novel cluster also highly correlates with the identity of the n + 1 residue, and is strongly associated with a sequential pYpY conformation which places two adjacent pY side chains in a specific relative orientation. Further analysis shows that the three pY clusters are associated with distinct distributions of cognate protein kinases.

  3. The quantitative assessment of the role played by basic amino acid clusters in the nuclear uptake of human ribosomal protein L7

    International Nuclear Information System (INIS)

    Tai, Lin-Ru; Chou, Chang-Wei; Lee, I-Fang; Kirby, Ralph; Lin, Alan

    2013-01-01

    In this study, we used a multiple copy (EGFP) 3 reporter system to establish a numeric nuclear index system to assess the degree of nuclear import. The system was first validated by a FRAP assay, and then was applied to evaluate the essential and multifaceted nature of basic amino acid clusters during the nuclear import of ribosomal protein L7. The results indicate that the sequence context of the basic cluster determines the degree of nuclear import, and that the number of basic residues in the cluster is irrelevant; rather the position of the pertinent basic residues is crucial. Moreover, it also found that the type of carrier protein used by basic cluster has a great impact on the degree of nuclear import. In case of L7, importin β2 or importin β3 are preferentially used by clusters with a high import efficiency, notwithstanding that other importins are also used by clusters with a weaker level of nuclear import. Such a preferential usage of multiple basic clusters and importins to gain nuclear entry would seem to be a common practice among ribosomal proteins in order to ensure their full participation in high rate ribosome synthesis. - Highlights: ► We introduce a numeric index system that represents the degree of nuclear import. ► The rate of nuclear import is dictated by the sequence context of the basic cluster. ► Importin β2 and β3 were mainly responsible for the N4 mediated nuclear import

  4. The quantitative assessment of the role played by basic amino acid clusters in the nuclear uptake of human ribosomal protein L7

    Energy Technology Data Exchange (ETDEWEB)

    Tai, Lin-Ru [Institute of Genome Sciences, National Yang-Ming University, Taipei, Taiwan, ROC (China); Chou, Chang-Wei [Institute of Clinical Dentistry Science, National Yang-Ming University, Taipei, Taiwan, ROC (China); Lee, I-Fang; Kirby, Ralph [Institute of Genome Sciences, National Yang-Ming University, Taipei, Taiwan, ROC (China); Lin, Alan, E-mail: alin@ym.edu.tw [Institute of Genome Sciences, National Yang-Ming University, Taipei, Taiwan, ROC (China); Institute of Clinical Dentistry Science, National Yang-Ming University, Taipei, Taiwan, ROC (China)

    2013-02-15

    In this study, we used a multiple copy (EGFP){sub 3} reporter system to establish a numeric nuclear index system to assess the degree of nuclear import. The system was first validated by a FRAP assay, and then was applied to evaluate the essential and multifaceted nature of basic amino acid clusters during the nuclear import of ribosomal protein L7. The results indicate that the sequence context of the basic cluster determines the degree of nuclear import, and that the number of basic residues in the cluster is irrelevant; rather the position of the pertinent basic residues is crucial. Moreover, it also found that the type of carrier protein used by basic cluster has a great impact on the degree of nuclear import. In case of L7, importin β2 or importin β3 are preferentially used by clusters with a high import efficiency, notwithstanding that other importins are also used by clusters with a weaker level of nuclear import. Such a preferential usage of multiple basic clusters and importins to gain nuclear entry would seem to be a common practice among ribosomal proteins in order to ensure their full participation in high rate ribosome synthesis. - Highlights: ► We introduce a numeric index system that represents the degree of nuclear import. ► The rate of nuclear import is dictated by the sequence context of the basic cluster. ► Importin β2 and β3 were mainly responsible for the N4 mediated nuclear import.

  5. An Atlas of Peroxiredoxins Created Using an Active Site Profile-Based Approach to Functionally Relevant Clustering of Proteins.

    Directory of Open Access Journals (Sweden)

    Angela F Harper

    2017-02-01

    Full Text Available Peroxiredoxins (Prxs or Prdxs are a large protein superfamily of antioxidant enzymes that rapidly detoxify damaging peroxides and/or affect signal transduction and, thus, have roles in proliferation, differentiation, and apoptosis. Prx superfamily members are widespread across phylogeny and multiple methods have been developed to classify them. Here we present an updated atlas of the Prx superfamily identified using a novel method called MISST (Multi-level Iterative Sequence Searching Technique. MISST is an iterative search process developed to be both agglomerative, to add sequences containing similar functional site features, and divisive, to split groups when functional site features suggest distinct functionally-relevant clusters. Superfamily members need not be identified initially-MISST begins with a minimal representative set of known structures and searches GenBank iteratively. Further, the method's novelty lies in the manner in which isofunctional groups are selected; rather than use a single or shifting threshold to identify clusters, the groups are deemed isofunctional when they pass a self-identification criterion, such that the group identifies itself and nothing else in a search of GenBank. The method was preliminarily validated on the Prxs, as the Prxs presented challenges of both agglomeration and division. For example, previous sequence analysis clustered the Prx functional families Prx1 and Prx6 into one group. Subsequent expert analysis clearly identified Prx6 as a distinct functionally relevant group. The MISST process distinguishes these two closely related, though functionally distinct, families. Through MISST search iterations, over 38,000 Prx sequences were identified, which the method divided into six isofunctional clusters, consistent with previous expert analysis. The results represent the most complete computational functional analysis of proteins comprising the Prx superfamily. The feasibility of this novel method is

  6. Iron-sulfur cluster biogenesis in mammalian cells: new insights into the molecular mechanisms of cluster delivery

    Science.gov (United States)

    Maio, Nunziata; Rouault, Tracey. A.

    2014-01-01

    Iron-sulfur (Fe-S) clusters are ancient, ubiquitous cofactors composed of iron and inorganic sulfur. The combination of the chemical reactivity of iron and sulfur, together with many variations of cluster composition, oxidation states and protein environments, enables Fe-S clusters to participate in numerous biological processes. Fe-S clusters are essential to redox catalysis in nitrogen fixation, mitochondrial respiration and photosynthesis, to regulatory sensing in key metabolic pathways (i. e. cellular iron homeostasis and oxidative stress response), and to the replication and maintenance of the nuclear genome. Fe-S cluster biogenesis is a multistep process that involves a complex sequence of catalyzed protein- protein interactions and coupled conformational changes between the components of several dedicated multimeric complexes. Intensive studies of the assembly process have clarified key points in the biogenesis of Fe-S proteins. However several critical questions still remain, such as: what is the role of frataxin? Why do some defects of Fe-S cluster biogenesis cause mitochondrial iron overload? How are specific Fe-S recipient proteins recognized in the process of Fe-S transfer? This review focuses on the basic steps of Fe-S cluster biogenesis, drawing attention to recent advances achieved on the identification of molecular features that guide selection of specific subsets of nascent Fe-S recipients by the cochaperone HSC20. Additionally, it outlines the distinctive phenotypes of human diseases due to mutations in the components of the basic pathway. PMID:25245479

  7. bcl::Cluster : A method for clustering biological molecules coupled with visualization in the Pymol Molecular Graphics System.

    Science.gov (United States)

    Alexander, Nathan; Woetzel, Nils; Meiler, Jens

    2011-02-01

    Clustering algorithms are used as data analysis tools in a wide variety of applications in Biology. Clustering has become especially important in protein structure prediction and virtual high throughput screening methods. In protein structure prediction, clustering is used to structure the conformational space of thousands of protein models. In virtual high throughput screening, databases with millions of drug-like molecules are organized by structural similarity, e.g. common scaffolds. The tree-like dendrogram structure obtained from hierarchical clustering can provide a qualitative overview of the results, which is important for focusing detailed analysis. However, in practice it is difficult to relate specific components of the dendrogram directly back to the objects of which it is comprised and to display all desired information within the two dimensions of the dendrogram. The current work presents a hierarchical agglomerative clustering method termed bcl::Cluster. bcl::Cluster utilizes the Pymol Molecular Graphics System to graphically depict dendrograms in three dimensions. This allows simultaneous display of relevant biological molecules as well as additional information about the clusters and the members comprising them.

  8. Nitrosylation of Nitric-Oxide-Sensing Regulatory Proteins Containing [4Fe-4S] Clusters Gives Rise to Multiple Iron-Nitrosyl Complexes

    Energy Technology Data Exchange (ETDEWEB)

    Serrano, Pauline N. [Department of Chemistry, University of California, Davis CA 95616 USA; Wang, Hongxin [Department of Chemistry, University of California, Davis CA 95616 USA; Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley CA 94720 USA; Crack, Jason C. [Centre for Molecular and Structural Biochemistry, School of Chemistry, University of East Anglia, Norwich Research Park Norwich NR4 7TJ UK; Prior, Christopher [Centre for Molecular and Structural Biochemistry, School of Chemistry, University of East Anglia, Norwich Research Park Norwich NR4 7TJ UK; Hutchings, Matthew I. [School of Biological Sciences, University of East Anglia, Norwich NR4 7TJ UK; Thomson, Andrew J. [Centre for Molecular and Structural Biochemistry, School of Chemistry, University of East Anglia, Norwich Research Park Norwich NR4 7TJ UK; Kamali, Saeed [University of Tennessee Space Institute, Tullahome TN 37388-9700 USA; Yoda, Yoshitaka [Research and Utilization Division, SPring-8/JASRI, 1-1-1 Kouto, Sayo Hyogo 679-5198 Japan; Zhao, Jiyong [Advanced Photon Source, Argonne National Laboratory, Argonne IL 60439 USA; Hu, Michael Y. [Advanced Photon Source, Argonne National Laboratory, Argonne IL 60439 USA; Alp, Ercan E. [Advanced Photon Source, Argonne National Laboratory, Argonne IL 60439 USA; Oganesyan, Vasily S. [Centre for Molecular and Structural Biochemistry, School of Chemistry, University of East Anglia, Norwich Research Park Norwich NR4 7TJ UK; Le Brun, Nick E. [Centre for Molecular and Structural Biochemistry, School of Chemistry, University of East Anglia, Norwich Research Park Norwich NR4 7TJ UK; Cramer, Stephen P. [Department of Chemistry, University of California, Davis CA 95616 USA; Physical Biosciences Division, Lawrence Berkeley National Laboratory, Berkeley CA 94720 USA

    2016-10-25

    The reaction of protein-bound iron–sulfur (Fe-S) clusters with nitric oxide (NO) plays key roles in NO-mediated toxicity and signaling. Elucidation of the mechanism of the reaction of NO with DNA regulatory proteins that contain Fe-S clusters has been hampered by a lack of information about the nature of the iron-nitrosyl products formed. Herein, we report nuclear resonance vibrational spectroscopy (NRVS) and density functional theory (DFT) calculations that identify NO reaction products in WhiD and NsrR, regulatory proteins that use a [4Fe-4S] cluster to sense NO. This work reveals that nitrosylation yields multiple products structurally related to Roussin's Red Ester (RRE, [Fe2(NO)4(Cys)2]) and Roussin's Black Salt (RBS, [Fe4(NO)7S3]. In the latter case, the absence of 32S/34S shifts in the Fe-S region of the NRVS spectra suggest that a new species, Roussin's Black Ester (RBE), may be formed, in which one or more of the sulfide ligands is replaced by Cys thiolates.

  9. UET: a database of evolutionarily-predicted functional determinants of protein sequences that cluster as functional sites in protein structures.

    Science.gov (United States)

    Lua, Rhonald C; Wilson, Stephen J; Konecki, Daniel M; Wilkins, Angela D; Venner, Eric; Morgan, Daniel H; Lichtarge, Olivier

    2016-01-04

    The structure and function of proteins underlie most aspects of biology and their mutational perturbations often cause disease. To identify the molecular determinants of function as well as targets for drugs, it is central to characterize the important residues and how they cluster to form functional sites. The Evolutionary Trace (ET) achieves this by ranking the functional and structural importance of the protein sequence positions. ET uses evolutionary distances to estimate functional distances and correlates genotype variations with those in the fitness phenotype. Thus, ET ranks are worse for sequence positions that vary among evolutionarily closer homologs but better for positions that vary mostly among distant homologs. This approach identifies functional determinants, predicts function, guides the mutational redesign of functional and allosteric specificity, and interprets the action of coding sequence variations in proteins, people and populations. Now, the UET database offers pre-computed ET analyses for the protein structure databank, and on-the-fly analysis of any protein sequence. A web interface retrieves ET rankings of sequence positions and maps results to a structure to identify functionally important regions. This UET database integrates several ways of viewing the results on the protein sequence or structure and can be found at http://mammoth.bcm.tmc.edu/uet/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  10. Proteomic-based detection of a protein cluster dysregulated during cardiovascular development identifies biomarkers of congenital heart defects.

    Directory of Open Access Journals (Sweden)

    Anjali K Nath

    Full Text Available Cardiovascular development is vital for embryonic survival and growth. Early gestation embryo loss or malformation has been linked to yolk sac vasculopathy and congenital heart defects (CHDs. However, the molecular pathways that underlie these structural defects in humans remain largely unknown hindering the development of molecular-based diagnostic tools and novel therapies.Murine embryos were exposed to high glucose, a condition known to induce cardiovascular defects in both animal models and humans. We further employed a mass spectrometry-based proteomics approach to identify proteins differentially expressed in embryos with defects from those with normal cardiovascular development. The proteins detected by mass spectrometry (WNT16, ST14, Pcsk1, Jumonji, Morca2a, TRPC5, and others were validated by Western blotting and immunoflorescent staining of the yolk sac and heart. The proteins within the proteomic dataset clustered to adhesion/migration, differentiation, transport, and insulin signaling pathways. A functional role for several proteins (WNT16, ADAM15 and NOGO-A/B was demonstrated in an ex vivo model of heart development. Additionally, a successful application of a cluster of protein biomarkers (WNT16, ST14 and Pcsk1 as a prenatal screen for CHDs was confirmed in a study of human amniotic fluid (AF samples from women carrying normal fetuses and those with CHDs.The novel finding that WNT16, ST14 and Pcsk1 protein levels increase in fetuses with CHDs suggests that these proteins may play a role in the etiology of human CHDs. The information gained through this bed-side to bench translational approach contributes to a more complete understanding of the protein pathways dysregulated during cardiovascular development and provides novel avenues for diagnostic and therapeutic interventions, beneficial to fetuses at risk for CHDs.

  11. Cluster editing

    DEFF Research Database (Denmark)

    Böcker, S.; Baumbach, Jan

    2013-01-01

    . The problem has been the inspiration for numerous algorithms in bioinformatics, aiming at clustering entities such as genes, proteins, phenotypes, or patients. In this paper, we review exact and heuristic methods that have been proposed for the Cluster Editing problem, and also applications......The Cluster Editing problem asks to transform a graph into a disjoint union of cliques using a minimum number of edge modifications. Although the problem has been proven NP-complete several times, it has nevertheless attracted much research both from the theoretical and the applied side...

  12. Live-cell FRET imaging reveals clustering of the prion protein at the cell surface induced by infectious prions.

    Science.gov (United States)

    Tavares, Evandro; Macedo, Joana A; Paulo, Pedro M R; Tavares, Catarina; Lopes, Carlos; Melo, Eduardo P

    2014-07-01

    Prion diseases are associated to the conversion of the prion protein into a misfolded pathological isoform. The mechanism of propagation of protein misfolding by protein templating remains largely unknown. Neuroblastoma cells were transfected with constructs of the prion protein fused to both CFP-GPI-anchored and to YFP-GPI-anchored and directed to its cell membrane location. Live-cell FRET imaging between the prion protein fused to CFP or YFP was measured giving consistent values of 10±2%. This result was confirmed by fluorescence lifetime imaging microscopy and indicates intermolecular interactions between neighbor prion proteins. In particular, considering that a maximum FRET efficiency of 17±2% was determined from a positive control consisting of a fusion CFP-YFP-GPI-anchored. A stable cell clone expressing the two fusions containing the prion protein was also selected to minimize cell-to-cell variability. In both, stable and transiently transfected cells, the FRET efficiency consistently increased in the presence of infectious prions - from 4±1% to 7±1% in the stable clone and from 10±2% to 16±1% in transiently transfected cells. These results clearly reflect an increased clustering of the prion protein on the membrane in the presence of infectious prions, which was not observed in negative control using constructs without the prion protein and upon addition of non-infected brain. Our data corroborates the recent view that the primary site for prion conversion is the cell membrane. Since our fluorescent cell clone is not susceptible to propagate infectivity, we hypothesize that the initial event of prion infectivity might be the clustering of the GPI-anchored prion protein. Copyright © 2014 Elsevier B.V. All rights reserved.

  13. Functional Complementation Studies Reveal Different Interaction Partners of Escherichia coli IscS and Human NFS1.

    Science.gov (United States)

    Bühning, Martin; Friemel, Martin; Leimkühler, Silke

    2017-08-29

    The trafficking and delivery of sulfur to cofactors and nucleosides is a highly regulated and conserved process among all organisms. All sulfur transfer pathways generally have an l-cysteine desulfurase as an initial sulfur-mobilizing enzyme in common, which serves as a sulfur donor for the biosynthesis of sulfur-containing biomolecules like iron-sulfur (Fe-S) clusters, thiamine, biotin, lipoic acid, the molybdenum cofactor (Moco), and thiolated nucleosides in tRNA. The human l-cysteine desulfurase NFS1 and the Escherichia coli homologue IscS share a level of amino acid sequence identity of ∼60%. While E. coli IscS has a versatile role in the cell and was shown to have numerous interaction partners, NFS1 is mainly localized in mitochondria with a crucial role in the biosynthesis of Fe-S clusters. Additionally, NFS1 is also located in smaller amounts in the cytosol with a role in Moco biosynthesis and mcm 5 s 2 U34 thio modifications of nucleosides in tRNA. NFS1 and IscS were conclusively shown to have different interaction partners in their respective organisms. Here, we used functional complementation studies of an E. coli iscS deletion strain with human NFS1 to dissect their conserved roles in the transfer of sulfur to a specific target protein. Our results show that human NFS1 and E. coli IscS share conserved binding sites for proteins involved in Fe-S cluster assembly like IscU, but not with proteins for tRNA thio modifications or Moco biosynthesis. In addition, we show that human NFS1 was almost fully able to complement the role of IscS in Moco biosynthesis when its specific interaction partner protein MOCS3 from humans was also present.

  14. Clustering of double strand break-containing chromosome domains is not inhibited by inactivation of major repair proteins

    International Nuclear Information System (INIS)

    Krawczyk, P. M.; Stap, C.; Van Oven, C.; Hoebe, R.; Aten, J. A.

    2006-01-01

    For efficient repair of DNA double strand breaks (DSBs) cells rely on a process that involves the Mre11/Rad50/Nbs1 complex, which may help to protect non-repaired DNA ends from separating until they can be rejoined by DNA repair proteins. It has been observed that as a secondary effect, this process can lead to unintended clustering of multiple, initially separate, DSB-containing chromosome domains. This work demonstrates that neither inactivation of the major repair proteins XRCC3 and the DNA-dependent protein kinase (DNA-PK) nor inhibition of DNA-PK by vanillin influences the aggregation of DSB-containing chromosome domains. (authors)

  15. Cluster analysis of historical and modern hard red spring wheat cultivars based on parentage and HPLC analysis of gluten forming proteins

    Science.gov (United States)

    In this study, 30 hard red spring (HRS) wheat cultivars released between 1910 and 2013 were analyzed to determine how they cluster in terms of parentage and protein data, analyzed by reverse-phase HPLC (RP-HPLC) of gliadins, and size-exclusion HPLC (SE-HPLC) of unreduced proteins. Dwarfing genes in...

  16. Using hierarchical clustering of secreted protein families to classify and rank candidate effectors of rust fungi.

    Directory of Open Access Journals (Sweden)

    Diane G O Saunders

    Full Text Available Rust fungi are obligate biotrophic pathogens that cause considerable damage on crop plants. Puccinia graminis f. sp. tritici, the causal agent of wheat stem rust, and Melampsora larici-populina, the poplar leaf rust pathogen, have strong deleterious impacts on wheat and poplar wood production, respectively. Filamentous pathogens such as rust fungi secrete molecules called disease effectors that act as modulators of host cell physiology and can suppress or trigger host immunity. Current knowledge on effectors from other filamentous plant pathogens can be exploited for the characterisation of effectors in the genome of recently sequenced rust fungi. We designed a comprehensive in silico analysis pipeline to identify the putative effector repertoire from the genome of two plant pathogenic rust fungi. The pipeline is based on the observation that known effector proteins from filamentous pathogens have at least one of the following properties: (i contain a secretion signal, (ii are encoded by in planta induced genes, (iii have similarity to haustorial proteins, (iv are small and cysteine rich, (v contain a known effector motif or a nuclear localization signal, (vi are encoded by genes with long intergenic regions, (vii contain internal repeats, and (viii do not contain PFAM domains, except those associated with pathogenicity. We used Markov clustering and hierarchical clustering to classify protein families of rust pathogens and rank them according to their likelihood of being effectors. Using this approach, we identified eight families of candidate effectors that we consider of high value for functional characterization. This study revealed a diverse set of candidate effectors, including families of haustorial expressed secreted proteins and small cysteine-rich proteins. This comprehensive classification of candidate effectors from these devastating rust pathogens is an initial step towards probing plant germplasm for novel resistance components.

  17. Cofactor-binding sites in proteins of deviating sequence: comparative analysis and clustering in torsion angle, cavity, and fold space.

    Science.gov (United States)

    Stegemann, Björn; Klebe, Gerhard

    2012-02-01

    Small molecules are recognized in protein-binding pockets through surface-exposed physicochemical properties. To optimize binding, they have to adopt a conformation corresponding to a local energy minimum within the formed protein-ligand complex. However, their conformational flexibility makes them competent to bind not only to homologous proteins of the same family but also to proteins of remote similarity with respect to the shape of the binding pockets and folding pattern. Considering drug action, such observations can give rise to unexpected and undesired cross reactivity. In this study, datasets of six different cofactors (ADP, ATP, NAD(P)(H), FAD, and acetyl CoA, sharing an adenosine diphosphate moiety as common substructure), observed in multiple crystal structures of protein-cofactor complexes exhibiting sequence identity below 25%, have been analyzed for the conformational properties of the bound ligands, the distribution of physicochemical properties in the accommodating protein-binding pockets, and the local folding patterns next to the cofactor-binding site. State-of-the-art clustering techniques have been applied to group the different protein-cofactor complexes in the different spaces. Interestingly, clustering in cavity (Cavbase) and fold space (DALI) reveals virtually the same data structuring. Remarkable relationships can be found among the different spaces. They provide information on how conformations are conserved across the host proteins and which distinct local cavity and fold motifs recognize the different portions of the cofactors. In those cases, where different cofactors are found to be accommodated in a similar fashion to the same fold motifs, only a commonly shared substructure of the cofactors is used for the recognition process. Copyright © 2011 Wiley Periodicals, Inc.

  18. Oligomeric protein structure networks: insights into protein-protein interactions

    Directory of Open Access Journals (Sweden)

    Brinda KV

    2005-12-01

    Full Text Available Abstract Background Protein-protein association is essential for a variety of cellular processes and hence a large number of investigations are being carried out to understand the principles of protein-protein interactions. In this study, oligomeric protein structures are viewed from a network perspective to obtain new insights into protein association. Structure graphs of proteins have been constructed from a non-redundant set of protein oligomer crystal structures by considering amino acid residues as nodes and the edges are based on the strength of the non-covalent interactions between the residues. The analysis of such networks has been carried out in terms of amino acid clusters and hubs (highly connected residues with special emphasis to protein interfaces. Results A variety of interactions such as hydrogen bond, salt bridges, aromatic and hydrophobic interactions, which occur at the interfaces are identified in a consolidated manner as amino acid clusters at the interface, from this study. Moreover, the characterization of the highly connected hub-forming residues at the interfaces and their comparison with the hubs from the non-interface regions and the non-hubs in the interface regions show that there is a predominance of charged interactions at the interfaces. Further, strong and weak interfaces are identified on the basis of the interaction strength between amino acid residues and the sizes of the interface clusters, which also show that many protein interfaces are stronger than their monomeric protein cores. The interface strengths evaluated based on the interface clusters and hubs also correlate well with experimentally determined dissociation constants for known complexes. Finally, the interface hubs identified using the present method correlate very well with experimentally determined hotspots in the interfaces of protein complexes obtained from the Alanine Scanning Energetics database (ASEdb. A few predictions of interface hot

  19. Fe-S Cluster Biogenesis in Isolated Mammalian Mitochondria

    Science.gov (United States)

    Pandey, Alok; Pain, Jayashree; Ghosh, Arnab K.; Dancis, Andrew; Pain, Debkumar

    2015-01-01

    Iron-sulfur (Fe-S) clusters are essential cofactors, and mitochondria contain several Fe-S proteins, including the [4Fe-4S] protein aconitase and the [2Fe-2S] protein ferredoxin. Fe-S cluster assembly of these proteins occurs within mitochondria. Although considerable data exist for yeast mitochondria, this biosynthetic process has never been directly demonstrated in mammalian mitochondria. Using [35S]cysteine as the source of sulfur, here we show that mitochondria isolated from Cath.A-derived cells, a murine neuronal cell line, can synthesize and insert new Fe-35S clusters into aconitase and ferredoxins. The process requires GTP, NADH, ATP, and iron, and hydrolysis of both GTP and ATP is necessary. Importantly, we have identified the 35S-labeled persulfide on the NFS1 cysteine desulfurase as a genuine intermediate en route to Fe-S cluster synthesis. In physiological settings, the persulfide sulfur is released from NFS1 and transferred to a scaffold protein, where it combines with iron to form an Fe-S cluster intermediate. We found that the release of persulfide sulfur from NFS1 requires iron, showing that the use of iron and sulfur for the synthesis of Fe-S cluster intermediates is a highly coordinated process. The release of persulfide sulfur also requires GTP and NADH, probably mediated by a GTPase and a reductase, respectively. ATP, a cofactor for a multifunctional Hsp70 chaperone, is not required at this step. The experimental system described here may help to define the biochemical basis of diseases that are associated with impaired Fe-S cluster biogenesis in mitochondria, such as Friedreich ataxia. PMID:25398879

  20. Integrative cluster analysis in bioinformatics

    CERN Document Server

    Abu-Jamous, Basel; Nandi, Asoke K

    2015-01-01

    Clustering techniques are increasingly being put to use in the analysis of high-throughput biological datasets. Novel computational techniques to analyse high throughput data in the form of sequences, gene and protein expressions, pathways, and images are becoming vital for understanding diseases and future drug discovery. This book details the complete pathway of cluster analysis, from the basics of molecular biology to the generation of biological knowledge. The book also presents the latest clustering methods and clustering validation, thereby offering the reader a comprehensive review o

  1. HKC: An Algorithm to Predict Protein Complexes in Protein-Protein Interaction Networks

    Directory of Open Access Journals (Sweden)

    Xiaomin Wang

    2011-01-01

    Full Text Available With the availability of more and more genome-scale protein-protein interaction (PPI networks, research interests gradually shift to Systematic Analysis on these large data sets. A key topic is to predict protein complexes in PPI networks by identifying clusters that are densely connected within themselves but sparsely connected with the rest of the network. In this paper, we present a new topology-based algorithm, HKC, to detect protein complexes in genome-scale PPI networks. HKC mainly uses the concepts of highest k-core and cohesion to predict protein complexes by identifying overlapping clusters. The experiments on two data sets and two benchmarks show that our algorithm has relatively high F-measure and exhibits better performance compared with some other methods.

  2. Validating clustering of molecular dynamics simulations using polymer models

    Directory of Open Access Journals (Sweden)

    Phillips Joshua L

    2011-11-01

    Full Text Available Abstract Background Molecular dynamics (MD simulation is a powerful technique for sampling the meta-stable and transitional conformations of proteins and other biomolecules. Computational data clustering has emerged as a useful, automated technique for extracting conformational states from MD simulation data. Despite extensive application, relatively little work has been done to determine if the clustering algorithms are actually extracting useful information. A primary goal of this paper therefore is to provide such an understanding through a detailed analysis of data clustering applied to a series of increasingly complex biopolymer models. Results We develop a novel series of models using basic polymer theory that have intuitive, clearly-defined dynamics and exhibit the essential properties that we are seeking to identify in MD simulations of real biomolecules. We then apply spectral clustering, an algorithm particularly well-suited for clustering polymer structures, to our models and MD simulations of several intrinsically disordered proteins. Clustering results for the polymer models provide clear evidence that the meta-stable and transitional conformations are detected by the algorithm. The results for the polymer models also help guide the analysis of the disordered protein simulations by comparing and contrasting the statistical properties of the extracted clusters. Conclusions We have developed a framework for validating the performance and utility of clustering algorithms for studying molecular biopolymer simulations that utilizes several analytic and dynamic polymer models which exhibit well-behaved dynamics including: meta-stable states, transition states, helical structures, and stochastic dynamics. We show that spectral clustering is robust to anomalies introduced by structural alignment and that different structural classes of intrinsically disordered proteins can be reliably discriminated from the clustering results. To our

  3. Spectroscopic and functional characterization of iron-sulfur cluster-bound forms of Azotobacter vinelandii (Nif)IscA.

    Science.gov (United States)

    Mapolelo, Daphne T; Zhang, Bo; Naik, Sunil G; Huynh, Boi Hanh; Johnson, Michael K

    2012-10-16

    The mechanism of [4Fe-4S] cluster assembly on A-type Fe-S cluster assembly proteins, in general, and the specific role of (Nif)IscA in the maturation of nitrogen fixation proteins are currently unknown. To address these questions, in vitro spectroscopic studies (UV-visible absorption/CD, resonance Raman and Mössbauer) have been used to investigate the mechanism of [4Fe-4S] cluster assembly on Azotobacter vinelandii(Nif)IscA, and the ability of (Nif)IscA to accept clusters from NifU and to donate clusters to the apo form of the nitrogenase Fe-protein. The results show that (Nif)IscA can rapidly and reversibly cycle between forms containing one [2Fe-2S](2+) and one [4Fe-4S](2+) cluster per homodimer via DTT-induced two-electron reductive coupling of two [2Fe-2S](2+) clusters and O(2)-induced [4Fe-4S](2+) oxidative cleavage. This unique type of cluster interconversion in response to cellular redox status and oxygen levels is likely to be important for the specific role of A-type proteins in the maturation of [4Fe-4S] cluster-containing proteins under aerobic growth or oxidative stress conditions. Only the [4Fe-4S](2+)-(Nif)IscA was competent for rapid activation of apo-nitrogenase Fe protein under anaerobic conditions. Apo-(Nif)IscA was shown to accept clusters from [4Fe-4S] cluster-bound NifU via rapid intact cluster transfer, indicating a potential role as a cluster carrier for delivery of clusters assembled on NifU. Overall the results support the proposal that A-type proteins can function as carrier proteins for clusters assembled on U-type proteins and suggest that they are likely to supply [2Fe-2S] clusters rather than [4Fe-4S] for the maturation of [4Fe-4S] cluster-containing proteins under aerobic or oxidative stress growth conditions.

  4. Robust continuous clustering.

    Science.gov (United States)

    Shah, Sohil Atul; Koltun, Vladlen

    2017-09-12

    Clustering is a fundamental procedure in the analysis of scientific data. It is used ubiquitously across the sciences. Despite decades of research, existing clustering algorithms have limited effectiveness in high dimensions and often require tuning parameters for different domains and datasets. We present a clustering algorithm that achieves high accuracy across multiple domains and scales efficiently to high dimensions and large datasets. The presented algorithm optimizes a smooth continuous objective, which is based on robust statistics and allows heavily mixed clusters to be untangled. The continuous nature of the objective also allows clustering to be integrated as a module in end-to-end feature learning pipelines. We demonstrate this by extending the algorithm to perform joint clustering and dimensionality reduction by efficiently optimizing a continuous global objective. The presented approach is evaluated on large datasets of faces, hand-written digits, objects, newswire articles, sensor readings from the Space Shuttle, and protein expression levels. Our method achieves high accuracy across all datasets, outperforming the best prior algorithm by a factor of 3 in average rank.

  5. Comparing Residue Clusters from Thermophilic and Mesophilic Enzymes Reveals Adaptive Mechanisms.

    Science.gov (United States)

    Sammond, Deanne W; Kastelowitz, Noah; Himmel, Michael E; Yin, Hang; Crowley, Michael F; Bomble, Yannick J

    2016-01-01

    Understanding how proteins adapt to function at high temperatures is important for deciphering the energetics that dictate protein stability and folding. While multiple principles important for thermostability have been identified, we lack a unified understanding of how internal protein structural and chemical environment determine qualitative or quantitative impact of evolutionary mutations. In this work we compare equivalent clusters of spatially neighboring residues between paired thermophilic and mesophilic homologues to evaluate adaptations under the selective pressure of high temperature. We find the residue clusters in thermophilic enzymes generally display improved atomic packing compared to mesophilic enzymes, in agreement with previous research. Unlike residue clusters from mesophilic enzymes, however, thermophilic residue clusters do not have significant cavities. In addition, anchor residues found in many clusters are highly conserved with respect to atomic packing between both thermophilic and mesophilic enzymes. Thus the improvements in atomic packing observed in thermophilic homologues are not derived from these anchor residues but from neighboring positions, which may serve to expand optimized protein core regions.

  6. Finding local communities in protein networks.

    Science.gov (United States)

    Voevodski, Konstantin; Teng, Shang-Hua; Xia, Yu

    2009-09-18

    Protein-protein interactions (PPIs) play fundamental roles in nearly all biological processes, and provide major insights into the inner workings of cells. A vast amount of PPI data for various organisms is available from BioGRID and other sources. The identification of communities in PPI networks is of great interest because they often reveal previously unknown functional ties between proteins. A large number of global clustering algorithms have been applied to protein networks, where the entire network is partitioned into clusters. Here we take a different approach by looking for local communities in PPI networks. We develop a tool, named Local Protein Community Finder, which quickly finds a community close to a queried protein in any network available from BioGRID or specified by the user. Our tool uses two new local clustering algorithms Nibble and PageRank-Nibble, which look for a good cluster among the most popular destinations of a short random walk from the queried vertex. The quality of a cluster is determined by proportion of outgoing edges, known as conductance, which is a relative measure particularly useful in undersampled networks. We show that the two local clustering algorithms find communities that not only form excellent clusters, but are also likely to be biologically relevant functional components. We compare the performance of Nibble and PageRank-Nibble to other popular and effective graph partitioning algorithms, and show that they find better clusters in the graph. Moreover, Nibble and PageRank-Nibble find communities that are more functionally coherent. The Local Protein Community Finder, accessible at http://xialab.bu.edu/resources/lpcf, allows the user to quickly find a high-quality community close to a queried protein in any network available from BioGRID or specified by the user. We show that the communities found by our tool form good clusters and are functionally coherent, making our application useful for biologists who wish to

  7. Finding local communities in protein networks

    Directory of Open Access Journals (Sweden)

    Teng Shang-Hua

    2009-09-01

    Full Text Available Abstract Background Protein-protein interactions (PPIs play fundamental roles in nearly all biological processes, and provide major insights into the inner workings of cells. A vast amount of PPI data for various organisms is available from BioGRID and other sources. The identification of communities in PPI networks is of great interest because they often reveal previously unknown functional ties between proteins. A large number of global clustering algorithms have been applied to protein networks, where the entire network is partitioned into clusters. Here we take a different approach by looking for local communities in PPI networks. Results We develop a tool, named Local Protein Community Finder, which quickly finds a community close to a queried protein in any network available from BioGRID or specified by the user. Our tool uses two new local clustering algorithms Nibble and PageRank-Nibble, which look for a good cluster among the most popular destinations of a short random walk from the queried vertex. The quality of a cluster is determined by proportion of outgoing edges, known as conductance, which is a relative measure particularly useful in undersampled networks. We show that the two local clustering algorithms find communities that not only form excellent clusters, but are also likely to be biologically relevant functional components. We compare the performance of Nibble and PageRank-Nibble to other popular and effective graph partitioning algorithms, and show that they find better clusters in the graph. Moreover, Nibble and PageRank-Nibble find communities that are more functionally coherent. Conclusion The Local Protein Community Finder, accessible at http://xialab.bu.edu/resources/lpcf, allows the user to quickly find a high-quality community close to a queried protein in any network available from BioGRID or specified by the user. We show that the communities found by our tool form good clusters and are functionally coherent

  8. [Nitrogen oxide is involved in the regulation of the Fe-S cluster assembly in proteins and the formation of biofilms by Escherichia coli cells].

    Science.gov (United States)

    Vasil'eva, S V; Streltsova, D A; Starostina, I A; Sanina, N A

    2013-01-01

    The functions of nitrogen oxide (NO) in the regulation of the reversible processes of Fe-S cluster assembly in proteins and the formation of Escherichia coli biofilms have been investigated. S-nitrosoglutathione (GSNO) and crystalline nitrosyl complexes of iron with sulfur-containing aliphatic ligands cisaconite (CisA) and penaconite have been used as NO donors for the first time. Wild-type E. coli cells of the strain MC4100, mutants deltaiscA and deltasufA, and the double paralog mutant deltaiscA/sufA with deletions in the alternative pathways of Fe2+ supply for cluster assembly (all derived from the above-named strain) were used in this study. Plankton growth of bacterial cultures, the mass of mature biofilms, and the expression of the SoxRS[2Fe-2S] regulon have been investigated and shown to depend on strain genotype, the process of Fe-S cluster assembly in iron-sulfur proteins, NO donor structure, and the presence of Fe2+ chelator ferene in the incubation medium. The antibiotic ciprofloxacine (CF) was used as an inhibitor of E. coli biofilm formation in the positive control. NO donors regulating Fe-S cluster assembly in E. coli have been shown to control plankton growth of the cultures and the process of mature biofilm formation; toxic doses of NO caused a dramatic (3- to 4-fold) stimulation of cell entry into biofilms as a response to nitrosative stress; NO donors CisA and GSNO in physiological concentrations suppressed the formation of mature biofilms, and the activity of these compounds was comparable to that of CE Regulation of both Fe-S cluster assembly in iron-sulfur proteins and biofilm formation by NO is indicative of the connection between these processes in E. coli.

  9. Clustering structures of large proteins using multifractal analyses based on a 6-letter model and hydrophobicity scale of amino acids

    International Nuclear Information System (INIS)

    Yang Jianyi; Yu Zuguo; Anh, Vo

    2009-01-01

    The Schneider and Wrede hydrophobicity scale of amino acids and the 6-letter model of protein are proposed to study the relationship between the primary structure and the secondary structural classification of proteins. Two kinds of multifractal analyses are performed on the two measures obtained from these two kinds of data on large proteins. Nine parameters from the multifractal analyses are considered to construct the parameter spaces. Each protein is represented by one point in these spaces. A procedure is proposed to separate large proteins in the α, β, α + β and α/β structural classes in these parameter spaces. Fisher's linear discriminant algorithm is used to assess our clustering accuracy on the 49 selected large proteins. Numerical results indicate that the discriminant accuracies are satisfactory. In particular, they reach 100.00% and 84.21% in separating the α proteins from the {β, α + β, α/β} proteins in a parameter space; 92.86% and 86.96% in separating the β proteins from the {α + β, α/β} proteins in another parameter space; 91.67% and 83.33% in separating the α/β proteins from the α + β proteins in the last parameter space.

  10. Zinc and the iron donor frataxin regulate oligomerization of the scaffold protein to form new Fe-S cluster assembly centers.

    Science.gov (United States)

    Galeano, B K; Ranatunga, W; Gakh, O; Smith, D Y; Thompson, J R; Isaya, G

    2017-06-21

    Early studies of the bacterial Fe-S cluster assembly system provided structural details for how the scaffold protein and the cysteine desulfurase interact. This work and additional work on the yeast and human systems elucidated a conserved mechanism for sulfur donation but did not provide any conclusive insights into the mechanism for iron delivery from the iron donor, frataxin, to the scaffold. We previously showed that oligomerization is a mechanism by which yeast frataxin (Yfh1) can promote assembly of the core machinery for Fe-S cluster synthesis both in vitro and in cells, in such a manner that the scaffold protein, Isu1, can bind to Yfh1 independent of the presence of the cysteine desulfurase, Nfs1. Here, in the absence of Yfh1, Isu1 was found to exist in two forms, one mostly monomeric with limited tendency to dimerize, and one with a strong propensity to oligomerize. Whereas the monomeric form is stabilized by zinc, the loss of zinc promotes formation of dimer and higher order oligomers. However, upon binding to oligomeric Yfh1, both forms take on a similar symmetrical trimeric configuration that places the Fe-S cluster coordinating residues of Isu1 in close proximity of iron-binding residues of Yfh1. This configuration is suitable for docking of Nfs1 in a manner that provides a structural context for coordinate iron and sulfur donation to the scaffold. Moreover, distinct structural features suggest that in physiological conditions the zinc-regulated abundance of monomeric vs. oligomeric Isu1 yields [Yfh1]·[Isu1] complexes with different Isu1 configurations that afford unique functional properties for Fe-S cluster assembly and delivery.

  11. Comparing Residue Clusters from Thermophilic and Mesophilic Enzymes Reveals Adaptive Mechanisms.

    Directory of Open Access Journals (Sweden)

    Deanne W Sammond

    Full Text Available Understanding how proteins adapt to function at high temperatures is important for deciphering the energetics that dictate protein stability and folding. While multiple principles important for thermostability have been identified, we lack a unified understanding of how internal protein structural and chemical environment determine qualitative or quantitative impact of evolutionary mutations. In this work we compare equivalent clusters of spatially neighboring residues between paired thermophilic and mesophilic homologues to evaluate adaptations under the selective pressure of high temperature. We find the residue clusters in thermophilic enzymes generally display improved atomic packing compared to mesophilic enzymes, in agreement with previous research. Unlike residue clusters from mesophilic enzymes, however, thermophilic residue clusters do not have significant cavities. In addition, anchor residues found in many clusters are highly conserved with respect to atomic packing between both thermophilic and mesophilic enzymes. Thus the improvements in atomic packing observed in thermophilic homologues are not derived from these anchor residues but from neighboring positions, which may serve to expand optimized protein core regions.

  12. A VTVH MCD and EPR Spectroscopic Study of the Maturation of the "Second" Nitrogenase P-Cluster.

    Science.gov (United States)

    Rupnik, Kresimir; Lee, Chi Chung; Hu, Yilin; Ribbe, Markus W; Hales, Brian J

    2018-04-16

    The P-cluster of the nitrogenase MoFe protein is a [ Fe 8 S 7 ] cluster that mediates efficient transfer of electrons to the active site for substrate reduction. Arguably the most complex homometallic FeS cluster found in nature, the biosynthetic mechanism of the P-cluster is of considerable theoretical and synthetic interest to chemists and biochemists alike. Previous studies have revealed a biphasic assembly mechanism of the two P-clusters in the MoFe protein upon incubation with Fe protein and ATP, in which the first P-cluster is formed through fast fusion of a pair of [ Fe 4 S 4 ] + clusters within 5 min and the second P-cluster is formed through slow fusion of the second pair of [ Fe 4 S 4 ] + clusters in a period of 2 h. Here we report a VTVH MCD and EPR spectroscopic study of the biosynthesis of the slow-forming, second P-cluster within the MoFe protein. Our results show that the first major step in the formation of the second P-cluster is the conversion of one of the precursor [ Fe 4 S 4 ] + clusters into the integer spin cluster [ Fe 4 S 3-4 ] α , a process aided by the assembly protein NifZ, whereas the second major biosynthetic step appears to be the formation of a diamagnetic cluster with a possible structure of [ Fe 8 S 7-8 ] β , which is eventually converted into the P-cluster.

  13. Site-Specific Biomolecule Labeling with Gold Clusters

    Science.gov (United States)

    Ackerson, Christopher J.; Powell, Richard D.; Hainfeld, James F.

    2013-01-01

    Site-specific labeling of biomolecules in vitro with gold clusters can enhance the information content of electron cryomicroscopy experiments. This chapter provides a practical overview of well-established techniques for forming biomolecule/gold cluster conjugates. Three bioconjugation chemistries are covered: Linker-mediated bioconjugation, direct gold–biomolecule bonding, and coordination-mediated bonding of nickel(II) nitrilotriacetic acid (NTA)-derivatized gold clusters to polyhistidine (His)-tagged proteins. PMID:20887859

  14. Noise reduction in protein-protein interaction graphs by the implementation of a novel weighting scheme

    Directory of Open Access Journals (Sweden)

    Moschopoulos Charalampos

    2011-06-01

    Full Text Available Abstract Background Recent technological advances applied to biology such as yeast-two-hybrid, phage display and mass spectrometry have enabled us to create a detailed map of protein interaction networks. These interaction networks represent a rich, yet noisy, source of data that could be used to extract meaningful information, such as protein complexes. Several interaction network weighting schemes have been proposed so far in the literature in order to eliminate the noise inherent in interactome data. In this paper, we propose a novel weighting scheme and apply it to the S. cerevisiae interactome. Complex prediction rates are improved by up to 39%, depending on the clustering algorithm applied. Results We adopt a two step procedure. During the first step, by applying both novel and well established protein-protein interaction (PPI weighting methods, weights are introduced to the original interactome graph based on the confidence level that a given interaction is a true-positive one. The second step applies clustering using established algorithms in the field of graph theory, as well as two variations of Spectral clustering. The clustered interactome networks are also cross-validated against the confirmed protein complexes present in the MIPS database. Conclusions The results of our experimental work demonstrate that interactome graph weighting methods clearly improve the clustering results of several clustering algorithms. Moreover, our proposed weighting scheme outperforms other approaches of PPI graph weighting.

  15. True Molecular Scale Visualization of Variable Clustering Properties of Ryanodine Receptors

    Directory of Open Access Journals (Sweden)

    Isuru Jayasinghe

    2018-01-01

    Full Text Available Summary: Signaling nanodomains rely on spatial organization of proteins to allow controlled intracellular signaling. Examples include calcium release sites of cardiomyocytes where ryanodine receptors (RyRs are clustered with their molecular partners. Localization microscopy has been crucial to visualizing these nanodomains but has been limited by brightness of markers, restricting the resolution and quantification of individual proteins clustered within. Harnessing the remarkable localization precision of DNA-PAINT (<10 nm, we visualized punctate labeling within these nanodomains, confirmed as single RyRs. RyR positions within sub-plasmalemmal nanodomains revealed how they are organized randomly into irregular clustering patterns leaving significant gaps occupied by accessory or regulatory proteins. RyR-inhibiting protein junctophilin-2 appeared highly concentrated adjacent to RyR channels. Analyzing these molecular maps showed significant variations in the co-clustering stoichiometry between junctophilin-2 and RyR, even between nearby nanodomains. This constitutes an additional level of complexity in RyR arrangement and regulation of calcium signaling, intrinsically built into the nanodomains. : Jayasinghe et al. resolve the distribution of single ryanodine receptors (RyRs within intracellular signaling domains in cardiac myocytes with DNA-PAINT, a super-resolution microscopy approach. Individual RyRs are resolved within irregular cluster arrays. Quantitative imaging reveals significant variation in the co-clustering stoichiometry between RyRs and the regulatory protein junctophilin-2. Keywords: nanodomains, DNA-PAINT, single-molecule localization microscopy, ryanodine receptor, super-resolution imaging, junctophilin, heart

  16. Characterization of Glutaredoxin Fe-S Cluster-Binding Interactions Using Circular Dichroism Spectroscopy.

    Science.gov (United States)

    Albetel, Angela-Nadia; Outten, Caryn E

    2018-01-01

    Monothiol glutaredoxins (Grxs) with a conserved Cys-Gly-Phe-Ser (CGFS) active site are iron-sulfur (Fe-S) cluster-binding proteins that interact with a variety of partner proteins and perform crucial roles in iron metabolism including Fe-S cluster transfer, Fe-S cluster repair, and iron signaling. Various analytical and spectroscopic methods are currently being used to monitor and characterize glutaredoxin Fe-S cluster-dependent interactions at the molecular level. The electronic, magnetic, and vibrational properties of the protein-bound Fe-S cluster provide a convenient handle to probe the structure, function, and coordination chemistry of Grx complexes. However, some limitations arise from sample preparation requirements, complexity of individual techniques, or the necessity for combining multiple methods in order to achieve a complete investigation. In this chapter, we focus on the use of UV-visible circular dichroism spectroscopy as a fast and simple initial approach for investigating glutaredoxin Fe-S cluster-dependent interactions. © 2018 Elsevier Inc. All rights reserved.

  17. AutoSOME: a clustering method for identifying gene expression modules without prior knowledge of cluster number

    Directory of Open Access Journals (Sweden)

    Cooper James B

    2010-03-01

    Full Text Available Abstract Background Clustering the information content of large high-dimensional gene expression datasets has widespread application in "omics" biology. Unfortunately, the underlying structure of these natural datasets is often fuzzy, and the computational identification of data clusters generally requires knowledge about cluster number and geometry. Results We integrated strategies from machine learning, cartography, and graph theory into a new informatics method for automatically clustering self-organizing map ensembles of high-dimensional data. Our new method, called AutoSOME, readily identifies discrete and fuzzy data clusters without prior knowledge of cluster number or structure in diverse datasets including whole genome microarray data. Visualization of AutoSOME output using network diagrams and differential heat maps reveals unexpected variation among well-characterized cancer cell lines. Co-expression analysis of data from human embryonic and induced pluripotent stem cells using AutoSOME identifies >3400 up-regulated genes associated with pluripotency, and indicates that a recently identified protein-protein interaction network characterizing pluripotency was underestimated by a factor of four. Conclusions By effectively extracting important information from high-dimensional microarray data without prior knowledge or the need for data filtration, AutoSOME can yield systems-level insights from whole genome microarray expression studies. Due to its generality, this new method should also have practical utility for a variety of data-intensive applications, including the results of deep sequencing experiments. AutoSOME is available for download at http://jimcooperlab.mcdb.ucsb.edu/autosome.

  18. Analysis of hepatocellular carcinoma and metastatic hepatic carcinoma via functional modules in a protein-protein interaction network

    Directory of Open Access Journals (Sweden)

    Jun Pan

    2014-01-01

    Full Text Available Introduction: This study aims to identify protein clusters with potential functional relevance in the pathogenesis of hepatocellular carcinoma (HCC and metastatic hepatic carcinoma using network analysis. Materials and Methods: We used human protein interaction data to build a protein-protein interaction network with Cytoscape and then derived functional clusters using MCODE. Combining the gene expression profiles, we calculated the functional scores for the clusters and selected statistically significant clusters. Meanwhile, Gene Ontology was used to assess the functionality of these clusters. Finally, a support vector machine was trained on the gold standard data sets. Results: The differentially expressed genes of HCC were mainly involved in metabolic and signaling processes. We acquired 13 significant modules from the gene expression profiles. The area under the curve value based on the differentially expressed modules were 98.31%, which outweighed the classification with DEGs. Conclusions: Differentially expressed modules are valuable to screen biomarkers combined with functional modules.

  19. Phylogenetic continuum indicates "galaxies" in the protein universe: preliminary results on the natural group structures of proteins.

    Science.gov (United States)

    Ladunga, I

    1992-04-01

    The markedly nonuniform, even systematic distribution of sequences in the protein "universe" has been analyzed by methods of protein taxonomy. Mapping of the natural hierarchical system of proteins has revealed some dense cores, i.e., well-defined clusterings of proteins that seem to be natural structural groupings, possibly seeds for a future protein taxonomy. The aim was not to force proteins into more or less man-made categories by discriminant analysis, but to find structurally similar groups, possibly of common evolutionary origin. Single-valued distance measures between pairs of superfamilies from the Protein Identification Resource were defined by two chi 2-like methods on tripeptide frequencies and the variable-length subsequence identity method derived from dot-matrix comparisons. Distance matrices were processed by several methods of cluster analysis to detect phylogenetic continuum between highly divergent proteins. Only well-defined clusters characterized by relatively unique structural, intracellular environmental, organismal, and functional attribute states were selected as major protein groups, including subsets of viral and Escherichia coli proteins, hormones, inhibitors, plant, ribosomal, serum and structural proteins, amino acid synthases, and clusters dominated by certain oxidoreductases and apolar and DNA-associated enzymes. The limited repertoire of functional patterns due to small genome size, the high rate of recombination, specific features of the bacterial membranes, or of the virus cycle canalize certain proteins of viruses and Gram-negative bacteria, respectively, to organismal groups.

  20. Genome cluster database. A sequence family analysis platform for Arabidopsis and rice.

    Science.gov (United States)

    Horan, Kevin; Lauricha, Josh; Bailey-Serres, Julia; Raikhel, Natasha; Girke, Thomas

    2005-05-01

    The genome-wide protein sequences from Arabidopsis (Arabidopsis thaliana) and rice (Oryza sativa) spp. japonica were clustered into families using sequence similarity and domain-based clustering. The two fundamentally different methods resulted in separate cluster sets with complementary properties to compensate the limitations for accurate family analysis. Functional names for the identified families were assigned with an efficient computational approach that uses the description of the most common molecular function gene ontology node within each cluster. Subsequently, multiple alignments and phylogenetic trees were calculated for the assembled families. All clustering results and their underlying sequences were organized in the Web-accessible Genome Cluster Database (http://bioinfo.ucr.edu/projects/GCD) with rich interactive and user-friendly sequence family mining tools to facilitate the analysis of any given family of interest for the plant science community. An automated clustering pipeline ensures current information for future updates in the annotations of the two genomes and clustering improvements. The analysis allowed the first systematic identification of family and singlet proteins present in both organisms as well as those restricted to one of them. In addition, the established Web resources for mining these data provide a road map for future studies of the composition and structure of protein families between the two species.

  1. Crystal structure of Mycobacterium tuberculosis O6-methylguanine-DNA methyltransferase protein clusters assembled on to damaged DNA.

    Science.gov (United States)

    Miggiano, Riccardo; Perugino, Giuseppe; Ciaramella, Maria; Serpe, Mario; Rejman, Dominik; Páv, Ondřej; Pohl, Radek; Garavaglia, Silvia; Lahiri, Samarpita; Rizzi, Menico; Rossi, Franca

    2016-01-15

    Mycobacterium tuberculosis O(6)-methylguanine-DNA methyltransferase (MtOGT) contributes to protect the bacterial GC-rich genome against the pro-mutagenic potential of O(6)-methylated guanine in DNA. Several strains of M. tuberculosis found worldwide encode a point-mutated O(6)-methylguanine-DNA methyltransferase (OGT) variant (MtOGT-R37L), which displays an arginine-to-leucine substitution at position 37 of the poorly functionally characterized N-terminal domain of the protein. Although the impact of this mutation on the MtOGT activity has not yet been proved in vivo, we previously demonstrated that a recombinant MtOGT-R37L variant performs a suboptimal alkylated-DNA repair in vitro, suggesting a direct role for the Arg(37)-bearing region in catalysis. The crystal structure of MtOGT complexed with modified DNA solved in the present study reveals details of the protein-protein and protein-DNA interactions occurring during alkylated-DNA binding, and the protein capability also to host unmodified bases inside the active site, in a fully extrahelical conformation. Our data provide the first experimental picture at the atomic level of a possible mode of assembling three adjacent MtOGT monomers on the same monoalkylated dsDNA molecule, and disclose the conformational flexibility of discrete regions of MtOGT, including the Arg(37)-bearing random coil. This peculiar structural plasticity of MtOGT could be instrumental to proper protein clustering at damaged DNA sites, as well as to protein-DNA complexes disassembling on repair. © 2016 Authors; published by Portland Press Limited.

  2. Clustering of near clusters versus cluster compactness

    International Nuclear Information System (INIS)

    Yu Gao; Yipeng Jing

    1989-01-01

    The clustering properties of near Zwicky clusters are studied by using the two-point angular correlation function. The angular correlation functions for compact and medium compact clusters, for open clusters, and for all near Zwicky clusters are estimated. The results show much stronger clustering for compact and medium compact clusters than for open clusters, and that open clusters have nearly the same clustering strength as galaxies. A detailed study of the compactness-dependence of correlation function strength is worth investigating. (author)

  3. Towards understanding the first genome sequence of a crenarchaeon by genome annotation using clusters of orthologous groups of proteins (COGs).

    Science.gov (United States)

    Natale, D A; Shankavaram, U T; Galperin, M Y; Wolf, Y I; Aravind, L; Koonin, E V

    2000-01-01

    Standard archival sequence databases have not been designed as tools for genome annotation and are far from being optimal for this purpose. We used the database of Clusters of Orthologous Groups of proteins (COGs) to reannotate the genomes of two archaea, Aeropyrum pernix, the first member of the Crenarchaea to be sequenced, and Pyrococcus abyssi. A. pernix and P. abyssi proteins were assigned to COGs using the COGNITOR program; the results were verified on a case-by-case basis and augmented by additional database searches using the PSI-BLAST and TBLASTN programs. Functions were predicted for over 300 proteins from A. pernix, which could not be assigned a function using conventional methods with a conservative sequence similarity threshold, an approximately 50% increase compared to the original annotation. A. pernix shares most of the conserved core of proteins that were previously identified in the Euryarchaeota. Cluster analysis or distance matrix tree construction based on the co-occurrence of genomes in COGs showed that A. pernix forms a distinct group within the archaea, although grouping with the two species of Pyrococci, indicative of similar repertoires of conserved genes, was observed. No indication of a specific relationship between Crenarchaeota and eukaryotes was obtained in these analyses. Several proteins that are conserved in Euryarchaeota and most bacteria are unexpectedly missing in A. pernix, including the entire set of de novo purine biosynthesis enzymes, the GTPase FtsZ (a key component of the bacterial and euryarchaeal cell-division machinery), and the tRNA-specific pseudouridine synthase, previously considered universal. A. pernix is represented in 48 COGs that do not contain any euryarchaeal members. Many of these proteins are TCA cycle and electron transport chain enzymes, reflecting the aerobic lifestyle of A. pernix. Special-purpose databases organized on the basis of phylogenetic analysis and carefully curated with respect to known and

  4. Cluster-cluster clustering

    International Nuclear Information System (INIS)

    Barnes, J.; Dekel, A.; Efstathiou, G.; Frenk, C.S.; Yale Univ., New Haven, CT; California Univ., Santa Barbara; Cambridge Univ., England; Sussex Univ., Brighton, England)

    1985-01-01

    The cluster correlation function xi sub c(r) is compared with the particle correlation function, xi(r) in cosmological N-body simulations with a wide range of initial conditions. The experiments include scale-free initial conditions, pancake models with a coherence length in the initial density field, and hybrid models. Three N-body techniques and two cluster-finding algorithms are used. In scale-free models with white noise initial conditions, xi sub c and xi are essentially identical. In scale-free models with more power on large scales, it is found that the amplitude of xi sub c increases with cluster richness; in this case the clusters give a biased estimate of the particle correlations. In the pancake and hybrid models (with n = 0 or 1), xi sub c is steeper than xi, but the cluster correlation length exceeds that of the points by less than a factor of 2, independent of cluster richness. Thus the high amplitude of xi sub c found in studies of rich clusters of galaxies is inconsistent with white noise and pancake models and may indicate a primordial fluctuation spectrum with substantial power on large scales. 30 references

  5. The non-random clustering of non-synonymous substitutions and its relationship to evolutionary rate

    Directory of Open Access Journals (Sweden)

    Stone Eric A

    2011-08-01

    Full Text Available Abstract Background Protein sequences are subject to a mosaic of constraint. Changes to functional domains and buried residues, for example, are more apt to disrupt protein structure and function than are changes to residues participating in loops or exposed to solvent. Regions of constraint on the tertiary structure of a protein often result in loose segmentation of its primary structure into stretches of slowly- and rapidly-evolving amino acids. This clustering can be exploited, and existing methods have done so by relying on local sequence conservation as a signature of selection to help identify functionally important regions within proteins. We invert this paradigm by leveraging the regional nature of protein structure and function to both illuminate and make use of genome-wide patterns of local sequence conservation. Results Our hypothesis is that the regional nature of structural and functional constraints will assert a positive autocorrelation on the evolutionary rates of neighboring sites, which, in a pairwise comparison of orthologous proteins, will manifest itself as the clustering of non-synonymous changes across the amino acid sequence. We introduce a dispersion ratio statistic to test this and related hypotheses. Using genome-wide interspecific comparisons of orthologous protein pairs, we reveal a strong log-linear relationship between the degree of clustering and the intensity of constraint. We further demonstrate how this relationship varies with the evolutionary distance between the species being compared. We provide some evidence that proteins with a history of positive selection deviate from genome-wide trends. Conclusions We find a significant association between the evolutionary rate of a protein and the degree to which non-synonymous changes cluster along its primary sequence. We show that clustering is a non-redundant predictor of evolutionary rate, and we speculate that conflicting signals of clustering and constraint may

  6. Proteomic properties reveal phyloecological clusters of Archaea.

    Directory of Open Access Journals (Sweden)

    Nela Nikolic

    Full Text Available In this study, we propose a novel way to describe the variety of environmental adaptations of Archaea. We have clustered 57 Archaea by using a non-redundant set of proteomic features, and verified that the clusters correspond to environmental adaptations to the archaeal habitats. The first cluster consists dominantly of hyperthermophiles and hyperthermoacidophilic aerobes. The second cluster joins together halophilic and extremely halophilic Archaea, while the third cluster contains mesophilic (mostly methanogenic Archaea together with thermoacidophiles. The non-redundant subset of proteomic features was found to consist of five features: the ratio of charged residues to uncharged, average protein size, normalized frequency of beta-sheet, normalized frequency of extended structure and number of hydrogen bond donors. We propose this clustering to be termed phyloecological clustering. This approach could give additional insights into relationships among archaeal species that may be hidden by sole phylogenetic analysis.

  7. Chinese mitten crab (Eriocheir sinensis) iron-sulphur cluster assembly protein 2 (EsIscA2) is differentially regulated after immune and oxidative stress challenges.

    Science.gov (United States)

    Zhang, Peng; Liu, Yu; Wang, Min; Dong, Miren; Liu, Zhaoqun; Jia, Zhihao; Wang, Weilin; Zhang, Anguo; Wang, Lingling; Song, Linsheng

    2018-07-01

    Iron-sulphur clusters (ISCs), one of the oldest and most versatile cofactors of proteins, are involved in catalysis reactions, electron transport reactions, regulation processes as well as sensing of ambient conditions. Iron-sulphur cluster assembly protein (IscA) is a scaffold protein member of ISC formation system, which plays a significant role in the assembly and maturation process of ISC proteins. In the present study, the cDNA sequence of iron-sulphur cluster assembly protein 2 (designated as EsIscA2) was cloned from Eriocheir sinensis. The open reading frame (ORF) of EsIscA2 was of 507 bp, encoding a peptide of 168 amino acids with a typically conserved Fe-S domain. A tetrameric form was predicated by the SWISS-MODEL prediction algorithm, and three conserved cysteine residues (Cys-93, Cys-158, Cys-160) from each IscA monomer were predicted to form a 'cysteine pocket'. The deduced amino acid sequence of EsIscA2 shared over 50% similarity with that of other IscAs. EsIscA2 was clustered with IscA2 proteins from invertebrates and vertebrates, indicating that the protein was highly conservative in the evolution. rEsIscA2 exhibited a high iron binding affinity in the concentration ranging from 2 to 200 μM. EsIscA2 transcripts were detected in all the tested tissues including gonad, hemocytes, gill, muscle, heart, hepatopancreas and eyestalk, and EsIscA2 protein was detected in the mitochondria of hemocytes. The highest mRNA expression level of EsIscA2 was detected in muscle and hepatopancreas, which was about 34.66-fold (p < 0.05) and 27.07-fold (p < 0.05) of that in hemocytes, respectively. After Aeromonas hydrophila and lipopolysaccharide (LPS) stimulations, the mRNA expression of EsIscA2 in hemocytes was down-regulated and reached the lowest level at 24 h (0.31-fold, p < 0.05) and 48 h (0.29-fold, p < 0.05) compared to control group, respectively. And the expression of EsIscA2 mRNA in hepatopancreas was repressed from 6 h to 48 h post

  8. Interaction of proteins with ionic liquid, alcohol and DMSO and in situ generation of gold nano-clusters in a cell.

    Science.gov (United States)

    Nandi, Somen; Parui, Sridip; Halder, Ritaban; Jana, Biman; Bhattacharyya, Kankan

    2018-06-01

    In this review, we give a brief overview on how the interaction of proteins with ionic liquids, alcohols and dimethyl sulfoxide (DMSO) influences the stability, conformational dynamics and function of proteins/enzymes. We present experimental results obtained from fluorescence correlation spectroscopy on the effect of ionic liquid or alcohol or DMSO on the size (more precisely, the diffusion constant) and conformational dynamics of lysozyme, cytochrome c and human serum albumin in aqueous solution. The interaction of ionic liquid with biomolecules (e.g. protein, DNA etc.) has emerged as a current frontier. We demonstrate that ionic liquids are excellent stabilizers of protein and DNA and, in some cases, cause refolding of a protein already denatured by chemical denaturing agents. We show that in ethanol-water binary mixture, proteins undergo non-monotonic changes in size and dynamics with increasing ethanol content. We also discuss the effect of water-DMSO mixture on the stability of proteins. We demonstrate how large-scale molecular dynamics simulations have revealed the molecular origin of this observed phenomenon and provide a microscopic picture of the immediate environment of the biomolecules. Finally, we describe how favorable interactions of ionic liquids may be utilized for in situ generation of fluorescent gold nano-clusters for imaging a live cell.

  9. Network Analysis Tools: from biological networks to clusters and pathways.

    Science.gov (United States)

    Brohée, Sylvain; Faust, Karoline; Lima-Mendez, Gipsi; Vanderstocken, Gilles; van Helden, Jacques

    2008-01-01

    Network Analysis Tools (NeAT) is a suite of computer tools that integrate various algorithms for the analysis of biological networks: comparison between graphs, between clusters, or between graphs and clusters; network randomization; analysis of degree distribution; network-based clustering and path finding. The tools are interconnected to enable a stepwise analysis of the network through a complete analytical workflow. In this protocol, we present a typical case of utilization, where the tasks above are combined to decipher a protein-protein interaction network retrieved from the STRING database. The results returned by NeAT are typically subnetworks, networks enriched with additional information (i.e., clusters or paths) or tables displaying statistics. Typical networks comprising several thousands of nodes and arcs can be analyzed within a few minutes. The complete protocol can be read and executed in approximately 1 h.

  10. High-temperature protein G is essential for activity of the Escherichia coli clustered regularly interspaced short palindromic repeats (CRISPR)/Cas system.

    Science.gov (United States)

    Yosef, Ido; Goren, Moran G; Kiro, Ruth; Edgar, Rotem; Qimron, Udi

    2011-12-13

    Prokaryotic DNA arrays arranged as clustered regularly interspaced short palindromic repeats (CRISPR), along with their associated proteins, provide prokaryotes with adaptive immunity by RNA-mediated targeting of alien DNA or RNA matching the sequences between the repeats. Here, we present a thorough screening system for the identification of bacterial proteins participating in immunity conferred by the Escherichia coli CRISPR system. We describe the identification of one such protein, high-temperature protein G (HtpG), a homolog of the eukaryotic chaperone heat-shock protein 90. We demonstrate that in the absence of htpG, the E. coli CRISPR system loses its suicidal activity against λ prophage and its ability to provide immunity from lysogenization. Transcomplementation of htpG restores CRISPR activity. We further show that inactivity of the CRISPR system attributable to htpG deficiency can be suppressed by expression of Cas3, a protein that is essential for its activity. Accordingly, we also find that the steady-state level of overexpressed Cas3 is significantly enhanced following HtpG expression. We conclude that HtpG is a newly identified positive modulator of the CRISPR system that is essential for maintaining functional levels of Cas3.

  11. Mössbauer spectroscopy and DFT calculations on all protonation states of the 2Fe-2S cluster of the Rieske protein

    Science.gov (United States)

    Müller, C. S.; Auerbach, H.; Stegmaier, K.; Wolny, J. A.; Schünemann, V.; Pierik, A. J.

    2017-11-01

    The Thermus thermophilus Rieske protein ( TtRP) contains a 2Fe-2S cluster with one iron (Fe-Cys) coordinated by four sulfur atoms (2xS2- and 2xCys) and one iron (Fe-His) by two sulfur and two nitrogen atoms (2xS2-, His134 and His154). Here, the protein is investigated at three pH values (6.0, 8.5 and 10.5) in order to elucidate the protonation states of the His-ligands. Examination of the effect of protonation on the electronic structure of the cluster via Mössbauer spectroscopy gives a deeper understanding of the coupling of electron transfer to the protonation state of the His-ligands. Two components (1 referring to Fe-Cys and 2 to Fe-His) with parameters typical for a diamagnetic [2Fe-2S]2+ cluster are detected. The Mössbauer parameters and the protonation state clearly correlate: while δ remains almost pH-independent with δ 1 (pH6.0) = 0.23 (± 0.01) mms- 1 and δ 1 (pH10.5) = 0.24 (± 0.01) mms- 1 for Fe-Cys, it decreases for Fe-His from δ 2 (pH6.0) = 0.34 (± 0.01) mms- 1 to δ 2 (pH10.5) = 0.28 (± 0.01) mms- 1. Δ E Q changes from Δ E Q1 (pH6.0) = 0.57 (± 0.01) mms- 1 to Δ E Q1 (pH10.5) = 0.45 (± 0.01) mms- 1 and from Δ E Q2 (pH6.0) = 1.05 (± 0.01) mms- 1 to Δ E Q2 (pH10.5) = 0.71 (± 0.01) mms- 1. Density functional theory (DFT)-calculations based on the crystal structure (pdb 1NYK) (Hunsicker-Wang et al. Biochemistry 42, 7303, 2003) have been performed for the Rieske-cluster with different His-ligand protonation states, reproducing the experimentally observed trend.

  12. Statistical survey of the buried waters in the Protein Data Bank.

    Science.gov (United States)

    Carugo, Oliviero

    2016-01-01

    The structures of buried water molecules were studied in an ensemble of high-quality and non-redundant protein crystal structures. Buried water molecules were clustered and classified in lake-like clusters, which are completely isolated from the bulk solvent, and bay-like clusters, which are in contact with the bulk solvent through a surface water molecule. Buried water molecules are extremely common: lake-like clusters are found in 89 % of the protein crystal structures and bay-like clusters in 93 %. Clusters with only one water molecule are much more common than larger clusters. Both cluster types incline to be surrounded by loop residues, and to a minor extent by residues in extended secondary structure. Helical residues on the contrary do not tend to surround clusters of buried water molecules. One buried water molecule is found every 30-50 amino acid residues, depending on the secondary structures that are more abundant in the protein. Both main- and side-chain atoms are in contact with buried waters; they form four hydrogen bonds with the first water and 1-1.5 additional hydrogen bond for each additional water in the cluster. Consequently, buried water molecules appear to be firmly packed and rigid like the protein atoms. In this regard, it is remarkable to observe that prolines often surround water molecules buried in the protein interior. Interestingly, clusters of buried water molecules tend to be just beneath the protein surface. Moreover, water molecules tend to form a one-dimensional wire rather than more compact arrangements. This agrees with recent evidence of the mechanisms of solvent exchange between internal cavities and bulk solvent.

  13. Roles of Fe-S proteins: from cofactor synthesis to iron homeostasis to protein synthesis.

    Science.gov (United States)

    Pain, Debkumar; Dancis, Andrew

    2016-06-01

    Fe-S cluster assembly is an essential process for all cells. Impairment of Fe-S cluster assembly creates diseases in diverse and surprising ways. In one scenario, the loss of function of lipoic acid synthase, an enzyme with Fe-S cluster cofactor in mitochondria, impairs activity of various lipoamide-dependent enzymes with drastic consequences for metabolism. In a second scenario, the heme biosynthetic pathway in red cell precursors is specifically targeted, and iron homeostasis is perturbed, but lipoic acid synthesis is unaffected. In a third scenario, tRNA modifications arising from action of the cysteine desulfurase and/or Fe-S cluster proteins are lost, which may lead to impaired protein synthesis. These defects can then result in cancer, neurologic dysfunction or type 2 diabetes. Copyright © 2016 Elsevier Ltd. All rights reserved.

  14. Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces

    Directory of Open Access Journals (Sweden)

    Gorin Andrey A

    2008-05-01

    Full Text Available Abstract Background Protein-protein interactions are ubiquitous and essential for all cellular processes. High-resolution X-ray crystallographic structures of protein complexes can reveal the details of their function and provide a basis for many computational and experimental approaches. Differentiation between biological and non-biological contacts and reconstruction of the intact complex is a challenging computational problem. A successful solution can provide additional insights into the fundamental principles of biological recognition and reduce errors in many algorithms and databases utilizing interaction information extracted from the Protein Data Bank (PDB. Results We have developed a method for identifying protein complexes in the PDB X-ray structures by a four step procedure: (1 comprehensively collecting all protein-protein interfaces; (2 clustering similar protein-protein interfaces together; (3 estimating the probability that each cluster is relevant based on a diverse set of properties; and (4 combining these scores for each PDB entry in order to predict the complex structure. The resulting clusters of biologically relevant interfaces provide a reliable catalog of evolutionary conserved protein-protein interactions. These interfaces, as well as the predicted protein complexes, are available from the Protein Interface Server (PInS website (see Availability and requirements section. Conclusion Our method demonstrates an almost two-fold reduction of the annotation error rate as evaluated on a large benchmark set of complexes validated from the literature. We also estimate relative contributions of each interface property to the accurate discrimination of biologically relevant interfaces and discuss possible directions for further improving the prediction method.

  15. Cluster (Viridiplantae) - PGDBj - Ortholog DB | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available 0”. This cluster ID is uniquely-assigned by the PGDBj Ortholog Database. Cluster size Number of proteins aff...r About This Database Database Description Download License Update History of This Database Site Policy | Contact Us Cluster (Viridiplantae) - PGDBj - Ortholog DB | LSDB Archive ... ...List Contact us PGDBj - Ortholog DB Cluster (Viridiplantae) Data detail Data name Cluster (Viridiplantae) DO...switchLanguage; BLAST Search Image Search Home About Archive Update History Data

  16. Cluster (Cyanobacteria) - PGDBj - Ortholog DB | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available 3090”. This cluster ID is uniquely-assigned by the PGDBj Ortholog Database. Cluster size Number of proteins ...ster About This Database Database Description Download License Update History of This Database Site Policy | Contact Us Cluster (Cyanobacteria) - PGDBj - Ortholog DB | LSDB Archive ... ...List Contact us PGDBj - Ortholog DB Cluster (Cyanobacteria) Data detail Data name Cluster (Cyanobacteria) DO...switchLanguage; BLAST Search Image Search Home About Archive Update History Data

  17. Localization and orientation of heavy-atom cluster compounds in protein crystals using molecular replacement.

    Science.gov (United States)

    Dahms, Sven O; Kuester, Miriam; Streb, Carsten; Roth, Christian; Sträter, Norbert; Than, Manuel E

    2013-02-01

    Heavy-atom clusters (HA clusters) containing a large number of specifically arranged electron-dense scatterers are especially useful for experimental phase determination of large complex structures, weakly diffracting crystals or structures with large unit cells. Often, the determination of the exact orientation of the HA cluster and hence of the individual heavy-atom positions proves to be the critical step in successful phasing and subsequent structure solution. Here, it is demonstrated that molecular replacement (MR) with either anomalous or isomorphous differences is a useful strategy for the correct placement of HA cluster compounds. The polyoxometallate cluster hexasodium α-metatungstate (HMT) was applied in phasing the structure of death receptor 6. Even though the HA cluster is bound in alternate partially occupied orientations and is located at a special position, its correct localization and orientation could be determined at resolutions as low as 4.9 Å. The broad applicability of this approach was demonstrated for five different derivative crystals that included the compounds tantalum tetradecabromide and trisodium phosphotungstate in addition to HMT. The correct placement of the HA cluster depends on the length of the intramolecular vectors chosen for MR, such that both a larger cluster size and the optimal choice of the wavelength used for anomalous data collection strongly affect the outcome.

  18. 25. Steenbock symposium -- Biosynthesis and function of metal clusters for enzymes: Proceedings

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1997-12-31

    This symposium was held June 10--14, 1997 in Madison, Wisconsin. The purpose of this conference was to provide a multidisciplinary forum for exchange of state-of-the-art information on biochemistry of enzymes that have an affinity for metal clusters. Attention is focused on the following: metal clusters involved in energy conservation and remediation; tungsten, molybdenum, and cobalt-containing enzymes; Fe proteins, and Mo-binding proteins; nickel enzymes; and nitrogenase.

  19. Localization and orientation of heavy-atom cluster compounds in protein crystals using molecular replacement

    International Nuclear Information System (INIS)

    Dahms, Sven O.; Kuester, Miriam; Streb, Carsten; Roth, Christian; Sträter, Norbert; Than, Manuel E.

    2013-01-01

    A new approach is presented that allows the efficient localization and orientation of heavy-atom cluster compounds used in experimental phasing by a molecular replacement procedure. This permits the calculation of meaningful phases up to the highest resolution of the diffraction data. Heavy-atom clusters (HA clusters) containing a large number of specifically arranged electron-dense scatterers are especially useful for experimental phase determination of large complex structures, weakly diffracting crystals or structures with large unit cells. Often, the determination of the exact orientation of the HA cluster and hence of the individual heavy-atom positions proves to be the critical step in successful phasing and subsequent structure solution. Here, it is demonstrated that molecular replacement (MR) with either anomalous or isomorphous differences is a useful strategy for the correct placement of HA cluster compounds. The polyoxometallate cluster hexasodium α-metatungstate (HMT) was applied in phasing the structure of death receptor 6. Even though the HA cluster is bound in alternate partially occupied orientations and is located at a special position, its correct localization and orientation could be determined at resolutions as low as 4.9 Å. The broad applicability of this approach was demonstrated for five different derivative crystals that included the compounds tantalum tetradecabromide and trisodium phosphotungstate in addition to HMT. The correct placement of the HA cluster depends on the length of the intramolecular vectors chosen for MR, such that both a larger cluster size and the optimal choice of the wavelength used for anomalous data collection strongly affect the outcome

  20. Integration of Phenotypic Metadata and Protein Similarity in Archaea Using a Spectral Bipartitioning Approach

    Energy Technology Data Exchange (ETDEWEB)

    Hooper, Sean D.; Anderson, Iain J; Pati, Amrita; Dalevi, Daniel; Mavromatis, Konstantinos; Kyrpides, Nikos C

    2009-01-01

    In order to simplify and meaningfully categorize large sets of protein sequence data, it is commonplace to cluster proteins based on the similarity of those sequences. However, it quickly becomes clear that the sequence flexibility allowed a given protein varies significantly among different protein families. The degree to which sequences are conserved not only differs for each protein family, but also is affected by the phylogenetic divergence of the source organisms. Clustering techniques that use similarity thresholds for protein families do not always allow for these variations and thus cannot be confidently used for applications such as automated annotation and phylogenetic profiling. In this work, we applied a spectral bipartitioning technique to all proteins from 53 archaeal genomes. Comparisons between different taxonomic levels allowed us to study the effects of phylogenetic distances on cluster structure. Likewise, by associating functional annotations and phenotypic metadata with each protein, we could compare our protein similarity clusters with both protein function and associated phenotype. Our clusters can be analyzed graphically and interactively online.

  1. A Proteomic Approach to Investigating Gene Cluster Expression and Secondary Metabolite Functionality in Aspergillus fumigatus

    Science.gov (United States)

    Owens, Rebecca A.; Hammel, Stephen; Sheridan, Kevin J.; Jones, Gary W.; Doyle, Sean

    2014-01-01

    A combined proteomics and metabolomics approach was utilised to advance the identification and characterisation of secondary metabolites in Aspergillus fumigatus. Here, implementation of a shotgun proteomic strategy led to the identification of non-redundant mycelial proteins (n = 414) from A. fumigatus including proteins typically under-represented in 2-D proteome maps: proteins with multiple transmembrane regions, hydrophobic proteins and proteins with extremes of molecular mass and pI. Indirect identification of secondary metabolite cluster expression was also achieved, with proteins (n = 18) from LaeA-regulated clusters detected, including GliT encoded within the gliotoxin biosynthetic cluster. Biochemical analysis then revealed that gliotoxin significantly attenuates H2O2-induced oxidative stress in A. fumigatus (p>0.0001), confirming observations from proteomics data. A complementary 2-D/LC-MS/MS approach further elucidated significantly increased abundance (pproteome and experimental strategies, plus mechanistic data pertaining to gliotoxin functionality in the organism. PMID:25198175

  2. HotRegion: a database of predicted hot spot clusters.

    Science.gov (United States)

    Cukuroglu, Engin; Gursoy, Attila; Keskin, Ozlem

    2012-01-01

    Hot spots are energetically important residues at protein interfaces and they are not randomly distributed across the interface but rather clustered. These clustered hot spots form hot regions. Hot regions are important for the stability of protein complexes, as well as providing specificity to binding sites. We propose a database called HotRegion, which provides the hot region information of the interfaces by using predicted hot spot residues, and structural properties of these interface residues such as pair potentials of interface residues, accessible surface area (ASA) and relative ASA values of interface residues of both monomer and complex forms of proteins. Also, the 3D visualization of the interface and interactions among hot spot residues are provided. HotRegion is accessible at http://prism.ccbb.ku.edu.tr/hotregion.

  3. Investigating the Correspondence Between Transcriptomic and Proteomic Expression Profiles Using Coupled Cluster Models

    International Nuclear Information System (INIS)

    Rogers, Simon; Girolami, Mark; Kolch, Walter; Waters, Katrina M.; Liu, Tao; Thrall, Brian D.; Wiley, H. S.

    2008-01-01

    Modern transcriptomics and proteomics enable us to survey the expression of RNAs and proteins at large scales. While these data are usually generated and analyzed separately, there is an increasing interest in comparing and co-analyzing transcriptome and proteome expression data. A major open question is whether transcriptome and proteome expression is linked and how it is coordinated. Results: Here we have developed a probabilistic clustering model that permits analysis of the links between transcriptomic and proteomic profiles in a sensible and flexible manner. Our coupled mixture model defines a prior probability distribution over the component to which a protein profile should be assigned conditioned on which component the associated mRNA profile belongs to. By providing probabilistic assignments this approach sits between the two extremes of concatenating the data on the assumption that mRNA and protein clusters would have a one-to-one relationship, and independent clustering where the mRNA profile provides no information on the protein profile and vice-versa. We apply this approach to a large dataset of quantitative transcriptomic and proteomic expression data obtained from a human breast epithelial cell line (HMEC) stimulated by epidermal growth factor (EGF) over a series of timepoints corresponding to one cell cycle. The results reveal a complex relationship between transcriptome and proteome with most mRNA clusters linked to at least two protein clusters, and vice versa. A more detailed analysis incorporating information on gene function from the gene ontology database shows that a high correlation of mRNA and protein expression is limited to the components of some molecular machines, such as the ribosome, cell adhesion complexes and the TCP-1 chaperonin involved in protein folding. Conclusions: The dynamic regulation of the transcriptome and proteome in mammalian cells in response to an acute mitogenic stimulus appears largely independent with very little

  4. Structure-sequence based analysis for identification of conserved regions in proteins

    Science.gov (United States)

    Zemla, Adam T; Zhou, Carol E; Lam, Marisa W; Smith, Jason R; Pardes, Elizabeth

    2013-05-28

    Disclosed are computational methods, and associated hardware and software products for scoring conservation in a protein structure based on a computationally identified family or cluster of protein structures. A method of computationally identifying a family or cluster of protein structures in also disclosed herein.

  5. Yeast Interacting Proteins Database: YOL069W, YIL144W [Yeast Interacting Proteins Database

    Lifescience Database Archive (English)

    Full Text Available complex (Ndc80p-Nuf2p-Spc24p-Spc25p); involved in chromosome segregation, spindle checkpoint activity and kinetochore clustering...vity, kinetochore assembly and clustering Rows with this prey as prey (2) Rows with this prey as bait (0) 12...-Nuf2p-Spc24p-Spc25p); involved in chromosome segregation, spindle checkpoint activity and kinetochore clustering...d coiled-coil protein involved in chromosome segregation, spindle checkpoint activity, kinetochore assembly and clustering

  6. Dynamics and molecular determinants of cytoplasmic lipid droplet clustering and dispersion.

    Directory of Open Access Journals (Sweden)

    David J Orlicky

    Full Text Available Perilipin-1 (Plin1, a prominent cytoplasmic lipid droplet (CLD binding phosphoprotein and key physiological regulator of triglyceride storage and lipolysis in adipocytes, is thought to regulate the fragmentation and dispersion of CLD that occurs in response to β-adrenergic activation of adenylate cyclase. Here we investigate the dynamics and molecular determinants of these processes using cell lines stably expressing recombinant forms of Plin1 and/or other members of the perilipin family. Plin1 and a C-terminal CLD-binding fragment of Plin1 (Plin1CT induced formation of single dense CLD clusters near the microtubule organizing center, whereas neither an N-terminal CLD-binding fragment of Plin1, nor Plin2 or Plin3 induced clustering. Clustered CLD coated by Plin1, or Plin1CT, dispersed in response to isoproterenol, or other agents that activate adenylate cyclase, in a process inhibited by the protein kinase A inhibitor, H89, and blocked by microtubule disruption. Isoproterenol-stimulated phosphorylation of CLD-associated Plin1 on serine 492 preceded their dispersion, and live cell imaging showed that cluster dispersion involved initial fragmentation of tight clusters into multiple smaller clusters, which then fragmented into well-dispersed individual CLD. siRNA knockdown of the cortical actin binding protein, moesin, induced disaggregation of tight clusters into multiple smaller clusters, and inhibited the reaggregation of dispersed CLD into tight clusters. Together these data suggest that the clustering and dispersion processes involve a complex orchestration of phosphorylation-dependent, microtubule-dependent and independent, and microfilament dependent steps.

  7. Dynamics and molecular determinants of cytoplasmic lipid droplet clustering and dispersion.

    Science.gov (United States)

    Orlicky, David J; Monks, Jenifer; Stefanski, Adrianne L; McManaman, James L

    2013-01-01

    Perilipin-1 (Plin1), a prominent cytoplasmic lipid droplet (CLD) binding phosphoprotein and key physiological regulator of triglyceride storage and lipolysis in adipocytes, is thought to regulate the fragmentation and dispersion of CLD that occurs in response to β-adrenergic activation of adenylate cyclase. Here we investigate the dynamics and molecular determinants of these processes using cell lines stably expressing recombinant forms of Plin1 and/or other members of the perilipin family. Plin1 and a C-terminal CLD-binding fragment of Plin1 (Plin1CT) induced formation of single dense CLD clusters near the microtubule organizing center, whereas neither an N-terminal CLD-binding fragment of Plin1, nor Plin2 or Plin3 induced clustering. Clustered CLD coated by Plin1, or Plin1CT, dispersed in response to isoproterenol, or other agents that activate adenylate cyclase, in a process inhibited by the protein kinase A inhibitor, H89, and blocked by microtubule disruption. Isoproterenol-stimulated phosphorylation of CLD-associated Plin1 on serine 492 preceded their dispersion, and live cell imaging showed that cluster dispersion involved initial fragmentation of tight clusters into multiple smaller clusters, which then fragmented into well-dispersed individual CLD. siRNA knockdown of the cortical actin binding protein, moesin, induced disaggregation of tight clusters into multiple smaller clusters, and inhibited the reaggregation of dispersed CLD into tight clusters. Together these data suggest that the clustering and dispersion processes involve a complex orchestration of phosphorylation-dependent, microtubule-dependent and independent, and microfilament dependent steps.

  8. Distinct profile of vascular progenitor attachment to extracellular matrix proteins in cancer patients.

    Science.gov (United States)

    Labonté, Laura; Li, Yuhua; Addison, Christina L; Brand, Marjorie; Javidnia, Hedyeh; Corsten, Martin; Burns, Kevin; Allan, David S

    2012-04-01

    Vascular progenitor cells (VPCs) facilitate angiogenesis and initiate vascular repair by homing in on sites of damage and adhering to extracellular matrix (ECM) proteins. VPCs also contribute to tumor angiogenesis and induce angiogenic switching in sites of metastatic cancer. In this study, the binding of attaching cells in VPC clusters that form in vitro on specific ECM proteins was investigated. VPC cluster assays were performed in vitro on ECM proteins enriched in cancer cells and in remodelling tissue. Profiles of VPC clusters from patients with cancer were compared to healthy controls. The role of VEGF and integrin-specific binding of angiogenic attaching cells was addressed. VPC clusters from cancer patients were markedly increased on fibronectin relative to other ECM proteins tested, in contrast to VPC clusters from control subjects, which formed preferentially on laminin. Specific integrin-mediated binding of attaching cells in VPC clusters was matrix protein-dependent. Furthermore, cancer patients had elevated plasma VEGF levels compared to healthy controls and VEGF facilitated preferential VPC cluster formation on fibronectin. Incubating cells from healthy controls with VEGF induced a switch from the 'healthy' VPC binding profile to the profile observed in cancer patients with a marked increase in VPC cluster formation on fibronectin. The ECM proteins laminin and fibronectin support VPC cluster formation via specific integrins on attaching cells and can facilitate patterns of VPC cluster formation that are distinct in cancer patients. Larger studies, however, are needed to gain insight on how tumor angiogenesis may differ from normal repair processes.

  9. Correlation of mRNA and protein levels: Cell type-specific gene expression of cluster designation antigens in the prostate

    Directory of Open Access Journals (Sweden)

    Deutsch Eric W

    2008-05-01

    Full Text Available Abstract Background: Expression levels of mRNA and protein by cell types exhibit a range of correlations for different genes. In this study, we compared levels of mRNA abundance for several cluster designation (CD genes determined by gene arrays using magnetic sorted and laser-capture microdissected human prostate cells with levels of expression of the respective CD proteins determined by immunohistochemical staining in the major cell types of the prostate – basal epithelial, luminal epithelial, stromal fibromuscular, and endothelial – and for prostate precursor/stem cells and prostate carcinoma cells. Immunohistochemical stains of prostate tissues from more than 50 patients were scored for informative CD antigen expression and compared with cell-type specific transcriptomes. Results: Concordance between gene and protein expression findings based on 'present' vs. 'absent' calls ranged from 46 to 68%. Correlation of expression levels was poor to moderate (Pearson correlations ranged from 0 to 0.63. Divergence between the two data types was most frequently seen for genes whose array signals exceeded background (> 50 but lacked immunoreactivity by immunostaining. This could be due to multiple factors, e.g. low levels of protein expression, technological sensitivities, sample processing, probe set definition or anatomical origin of tissue and actual biological differences between transcript and protein abundance. Conclusion: Agreement between these two very different methodologies has great implications for their respective use in both molecular studies and clinical trials employing molecular biomarkers.

  10. VTVH-MCD study of the Delta nifB Delta nifZ MoFe protein from Azotobacter vinelandii.

    Science.gov (United States)

    Cotton, Marcia S; Rupnik, Kresimir; Broach, Robyn B; Hu, Yilin; Fay, Aaron W; Ribbe, Markus W; Hales, Brian J

    2009-04-08

    NifZ is a member of a series of proteins associated with the maturation of the nitrogenase MoFe protein. An MCD spectroscopic study was undertaken on the Delta nifB Delta nifZ MoFe protein generated in the absence of both NifZ and NifB (deletion of NifB generates an apo-MoFe protein lacking the FeMo cofactor). Results presented here show that, in the absence of NifZ, only one of the two P-clusters of the MoFe protein is matured to the ultimate [8Fe-7S] structure. The other P-cluster site in the protein contains a [4Fe-4S] cluster pair, representing a P-cluster precursor that is electronically identical to the analogous clusters observed in the Delta nifH MoFe protein. These results suggest that the MoFe protein is synthesized in a stepwise fashion where NifZ is specifically required for the formation of the second P-cluster.

  11. Architecture of the Yeast Mitochondrial Iron-Sulfur Cluster Assembly Machinery

    Science.gov (United States)

    Ranatunga, Wasantha; Gakh, Oleksandr; Galeano, Belinda K.; Smith, Douglas Y.; Söderberg, Christopher A. G.; Al-Karadaghi, Salam; Thompson, James R.; Isaya, Grazia

    2016-01-01

    The biosynthesis of Fe-S clusters is a vital process involving the delivery of elemental iron and sulfur to scaffold proteins via molecular interactions that are still poorly defined. We reconstituted a stable, functional complex consisting of the iron donor, Yfh1 (yeast frataxin homologue 1), and the Fe-S cluster scaffold, Isu1, with 1:1 stoichiometry, [Yfh1]24·[Isu1]24. Using negative staining transmission EM and single particle analysis, we obtained a three-dimensional reconstruction of this complex at a resolution of ∼17 Å. In addition, via chemical cross-linking, limited proteolysis, and mass spectrometry, we identified protein-protein interaction surfaces within the complex. The data together reveal that [Yfh1]24·[Isu1]24 is a roughly cubic macromolecule consisting of one symmetric Isu1 trimer binding on top of one symmetric Yfh1 trimer at each of its eight vertices. Furthermore, molecular modeling suggests that two subunits of the cysteine desulfurase, Nfs1, may bind symmetrically on top of two adjacent Isu1 trimers in a manner that creates two putative [2Fe-2S] cluster assembly centers. In each center, conserved amino acids known to be involved in sulfur and iron donation by Nfs1 and Yfh1, respectively, are in close proximity to the Fe-S cluster-coordinating residues of Isu1. We suggest that this architecture is suitable to ensure concerted and protected transfer of potentially toxic iron and sulfur atoms to Isu1 during Fe-S cluster assembly. PMID:26941001

  12. Analysis of protein-protein docking decoys using interaction fingerprints: application to the reconstruction of CaM-ligand complexes

    Directory of Open Access Journals (Sweden)

    Uchikoga Nobuyuki

    2010-05-01

    Full Text Available Abstract Background Protein-protein docking for proteins with large conformational changes was analyzed by using interaction fingerprints, one of the scales for measuring similarities among complex structures, utilized especially for searching near-native protein-ligand or protein-protein complex structures. Here, we have proposed a combined method for analyzing protein-protein docking by taking large conformational changes into consideration. This combined method consists of ensemble soft docking with multiple protein structures, refinement of complexes, and cluster analysis using interaction fingerprints and energy profiles. Results To test for the applicability of this combined method, various CaM-ligand complexes were reconstructed from the NMR structures of unbound CaM. For the purpose of reconstruction, we used three known CaM-ligands, namely, the CaM-binding peptides of cyclic nucleotide gateway (CNG, CaM kinase kinase (CaMKK and the plasma membrane Ca2+ ATPase pump (PMCA, and thirty-one structurally diverse CaM conformations. For each ligand, 62000 CaM-ligand complexes were generated in the docking step and the relationship between their energy profiles and structural similarities to the native complex were analyzed using interaction fingerprint and RMSD. Near-native clusters were obtained in the case of CNG and CaMKK. Conclusions The interaction fingerprint method discriminated near-native structures better than the RMSD method in cluster analysis. We showed that a combined method that includes the interaction fingerprint is very useful for protein-protein docking analysis of certain cases.

  13. Iron-sulfur clusters as biological sensors: the chemistry of reactions with molecular oxygen and nitric oxide.

    Science.gov (United States)

    Crack, Jason C; Green, Jeffrey; Thomson, Andrew J; Le Brun, Nick E

    2014-10-21

    Iron-sulfur cluster proteins exhibit a range of physicochemical properties that underpin their functional diversity in biology, which includes roles in electron transfer, catalysis, and gene regulation. Transcriptional regulators that utilize iron-sulfur clusters are a growing group that exploit the redox and coordination properties of the clusters to act as sensors of environmental conditions including O2, oxidative and nitrosative stress, and metabolic nutritional status. To understand the mechanism by which a cluster detects such analytes and then generates modulation of DNA-binding affinity, we have undertaken a combined strategy of in vivo and in vitro studies of a range of regulators. In vitro studies of iron-sulfur cluster proteins are particularly challenging because of the inherent reactivity and fragility of the cluster, often necessitating strict anaerobic conditions for all manipulations. Nevertheless, and as discussed in this Account, significant progress has been made over the past decade in studies of O2-sensing by the fumarate and nitrate reduction (FNR) regulator and, more recently, nitric oxide (NO)-sensing by WhiB-like (Wbl) and FNR proteins. Escherichia coli FNR binds a [4Fe-4S] cluster under anaerobic conditions leading to a DNA-binding dimeric form. Exposure to O2 converts the cluster to a [2Fe-2S] form, leading to protein monomerization and hence loss of DNA binding ability. Spectroscopic and kinetic studies have shown that the conversion proceeds via at least two steps and involves a [3Fe-4S](1+) intermediate. The second step involves the release of two bridging sulfide ions from the cluster that, unusually, are not released into solution but rather undergo oxidation to sulfane (S(0)) subsequently forming cysteine persulfides that then coordinate the [2Fe-2S] cluster. Studies of other [4Fe-4S] cluster proteins that undergo oxidative cluster conversion indicate that persulfide formation and coordination may be more common than previously

  14. Progeny Clustering: A Method to Identify Biological Phenotypes

    Science.gov (United States)

    Hu, Chenyue W.; Kornblau, Steven M.; Slater, John H.; Qutub, Amina A.

    2015-01-01

    Estimating the optimal number of clusters is a major challenge in applying cluster analysis to any type of dataset, especially to biomedical datasets, which are high-dimensional and complex. Here, we introduce an improved method, Progeny Clustering, which is stability-based and exceptionally efficient in computing, to find the ideal number of clusters. The algorithm employs a novel Progeny Sampling method to reconstruct cluster identity, a co-occurrence probability matrix to assess the clustering stability, and a set of reference datasets to overcome inherent biases in the algorithm and data space. Our method was shown successful and robust when applied to two synthetic datasets (datasets of two-dimensions and ten-dimensions containing eight dimensions of pure noise), two standard biological datasets (the Iris dataset and Rat CNS dataset) and two biological datasets (a cell phenotype dataset and an acute myeloid leukemia (AML) reverse phase protein array (RPPA) dataset). Progeny Clustering outperformed some popular clustering evaluation methods in the ten-dimensional synthetic dataset as well as in the cell phenotype dataset, and it was the only method that successfully discovered clinically meaningful patient groupings in the AML RPPA dataset. PMID:26267476

  15. Spatial cluster analysis of nanoscopically mapped serotonin receptors for classification of fixed brain tissue

    Science.gov (United States)

    Sams, Michael; Silye, Rene; Göhring, Janett; Muresan, Leila; Schilcher, Kurt; Jacak, Jaroslaw

    2014-01-01

    We present a cluster spatial analysis method using nanoscopic dSTORM images to determine changes in protein cluster distributions within brain tissue. Such methods are suitable to investigate human brain tissue and will help to achieve a deeper understanding of brain disease along with aiding drug development. Human brain tissue samples are usually treated postmortem via standard fixation protocols, which are established in clinical laboratories. Therefore, our localization microscopy-based method was adapted to characterize protein density and protein cluster localization in samples fixed using different protocols followed by common fluorescent immunohistochemistry techniques. The localization microscopy allows nanoscopic mapping of serotonin 5-HT1A receptor groups within a two-dimensional image of a brain tissue slice. These nanoscopically mapped proteins can be confined to clusters by applying the proposed statistical spatial analysis. Selected features of such clusters were subsequently used to characterize and classify the tissue. Samples were obtained from different types of patients, fixed with different preparation methods, and finally stored in a human tissue bank. To verify the proposed method, samples of a cryopreserved healthy brain have been compared with epitope-retrieved and paraffin-fixed tissues. Furthermore, samples of healthy brain tissues were compared with data obtained from patients suffering from mental illnesses (e.g., major depressive disorder). Our work demonstrates the applicability of localization microscopy and image analysis methods for comparison and classification of human brain tissues at a nanoscopic level. Furthermore, the presented workflow marks a unique technological advance in the characterization of protein distributions in brain tissue sections.

  16. Identification and phylogeny of the tomato receptor-like proteins family

    Directory of Open Access Journals (Sweden)

    Ermis Yanes-Paz

    2017-03-01

    Full Text Available The receptor-like proteins (RLPs play multiple roles in development and defense. In the current work 75 RLPs were identified in tomato (Solanum lycopersicum L. using iterative BLAST searches and domain prediction. A phylogenetic tree including all the identified RLPs from tomato and some functionally characterized RLPs from other species was built to identify their putative homologues in tomato. We first tested whether C3-F-based phylogeny was a good indicator of functional relation between related proteins of different species. Indeed, the functionally characterized CLAVATA2 (CLV2, the maize ortholog FASCIATED EAR2 (FEA2 and a putative tomato CLV2 described in Uniprot clustered together, which validates the approach. Using this approach Solyc12g042760.1.1 was identified as the putative tomato homologue of TOO MANY MOUTHS (TMM. It was shown that proteins in the same cluster of the phylogenetic tree share functional relations since several clusters of functionally related proteins i.e. the Ve cluster, the Cf cluster, and the Eix clade were formed.   Keywords: phylogeny, receptors, RLP, tomato

  17. Super Resolution Fluorescence Microscopy and Tracking of Bacterial Flotillin (Reggie Paralogs Provide Evidence for Defined-Sized Protein Microdomains within the Bacterial Membrane but Absence of Clusters Containing Detergent-Resistant Proteins.

    Directory of Open Access Journals (Sweden)

    Felix Dempwolff

    2016-06-01

    Full Text Available Biological membranes have been proposed to contain microdomains of a specific lipid composition, in which distinct groups of proteins are clustered. Flotillin-like proteins are conserved between pro-and eukaryotes, play an important function in several eukaryotic and bacterial cells, and define in vertebrates a type of so-called detergent-resistant microdomains. Using STED microscopy, we show that two bacterial flotillins, FloA and FloT, form defined assemblies with an average diameter of 85 to 110 nm in the model bacterium Bacillus subtilis. Interestingly, flotillin microdomains are of similar size in eukaryotic cells. The soluble domains of FloA form higher order oligomers of up to several hundred kDa in vitro, showing that like eukaryotic flotillins, bacterial assemblies are based in part on their ability to self-oligomerize. However, B. subtilis paralogs show significantly different diffusion rates, and consequently do not colocalize into a common microdomain. Dual colour time lapse experiments of flotillins together with other detergent-resistant proteins in bacteria show that proteins colocalize for no longer than a few hundred milliseconds, and do not move together. Our data reveal that the bacterial membrane contains defined-sized protein domains rather than functional microdomains dependent on flotillins. Based on their distinct dynamics, FloA and FloT confer spatially distinguishable activities, but do not serve as molecular scaffolds.

  18. Protein complex prediction in large ontology attributed protein-protein interaction networks.

    Science.gov (United States)

    Zhang, Yijia; Lin, Hongfei; Yang, Zhihao; Wang, Jian; Li, Yanpeng; Xu, Bo

    2013-01-01

    Protein complexes are important for unraveling the secrets of cellular organization and function. Many computational approaches have been developed to predict protein complexes in protein-protein interaction (PPI) networks. However, most existing approaches focus mainly on the topological structure of PPI networks, and largely ignore the gene ontology (GO) annotation information. In this paper, we constructed ontology attributed PPI networks with PPI data and GO resource. After constructing ontology attributed networks, we proposed a novel approach called CSO (clustering based on network structure and ontology attribute similarity). Structural information and GO attribute information are complementary in ontology attributed networks. CSO can effectively take advantage of the correlation between frequent GO annotation sets and the dense subgraph for protein complex prediction. Our proposed CSO approach was applied to four different yeast PPI data sets and predicted many well-known protein complexes. The experimental results showed that CSO was valuable in predicting protein complexes and achieved state-of-the-art performance.

  19. Analysis of protein profiles using fuzzy clustering methods

    DEFF Research Database (Denmark)

    Karemore, Gopal Raghunath; Ukendt, Sujatha; Rai, Lavanya

    The tissue protein profiles of healthy volunteers and volunteers with cervical cancer were recorded using High Performance Liquid Chromatography combined with Laser Induced Fluorescence  technique  (HPLC-LIF)  developed  in  our  lab.      We analyzed      the protein profile data using different...

  20. Pulse laser-induced generation of cluster codes from metal nanoparticles for immunoassay applications

    Directory of Open Access Journals (Sweden)

    Chia-Yin Chang

    2017-05-01

    Full Text Available In this work, we have developed an assay for the detection of proteins by functionalized nanomaterials coupled with laser-induced desorption/ionization mass spectrometry (LDI-MS by monitoring the generation of metal cluster ions. We achieved selective detection of three proteins [thrombin, vascular endothelial growth factor-A165 (VEGF-A165, and platelet-derived growth factor-BB (PDGF-BB] by modifying nanoparticles (NPs of three different metals (Au, Ag, and Pt with the corresponding aptamer or antibody in one assay. The Au, Ag, and Pt acted as metal bio-codes for the analysis of thrombin, VEGF-A165, and PDGF-BB, respectively, and a microporous cellulose acetate membrane (CAM served as a medium for an in situ separation of target protein-bound and -unbound NPs. The functionalized metal nanoparticles bound to their specific proteins were subjected to LDI-MS on the CAM. The functional nanoparticles/CAM system can function as a signal transducer and amplifier by transforming the protein concentration into an intense metal cluster ion signal during LDI-MS analysis. This system can selectively detect proteins at picomolar concentrations. Most importantly, the system has great potential for the detection of multiple proteins without any pre-concentration, separation, or purification process because LDI-MS coupled with CAM effectively removes all signals except for those from the metal cluster ions.

  1. Comparing the performance of biomedical clustering methods

    DEFF Research Database (Denmark)

    Wiwie, Christian; Baumbach, Jan; Röttger, Richard

    2015-01-01

    expression to protein domains. Performance was judged on the basis of 13 common cluster validity indices. We developed a clustering analysis platform, ClustEval (http://clusteval.mpi-inf.mpg.de), to promote streamlined evaluation, comparison and reproducibility of clustering results in the future......Identifying groups of similar objects is a popular first step in biomedical data analysis, but it is error-prone and impossible to perform manually. Many computational methods have been developed to tackle this problem. Here we assessed 13 well-known methods using 24 data sets ranging from gene....... This allowed us to objectively evaluate the performance of all tools on all data sets with up to 1,000 different parameter sets each, resulting in a total of more than 4 million calculated cluster validity indices. We observed that there was no universal best performer, but on the basis of this wide...

  2. UQlust: combining profile hashing with linear-time ranking for efficient clustering and analysis of big macromolecular data.

    Science.gov (United States)

    Adamczak, Rafal; Meller, Jarek

    2016-12-28

    Advances in computing have enabled current protein and RNA structure prediction and molecular simulation methods to dramatically increase their sampling of conformational spaces. The quickly growing number of experimentally resolved structures, and databases such as the Protein Data Bank, also implies large scale structural similarity analyses to retrieve and classify macromolecular data. Consequently, the computational cost of structure comparison and clustering for large sets of macromolecular structures has become a bottleneck that necessitates further algorithmic improvements and development of efficient software solutions. uQlust is a versatile and easy-to-use tool for ultrafast ranking and clustering of macromolecular structures. uQlust makes use of structural profiles of proteins and nucleic acids, while combining a linear-time algorithm for implicit comparison of all pairs of models with profile hashing to enable efficient clustering of large data sets with a low memory footprint. In addition to ranking and clustering of large sets of models of the same protein or RNA molecule, uQlust can also be used in conjunction with fragment-based profiles in order to cluster structures of arbitrary length. For example, hierarchical clustering of the entire PDB using profile hashing can be performed on a typical laptop, thus opening an avenue for structural explorations previously limited to dedicated resources. The uQlust package is freely available under the GNU General Public License at https://github.com/uQlust . uQlust represents a drastic reduction in the computational complexity and memory requirements with respect to existing clustering and model quality assessment methods for macromolecular structure analysis, while yielding results on par with traditional approaches for both proteins and RNAs.

  3. Lipid reorganization induced by Shiga toxin clustering on planar membranes.

    Directory of Open Access Journals (Sweden)

    Barbara Windschiegl

    Full Text Available The homopentameric B-subunit of bacterial protein Shiga toxin (STxB binds to the glycolipid Gb(3 in plasma membranes, which is the initial step for entering cells by a clathrin-independent mechanism. It has been suggested that protein clustering and lipid reorganization determine toxin uptake into cells. Here, we elucidated the molecular requirements for STxB induced Gb(3 clustering and for the proposed lipid reorganization in planar membranes. The influence of binding site III of the B-subunit as well as the Gb(3 lipid structure was investigated by means of high resolution methods such as fluorescence and scanning force microscopy. STxB was found to form protein clusters on homogenous 1,2-dioleoyl-sn-glycero-3-phosphocholine (DOPC/cholesterol/Gb(3 (65:30:5 bilayers. In contrast, membranes composed of DOPC/cholesterol/sphingomyelin/Gb(3 (40:35:20:5 phase separate into a liquid ordered and liquid disordered phase. Dependent on the fatty acid composition of Gb(3, STxB-Gb(3 complexes organize within the liquid ordered phase upon protein binding. Our findings suggest that STxB is capable of forming a new membrane phase that is characterized by lipid compaction. The significance of this finding is discussed in the context of Shiga toxin-induced formation of endocytic membrane invaginations.

  4. Comparative genomics of Cluster O mycobacteriophages.

    Science.gov (United States)

    Cresawn, Steven G; Pope, Welkin H; Jacobs-Sera, Deborah; Bowman, Charles A; Russell, Daniel A; Dedrick, Rebekah M; Adair, Tamarah; Anders, Kirk R; Ball, Sarah; Bollivar, David; Breitenberger, Caroline; Burnett, Sandra H; Butela, Kristen; Byrnes, Deanna; Carzo, Sarah; Cornely, Kathleen A; Cross, Trevor; Daniels, Richard L; Dunbar, David; Findley, Ann M; Gissendanner, Chris R; Golebiewska, Urszula P; Hartzog, Grant A; Hatherill, J Robert; Hughes, Lee E; Jalloh, Chernoh S; De Los Santos, Carla; Ekanem, Kevin; Khambule, Sphindile L; King, Rodney A; King-Smith, Christina; Klyczek, Karen; Krukonis, Greg P; Laing, Christian; Lapin, Jonathan S; Lopez, A Javier; Mkhwanazi, Sipho M; Molloy, Sally D; Moran, Deborah; Munsamy, Vanisha; Pacey, Eddie; Plymale, Ruth; Poxleitner, Marianne; Reyna, Nathan; Schildbach, Joel F; Stukey, Joseph; Taylor, Sarah E; Ware, Vassie C; Wellmann, Amanda L; Westholm, Daniel; Wodarski, Donna; Zajko, Michelle; Zikalala, Thabiso S; Hendrix, Roger W; Hatfull, Graham F

    2015-01-01

    Mycobacteriophages--viruses of mycobacterial hosts--are genetically diverse but morphologically are all classified in the Caudovirales with double-stranded DNA and tails. We describe here a group of five closely related mycobacteriophages--Corndog, Catdawg, Dylan, Firecracker, and YungJamal--designated as Cluster O with long flexible tails but with unusual prolate capsids. Proteomic analysis of phage Corndog particles, Catdawg particles, and Corndog-infected cells confirms expression of half of the predicted gene products and indicates a non-canonical mechanism for translation of the Corndog tape measure protein. Bioinformatic analysis identifies 8-9 strongly predicted SigA promoters and all five Cluster O genomes contain more than 30 copies of a 17 bp repeat sequence with dyad symmetry located throughout the genomes. Comparison of the Cluster O phages provides insights into phage genome evolution including the processes of gene flux by horizontal genetic exchange.

  5. Transporter’s evolution and carbohydrate metabolic clusters

    NARCIS (Netherlands)

    Plantinga, Titia H.; Does, Chris van der; Driessen, Arnold J.M.

    2004-01-01

    The yiaQRS genes of Escherichia coli K-12 are involved in carbohydrate metabolism. Clustering of homologous genes was found throughout several unrelated bacteria. Strikingly, all four bacterial transport protein classes were found, conserving transport function but not mechanism. It appears that

  6. Comparative analysis of clustering methods for gene expression time course data

    Directory of Open Access Journals (Sweden)

    Ivan G. Costa

    2004-01-01

    Full Text Available This work performs a data driven comparative study of clustering methods used in the analysis of gene expression time courses (or time series. Five clustering methods found in the literature of gene expression analysis are compared: agglomerative hierarchical clustering, CLICK, dynamical clustering, k-means and self-organizing maps. In order to evaluate the methods, a k-fold cross-validation procedure adapted to unsupervised methods is applied. The accuracy of the results is assessed by the comparison of the partitions obtained in these experiments with gene annotation, such as protein function and series classification.

  7. Alignment and integration of complex networks by hypergraph-based spectral clustering

    Science.gov (United States)

    Michoel, Tom; Nachtergaele, Bruno

    2012-11-01

    Complex networks possess a rich, multiscale structure reflecting the dynamical and functional organization of the systems they model. Often there is a need to analyze multiple networks simultaneously, to model a system by more than one type of interaction, or to go beyond simple pairwise interactions, but currently there is a lack of theoretical and computational methods to address these problems. Here we introduce a framework for clustering and community detection in such systems using hypergraph representations. Our main result is a generalization of the Perron-Frobenius theorem from which we derive spectral clustering algorithms for directed and undirected hypergraphs. We illustrate our approach with applications for local and global alignment of protein-protein interaction networks between multiple species, for tripartite community detection in folksonomies, and for detecting clusters of overlapping regulatory pathways in directed networks.

  8. Protein-protein interface detection using the energy centrality relationship (ECR characteristic of proteins.

    Directory of Open Access Journals (Sweden)

    Sanjana Sudarshan

    Full Text Available Specific protein interactions are responsible for most biological functions. Distinguishing Functionally Linked Interfaces of Proteins (FLIPs, from Functionally uncorrelated Contacts (FunCs, is therefore important to characterizing these interactions. To achieve this goal, we have created a database of protein structures called FLIPdb, containing proteins belonging to various functional sub-categories. Here, we use geometric features coupled with Kortemme and Baker's computational alanine scanning method to calculate the energetic sensitivity of each amino acid at the interface to substitution, identify hotspots, and identify other factors that may contribute towards an interface being FLIP or FunC. Using Principal Component Analysis and K-means clustering on a training set of 160 interfaces, we could distinguish FLIPs from FunCs with an accuracy of 76%. When these methods were applied to two test sets of 18 and 170 interfaces, we achieved similar accuracies of 78% and 80%. We have identified that FLIP interfaces have a stronger central organizing tendency than FunCs, due, we suggest, to greater specificity. We also observe that certain functional sub-categories, such as enzymes, antibody-heavy-light, antibody-antigen, and enzyme-inhibitors form distinct sub-clusters. The antibody-antigen and enzyme-inhibitors interfaces have patterns of physical characteristics similar to those of FunCs, which is in agreement with the fact that the selection pressures of these interfaces is differently evolutionarily driven. As such, our ECR model also successfully describes the impact of evolution and natural selection on protein-protein interfaces. Finally, we indicate how our ECR method may be of use in reducing the false positive rate of docking calculations.

  9. Identification, characterization and metagenome analysis of oocyte-specific genes organized in clusters in the mouse genome

    Directory of Open Access Journals (Sweden)

    Vaiman Daniel

    2005-05-01

    Full Text Available Abstract Background Genes specifically expressed in the oocyte play key roles in oogenesis, ovarian folliculogenesis, fertilization and/or early embryonic development. In an attempt to identify novel oocyte-specific genes in the mouse, we have used an in silico subtraction methodology, and we have focused our attention on genes that are organized in genomic clusters. Results In the present work, five clusters have been studied: a cluster of thirteen genes characterized by an F-box domain localized on chromosome 9, a cluster of six genes related to T-cell leukaemia/lymphoma protein 1 (Tcl1 on chromosome 12, a cluster composed of a SPErm-associated glutamate (E-Rich (Speer protein expressed in the oocyte in the vicinity of four unknown genes specifically expressed in the testis on chromosome 14, a cluster composed of the oocyte secreted protein-1 (Oosp-1 gene and two Oosp-related genes on chromosome 19, all three being characterized by a partial N-terminal zona pellucida-like domain, and another small cluster of two genes on chromosome 19 as well, composed of a TWIK-Related spinal cord K+ channel encoding-gene, and an unknown gene predicted in silico to be testis-specific. The specificity of expression was confirmed by RT-PCR and in situ hybridization for eight and five of them, respectively. Finally, we showed by comparing all of the isolated and clustered oocyte-specific genes identified so far in the mouse genome, that the oocyte-specific clusters are significantly closer to telomeres than isolated oocyte-specific genes are. Conclusion We have studied five clusters of genes specifically expressed in female, some of them being also expressed in male germ-cells. Moreover, contrarily to non-clustered oocyte-specific genes, those that are organized in clusters tend to map near chromosome ends, suggesting that this specific near-telomere position of oocyte-clusters in rodents could constitute an evolutionary advantage. Understanding the biological

  10. Identifying technical aliases in SELDI mass spectra of complex mixtures of proteins

    Science.gov (United States)

    2013-01-01

    Background Biomarker discovery datasets created using mass spectrum protein profiling of complex mixtures of proteins contain many peaks that represent the same protein with different charge states. Correlated variables such as these can confound the statistical analyses of proteomic data. Previously we developed an algorithm that clustered mass spectrum peaks that were biologically or technically correlated. Here we demonstrate an algorithm that clusters correlated technical aliases only. Results In this paper, we propose a preprocessing algorithm that can be used for grouping technical aliases in mass spectrometry protein profiling data. The stringency of the variance allowed for clustering is customizable, thereby affecting the number of peaks that are clustered. Subsequent analysis of the clusters, instead of individual peaks, helps reduce difficulties associated with technically-correlated data, and can aid more efficient biomarker identification. Conclusions This software can be used to pre-process and thereby decrease the complexity of protein profiling proteomics data, thus simplifying the subsequent analysis of biomarkers by decreasing the number of tests. The software is also a practical tool for identifying which features to investigate further by purification, identification and confirmation. PMID:24010718

  11. Comparative genomics of Cluster O mycobacteriophages.

    Directory of Open Access Journals (Sweden)

    Steven G Cresawn

    Full Text Available Mycobacteriophages--viruses of mycobacterial hosts--are genetically diverse but morphologically are all classified in the Caudovirales with double-stranded DNA and tails. We describe here a group of five closely related mycobacteriophages--Corndog, Catdawg, Dylan, Firecracker, and YungJamal--designated as Cluster O with long flexible tails but with unusual prolate capsids. Proteomic analysis of phage Corndog particles, Catdawg particles, and Corndog-infected cells confirms expression of half of the predicted gene products and indicates a non-canonical mechanism for translation of the Corndog tape measure protein. Bioinformatic analysis identifies 8-9 strongly predicted SigA promoters and all five Cluster O genomes contain more than 30 copies of a 17 bp repeat sequence with dyad symmetry located throughout the genomes. Comparison of the Cluster O phages provides insights into phage genome evolution including the processes of gene flux by horizontal genetic exchange.

  12. Targeting protein-protein interaction between MLL1 and reciprocal proteins for leukemia therapy.

    Science.gov (United States)

    Wang, Zhi-Hui; Li, Dong-Dong; Chen, Wei-Lin; You, Qi-Dong; Guo, Xiao-Ke

    2018-01-15

    The mixed lineage leukemia protein-1 (MLL1), as a lysine methyltransferase, predominantly regulates the methylation of histone H3 lysine 4 (H3K4) and functions in hematopoietic stem cell (HSC) self-renewal. MLL1 gene fuses with partner genes that results in the generation of MLL1 fusion proteins (MLL1-FPs), which are frequently detected in acute leukemia. In the progress of leukemogenesis, a great deal of proteins cooperate with MLL1 to form multiprotein complexes serving for the dysregulation of H3K4 methylation, the overexpression of homeobox (HOX) cluster genes, and the consequent generation of leukemia. Hence, disrupting the interactions between MLL1 and the reciprocal proteins has been considered to be a new treatment strategy for leukemia. Here, we reviewed potential protein-protein interactions (PPIs) between MLL1 and its reciprocal proteins, and summarized the inhibitors to target MLL1 PPIs. The druggability of MLL1 PPIs for leukemia were also discussed. Copyright © 2017. Published by Elsevier Ltd.

  13. Toxicological relationships between proteins obtained from protein target predictions of large toxicity databases

    International Nuclear Information System (INIS)

    Nigsch, Florian; Mitchell, John B.O.

    2008-01-01

    The combination of models for protein target prediction with large databases containing toxicological information for individual molecules allows the derivation of 'toxiclogical' profiles, i.e., to what extent are molecules of known toxicity predicted to interact with a set of protein targets. To predict protein targets of drug-like and toxic molecules, we built a computational multiclass model using the Winnow algorithm based on a dataset of protein targets derived from the MDL Drug Data Report. A 15-fold Monte Carlo cross-validation using 50% of each class for training, and the remaining 50% for testing, provided an assessment of the accuracy of that model. We retained the 3 top-ranking predictions and found that in 82% of all cases the correct target was predicted within these three predictions. The first prediction was the correct one in almost 70% of cases. A model built on the whole protein target dataset was then used to predict the protein targets for 150 000 molecules from the MDL Toxicity Database. We analysed the frequency of the predictions across the panel of protein targets for experimentally determined toxicity classes of all molecules. This allowed us to identify clusters of proteins related by their toxicological profiles, as well as toxicities that are related. Literature-based evidence is provided for some specific clusters to show the relevance of the relationships identified

  14. Influence of dietary tender cluster beans (Cyamopsis tetragonoloba) on biliary proteins, bile acid synthesis and cholesterol crystal growth in rat bile.

    Science.gov (United States)

    Raghavendra, Chikkanna K; Srinivasan, Krishnapura

    2015-02-01

    Tender cluster beans (CBs; Cyamopsis tetragonoloba) are observed to possess anti-lithogenic potential in experimental mice. Formation of cholesterol gallstones in gallbladder is controlled by procrystallizing and anticrystallizing factors present in bile in addition to supersaturation of cholesterol. This study aimed at evaluating the influence of CB on biliary glycoproteins, low molecular weight (LMW) and high molecular weight (HMW) proteins, cholesterol nucleation time, and cholesterol crystal growth in rat hepatic bile. Groups of rats were fed for 10 weeks with 0.5% cholesterol to render the bile lithogenic. Experimental dietary interventions were: 10% freeze-dried CB, 1% garlic powder or their combination. Incorporation of CB into HCD decreased the cholesterol saturation index in bile, increased bile flow and biliary glycoproteins. Dietary CB prolonged cholesterol nucleation time in bile. Electrophoresis of biliary proteins showed the presence of high concentration of 27 kDa protein which might be responsible for the prolongation of cholesterol nucleation time in the CB fed group. Proteins of 20 kDa and 18 kDa were higher in CB treated animals, while the same were less expressed in HCD group. Biliary proteins from CB fed animals reduced cholesterol crystal growth index which was elevated in the presence of proteins from HCD group. Cholesterol-7α-hydroxylase and cholesterol-27-hydroxylase mRNA expression was increased in CB treated animals contributing to the bile acid synthesis. Thus, the beneficial anti-lithogenic effect of dietary CB which primarily is due to reduced cholesterol saturation index was additionally affected through a modulation of the nucleating and anti-nucleating proteins that affect cholesterol crystallization. Copyright © 2014 Elsevier Inc. All rights reserved.

  15. Double-stranded endonuclease activity in Bacillus halodurans clustered regularly interspaced short palindromic repeats (CRISPR)-associated Cas2 protein.

    Science.gov (United States)

    Nam, Ki Hyun; Ding, Fran; Haitjema, Charles; Huang, Qingqiu; DeLisa, Matthew P; Ke, Ailong

    2012-10-19

    The CRISPR (clustered regularly interspaced short palindromic repeats) system is a prokaryotic RNA-based adaptive immune system against extrachromosomal genetic elements. Cas2 is a universally conserved core CRISPR-associated protein required for the acquisition of new spacers for CRISPR adaptation. It was previously characterized as an endoribonuclease with preference for single-stranded (ss)RNA. Here, we show using crystallography, mutagenesis, and isothermal titration calorimetry that the Bacillus halodurans Cas2 (Bha_Cas2) from the subtype I-C/Dvulg CRISPR instead possesses metal-dependent endonuclease activity against double-stranded (ds)DNA. This activity is consistent with its putative function in producing new spacers for insertion into the 5'-end of the CRISPR locus. Mutagenesis and isothermal titration calorimetry studies revealed that a single divalent metal ion (Mg(2+) or Mn(2+)), coordinated by a symmetric Asp pair in the Bha_Cas2 dimer, is involved in the catalysis. We envision that a pH-dependent conformational change switches Cas2 into a metal-binding competent conformation for catalysis. We further propose that the distinct substrate preferences among Cas2 proteins may be determined by the sequence and structure in the β1-α1 loop.

  16. Conserved syntenic clusters of protein coding genes are missing in birds.

    Science.gov (United States)

    Lovell, Peter V; Wirthlin, Morgan; Wilhelm, Larry; Minx, Patrick; Lazar, Nathan H; Carbone, Lucia; Warren, Wesley C; Mello, Claudio V

    2014-01-01

    Birds are one of the most highly successful and diverse groups of vertebrates, having evolved a number of distinct characteristics, including feathers and wings, a sturdy lightweight skeleton and unique respiratory and urinary/excretion systems. However, the genetic basis of these traits is poorly understood. Using comparative genomics based on extensive searches of 60 avian genomes, we have found that birds lack approximately 274 protein coding genes that are present in the genomes of most vertebrate lineages and are for the most part organized in conserved syntenic clusters in non-avian sauropsids and in humans. These genes are located in regions associated with chromosomal rearrangements, and are largely present in crocodiles, suggesting that their loss occurred subsequent to the split of dinosaurs/birds from crocodilians. Many of these genes are associated with lethality in rodents, human genetic disorders, or biological functions targeting various tissues. Functional enrichment analysis combined with orthogroup analysis and paralog searches revealed enrichments that were shared by non-avian species, present only in birds, or shared between all species. Together these results provide a clearer definition of the genetic background of extant birds, extend the findings of previous studies on missing avian genes, and provide clues about molecular events that shaped avian evolution. They also have implications for fields that largely benefit from avian studies, including development, immune system, oncogenesis, and brain function and cognition. With regards to the missing genes, birds can be considered ‘natural knockouts’ that may become invaluable model organisms for several human diseases.

  17. Extracellular cyclophilin levels associate with parameters of asthma in phenotypic clusters.

    Science.gov (United States)

    Stemmy, Erik J; Benton, Angela S; Lerner, Jennifer; Alcala, Sarah; Constant, Stephanie L; Freishtat, Robert J

    2011-12-01

    Leukocyte persistence during chronic (quiescent) phases of asthma is a major hallmark of the disease. The mechanisms regulating these persistent leukocyte populations are not clearly understood. An alternative family of chemoattracting proteins, cyclophilins (Cyps), has recently been shown to contribute to leukocyte recruitment in animal models of allergic asthma. The goals of this study were to determine whether Cyps are present in asthma patients during the chronic phase of the disease and to investigate whether levels of Cyps associate with clinical parameters of disease severity. Nasal wash samples from an urban cohort of 137 patients of age 6-20 years with physician-diagnosed asthma were examined for the presence of cyclophilin A (CypA), cyclophilin B (CypB), as well as several other classical chemokines. Linear, logistic, or ordinal regressions were performed to identify associations between Cyps, chemokines, and clinical parameters of asthma. The asthma cohort was further divided into previously established phenotypic clusters (cluster 1: n = 55; cluster 2: n = 31; and cluster 3: n = 51) and examined for associations. Levels of CypB in the asthma group were highly elevated compared to nonasthmatic controls, while a slight increase in Monocyte Chemotactic Protein-1 (MCP-1) was also observed. CypA and MCP-1 were associated with levels of eosinophil cationic protein (ECP; a marker of eosinophil activation). Cluster-specific associations were found for CypA and CypB and clinical asthma parameters [e.g. forced expiratory volume in 1 second (FEV(1)) and ECP]. Cyps are present in nasal wash samples of asthma patients and may be a novel biomarker for clinical parameters of asthma severity.

  18. Investigation of glutathione-derived electrostatic and hydrogen-bonding interactions and their role in defining Grx5 [2Fe-2S] cluster optical spectra and transfer chemistry.

    Science.gov (United States)

    Sen, Sambuddha; Bonfio, Claudia; Mansy, Sheref S; Cowan, J A

    2018-03-01

    Human glutaredoxin 5 (Grx5) is one of the core components of the Isc (iron-sulfur cluster) assembly and trafficking machinery, and serves as an intermediary cluster carrier, putatively delivering cluster from the Isu scaffold protein to target proteins. The tripeptide glutathione is intimately involved in this role, providing cysteinyl coordination to the iron center of the Grx5-bound [2Fe-2S] cluster. Grx5 has a well-defined glutathione-binding pocket with protein amino acid residues providing many ionic and hydrogen binding contacts to the bound glutathione. In this report, we investigated the importance of these interactions in cluster chirality and exchange reactivity by systematically perturbing the crucial contacts by use of natural and non-natural amino acid substitutions to disrupt the binding contacts from both the protein and glutathione. Native Grx5 could be reconstituted with all of the glutathione analogs used, as well as other thiol ligands, such as DTT or L-cysteine, by in vitro chemical reconstitution, and the holo proteins were found to transfer [2Fe-2S] cluster to apo ferredoxin 1 at comparable rates. However, the circular dichroism spectra of these derivatives displayed prominent differences that reflect perturbations in local cluster chirality. These studies provided a detailed molecular understanding of glutathione-protein interactions in holo Grx5 that define both cluster spectroscopy and exchange chemistry.

  19. Statistical discovery of site inter-dependencies in sub-molecular hierarchical protein structuring.

    Science.gov (United States)

    Durston, Kirk K; Chiu, David Ky; Wong, Andrew Kc; Li, Gary Cl

    2012-07-13

    Much progress has been made in understanding the 3D structure of proteins using methods such as NMR and X-ray crystallography. The resulting 3D structures are extremely informative, but do not always reveal which sites and residues within the structure are of special importance. Recently, there are indications that multiple-residue, sub-domain structural relationships within the larger 3D consensus structure of a protein can be inferred from the analysis of the multiple sequence alignment data of a protein family. These intra-dependent clusters of associated sites are used to indicate hierarchical inter-residue relationships within the 3D structure. To reveal the patterns of associations among individual amino acids or sub-domain components within the structure, we apply a k-modes attribute (aligned site) clustering algorithm to the ubiquitin and transthyretin families in order to discover associations among groups of sites within the multiple sequence alignment. We then observe what these associations imply within the 3D structure of these two protein families. The k-modes site clustering algorithm we developed maximizes the intra-group interdependencies based on a normalized mutual information measure. The clusters formed correspond to sub-structural components or binding and interface locations. Applying this data-directed method to the ubiquitin and transthyretin protein family multiple sequence alignments as a test bed, we located numerous interesting associations of interdependent sites. These clusters were then arranged into cluster tree diagrams which revealed four structural sub-domains within the single domain structure of ubiquitin and a single large sub-domain within transthyretin associated with the interface among transthyretin monomers. In addition, several clusters of mutually interdependent sites were discovered for each protein family, each of which appear to play an important role in the molecular structure and/or function. Our results

  20. The Yeast Nbp35-Cfd1 Cytosolic Iron-Sulfur Cluster Scaffold Is an ATPase.

    Science.gov (United States)

    Camire, Eric J; Grossman, John D; Thole, Grace J; Fleischman, Nicholas M; Perlstein, Deborah L

    2015-09-25

    Nbp35 and Cfd1 are prototypical members of the MRP/Nbp35 class of iron-sulfur (FeS) cluster scaffolds that function to assemble nascent FeS clusters for transfer to FeS-requiring enzymes. Both proteins contain a conserved NTPase domain that genetic studies have demonstrated is essential for their cluster assembly activity inside the cell. It was recently reported that these proteins possess no or very low nucleotide hydrolysis activity in vitro, and thus the role of the NTPase domain in cluster biogenesis has remained uncertain. We have reexamined the NTPase activity of Nbp35, Cfd1, and their complex. Using in vitro assays and site-directed mutagenesis, we demonstrate that the Nbp35 homodimer and the Nbp35-Cfd1 heterodimer are ATPases, whereas the Cfd1 homodimer exhibited no or very low ATPase activity. We ruled out the possibility that the observed ATP hydrolysis activity might result from a contaminating ATPase by showing that mutation of key active site residues reduced activity to background levels. Finally, we demonstrate that the fluorescent ATP analog 2'/3'-O-(N'-methylanthraniloyl)-ATP (mantATP) binds stoichiometrically to Nbp35 with a KD = 15.6 μM and that an Nbp35 mutant deficient in ATP hydrolysis activity also displays an increased KD for mantATP. Together, our results demonstrate that the cytosolic iron-sulfur cluster assembly scaffold is an ATPase and pave the way for interrogating the role of nucleotide hydrolysis in cluster biogenesis by this large family of cluster scaffolding proteins found across all domains of life. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.

  1. Formation of a homocitrate-free iron-molybdenum cluster on NifEN: implications for the role of homocitrate in nitrogenase assembly.

    Science.gov (United States)

    Fay, Aaron Wolfe; Blank, Michael Aaron; Yoshizawa, Janice Mariko; Lee, Chi Chung; Wiig, Jared Andrew; Hu, Yilin; Hodgson, Keith Owen; Hedman, Britt; Ribbe, Markus Walter

    2010-03-28

    Molybdenum (Mo)-dependent nitrogenase is a complex metalloprotein that catalyzes the biological reduction of dinitrogen (N(2)) to ammonia (NH(3)) at the molybdenum-iron cofactor (FeMoco) site of its molybdenum-iron (MoFe) protein component. Here we report the formation of a homocitrate-free, iron-molybdenum ("FeMo") cluster on the biosynthetic scaffold of FeMoco, NifEN. Such a NifEN-associated "FeMo" cluster exhibits EPR features similar to those of the NifEN-associated, fully-complemented "FeMoco", which originate from the presence of Mo in both cluster species; however, "FeMo" cluster and "FeMoco" display different temperature-dependent changes in the line shape and the signal intensity of their respective EPR features, which reflect the impact of homocitrate on the redox properties of these clusters. XAS/EXAFS analysis reveals that the Mo centers in both "FeMo" cluster and "FeMoco" are present in a similar coordination environment, although Mo in "FeMo" cluster is more loosely coordinated as compared to that in "FeMoco" with respect to the Mo-O distances in the cluster, likely due to the absence of homocitrate that normally serves as an additional ligand for the Mo in the cluster. Subsequent biochemical investigation of the "FeMo" cluster not only facilitates the determination of the sequence of events in the mobilization of Mo and homocitrate during FeMoco maturation, but also permits the examination of the role of homocitrate in the transfer of FeMoco between NifEN and MoFe protein. Combined outcome of these studies establishes a platform for future structural analysis of the interactions between NifEN and MoFe protein, which will provide useful insights into the mechanism of cluster transfer between the two proteins.

  2. Effects of clustering structure on volumetric properties of amino acids in (DMSO + water) mixtures

    International Nuclear Information System (INIS)

    Huang Aimin; Liu Chunli; Ma Lin; Tong Zhangfa; Lin Ruisen

    2012-01-01

    Graphical abstract: Together with static light scattering measurement, volumetric properties of glycine, L-alanine and L-serine were determined and utilized to reveal the microscopic solvent structure of (DMSO + water) mixtures and its influence on the interaction between DMSO and amino acids from a clustering point of view. The results demonstrated that the interaction between amino acids and DMSO was greatly related to the clustering structure of the mixed solvent and that amino acids interacted with already established solvent clusters. Hydrophobic aggregating of DMSO lead to a decrease in the hydrophobic effect of DMSO and the hydrophobic–hydrophilic and hydrophobic–hydrophobic interaction with amino acids, which was reflected by the solvation of proteins. Highlights: ► Determine volumetric properties of three amino acids in aqueous DMSO in details. ► Static light scattering measurement for clustering structure of aqueous DMSO. ► Volumetric behaviour of amino acids depends on clustering structure of aqueous DMSO. ► Clustering structure of aqueous DMSO influences solvation of protein and cellulose. - Abstract: For a better understanding on the functions of DMSO in biological systems at a relatively lower concentration, apparent molar volumes of three typical amino acids, glycine, L-alanine and L-serine in (DMSO + water) mixtures were determined and the transfer volumes from water to the mixtures were evaluated. Together with static light scattering measurement, the results were utilised to reveal the microscopic solvent structure of (DMSO + water) mixtures and its influence on the interaction between DMSO and amino acids from a clustering point of view. The results demonstrate that the interaction between amino acids and DMSO is greatly related to the clustering structure of the mixed solvent and that amino acids interacted with already established solvent clusters. The linear dependence of transfer volume of amino acids on DMSO concentration up to 2

  3. Analysis and comparison of very large metagenomes with fast clustering and functional annotation

    Directory of Open Access Journals (Sweden)

    Li Weizhong

    2009-10-01

    Full Text Available Abstract Background The remarkable advance of metagenomics presents significant new challenges in data analysis. Metagenomic datasets (metagenomes are large collections of sequencing reads from anonymous species within particular environments. Computational analyses for very large metagenomes are extremely time-consuming, and there are often many novel sequences in these metagenomes that are not fully utilized. The number of available metagenomes is rapidly increasing, so fast and efficient metagenome comparison methods are in great demand. Results The new metagenomic data analysis method Rapid Analysis of Multiple Metagenomes with a Clustering and Annotation Pipeline (RAMMCAP was developed using an ultra-fast sequence clustering algorithm, fast protein family annotation tools, and a novel statistical metagenome comparison method that employs a unique graphic interface. RAMMCAP processes extremely large datasets with only moderate computational effort. It identifies raw read clusters and protein clusters that may include novel gene families, and compares metagenomes using clusters or functional annotations calculated by RAMMCAP. In this study, RAMMCAP was applied to the two largest available metagenomic collections, the "Global Ocean Sampling" and the "Metagenomic Profiling of Nine Biomes". Conclusion RAMMCAP is a very fast method that can cluster and annotate one million metagenomic reads in only hundreds of CPU hours. It is available from http://tools.camera.calit2.net/camera/rammcap/.

  4. Detection of Locally Over-Represented GO Terms in Protein-Protein Interaction Networks

    Science.gov (United States)

    LAVALLÉE-ADAM, MATHIEU; COULOMBE, BENOIT; BLANCHETTE, MATHIEU

    2015-01-01

    High-throughput methods for identifying protein-protein interactions produce increasingly complex and intricate interaction networks. These networks are extremely rich in information, but extracting biologically meaningful hypotheses from them and representing them in a human-readable manner is challenging. We propose a method to identify Gene Ontology terms that are locally over-represented in a subnetwork of a given biological network. Specifically, we propose several methods to evaluate the degree of clustering of proteins associated to a particular GO term in both weighted and unweighted PPI networks, and describe efficient methods to estimate the statistical significance of the observed clustering. We show, using Monte Carlo simulations, that our best approximation methods accurately estimate the true p-value, for random scale-free graphs as well as for actual yeast and human networks. When applied to these two biological networks, our approach recovers many known complexes and pathways, but also suggests potential functions for many subnetworks. Online Supplementary Material is available at www.liebertonline.com. PMID:20377456

  5. Brightest Cluster Galaxies in REXCESS Clusters

    Science.gov (United States)

    Haarsma, Deborah B.; Leisman, L.; Bruch, S.; Donahue, M.

    2009-01-01

    Most galaxy clusters contain a Brightest Cluster Galaxy (BCG) which is larger than the other cluster ellipticals and has a more extended profile. In the hierarchical model, the BCG forms through many galaxy mergers in the crowded center of the cluster, and thus its properties give insight into the assembly of the cluster as a whole. In this project, we are working with the Representative XMM-Newton Cluster Structure Survey (REXCESS) team (Boehringer et al 2007) to study BCGs in 33 X-ray luminous galaxy clusters, 0.055 < z < 0.183. We are imaging the BCGs in R band at the Southern Observatory for Astrophysical Research (SOAR) in Chile. In this poster, we discuss our methods and give preliminary measurements of the BCG magnitudes, morphology, and stellar mass. We compare these BCG properties with the properties of their host clusters, particularly of the X-ray emitting gas.

  6. Characterization of the largest effector gene cluster of Ustilago maydis.

    Directory of Open Access Journals (Sweden)

    Thomas Brefort

    2014-07-01

    Full Text Available In the genome of the biotrophic plant pathogen Ustilago maydis, many of the genes coding for secreted protein effectors modulating virulence are arranged in gene clusters. The vast majority of these genes encode novel proteins whose expression is coupled to plant colonization. The largest of these gene clusters, cluster 19A, encodes 24 secreted effectors. Deletion of the entire cluster results in severe attenuation of virulence. Here we present the functional analysis of this genomic region. We show that a 19A deletion mutant behaves like an endophyte, i.e. is still able to colonize plants and complete the infection cycle. However, tumors, the most conspicuous symptoms of maize smut disease, are only rarely formed and fungal biomass in infected tissue is significantly reduced. The generation and analysis of strains carrying sub-deletions identified several genes significantly contributing to tumor formation after seedling infection. Another of the effectors could be linked specifically to anthocyanin induction in the infected tissue. As the individual contributions of these genes to tumor formation were small, we studied the response of maize plants to the whole cluster mutant as well as to several individual mutants by array analysis. This revealed distinct plant responses, demonstrating that the respective effectors have discrete plant targets. We propose that the analysis of plant responses to effector mutant strains that lack a strong virulence phenotype may be a general way to visualize differences in effector function.

  7. The OGCleaner: filtering false-positive homology clusters.

    Science.gov (United States)

    Fujimoto, M Stanley; Suvorov, Anton; Jensen, Nicholas O; Clement, Mark J; Snell, Quinn; Bybee, Seth M

    2017-01-01

    Detecting homologous sequences in organisms is an essential step in protein structure and function prediction, gene annotation and phylogenetic tree construction. Heuristic methods are often employed for quality control of putative homology clusters. These heuristics, however, usually only apply to pairwise sequence comparison and do not examine clusters as a whole. We present the Orthology Group Cleaner (the OGCleaner), a tool designed for filtering putative orthology groups as homology or non-homology clusters by considering all sequences in a cluster. The OGCleaner relies on high-quality orthologous groups identified in OrthoDB to train machine learning algorithms that are able to distinguish between true-positive and false-positive homology groups. This package aims to improve the quality of phylogenetic tree construction especially in instances of lower-quality transcriptome assemblies. https://github.com/byucsl/ogcleaner CONTACT: sfujimoto@gmail.comSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  8. SCPS: a fast implementation of a spectral method for detecting protein families on a genome-wide scale

    Directory of Open Access Journals (Sweden)

    Paccanaro Alberto

    2010-03-01

    Full Text Available Abstract Background An important problem in genomics is the automatic inference of groups of homologous proteins from pairwise sequence similarities. Several approaches have been proposed for this task which are "local" in the sense that they assign a protein to a cluster based only on the distances between that protein and the other proteins in the set. It was shown recently that global methods such as spectral clustering have better performance on a wide variety of datasets. However, currently available implementations of spectral clustering methods mostly consist of a few loosely coupled Matlab scripts that assume a fair amount of familiarity with Matlab programming and hence they are inaccessible for large parts of the research community. Results SCPS (Spectral Clustering of Protein Sequences is an efficient and user-friendly implementation of a spectral method for inferring protein families. The method uses only pairwise sequence similarities, and is therefore practical when only sequence information is available. SCPS was tested on difficult sets of proteins whose relationships were extracted from the SCOP database, and its results were extensively compared with those obtained using other popular protein clustering algorithms such as TribeMCL, hierarchical clustering and connected component analysis. We show that SCPS is able to identify many of the family/superfamily relationships correctly and that the quality of the obtained clusters as indicated by their F-scores is consistently better than all the other methods we compared it with. We also demonstrate the scalability of SCPS by clustering the entire SCOP database (14,183 sequences and the complete genome of the yeast Saccharomyces cerevisiae (6,690 sequences. Conclusions Besides the spectral method, SCPS also implements connected component analysis and hierarchical clustering, it integrates TribeMCL, it provides different cluster quality tools, it can extract human-readable protein

  9. SCPS: a fast implementation of a spectral method for detecting protein families on a genome-wide scale.

    Science.gov (United States)

    Nepusz, Tamás; Sasidharan, Rajkumar; Paccanaro, Alberto

    2010-03-09

    An important problem in genomics is the automatic inference of groups of homologous proteins from pairwise sequence similarities. Several approaches have been proposed for this task which are "local" in the sense that they assign a protein to a cluster based only on the distances between that protein and the other proteins in the set. It was shown recently that global methods such as spectral clustering have better performance on a wide variety of datasets. However, currently available implementations of spectral clustering methods mostly consist of a few loosely coupled Matlab scripts that assume a fair amount of familiarity with Matlab programming and hence they are inaccessible for large parts of the research community. SCPS (Spectral Clustering of Protein Sequences) is an efficient and user-friendly implementation of a spectral method for inferring protein families. The method uses only pairwise sequence similarities, and is therefore practical when only sequence information is available. SCPS was tested on difficult sets of proteins whose relationships were extracted from the SCOP database, and its results were extensively compared with those obtained using other popular protein clustering algorithms such as TribeMCL, hierarchical clustering and connected component analysis. We show that SCPS is able to identify many of the family/superfamily relationships correctly and that the quality of the obtained clusters as indicated by their F-scores is consistently better than all the other methods we compared it with. We also demonstrate the scalability of SCPS by clustering the entire SCOP database (14,183 sequences) and the complete genome of the yeast Saccharomyces cerevisiae (6,690 sequences). Besides the spectral method, SCPS also implements connected component analysis and hierarchical clustering, it integrates TribeMCL, it provides different cluster quality tools, it can extract human-readable protein descriptions using GI numbers from NCBI, it interfaces with

  10. Assembling Fe/S-clusters and modifying tRNAs: ancient co-factors meet ancient adaptors.

    Science.gov (United States)

    Alfonzo, Juan D; Lukeš, Julius

    2011-06-01

    Trypanosoma brucei undergoes two clearly distinct develomental stages: in the insect vector (procyclic stage) the cells generate the bulk of their energy through respiration, whereas in the bloodstream of the mammalian host (bloodstream stage) they grow mostly glycolytically. Several mitochondrial respiratory proteins require iron-sulfur clusters for activity, and their activation coincides with developmental changes. Likewise some tRNA modification enzymes either require iron-sulfur clusters or use components of the iron-sulfur cluster assembly pathway for activity. These enzymes affect the anticodon loop of various tRNAs and can impact protein synthesis. Herein, the possibility of these pathways being integrated and exploited by T. brucei to carefully coordinate energy demands to translational rates in response to enviromental changes is examined.

  11. Metal cluster compounds - chemistry and importance; clusters containing isolated main group element atoms, large metal cluster compounds, cluster fluxionality

    International Nuclear Information System (INIS)

    Walther, B.

    1988-01-01

    This part of the review on metal cluster compounds deals with clusters containing isolated main group element atoms, with high nuclearity clusters and metal cluster fluxionality. It will be obvious that main group element atoms strongly influence the geometry, stability and reactivity of the clusters. High nuclearity clusters are of interest in there own due to the diversity of the structures adopted, but their intermediate position between molecules and the metallic state makes them a fascinating research object too. These both sites of the metal cluster chemistry as well as the frequently observed ligand and core fluxionality are related to the cluster metal and surface analogy. (author)

  12. PREFACE: Nuclear Cluster Conference; Cluster'07

    Science.gov (United States)

    Freer, Martin

    2008-05-01

    The Cluster Conference is a long-running conference series dating back to the 1960's, the first being initiated by Wildermuth in Bochum, Germany, in 1969. The most recent meeting was held in Nara, Japan, in 2003, and in 2007 the 9th Cluster Conference was held in Stratford-upon-Avon, UK. As the name suggests the town of Stratford lies upon the River Avon, and shortly before the conference, due to unprecedented rainfall in the area (approximately 10 cm within half a day), lay in the River Avon! Stratford is the birthplace of the `Bard of Avon' William Shakespeare, and this formed an intriguing conference backdrop. The meeting was attended by some 90 delegates and the programme contained 65 70 oral presentations, and was opened by a historical perspective presented by Professor Brink (Oxford) and closed by Professor Horiuchi (RCNP) with an overview of the conference and future perspectives. In between, the conference covered aspects of clustering in exotic nuclei (both neutron and proton-rich), molecular structures in which valence neutrons are exchanged between cluster cores, condensates in nuclei, neutron-clusters, superheavy nuclei, clusters in nuclear astrophysical processes and exotic cluster decays such as 2p and ternary cluster decay. The field of nuclear clustering has become strongly influenced by the physics of radioactive beam facilities (reflected in the programme), and by the excitement that clustering may have an important impact on the structure of nuclei at the neutron drip-line. It was clear that since Nara the field had progressed substantially and that new themes had emerged and others had crystallized. Two particular topics resonated strongly condensates and nuclear molecules. These topics are thus likely to be central in the next cluster conference which will be held in 2011 in the Hungarian city of Debrechen. Martin Freer Participants and Cluster'07

  13. Cross-over between discrete and continuous protein structure space: insights into automatic classification and networks of protein structures.

    Directory of Open Access Journals (Sweden)

    Alberto Pascual-García

    2009-03-01

    Full Text Available Structural classifications of proteins assume the existence of the fold, which is an intrinsic equivalence class of protein domains. Here, we test in which conditions such an equivalence class is compatible with objective similarity measures. We base our analysis on the transitive property of the equivalence relationship, requiring that similarity of A with B and B with C implies that A and C are also similar. Divergent gene evolution leads us to expect that the transitive property should approximately hold. However, if protein domains are a combination of recurrent short polypeptide fragments, as proposed by several authors, then similarity of partial fragments may violate the transitive property, favouring the continuous view of the protein structure space. We propose a measure to quantify the violations of the transitive property when a clustering algorithm joins elements into clusters, and we find out that such violations present a well defined and detectable cross-over point, from an approximately transitive regime at high structure similarity to a regime with large transitivity violations and large differences in length at low similarity. We argue that protein structure space is discrete and hierarchic classification is justified up to this cross-over point, whereas at lower similarities the structure space is continuous and it should be represented as a network. We have tested the qualitative behaviour of this measure, varying all the choices involved in the automatic classification procedure, i.e., domain decomposition, alignment algorithm, similarity score, and clustering algorithm, and we have found out that this behaviour is quite robust. The final classification depends on the chosen algorithms. We used the values of the clustering coefficient and the transitivity violations to select the optimal choices among those that we tested. Interestingly, this criterion also favours the agreement between automatic and expert classifications

  14. Identify High-Quality Protein Structural Models by Enhanced K-Means.

    Science.gov (United States)

    Wu, Hongjie; Li, Haiou; Jiang, Min; Chen, Cheng; Lv, Qiang; Wu, Chuang

    2017-01-01

    Background. One critical issue in protein three-dimensional structure prediction using either ab initio or comparative modeling involves identification of high-quality protein structural models from generated decoys. Currently, clustering algorithms are widely used to identify near-native models; however, their performance is dependent upon different conformational decoys, and, for some algorithms, the accuracy declines when the decoy population increases. Results. Here, we proposed two enhanced K -means clustering algorithms capable of robustly identifying high-quality protein structural models. The first one employs the clustering algorithm SPICKER to determine the initial centroids for basic K -means clustering ( SK -means), whereas the other employs squared distance to optimize the initial centroids ( K -means++). Our results showed that SK -means and K -means++ were more robust as compared with SPICKER alone, detecting 33 (59%) and 42 (75%) of 56 targets, respectively, with template modeling scores better than or equal to those of SPICKER. Conclusions. We observed that the classic K -means algorithm showed a similar performance to that of SPICKER, which is a widely used algorithm for protein-structure identification. Both SK -means and K -means++ demonstrated substantial improvements relative to results from SPICKER and classical K -means.

  15. Application of the AMPLE cluster-and-truncate approach to NMR structures for molecular replacement

    Energy Technology Data Exchange (ETDEWEB)

    Bibby, Jaclyn [University of Liverpool, Liverpool L69 7ZB (United Kingdom); Keegan, Ronan M. [Research Complex at Harwell, STFC Rutherford Appleton Laboratory, Didcot OX11 0FA (United Kingdom); Mayans, Olga [University of Liverpool, Liverpool L69 7ZB (United Kingdom); Winn, Martyn D. [Science and Technology Facilities Council Daresbury Laboratory, Warrington WA4 4AD (United Kingdom); Rigden, Daniel J., E-mail: drigden@liv.ac.uk [University of Liverpool, Liverpool L69 7ZB (United Kingdom)

    2013-11-01

    Processing of NMR structures for molecular replacement by AMPLE works well. AMPLE is a program developed for clustering and truncating ab initio protein structure predictions into search models for molecular replacement. Here, it is shown that its core cluster-and-truncate methods also work well for processing NMR ensembles into search models. Rosetta remodelling helps to extend success to NMR structures bearing low sequence identity or high structural divergence from the target protein. Potential future routes to improved performance are considered and practical, general guidelines on using AMPLE are provided.

  16. Improvement of activity and stability of Chondroitinase ABC I by introducing an aromatic cluster at the surface of protein.

    Science.gov (United States)

    Shahaboddin, Mohammad Esmaeil; Khajeh, Khosro; Maleki, Monireh; Golestani, Abolfazl

    2017-10-01

    Chondroitinase ABC I (ChABC I) has been shown to depolymerize a variety of glycosaminoglycan substrates and promote regeneration of damaged spinal cord. However, to date, intrathecal delivery methods have been suboptimal largely due to enzyme instability which necessitates repeated administration to the injured loci. Among the aromatic amino acids, tyrosine has been shown to be more effective in creation of stable clusters and further stabilize of the proteins. Bioinformatics approaches have been used to examine the effect of an extra aromatic cluster at the surface of ChABC I. In this study two amino acids i.e., Asn 806 and Gln 810 were mutated to tyrosine and to alanine as negative control. In this way, four variants i.e., N806Y/Q810Y, N806A/Q810Y, N806Y/Q810A and N806A/Q810A were created. The results showed that N806Y/Q810Y mutation improved both activity and thermal stability of the enzyme while Ala substitution reduced the enzyme activity and destabilized it. Structural analysis of mutants showed an increase in intrinsic fluorescence intensity and secondary structure content of N806Y/Q810Y mutant when compared to the wild type enzyme indicating a more rigid structure of this variant. Moreover, the N806Y/Q810Y enzyme displayed a remarkable resistance against trypsin degradation with a half-life (t 1/2 ) of 45.0min versus 32.5min of wild-type. In conclusion, the data revealed that structural features and activity of ChABC I can be improved by introducing appropriate aromatic clusters at the surface of the enzyme. Copyright © 2017 Elsevier Inc. All rights reserved.

  17. Crystal structure of clustered regularly interspaced short palindromic repeats (CRISPR)-associated Csn2 protein revealed Ca2+-dependent double-stranded DNA binding activity.

    Science.gov (United States)

    Nam, Ki Hyun; Kurinov, Igor; Ke, Ailong

    2011-09-02

    Clustered regularly interspaced short palindromic repeats (CRISPR) and their associated protein genes (cas genes) are widespread in bacteria and archaea. They form a line of RNA-based immunity to eradicate invading bacteriophages and malicious plasmids. A key molecular event during this process is the acquisition of new spacers into the CRISPR loci to guide the selective degradation of the matching foreign genetic elements. Csn2 is a Nmeni subtype-specific cas gene required for new spacer acquisition. Here we characterize the Enterococcus faecalis Csn2 protein as a double-stranded (ds-) DNA-binding protein and report its 2.7 Å tetrameric ring structure. The inner circle of the Csn2 tetrameric ring is ∼26 Å wide and populated with conserved lysine residues poised for nonspecific interactions with ds-DNA. Each Csn2 protomer contains an α/β domain and an α-helical domain; significant hinge motion was observed between these two domains. Ca(2+) was located at strategic positions in the oligomerization interface. We further showed that removal of Ca(2+) ions altered the oligomerization state of Csn2, which in turn severely decreased its affinity for ds-DNA. In summary, our results provided the first insight into the function of the Csn2 protein in CRISPR adaptation by revealing that it is a ds-DNA-binding protein functioning at the quaternary structure level and regulated by Ca(2+) ions.

  18. Dynamics of water clusters confined in proteins: a molecular dynamics simulation study of interfacial waters in a dimeric hemoglobin.

    Science.gov (United States)

    Gnanasekaran, Ramachandran; Xu, Yao; Leitner, David M

    2010-12-23

    Water confined in proteins exhibits dynamics distinct from the dynamics of water in the bulk or near the surface of a biomolecule. We examine the water dynamics at the interface of the two globules of the homodimeric hemoglobin from Scapharca inaequivalvis (HbI) by molecular dynamics (MD) simulations, with focus on water-protein hydrogen bond lifetimes and rotational anisotropy of the interfacial waters. We find that relaxation of the waters at the interface of both deoxy- and oxy-HbI, which contain a cluster of 17 and 11 interfacial waters, respectively, is well described by stretched exponentials with exponents from 0.1 to 0.6 and relaxation times of tens to thousands of picoseconds. The interfacial water molecules of oxy-HbI exhibit slower rotational relaxation and hydrogen bond rearrangement than those of deoxy-HbI, consistent with an allosteric transition from unliganded to liganded conformers involving the expulsion of several water molecules from the interface. Though the interfacial waters are translationally and rotationally static on the picosecond time scale, they contribute to fast communication between the globules via vibrations. We find that the interfacial waters enhance vibrational energy transport across the interface by ≈10%.

  19. Identification of biofilm-associated cluster (bac in Pseudomonas aeruginosa involved in biofilm formation and virulence.

    Directory of Open Access Journals (Sweden)

    Camille Macé

    Full Text Available Biofilms are prevalent in diseases caused by Pseudomonas aeruginosa, an opportunistic and nosocomial pathogen. By a proteomic approach, we previously identified a hypothetical protein of P. aeruginosa (coded by the gene pA3731 that was accumulated by biofilm cells. We report here that a Delta pA3731 mutant is highly biofilm-defective as compared with the wild-type strain. Using a mouse model of lung infection, we show that the mutation also induces a defect in bacterial growth during the acute phase of infection and an attenuation of the virulence. The pA3731 gene is found to control positively the ability to swarm and to produce extracellular rhamnolipids, and belongs to a cluster of 4 genes (pA3729-pA3732 not previously described in P. aeruginosa. Though the protein PA3731 has a predicted secondary structure similar to that of the Phage Shock Protein, some obvious differences are observed compared to already described psp systems, e.g., this unknown cluster is monocistronic and no homology is found between the other proteins constituting this locus and psp proteins. As E. coli PspA, the amount of the protein PA3731 is enlarged by an osmotic shock, however, not affected by a heat shock. We consequently named this locus bac for biofilm-associated cluster.

  20. Formation of stable products from cluster-cluster collisions

    International Nuclear Information System (INIS)

    Alamanova, Denitsa; Grigoryan, Valeri G; Springborg, Michael

    2007-01-01

    The formation of stable products from copper cluster-cluster collisions is investigated by using classical molecular-dynamics simulations in combination with an embedded-atom potential. The dependence of the product clusters on impact energy, relative orientation of the clusters, and size of the clusters is studied. The structures and total energies of the product clusters are analysed and compared with those of the colliding clusters before impact. These results, together with the internal temperature, are used in obtaining an increased understanding of cluster fusion processes

  1. Statistical method on nonrandom clustering with application to somatic mutations in cancer

    Directory of Open Access Journals (Sweden)

    Rejto Paul A

    2010-01-01

    Full Text Available Abstract Background Human cancer is caused by the accumulation of tumor-specific mutations in oncogenes and tumor suppressors that confer a selective growth advantage to cells. As a consequence of genomic instability and high levels of proliferation, many passenger mutations that do not contribute to the cancer phenotype arise alongside mutations that drive oncogenesis. While several approaches have been developed to separate driver mutations from passengers, few approaches can specifically identify activating driver mutations in oncogenes, which are more amenable for pharmacological intervention. Results We propose a new statistical method for detecting activating mutations in cancer by identifying nonrandom clusters of amino acid mutations in protein sequences. A probability model is derived using order statistics assuming that the location of amino acid mutations on a protein follows a uniform distribution. Our statistical measure is the differences between pair-wise order statistics, which is equivalent to the size of an amino acid mutation cluster, and the probabilities are derived from exact and approximate distributions of the statistical measure. Using data in the Catalog of Somatic Mutations in Cancer (COSMIC database, we have demonstrated that our method detects well-known clusters of activating mutations in KRAS, BRAF, PI3K, and β-catenin. The method can also identify new cancer targets as well as gain-of-function mutations in tumor suppressors. Conclusions Our proposed method is useful to discover activating driver mutations in cancer by identifying nonrandom clusters of somatic amino acid mutations in protein sequences.

  2. Effects of punctal occlusion on global tear proteins in patients with dry eye.

    Science.gov (United States)

    Tong, Louis; Zhou, Lei; Beuerman, Roger; Simonyi, Susan; Hollander, David A; Stern, Michael E

    2017-10-01

    To investigate effects of punctal occlusion on global tear protein levels in patients with dry eye. In this prospective, longitudinal, single-center study, nonabsorbable punctal plugs were inserted bilaterally into the lower punctum of 30 patients with moderate dry eye. Dry eye symptoms, fluorescein corneal staining, Schirmer I test, tear film break-up time, and safety were assessed in the more severely affected eye. Tear proteins at weeks 1 and 3 were quantified by iTRAQ relative to baseline preocclusion levels. Of 29 patients who completed the study, 23 (mean age 49.8 years) had sufficient tear samples for analysis. After 3 weeks, punctal occlusion significantly upregulated tear proteins, including glutathione synthase (mean of 1.6-fold, P = 0.01) and interleukin-1 receptor antagonist (1.7-fold, P = 0.032) and downregulated cholinergic receptor (neuronal) alpha-7 (0.79-fold, P = 0.039) and lymphocyte cytosolic protein-1 (0.66-fold, P = 0.012). Clustering analysis of global tear proteins revealed two clear profile changes; the first group of patients (cluster 1, n = 10) had a reduction in the inflammatory proteins (e.g., S100A8) and rise in lacrimal proteins supporting the ocular surface (e.g., lysozyme), whereas the second group (cluster 2, n = 13) had an increase in inflammatory proteins and a decrease in lacrimal proteins. Logistic regression analysis revealed that cluster 1 patients had significantly (P = 0.006) lower Schirmer scores at baseline (mean [standard deviation]: 4.3 [4.3] mm) than cluster 2 (6.8 [2.6] mm). Punctal plugs produced a beneficial pattern of tear protein change in patients with relatively low Schirmer scores within 3 weeks of punctal occlusion. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  3. Document clustering methods, document cluster label disambiguation methods, document clustering apparatuses, and articles of manufacture

    Science.gov (United States)

    Sanfilippo, Antonio [Richland, WA; Calapristi, Augustin J [West Richland, WA; Crow, Vernon L [Richland, WA; Hetzler, Elizabeth G [Kennewick, WA; Turner, Alan E [Kennewick, WA

    2009-12-22

    Document clustering methods, document cluster label disambiguation methods, document clustering apparatuses, and articles of manufacture are described. In one aspect, a document clustering method includes providing a document set comprising a plurality of documents, providing a cluster comprising a subset of the documents of the document set, using a plurality of terms of the documents, providing a cluster label indicative of subject matter content of the documents of the cluster, wherein the cluster label comprises a plurality of word senses, and selecting one of the word senses of the cluster label.

  4. Fe-S Clusters Emerging as Targets of Therapeutic Drugs

    Directory of Open Access Journals (Sweden)

    Laurence Vernis

    2017-01-01

    Full Text Available Fe-S centers exhibit strong electronic plasticity, which is of importance for insuring fine redox tuning of protein biological properties. In accordance, Fe-S clusters are also highly sensitive to oxidation and can be very easily altered in vivo by different drugs, either directly or indirectly due to catabolic by-products, such as nitric oxide species (NOS or reactive oxygen species (ROS. In case of metal ions, Fe-S cluster alteration might be the result of metal liganding to the coordinating sulfur atoms, as suggested for copper. Several drugs presented through this review are either capable of direct interaction with Fe-S clusters or of secondary Fe-S clusters alteration following ROS or NOS production. Reactions leading to Fe-S cluster disruption are also reported. Due to the recent interest and progress in Fe-S biology, it is very likely that an increasing number of drugs already used in clinics will emerge as molecules interfering with Fe-S centers in the near future. Targeting Fe-S centers could also become a promising strategy for drug development.

  5. The [Mo₆Cl14]2- Cluster is Biologically Secure and Has Anti-Rotavirus Activity In Vitro.

    Science.gov (United States)

    Rojas-Mancilla, Edgardo; Oyarce, Alexis; Verdugo, Viviana; Morales-Verdejo, Cesar; Echeverria, Cesar; Velásquez, Felipe; Chnaiderman, Jonas; Valiente-Echeverría, Fernando; Ramirez-Tagle, Rodrigo

    2017-07-05

    The molybdenum cluster [Mo₆Cl 14 ] 2- is a fluorescent component with potential for use in cell labelling and pharmacology. Biological safety and antiviral properties of the cluster are as yet unknown. Here, we show the effect of acute exposition of human cells and red blood cells to the molybdenum cluster and its interaction with proteins and antiviral activity in vitro. We measured cell viability of HepG2 and EA.hy926 cell lines exposed to increasing concentrations of the cluster (0.1 to 250 µM), by 3-(4,5-dimethylthiazol-2-yl)-2,5-diphenyltetrazolium bromide (MTT) colorimetric assay. Hemolysis and morphological alterations of red blood cells, obtained from healthy donors, exposed to the cluster (10 to 200 µM) at 37 °C were analyzed. Furthermore, quenching of tryptophan residues of albumin was performed. Finally, plaque formation by rotavirus SA11 in MA104 cells treated with the cluster (100 to 300 µM) were analyzed. We found that all doses of the cluster showed similar cell viability, hemolysis, and morphology values, compared to control. Quenching of tryptophan residues of albumin suggests a protein-cluster complex formation. Finally, the cluster showed antiviral activity at 300 µM. These results indicate that the cluster [Mo₆Cl 14 ] 2- could be intravenously administered in animals at therapeutic doses for further in vivo studies and might be studied as an antiviral agent.

  6. Interspecific variation of total seed protein in wild rice germplasm using SDS-Page

    International Nuclear Information System (INIS)

    Shah, S.M.A.; Hidayat-ur-Rahman; Abbasi, F.M.; Ashiq, M.; Rabbani, A.M.; Khan, I.A.; Shinwari, Z.K.; Shah, Z.

    2011-01-01

    Variation in seed protein of 14 wild rice species (Oryza spp.) along with cultivated rice species (O. sativa) was studied using sodium dodecyl sulphate polyacrylamide gel electrophoresis (SDS-PAGE) to assess genetic diversity in the rice germplasm. SDS bands were scored as present (1) or absent (0) for protein sample of each genotype. On the basis of cluster analysis, four clusters were identified at a similarity level of 0.85. O. nivara, O. rufipogon and O. sativa with AA genomes constituted the first cluster. The second cluster comprised O. punctata of BB genome and wild rice species of CC genome i.e., O. rhizomatis and O. officinalis. However, it also contained O. barthii and O. glumaepatula of AA genome. O. australiensis with EE genome, and O. latifolia, O. alta and O. grandiglumis having CCDD genomes comprised the third cluster. The fourth cluster consisted of wild rice species, O. brachyantha with EE genome along with two other wild rice species, O. longistaminata and O. meridionalis of AA genome. Overall, on the basis of total seed protein, the grouping pattern of rice genotypes was mostly compatible with their genome status. The results of the present work depicted considerable interspecific genetic variation in the investigated germplasm for total seed protein. Moreover, the results obtained in this study also suggest that analysis of seed protein can also provide a better understanding of genetic affinity of the germplasm. (author)

  7. The E. coli monothiol glutaredoxin GrxD forms homodimeric and heterodimeric FeS cluster containing complexes.

    Science.gov (United States)

    Yeung, N; Gold, B; Liu, N L; Prathapam, R; Sterling, H J; Willams, E R; Butland, G

    2011-10-18

    Monothiol glutaredoxins (mono-Grx) represent a highly evolutionarily conserved class of proteins present in organisms ranging from prokaryotes to humans. Mono-Grxs have been implicated in iron sulfur (FeS) cluster biosynthesis as potential scaffold proteins and in iron homeostasis via an FeS-containing complex with Fra2p (homologue of E. coli BolA) in yeast and are linked to signal transduction in mammalian systems. However, the function of the mono-Grx in prokaryotes and the nature of an interaction with BolA-like proteins have not been established. Recent genome-wide screens for E. coli genetic interactions reported the synthetic lethality (combination of mutations leading to cell death; mutation of only one of these genes does not) of a grxD mutation when combined with strains defective in FeS cluster biosynthesis (isc operon) functions [Butland, G., et al. (2008) Nature Methods 5, 789-795]. These data connected the only E. coli mono-Grx, GrxD to a potential role in FeS cluster biosynthesis. We investigated GrxD to uncover the molecular basis of this synthetic lethality and observed that GrxD can form FeS-bound homodimeric and BolA containing heterodimeric complexes. These complexes display substantially different spectroscopic and functional properties, including the ability to act as scaffold proteins for intact FeS cluster transfer to the model [2Fe-2S] acceptor protein E. coli apo-ferredoxin (Fdx), with the homodimer being significantly more efficient. In this work, we functionally dissect the potential cellular roles of GrxD as a component of both homodimeric and heterodimeric complexes to ultimately uncover if either of these complexes performs functions linked to FeS cluster biosynthesis. © 2011 American Chemical Society

  8. A complex of Cas proteins 5, 6, and 7 is required for the biogenesis and stability of clustered regularly interspaced short palindromic repeats (crispr)-derived rnas (crrnas) in Haloferax volcanii.

    Science.gov (United States)

    Brendel, Jutta; Stoll, Britta; Lange, Sita J; Sharma, Kundan; Lenz, Christof; Stachler, Aris-Edda; Maier, Lisa-Katharina; Richter, Hagen; Nickel, Lisa; Schmitz, Ruth A; Randau, Lennart; Allers, Thorsten; Urlaub, Henning; Backofen, Rolf; Marchfelder, Anita

    2014-03-07

    The clustered regularly interspaced short palindromic repeats/CRISPR-associated (CRISPR-Cas) system is a prokaryotic defense mechanism against foreign genetic elements. A plethora of CRISPR-Cas versions exist, with more than 40 different Cas protein families and several different molecular approaches to fight the invading DNA. One of the key players in the system is the CRISPR-derived RNA (crRNA), which directs the invader-degrading Cas protein complex to the invader. The CRISPR-Cas types I and III use the Cas6 protein to generate mature crRNAs. Here, we show that the Cas6 protein is necessary for crRNA production but that additional Cas proteins that form a CRISPR-associated complex for antiviral defense (Cascade)-like complex are needed for crRNA stability in the CRISPR-Cas type I-B system in Haloferax volcanii in vivo. Deletion of the cas6 gene results in the loss of mature crRNAs and interference. However, cells that have the complete cas gene cluster (cas1-8b) removed and are transformed with the cas6 gene are not able to produce and stably maintain mature crRNAs. crRNA production and stability is rescued only if cas5, -6, and -7 are present. Mutational analysis of the cas6 gene reveals three amino acids (His-41, Gly-256, and Gly-258) that are essential for pre-crRNA cleavage, whereas the mutation of two amino acids (Ser-115 and Ser-224) leads to an increase of crRNA amounts. This is the first systematic in vivo analysis of Cas6 protein variants. In addition, we show that the H. volcanii I-B system contains a Cascade-like complex with a Cas7, Cas5, and Cas6 core that protects the crRNA.

  9. Nuclear clustering - a cluster core model study

    International Nuclear Information System (INIS)

    Paul Selvi, G.; Nandhini, N.; Balasubramaniam, M.

    2015-01-01

    Nuclear clustering, similar to other clustering phenomenon in nature is a much warranted study, since it would help us in understanding the nature of binding of the nucleons inside the nucleus, closed shell behaviour when the system is highly deformed, dynamics and structure at extremes. Several models account for the clustering phenomenon of nuclei. We present in this work, a cluster core model study of nuclear clustering in light mass nuclei

  10. Protein complex prediction via dense subgraphs and false positive analysis.

    Directory of Open Access Journals (Sweden)

    Cecilia Hernandez

    Full Text Available Many proteins work together with others in groups called complexes in order to achieve a specific function. Discovering protein complexes is important for understanding biological processes and predict protein functions in living organisms. Large-scale and throughput techniques have made possible to compile protein-protein interaction networks (PPI networks, which have been used in several computational approaches for detecting protein complexes. Those predictions might guide future biologic experimental research. Some approaches are topology-based, where highly connected proteins are predicted to be complexes; some propose different clustering algorithms using partitioning, overlaps among clusters for networks modeled with unweighted or weighted graphs; and others use density of clusters and information based on protein functionality. However, some schemes still require much processing time or the quality of their results can be improved. Furthermore, most of the results obtained with computational tools are not accompanied by an analysis of false positives. We propose an effective and efficient mining algorithm for discovering highly connected subgraphs, which is our base for defining protein complexes. Our representation is based on transforming the PPI network into a directed acyclic graph that reduces the number of represented edges and the search space for discovering subgraphs. Our approach considers weighted and unweighted PPI networks. We compare our best alternative using PPI networks from Saccharomyces cerevisiae (yeast and Homo sapiens (human with state-of-the-art approaches in terms of clustering, biological metrics and execution times, as well as three gold standards for yeast and two for human. Furthermore, we analyze false positive predicted complexes searching the PDBe (Protein Data Bank in Europe database in order to identify matching protein complexes that have been purified and structurally characterized. Our analysis shows

  11. Efficient algorithms for accurate hierarchical clustering of huge datasets: tackling the entire protein space

    OpenAIRE

    Loewenstein, Yaniv; Portugaly, Elon; Fromer, Menachem; Linial, Michal

    2008-01-01

    Motivation: UPGMA (average linking) is probably the most popular algorithm for hierarchical data clustering, especially in computational biology. However, UPGMA requires the entire dissimilarity matrix in memory. Due to this prohibitive requirement, UPGMA is not scalable to very large datasets. Application: We present a novel class of memory-constrained UPGMA (MC-UPGMA) algorithms. Given any practical memory size constraint, this framework guarantees the correct clustering solution without ex...

  12. Predicting and validating protein interactions using network structure.

    Directory of Open Access Journals (Sweden)

    Pao-Yang Chen

    2008-07-01

    Full Text Available Protein interactions play a vital part in the function of a cell. As experimental techniques for detection and validation of protein interactions are time consuming, there is a need for computational methods for this task. Protein interactions appear to form a network with a relatively high degree of local clustering. In this paper we exploit this clustering by suggesting a score based on triplets of observed protein interactions. The score utilises both protein characteristics and network properties. Our score based on triplets is shown to complement existing techniques for predicting protein interactions, outperforming them on data sets which display a high degree of clustering. The predicted interactions score highly against test measures for accuracy. Compared to a similar score derived from pairwise interactions only, the triplet score displays higher sensitivity and specificity. By looking at specific examples, we show how an experimental set of interactions can be enriched and validated. As part of this work we also examine the effect of different prior databases upon the accuracy of prediction and find that the interactions from the same kingdom give better results than from across kingdoms, suggesting that there may be fundamental differences between the networks. These results all emphasize that network structure is important and helps in the accurate prediction of protein interactions. The protein interaction data set and the program used in our analysis, and a list of predictions and validations, are available at http://www.stats.ox.ac.uk/bioinfo/resources/PredictingInteractions.

  13. A proteomic approach to investigating gene cluster expression and secondary metabolite functionality in Aspergillus fumigatus.

    Directory of Open Access Journals (Sweden)

    Rebecca A Owens

    Full Text Available A combined proteomics and metabolomics approach was utilised to advance the identification and characterisation of secondary metabolites in Aspergillus fumigatus. Here, implementation of a shotgun proteomic strategy led to the identification of non-redundant mycelial proteins (n = 414 from A. fumigatus including proteins typically under-represented in 2-D proteome maps: proteins with multiple transmembrane regions, hydrophobic proteins and proteins with extremes of molecular mass and pI. Indirect identification of secondary metabolite cluster expression was also achieved, with proteins (n = 18 from LaeA-regulated clusters detected, including GliT encoded within the gliotoxin biosynthetic cluster. Biochemical analysis then revealed that gliotoxin significantly attenuates H2O2-induced oxidative stress in A. fumigatus (p>0.0001, confirming observations from proteomics data. A complementary 2-D/LC-MS/MS approach further elucidated significantly increased abundance (p<0.05 of proliferating cell nuclear antigen (PCNA, NADH-quinone oxidoreductase and the gliotoxin oxidoreductase GliT, along with significantly attenuated abundance (p<0.05 of a heat shock protein, an oxidative stress protein and an autolysis-associated chitinase, when gliotoxin and H2O2 were present, compared to H2O2 alone. Moreover, gliotoxin exposure significantly reduced the abundance of selected proteins (p<0.05 involved in de novo purine biosynthesis. Significantly elevated abundance (p<0.05 of a key enzyme, xanthine-guanine phosphoribosyl transferase Xpt1, utilised in purine salvage, was observed in the presence of H2O2 and gliotoxin. This work provides new insights into the A. fumigatus proteome and experimental strategies, plus mechanistic data pertaining to gliotoxin functionality in the organism.

  14. Phasing and structure of bestrophin-1: a case study in the use of heavy-atom cluster compounds with multi-subunit transmembrane proteins

    Energy Technology Data Exchange (ETDEWEB)

    Dickson, Veronica Kane (Cambridge)

    2016-03-01

    The purification and three-dimensional crystallization of membrane proteins are commonly affected by a cumulation of pathologies that are less prevalent in their soluble counterparts. This may include severe anisotropy, poor spot shape, poor to moderate-resolution diffraction, crystal twinning, translational pseudo-symmetry and poor uptake of heavy atoms for derivatization. Such challenges must be circumvented by adaptations in the approach to crystallization and/or phasing. Here, an example of a protein that exhibited all of the above-mentioned complications is presented. Bestrophin-1 is a eukaryotic calcium-activated chloride channel, the structure of which was recently determined in complex with monoclonal antibody fragments using SAD phasing with tantalum bromide clusters (Ta6Br12·Br2). Some of the obstacles to obtaining improved diffraction and phasing for this particular channel are discussed, as well as the approach and adaptations that were key to determining the structure.

  15. Cluster fusion algorithm: application to Lennard-Jones clusters

    DEFF Research Database (Denmark)

    Solov'yov, Ilia; Solov'yov, Andrey V.; Greiner, Walter

    2006-01-01

    paths up to the cluster size of 150 atoms. We demonstrate that in this way all known global minima structures of the Lennard-Jones clusters can be found. Our method provides an efficient tool for the calculation and analysis of atomic cluster structure. With its use we justify the magic number sequence......We present a new general theoretical framework for modelling the cluster structure and apply it to description of the Lennard-Jones clusters. Starting from the initial tetrahedral cluster configuration, adding new atoms to the system and absorbing its energy at each step, we find cluster growing...... for the clusters of noble gas atoms and compare it with experimental observations. We report the striking correspondence of the peaks in the dependence of the second derivative of the binding energy per atom on cluster size calculated for the chain of the Lennard-Jones clusters based on the icosahedral symmetry...

  16. Cluster fusion algorithm: application to Lennard-Jones clusters

    DEFF Research Database (Denmark)

    Solov'yov, Ilia; Solov'yov, Andrey V.; Greiner, Walter

    2008-01-01

    paths up to the cluster size of 150 atoms. We demonstrate that in this way all known global minima structures of the Lennard-Jones clusters can be found. Our method provides an efficient tool for the calculation and analysis of atomic cluster structure. With its use we justify the magic number sequence......We present a new general theoretical framework for modelling the cluster structure and apply it to description of the Lennard-Jones clusters. Starting from the initial tetrahedral cluster configuration, adding new atoms to the system and absorbing its energy at each step, we find cluster growing...... for the clusters of noble gas atoms and compare it with experimental observations. We report the striking correspondence of the peaks in the dependence of the second derivative of the binding energy per atom on cluster size calculated for the chain of the Lennard-Jones clusters based on the icosahedral symmetry...

  17. NMR of proteins (4Fe-4S): structural properties and intramolecular electron transfer; RMN de proteines (4Fe-4S): proprietes structurales et transfert electronique intramoleculaire

    Energy Technology Data Exchange (ETDEWEB)

    Huber, J G

    1996-10-17

    NMR started to be applied to Fe-S proteins in the seventies. Its use has recently been enlarged as the problems arising from the paramagnetic polymetallic clusters ware overcome. Applications to [4Fe-4S] are presented herein. The information derived thereof deepens the understanding of the redox properties of these proteins which play a central role in the metabolism of bacterial cells. The secondary structure elements and the overall folding of Chromatium vinosum ferredoxin (Cv Fd) in solution have been established by NMR. The unique features of this sequence have been shown to fold as an {alpha} helix at the C-terminus and as a loop between two cysteines ligand of one cluster: these two parts localize in close proximity from one another. The interaction between nuclear and electronic spins is a source of additional structural information for (4Fe-AS] proteins. The conformation of the cysteine-ligands, as revealed by the Fe-(S{sub {gamma}}-C{sub {beta}}-H{sub {beta}})Cys dihedral angles, is related to the chemical shifts of the signals associated with the protons of these residues. The longitudinal relaxation times of the protons depend on their distance to the cluster. A quantitative relationship has been established and used to show that the solution structure of the high-potential ferredoxin from Cv differs significantly from the crystal structure around Phe-48. Both parameters (chemical shifts and longitudinal relaxation times) give also insight into the electronic and magnetic properties of the [4Fe-4S] clusters. The rate of intramolecular electron transfer between the two [4FE-4S] clusters of ferredoxins has been measured by NMR. It is far slower in the case of Cv Fd than for shorter ferredoxins. The difference may be associated with changes in the magnetic and/or electronic properties of one cluster. The strong paramagnetism of the [4Fe-4S] clusters, which originally limited the applicability of NMR to proteins containing these cofactors, has been proven

  18. Ligand-protected gold clusters: the structure, synthesis and applications

    Science.gov (United States)

    Pichugina, D. A.; Kuz'menko, N. E.; Shestakov, A. F.

    2015-11-01

    Modern concepts of the structure and properties of atomic gold clusters protected by thiolate, selenolate, phosphine and phenylacetylene ligands are analyzed. Within the framework of the superatom theory, the 'divide and protect' approach and the structure rule, the stability and composition of a cluster are determined by the structure of the cluster core, the type of ligands and the total number of valence electrons. Methods of selective synthesis of gold clusters in solution and on the surface of inorganic composites based, in particular, on the reaction of Aun with RS, RSe, PhC≡C, Hal ligands or functional groups of proteins, on stabilization of clusters in cavities of the α-, β and γ-cyclodextrin molecules (Au15 and Au25) and on anchorage to a support surface (Au25/SiO2, Au20/C, Au10/FeOx) are reviewed. Problems in this field are also discussed. Among the methods for cluster structure prediction, particular attention is given to the theoretical approaches based on the density functional theory (DFT). The structures of a number of synthesized clusters are described using the results obtained by X-ray diffraction analysis and DFT calculations. A possible mechanism of formation of the SR(AuSR)n 'staple' units in the cluster shell is proposed. The structure and properties of bimetallic clusters MxAunLm (M=Pd, Pt, Ag, Cu) are discussed. The Pd or Pt atom is located at the centre of the cluster, whereas Ag and Cu atoms form bimetallic compounds in which the heteroatom is located on the surface of the cluster core or in the 'staple' units. The optical properties, fluorescence and luminescence of ligand-protected gold clusters originate from the quantum effects of the Au atoms in the cluster core and in the oligomeric SR(AuSR)x units in the cluster shell. Homogeneous and heterogeneous reactions catalyzed by atomic gold clusters are discussed in the context of the reaction mechanism and the nature of the active sites. The bibliography includes 345 references.

  19. Identification of Protein Complexes Using Weighted PageRank-Nibble Algorithm and Core-Attachment Structure.

    Science.gov (United States)

    Peng, Wei; Wang, Jianxin; Zhao, Bihai; Wang, Lusheng

    2015-01-01

    Protein complexes play a significant role in understanding the underlying mechanism of most cellular functions. Recently, many researchers have explored computational methods to identify protein complexes from protein-protein interaction (PPI) networks. One group of researchers focus on detecting local dense subgraphs which correspond to protein complexes by considering local neighbors. The drawback of this kind of approach is that the global information of the networks is ignored. Some methods such as Markov Clustering algorithm (MCL), PageRank-Nibble are proposed to find protein complexes based on random walk technique which can exploit the global structure of networks. However, these methods ignore the inherent core-attachment structure of protein complexes and treat adjacent node equally. In this paper, we design a weighted PageRank-Nibble algorithm which assigns each adjacent node with different probability, and propose a novel method named WPNCA to detect protein complex from PPI networks by using weighted PageRank-Nibble algorithm and core-attachment structure. Firstly, WPNCA partitions the PPI networks into multiple dense clusters by using weighted PageRank-Nibble algorithm. Then the cores of these clusters are detected and the rest of proteins in the clusters will be selected as attachments to form the final predicted protein complexes. The experiments on yeast data show that WPNCA outperforms the existing methods in terms of both accuracy and p-value. The software for WPNCA is available at "http://netlab.csu.edu.cn/bioinfomatics/weipeng/WPNCA/download.html".

  20. Cluster-cluster correlations and constraints on the correlation hierarchy

    Science.gov (United States)

    Hamilton, A. J. S.; Gott, J. R., III

    1988-01-01

    The hypothesis that galaxies cluster around clusters at least as strongly as they cluster around galaxies imposes constraints on the hierarchy of correlation amplitudes in hierachical clustering models. The distributions which saturate these constraints are the Rayleigh-Levy random walk fractals proposed by Mandelbrot; for these fractal distributions cluster-cluster correlations are all identically equal to galaxy-galaxy correlations. If correlation amplitudes exceed the constraints, as is observed, then cluster-cluster correlations must exceed galaxy-galaxy correlations, as is observed.

  1. A framework for classification of prokaryotic protein kinases.

    Directory of Open Access Journals (Sweden)

    Nidhi Tyagi

    Full Text Available BACKGROUND: Overwhelming majority of the Serine/Threonine protein kinases identified by gleaning archaeal and eubacterial genomes could not be classified into any of the well known Hanks and Hunter subfamilies of protein kinases. This is owing to the development of Hanks and Hunter classification scheme based on eukaryotic protein kinases which are highly divergent from their prokaryotic homologues. A large dataset of prokaryotic Serine/Threonine protein kinases recognized from genomes of prokaryotes have been used to develop a classification framework for prokaryotic Ser/Thr protein kinases. METHODOLOGY/PRINCIPAL FINDINGS: We have used traditional sequence alignment and phylogenetic approaches and clustered the prokaryotic kinases which represent 72 subfamilies with at least 4 members in each. Such a clustering enables classification of prokaryotic Ser/Thr kinases and it can be used as a framework to classify newly identified prokaryotic Ser/Thr kinases. After series of searches in a comprehensive sequence database we recognized that 38 subfamilies of prokaryotic protein kinases are associated to a specific taxonomic level. For example 4, 6 and 3 subfamilies have been identified that are currently specific to phylum proteobacteria, cyanobacteria and actinobacteria respectively. Similarly subfamilies which are specific to an order, sub-order, class, family and genus have also been identified. In addition to these, we also identify organism-diverse subfamilies. Members of these clusters are from organisms of different taxonomic levels, such as archaea, bacteria, eukaryotes and viruses. CONCLUSION/SIGNIFICANCE: Interestingly, occurrence of several taxonomic level specific subfamilies of prokaryotic kinases contrasts with classification of eukaryotic protein kinases in which most of the popular subfamilies of eukaryotic protein kinases occur diversely in several eukaryotes. Many prokaryotic Ser/Thr kinases exhibit a wide variety of modular

  2. CONSTRAINING CLUSTER PHYSICS WITH THE SHAPE OF X-RAY CLUSTERS: COMPARISON OF LOCAL X-RAY CLUSTERS VERSUS ΛCDM CLUSTERS

    International Nuclear Information System (INIS)

    Lau, Erwin T.; Nagai, Daisuke; Kravtsov, Andrey V.; Vikhlinin, Alexey; Zentner, Andrew R.

    2012-01-01

    Recent simulations of cluster formation have demonstrated that condensation of baryons into central galaxies during cluster formation can drive the shape of the gas distribution in galaxy clusters significantly rounder out to their virial radius. These simulations generally predict stellar fractions within cluster virial radii that are ∼2-3 times larger than the stellar masses deduced from observations. In this paper, we compare ellipticity profiles of simulated clusters performed with varying input physics (radiative cooling, star formation, and supernova feedback) to the cluster ellipticity profiles derived from Chandra and ROSAT observations, in an effort to constrain the fraction of gas that cools and condenses into the central galaxies within clusters. We find that local relaxed clusters have an average ellipticity of ε = 0.18 ± 0.05 in the radial range of 0.04 ≤ r/r 500 ≤ 1. At larger radii r > 0.1r 500 , the observed ellipticity profiles agree well with the predictions of non-radiative simulations. In contrast, the ellipticity profiles of simulated clusters that include dissipative gas physics deviate significantly from the observed ellipticity profiles at all radii. The dissipative simulations overpredict (underpredict) ellipticity in the inner (outer) regions of galaxy clusters. By comparing simulations with and without dissipative gas physics, we show that gas cooling causes the gas distribution to be more oblate in the central regions, but makes the outer gas distribution more spherical. We find that late-time gas cooling and star formation are responsible for the significantly oblate gas distributions in cluster cores, but the gas shapes outside of cluster cores are set primarily by baryon dissipation at high redshift (z ≥ 2). Our results indicate that the shapes of X-ray emitting gas in galaxy clusters, especially at large radii, can be used to place constraints on cluster gas physics, making it potential probes of the history of baryonic

  3. CHIPMUNK: A Virtual Synthesizable Small-Molecule Library for Medicinal Chemistry, Exploitable for Protein-Protein Interaction Modulators.

    Science.gov (United States)

    Humbeck, Lina; Weigang, Sebastian; Schäfer, Till; Mutzel, Petra; Koch, Oliver

    2018-03-20

    A common issue during drug design and development is the discovery of novel scaffolds for protein targets. On the one hand the chemical space of purchasable compounds is rather limited; on the other hand artificially generated molecules suffer from a grave lack of accessibility in practice. Therefore, we generated a novel virtual library of small molecules which are synthesizable from purchasable educts, called CHIPMUNK (CHemically feasible In silico Public Molecular UNiverse Knowledge base). Altogether, CHIPMUNK covers over 95 million compounds and encompasses regions of the chemical space that are not covered by existing databases. The coverage of CHIPMUNK exceeds the chemical space spanned by the Lipinski rule of five to foster the exploration of novel and difficult target classes. The analysis of the generated property space reveals that CHIPMUNK is well suited for the design of protein-protein interaction inhibitors (PPIIs). Furthermore, a recently developed structural clustering algorithm (StruClus) for big data was used to partition the sub-libraries into meaningful subsets and assist scientists to process the large amount of data. These clustered subsets also contain the target space based on ChEMBL data which was included during clustering. © 2018 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

  4. A Complex of Cas Proteins 5, 6, and 7 Is Required for the Biogenesis and Stability of Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-derived RNAs (crRNAs) in Haloferax volcanii*

    Science.gov (United States)

    Brendel, Jutta; Stoll, Britta; Lange, Sita J.; Sharma, Kundan; Lenz, Christof; Stachler, Aris-Edda; Maier, Lisa-Katharina; Richter, Hagen; Nickel, Lisa; Schmitz, Ruth A.; Randau, Lennart; Allers, Thorsten; Urlaub, Henning; Backofen, Rolf; Marchfelder, Anita

    2014-01-01

    The clustered regularly interspaced short palindromic repeats/CRISPR-associated (CRISPR-Cas) system is a prokaryotic defense mechanism against foreign genetic elements. A plethora of CRISPR-Cas versions exist, with more than 40 different Cas protein families and several different molecular approaches to fight the invading DNA. One of the key players in the system is the CRISPR-derived RNA (crRNA), which directs the invader-degrading Cas protein complex to the invader. The CRISPR-Cas types I and III use the Cas6 protein to generate mature crRNAs. Here, we show that the Cas6 protein is necessary for crRNA production but that additional Cas proteins that form a CRISPR-associated complex for antiviral defense (Cascade)-like complex are needed for crRNA stability in the CRISPR-Cas type I-B system in Haloferax volcanii in vivo. Deletion of the cas6 gene results in the loss of mature crRNAs and interference. However, cells that have the complete cas gene cluster (cas1–8b) removed and are transformed with the cas6 gene are not able to produce and stably maintain mature crRNAs. crRNA production and stability is rescued only if cas5, -6, and -7 are present. Mutational analysis of the cas6 gene reveals three amino acids (His-41, Gly-256, and Gly-258) that are essential for pre-crRNA cleavage, whereas the mutation of two amino acids (Ser-115 and Ser-224) leads to an increase of crRNA amounts. This is the first systematic in vivo analysis of Cas6 protein variants. In addition, we show that the H. volcanii I-B system contains a Cascade-like complex with a Cas7, Cas5, and Cas6 core that protects the crRNA. PMID:24459147

  5. Molecular basis of the γ-aminobutyric acid A receptor α3 subunit interaction with the clustering protein gephyrin

    DEFF Research Database (Denmark)

    Tretter, Verena; Kerschner, Bernd; Milenkovic, Ivan

    2011-01-01

    The multifunctional scaffolding protein gephyrin is a key player in the formation of the postsynaptic scaffold at inhibitory synapses, clustering both inhibitory glycine receptors (GlyRs) and selected GABA(A) receptor (GABA(A)R) subtypes. We report a direct interaction between the GABA(A)R α3...... subunit and gephyrin, mapping reciprocal binding sites using mutagenesis, overlay, and yeast two-hybrid assays. This analysis reveals that critical determinants of this interaction are located in the motif FNIVGTTYPI in the GABA(A)R α3 M3-M4 domain and the motif SMDKAFITVL at the N terminus...... of the gephyrin E domain. GABA(A)R α3 gephyrin binding-site mutants were unable to co-localize with endogenous gephyrin in transfected hippocampal neurons, despite being able to traffic to the cell membrane and form functional benzodiazepine-responsive GABA(A)Rs in recombinant systems. Interestingly, motifs...

  6. Three Pseudomonas putida FNR Family Proteins with Different Sensitivities to O2.

    Science.gov (United States)

    Ibrahim, Susan A; Crack, Jason C; Rolfe, Matthew D; Borrero-de Acuña, José Manuel; Thomson, Andrew J; Le Brun, Nick E; Schobert, Max; Stapleton, Melanie R; Green, Jeffrey

    2015-07-03

    The Escherichia coli fumarate-nitrate reduction regulator (FNR) protein is the paradigm for bacterial O2-sensing transcription factors. However, unlike E. coli, some bacterial species possess multiple FNR proteins that presumably have evolved to fulfill distinct roles. Here, three FNR proteins (ANR, PP_3233, and PP_3287) from a single bacterial species, Pseudomonas putida KT2440, have been analyzed. Under anaerobic conditions, all three proteins had spectral properties resembling those of [4Fe-4S] proteins. The reactivity of the ANR [4Fe-4S] cluster with O2 was similar to that of E. coli FNR, and during conversion to the apo-protein, via a [2Fe-2S] intermediate, cluster sulfur was retained. Like ANR, reconstituted PP_3233 and PP_3287 were converted to [2Fe-2S] forms when exposed to O2, but their [4Fe-4S] clusters reacted more slowly. Transcription from an FNR-dependent promoter with a consensus FNR-binding site in P. putida and E. coli strains expressing only one FNR protein was consistent with the in vitro responses to O2. Taken together, the experimental results suggest that the local environments of the iron-sulfur clusters in the different P. putida FNR proteins influence their reactivity with O2, such that ANR resembles E. coli FNR and is highly responsive to low concentrations of O2, whereas PP_3233 and PP_3287 have evolved to be less sensitive to O2. © 2015 by The American Society for Biochemistry and Molecular Biology, Inc.

  7. Cluster analysis of obesity and asthma phenotypes.

    Directory of Open Access Journals (Sweden)

    E Rand Sutherland

    Full Text Available Asthma is a heterogeneous disease with variability among patients in characteristics such as lung function, symptoms and control, body weight, markers of inflammation, and responsiveness to glucocorticoids (GC. Cluster analysis of well-characterized cohorts can advance understanding of disease subgroups in asthma and point to unsuspected disease mechanisms. We utilized an hypothesis-free cluster analytical approach to define the contribution of obesity and related variables to asthma phenotype.In a cohort of clinical trial participants (n = 250, minimum-variance hierarchical clustering was used to identify clinical and inflammatory biomarkers important in determining disease cluster membership in mild and moderate persistent asthmatics. In a subset of participants, GC sensitivity was assessed via expression of GC receptor alpha (GCRα and induction of MAP kinase phosphatase-1 (MKP-1 expression by dexamethasone. Four asthma clusters were identified, with body mass index (BMI, kg/m(2 and severity of asthma symptoms (AEQ score the most significant determinants of cluster membership (F = 57.1, p<0.0001 and F = 44.8, p<0.0001, respectively. Two clusters were composed of predominantly obese individuals; these two obese asthma clusters differed from one another with regard to age of asthma onset, measures of asthma symptoms (AEQ and control (ACQ, exhaled nitric oxide concentration (F(ENO and airway hyperresponsiveness (methacholine PC(20 but were similar with regard to measures of lung function (FEV(1 (% and FEV(1/FVC, airway eosinophilia, IgE, leptin, adiponectin and C-reactive protein (hsCRP. Members of obese clusters demonstrated evidence of reduced expression of GCRα, a finding which was correlated with a reduced induction of MKP-1 expression by dexamethasoneObesity is an important determinant of asthma phenotype in adults. There is heterogeneity in expression of clinical and inflammatory biomarkers of asthma across obese individuals

  8. Platelet apoptosis by cld-induced glycoportein Ibα clustering

    DEFF Research Database (Denmark)

    van der Wal, Dianne E; Du, V X; Lo, KS

    2010-01-01

    Summary Background:  Cold-storage of platelets followed by rewarming induces changes in Glycoprotein (GP) Ibα-distribution indicative of receptor clustering and initiates thromboxane A2-formation. GPIbα is associated with 14-3-3 proteins, which contribute to GPIbα-signaling and in nucleated cells...

  9. Convex Clustering: An Attractive Alternative to Hierarchical Clustering

    Science.gov (United States)

    Chen, Gary K.; Chi, Eric C.; Ranola, John Michael O.; Lange, Kenneth

    2015-01-01

    The primary goal in cluster analysis is to discover natural groupings of objects. The field of cluster analysis is crowded with diverse methods that make special assumptions about data and address different scientific aims. Despite its shortcomings in accuracy, hierarchical clustering is the dominant clustering method in bioinformatics. Biologists find the trees constructed by hierarchical clustering visually appealing and in tune with their evolutionary perspective. Hierarchical clustering operates on multiple scales simultaneously. This is essential, for instance, in transcriptome data, where one may be interested in making qualitative inferences about how lower-order relationships like gene modules lead to higher-order relationships like pathways or biological processes. The recently developed method of convex clustering preserves the visual appeal of hierarchical clustering while ameliorating its propensity to make false inferences in the presence of outliers and noise. The solution paths generated by convex clustering reveal relationships between clusters that are hidden by static methods such as k-means clustering. The current paper derives and tests a novel proximal distance algorithm for minimizing the objective function of convex clustering. The algorithm separates parameters, accommodates missing data, and supports prior information on relationships. Our program CONVEXCLUSTER incorporating the algorithm is implemented on ATI and nVidia graphics processing units (GPUs) for maximal speed. Several biological examples illustrate the strengths of convex clustering and the ability of the proximal distance algorithm to handle high-dimensional problems. CONVEXCLUSTER can be freely downloaded from the UCLA Human Genetics web site at http://www.genetics.ucla.edu/software/ PMID:25965340

  10. The role of mitochondria in cellular iron-sulfur protein biogenesis and iron metabolism.

    Science.gov (United States)

    Lill, Roland; Hoffmann, Bastian; Molik, Sabine; Pierik, Antonio J; Rietzschel, Nicole; Stehling, Oliver; Uzarska, Marta A; Webert, Holger; Wilbrecht, Claudia; Mühlenhoff, Ulrich

    2012-09-01

    Mitochondria play a key role in iron metabolism in that they synthesize heme, assemble iron-sulfur (Fe/S) proteins, and participate in cellular iron regulation. Here, we review the latter two topics and their intimate connection. The mitochondrial Fe/S cluster (ISC) assembly machinery consists of 17 proteins that operate in three major steps of the maturation process. First, the cysteine desulfurase complex Nfs1-Isd11 as the sulfur donor cooperates with ferredoxin-ferredoxin reductase acting as an electron transfer chain, and frataxin to synthesize an [2Fe-2S] cluster on the scaffold protein Isu1. Second, the cluster is released from Isu1 and transferred toward apoproteins with the help of a dedicated Hsp70 chaperone system and the glutaredoxin Grx5. Finally, various specialized ISC components assist in the generation of [4Fe-4S] clusters and cluster insertion into specific target apoproteins. Functional defects of the core ISC assembly machinery are signaled to cytosolic or nuclear iron regulatory systems resulting in increased cellular iron acquisition and mitochondrial iron accumulation. In fungi, regulation is achieved by iron-responsive transcription factors controlling the expression of genes involved in iron uptake and intracellular distribution. They are assisted by cytosolic multidomain glutaredoxins which use a bound Fe/S cluster as iron sensor and additionally perform an essential role in intracellular iron delivery to target metalloproteins. In mammalian cells, the iron regulatory proteins IRP1, an Fe/S protein, and IRP2 act in a post-transcriptional fashion to adjust the cellular needs for iron. Thus, Fe/S protein biogenesis and cellular iron metabolism are tightly linked to coordinate iron supply and utilization. This article is part of a Special Issue entitled: Cell Biology of Metals. Copyright © 2012 Elsevier B.V. All rights reserved.

  11. Homo-FRET Imaging as a tool to quantify protein and lipid clustering

    NARCIS (Netherlands)

    Bader, A.N.; Hoetzl, S.; Hofman, E.G.; Voortman, J.; van Bergen en Henegouwen, P.M.P.; van Meer, G.; Gerritsen, H.C.

    2010-01-01

    Homo-FRET, Förster resonance energy transfer between identical fluorophores, can be conveniently measured by observing its effect on the fluorescence anisotropy. This review aims to summarize the possibilities of fluorescence anisotropy imaging techniques to investigate clustering of identical

  12. Cluster management.

    Science.gov (United States)

    Katz, R

    1992-11-01

    Cluster management is a management model that fosters decentralization of management, develops leadership potential of staff, and creates ownership of unit-based goals. Unlike shared governance models, there is no formal structure created by committees and it is less threatening for managers. There are two parts to the cluster management model. One is the formation of cluster groups, consisting of all staff and facilitated by a cluster leader. The cluster groups function for communication and problem-solving. The second part of the cluster management model is the creation of task forces. These task forces are designed to work on short-term goals, usually in response to solving one of the unit's goals. Sometimes the task forces are used for quality improvement or system problems. Clusters are groups of not more than five or six staff members, facilitated by a cluster leader. A cluster is made up of individuals who work the same shift. For example, people with job titles who work days would be in a cluster. There would be registered nurses, licensed practical nurses, nursing assistants, and unit clerks in the cluster. The cluster leader is chosen by the manager based on certain criteria and is trained for this specialized role. The concept of cluster management, criteria for choosing leaders, training for leaders, using cluster groups to solve quality improvement issues, and the learning process necessary for manager support are described.

  13. Protein domain organisation: adding order.

    Science.gov (United States)

    Kummerfeld, Sarah K; Teichmann, Sarah A

    2009-01-29

    Domains are the building blocks of proteins. During evolution, they have been duplicated, fused and recombined, to produce proteins with novel structures and functions. Structural and genome-scale studies have shown that pairs or groups of domains observed together in a protein are almost always found in only one N to C terminal order and are the result of a single recombination event that has been propagated by duplication of the multi-domain unit. Previous studies of domain organisation have used graph theory to represent the co-occurrence of domains within proteins. We build on this approach by adding directionality to the graphs and connecting nodes based on their relative order in the protein. Most of the time, the linear order of domains is conserved. However, using the directed graph representation we have identified non-linear features of domain organization that are over-represented in genomes. Recognising these patterns and unravelling how they have arisen may allow us to understand the functional relationships between domains and understand how the protein repertoire has evolved. We identify groups of domains that are not linearly conserved, but instead have been shuffled during evolution so that they occur in multiple different orders. We consider 192 genomes across all three kingdoms of life and use domain and protein annotation to understand their functional significance. To identify these features and assess their statistical significance, we represent the linear order of domains in proteins as a directed graph and apply graph theoretical methods. We describe two higher-order patterns of domain organisation: clusters and bi-directionally associated domain pairs and explore their functional importance and phylogenetic conservation. Taking into account the order of domains, we have derived a novel picture of global protein organization. We found that all genomes have a higher than expected degree of clustering and more domain pairs in forward and

  14. Protein domain organisation: adding order

    Directory of Open Access Journals (Sweden)

    Kummerfeld Sarah K

    2009-01-01

    Full Text Available Abstract Background Domains are the building blocks of proteins. During evolution, they have been duplicated, fused and recombined, to produce proteins with novel structures and functions. Structural and genome-scale studies have shown that pairs or groups of domains observed together in a protein are almost always found in only one N to C terminal order and are the result of a single recombination event that has been propagated by duplication of the multi-domain unit. Previous studies of domain organisation have used graph theory to represent the co-occurrence of domains within proteins. We build on this approach by adding directionality to the graphs and connecting nodes based on their relative order in the protein. Most of the time, the linear order of domains is conserved. However, using the directed graph representation we have identified non-linear features of domain organization that are over-represented in genomes. Recognising these patterns and unravelling how they have arisen may allow us to understand the functional relationships between domains and understand how the protein repertoire has evolved. Results We identify groups of domains that are not linearly conserved, but instead have been shuffled during evolution so that they occur in multiple different orders. We consider 192 genomes across all three kingdoms of life and use domain and protein annotation to understand their functional significance. To identify these features and assess their statistical significance, we represent the linear order of domains in proteins as a directed graph and apply graph theoretical methods. We describe two higher-order patterns of domain organisation: clusters and bi-directionally associated domain pairs and explore their functional importance and phylogenetic conservation. Conclusion Taking into account the order of domains, we have derived a novel picture of global protein organization. We found that all genomes have a higher than expected

  15. Protein-gold clusters-capped mesoporous silica nanoparticles for high drug loading, autonomous gemcitabine/doxorubicin co-delivery, and in-vivo tumor imaging

    KAUST Repository

    Croissant, Jonas G.; Zhang, Dingyuan; Alsaiari, Shahad K.; Lu, Jie; Deng, Lin; Tamanoi, Fuyuhiko; Zink, Jeffrey I.; Khashab, Niveen M.

    2016-01-01

    Functional nanocarriers capable of transporting high drug contents without premature leakage and to controllably deliver several drugs are needed for better cancer treatments. To address this clinical need, gold cluster bovine serum albumin (AuNC@BSA) nanogates were engineered on mesoporous silica nanoparticles (MSN) for high drug loadings and co-delivery of two different anticancer drugs. The first drug, gemcitabine (GEM, 40 wt%), was loaded in positively-charged ammonium-functionalized MSN (MSN-NH3+). The second drug, doxorubicin (DOX, 32 wt%), was bound with negatively-charged AuNC@BSA electrostatically-attached onto MSN-NH3+, affording highly loaded pH-responsive MSN-AuNC@BSA nanocarriers. The co-delivery of DOX and GEM was achieved for the first time via an inorganic nanocarrier, possessing a zero-premature leakage behavior as well as drug loading capacities seven times higher than polymersome NPs. Besides, unlike the majority of strategies used to cap the pores of MSN, AuNC@BSA nanogates are biotools and were applied for targeted red nuclear staining and in-vivo tumor imaging. The straightforward non-covalent combination of MSN and gold-protein cluster bioconjugates thus leads to a simple, yet multifunctional nanotheranostic for the next generation of cancer treatments.

  16. Protein-gold clusters-capped mesoporous silica nanoparticles for high drug loading, autonomous gemcitabine/doxorubicin co-delivery, and in-vivo tumor imaging

    KAUST Repository

    Croissant, Jonas G.

    2016-03-23

    Functional nanocarriers capable of transporting high drug contents without premature leakage and to controllably deliver several drugs are needed for better cancer treatments. To address this clinical need, gold cluster bovine serum albumin (AuNC@BSA) nanogates were engineered on mesoporous silica nanoparticles (MSN) for high drug loadings and co-delivery of two different anticancer drugs. The first drug, gemcitabine (GEM, 40 wt%), was loaded in positively-charged ammonium-functionalized MSN (MSN-NH3+). The second drug, doxorubicin (DOX, 32 wt%), was bound with negatively-charged AuNC@BSA electrostatically-attached onto MSN-NH3+, affording highly loaded pH-responsive MSN-AuNC@BSA nanocarriers. The co-delivery of DOX and GEM was achieved for the first time via an inorganic nanocarrier, possessing a zero-premature leakage behavior as well as drug loading capacities seven times higher than polymersome NPs. Besides, unlike the majority of strategies used to cap the pores of MSN, AuNC@BSA nanogates are biotools and were applied for targeted red nuclear staining and in-vivo tumor imaging. The straightforward non-covalent combination of MSN and gold-protein cluster bioconjugates thus leads to a simple, yet multifunctional nanotheranostic for the next generation of cancer treatments.

  17. Lipid-protein interaction induced domains: Kinetics and conformational changes in multicomponent vesicles

    Science.gov (United States)

    Sreeja, K. K.; Sunil Kumar, P. B.

    2018-04-01

    The spatio-temporal organization of proteins and the associated morphological changes in membranes are of importance in cell signaling. Several mechanisms that promote the aggregation of proteins at low cell surface concentrations have been investigated in the past. We show, using Monte Carlo simulations, that the affinity of proteins for specific lipids can hasten their aggregation kinetics. The lipid membrane is modeled as a dynamically triangulated surface with the proteins defined as in-plane fields at the vertices. We show that, even at low protein concentrations, strong lipid-protein interactions can result in large protein clusters indicating a route to lipid mediated signal amplification. At high protein concentrations, the domains form buds similar to that seen in lipid-lipid interaction induced phase separation. Protein interaction induced domain budding is suppressed when proteins act as anisotropic inclusions and exhibit nematic orientational order. The kinetics of protein clustering and resulting conformational changes are shown to be significantly different for the isotropic and anisotropic curvature inducing proteins.

  18. Nonclassical nucleation pathways in protein crystallization.

    Science.gov (United States)

    Zhang, Fajun

    2017-11-08

    Classical nucleation theory (CNT), which was established about 90 years ago, has been very successful in many research fields, and continues to be the most commonly used theory in describing the nucleation process. For a fluid-to-solid phase transition, CNT states that the solute molecules in a supersaturated solution reversibly form small clusters. Once the cluster size reaches a critical value, it becomes thermodynamically stable and favored for further growth. One of the most important assumptions of CNT is that the nucleation process is described by one reaction coordinate and all order parameters proceed simultaneously. Recent studies in experiments, computer simulations and theory have revealed nonclassical features in the early stage of nucleation. In particular, the decoupling of order parameters involved during a fluid-to-solid transition leads to the so-called two-step nucleation mechanism, in which a metastable intermediate phase (MIP) exists between the initial supersaturated solution and the final crystals. Depending on the exact free energy landscapes, the MIPs can be a high density liquid phase, mesoscopic clusters, or a pre-ordered state. In this review, we focus on the studies of nonclassical pathways in protein crystallization and discuss the applications of the various scenarios of two-step nucleation theory. In particular, we focus on protein solutions in the presence of multivalent salts, which serve as a model protein system to study the nucleation pathways. We wish to point out the unique features of proteins as model systems for further studies.

  19. Nonclassical nucleation pathways in protein crystallization

    Science.gov (United States)

    Zhang, Fajun

    2017-11-01

    Classical nucleation theory (CNT), which was established about 90 years ago, has been very successful in many research fields, and continues to be the most commonly used theory in describing the nucleation process. For a fluid-to-solid phase transition, CNT states that the solute molecules in a supersaturated solution reversibly form small clusters. Once the cluster size reaches a critical value, it becomes thermodynamically stable and favored for further growth. One of the most important assumptions of CNT is that the nucleation process is described by one reaction coordinate and all order parameters proceed simultaneously. Recent studies in experiments, computer simulations and theory have revealed nonclassical features in the early stage of nucleation. In particular, the decoupling of order parameters involved during a fluid-to-solid transition leads to the so-called two-step nucleation mechanism, in which a metastable intermediate phase (MIP) exists between the initial supersaturated solution and the final crystals. Depending on the exact free energy landscapes, the MIPs can be a high density liquid phase, mesoscopic clusters, or a pre-ordered state. In this review, we focus on the studies of nonclassical pathways in protein crystallization and discuss the applications of the various scenarios of two-step nucleation theory. In particular, we focus on protein solutions in the presence of multivalent salts, which serve as a model protein system to study the nucleation pathways. We wish to point out the unique features of proteins as model systems for further studies.

  20. Electrospray Ionization Mass Spectrometry: From Cluster Ions to Toxic metal Ions in Biology

    Energy Technology Data Exchange (ETDEWEB)

    Lentz, Nicholas B. [Iowa State Univ., Ames, IA (United States)

    2007-01-01

    This dissertation focused on using electrospray ionization mass spectrometry to study cluster ions and toxic metal ions in biology. In Chapter 2, it was shown that primary, secondary and quarternary amines exhibit different clustering characteristics under identical instrument conditions. Carbon chain length also played a role in cluster ion formation. In Chapters 3 and 4, the effects of solvent types/ratios and various instrumental parameters on cluster ion formation were examined. It was found that instrument interface design also plays a critical role in the cluster ion distribution seen in the mass spectrum. In Chapter 5, ESI-MS was used to investigate toxic metal binding to the [Gln11]-amyloid β-protein fragment (1-16). Pb and Cd bound stronger than Zn, even in the presence of excess Zn. Hg bound weaker than Zn. There are endless options for future work on cluster ions. Any molecule that is poorly ionized in positive ion mode can potentially show an increase in ionization efficiency if an appropriate anion is used to produce a net negative charge. It is possible that drug protein or drug/DNA complexes can also be stabilized by adding counter-ions. This would preserve the solution characteristics of the complex in the gas phase. Once in the gas phase, CID could determine the drug binding location on the biomolecule. There are many research projects regarding toxic metals in biology that have yet to be investigated or even discovered. This is an area of research with an almost endless future because of the changing dynamics of biological systems. What is deemed safe today may show toxic effects in the future. Evolutionary changes in protein structures may render them more susceptible to toxic metal binding. As the understanding of toxicity evolves, so does the demand for new toxic metal research. New instrumentation designs and software make it possible to perform research that could not be done in the past. What was undetectable yesterday will

  1. Lifting to cluster-tilting objects in higher cluster categories

    OpenAIRE

    Liu, Pin

    2008-01-01

    In this note, we consider the $d$-cluster-tilted algebras, the endomorphism algebras of $d$-cluster-tilting objects in $d$-cluster categories. We show that a tilting module over such an algebra lifts to a $d$-cluster-tilting object in this $d$-cluster category.

  2. Domain organizations of modular extracellular matrix proteins and their evolution.

    Science.gov (United States)

    Engel, J

    1996-11-01

    Multidomain proteins which are composed of modular units are a rather recent invention of evolution. Domains are defined as autonomously folding regions of a protein, and many of them are similar in sequence and structure, indicating common ancestry. Their modular nature is emphasized by frequent repetitions in identical or in different proteins and by a large number of different combinations with other domains. The extracellular matrix is perhaps the largest biological system composed of modular mosaic proteins, and its astonishing complexity and diversity are based on them. A cluster of minireviews on modular proteins is being published in Matrix Biology. These deal with the evolution of modular proteins, the three-dimensional structure of domains and the ways in which these interact in a multidomain protein. They discuss structure-function relationships in calcium binding domains, collagen helices, alpha-helical coiled-coil domains and C-lectins. The present minireview is focused on some general aspects and serves as an introduction to the cluster.

  3. Data Clustering

    Science.gov (United States)

    Wagstaff, Kiri L.

    2012-03-01

    On obtaining a new data set, the researcher is immediately faced with the challenge of obtaining a high-level understanding from the observations. What does a typical item look like? What are the dominant trends? How many distinct groups are included in the data set, and how is each one characterized? Which observable values are common, and which rarely occur? Which items stand out as anomalies or outliers from the rest of the data? This challenge is exacerbated by the steady growth in data set size [11] as new instruments push into new frontiers of parameter space, via improvements in temporal, spatial, and spectral resolution, or by the desire to "fuse" observations from different modalities and instruments into a larger-picture understanding of the same underlying phenomenon. Data clustering algorithms provide a variety of solutions for this task. They can generate summaries, locate outliers, compress data, identify dense or sparse regions of feature space, and build data models. It is useful to note up front that "clusters" in this context refer to groups of items within some descriptive feature space, not (necessarily) to "galaxy clusters" which are dense regions in physical space. The goal of this chapter is to survey a variety of data clustering methods, with an eye toward their applicability to astronomical data analysis. In addition to improving the individual researcher’s understanding of a given data set, clustering has led directly to scientific advances, such as the discovery of new subclasses of stars [14] and gamma-ray bursts (GRBs) [38]. All clustering algorithms seek to identify groups within a data set that reflect some observed, quantifiable structure. Clustering is traditionally an unsupervised approach to data analysis, in the sense that it operates without any direct guidance about which items should be assigned to which clusters. There has been a recent trend in the clustering literature toward supporting semisupervised or constrained

  4. Bacillus sp.CDB3 isolated from cattle dip-sites possesses two ars gene clusters

    Institute of Scientific and Technical Information of China (English)

    Somanath Bhat; Xi Luo; Zhiqiang Xu; Lixia Liu; Ren Zhang

    2011-01-01

    Contamination of soil and water by arsenic is a global problem.In Australia, the dipping of cattle in arsenic-containing solution to control cattle ticks in last centenary has left many sites heavily contaminated with arsenic and other toxicants.We had previously isolated five soil bacterial strains (CDB1-5) highly resistant to arsenic.To understand the resistance mechanism, molecular studies have been carried out.Two chromosome-encoded arsenic resistance (ars) gene clusters have been cloned from CDB3 (Bacillus sp.).They both function in Escherichia coli and cluster 1 exerts a much higher resistance to the toxic metalloid.Cluster 2 is smaller possessing four open reading frames (ORFs) arsRorf2BC, similar to that identified in Bacillus subtilis Skin element.Among the eight ORFs in cluster 1 five are analogs of common ars genes found in other bacteria, however, organized in a unique order arsRBCDA instead of arsRDABC.Three other putative genes are located directly downstream and designated as arsTIP based on the homologies of their theoretical translation sequences respectively to thioredoxin reductases, iron-sulphur cluster proteins and protein phosphatases.The latter two are novel of any known ars operons.The arsD gene from Bacillus species was cloned for the first time and the predict protein differs from the well studied E.coli ArsD by lacking two pairs of C-terrninal cysteine residues.Its functional involvement in arsenic resistance has been confirmed by a deletion experiment.There exists also an inverted repeat in the intergenic region between arsC and arsD implying some unknown transcription regulation.

  5. Cell-free H-cluster synthesis and [FeFe] hydrogenase activation: all five CO and CN⁻ ligands derive from tyrosine.

    Directory of Open Access Journals (Sweden)

    Jon M Kuchenreuther

    Full Text Available [FeFe] hydrogenases are promising catalysts for producing hydrogen as a sustainable fuel and chemical feedstock, and they also serve as paradigms for biomimetic hydrogen-evolving compounds. Hydrogen formation is catalyzed by the H-cluster, a unique iron-based cofactor requiring three carbon monoxide (CO and two cyanide (CN⁻ ligands as well as a dithiolate bridge. Three accessory proteins (HydE, HydF, and HydG are presumably responsible for assembling and installing the H-cluster, yet their precise roles and the biosynthetic pathway have yet to be fully defined. In this report, we describe effective cell-free methods for investigating H-cluster synthesis and [FeFe] hydrogenase activation. Combining isotopic labeling with FTIR spectroscopy, we conclusively show that each of the CO and CN⁻ ligands derive respectively from the carboxylate and amino substituents of tyrosine. Such in vitro systems with reconstituted pathways comprise a versatile approach for studying biosynthetic mechanisms, and this work marks a significant step towards an understanding of both the protein-protein interactions and complex reactions required for H-cluster assembly and hydrogenase maturation.

  6. Genomic organization, tissue distribution and functional characterization of the rat Pate gene cluster.

    Directory of Open Access Journals (Sweden)

    Angireddy Rajesh

    Full Text Available The cysteine rich prostate and testis expressed (Pate proteins identified till date are thought to resemble the three fingered protein/urokinase-type plasminogen activator receptor proteins. In this study, for the first time, we report the identification, cloning and characterization of rat Pate gene cluster and also determine the expression pattern. The rat Pate genes are clustered on chromosome 8 and their predicted proteins retained the ten cysteine signature characteristic to TFP/Ly-6 protein family. PATE and PATE-F three dimensional protein structure was found to be similar to that of the toxin bucandin. Though Pate gene expression is thought to be prostate and testis specific, we observed that rat Pate genes are also expressed in seminal vesicle and epididymis and in tissues beyond the male reproductive tract. In the developing rats (20-60 day old, expression of Pate genes seem to be androgen dependent in the epididymis and testis. In the adult rat, androgen ablation resulted in down regulation of the majority of Pate genes in the epididymides. PATE and PATE-F proteins were found to be expressed abundantly in the male reproductive tract of rats and on the sperm. Recombinant PATE protein exhibited potent antibacterial activity, whereas PATE-F did not exhibit any antibacterial activity. Pate expression was induced in the epididymides when challenged with LPS. Based on our results, we conclude that rat PATE proteins may contribute to the reproductive and defense functions.

  7. Cell surface clustering of Cadherin adhesion complex induced by antibody coated beads

    Institute of Scientific and Technical Information of China (English)

    2000-01-01

    Cadherin receptors mediate cell-cell adhesion, signal transduction and assembly of cytoskeletons. How a single transmembrane molecule Cadherin can be involved in multiple functions through modulating its binding activities with many membrane adhesion molecules and cytoskeletal components is an unanswered question which can be elucidated by clues from bead experiments. Human lung cells expressing N-Cadherin were examined. After co-incubation with anti-N-Cadherin monoclonal antibody coated beads, cell surface clustering of N-Cadherin was induced. Immunofluorescent detection demonstrated that in addition to Cadherin, β-Catenin, α-Catenin, α-Actinin and Actin fluorescence also aggregated respectively at the membrane site of bead attachment. Myosin heavy chain (MHC), another major component of Actin cytoskeleton, did not aggregate at the membrane site of bead attachment. Adhesion unrelated protein Con A and polylysine conjugated beads did not induce the clustering of adhesion molecules. It is indicated that the Cadherin/Catenins/α-Actinin/Actin complex is formed at Cadherin mediated cell adherens junction; occupancy and cell surface clustering of Cadherin is crucial for the formation of Cadherin adhesion protein complexes.

  8. Inhibition of protein aggregation: supramolecular assemblies of arginine hold the key.

    Directory of Open Access Journals (Sweden)

    Utpal Das

    Full Text Available BACKGROUND: Aggregation of unfolded proteins occurs mainly through the exposed hydrophobic surfaces. Any mechanism of inhibition of this aggregation should explain the prevention of these hydrophobic interactions. Though arginine is prevalently used as an aggregation suppressor, its mechanism of action is not clearly understood. We propose a mechanism based on the hydrophobic interactions of arginine. METHODOLOGY: We have analyzed arginine solution for its hydrotropic effect by pyrene solubility and the presence of hydrophobic environment by 1-anilino-8-naphthalene sulfonic acid fluorescence. Mass spectroscopic analyses show that arginine forms molecular clusters in the gas phase and the cluster composition is dependent on the solution conditions. Light scattering studies indicate that arginine exists as clusters in solution. In the presence of arginine, the reverse phase chromatographic elution profile of Alzheimer's amyloid beta 1-42 (Abeta(1-42 peptide is modified. Changes in the hydrodynamic volume of Abeta(1-42 in the presence of arginine measured by size exclusion chromatography show that arginine binds to Abeta(1-42. Arginine increases the solubility of Abeta(1-42 peptide in aqueous medium. It decreases the aggregation of Abeta(1-42 as observed by atomic force microscopy. CONCLUSIONS: Based on our experimental results we propose that molecular clusters of arginine in aqueous solutions display a hydrophobic surface by the alignment of its three methylene groups. The hydrophobic surfaces present on the proteins interact with the hydrophobic surface presented by the arginine clusters. The masking of hydrophobic surface inhibits protein-protein aggregation. This mechanism is also responsible for the hydrotropic effect of arginine on various compounds. It is also explained why other amino acids fail to inhibit the protein aggregation.

  9. Activation of protein kinase A and clustering of cell surface receptors by N-methyl-N'-nitro-N-nitrosoguanidine are independent of genomic DNA damage

    Energy Technology Data Exchange (ETDEWEB)

    Wang Zheng; Wang Guliang; Yang Jun; Guo Lei; Yu Yingnian

    2003-07-25

    Alkylating agent N-methyl-N'-nitro-N-nitrosoguanidine (MNNG) induces cellular stress leading to chromosomal aberrations, mutations and cell death. Previous reports from our laboratory have shown that low concentration of MNNG induces untargeted mutation (UTM), which occurs on intact DNA in mammalian cells through changes in gene expression profile. It also causes the activation of cAMP-protein kinase A (PKA) and up-regulation of POL-{beta}, which is demonstrated to play a role in DNA repair system. In order to find out the possible initial signal involved in UTM, we try to investigate whether the activation of PKA-CREB signal pathway is closely related to DNA damage. Our data shows that the treatment of low concentration MNNG (0.2 {mu}M) activates PKA-CREB pathway in a comparable level both in a nuclear and enucleated cell system. And similar to the cell response caused by UV, the clustering of cell surface receptors of epidermal growth factor (EGF) and tumor necrosis factor {alpha} (TNF{alpha}) was also observed in cells exposed to MNNG. It was further demonstrated that the clustering of the surface receptors is independent of the genomic DNA damage, as this phenomenon was also observed in enucleated cells. These observations indicate that the initiation of signal cascades induced by low concentration of MNNG might be associated with its interaction with cell surface receptors and/or direct activation of related signal proteins but not its DNA damaging property.

  10. Dense Fe cluster-assembled films by energetic cluster deposition

    International Nuclear Information System (INIS)

    Peng, D.L.; Yamada, H.; Hihara, T.; Uchida, T.; Sumiyama, K.

    2004-01-01

    High-density Fe cluster-assembled films were produced at room temperature by an energetic cluster deposition. Though cluster-assemblies are usually sooty and porous, the present Fe cluster-assembled films are lustrous and dense, revealing a soft magnetic behavior. Size-monodispersed Fe clusters with the mean cluster size d=9 nm were synthesized using a plasma-gas-condensation technique. Ionized clusters are accelerated electrically and deposited onto the substrate together with neutral clusters from the same cluster source. Packing fraction and saturation magnetic flux density increase rapidly and magnetic coercivity decreases remarkably with increasing acceleration voltage. The Fe cluster-assembled film obtained at the acceleration voltage of -20 kV has a packing fraction of 0.86±0.03, saturation magnetic flux density of 1.78±0.05 Wb/m 2 , and coercivity value smaller than 80 A/m. The resistivity at room temperature is ten times larger than that of bulk Fe metal

  11. Nutrient Shielding in Clusters of Cells

    Science.gov (United States)

    Lavrentovich, Maxim O.; Koschwanez, John H.; Nelson, David R.

    2014-01-01

    Cellular nutrient consumption is influenced by both the nutrient uptake kinetics of an individual cell and the cells’ spatial arrangement. Large cell clusters or colonies have inhibited growth at the cluster's center due to the shielding of nutrients by the cells closer to the surface. We develop an effective medium theory that predicts a thickness ℓ of the outer shell of cells in the cluster that receives enough nutrient to grow. The cells are treated as partially absorbing identical spherical nutrient sinks, and we identify a dimensionless parameter ν that characterizes the absorption strength of each cell. The parameter ν can vary over many orders of magnitude between different cell types, ranging from bacteria and yeast to human tissue. The thickness ℓ decreases with increasing ν, increasing cell volume fraction ϕ, and decreasing ambient nutrient concentration ψ∞. The theoretical results are compared with numerical simulations and experiments. In the latter studies, colonies of budding yeast, Saccharomyces cerevisiae, are grown on glucose media and imaged under a confocal microscope. We measure the growth inside the colonies via a fluorescent protein reporter and compare the experimental and theoretical results for the thickness ℓ. PMID:23848711

  12. Cluster Physics with Merging Galaxy Clusters

    Directory of Open Access Journals (Sweden)

    Sandor M. Molnar

    2016-02-01

    Full Text Available Collisions between galaxy clusters provide a unique opportunity to study matter in a parameter space which cannot be explored in our laboratories on Earth. In the standard LCDM model, where the total density is dominated by the cosmological constant ($Lambda$ and the matter density by cold dark matter (CDM, structure formation is hierarchical, and clusters grow mostly by merging.Mergers of two massive clusters are the most energetic events in the universe after the Big Bang,hence they provide a unique laboratory to study cluster physics.The two main mass components in clusters behave differently during collisions:the dark matter is nearly collisionless, responding only to gravity, while the gas is subject to pressure forces and dissipation, and shocks and turbulenceare developed during collisions. In the present contribution we review the different methods used to derive the physical properties of merging clusters. Different physical processes leave their signatures on different wavelengths, thusour review is based on a multifrequency analysis. In principle, the best way to analyze multifrequency observations of merging clustersis to model them using N-body/HYDRO numerical simulations. We discuss the results of such detailed analyses.New high spatial and spectral resolution ground and space based telescopeswill come online in the near future. Motivated by these new opportunities,we briefly discuss methods which will be feasible in the near future in studying merging clusters.

  13. Transcriptional analysis of ESAT-6 cluster 3 in Mycobacterium smegmatis

    Directory of Open Access Journals (Sweden)

    Riccardi Giovanna

    2009-03-01

    Full Text Available Abstract Background The ESAT-6 (early secreted antigenic target, 6 kDa family collects small mycobacterial proteins secreted by Mycobacterium tuberculosis, particularly in the early phase of growth. There are 23 ESAT-6 family members in M. tuberculosis H37Rv. In a previous work, we identified the Zur- dependent regulation of five proteins of the ESAT-6/CFP-10 family (esxG, esxH, esxQ, esxR, and esxS. esxG and esxH are part of ESAT-6 cluster 3, whose expression was already known to be induced by iron starvation. Results In this research, we performed EMSA experiments and transcriptional analysis of ESAT-6 cluster 3 in Mycobacterium smegmatis (msmeg0615-msmeg0625 and M. tuberculosis. In contrast to what we had observed in M. tuberculosis, we found that in M. smegmatis ESAT-6 cluster 3 responds only to iron and not to zinc. In both organisms we identified an internal promoter, a finding which suggests the presence of two transcriptional units and, by consequence, a differential expression of cluster 3 genes. We compared the expression of msmeg0615 and msmeg0620 in different growth and stress conditions by means of relative quantitative PCR. The expression of msmeg0615 and msmeg0620 genes was essentially similar; they appeared to be repressed in most of the tested conditions, with the exception of acid stress (pH 4.2 where msmeg0615 was about 4-fold induced, while msmeg0620 was repressed. Analysis revealed that in acid stress conditions M. tuberculosis rv0282 gene was 3-fold induced too, while rv0287 induction was almost insignificant. Conclusion In contrast with what has been reported for M. tuberculosis, our results suggest that in M. smegmatis only IdeR-dependent regulation is retained, while zinc has no effect on gene expression. The role of cluster 3 in M. tuberculosis virulence is still to be defined; however, iron- and zinc-dependent expression strongly suggests that cluster 3 is highly expressed in the infective process, and that the cluster

  14. Ligand-protected gold clusters: the structure, synthesis and applications

    International Nuclear Information System (INIS)

    Pichugina, D A; Kuz'menko, N E; Shestakov, A F

    2015-01-01

    Modern concepts of the structure and properties of atomic gold clusters protected by thiolate, selenolate, phosphine and phenylacetylene ligands are analyzed. Within the framework of the superatom theory, the 'divide and protect' approach and the structure rule, the stability and composition of a cluster are determined by the structure of the cluster core, the type of ligands and the total number of valence electrons. Methods of selective synthesis of gold clusters in solution and on the surface of inorganic composites based, in particular, on the reaction of Au n with RS, RSe, PhC≡C, Hal ligands or functional groups of proteins, on stabilization of clusters in cavities of the α-, β and γ-cyclodextrin molecules (Au 15 and Au 25 ) and on anchorage to a support surface (Au 25 /SiO 2 , Au 20 /C, Au 10 /FeO x ) are reviewed. Problems in this field are also discussed. Among the methods for cluster structure prediction, particular attention is given to the theoretical approaches based on the density functional theory (DFT). The structures of a number of synthesized clusters are described using the results obtained by X-ray diffraction analysis and DFT calculations. A possible mechanism of formation of the SR(AuSR) n 'staple' units in the cluster shell is proposed. The structure and properties of bimetallic clusters M x Au n L m (M=Pd, Pt, Ag, Cu) are discussed. The Pd or Pt atom is located at the centre of the cluster, whereas Ag and Cu atoms form bimetallic compounds in which the heteroatom is located on the surface of the cluster core or in the 'staple' units. The optical properties, fluorescence and luminescence of ligand-protected gold clusters originate from the quantum effects of the Au atoms in the cluster core and in the oligomeric SR(AuSR) x units in the cluster shell. Homogeneous and heterogeneous reactions catalyzed by atomic gold clusters are discussed in the context of the reaction mechanism and the nature of the active

  15. OrthoVenn: a web server for genome wide comparison and annotation of orthologous clusters across multiple species

    Science.gov (United States)

    Genome wide analysis of orthologous clusters is an important component of comparative genomics studies. Identifying the overlap among orthologous clusters can enable us to elucidate the function and evolution of proteins across multiple species. Here, we report a web platform named OrthoVenn that i...

  16. Essential multimeric enzymes in kinetoplastid parasites: A host of potentially druggable protein-protein interactions.

    Science.gov (United States)

    Wachsmuth, Leah M; Johnson, Meredith G; Gavenonis, Jason

    2017-06-01

    Parasitic diseases caused by kinetoplastid parasites of the genera Trypanosoma and Leishmania are an urgent public health crisis in the developing world. These closely related species possess a number of multimeric enzymes in highly conserved pathways involved in vital functions, such as redox homeostasis and nucleotide synthesis. Computational alanine scanning of these protein-protein interfaces has revealed a host of potentially ligandable sites on several established and emerging anti-parasitic drug targets. Analysis of interfaces with multiple clustered hotspots has suggested several potentially inhibitable protein-protein interactions that may have been overlooked by previous large-scale analyses focusing solely on secondary structure. These protein-protein interactions provide a promising lead for the development of new peptide and macrocycle inhibitors of these enzymes.

  17. Hemoglobin–Albumin Cluster Incorporating a Pt Nanoparticle: Artificial O2 Carrier with Antioxidant Activities

    Science.gov (United States)

    Hosaka, Hitomi; Haruki, Risa; Yamada, Kana; Böttcher, Christoph; Komatsu, Teruyuki

    2014-01-01

    A covalent core–shell structured protein cluster composed of hemoglobin (Hb) at the center and human serum albumins (HSA) at the periphery, Hb-HSAm, is an artificial O2 carrier that can function as a red blood cell substitute. Here we described the preparation of a novel Hb-HSA3 cluster with antioxidant activities and its O2 complex stable in aqueous H2O2 solution. We used an approach of incorporating a Pt nanoparticle (PtNP) into the exterior HSA unit of the cluster. A citrate reduced PtNP (1.8 nm diameter) was bound tightly within the cleft of free HSA with a binding constant (K) of 1.1×107 M−1, generating a stable HSA-PtNP complex. This platinated protein showed high catalytic activities for dismutations of superoxide radical anions (O2 •–) and hydrogen peroxide (H2O2), i.e., superoxide dismutase and catalase activities. Also, Hb-HSA3 captured PtNP into the external albumin unit (K = 1.1×107 M−1), yielding an Hb-HSA3(PtNP) cluster. The association of PtNP caused no alteration of the protein surface net charge and O2 binding affinity. The peripheral HSA-PtNP shell prevents oxidation of the core Hb, which enables the formation of an extremely stable O2 complex, even in H2O2 solution. PMID:25310133

  18. Enhanced conformational sampling to visualize a free-energy landscape of protein complex formation.

    Science.gov (United States)

    Iida, Shinji; Nakamura, Haruki; Higo, Junichi

    2016-06-15

    We introduce various, recently developed, generalized ensemble methods, which are useful to sample various molecular configurations emerging in the process of protein-protein or protein-ligand binding. The methods introduced here are those that have been or will be applied to biomolecular binding, where the biomolecules are treated as flexible molecules expressed by an all-atom model in an explicit solvent. Sampling produces an ensemble of conformations (snapshots) that are thermodynamically probable at room temperature. Then, projection of those conformations to an abstract low-dimensional space generates a free-energy landscape. As an example, we show a landscape of homo-dimer formation of an endothelin-1-like molecule computed using a generalized ensemble method. The lowest free-energy cluster at room temperature coincided precisely with the experimentally determined complex structure. Two minor clusters were also found in the landscape, which were largely different from the native complex form. Although those clusters were isolated at room temperature, with rising temperature a pathway emerged linking the lowest and second-lowest free-energy clusters, and a further temperature increment connected all the clusters. This exemplifies that the generalized ensemble method is a powerful tool for computing the free-energy landscape, by which one can discuss the thermodynamic stability of clusters and the temperature dependence of the cluster networks. © 2016 The Author(s).

  19. Crystal Structure of Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR)-associated Csn2 Protein Revealed Ca[superscript 2+]-dependent Double-stranded DNA Binding Activity

    Energy Technology Data Exchange (ETDEWEB)

    Nam, Ki Hyun; Kurinov, Igor; Ke, Ailong (Cornell); (NWU)

    2012-05-22

    Clustered regularly interspaced short palindromic repeats (CRISPR) and their associated protein genes (cas genes) are widespread in bacteria and archaea. They form a line of RNA-based immunity to eradicate invading bacteriophages and malicious plasmids. A key molecular event during this process is the acquisition of new spacers into the CRISPR loci to guide the selective degradation of the matching foreign genetic elements. Csn2 is a Nmeni subtype-specific cas gene required for new spacer acquisition. Here we characterize the Enterococcus faecalis Csn2 protein as a double-stranded (ds-) DNA-binding protein and report its 2.7 {angstrom} tetrameric ring structure. The inner circle of the Csn2 tetrameric ring is {approx}26 {angstrom} wide and populated with conserved lysine residues poised for nonspecific interactions with ds-DNA. Each Csn2 protomer contains an {alpha}/{beta} domain and an {alpha}-helical domain; significant hinge motion was observed between these two domains. Ca{sup 2+} was located at strategic positions in the oligomerization interface. We further showed that removal of Ca{sup 2+} ions altered the oligomerization state of Csn2, which in turn severely decreased its affinity for ds-DNA. In summary, our results provided the first insight into the function of the Csn2 protein in CRISPR adaptation by revealing that it is a ds-DNA-binding protein functioning at the quaternary structure level and regulated by Ca{sup 2+} ions.

  20. Are clusters of dietary patterns and cluster membership stable over time? Results of a longitudinal cluster analysis study.

    Science.gov (United States)

    Walthouwer, Michel Jean Louis; Oenema, Anke; Soetens, Katja; Lechner, Lilian; de Vries, Hein

    2014-11-01

    Developing nutrition education interventions based on clusters of dietary patterns can only be done adequately when it is clear if distinctive clusters of dietary patterns can be derived and reproduced over time, if cluster membership is stable, and if it is predictable which type of people belong to a certain cluster. Hence, this study aimed to: (1) identify clusters of dietary patterns among Dutch adults, (2) test the reproducibility of these clusters and stability of cluster membership over time, and (3) identify sociodemographic predictors of cluster membership and cluster transition. This study had a longitudinal design with online measurements at baseline (N=483) and 6 months follow-up (N=379). Dietary intake was assessed with a validated food frequency questionnaire. A hierarchical cluster analysis was performed, followed by a K-means cluster analysis. Multinomial logistic regression analyses were conducted to identify the sociodemographic predictors of cluster membership and cluster transition. At baseline and follow-up, a comparable three-cluster solution was derived, distinguishing a healthy, moderately healthy, and unhealthy dietary pattern. Male and lower educated participants were significantly more likely to have a less healthy dietary pattern. Further, 251 (66.2%) participants remained in the same cluster, 45 (11.9%) participants changed to an unhealthier cluster, and 83 (21.9%) participants shifted to a healthier cluster. Men and people living alone were significantly more likely to shift toward a less healthy dietary pattern. Distinctive clusters of dietary patterns can be derived. Yet, cluster membership is unstable and only few sociodemographic factors were associated with cluster membership and cluster transition. These findings imply that clusters based on dietary intake may not be suitable as a basis for nutrition education interventions. Copyright © 2014 Elsevier Ltd. All rights reserved.

  1. Protein clustering and RNA phylogenetic reconstruction of the influenza A [corrected] virus NS1 protein allow an update in classification and identification of motif conservation.

    Science.gov (United States)

    Sevilla-Reyes, Edgar E; Chavaro-Pérez, David A; Piten-Isidro, Elvira; Gutiérrez-González, Luis H; Santos-Mendoza, Teresa

    2013-01-01

    The non-structural protein 1 (NS1) of influenza A virus (IAV), coded by its third most diverse gene, interacts with multiple molecules within infected cells. NS1 is involved in host immune response regulation and is a potential contributor to the virus host range. Early phylogenetic analyses using 50 sequences led to the classification of NS1 gene variants into groups (alleles) A and B. We reanalyzed NS1 diversity using 14,716 complete NS IAV sequences, downloaded from public databases, without host bias. Removal of sequence redundancy and further structured clustering at 96.8% amino acid similarity produced 415 clusters that enhanced our capability to detect distinct subgroups and lineages, which were assigned a numerical nomenclature. Maximum likelihood phylogenetic reconstruction using RNA sequences indicated the previously identified deep branching separating group A from group B, with five distinct subgroups within A as well as two and five lineages within the A4 and A5 subgroups, respectively. Our classification model proposes that sequence patterns in thirteen amino acid positions are sufficient to fit >99.9% of all currently available NS1 sequences into the A subgroups/lineages or the B group. This classification reduces host and virus bias through the prioritization of NS1 RNA phylogenetics over host or virus phenetics. We found significant sequence conservation within the subgroups and lineages with characteristic patterns of functional motifs, such as the differential binding of CPSF30 and crk/crkL or the availability of a C-terminal PDZ-binding motif. To understand selection pressures and evolution acting on NS1, it is necessary to organize the available data. This updated classification may help to clarify and organize the study of NS1 interactions and pathogenic differences and allow the drawing of further functional inferences on sequences in each group, subgroup and lineage rather than on a strain-by-strain basis.

  2. Clusters and how to make it work : Cluster Strategy Toolkit

    NARCIS (Netherlands)

    Manickam, Anu; van Berkel, Karel

    2014-01-01

    Clusters are the magic answer to regional economic development. Firms in clusters are more innovative; cluster policy dominates EU policy; ‘top-sectors’ and excellence are the choice of national policy makers; clusters are ‘in’. But, clusters are complex, clusters are ‘messy’; there is no clear

  3. Cluster dynamics at different cluster size and incident laser wavelengths

    International Nuclear Information System (INIS)

    Desai, Tara; Bernardinello, Andrea

    2002-01-01

    X-ray emission spectra from aluminum clusters of diameter -0.4 μm and gold clusters of dia. ∼1.25 μm are experimentally studied by irradiating the cluster foil targets with 1.06 μm laser, 10 ns (FWHM) at an intensity ∼10 12 W/cm 2 . Aluminum clusters show a different spectra compared to bulk material whereas gold cluster evolve towards bulk gold. Experimental data are analyzed on the basis of cluster dimension, laser wavelength and pulse duration. PIC simulations are performed to study the behavior of clusters at higher intensity I≥10 17 W/cm 2 for different size of the clusters irradiated at different laser wavelengths. Results indicate the dependence of cluster dynamics on cluster size and incident laser wavelength

  4. Text Clustering Algorithm Based on Random Cluster Core

    Directory of Open Access Journals (Sweden)

    Huang Long-Jun

    2016-01-01

    Full Text Available Nowadays clustering has become a popular text mining algorithm, but the huge data can put forward higher requirements for the accuracy and performance of text mining. In view of the performance bottleneck of traditional text clustering algorithm, this paper proposes a text clustering algorithm with random features. This is a kind of clustering algorithm based on text density, at the same time using the neighboring heuristic rules, the concept of random cluster is introduced, which effectively reduces the complexity of the distance calculation.

  5. Protein complex finding and ranking: An application to Alzheimer's ...

    Indian Academy of Sciences (India)

    Pooja Sharma

    2017-07-07

    Jul 7, 2017 ... and a few other model organisms. .... form proteins) affect the protein formation process. Muta- ..... We implemented the ComFiR method in MATLAB run- ning on ..... Van Dongen SM 2001 Graph clustering by flow simulation.

  6. Mutation of the His ligand in mitoNEET stabilizes the 2Fe–2S cluster despite conformational heterogeneity in the ligand environment

    Energy Technology Data Exchange (ETDEWEB)

    Conlan, Andrea R.; Paddock, Mark L.; Homer, Christina [University of California at San Diego, La Jolla, CA 92093 (United States); Axelrod, Herbert L.; Cohen, Aina E. [Stanford Synchrotron Radiation Laboratory, 2575 Sand Hill Road, Menlo Park, CA 94025 (United States); Abresch, Edward C.; Zuris, John A. [University of California at San Diego, La Jolla, CA 92093 (United States); Nechushtai, Rachel [Hebrew University of Jerusalem, Edmond J. Safra Campus, Givat Ram, Jerusalem 91904 (Israel); Jennings, Patricia A., E-mail: pajennin@ucsd.edu [University of California at San Diego, La Jolla, CA 92093 (United States)

    2011-06-01

    The spectroscopic and stability properties and X-ray crystal structure of the H87C mutant of the 2Fe–2S ligand mitoNEET are reported. Strikingly, the single point mutation leads to changes in its absorbance and CD spectra and an increase of around sixfold in the stability of the 2Fe–2S clusters over the pH range 5–7. However, the crystal structure of the H87C mutant displays heterogeneity in a few key residues, including the introduced cysteine ligand. Nonetheless, the cluster is highly stabilized from release. MitoNEET is the only identified Fe–S protein localized to the outer mitochondrial membrane and a 1.5 Å resolution X-ray analysis has revealed a unique structure [Paddock et al. (2007 ▶), Proc. Natl Acad. Sci. USA, 104, 14342–14347]. The 2Fe–2S cluster is bound with a 3Cys–1His coordination which defines a new class of 2Fe–2S proteins. The hallmark feature of this class is the single noncysteine ligand His87, which when replaced by Cys decreases the redox potential (E{sub m}) by ∼300 mV and increases the stability of the cluster by around sixfold. Unexpectedly, the pH dependence of the lifetime of the 2Fe–2S cluster remains the same as in the wild-type protein. Here, the crystal structure of H87C mitoNEET was determined to 1.7 Å resolution (R factor = 18%) to investigate the structural basis of the changes in the properties of the 2Fe–2S cluster. In comparison to the wild type, structural changes are localized to the immediate vicinity of the cluster-binding region. Despite the increased stability, Cys87 displays two distinct conformations, with distances of 2.3 and 3.2 Å between the S{sup γ} and the outer Fe of the 2Fe–2S cluster. In addition, Lys55 exhibits multiple conformations in the H87C mutant protein. The structure and distinct characteristics of the H87C mutant provide a framework for further studies investigating the effects of mutation on the properties of the 2Fe–2S cluster in this new class of proteins.

  7. The CoxD protein, a novel AAA+ ATPase involved in metal cluster assembly: hydrolysis of nucleotide-triphosphates and oligomerization.

    Directory of Open Access Journals (Sweden)

    Tobias Maisel

    Full Text Available CoxD of the α-proteobacterium Oligotropha carboxidovorans is a membrane protein which is involved in the posttranslational biosynthesis of the [CuSMoO₂] cluster in the active site of the enzyme CO dehydrogenase. The bacteria synthesize CoxD only in the presence of CO. Recombinant CoxD produced in E. coli K38 pGP1-2/pETMW2 appeared in inclusion bodies from where it was solubilized by urea and refolded by stepwise dilution. Circular dichroism spectroscopy revealed the presence of secondary structural elements in refolded CoxD. CoxD is a P-loop ATPase of the AAA-protein family. Refolded CoxD catalyzed the hydrolysis of MgATP yielding MgADP and inorganic phosphate at a 1∶1∶1 molar ratio. The reaction was inhibited by the slow hydrolysable MgATP-γ-S. GTPase activity of CoxD did not exceed 2% of the ATPase activity. Employing different methods (non linear regression, Hanes and Woolf, Lineweaver-Burk, preparations of CoxD revealed a mean K(M value of 0.69±0.14 mM ATP and an apparent V(max value of 19.3±2.3 nmol ATP hydrolyzed min⁻¹ mg⁻¹. Sucrose density gradient centrifugation and gel filtration showed that refolded CoxD can exist in various multimeric states (2-mer, 4-mer or 6-mer, preferentially as hexamer or dimer. Within weeks the hexamer dissociates into the dimer, a process which can be reversed by MgATP or MgATP-γ-S within hours. Only the hexamers and the dimers exhibited MgATPase activity. Transmission electron microscopy of negatively stained CoxD preparations revealed distinct particles within a size range of 10-16 nm, which further corroborates the oligomeric organization. The 3D structure of CoxD was modeled with the 3D structure of BchI from Rhodobacter capsulatus as template. It has the key elements of an AAA+ domain in the same arrangement and at same positions as in BchI and displays the characteristic inserts of the PS-II-insert clade. Possible functions of CoxD in [CuSMoO₂] cluster assembly are discussed.

  8. Architectures and Functional Coverage of Protein-Protein Interfaces

    Science.gov (United States)

    Tuncbag, Nurcan; Gursoy, Attila; Guney, Emre; Nussinov, Ruth; Keskin, Ozlem

    2008-01-01

    The diverse range of cellular functions is performed by a limited number of protein folds existing in nature. One may similarly expect that cellular functional diversity would be covered by a limited number of protein-protein interface architectures. Here, we present 8205 interface clusters, each representing unique interface architecture. This dataset of protein-protein interfaces is analyzed and compared with older datasets. We observe that the number of both biological and crystal interfaces increase significantly compared to the number of PDB entries. Further, we find that the number of distinct interface architectures grows at a much faster rate than the number of folds and is yet to level off. We further analyze the growth trend of the functional coverage by constructing functional interaction networks from interfaces. The functional coverage is also found to steadily increase. Interestingly, we also observe that despite the diversity of interface architectures, some are more favorable and frequently used, and of particular interest, those are the ones which are also preferred in single chains. PMID:18620705

  9. Insulin regulates Glut4 confinement in plasma membrane clusters in adipose cells.

    Science.gov (United States)

    Lizunov, Vladimir A; Stenkula, Karin; Troy, Aaron; Cushman, Samuel W; Zimmerberg, Joshua

    2013-01-01

    Insulin-stimulated delivery of glucose transporter-4 (GLUT4) to the plasma membrane (PM) is the hallmark of glucose metabolism. In this study we examined insulin's effects on GLUT4 organization in PM of adipose cells by direct microscopic observation of single monomers tagged with photoswitchable fluorescent protein. In the basal state, after exocytotic delivery only a fraction of GLUT4 is dispersed into the PM as monomers, while most of the GLUT4 stays at the site of fusion and forms elongated clusters (60-240 nm). GLUT4 monomers outside clusters diffuse freely and do not aggregate with other monomers. In contrast, GLUT4 molecule collision with an existing cluster can lead to immediate confinement and association with that cluster. Insulin has three effects: it shifts the fraction of dispersed GLUT4 upon delivery, it augments the dissociation of GLUT4 monomers from clusters ∼3-fold and it decreases the rate of endocytic uptake. All together these three effects of insulin shift most of the PM GLUT4 from clustered to dispersed states. GLUT4 confinement in clusters represents a novel kinetic mechanism for insulin regulation of glucose homeostasis.

  10. Insulin regulates Glut4 confinement in plasma membrane clusters in adipose cells.

    Directory of Open Access Journals (Sweden)

    Vladimir A Lizunov

    Full Text Available Insulin-stimulated delivery of glucose transporter-4 (GLUT4 to the plasma membrane (PM is the hallmark of glucose metabolism. In this study we examined insulin's effects on GLUT4 organization in PM of adipose cells by direct microscopic observation of single monomers tagged with photoswitchable fluorescent protein. In the basal state, after exocytotic delivery only a fraction of GLUT4 is dispersed into the PM as monomers, while most of the GLUT4 stays at the site of fusion and forms elongated clusters (60-240 nm. GLUT4 monomers outside clusters diffuse freely and do not aggregate with other monomers. In contrast, GLUT4 molecule collision with an existing cluster can lead to immediate confinement and association with that cluster. Insulin has three effects: it shifts the fraction of dispersed GLUT4 upon delivery, it augments the dissociation of GLUT4 monomers from clusters ∼3-fold and it decreases the rate of endocytic uptake. All together these three effects of insulin shift most of the PM GLUT4 from clustered to dispersed states. GLUT4 confinement in clusters represents a novel kinetic mechanism for insulin regulation of glucose homeostasis.

  11. On clusters and clustering from atoms to fractals

    CERN Document Server

    Reynolds, PJ

    1993-01-01

    This book attempts to answer why there is so much interest in clusters. Clusters occur on all length scales, and as a result occur in a variety of fields. Clusters are interesting scientifically, but they also have important consequences technologically. The division of the book into three parts roughly separates the field into small, intermediate, and large-scale clusters. Small clusters are the regime of atomic and molecular physics and chemistry. The intermediate regime is the transitional regime, with its characteristics including the onset of bulk-like behavior, growth and aggregation, a

  12. Protein Charge and Mass Contribute to the Spatio-temporal Dynamics of Protein-Protein Interactions in a Minimal Proteome

    Science.gov (United States)

    Xu, Yu; Wang, Hong; Nussinov, Ruth; Ma, Buyong

    2013-01-01

    We constructed and simulated a ‘minimal proteome’ model using Langevin dynamics. It contains 206 essential protein types which were compiled from the literature. For comparison, we generated six proteomes with randomized concentrations. We found that the net charges and molecular weights of the proteins in the minimal genome are not random. The net charge of a protein decreases linearly with molecular weight, with small proteins being mostly positively charged and large proteins negatively charged. The protein copy numbers in the minimal genome have the tendency to maximize the number of protein-protein interactions in the network. Negatively charged proteins which tend to have larger sizes can provide large collision cross-section allowing them to interact with other proteins; on the other hand, the smaller positively charged proteins could have higher diffusion speed and are more likely to collide with other proteins. Proteomes with random charge/mass populations form less stable clusters than those with experimental protein copy numbers. Our study suggests that ‘proper’ populations of negatively and positively charged proteins are important for maintaining a protein-protein interaction network in a proteome. It is interesting to note that the minimal genome model based on the charge and mass of E. Coli may have a larger protein-protein interaction network than that based on the lower organism M. pneumoniae. PMID:23420643

  13. How Escherichia coli and Saccharomyces cerevisiae build Fe/S proteins.

    Science.gov (United States)

    Barras, Frédéric; Loiseau, Laurent; Py, Béatrice

    2005-01-01

    Owing to the versatile electronic properties of iron and sulfur, iron sulfur (Fe/S) clusters are perfectly suited for sensing changes in environmental conditions and regulating protein properties accordingly. Fe/S proteins have been recruited in a wide array of diverse biological processes, including electron transfer chains, metabolic pathways and gene regulatory circuits. Chemistry has revealed the great diversity of Fe/S clusters occurring in proteins. The question now is to understand how iron and sulfur come together to form Fe/S clusters and how these clusters are subsequently inserted into apoproteins. Iron, sulfide and reducing conditions were found to be sufficient for successful maturation of many apoproteins in vitro, opening the possibility that insertion might be a spontaneous event. However, as in many other biological pathways such as protein folding, genetic analyses revealed that Fe/S cluster biogenesis and insertion depend in vivo upon auxiliary proteins. This was brought to light by studies on Azotobacter vinelandii nitrogenase, which, in particular, led to the concept of scaffold proteins, the role of which would be to allow transient assembly of Fe/S cluster. These studies paved the way toward the identification of the ISC and SUF systems, subjects of the present review that allow Fe/S cluster assembly into apoproteins of most organisms. Despite the recent discovery of the SUF and ISC systems, remarkable progress has been made in our understanding of their molecular composition and biochemical mechanisms. Such a rapid increase in our knowledge arose from a convergent interest from researchers engaged in unrelated fields and whose complementary expertise covered most experimental approaches used in biology. Also, the high conservation of ISC and SUF systems throughout a wide array of organisms helped cross-feeding between studies. The ISC system is conserved in eubacteria and most eukaryotes, while the SUF system arises in eubacteria, archaea

  14. The near-symmetry of proteins.

    Science.gov (United States)

    Bonjack-Shterengartz, Maayan; Avnir, David

    2015-04-01

    The majority of protein oligomers form clusters which are nearly symmetric. Understanding of that imperfection, its origins, and perhaps also its advantages requires the conversion of the currently used vague qualitative descriptive language of the near-symmetry into an accurate quantitative measure that will allow to answer questions such as: "What is the degree of symmetry deviation of the protein?," "how do these deviations compare within a family of proteins?," and so on. We developed quantitative methods to answer this type of questions, which are capable of analyzing the whole protein, its backbone or selected portions of it, down to comparison of symmetry-related specific amino-acids, and which are capable of visualizing the various levels of symmetry deviations in the form of symmetry maps. We have applied these methods on an extensive list of homomers and heteromers and found that apparently all proteins never reach perfect symmetry. Strikingly, even homomeric protein clusters are never ideally symmetric. We also found that the main burden of symmetry distortion is on the amino-acids near the symmetry axis; that it is mainly the more hydrophilic amino-acids that take place in symmetry-distortive interactions; and more. The remarkable ability of heteromers to preserve near-symmetry, despite the different sequences, was also shown and analyzed. The comprehensive literature on the suggested advantages symmetric oligomerizations raises a yet-unsolved key question: If symmetry is so advantageous, why do proteins stop shy of perfect symmetry? Some tentative answers to be tested in further studies are suggested in a concluding outlook. © 2014 Wiley Periodicals, Inc.

  15. Fe-S cluster coordination of the chromokinesin KIF4A alters its sub-cellular localization during mitosis.

    Science.gov (United States)

    Ben-Shimon, Lilach; Paul, Viktoria D; David-Kadoch, Galit; Volpe, Marina; Stümpfig, Martin; Bill, Eckhard; Mühlenhoff, Ulrich; Lill, Roland; Ben-Aroya, Shay

    2018-05-30

    Fe-S clusters act as co-factors of proteins with diverse functions, e.g. in DNA repair. Down-regulation of the cytosolic iron-sulfur protein assembly (CIA) machinery promotes genomic instability by the inactivation of multiple DNA repair pathways. Furthermore, CIA deficiencies are associated with so far unexplained mitotic defects. Here, we show that CIA2B and MMS19, constituents of the CIA targeting complex involved in facilitating Fe-S cluster insertion into cytosolic and nuclear target proteins, co-localize with components of the mitotic machinery. Down-regulation of CIA2B and MMS19 impairs the mitotic cycle. We identify the chromokinesin KIF4A as a mitotic component involved in these effects. KIF4A binds a Fe-S cluster in vitro through its conserved cysteine-rich domain. We demonstrate in vivo that this domain is required for the mitosis-related KIF4A localization and for the mitotic defects associated with KIF4A knockout. KIF4A is the first identified mitotic component carrying such a post-translational modification. These findings suggest that the lack of Fe-S clusters in KIF4A upon down-regulation of the CIA targeting complex contributes to the mitotic defects. © 2018. Published by The Company of Biologists Ltd.

  16. GibbsCluster: unsupervised clustering and alignment of peptide sequences

    DEFF Research Database (Denmark)

    Andreatta, Massimo; Alvarez, Bruno; Nielsen, Morten

    2017-01-01

    motif characterizing each cluster. Several parameters are available to customize cluster analysis, including adjustable penalties for small clusters and overlapping groups and a trash cluster to remove outliers. As an example application, we used the server to deconvolute multiple specificities in large......-scale peptidome data generated by mass spectrometry. The server is available at http://www.cbs.dtu.dk/services/GibbsCluster-2.0....

  17. Hierarchical partitioning of metazoan protein conservation profiles provides new functional insights.

    Directory of Open Access Journals (Sweden)

    Jonathan Witztum

    Full Text Available The availability of many complete, annotated proteomes enables the systematic study of the relationships between protein conservation and functionality. We explore this question based solely on the presence or absence of protein homologues (a.k.a. conservation profiles. We study 18 metazoans, from two distinct points of view: the human's and the fly's. Using the GOrilla gene ontology (GO analysis tool, we explore functional enrichment of the "universal proteins", those with homologues in all 17 other species, and of the "non-universal proteins". A large number of GO terms are strongly enriched in both human and fly universal proteins. Most of these functions are known to be essential. A smaller number of GO terms, exhibiting markedly different properties, are enriched in both human and fly non-universal proteins. We further explore the non-universal proteins, whose conservation profiles are consistent with the "tree of life" (TOL consistent, as well as the TOL inconsistent proteins. Finally, we applied Quantum Clustering to the conservation profiles of the TOL consistent proteins. Each cluster is strongly associated with one or a small number of specific monophyletic clades in the tree of life. The proteins in many of these clusters exhibit strong functional enrichment associated with the "life style" of the related clades. Most previous approaches for studying function and conservation are "bottom up", studying protein families one by one, and separately assessing the conservation of each. By way of contrast, our approach is "top down". We globally partition the set of all proteins hierarchically, as described above, and then identify protein families enriched within different subdivisions. While supporting previous findings, our approach also provides a tool for discovering novel relations between protein conservation profiles, functionality, and evolutionary history as represented by the tree of life.

  18. Diametrical clustering for identifying anti-correlated gene clusters.

    Science.gov (United States)

    Dhillon, Inderjit S; Marcotte, Edward M; Roshan, Usman

    2003-09-01

    Clustering genes based upon their expression patterns allows us to predict gene function. Most existing clustering algorithms cluster genes together when their expression patterns show high positive correlation. However, it has been observed that genes whose expression patterns are strongly anti-correlated can also be functionally similar. Biologically, this is not unintuitive-genes responding to the same stimuli, regardless of the nature of the response, are more likely to operate in the same pathways. We present a new diametrical clustering algorithm that explicitly identifies anti-correlated clusters of genes. Our algorithm proceeds by iteratively (i). re-partitioning the genes and (ii). computing the dominant singular vector of each gene cluster; each singular vector serving as the prototype of a 'diametric' cluster. We empirically show the effectiveness of the algorithm in identifying diametrical or anti-correlated clusters. Testing the algorithm on yeast cell cycle data, fibroblast gene expression data, and DNA microarray data from yeast mutants reveals that opposed cellular pathways can be discovered with this method. We present systems whose mRNA expression patterns, and likely their functions, oppose the yeast ribosome and proteosome, along with evidence for the inverse transcriptional regulation of a number of cellular systems.

  19. Partitional clustering algorithms

    CERN Document Server

    2015-01-01

    This book summarizes the state-of-the-art in partitional clustering. Clustering, the unsupervised classification of patterns into groups, is one of the most important tasks in exploratory data analysis. Primary goals of clustering include gaining insight into, classifying, and compressing data. Clustering has a long and rich history that spans a variety of scientific disciplines including anthropology, biology, medicine, psychology, statistics, mathematics, engineering, and computer science. As a result, numerous clustering algorithms have been proposed since the early 1950s. Among these algorithms, partitional (nonhierarchical) ones have found many applications, especially in engineering and computer science. This book provides coverage of consensus clustering, constrained clustering, large scale and/or high dimensional clustering, cluster validity, cluster visualization, and applications of clustering. Examines clustering as it applies to large and/or high-dimensional data sets commonly encountered in reali...

  20. NMR of proteins (4Fe-4S): structural properties and intramolecular electron transfer

    International Nuclear Information System (INIS)

    Huber, J.G.

    1996-01-01

    NMR started to be applied to Fe-S proteins in the seventies. Its use has recently been enlarged as the problems arising from the paramagnetic polymetallic clusters ware overcome. Applications to [4Fe-4S] are presented herein. The information derived thereof deepens the understanding of the redox properties of these proteins which play a central role in the metabolism of bacterial cells. The secondary structure elements and the overall folding of Chromatium vinosum ferredoxin (Cv Fd) in solution have been established by NMR. The unique features of this sequence have been shown to fold as an α helix at the C-terminus and as a loop between two cysteines ligand of one cluster: these two parts localize in close proximity from one another. The interaction between nuclear and electronic spins is a source of additional structural information for (4Fe-AS] proteins. The conformation of the cysteine-ligands, as revealed by the Fe-(S γ -C β -H β )Cys dihedral angles, is related to the chemical shifts of the signals associated with the protons of these residues. The longitudinal relaxation times of the protons depend on their distance to the cluster. A quantitative relationship has been established and used to show that the solution structure of the high-potential ferredoxin from Cv differs significantly from the crystal structure around Phe-48. Both parameters (chemical shifts and longitudinal relaxation times) give also insight into the electronic and magnetic properties of the [4Fe-4S] clusters. The rate of intramolecular electron transfer between the two [4FE-4S] clusters of ferredoxins has been measured by NMR. It is far slower in the case of Cv Fd than for shorter ferredoxins. The difference may be associated with changes in the magnetic and/or electronic properties of one cluster. The strong paramagnetism of the [4Fe-4S] clusters, which originally limited the applicability of NMR to proteins containing these cofactors, has been proven instrumental in affording new

  1. SOS System Induction Inhibits the Assembly of Chemoreceptor Signaling Clusters in Salmonella enterica.

    Science.gov (United States)

    Irazoki, Oihane; Mayola, Albert; Campoy, Susana; Barbé, Jordi

    2016-01-01

    Swarming, a flagellar-driven multicellular form of motility, is associated with bacterial virulence and increased antibiotic resistance. In this work we demonstrate that activation of the SOS response reversibly inhibits swarming motility by preventing the assembly of chemoreceptor-signaling polar arrays. We also show that an increase in the concentration of the RecA protein, generated by SOS system activation, rather than another function of this genetic network impairs chemoreceptor polar cluster formation. Our data provide evidence that the molecular balance between RecA and CheW proteins is crucial to allow polar cluster formation in Salmonella enterica cells. Thus, activation of the SOS response by the presence of a DNA-injuring compound increases the RecA concentration, thereby disturbing the equilibrium between RecA and CheW and resulting in the cessation of swarming. Nevertheless, when the DNA-damage decreases and the SOS response is no longer activated, basal RecA levels and thus polar cluster assembly are reestablished. These results clearly show that bacterial populations moving over surfaces make use of specific mechanisms to avoid contact with DNA-damaging compounds.

  2. Cluster Matters

    DEFF Research Database (Denmark)

    Gulati, Mukesh; Lund-Thomsen, Peter; Suresh, Sangeetha

    2018-01-01

    sell their products successfully in international markets, but there is also an increasingly large consumer base within India. Indeed, Indian industrial clusters have contributed to a substantial part of this growth process, and there are several hundred registered clusters within the country...... of this handbook, which focuses on the role of CSR in MSMEs. Hence we contribute to the literature on CSR in industrial clusters and specifically CSR in Indian industrial clusters by investigating the drivers of CSR in India’s industrial clusters....

  3. Weighted Clustering

    DEFF Research Database (Denmark)

    Ackerman, Margareta; Ben-David, Shai; Branzei, Simina

    2012-01-01

    We investigate a natural generalization of the classical clustering problem, considering clustering tasks in which different instances may have different weights.We conduct the first extensive theoretical analysis on the influence of weighted data on standard clustering algorithms in both...... the partitional and hierarchical settings, characterizing the conditions under which algorithms react to weights. Extending a recent framework for clustering algorithm selection, we propose intuitive properties that would allow users to choose between clustering algorithms in the weighted setting and classify...

  4. MOCASSIN-prot: A multi-objective clustering approach for protein similarity networks

    Science.gov (United States)

    Motivation: Proteins often include multiple conserved domains. Various evolutionary events including duplication and loss of domains, domain shuffling, as well as sequence divergence contribute to generating complexities in protein structures, and consequently, in their functions. The evolutionary h...

  5. Conditions for the Evolution of Gene Clusters in Bacterial Genomes

    Science.gov (United States)

    Ballouz, Sara; Francis, Andrew R.; Lan, Ruiting; Tanaka, Mark M.

    2010-01-01

    Genes encoding proteins in a common pathway are often found near each other along bacterial chromosomes. Several explanations have been proposed to account for the evolution of these structures. For instance, natural selection may directly favour gene clusters through a variety of mechanisms, such as increased efficiency of coregulation. An alternative and controversial hypothesis is the selfish operon model, which asserts that clustered arrangements of genes are more easily transferred to other species, thus improving the prospects for survival of the cluster. According to another hypothesis (the persistence model), genes that are in close proximity are less likely to be disrupted by deletions. Here we develop computational models to study the conditions under which gene clusters can evolve and persist. First, we examine the selfish operon model by re-implementing the simulation and running it under a wide range of conditions. Second, we introduce and study a Moran process in which there is natural selection for gene clustering and rearrangement occurs by genome inversion events. Finally, we develop and study a model that includes selection and inversion, which tracks the occurrence and fixation of rearrangements. Surprisingly, gene clusters fail to evolve under a wide range of conditions. Factors that promote the evolution of gene clusters include a low number of genes in the pathway, a high population size, and in the case of the selfish operon model, a high horizontal transfer rate. The computational analysis here has shown that the evolution of gene clusters can occur under both direct and indirect selection as long as certain conditions hold. Under these conditions the selfish operon model is still viable as an explanation for the evolution of gene clusters. PMID:20168992

  6. Support Policies in Clusters: Prioritization of Support Needs by Cluster Members According to Cluster Life Cycle

    Directory of Open Access Journals (Sweden)

    Gulcin Salıngan

    2012-07-01

    Full Text Available Economic development has always been a moving target. Both the national and local governments have been facing the challenge of implementing the effective and efficient economic policy and program in order to best utilize their limited resources. One of the recent approaches in this area is called cluster-based economic analysis and strategy development. This study reviews key literature and some of the cluster based economic policies adopted by different governments. Based on this review, it proposes “the cluster life cycle” as a determining factor to identify the support requirements of clusters. A survey, designed based on literature review of International Cluster support programs, was conducted with 30 participants from 3 clusters with different maturity stage. This paper discusses the results of this study conducted among the cluster members in Eskişehir- Bilecik-Kütahya Region in Turkey on the requirement of the support to foster the development of related clusters.

  7. Entry and exit of bacterial outer membrane proteins.

    Science.gov (United States)

    Misra, Rajeev

    2015-08-01

    The sites of new outer membrane protein (OMP) deposition and the fate of pre-existing OMPs are still enigmatic despite numerous concerted efforts. Rassam et al. identified mid-cell regions as the primary entry points for new OMP insertion in clusters, driving the pre-existing OMP clusters towards cell poles for long-term storage. Copyright © 2015 Elsevier Ltd. All rights reserved.

  8. Clusters and how to make it work : toolkit for cluster strategy

    NARCIS (Netherlands)

    Manickam, Anu; van Berkel, Karel

    2013-01-01

    Clusters are the magic answer to regional economic development. Firms in clusters are more innovative; cluster policy dominates EU policy; ‘top-sectors’ and excellence are the choice of national policy makers; clusters are ‘in’. But, clusters are complex, clusters are ‘messy’; there is no clear

  9. Arabidopsis thaliana mTERF proteins: evolution and functional classification

    Directory of Open Access Journals (Sweden)

    Tatjana eKleine

    2012-10-01

    Full Text Available Organellar gene expression (OGE is crucial for plant development, photosynthesis and respiration, but our understanding of the mechanisms that control it is still relatively poor. Thus, OGE requires various nucleus-encoded proteins that promote transcription, splicing, trimming and editing of organellar RNAs, and regulate translation. In metazoans, proteins of the mitochondrial Transcription tERmination Factor (mTERF family interact with the mitochondrial chromosome and regulate transcriptional initiation and termination. Sequencing of the Arabidopsis thaliana genome led to the identification of a diversified MTERF gene family but, in contrast to mammalian mTERFs, knowledge about the function of these proteins in photosynthetic organisms is scarce. In this hypothesis article, I show that tandem duplications and one block duplication contributed to the large number of MTERF genes in A. thaliana, and propose that the expansion of the family is related to the evolution of land plants. The MTERF genes - especially the duplicated genes - display a number of distinct mRNA accumulation patterns, suggesting functional diversification of mTERF proteins to increase adaptability to environmental changes. Indeed, hypothetical functions for the different mTERF proteins can be predicted using co-expression analysis and gene ontology annotations. On this basis, mTERF proteins can be sorted into five groups. Members of the chloroplast and chloroplast-associated clusters are principally involved in chloroplast gene expression, embryogenesis and protein catabolism, while representatives of the mitochondrial cluster seem to participate in DNA and RNA metabolism in that organelle. Moreover, members of the mitochondrion-associated cluster and the low expression group may act in the nucleus and/or the cytosol. As proteins involved in OGE and presumably nuclear gene expression, mTERFs are ideal candidates for the coordination of the expression of organelle and nuclear

  10. A PQL (protein quantity loci) analysis of mature pea seed proteins identifies loci determining seed protein composition.

    Science.gov (United States)

    Bourgeois, Michael; Jacquin, Françoise; Cassecuelle, Florence; Savois, Vincent; Belghazi, Maya; Aubert, Grégoire; Quillien, Laurence; Huart, Myriam; Marget, Pascal; Burstin, Judith

    2011-05-01

    Legume seeds are a major source of dietary proteins for humans and animals. Deciphering the genetic control of their accumulation is thus of primary significance towards their improvement. At first, we analysed the genetic variability of the pea seed proteome of three genotypes over 3 years of cultivation. This revealed that seed protein composition variability was under predominant genetic control, with as much as 60% of the spots varying quantitatively among the three genotypes. Then, by combining proteomic and quantitative trait loci (QTL) mapping approaches, we uncovered the genetic architecture of seed proteome variability. Protein quantity loci (PQL) were searched for 525 spots detected on 2-D gels obtained for 157 recombinant inbred lines. Most protein quantity loci mapped in clusters, suggesting that the accumulation of the major storage protein families was under the control of a limited number of loci. While convicilin accumulation was mainly under the control of cis-regulatory regions, vicilins and legumins were controlled by both cis- and trans-regulatory regions. Some loci controlled both seed protein composition and protein content and a locus on LGIIa appears to be a major regulator of protein composition and of protein in vitro digestibility. Copyright © 2011 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  11. Ensemble attribute profile clustering: discovering and characterizing groups of genes with similar patterns of biological features

    Directory of Open Access Journals (Sweden)

    Bissell MJ

    2006-03-01

    Full Text Available Abstract Background Ensemble attribute profile clustering is a novel, text-based strategy for analyzing a user-defined list of genes and/or proteins. The strategy exploits annotation data present in gene-centered corpora and utilizes ideas from statistical information retrieval to discover and characterize properties shared by subsets of the list. The practical utility of this method is demonstrated by employing it in a retrospective study of two non-overlapping sets of genes defined by a published investigation as markers for normal human breast luminal epithelial cells and myoepithelial cells. Results Each genetic locus was characterized using a finite set of biological properties and represented as a vector of features indicating attributes associated with the locus (a gene attribute profile. In this study, the vector space models for a pre-defined list of genes were constructed from the Gene Ontology (GO terms and the Conserved Domain Database (CDD protein domain terms assigned to the loci by the gene-centered corpus LocusLink. This data set of GO- and CDD-based gene attribute profiles, vectors of binary random variables, was used to estimate multiple finite mixture models and each ensuing model utilized to partition the profiles into clusters. The resultant partitionings were combined using a unanimous voting scheme to produce consensus clusters, sets of profiles that co-occured consistently in the same cluster. Attributes that were important in defining the genes assigned to a consensus cluster were identified. The clusters and their attributes were inspected to ascertain the GO and CDD terms most associated with subsets of genes and in conjunction with external knowledge such as chromosomal location, used to gain functional insights into human breast biology. The 52 luminal epithelial cell markers and 89 myoepithelial cell markers are disjoint sets of genes. Ensemble attribute profile clustering-based analysis indicated that both lists

  12. Rescuing the Rescuer: On the Protein Complex between the Human Mitochondrial Acyl Carrier Protein and ISD11.

    Science.gov (United States)

    Herrera, María Georgina; Pignataro, María Florencia; Noguera, Martín Ezequiel; Cruz, Karen Magalí; Santos, Javier

    2018-05-16

    Iron-sulfur clusters are essential cofactors in many biochemical processes. ISD11, one of the subunits of the protein complex that carries out the cluster assembly in mitochondria, is necessary for cysteine desulfurase NFS1 stability and function. Several authors have recently provided evidence showing that ISD11 interacts with the acyl carrier protein (ACP). We carried out the coexpression of human mitochondrial ACP and ISD11 in E. coli. This work shows that ACP and ISD11 form a soluble, structured, and stable complex able to bind to the human NFS1 subunit modulating its activity. Results suggest that ACP plays a key-role in ISD11 folding and stability in vitro. These findings offer the opportunity to study the mechanism of interaction between ISD11 and NFS1.

  13. Determination of atomic cluster structure with cluster fusion algorithm

    DEFF Research Database (Denmark)

    Obolensky, Oleg I.; Solov'yov, Ilia; Solov'yov, Andrey V.

    2005-01-01

    We report an efficient scheme of global optimization, called cluster fusion algorithm, which has proved its reliability and high efficiency in determination of the structure of various atomic clusters.......We report an efficient scheme of global optimization, called cluster fusion algorithm, which has proved its reliability and high efficiency in determination of the structure of various atomic clusters....

  14. Large-Scale Multi-Dimensional Document Clustering on GPU Clusters

    Energy Technology Data Exchange (ETDEWEB)

    Cui, Xiaohui [ORNL; Mueller, Frank [North Carolina State University; Zhang, Yongpeng [ORNL; Potok, Thomas E [ORNL

    2010-01-01

    Document clustering plays an important role in data mining systems. Recently, a flocking-based document clustering algorithm has been proposed to solve the problem through simulation resembling the flocking behavior of birds in nature. This method is superior to other clustering algorithms, including k-means, in the sense that the outcome is not sensitive to the initial state. One limitation of this approach is that the algorithmic complexity is inherently quadratic in the number of documents. As a result, execution time becomes a bottleneck with large number of documents. In this paper, we assess the benefits of exploiting the computational power of Beowulf-like clusters equipped with contemporary Graphics Processing Units (GPUs) as a means to significantly reduce the runtime of flocking-based document clustering. Our framework scales up to over one million documents processed simultaneously in a sixteennode GPU cluster. Results are also compared to a four-node cluster with higher-end GPUs. On these clusters, we observe 30X-50X speedups, which demonstrates the potential of GPU clusters to efficiently solve massive data mining problems. Such speedups combined with the scalability potential and accelerator-based parallelization are unique in the domain of document-based data mining, to the best of our knowledge.

  15. Membership determination of open clusters based on a spectral clustering method

    Science.gov (United States)

    Gao, Xin-Hua

    2018-06-01

    We present a spectral clustering (SC) method aimed at segregating reliable members of open clusters in multi-dimensional space. The SC method is a non-parametric clustering technique that performs cluster division using eigenvectors of the similarity matrix; no prior knowledge of the clusters is required. This method is more flexible in dealing with multi-dimensional data compared to other methods of membership determination. We use this method to segregate the cluster members of five open clusters (Hyades, Coma Ber, Pleiades, Praesepe, and NGC 188) in five-dimensional space; fairly clean cluster members are obtained. We find that the SC method can capture a small number of cluster members (weak signal) from a large number of field stars (heavy noise). Based on these cluster members, we compute the mean proper motions and distances for the Hyades, Coma Ber, Pleiades, and Praesepe clusters, and our results are in general quite consistent with the results derived by other authors. The test results indicate that the SC method is highly suitable for segregating cluster members of open clusters based on high-precision multi-dimensional astrometric data such as Gaia data.

  16. Supervised maximum-likelihood weighting of composite protein networks for complex prediction

    Directory of Open Access Journals (Sweden)

    Yong Chern Han

    2012-12-01

    Full Text Available Abstract Background Protein complexes participate in many important cellular functions, so finding the set of existent complexes is essential for understanding the organization and regulation of processes in the cell. With the availability of large amounts of high-throughput protein-protein interaction (PPI data, many algorithms have been proposed to discover protein complexes from PPI networks. However, such approaches are hindered by the high rate of noise in high-throughput PPI data, including spurious and missing interactions. Furthermore, many transient interactions are detected between proteins that are not from the same complex, while not all proteins from the same complex may actually interact. As a result, predicted complexes often do not match true complexes well, and many true complexes go undetected. Results We address these challenges by integrating PPI data with other heterogeneous data sources to construct a composite protein network, and using a supervised maximum-likelihood approach to weight each edge based on its posterior probability of belonging to a complex. We then use six different clustering algorithms, and an aggregative clustering strategy, to discover complexes in the weighted network. We test our method on Saccharomyces cerevisiae and Homo sapiens, and show that complex discovery is improved: compared to previously proposed supervised and unsupervised weighting approaches, our method recalls more known complexes, achieves higher precision at all recall levels, and generates novel complexes of greater functional similarity. Furthermore, our maximum-likelihood approach allows learned parameters to be used to visualize and evaluate the evidence of novel predictions, aiding human judgment of their credibility. Conclusions Our approach integrates multiple data sources with supervised learning to create a weighted composite protein network, and uses six clustering algorithms with an aggregative clustering strategy to

  17. Mitochondrial nucleoid clusters protect newly synthesized mtDNA during Doxorubicin- and Ethidium Bromide-induced mitochondrial stress

    Energy Technology Data Exchange (ETDEWEB)

    Alán, Lukáš, E-mail: lukas.alan@fgu.cas.cz; Špaček, Tomáš; Pajuelo Reguera, David; Jabůrek, Martin; Ježek, Petr

    2016-07-01

    Mitochondrial DNA (mtDNA) is compacted in ribonucleoprotein complexes called nucleoids, which can divide or move within the mitochondrial network. Mitochondrial nucleoids are able to aggregate into clusters upon reaction with intercalators such as the mtDNA depletion agent Ethidium Bromide (EB) or anticancer drug Doxorobicin (DXR). However, the exact mechanism of nucleoid clusters formation remains unknown. Resolving these processes may help to elucidate the mechanisms of DXR-induced cardiotoxicity. Therefore, we addressed the role of two key nucleoid proteins; mitochondrial transcription factor A (TFAM) and mitochondrial single-stranded binding protein (mtSSB); in the formation of mitochondrial nucleoid clusters during the action of intercalators. We found that both intercalators cause numerous aberrations due to perturbing their native status. By blocking mtDNA replication, both agents also prevented mtDNA association with TFAM, consequently causing nucleoid aggregation into large nucleoid clusters enriched with TFAM, co-existing with the normal nucleoid population. In the later stages of intercalation (> 48 h), TFAM levels were reduced to 25%. In contrast, mtSSB was released from mtDNA and freely distributed within the mitochondrial network. Nucleoid clusters mostly contained nucleoids with newly replicated mtDNA, however the nucleoid population which was not in replication mode remained outside the clusters. Moreover, the nucleoid clusters were enriched with p53, an anti-oncogenic gatekeeper. We suggest that mitochondrial nucleoid clustering is a mechanism for protecting nucleoids with newly replicated DNA against intercalators mediating genotoxic stress. These results provide new insight into the common mitochondrial response to mtDNA stress and can be implied also on DXR-induced mitochondrial cytotoxicity. - Highlights: • The mechanism for mitochondrial nucleoid clustering is proposed. • DNA intercalators (Doxorubicin or Ethidium Bromide) prevent TFAM

  18. Dietary protein is associated with musculoskeletal health independently of dietary pattern: the Framingham Third Generation Study.

    Science.gov (United States)

    Mangano, Kelsey M; Sahni, Shivani; Kiel, Douglas P; Tucker, Katherine L; Dufour, Alyssa B; Hannan, Marian T

    2017-03-01

    Background: Above-average dietary protein, as a single nutrient, improves musculoskeletal health. Evaluating the link between dietary protein and musculoskeletal health from a whole-diet perspective is important, as dietary guidelines focus on dietary patterns. Objective: We examined the prospective association of novel dietary protein food clusters (derived from established dietary pattern techniques) with appendicular lean mass (ALM), quadriceps strength (QS), and bone mineral density (BMD) in 2986 men and women, aged 19-72 y, from the Framingham Third Generation Study. Design: Total protein intake was estimated by food-frequency questionnaire in 2002-2005. A cluster analysis was used to classify participants into mutually exclusive groups, which were determined by using the percentage of contribution of food intake to overall protein intake. General linear modeling was used to 1 ) estimate the association between protein intake (grams per day) and BMD, ALM, appendicular lean mass normalized for height (ALM/ht 2 ), and QS (2008-2011) and to 2 ) calculate adjusted least-squares mean outcomes across quartiles of protein (grams per day) and protein food clusters. Results: The mean ± SD age of subjects was 40 ± 9 y; 82% of participants met the Recommended Daily Allowance (0.8 g · kg body weight -1 · d -1 ). The following 6 dietary protein food clusters were identified: fast food and full-fat dairy, fish, red meat, chicken, low-fat milk, and legumes. BMD was not different across quartiles of protein intake ( P -trend range = 0.32-0.82); but significant positive trends were observed for ALM, ALM/ht 2 ( P dietary protein is associated with ALM and QS but not with BMD. In this study, dietary protein food patterns do not provide further insight into beneficial protein effects on muscle outcomes. © 2017 American Society for Nutrition.

  19. Cluster headache

    Science.gov (United States)

    Histamine headache; Headache - histamine; Migrainous neuralgia; Headache - cluster; Horton's headache; Vascular headache - cluster ... Doctors do not know exactly what causes cluster headaches. They ... (chemical in the body released during an allergic response) or ...

  20. Phrase Mining of Textual Data to Analyze Extracellular Matrix Protein Patterns Across Cardiovascular Disease.

    Science.gov (United States)

    Liem, David Alexandre; Murali, Sanjana; Sigdel, Dibakar; Shi, Yu; Wang, Xuan; Shen, Jiaming; Choi, Howard; Caufield, J Harry; Wang, Wei; Ping, Peipei; Han, Jiawei

    2018-05-18

    Extracellular matrix (ECM) proteins have been shown to play important roles regulating multiple biological processes in an array of organ systems, including the cardiovascular system. By using a novel bioinformatics text-mining tool, we studied six categories of cardiovascular disease (CVD), namely ischemic heart disease (IHD), cardiomyopathies (CM), cerebrovascular accident (CVA), congenital heart disease (CHD), arrhythmias (ARR), and valve disease (VD), anticipating novel ECM protein-disease and protein-protein relationships hidden within vast quantities of textual data. We conducted a phrase-mining analysis, delineating the relationships of 709 ECM proteins with the six groups of CVDs reported in 1,099,254 abstracts. The technology pipeline known as Context-aware Semantic Online Analytical Processing (CaseOLAP) was applied to semantically rank the association of proteins to each and all six CVDs, performing analyses to quantify each protein-disease relationship. We performed principal component analysis and hierarchical clustering of the data, where each protein is visualized as a six dimensional vector. We found that ECM proteins display variable degrees of association with the six CVDs; certain CVDs share groups of associated proteins whereas others have divergent protein associations. We identified 82 ECM proteins sharing associations with all six CVDs. Our bioinformatics analysis ascribed distinct ECM pathways (via Reactome) from this subset of proteins, namely insulin-like growth factor regulation and interleukin-4 and interleukin-13 signaling, suggesting their contribution to the pathogenesis of all six CVDs. Finally, we performed hierarchical clustering analysis and identified protein clusters associated with a targeted CVD; analyses revealed unexpected insights underlying ECM-pathogenesis of CVDs.

  1. Histone and ribosomal RNA repetitive gene clusters of the boll weevil are linked in a tandem array.

    Science.gov (United States)

    Roehrdanz, R; Heilmann, L; Senechal, P; Sears, S; Evenson, P

    2010-08-01

    Histones are the major protein component of chromatin structure. The histone family is made up of a quintet of proteins, four core histones (H2A, H2B, H3 & H4) and the linker histones (H1). Spacers are found between the coding regions. Among insects this quintet of genes is usually clustered and the clusters are tandemly repeated. Ribosomal DNA contains a cluster of the rRNA sequences 18S, 5.8S and 28S. The rRNA genes are separated by the spacers ITS1, ITS2 and IGS. This cluster is also tandemly repeated. We found that the ribosomal RNA repeat unit of at least two species of Anthonomine weevils, Anthonomus grandis and Anthonomus texanus (Coleoptera: Curculionidae), is interspersed with a block containing the histone gene quintet. The histone genes are situated between the rRNA 18S and 28S genes in what is known as the intergenic spacer region (IGS). The complete reiterated Anthonomus grandis histone-ribosomal sequence is 16,248 bp.

  2. Maximum-likelihood model averaging to profile clustering of site types across discrete linear sequences.

    Directory of Open Access Journals (Sweden)

    Zhang Zhang

    2009-06-01

    Full Text Available A major analytical challenge in computational biology is the detection and description of clusters of specified site types, such as polymorphic or substituted sites within DNA or protein sequences. Progress has been stymied by a lack of suitable methods to detect clusters and to estimate the extent of clustering in discrete linear sequences, particularly when there is no a priori specification of cluster size or cluster count. Here we derive and demonstrate a maximum likelihood method of hierarchical clustering. Our method incorporates a tripartite divide-and-conquer strategy that models sequence heterogeneity, delineates clusters, and yields a profile of the level of clustering associated with each site. The clustering model may be evaluated via model selection using the Akaike Information Criterion, the corrected Akaike Information Criterion, and the Bayesian Information Criterion. Furthermore, model averaging using weighted model likelihoods may be applied to incorporate model uncertainty into the profile of heterogeneity across sites. We evaluated our method by examining its performance on a number of simulated datasets as well as on empirical polymorphism data from diverse natural alleles of the Drosophila alcohol dehydrogenase gene. Our method yielded greater power for the detection of clustered sites across a breadth of parameter ranges, and achieved better accuracy and precision of estimation of clusters, than did the existing empirical cumulative distribution function statistics.

  3. Single-cluster dynamics for the random-cluster model

    NARCIS (Netherlands)

    Deng, Y.; Qian, X.; Blöte, H.W.J.

    2009-01-01

    We formulate a single-cluster Monte Carlo algorithm for the simulation of the random-cluster model. This algorithm is a generalization of the Wolff single-cluster method for the q-state Potts model to noninteger values q>1. Its results for static quantities are in a satisfactory agreement with those

  4. Epidemiology Analysis of Streptococcus pyogenes in a Hospital in Southern Taiwan by Use of the Updated emm Cluster Typing System.

    Science.gov (United States)

    Chiang-Ni, Chuan; Zheng, Po-Xing; Wang, Shu-Ying; Tsai, Pei-Jane; Chuang, Woei-Jer; Lin, Yee-Shin; Liu, Ching-Chuan; Wu, Jiunn-Jong

    2016-01-01

    emm typing is the most widely used molecular typing method for the human pathogen Streptococcus pyogenes (group A streptococcus [GAS]). emm typing is based on a small variable region of the emm gene; however, the emm cluster typing system defines GAS types according to the nearly complete sequence of the emm gene. Therefore, emm cluster typing is considered to provide more information regarding the functional and structural properties of M proteins in different emm types of GAS. In the present study, 677 isolates collected between 1994 and 2008 in a hospital in southern Taiwan were analyzed by the emm cluster typing system. emm clusters A-C4, E1, E6, and A-C3 were the most prevalent emm cluster types and accounted for 67.4% of total isolates. emm clusters A-C4 and E1 were associated with noninvasive diseases, whereas E6 was significantly associated with both invasive and noninvasive manifestations. In addition, emm clusters D4, E2, and E3 were significantly associated with invasive manifestations. Furthermore, we found that the functional properties of M protein, including low fibrinogen-binding and high IgG-binding activities, were correlated significantly with invasive manifestations. In summary, the present study provides updated epidemiological information on GAS emm cluster types in southern Taiwan. Copyright © 2015, American Society for Microbiology. All Rights Reserved.

  5. Conditions for the evolution of gene clusters in bacterial genomes.

    Directory of Open Access Journals (Sweden)

    Sara Ballouz

    2010-02-01

    Full Text Available Genes encoding proteins in a common pathway are often found near each other along bacterial chromosomes. Several explanations have been proposed to account for the evolution of these structures. For instance, natural selection may directly favour gene clusters through a variety of mechanisms, such as increased efficiency of coregulation. An alternative and controversial hypothesis is the selfish operon model, which asserts that clustered arrangements of genes are more easily transferred to other species, thus improving the prospects for survival of the cluster. According to another hypothesis (the persistence model, genes that are in close proximity are less likely to be disrupted by deletions. Here we develop computational models to study the conditions under which gene clusters can evolve and persist. First, we examine the selfish operon model by re-implementing the simulation and running it under a wide range of conditions. Second, we introduce and study a Moran process in which there is natural selection for gene clustering and rearrangement occurs by genome inversion events. Finally, we develop and study a model that includes selection and inversion, which tracks the occurrence and fixation of rearrangements. Surprisingly, gene clusters fail to evolve under a wide range of conditions. Factors that promote the evolution of gene clusters include a low number of genes in the pathway, a high population size, and in the case of the selfish operon model, a high horizontal transfer rate. The computational analysis here has shown that the evolution of gene clusters can occur under both direct and indirect selection as long as certain conditions hold. Under these conditions the selfish operon model is still viable as an explanation for the evolution of gene clusters.

  6. Electrostatically induced recruitment of membrane peptides into clusters requires ligand binding at both interfaces.

    Directory of Open Access Journals (Sweden)

    Yuri N Antonenko

    Full Text Available Protein recruitment to specific membrane locations may be governed or facilitated by electrostatic attraction, which originates from a multivalent ligand. Here we explored the energetics of a model system in which this simple electrostatic recruitment mechanism failed. That is, basic poly-L-lysine binding to one leaflet of a planar lipid bilayer did not recruit the triply-charged peptide (O-Pyromellitylgramicidin. Clustering was only observed in cases where PLL was bound to both channel ends. Clustering was indicated (i by the decreased diffusional PLL mobility D(PLL and (ii by an increased lifetime τ(PLL of the clustered channels. In contrast, if PLL was bound to only one leaflet, neither D(PLL nor τ(P changed. Simple calculations suggest that electrostatic repulsion of the unbound ends prevented neighboring OPg dimers from approaching each other. We believe that a similar mechanism may also operate in cell signaling and that it may e.g. contribute to the controversial results obtained for the ligand driven dimerization of G protein-coupled receptors.

  7. Anisotropy of the Coulomb Interaction between Folded Proteins: Consequences for Mesoscopic Aggregation of Lysozyme

    Science.gov (United States)

    Chan, Ho Yin; Lankevich, Vladimir; Vekilov, Peter G.; Lubchenko, Vassiliy

    2012-01-01

    Toward quantitative description of protein aggregation, we develop a computationally efficient method to evaluate the potential of mean force between two folded protein molecules that allows for complete sampling of their mutual orientation. Our model is valid at moderate ionic strengths and accounts for the actual charge distribution on the surface of the molecules, the dielectric discontinuity at the protein-solvent interface, and the possibility of protonation or deprotonation of surface residues induced by the electric field due to the other protein molecule. We apply the model to the protein lysozyme, whose solutions exhibit both mesoscopic clusters of protein-rich liquid and liquid-liquid separation; the former requires that protein form complexes with typical lifetimes of approximately milliseconds. We find the electrostatic repulsion is typically lower than the prediction of the Derjaguin-Landau-Verwey-Overbeek theory. The Coulomb interaction in the lowest-energy docking configuration is nonrepulsive, despite the high positive charge on the molecules. Typical docking configurations barely involve protonation or deprotonation of surface residues. The obtained potential of mean force between folded lysozyme molecules is consistent with the location of the liquid-liquid coexistence, but produces dimers that are too short-lived for clusters to exist, suggesting lysozyme undergoes conformational changes during cluster formation. PMID:22768950

  8. Relevant Subspace Clustering

    DEFF Research Database (Denmark)

    Müller, Emmanuel; Assent, Ira; Günnemann, Stephan

    2009-01-01

    Subspace clustering aims at detecting clusters in any subspace projection of a high dimensional space. As the number of possible subspace projections is exponential in the number of dimensions, the result is often tremendously large. Recent approaches fail to reduce results to relevant subspace...... clusters. Their results are typically highly redundant, i.e. many clusters are detected multiple times in several projections. In this work, we propose a novel model for relevant subspace clustering (RESCU). We present a global optimization which detects the most interesting non-redundant subspace clusters...... achieves top clustering quality while competing approaches show greatly varying performance....

  9. Is protein structure prediction still an enigma?

    African Journals Online (AJOL)

    STORAGESEVER

    2008-12-29

    Dec 29, 2008 ... Computer methods for protein analysis address this problem since they study the .... neighbor methods, molecular dynamic simulation, and approaches .... fuzzy clustering, neural net works, logistic regression, decision tree ...

  10. Horticultural cluster

    OpenAIRE

    SHERSTIUK S.V.; POSYLAYEVA K.I.

    2013-01-01

    In the article there are the theoretical and methodological approaches to the nature and existence of the cluster. The cluster differences from other kinds of cooperative and integration associations. Was develop by scientific-practical recommendations for forming a competitive horticultur cluster.

  11. Genetic clustering and polymorphism of the merozoite surface protein-3 of Plasmodium knowlesi clinical isolates from Peninsular Malaysia.

    Science.gov (United States)

    De Silva, Jeremy Ryan; Lau, Yee Ling; Fong, Mun Yik

    2017-01-03

    The simian malaria parasite Plasmodium knowlesi has been reported to cause significant numbers of human infection in South East Asia. Its merozoite surface protein-3 (MSP3) is a protein that belongs to a multi-gene family of proteins first found in Plasmodium falciparum. Several studies have evaluated the potential of P. falciparum MSP3 as a potential vaccine candidate. However, to date no detailed studies have been carried out on P. knowlesi MSP3 gene (pkmsp3). The present study investigates the genetic diversity, and haplotypes groups of pkmsp3 in P. knowlesi clinical samples from Peninsular Malaysia. Blood samples were collected from P. knowlesi malaria patients within a period of 4 years (2008-2012). The pkmsp3 gene of the isolates was amplified via PCR, and subsequently cloned and sequenced. The full length pkmsp3 sequence was divided into Domain A and Domain B. Natural selection, genetic diversity, and haplotypes of pkmsp3 were analysed using MEGA6 and DnaSP ver. 5.10.00 programmes. From 23 samples, 48 pkmsp3 sequences were successfully obtained. At the nucleotide level, 101 synonymous and 238 non-synonymous mutations were observed. Tests of neutrality were not significant for the full length, Domain A or Domain B sequences. However, the dN/dS ratio of Domain B indicates purifying selection for this domain. Analysis of the deduced amino acid sequences revealed 42 different haplotypes. Neighbour Joining phylogenetic tree and haplotype network analyses revealed that the haplotypes clustered into two distinct groups. A moderate level of genetic diversity was observed in the pkmsp3 and only the C-terminal region (Domain B) appeared to be under purifying selection. The separation of the pkmsp3 into two haplotype groups provides further evidence of the existence of two distinct P. knowlesi types or lineages. Future studies should investigate the diversity of pkmsp3 among P. knowlesi isolates in North Borneo, where large numbers of human knowlesi malaria infection

  12. TreeCluster: Massively scalable transmission clustering using phylogenetic trees

    OpenAIRE

    Moshiri, Alexander

    2018-01-01

    Background: The ability to infer transmission clusters from molecular data is critical to designing and evaluating viral control strategies. Viral sequencing datasets are growing rapidly, but standard methods of transmission cluster inference do not scale well beyond thousands of sequences. Results: I present TreeCluster, a cross-platform tool that performs transmission cluster inference on a given phylogenetic tree orders of magnitude faster than existing inference methods and supports multi...

  13. Voting-based consensus clustering for combining multiple clusterings of chemical structures

    Directory of Open Access Journals (Sweden)

    Saeed Faisal

    2012-12-01

    Full Text Available Abstract Background Although many consensus clustering methods have been successfully used for combining multiple classifiers in many areas such as machine learning, applied statistics, pattern recognition and bioinformatics, few consensus clustering methods have been applied for combining multiple clusterings of chemical structures. It is known that any individual clustering method will not always give the best results for all types of applications. So, in this paper, three voting and graph-based consensus clusterings were used for combining multiple clusterings of chemical structures to enhance the ability of separating biologically active molecules from inactive ones in each cluster. Results The cumulative voting-based aggregation algorithm (CVAA, cluster-based similarity partitioning algorithm (CSPA and hyper-graph partitioning algorithm (HGPA were examined. The F-measure and Quality Partition Index method (QPI were used to evaluate the clusterings and the results were compared to the Ward’s clustering method. The MDL Drug Data Report (MDDR dataset was used for experiments and was represented by two 2D fingerprints, ALOGP and ECFP_4. The performance of voting-based consensus clustering method outperformed the Ward’s method using F-measure and QPI method for both ALOGP and ECFP_4 fingerprints, while the graph-based consensus clustering methods outperformed the Ward’s method only for ALOGP using QPI. The Jaccard and Euclidean distance measures were the methods of choice to generate the ensembles, which give the highest values for both criteria. Conclusions The results of the experiments show that consensus clustering methods can improve the effectiveness of chemical structures clusterings. The cumulative voting-based aggregation algorithm (CVAA was the method of choice among consensus clustering methods.

  14. Analysis of substructural variation in families of enzymatic proteins with applications to protein function prediction

    Directory of Open Access Journals (Sweden)

    Fofanov Viacheslav Y

    2010-05-01

    Full Text Available Abstract Background Structural variations caused by a wide range of physico-chemical and biological sources directly influence the function of a protein. For enzymatic proteins, the structure and chemistry of the catalytic binding site residues can be loosely defined as a substructure of the protein. Comparative analysis of drug-receptor substructures across and within species has been used for lead evaluation. Substructure-level similarity between the binding sites of functionally similar proteins has also been used to identify instances of convergent evolution among proteins. In functionally homologous protein families, shared chemistry and geometry at catalytic sites provide a common, local point of comparison among proteins that may differ significantly at the sequence, fold, or domain topology levels. Results This paper describes two key results that can be used separately or in combination for protein function analysis. The Family-wise Analysis of SubStructural Templates (FASST method uses all-against-all substructure comparison to determine Substructural Clusters (SCs. SCs characterize the binding site substructural variation within a protein family. In this paper we focus on examples of automatically determined SCs that can be linked to phylogenetic distance between family members, segregation by conformation, and organization by homology among convergent protein lineages. The Motif Ensemble Statistical Hypothesis (MESH framework constructs a representative motif for each protein cluster among the SCs determined by FASST to build motif ensembles that are shown through a series of function prediction experiments to improve the function prediction power of existing motifs. Conclusions FASST contributes a critical feedback and assessment step to existing binding site substructure identification methods and can be used for the thorough investigation of structure-function relationships. The application of MESH allows for an automated

  15. OBSERVED SCALING RELATIONS FOR STRONG LENSING CLUSTERS: CONSEQUENCES FOR COSMOLOGY AND CLUSTER ASSEMBLY

    International Nuclear Information System (INIS)

    Comerford, Julia M.; Moustakas, Leonidas A.; Natarajan, Priyamvada

    2010-01-01

    Scaling relations of observed galaxy cluster properties are useful tools for constraining cosmological parameters as well as cluster formation histories. One of the key cosmological parameters, σ 8 , is constrained using observed clusters of galaxies, although current estimates of σ 8 from the scaling relations of dynamically relaxed galaxy clusters are limited by the large scatter in the observed cluster mass-temperature (M-T) relation. With a sample of eight strong lensing clusters at 0.3 8 , but combining the cluster concentration-mass relation with the M-T relation enables the inclusion of unrelaxed clusters as well. Thus, the resultant gains in the accuracy of σ 8 measurements from clusters are twofold: the errors on σ 8 are reduced and the cluster sample size is increased. Therefore, the statistics on σ 8 determination from clusters are greatly improved by the inclusion of unrelaxed clusters. Exploring cluster scaling relations further, we find that the correlation between brightest cluster galaxy (BCG) luminosity and cluster mass offers insight into the assembly histories of clusters. We find preliminary evidence for a steeper BCG luminosity-cluster mass relation for strong lensing clusters than the general cluster population, hinting that strong lensing clusters may have had more active merging histories.

  16. From Extraction of Local Structures of Protein Energy Landscapes to Improved Decoy Selection in Template-Free Protein Structure Prediction.

    Science.gov (United States)

    Akhter, Nasrin; Shehu, Amarda

    2018-01-19

    Due to the essential role that the three-dimensional conformation of a protein plays in regulating interactions with molecular partners, wet and dry laboratories seek biologically-active conformations of a protein to decode its function. Computational approaches are gaining prominence due to the labor and cost demands of wet laboratory investigations. Template-free methods can now compute thousands of conformations known as decoys, but selecting native conformations from the generated decoys remains challenging. Repeatedly, research has shown that the protein energy functions whose minima are sought in the generation of decoys are unreliable indicators of nativeness. The prevalent approach ignores energy altogether and clusters decoys by conformational similarity. Complementary recent efforts design protein-specific scoring functions or train machine learning models on labeled decoys. In this paper, we show that an informative consideration of energy can be carried out under the energy landscape view. Specifically, we leverage local structures known as basins in the energy landscape probed by a template-free method. We propose and compare various strategies of basin-based decoy selection that we demonstrate are superior to clustering-based strategies. The presented results point to further directions of research for improving decoy selection, including the ability to properly consider the multiplicity of native conformations of proteins.

  17. From Extraction of Local Structures of Protein Energy Landscapes to Improved Decoy Selection in Template-Free Protein Structure Prediction

    Directory of Open Access Journals (Sweden)

    Nasrin Akhter

    2018-01-01

    Full Text Available Due to the essential role that the three-dimensional conformation of a protein plays in regulating interactions with molecular partners, wet and dry laboratories seek biologically-active conformations of a protein to decode its function. Computational approaches are gaining prominence due to the labor and cost demands of wet laboratory investigations. Template-free methods can now compute thousands of conformations known as decoys, but selecting native conformations from the generated decoys remains challenging. Repeatedly, research has shown that the protein energy functions whose minima are sought in the generation of decoys are unreliable indicators of nativeness. The prevalent approach ignores energy altogether and clusters decoys by conformational similarity. Complementary recent efforts design protein-specific scoring functions or train machine learning models on labeled decoys. In this paper, we show that an informative consideration of energy can be carried out under the energy landscape view. Specifically, we leverage local structures known as basins in the energy landscape probed by a template-free method. We propose and compare various strategies of basin-based decoy selection that we demonstrate are superior to clustering-based strategies. The presented results point to further directions of research for improving decoy selection, including the ability to properly consider the multiplicity of native conformations of proteins.

  18. Cluster Headache

    OpenAIRE

    Pearce, Iris

    1985-01-01

    Cluster headache is the most severe primary headache with recurrent pain attacks described as worse than giving birth. The aim of this paper was to make an overview of current knowledge on cluster headache with a focus on pathophysiology and treatment. This paper presents hypotheses of cluster headache pathophysiology, current treatment options and possible future therapy approaches. For years, the hypothalamus was regarded as the key structure in cluster headache, but is now thought to be pa...

  19. Versatile microsphere attachment of GFP-labeled motors and other tagged proteins with preserved functionality

    Directory of Open Access Journals (Sweden)

    Michael Bugiel

    2015-11-01

    Full Text Available Microspheres are often used as handles for protein purification or force spectroscopy. For example, optical tweezers apply forces on trapped particles to which motor proteins are attached. However, even though many attachment strategies exist, procedures are often limited to a particular biomolecule and prone to non-specific protein or surface attachment. Such interactions may lead to loss of protein functionality or microsphere clustering. Here, we describe a versatile coupling procedure for GFP-tagged proteins via a polyethylene glycol linker preserving the functionality of the coupled proteins. The procedure combines well-established protocols, is highly reproducible, reliable, and can be used for a large variety of proteins. The coupling is efficient and can be tuned to the desired microsphere-to-protein ratio. Moreover, microspheres hardly cluster or adhere to surfaces. Furthermore, the procedure can be adapted to different tags providing flexibility and a promising attachment strategy for any tagged protein.

  20. Sizing protein-templated gold nanoclusters by time resolved fluorescence anisotropy decay measurements

    Science.gov (United States)

    Soleilhac, Antonin; Bertorelle, Franck; Antoine, Rodolphe

    2018-03-01

    Protein-templated gold nanoclusters (AuNCs) are very attractive due to their unique fluorescence properties. A major problem however may arise due to protein structure changes upon the nucleation of an AuNC within the protein for any future use as in vivo probes, for instance. In this work, we propose a simple and reliable fluorescence based technique measuring the hydrodynamic size of protein-templated gold nanoclusters. This technique uses the relation between the time resolved fluorescence anisotropy decay and the hydrodynamic volume, through the rotational correlation time. We determine the molecular size of protein-directed AuNCs, with protein templates of increasing sizes, e.g. insulin, lysozyme, and bovine serum albumin (BSA). The comparison of sizes obtained by other techniques (e.g. dynamic light scattering and small-angle X-ray scattering) between bare and gold clusters containing proteins allows us to address the volume changes induced either by conformational changes (for BSA) or the formation of protein dimers (for insulin and lysozyme) during cluster formation and incorporation.

  1. Sizing protein-templated gold nanoclusters by time resolved fluorescence anisotropy decay measurements.

    Science.gov (United States)

    Soleilhac, Antonin; Bertorelle, Franck; Antoine, Rodolphe

    2018-03-15

    Protein-templated gold nanoclusters (AuNCs) are very attractive due to their unique fluorescence properties. A major problem however may arise due to protein structure changes upon the nucleation of an AuNC within the protein for any future use as in vivo probes, for instance. In this work, we propose a simple and reliable fluorescence based technique measuring the hydrodynamic size of protein-templated gold nanoclusters. This technique uses the relation between the time resolved fluorescence anisotropy decay and the hydrodynamic volume, through the rotational correlation time. We determine the molecular size of protein-directed AuNCs, with protein templates of increasing sizes, e.g. insulin, lysozyme, and bovine serum albumin (BSA). The comparison of sizes obtained by other techniques (e.g. dynamic light scattering and small-angle X-ray scattering) between bare and gold clusters containing proteins allows us to address the volume changes induced either by conformational changes (for BSA) or the formation of protein dimers (for insulin and lysozyme) during cluster formation and incorporation. Copyright © 2017 Elsevier B.V. All rights reserved.

  2. Particle Simulation of Oxidation Induced Band 3 Clustering in Human Erythrocytes.

    Directory of Open Access Journals (Sweden)

    Hanae Shimo

    2015-06-01

    Full Text Available Oxidative stress mediated clustering of membrane protein band 3 plays an essential role in the clearance of damaged and aged red blood cells (RBCs from the circulation. While a number of previous experimental studies have observed changes in band 3 distribution after oxidative treatment, the details of how these clusters are formed and how their properties change under different conditions have remained poorly understood. To address these issues, a framework that enables the simultaneous monitoring of the temporal and spatial changes following oxidation is needed. In this study, we established a novel simulation strategy that incorporates deterministic and stochastic reactions with particle reaction-diffusion processes, to model band 3 cluster formation at single molecule resolution. By integrating a kinetic model of RBC antioxidant metabolism with a model of band 3 diffusion, we developed a model that reproduces the time-dependent changes of glutathione and clustered band 3 levels, as well as band 3 distribution during oxidative treatment, observed in prior studies. We predicted that cluster formation is largely dependent on fast reverse reaction rates, strong affinity between clustering molecules, and irreversible hemichrome binding. We further predicted that under repeated oxidative perturbations, clusters tended to progressively grow and shift towards an irreversible state. Application of our model to simulate oxidation in RBCs with cytoskeletal deficiency also suggested that oxidation leads to more enhanced clustering compared to healthy RBCs. Taken together, our model enables the prediction of band 3 spatio-temporal profiles under various situations, thus providing valuable insights to potentially aid understanding mechanisms for removing senescent and premature RBCs.

  3. Particle Simulation of Oxidation Induced Band 3 Clustering in Human Erythrocytes.

    Science.gov (United States)

    Shimo, Hanae; Arjunan, Satya Nanda Vel; Machiyama, Hiroaki; Nishino, Taiko; Suematsu, Makoto; Fujita, Hideaki; Tomita, Masaru; Takahashi, Koichi

    2015-06-01

    Oxidative stress mediated clustering of membrane protein band 3 plays an essential role in the clearance of damaged and aged red blood cells (RBCs) from the circulation. While a number of previous experimental studies have observed changes in band 3 distribution after oxidative treatment, the details of how these clusters are formed and how their properties change under different conditions have remained poorly understood. To address these issues, a framework that enables the simultaneous monitoring of the temporal and spatial changes following oxidation is needed. In this study, we established a novel simulation strategy that incorporates deterministic and stochastic reactions with particle reaction-diffusion processes, to model band 3 cluster formation at single molecule resolution. By integrating a kinetic model of RBC antioxidant metabolism with a model of band 3 diffusion, we developed a model that reproduces the time-dependent changes of glutathione and clustered band 3 levels, as well as band 3 distribution during oxidative treatment, observed in prior studies. We predicted that cluster formation is largely dependent on fast reverse reaction rates, strong affinity between clustering molecules, and irreversible hemichrome binding. We further predicted that under repeated oxidative perturbations, clusters tended to progressively grow and shift towards an irreversible state. Application of our model to simulate oxidation in RBCs with cytoskeletal deficiency also suggested that oxidation leads to more enhanced clustering compared to healthy RBCs. Taken together, our model enables the prediction of band 3 spatio-temporal profiles under various situations, thus providing valuable insights to potentially aid understanding mechanisms for removing senescent and premature RBCs.

  4. Emerging critical roles of Fe-S clusters in DNA replication and repair

    Science.gov (United States)

    Fuss, Jill O.; Tsai, Chi-Lin; Ishida, Justin P.; Tainer, John A.

    2015-01-01

    Fe-S clusters are partners in the origin of life that predate cells, acetyl-CoA metabolism, DNA, and the RNA world. The double helix solved the mystery of DNA replication by base pairing for accurate copying. Yet, for genome stability necessary to life, the double helix has equally important implications for damage repair. Here we examine striking advances that uncover Fe-S cluster roles both in copying the genetic sequence by DNA polymerases and in crucial repair processes for genome maintenance, as mutational defects cause cancer and degenerative disease. Moreover, we examine an exciting, controversial role for Fe-S clusters in a third element required for life – the long-range coordination and regulation of replication and repair events. By their ability to delocalize electrons over both Fe and S centers, Fe-S clusters have unbeatable features for protein conformational control and charge transfer via double-stranded DNA that may fundamentally transform our understanding of life, replication, and repair. PMID:25655665

  5. Properties of an ionised-cluster beam from a vaporised-cluster ion source

    International Nuclear Information System (INIS)

    Takagi, T.; Yamada, I.; Sasaki, A.

    1978-01-01

    A new type of ion source vaporised-metal cluster ion source, has been developed for deposition and epitaxy. A cluster consisting of 10 2 to 10 3 atoms coupled loosely together is formed by adiabatic expansion ejecting the vapour of materials into a high-vacuum region through the nozzle of a heated crucible. The clusters are ionised by electron bombardment and accelerated with neutral clusters toward a substrate. In this paper, mechanisms of cluster formation experimental results of the cluster size (atoms/cluster) and its distribution, and characteristics of the cluster ion beams are reported. The size is calculated from the kinetic equation E = (1/2)mNVsub(ej) 2 , where E is the cluster beam energy, Vsub(ej) is the ejection velocity, m is the mass of atom and N is the cluster size. The energy and the velocity of the cluster are measured by an electrostatic 127 0 energy analyser and a rotating disc system, respectively. The cluster size obtained for Ag is about 5 x 10 2 to 2 x 10 3 atoms. The retarding potential method is used to confirm the results for Ag. The same dependence on cluster size for metals such as Ag, Cu and Pb has been obtained in previous experiments. In the cluster state the cluster ion beam is easily produced by electron bombardment. About 50% of ionised clusters are obtained under typical operation conditions, because of the large ionisation cross sections of the clusters. To obtain a uniform spatial distribution, the ionising electrode system is also discussed. The new techniques are termed ionised-cluster beam deposition (ICBD) and epitaxy (ICBE). (author)

  6. Feasibility Study of Parallel Finite Element Analysis on Cluster-of-Clusters

    Science.gov (United States)

    Muraoka, Masae; Okuda, Hiroshi

    With the rapid growth of WAN infrastructure and development of Grid middleware, it's become a realistic and attractive methodology to connect cluster machines on wide-area network for the execution of computation-demanding applications. Many existing parallel finite element (FE) applications have been, however, designed and developed with a single computing resource in mind, since such applications require frequent synchronization and communication among processes. There have been few FE applications that can exploit the distributed environment so far. In this study, we explore the feasibility of FE applications on the cluster-of-clusters. First, we classify FE applications into two types, tightly coupled applications (TCA) and loosely coupled applications (LCA) based on their communication pattern. A prototype of each application is implemented on the cluster-of-clusters. We perform numerical experiments executing TCA and LCA on both the cluster-of-clusters and a single cluster. Thorough these experiments, by comparing the performances and communication cost in each case, we evaluate the feasibility of FEA on the cluster-of-clusters.

  7. Interplay between experiments and calculations for organometallic clusters and caged clusters

    International Nuclear Information System (INIS)

    Nakajima, Atsushi

    2015-01-01

    Clusters consisting of 10-1000 atoms exhibit size-dependent electronic and geometric properties. In particular, composite clusters consisting of several elements and/or components provide a promising way for a bottom-up approach for designing functional advanced materials, because the functionality of the composite clusters can be optimized not only by the cluster size but also by their compositions. In the formation of composite clusters, their geometric symmetry and dimensionality are emphasized to control the physical and chemical properties, because selective and anisotropic enhancements for optical, chemical, and magnetic properties can be expected. Organometallic clusters and caged clusters are demonstrated as a representative example of designing the functionality of the composite clusters. Organometallic vanadium-benzene forms a one dimensional sandwich structure showing ferromagnetic behaviors and anomalously large HOMO-LUMO gap differences of two spin orbitals, which can be regarded as spin-filter components for cluster-based spintronic devices. Caged clusters of aluminum (Al) are well stabilized both geometrically and electronically at Al 12 X, behaving as a “superatom”

  8. Characterization of the photolyase-like iron sulfur protein PhrB from Agrobacterium tumefaciens by Mössbauer spectroscopy

    Science.gov (United States)

    Bauer, T. O.; Graf, D.; Lamparter, T.; Schünemann, V.

    2014-04-01

    High field Mössbauer spectroscopy has been used to characterize the [4Fe-4S] 2 +cluster of the protein PhrB from Agrobacterium tumefaciens which belongs to the cryptochrome/photolyase family (CPF) and which biological function has previously been shown to be DNA repair. Mössbauer spectra taken of the as prepared protein reveal δ = 0. 42 mms - 1, and Δ E Q = 1. 26 mms - 1as well as an asymmetry parameter of η = 0. 8. These parameters are characteristic for a ferredoxin-type [4Fe-4S] 2 +cluster. In order to investigate whether this cluster is involved in DNA-repair the protein has also been studied in its photoactivated state during DNA binding. The so obtained data sets exhibit essentially the same Mössbauer parameters as those of the non-activated PhrB. This indicates that during DNA repair the [4Fe-4S] 2 +cluster of PhrB has no significant amounts of transition states which have conformational changes compared to the resting state of the protein and which have life times of several seconds or longer.

  9. Categorias Cluster

    OpenAIRE

    Queiroz, Dayane Andrade

    2015-01-01

    Neste trabalho apresentamos as categorias cluster, que foram introduzidas por Aslak Bakke Buan, Robert Marsh, Markus Reineke, Idun Reiten e Gordana Todorov, com o objetivo de categoriíicar as algebras cluster criadas em 2002 por Sergey Fomin e Andrei Zelevinsky. Os autores acima, em [4], mostraram que existe uma estreita relação entre algebras cluster e categorias cluster para quivers cujo grafo subjacente é um diagrama de Dynkin. Para isto desenvolveram uma teoria tilting na estrutura triang...

  10. BRIGHTEST CLUSTER GALAXIES AND CORE GAS DENSITY IN REXCESS CLUSTERS

    International Nuclear Information System (INIS)

    Haarsma, Deborah B.; Leisman, Luke; Donahue, Megan; Bruch, Seth; Voit, G. Mark; Boehringer, Hans; Pratt, Gabriel W.; Pierini, Daniele; Croston, Judith H.; Arnaud, Monique

    2010-01-01

    We investigate the relationship between brightest cluster galaxies (BCGs) and their host clusters using a sample of nearby galaxy clusters from the Representative XMM-Newton Cluster Structure Survey. The sample was imaged with the Southern Observatory for Astrophysical Research in R band to investigate the mass of the old stellar population. Using a metric radius of 12 h -1 kpc, we found that the BCG luminosity depends weakly on overall cluster mass as L BCG ∝ M 0.18±0.07 cl , consistent with previous work. We found that 90% of the BCGs are located within 0.035 r 500 of the peak of the X-ray emission, including all of the cool core (CC) clusters. We also found an unexpected correlation between the BCG metric luminosity and the core gas density for non-cool-core (non-CC) clusters, following a power law of n e ∝ L 2.7±0.4 BCG (where n e is measured at 0.008 r 500 ). The correlation is not easily explained by star formation (which is weak in non-CC clusters) or overall cluster mass (which is not correlated with core gas density). The trend persists even when the BCG is not located near the peak of the X-ray emission, so proximity is not necessary. We suggest that, for non-CC clusters, this correlation implies that the same process that sets the central entropy of the cluster gas also determines the central stellar density of the BCG, and that this underlying physical process is likely to be mergers.

  11. Speed Controls in Translating Secretory Proteins in Eukaryotes - an Evolutionary Perspective

    Science.gov (United States)

    Mahlab, Shelly; Linial, Michal

    2014-01-01

    Protein translation is the most expensive operation in dividing cells from bacteria to humans. Therefore, managing the speed and allocation of resources is subject to tight control. From bacteria to humans, clusters of relatively rare tRNA codons at the N′-terminal of mRNAs have been implicated in attenuating the process of ribosome allocation, and consequently the translation rate in a broad range of organisms. The current interpretation of “slow” tRNA codons does not distinguish between protein translations mediated by free- or endoplasmic reticulum (ER)-bound ribosomes. We demonstrate that proteins translated by free- or ER-bound ribosomes exhibit different overall properties in terms of their translation efficiency and speed in yeast, fly, plant, worm, bovine and human. We note that only secreted or membranous proteins with a Signal peptide (SP) are specified by segments of “slow” tRNA at the N′-terminal, followed by abundant codons that are considered “fast.” Such profiles apply to 3100 proteins of the human proteome that are composed of secreted and signal peptide (SP)-assisted membranous proteins. Remarkably, the bulks of the proteins (12,000), or membranous proteins lacking SP (3400), do not have such a pattern. Alternation of “fast” and “slow” codons was found also in proteins that translocate to mitochondria through transit peptides (TP). The differential clusters of tRNA adapted codons is not restricted to the N′-terminal of transcripts. Specifically, Glycosylphosphatidylinositol (GPI)-anchored proteins are unified by clusters of low adapted tRNAs codons at the C′-termini. Furthermore, selection of amino acids types and specific codons was shown as the driving force which establishes the translation demands for the secretory proteome. We postulate that “hard-coded” signals within the secretory proteome assist the steps of protein maturation and folding. Specifically, “speed control” signals for delaying the translation

  12. Scientific Cluster Deployment and Recovery - Using puppet to simplify cluster management

    Science.gov (United States)

    Hendrix, Val; Benjamin, Doug; Yao, Yushu

    2012-12-01

    Deployment, maintenance and recovery of a scientific cluster, which has complex, specialized services, can be a time consuming task requiring the assistance of Linux system administrators, network engineers as well as domain experts. Universities and small institutions that have a part-time FTE with limited time for and knowledge of the administration of such clusters can be strained by such maintenance tasks. This current work is the result of an effort to maintain a data analysis cluster (DAC) with minimal effort by a local system administrator. The realized benefit is the scientist, who is the local system administrator, is able to focus on the data analysis instead of the intricacies of managing a cluster. Our work provides a cluster deployment and recovery process (CDRP) based on the puppet configuration engine allowing a part-time FTE to easily deploy and recover entire clusters with minimal effort. Puppet is a configuration management system (CMS) used widely in computing centers for the automatic management of resources. Domain experts use Puppet's declarative language to define reusable modules for service configuration and deployment. Our CDRP has three actors: domain experts, a cluster designer and a cluster manager. The domain experts first write the puppet modules for the cluster services. A cluster designer would then define a cluster. This includes the creation of cluster roles, mapping the services to those roles and determining the relationships between the services. Finally, a cluster manager would acquire the resources (machines, networking), enter the cluster input parameters (hostnames, IP addresses) and automatically generate deployment scripts used by puppet to configure it to act as a designated role. In the event of a machine failure, the originally generated deployment scripts along with puppet can be used to easily reconfigure a new machine. The cluster definition produced in our CDRP is an integral part of automating cluster deployment

  13. TLR-activated repression of Fe-S cluster biogenesis drives a metabolic shift and alters histone and tubulin acetylation.

    Science.gov (United States)

    Tong, Wing-Hang; Maio, Nunziata; Zhang, De-Liang; Palmieri, Erika M; Ollivierre, Hayden; Ghosh, Manik C; McVicar, Daniel W; Rouault, Tracey A

    2018-05-22

    Given the essential roles of iron-sulfur (Fe-S) cofactors in mediating electron transfer in the mitochondrial respiratory chain and supporting heme biosynthesis, mitochondrial dysfunction is a common feature in a growing list of human Fe-S cluster biogenesis disorders, including Friedreich ataxia and GLRX5-related sideroblastic anemia. Here, our studies showed that restriction of Fe-S cluster biogenesis not only compromised mitochondrial oxidative metabolism but also resulted in decreased overall histone acetylation and increased H3K9me3 levels in the nucleus and increased acetylation of α-tubulin in the cytosol by decreasing the lipoylation of the pyruvate dehydrogenase complex, decreasing levels of succinate dehydrogenase and the histone acetyltransferase ELP3, and increasing levels of the tubulin acetyltransferase MEC17. Previous studies have shown that the metabolic shift in Toll-like receptor (TLR)-activated myeloid cells involves rapid activation of glycolysis and subsequent mitochondrial respiratory failure due to nitric oxide (NO)-mediated damage to Fe-S proteins. Our studies indicated that TLR activation also actively suppresses many components of the Fe-S cluster biogenesis machinery, which exacerbates NO-mediated damage to Fe-S proteins by interfering with cluster recovery. These results reveal new regulatory pathways and novel roles of the Fe-S cluster biogenesis machinery in modifying the epigenome and acetylome and provide new insights into the etiology of Fe-S cluster biogenesis disorders.

  14. Crystal structure of the FeS cluster-containing nucleotide excision repair helicase XPD.

    Directory of Open Access Journals (Sweden)

    Stefanie C Wolski

    2008-06-01

    Full Text Available DNA damage recognition by the nucleotide excision repair pathway requires an initial step identifying helical distortions in the DNA and a proofreading step verifying the presence of a lesion. This proofreading step is accomplished in eukaryotes by the TFIIH complex. The critical damage recognition component of TFIIH is the XPD protein, a DNA helicase that unwinds DNA and identifies the damage. Here, we describe the crystal structure of an archaeal XPD protein with high sequence identity to the human XPD protein that reveals how the structural helicase framework is combined with additional elements for strand separation and DNA scanning. Two RecA-like helicase domains are complemented by a 4Fe4S cluster domain, which has been implicated in damage recognition, and an alpha-helical domain. The first helicase domain together with the helical and 4Fe4S-cluster-containing domains form a central hole with a diameter sufficient in size to allow passage of a single stranded DNA. Based on our results, we suggest a model of how DNA is bound to the XPD protein, and can rationalize several of the mutations in the human XPD gene that lead to one of three severe diseases, xeroderma pigmentosum, Cockayne syndrome, and trichothiodystrophy.

  15. Cluster-cluster correlations in the two-dimensional stationary Ising-model

    International Nuclear Information System (INIS)

    Klassmann, A.

    1997-01-01

    In numerical integration of the Cahn-Hillard equation, which describes Oswald rising in a two-phase matrix, N. Masbaum showed that spatial correlations between clusters scale with respect to the mean cluster size (itself a function of time). T. B. Liverpool showed by Monte Carlo simulations for the Ising model that the analogous correlations have a similar form. Both demonstrated that immediately around each cluster there is some depletion area followed by something like a ring of clusters of the same size as the original one. More precisely, it has been shown that the distribution of clusters around a given cluster looks like a sinus-curve decaying exponentially with respect to the distance to a constant value

  16. GraphCrunch 2: Software tool for network modeling, alignment and clustering.

    Science.gov (United States)

    Kuchaiev, Oleksii; Stevanović, Aleksandar; Hayes, Wayne; Pržulj, Nataša

    2011-01-19

    Recent advancements in experimental biotechnology have produced large amounts of protein-protein interaction (PPI) data. The topology of PPI networks is believed to have a strong link to their function. Hence, the abundance of PPI data for many organisms stimulates the development of computational techniques for the modeling, comparison, alignment, and clustering of networks. In addition, finding representative models for PPI networks will improve our understanding of the cell just as a model of gravity has helped us understand planetary motion. To decide if a model is representative, we need quantitative comparisons of model networks to real ones. However, exact network comparison is computationally intractable and therefore several heuristics have been used instead. Some of these heuristics are easily computable "network properties," such as the degree distribution, or the clustering coefficient. An important special case of network comparison is the network alignment problem. Analogous to sequence alignment, this problem asks to find the "best" mapping between regions in two networks. It is expected that network alignment might have as strong an impact on our understanding of biology as sequence alignment has had. Topology-based clustering of nodes in PPI networks is another example of an important network analysis problem that can uncover relationships between interaction patterns and phenotype. We introduce the GraphCrunch 2 software tool, which addresses these problems. It is a significant extension of GraphCrunch which implements the most popular random network models and compares them with the data networks with respect to many network properties. Also, GraphCrunch 2 implements the GRAph ALigner algorithm ("GRAAL") for purely topological network alignment. GRAAL can align any pair of networks and exposes large, dense, contiguous regions of topological and functional similarities far larger than any other existing tool. Finally, GraphCruch 2 implements an

  17. GraphCrunch 2: Software tool for network modeling, alignment and clustering

    Directory of Open Access Journals (Sweden)

    Hayes Wayne

    2011-01-01

    Full Text Available Abstract Background Recent advancements in experimental biotechnology have produced large amounts of protein-protein interaction (PPI data. The topology of PPI networks is believed to have a strong link to their function. Hence, the abundance of PPI data for many organisms stimulates the development of computational techniques for the modeling, comparison, alignment, and clustering of networks. In addition, finding representative models for PPI networks will improve our understanding of the cell just as a model of gravity has helped us understand planetary motion. To decide if a model is representative, we need quantitative comparisons of model networks to real ones. However, exact network comparison is computationally intractable and therefore several heuristics have been used instead. Some of these heuristics are easily computable "network properties," such as the degree distribution, or the clustering coefficient. An important special case of network comparison is the network alignment problem. Analogous to sequence alignment, this problem asks to find the "best" mapping between regions in two networks. It is expected that network alignment might have as strong an impact on our understanding of biology as sequence alignment has had. Topology-based clustering of nodes in PPI networks is another example of an important network analysis problem that can uncover relationships between interaction patterns and phenotype. Results We introduce the GraphCrunch 2 software tool, which addresses these problems. It is a significant extension of GraphCrunch which implements the most popular random network models and compares them with the data networks with respect to many network properties. Also, GraphCrunch 2 implements the GRAph ALigner algorithm ("GRAAL" for purely topological network alignment. GRAAL can align any pair of networks and exposes large, dense, contiguous regions of topological and functional similarities far larger than any other

  18. Structure and Sequence Analyses of Clustered Protocadherins Reveal Antiparallel Interactions that Mediate Homophilic Specificity.

    Science.gov (United States)

    Nicoludis, John M; Lau, Sze-Yi; Schärfe, Charlotta P I; Marks, Debora S; Weihofen, Wilhelm A; Gaudet, Rachelle

    2015-11-03

    Clustered protocadherin (Pcdh) proteins mediate dendritic self-avoidance in neurons via specific homophilic interactions in their extracellular cadherin (EC) domains. We determined crystal structures of EC1-EC3, containing the homophilic specificity-determining region, of two mouse clustered Pcdh isoforms (PcdhγA1 and PcdhγC3) to investigate the nature of the homophilic interaction. Within the crystal lattices, we observe antiparallel interfaces consistent with a role in trans cell-cell contact. Antiparallel dimerization is supported by evolutionary correlations. Two interfaces, located primarily on EC2-EC3, involve distinctive clustered Pcdh structure and sequence motifs, lack predicted glycosylation sites, and contain residues highly conserved in orthologs but not paralogs, pointing toward their biological significance as homophilic interaction interfaces. These two interfaces are similar yet distinct, reflecting a possible difference in interaction architecture between clustered Pcdh subfamilies. These structures initiate a molecular understanding of clustered Pcdh assemblies that are required to produce functional neuronal networks. Copyright © 2015 Elsevier Ltd. All rights reserved.

  19. Identifying Hierarchical and Overlapping Protein Complexes Based on Essential Protein-Protein Interactions and “Seed-Expanding” Method

    Directory of Open Access Journals (Sweden)

    Jun Ren

    2014-01-01

    Full Text Available Many evidences have demonstrated that protein complexes are overlapping and hierarchically organized in PPI networks. Meanwhile, the large size of PPI network wants complex detection methods have low time complexity. Up to now, few methods can identify overlapping and hierarchical protein complexes in a PPI network quickly. In this paper, a novel method, called MCSE, is proposed based on λ-module and “seed-expanding.” First, it chooses seeds as essential PPIs or edges with high edge clustering values. Then, it identifies protein complexes by expanding each seed to a λ-module. MCSE is suitable for large PPI networks because of its low time complexity. MCSE can identify overlapping protein complexes naturally because a protein can be visited by different seeds. MCSE uses the parameter λ_th to control the range of seed expanding and can detect a hierarchical organization of protein complexes by tuning the value of λ_th. Experimental results of S. cerevisiae show that this hierarchical organization is similar to that of known complexes in MIPS database. The experimental results also show that MCSE outperforms other previous competing algorithms, such as CPM, CMC, Core-Attachment, Dpclus, HC-PIN, MCL, and NFC, in terms of the functional enrichment and matching with known protein complexes.

  20. Meaningful Clusters

    Energy Technology Data Exchange (ETDEWEB)

    Sanfilippo, Antonio P.; Calapristi, Augustin J.; Crow, Vernon L.; Hetzler, Elizabeth G.; Turner, Alan E.

    2004-05-26

    We present an approach to the disambiguation of cluster labels that capitalizes on the notion of semantic similarity to assign WordNet senses to cluster labels. The approach provides interesting insights on how document clustering can provide the basis for developing a novel approach to word sense disambiguation.

  1. Macroeconomic Dimensions in the Clusterization Processes: Lithuanian Biomass Cluster Case

    Directory of Open Access Journals (Sweden)

    Navickas Valentinas

    2017-03-01

    Full Text Available The Future production systems’ increasing significance will impose work, which maintains not a competitive, but a collaboration basis, with concentrated resources and expertise, which can help to reach the general purpose. One form of collaboration among medium-size business organizations is work in clusters. Clusterization as a phenomenon has been known from quite a long time, but it offers simple benefits to researches at micro and medium levels. The clusterization process evaluation in macroeconomic dimensions has been comparatively little investigated. Thereby, in this article, the clusterization processes is analysed by concentrating our attention on macroeconomic factor researches. The authors analyse clusterization’s influence on country’s macroeconomic growth; they apply a structure research methodology for clusterization’s macroeconomic influence evaluation and propose that clusterization processes benefit macroeconomic analysis. The theoretical model of clusterization processes was validated by referring to a biomass cluster case. Because biomass cluster case is a new phenomenon, currently there are no other scientific approaches to them. The authors’ accomplished researches show that clusterization allows the achievement of a large positive slip in macroeconomics, which proves to lead to a high value added to creation, a faster country economic growth, and social situation amelioration.

  2. Clustering Dycom

    KAUST Repository

    Minku, Leandro L.

    2017-10-06

    Background: Software Effort Estimation (SEE) can be formulated as an online learning problem, where new projects are completed over time and may become available for training. In this scenario, a Cross-Company (CC) SEE approach called Dycom can drastically reduce the number of Within-Company (WC) projects needed for training, saving the high cost of collecting such training projects. However, Dycom relies on splitting CC projects into different subsets in order to create its CC models. Such splitting can have a significant impact on Dycom\\'s predictive performance. Aims: This paper investigates whether clustering methods can be used to help finding good CC splits for Dycom. Method: Dycom is extended to use clustering methods for creating the CC subsets. Three different clustering methods are investigated, namely Hierarchical Clustering, K-Means, and Expectation-Maximisation. Clustering Dycom is compared against the original Dycom with CC subsets of different sizes, based on four SEE databases. A baseline WC model is also included in the analysis. Results: Clustering Dycom with K-Means can potentially help to split the CC projects, managing to achieve similar or better predictive performance than Dycom. However, K-Means still requires the number of CC subsets to be pre-defined, and a poor choice can negatively affect predictive performance. EM enables Dycom to automatically set the number of CC subsets while still maintaining or improving predictive performance with respect to the baseline WC model. Clustering Dycom with Hierarchical Clustering did not offer significant advantage in terms of predictive performance. Conclusion: Clustering methods can be an effective way to automatically generate Dycom\\'s CC subsets.

  3. HIV protein sequence hotspots for crosstalk with host hub proteins.

    Directory of Open Access Journals (Sweden)

    Mahdi Sarmady

    Full Text Available HIV proteins target host hub proteins for transient binding interactions. The presence of viral proteins in the infected cell results in out-competition of host proteins in their interaction with hub proteins, drastically affecting cell physiology. Functional genomics and interactome datasets can be used to quantify the sequence hotspots on the HIV proteome mediating interactions with host hub proteins. In this study, we used the HIV and human interactome databases to identify HIV targeted host hub proteins and their host binding partners (H2. We developed a high throughput computational procedure utilizing motif discovery algorithms on sets of protein sequences, including sequences of HIV and H2 proteins. We identified as HIV sequence hotspots those linear motifs that are highly conserved on HIV sequences and at the same time have a statistically enriched presence on the sequences of H2 proteins. The HIV protein motifs discovered in this study are expressed by subsets of H2 host proteins potentially outcompeted by HIV proteins. A large subset of these motifs is involved in cleavage, nuclear localization, phosphorylation, and transcription factor binding events. Many such motifs are clustered on an HIV sequence in the form of hotspots. The sequential positions of these hotspots are consistent with the curated literature on phenotype altering residue mutations, as well as with existing binding site data. The hotspot map produced in this study is the first global portrayal of HIV motifs involved in altering the host protein network at highly connected hub nodes.

  4. LMC clusters: young

    International Nuclear Information System (INIS)

    Freeman, K.C.

    1980-01-01

    The young globular clusters of the LMC have ages of 10 7 -10 8 y. Their masses and structure are similar to those of the smaller galactic globular clusters. Their stellar mass functions (in the mass range 6 solar masses to 1.2 solar masses) vary greatly from cluster to cluster, although the clusters are similar in total mass, age, structure and chemical composition. It would be very interesting to know why these clusters are forming now in the LMC and not in the Galaxy. The author considers the 'young globular' or 'blue populous' clusters of the LMC. The ages of these objects are 10 7 to 10 8 y, and their masses are 10 4 to 10 5 solar masses, so they are populous enough to be really useful for studying the evolution of massive stars. The author concentrates on the structure and stellar content of these young clusters. (Auth.)

  5. Major cluster mergers and the location of the brightest cluster galaxy

    International Nuclear Information System (INIS)

    Martel, Hugo; Robichaud, Fidèle; Barai, Paramita

    2014-01-01

    Using a large N-body cosmological simulation combined with a subgrid treatment of galaxy formation, merging, and tidal destruction, we study the formation and evolution of the galaxy and cluster population in a comoving volume (100 Mpc) 3 in a ΛCDM universe. At z = 0, our computational volume contains 1788 clusters with mass M cl > 1.1 × 10 12 M ☉ , including 18 massive clusters with M cl > 10 14 M ☉ . It also contains 1, 088, 797 galaxies with mass M gal ≥ 2 × 10 9 M ☉ and luminosity L > 9.5 × 10 5 L ☉ . For each cluster, we identified the brightest cluster galaxy (BCG). We then computed two separate statistics: the fraction f BNC of clusters in which the BCG is not the closest galaxy to the center of the cluster in projection, and the ratio Δv/σ, where Δv is the difference in radial velocity between the BCG and the whole cluster and σ is the radial velocity dispersion of the cluster. We found that f BNC increases from 0.05 for low-mass clusters (M cl ∼ 10 12 M ☉ ) to 0.5 for high-mass clusters (M cl > 10 14 M ☉ ) with very little dependence on cluster redshift. Most of this result turns out to be a projection effect and when we consider three-dimensional distances instead of projected distances, f BNC increases only to 0.2 at high-cluster mass. The values of Δv/σ vary from 0 to 1.8, with median values in the range 0.03-0.15 when considering all clusters, and 0.12-0.31 when considering only massive clusters. These results are consistent with previous observational studies and indicate that the central galaxy paradigm, which states that the BCG should be at rest at the center of the cluster, is usually valid, but exceptions are too common to be ignored. We built merger trees for the 18 most massive clusters in the simulation. Analysis of these trees reveal that 16 of these clusters have experienced 1 or several major or semi-major mergers in the past. These mergers leave each cluster in a non-equilibrium state, but eventually the cluster

  6. The Sorcerer II Global Ocean Sampling Expedition: Expanding theUniverse of Protein Families

    Energy Technology Data Exchange (ETDEWEB)

    Yooseph, Shibu; Sutton, Granger; Rusch, Douglas B.; Halpern,Aaron L.; Williamson, Shannon J.; Remington, Karin; Eisen, Jonathan A.; Heidelberg, Karla B.; Manning, Gerard; Li, Weizhong; Jaroszewski, Lukasz; Cieplak, Piotr; Miller, Christopher S.; Li, Huiying; Mashiyama, Susan T.; Joachimiak, Marcin P.; van Belle, Christopher; Chandonia, John-Marc; Soergel, David A.; Zhai, Yufeng; Natarajan, Kannan; Lee, Shaun; Raphael,Benjamin J.; Bafna, Vineet; Friedman, Robert; Brenner, Steven E.; Godzik,Adam; Eisenberg, David; Dixon, Jack E.; Taylor, Susan S.; Strausberg,Robert L.; Frazier, Marvin; Venter, J.Craig

    2006-03-23

    Metagenomics projects based on shotgun sequencing of populations of micro-organisms yield insight into protein families. We used sequence similarity clustering to explore proteins with a comprehensive dataset consisting of sequences from available databases together with 6.12 million proteins predicted from an assembly of 7.7 million Global Ocean Sampling (GOS) sequences. The GOS dataset covers nearly all known prokaryotic protein families. A total of 3,995 medium- and large-sized clusters consisting of only GOS sequences are identified, out of which 1,700 have no detectable homology to known families. The GOS-only clusters contain a higher than expected proportion of sequences of viral origin, thus reflecting a poor sampling of viral diversity until now. Protein domain distributions in the GOS dataset and current protein databases show distinct biases. Several protein domains that were previously categorized as kingdom specific are shown to have GOS examples in other kingdoms. About 6,000 sequences (ORFans) from the literature that heretofore lacked similarity to known proteins have matches in the GOS data. The GOS dataset is also used to improve remote homology detection. Overall, besides nearly doubling the number of current proteins, the predicted GOS proteins also add a great deal of diversity to known protein families and shed light on their evolution. These observations are illustrated using several protein families, including phosphatases, proteases, ultraviolet-irradiation DNA damage repair enzymes, glutamine synthetase, and RuBisCO. The diversity added by GOS data has implications for choosing targets for experimental structure characterization as part of structural genomics efforts. Our analysis indicates that new families are being discovered at a rate that is linear or almost linear with the addition of new sequences, implying that we are still far from discovering all protein families in nature.

  7. The Sorcerer II Global Ocean Sampling expedition: expanding the universe of protein families.

    Science.gov (United States)

    Yooseph, Shibu; Sutton, Granger; Rusch, Douglas B; Halpern, Aaron L; Williamson, Shannon J; Remington, Karin; Eisen, Jonathan A; Heidelberg, Karla B; Manning, Gerard; Li, Weizhong; Jaroszewski, Lukasz; Cieplak, Piotr; Miller, Christopher S; Li, Huiying; Mashiyama, Susan T; Joachimiak, Marcin P; van Belle, Christopher; Chandonia, John-Marc; Soergel, David A; Zhai, Yufeng; Natarajan, Kannan; Lee, Shaun; Raphael, Benjamin J; Bafna, Vineet; Friedman, Robert; Brenner, Steven E; Godzik, Adam; Eisenberg, David; Dixon, Jack E; Taylor, Susan S; Strausberg, Robert L; Frazier, Marvin; Venter, J Craig

    2007-03-01

    Metagenomics projects based on shotgun sequencing of populations of micro-organisms yield insight into protein families. We used sequence similarity clustering to explore proteins with a comprehensive dataset consisting of sequences from available databases together with 6.12 million proteins predicted from an assembly of 7.7 million Global Ocean Sampling (GOS) sequences. The GOS dataset covers nearly all known prokaryotic protein families. A total of 3,995 medium- and large-sized clusters consisting of only GOS sequences are identified, out of which 1,700 have no detectable homology to known families. The GOS-only clusters contain a higher than expected proportion of sequences of viral origin, thus reflecting a poor sampling of viral diversity until now. Protein domain distributions in the GOS dataset and current protein databases show distinct biases. Several protein domains that were previously categorized as kingdom specific are shown to have GOS examples in other kingdoms. About 6,000 sequences (ORFans) from the literature that heretofore lacked similarity to known proteins have matches in the GOS data. The GOS dataset is also used to improve remote homology detection. Overall, besides nearly doubling the number of current proteins, the predicted GOS proteins also add a great deal of diversity to known protein families and shed light on their evolution. These observations are illustrated using several protein families, including phosphatases, proteases, ultraviolet-irradiation DNA damage repair enzymes, glutamine synthetase, and RuBisCO. The diversity added by GOS data has implications for choosing targets for experimental structure characterization as part of structural genomics efforts. Our analysis indicates that new families are being discovered at a rate that is linear or almost linear with the addition of new sequences, implying that we are still far from discovering all protein families in nature.

  8. The Sorcerer II Global Ocean Sampling expedition: expanding the universe of protein families.

    Directory of Open Access Journals (Sweden)

    Shibu Yooseph

    2007-03-01

    Full Text Available Metagenomics projects based on shotgun sequencing of populations of micro-organisms yield insight into protein families. We used sequence similarity clustering to explore proteins with a comprehensive dataset consisting of sequences from available databases together with 6.12 million proteins predicted from an assembly of 7.7 million Global Ocean Sampling (GOS sequences. The GOS dataset covers nearly all known prokaryotic protein families. A total of 3,995 medium- and large-sized clusters consisting of only GOS sequences are identified, out of which 1,700 have no detectable homology to known families. The GOS-only clusters contain a higher than expected proportion of sequences of viral origin, thus reflecting a poor sampling of viral diversity until now. Protein domain distributions in the GOS dataset and current protein databases show distinct biases. Several protein domains that were previously categorized as kingdom specific are shown to have GOS examples in other kingdoms. About 6,000 sequences (ORFans from the literature that heretofore lacked similarity to known proteins have matches in the GOS data. The GOS dataset is also used to improve remote homology detection. Overall, besides nearly doubling the number of current proteins, the predicted GOS proteins also add a great deal of diversity to known protein families and shed light on their evolution. These observations are illustrated using several protein families, including phosphatases, proteases, ultraviolet-irradiation DNA damage repair enzymes, glutamine synthetase, and RuBisCO. The diversity added by GOS data has implications for choosing targets for experimental structure characterization as part of structural genomics efforts. Our analysis indicates that new families are being discovered at a rate that is linear or almost linear with the addition of new sequences, implying that we are still far from discovering all protein families in nature.

  9. Changing cluster composition in cluster randomised controlled trials: design and analysis considerations

    Science.gov (United States)

    2014-01-01

    Background There are many methodological challenges in the conduct and analysis of cluster randomised controlled trials, but one that has received little attention is that of post-randomisation changes to cluster composition. To illustrate this, we focus on the issue of cluster merging, considering the impact on the design, analysis and interpretation of trial outcomes. Methods We explored the effects of merging clusters on study power using standard methods of power calculation. We assessed the potential impacts on study findings of both homogeneous cluster merges (involving clusters randomised to the same arm of a trial) and heterogeneous merges (involving clusters randomised to different arms of a trial) by simulation. To determine the impact on bias and precision of treatment effect estimates, we applied standard methods of analysis to different populations under analysis. Results Cluster merging produced a systematic reduction in study power. This effect depended on the number of merges and was most pronounced when variability in cluster size was at its greatest. Simulations demonstrate that the impact on analysis was minimal when cluster merges were homogeneous, with impact on study power being balanced by a change in observed intracluster correlation coefficient (ICC). We found a decrease in study power when cluster merges were heterogeneous, and the estimate of treatment effect was attenuated. Conclusions Examples of cluster merges found in previously published reports of cluster randomised trials were typically homogeneous rather than heterogeneous. Simulations demonstrated that trial findings in such cases would be unbiased. However, simulations also showed that any heterogeneous cluster merges would introduce bias that would be hard to quantify, as well as having negative impacts on the precision of estimates obtained. Further methodological development is warranted to better determine how to analyse such trials appropriately. Interim recommendations

  10. Genomic organization of the rat alpha 2u-globulin gene cluster.

    Science.gov (United States)

    McFadyen, D A; Addison, W; Locke, J

    1999-05-01

    The alpha 2u-globulin are a group of similar proteins, belonging to the lipocalin superfamily of proteins, that are synthesized in a subset of secretory tissues in rats. The many alpha 2u-globulin isoforms are encoded by a multigene family that exhibits extensive homology. Despite a high degree of sequence identity, individual family members show diverse expression patterns involving complex hormonal, tissue-specific, and developmental regulation. Analysis suggests that there are approximately 20 alpha 2u-globulin genes in the rat genome. We have used fluorescence in situ hybridization (FISH) to show that the alpha 2u-globulin genes are clustered at a single site on rat Chromosome (Chr) 5 (5q22-24). Southern blots of rat genomic DNA separated by pulsed field gel electrophoresis indicated that the alpha 2u-globulin genes are contained on two NruI fragments with a total size of 880 kbp. Analysis of three P1 clones containing alpha 2u-globulin genes indicated that the alpha 2u-globulin genes are tandemly arranged in a head-to-tail fashion. The organization of the alpha 2u-globulin genes in the rat as a tandem array of single genes differs from the homologous major urinary protein genes in the mouse, which are organized as tandem arrays of divergently oriented gene pairs. The structure of these gene clusters may have consequences for the proposed function, as a pheromone transporter, for the protein products encoded by these genes.

  11. Parallel clustering algorithm for large-scale biological data sets.

    Science.gov (United States)

    Wang, Minchao; Zhang, Wu; Ding, Wang; Dai, Dongbo; Zhang, Huiran; Xie, Hao; Chen, Luonan; Guo, Yike; Xie, Jiang

    2014-01-01

    Recent explosion of biological data brings a great challenge for the traditional clustering algorithms. With increasing scale of data sets, much larger memory and longer runtime are required for the cluster identification problems. The affinity propagation algorithm outperforms many other classical clustering algorithms and is widely applied into the biological researches. However, the time and space complexity become a great bottleneck when handling the large-scale data sets. Moreover, the similarity matrix, whose constructing procedure takes long runtime, is required before running the affinity propagation algorithm, since the algorithm clusters data sets based on the similarities between data pairs. Two types of parallel architectures are proposed in this paper to accelerate the similarity matrix constructing procedure and the affinity propagation algorithm. The memory-shared architecture is used to construct the similarity matrix, and the distributed system is taken for the affinity propagation algorithm, because of its large memory size and great computing capacity. An appropriate way of data partition and reduction is designed in our method, in order to minimize the global communication cost among processes. A speedup of 100 is gained with 128 cores. The runtime is reduced from serval hours to a few seconds, which indicates that parallel algorithm is capable of handling large-scale data sets effectively. The parallel affinity propagation also achieves a good performance when clustering large-scale gene data (microarray) and detecting families in large protein superfamilies.

  12. Peptide Pattern Recognition for high-throughput protein sequence analysis and clustering

    DEFF Research Database (Denmark)

    Busk, Peter Kamp

    2017-01-01

    Large collections of protein sequences with divergent sequences are tedious to analyze for understanding their phylogenetic or structure-function relation. Peptide Pattern Recognition is an algorithm that was developed to facilitate this task but the previous version does only allow a limited...... number of sequences as input. I implemented Peptide Pattern Recognition as a multithread software designed to handle large numbers of sequences and perform analysis in a reasonable time frame. Benchmarking showed that the new implementation of Peptide Pattern Recognition is twenty times faster than...... the previous implementation on a small protein collection with 673 MAP kinase sequences. In addition, the new implementation could analyze a large protein collection with 48,570 Glycosyl Transferase family 20 sequences without reaching its upper limit on a desktop computer. Peptide Pattern Recognition...

  13. Structural and biochemical analysis of nuclease domain of clustered regularly interspaced short palindromic repeat (CRISPR)-associated protein 3 (Cas3).

    Science.gov (United States)

    Mulepati, Sabin; Bailey, Scott

    2011-09-09

    RNA transcribed from clustered regularly interspaced short palindromic repeats (CRISPRs) protects many prokaryotes from invasion by foreign DNA such as viruses, conjugative plasmids, and transposable elements. Cas3 (CRISPR-associated protein 3) is essential for this CRISPR protection and is thought to mediate cleavage of the foreign DNA through its N-terminal histidine-aspartate (HD) domain. We report here the 1.8 Å crystal structure of the HD domain of Cas3 from Thermus thermophilus HB8. Structural and biochemical studies predict that this enzyme binds two metal ions at its active site. We also demonstrate that the single-stranded DNA endonuclease activity of this T. thermophilus domain is activated not by magnesium but by transition metal ions such as manganese and nickel. Structure-guided mutagenesis confirms the importance of the metal-binding residues for the nuclease activity and identifies other active site residues. Overall, these results provide a framework for understanding the role of Cas3 in the CRISPR system.

  14. Cluster evolution

    International Nuclear Information System (INIS)

    Schaeffer, R.

    1987-01-01

    The galaxy and cluster luminosity functions are constructed from a model of the mass distribution based on hierarchical clustering at an epoch where the matter distribution is non-linear. These luminosity functions are seen to reproduce the present distribution of objects as can be inferred from the observations. They can be used to deduce the redshift dependence of the cluster distribution and to extrapolate the observations towards the past. The predicted evolution of the cluster distribution is quite strong, although somewhat less rapid than predicted by the linear theory

  15. Strategies to regulate transcription factor-mediated gene positioning and interchromosomal clustering at the nuclear periphery.

    Science.gov (United States)

    Randise-Hinchliff, Carlo; Coukos, Robert; Sood, Varun; Sumner, Michael Chas; Zdraljevic, Stefan; Meldi Sholl, Lauren; Garvey Brickner, Donna; Ahmed, Sara; Watchmaker, Lauren; Brickner, Jason H

    2016-03-14

    In budding yeast, targeting of active genes to the nuclear pore complex (NPC) and interchromosomal clustering is mediated by transcription factor (TF) binding sites in the gene promoters. For example, the binding sites for the TFs Put3, Ste12, and Gcn4 are necessary and sufficient to promote positioning at the nuclear periphery and interchromosomal clustering. However, in all three cases, gene positioning and interchromosomal clustering are regulated. Under uninducing conditions, local recruitment of the Rpd3(L) histone deacetylase by transcriptional repressors blocks Put3 DNA binding. This is a general function of yeast repressors: 16 of 21 repressors blocked Put3-mediated subnuclear positioning; 11 of these required Rpd3. In contrast, Ste12-mediated gene positioning is regulated independently of DNA binding by mitogen-activated protein kinase phosphorylation of the Dig2 inhibitor, and Gcn4-dependent targeting is up-regulated by increasing Gcn4 protein levels. These different regulatory strategies provide either qualitative switch-like control or quantitative control of gene positioning over different time scales. © 2016 Randise-Hinchliff et al.

  16. Pattern of occurrence and occupancy of carbonylation sites in proteins

    DEFF Research Database (Denmark)

    Rao, R Shyama Prasad; Møller, Ian Max

    2011-01-01

    sites. Comparison of metal-catalyzed oxidation of two closely related proteins indicates that this type of carbonylation might not be very specific in proteins. Interestingly, carbonylated sites show a very strong tendency to cluster together in the protein primary sequence hinting at some sort......Proteins are targets for modification by reactive oxygen species, and carbonylation is an important irreversible modification that increases during oxidative stress. While information on protein carbonylation is accumulating, its pattern is not yet understood. We have made a meta...

  17. Cytokines and clustered cardiovascular risk factors in children

    DEFF Research Database (Denmark)

    Andersen, Lars Bo; Müller, Klaus; Eiberg, Stig

    2010-01-01

    pronounced in fat and unfit children based on the association with CRP levels. The association between fitness and fatness variables, insulin resistance, and clustered risk could be caused by other mechanisms related to these exposures. The role of IL-6 remains unclear.......The aim was to evaluate the possible role of tumor necrosis factor alpha (TNF-alpha), interleukin-6 (IL-6), C-reactive protein (CRP), low fitness, and fatness in the early development of clustering of cardiovascular disease (CVD) risk factors and insulin resistance. Subjects for this cross......-sectional study were obtained from 18 schools near Copenhagen, Denmark. Two hundred ten 9-year-old children were selected for cytokine analysis from 434 third-grade children with complete CVD risk profiles. The subgroup was selected according to the CVD risk factor profile (upper and lower quartile of a composite...

  18. Cluster-cluster aggregation of Ising dipolar particles under thermal noise

    KAUST Repository

    Suzuki, Masaru

    2009-08-14

    The cluster-cluster aggregation processes of Ising dipolar particles under thermal noise are investigated in the dilute condition. As the temperature increases, changes in the typical structures of clusters are observed from chainlike (D1) to crystalline (D2) through fractal structures (D1.45), where D is the fractal dimension. By calculating the bending energy of the chainlike structure, it is found that the transition temperature is associated with the energy gap between the chainlike and crystalline configurations. The aggregation dynamics changes from being dominated by attraction to diffusion involving changes in the dynamic exponent z=0.2 to 0.5. In the region of temperature where the fractal clusters grow, different growth rates are observed between charged and neutral clusters. Using the Smoluchowski equation with a twofold kernel, this hetero-aggregation process is found to result from two types of dynamics: the diffusive motion of neutral clusters and the weak attractive motion between charged clusters. The fact that changes in structures and dynamics take place at the same time suggests that transitions in the structure of clusters involve marked changes in the dynamics of the aggregation processes. © 2009 The American Physical Society.

  19. Comprehensive identification and clustering of CLV3/ESR-related (CLE) genes in plants finds groups with potentially shared function.

    Science.gov (United States)

    Goad, David M; Zhu, Chuanmei; Kellogg, Elizabeth A

    2017-10-01

    CLV3/ESR (CLE) proteins are important signaling peptides in plants. The short CLE peptide (12-13 amino acids) is cleaved from a larger pre-propeptide and functions as an extracellular ligand. The CLE family is large and has resisted attempts at classification because the CLE domain is too short for reliable phylogenetic analysis and the pre-propeptide is too variable. We used a model-based search for CLE domains from 57 plant genomes and used the entire pre-propeptide for comprehensive clustering analysis. In total, 1628 CLE genes were identified in land plants, with none recognizable from green algae. These CLEs form 12 groups within which CLE domains are largely conserved and pre-propeptides can be aligned. Most clusters contain sequences from monocots, eudicots and Amborella trichopoda, with sequences from Picea abies, Selaginella moellendorffii and Physcomitrella patens scattered in some clusters. We easily identified previously known clusters involved in vascular differentiation and nodulation. In addition, we found a number of discrete groups whose function remains poorly characterized. Available data indicate that CLE proteins within a cluster are likely to share function, whereas those from different clusters play at least partially different roles. Our analysis provides a foundation for future evolutionary and functional studies. © 2016 The Authors. New Phytologist © 2016 New Phytologist Trust.

  20. A method for improved clustering and classification of microscopy images using quantitative co-localization coefficients

    LENUS (Irish Health Repository)

    Singan, Vasanth R

    2012-06-08

    AbstractBackgroundThe localization of proteins to specific subcellular structures in eukaryotic cells provides important information with respect to their function. Fluorescence microscopy approaches to determine localization distribution have proved to be an essential tool in the characterization of unknown proteins, and are now particularly pertinent as a result of the wide availability of fluorescently-tagged constructs and antibodies. However, there are currently very few image analysis options able to effectively discriminate proteins with apparently similar distributions in cells, despite this information being important for protein characterization.FindingsWe have developed a novel method for combining two existing image analysis approaches, which results in highly efficient and accurate discrimination of proteins with seemingly similar distributions. We have combined image texture-based analysis with quantitative co-localization coefficients, a method that has traditionally only been used to study the spatial overlap between two populations of molecules. Here we describe and present a novel application for quantitative co-localization, as applied to the study of Rab family small GTP binding proteins localizing to the endomembrane system of cultured cells.ConclusionsWe show how quantitative co-localization can be used alongside texture feature analysis, resulting in improved clustering of microscopy images. The use of co-localization as an additional clustering parameter is non-biased and highly applicable to high-throughput image data sets.

  1. NIF-type iron-sulfur cluster assembly system is duplicated and distributed in the mitochondria and cytosol of Mastigamoeba balamuthi.

    Science.gov (United States)

    Nývltová, Eva; Šuták, Robert; Harant, Karel; Šedinová, Miroslava; Hrdy, Ivan; Paces, Jan; Vlček, Čestmír; Tachezy, Jan

    2013-04-30

    In most eukaryotes, the mitochondrion is the main organelle for the formation of iron-sulfur (FeS) clusters. This function is mediated through the iron-sulfur cluster assembly machinery, which was inherited from the α-proteobacterial ancestor of mitochondria. In Archamoebae, including pathogenic Entamoeba histolytica and free-living Mastigamoeba balamuthi, the complex iron-sulfur cluster machinery has been replaced by an ε-proteobacterial nitrogen fixation (NIF) system consisting of two components: NifS (cysteine desulfurase) and NifU (scaffold protein). However, the cellular localization of the NIF system and the involvement of mitochondria in archamoebal FeS assembly are controversial. Here, we show that the genes for both NIF components are duplicated within the M. balamuthi genome. One paralog of each protein contains an amino-terminal extension that targets proteins to mitochondria (NifS-M and NifU-M), and the second paralog lacks a targeting signal, thereby reflecting the cytosolic form of the NIF machinery (NifS-C and NifU-C). The dual localization of the NIF system corresponds to the presence of FeS proteins in both cellular compartments, including detectable hydrogenase activity in Mastigamoeba cytosol and mitochondria. In contrast, E. histolytica possesses only single genes encoding NifS and NifU, respectively, and there is no evidence for the presence of the NIF machinery in its reduced mitochondria. Thus, M. balamuthi is unique among eukaryotes in that its FeS cluster formation is mediated through two most likely independent NIF machineries present in two cellular compartments.

  2. Clustering Molecular Dynamics Trajectories for Optimizing Docking Experiments

    Directory of Open Access Journals (Sweden)

    Renata De Paris

    2015-01-01

    Full Text Available Molecular dynamics simulations of protein receptors have become an attractive tool for rational drug discovery. However, the high computational cost of employing molecular dynamics trajectories in virtual screening of large repositories threats the feasibility of this task. Computational intelligence techniques have been applied in this context, with the ultimate goal of reducing the overall computational cost so the task can become feasible. Particularly, clustering algorithms have been widely used as a means to reduce the dimensionality of molecular dynamics trajectories. In this paper, we develop a novel methodology for clustering entire trajectories using structural features from the substrate-binding cavity of the receptor in order to optimize docking experiments on a cloud-based environment. The resulting partition was selected based on three clustering validity criteria, and it was further validated by analyzing the interactions between 20 ligands and a fully flexible receptor (FFR model containing a 20 ns molecular dynamics simulation trajectory. Our proposed methodology shows that taking into account features of the substrate-binding cavity as input for the k-means algorithm is a promising technique for accurately selecting ensembles of representative structures tailored to a specific ligand.

  3. Diversity among galaxy clusters

    International Nuclear Information System (INIS)

    Struble, M.F.; Rood, H.J.

    1988-01-01

    The classification of galaxy clusters is discussed. Consideration is given to the classification scheme of Abell (1950's), Zwicky (1950's), Morgan, Matthews, and Schmidt (1964), and Morgan-Bautz (1970). Galaxies can be classified based on morphology, chemical composition, spatial distribution, and motion. The correlation between a galaxy's environment and morphology is examined. The classification scheme of Rood-Sastry (1971), which is based on clusters's morphology and galaxy population, is described. The six types of clusters they define include: (1) a cD-cluster dominated by a single large galaxy, (2) a cluster dominated by a binary, (3) a core-halo cluster, (4) a cluster dominated by several bright galaxies, (5) a cluster appearing flattened, and (6) an irregularly shaped cluster. Attention is also given to the evolution of cluster structures, which is related to initial density and cluster motion

  4. Characterization of known protein complexes using k-connectivity and other topological measures

    Science.gov (United States)

    Gallagher, Suzanne R; Goldberg, Debra S

    2015-01-01

    Many protein complexes are densely packed, so proteins within complexes often interact with several other proteins in the complex. Steric constraints prevent most proteins from simultaneously binding more than a handful of other proteins, regardless of the number of proteins in the complex. Because of this, as complex size increases, several measures of the complex decrease within protein-protein interaction networks. However, k-connectivity, the number of vertices or edges that need to be removed in order to disconnect a graph, may be consistently high for protein complexes. The property of k-connectivity has been little used previously in the investigation of protein-protein interactions. To understand the discriminative power of k-connectivity and other topological measures for identifying unknown protein complexes, we characterized these properties in known Saccharomyces cerevisiae protein complexes in networks generated both from highly accurate X-ray crystallography experiments which give an accurate model of each complex, and also as the complexes appear in high-throughput yeast 2-hybrid studies in which new complexes may be discovered. We also computed these properties for appropriate random subgraphs.We found that clustering coefficient, mutual clustering coefficient, and k-connectivity are better indicators of known protein complexes than edge density, degree, or betweenness. This suggests new directions for future protein complex-finding algorithms. PMID:26913183

  5. Synthetic modeling chemistry of iron-sulfur clusters in nitric oxide signaling.

    Science.gov (United States)

    Fitzpatrick, Jessica; Kim, Eunsuk

    2015-08-18

    Nitric oxide (NO) is an important signaling molecule that is involved in many physiological and pathological functions. Iron-sulfur proteins are one of the main reaction targets for NO, and the [Fe-S] clusters within these proteins are converted to various iron nitrosyl species upon reaction with NO, of which dinitrosyl iron complexes (DNICs) are the most prevalent. Much progress has been made in identifying the origin of cellular DNIC generation. However, it is not well-understood which other products besides DNICs may form during [Fe-S] cluster degradation nor what effects DNICs and other degradation products can have once they are generated in cells. Even more elusive is an understanding of the manner by which cells cope with unwanted [Fe-S] modifications by NO. This Account describes our synthetic modeling efforts to identify cluster degradation products derived from the [2Fe-2S]/NO reaction in order to establish their chemical reactivity and repair chemistry. Our intent is to use the chemical knowledge that we generate to provide insight into the unknown biological consequences of cluster modification. Our recent advances in three different areas are described. First, new reaction conditions that lead to the formation of previously unrecognized products during the reaction of [Fe-S] clusters with NO are identified. Hydrogen sulfide (H2S), a gaseous signaling molecule, can be generated from the reaction between [2Fe-2S] clusters and NO in the presence of acid or formal H• (e(-)/H(+)) donors. In the presence of acid, a mononitrosyl iron complex (MNIC) can be produced as the major iron-containing product. Second, cysteine analogues can efficiently convert MNICs back to [2Fe-2S] clusters without the need for any other reagents. This reaction is possible for cysteine analogues because of their ability to labilize NO from MNICs and their capacity to undergo C-S bond cleavage, providing the necessary sulfide for [2Fe-2S] cluster formation. Lastly, unique dioxygen

  6. Cluster-cluster aggregation of Ising dipolar particles under thermal noise

    KAUST Repository

    Suzuki, Masaru; Kun, Ferenc; Ito, Nobuyasu

    2009-01-01

    The cluster-cluster aggregation processes of Ising dipolar particles under thermal noise are investigated in the dilute condition. As the temperature increases, changes in the typical structures of clusters are observed from chainlike (D1

  7. Re-estimating sample size in cluster randomized trials with active recruitment within clusters

    NARCIS (Netherlands)

    van Schie, Sander; Moerbeek, Mirjam

    2014-01-01

    Often only a limited number of clusters can be obtained in cluster randomised trials, although many potential participants can be recruited within each cluster. Thus, active recruitment is feasible within the clusters. To obtain an efficient sample size in a cluster randomised trial, the cluster

  8. DiffSLC: A graph centrality method to detect essential proteins of a protein-protein interaction network.

    Science.gov (United States)

    Mistry, Divya; Wise, Roger P; Dickerson, Julie A

    2017-01-01

    Identification of central genes and proteins in biomolecular networks provides credible candidates for pathway analysis, functional analysis, and essentiality prediction. The DiffSLC centrality measure predicts central and essential genes and proteins using a protein-protein interaction network. Network centrality measures prioritize nodes and edges based on their importance to the network topology. These measures helped identify critical genes and proteins in biomolecular networks. The proposed centrality measure, DiffSLC, combines the number of interactions of a protein and the gene coexpression values of genes from which those proteins were translated, as a weighting factor to bias the identification of essential proteins in a protein interaction network. Potentially essential proteins with low node degree are promoted through eigenvector centrality. Thus, the gene coexpression values are used in conjunction with the eigenvector of the network's adjacency matrix and edge clustering coefficient to improve essentiality prediction. The outcome of this prediction is shown using three variations: (1) inclusion or exclusion of gene co-expression data, (2) impact of different coexpression measures, and (3) impact of different gene expression data sets. For a total of seven networks, DiffSLC is compared to other centrality measures using Saccharomyces cerevisiae protein interaction networks and gene expression data. Comparisons are also performed for the top ranked proteins against the known essential genes from the Saccharomyces Gene Deletion Project, which show that DiffSLC detects more essential proteins and has a higher area under the ROC curve than other compared methods. This makes DiffSLC a stronger alternative to other centrality methods for detecting essential genes using a protein-protein interaction network that obeys centrality-lethality principle. DiffSLC is implemented using the igraph package in R, and networkx package in Python. The python package can be

  9. Multi-Optimisation Consensus Clustering

    Science.gov (United States)

    Li, Jian; Swift, Stephen; Liu, Xiaohui

    Ensemble Clustering has been developed to provide an alternative way of obtaining more stable and accurate clustering results. It aims to avoid the biases of individual clustering algorithms. However, it is still a challenge to develop an efficient and robust method for Ensemble Clustering. Based on an existing ensemble clustering method, Consensus Clustering (CC), this paper introduces an advanced Consensus Clustering algorithm called Multi-Optimisation Consensus Clustering (MOCC), which utilises an optimised Agreement Separation criterion and a Multi-Optimisation framework to improve the performance of CC. Fifteen different data sets are used for evaluating the performance of MOCC. The results reveal that MOCC can generate more accurate clustering results than the original CC algorithm.

  10. Electron: Cluster interactions

    International Nuclear Information System (INIS)

    Scheidemann, A.A.; Knight, W.D.

    1994-02-01

    Beam depletion spectroscopy has been used to measure absolute total inelastic electron-sodium cluster collision cross sections in the energy range from E ∼ 0.1 to E ∼ 6 eV. The investigation focused on the closed shell clusters Na 8 , Na 20 , Na 40 . The measured cross sections show an increase for the lowest collision energies where electron attachment is the primary scattering channel. The electron attachment cross section can be understood in terms of Langevin scattering, connecting this measurement with the polarizability of the cluster. For energies above the dissociation energy the measured electron-cluster cross section is energy independent, thus defining an electron-cluster interaction range. This interaction range increases with the cluster size

  11. Semantic based cluster content discovery in description first clustering algorithm

    International Nuclear Information System (INIS)

    Khan, M.W.; Asif, H.M.S.

    2017-01-01

    In the field of data analytics grouping of like documents in textual data is a serious problem. A lot of work has been done in this field and many algorithms have purposed. One of them is a category of algorithms which firstly group the documents on the basis of similarity and then assign the meaningful labels to those groups. Description first clustering algorithm belong to the category in which the meaningful description is deduced first and then relevant documents are assigned to that description. LINGO (Label Induction Grouping Algorithm) is the algorithm of description first clustering category which is used for the automatic grouping of documents obtained from search results. It uses LSI (Latent Semantic Indexing); an IR (Information Retrieval) technique for induction of meaningful labels for clusters and VSM (Vector Space Model) for cluster content discovery. In this paper we present the LINGO while it is using LSI during cluster label induction and cluster content discovery phase. Finally, we compare results obtained from the said algorithm while it uses VSM and Latent semantic analysis during cluster content discovery phase. (author)

  12. The clustered nucleus-cluster structures in stable and unstable nuclei

    International Nuclear Information System (INIS)

    Freer, Martin

    2007-01-01

    The subject of clustering has a lineage which runs throughout the history of nuclear physics. Its attraction is the simplification of the often uncorrelated behaviour of independent particles to organized and coherent quasi-crystalline structures. In this review the ideas behind the development of clustering in light nuclei are investigated, mostly from the stand-point of the harmonic oscillator framework. This allows a unifying description of alpha-conjugate and neutron-rich nuclei, alike. More sophisticated models of clusters are explored, such as antisymmetrized molecular dynamics. A number of contemporary topics in clustering are touched upon; the 3α-cluster state in 12 C, nuclear molecules and clustering at the drip-line. Finally, an understanding of the 12 C+ 12 C resonances in 24 Mg, within the framework of the theoretical ideas developed in the review, is presented

  13. Regional Innovation Clusters

    Data.gov (United States)

    Small Business Administration — The Regional Innovation Clusters serve a diverse group of sectors and geographies. Three of the initial pilot clusters, termed Advanced Defense Technology clusters,...

  14. Choosing the Number of Clusters in K-Means Clustering

    Science.gov (United States)

    Steinley, Douglas; Brusco, Michael J.

    2011-01-01

    Steinley (2007) provided a lower bound for the sum-of-squares error criterion function used in K-means clustering. In this article, on the basis of the lower bound, the authors propose a method to distinguish between 1 cluster (i.e., a single distribution) versus more than 1 cluster. Additionally, conditional on indicating there are multiple…

  15. Personalized PageRank Clustering: A graph clustering algorithm based on random walks

    Science.gov (United States)

    A. Tabrizi, Shayan; Shakery, Azadeh; Asadpour, Masoud; Abbasi, Maziar; Tavallaie, Mohammad Ali

    2013-11-01

    Graph clustering has been an essential part in many methods and thus its accuracy has a significant effect on many applications. In addition, exponential growth of real-world graphs such as social networks, biological networks and electrical circuits demands clustering algorithms with nearly-linear time and space complexity. In this paper we propose Personalized PageRank Clustering (PPC) that employs the inherent cluster exploratory property of random walks to reveal the clusters of a given graph. We combine random walks and modularity to precisely and efficiently reveal the clusters of a graph. PPC is a top-down algorithm so it can reveal inherent clusters of a graph more accurately than other nearly-linear approaches that are mainly bottom-up. It also gives a hierarchy of clusters that is useful in many applications. PPC has a linear time and space complexity and has been superior to most of the available clustering algorithms on many datasets. Furthermore, its top-down approach makes it a flexible solution for clustering problems with different requirements.

  16. A functional bikaverin biosynthesis gene cluster in rare strains of Botrytis cinerea is positively controlled by VELVET.

    Directory of Open Access Journals (Sweden)

    Julia Schumacher

    Full Text Available The gene cluster responsible for the biosynthesis of the red polyketidic pigment bikaverin has only been characterized in Fusarium ssp. so far. Recently, a highly homologous but incomplete and nonfunctional bikaverin cluster has been found in the genome of the unrelated phytopathogenic fungus Botrytis cinerea. In this study, we provided evidence that rare B. cinerea strains such as 1750 have a complete and functional cluster comprising the six genes orthologous to Fusarium fujikuroi ffbik1-ffbik6 and do produce bikaverin. Phylogenetic analysis confirmed that the whole cluster was acquired from Fusarium through a horizontal gene transfer (HGT. In the bikaverin-nonproducing strain B05.10, the genes encoding bikaverin biosynthesis enzymes are nonfunctional due to deleterious mutations (bcbik2-3 or missing (bcbik1 but interestingly, the genes encoding the regulatory proteins BcBIK4 and BcBIK5 do not harbor deleterious mutations which suggests that they may still be functional. Heterologous complementation of the F. fujikuroi Δffbik4 mutant confirmed that bcbik4 of strain B05.10 is indeed fully functional. Deletion of bcvel1 in the pink strain 1750 resulted in loss of bikaverin and overproduction of melanin indicating that the VELVET protein BcVEL1 regulates the biosynthesis of the two pigments in an opposite manner. Although strain 1750 itself expresses a truncated BcVEL1 protein (100 instead of 575 aa that is nonfunctional with regard to sclerotia formation, virulence and oxalic acid formation, it is sufficient to regulate pigment biosynthesis (bikaverin and melanin and fenhexamid HydR2 type of resistance. Finally, a genetic cross between strain 1750 and a bikaverin-nonproducing strain sensitive to fenhexamid revealed that the functional bikaverin cluster is genetically linked to the HydR2 locus.

  17. A protein interaction map of the kalimantacin biosynthesis assembly line

    Directory of Open Access Journals (Sweden)

    Birgit Uytterhoeven

    2016-11-01

    Full Text Available The antimicrobial secondary metabolite kalimantacin is produced by a hybrid polyketide/ non-ribosomal peptide system in Pseudomonas fluorescens BCCM_ID9359. In this study, the kalimantacin biosynthesis gene cluster is analyzed by yeast two-hybrid analysis, creating a protein-protein interaction map of the entire assembly line. In total, 28 potential interactions were identified, of which 13 could be confirmed further. These interactions include the dimerization of ketosynthase domains, a link between assembly line modules 9 and 10, and a specific interaction between the trans-acting enoyl reductase BatK and the carrier proteins of modules 8 and 10. These interactions reveal fundamental insight into the biosynthesis of secondary metabolites.This study is the first to reveal interactions in a complete biosynthetic pathway. Similar future studies could build a strong basis for engineering strategies in such clusters.

  18. Kinetics of proton transfer in a green fluorescent protein: A laser ...

    Indian Academy of Sciences (India)

    Unknown

    therefore implicates bulk solvent-controlled protein dynamics in the protonation process. ... recently to protein–protein interactions in the bacterial response regulator SpoOF. NMR ..... molecular mechanism for redox-driven proton transfer to a buried iron–sulphur cluster ... Dynamic simulations of proton transfer from bulk.

  19. Clusters in nuclei

    CERN Document Server

    Following the pioneering discovery of alpha clustering and of molecular resonances, the field of nuclear clustering is today one of those domains of heavy-ion nuclear physics that faces the greatest challenges, yet also contains the greatest opportunities. After many summer schools and workshops, in particular over the last decade, the community of nuclear molecular physicists has decided to collaborate in producing a comprehensive collection of lectures and tutorial reviews covering the field. This third volume follows the successful Lect. Notes Phys. 818 (Vol. 1) and 848 (Vol. 2), and comprises six extensive lectures covering the following topics:  - Gamma Rays and Molecular Structure - Faddeev Equation Approach for Three Cluster Nuclear Reactions - Tomography of the Cluster Structure of Light Nuclei Via Relativistic Dissociation - Clustering Effects Within the Dinuclear Model : From Light to Hyper-heavy Molecules in Dynamical Mean-field Approach - Clusterization in Ternary Fission - Clusters in Light N...

  20. Spatial cluster modelling

    CERN Document Server

    Lawson, Andrew B

    2002-01-01

    Research has generated a number of advances in methods for spatial cluster modelling in recent years, particularly in the area of Bayesian cluster modelling. Along with these advances has come an explosion of interest in the potential applications of this work, especially in epidemiology and genome research. In one integrated volume, this book reviews the state-of-the-art in spatial clustering and spatial cluster modelling, bringing together research and applications previously scattered throughout the literature. It begins with an overview of the field, then presents a series of chapters that illuminate the nature and purpose of cluster modelling within different application areas, including astrophysics, epidemiology, ecology, and imaging. The focus then shifts to methods, with discussions on point and object process modelling, perfect sampling of cluster processes, partitioning in space and space-time, spatial and spatio-temporal process modelling, nonparametric methods for clustering, and spatio-temporal ...

  1. Herd Clustering: A synergistic data clustering approach using collective intelligence

    KAUST Repository

    Wong, Kachun; Peng, Chengbin; Li, Yue; Chan, Takming

    2014-01-01

    , this principle is used to develop a new clustering algorithm. Inspired by herd behavior, the clustering method is a synergistic approach using collective intelligence called Herd Clustering (HC). The novel part is laid in its first stage where data instances

  2. Analysis of the selective advantage conferred by a C-E1 fusion protein synthesized by rubella virus DI RNAs

    International Nuclear Information System (INIS)

    Claus, Claudia; Tzeng, W.-P.; Liebert, Uwe Gerd; Frey, Teryl K.

    2007-01-01

    During serial passaging of rubella virus (RUB) in cell culture, the dominant species of defective-interfering RNA (DI) generated contains an in-frame deletion between the capsid protein (C) gene and E1 glycoprotein gene resulting in production of a C-E1 fusion protein that is necessary for the maintenance of the DI [Tzeng, W.P., Frey, T.K. (2006). C-E1 fusion protein synthesized by rubella virus DI RNAs maintained during serial passage. Virology 356 198-207.]. A BHK cell line stably expressing the RUB structural proteins was established which was used to package DIs into virus particles following transfection with in vitro transcripts from DI infectious cDNA constructs. Packaging of a DI encoding an in-frame C-GFP-E1 reporter fusion protein corresponding to the C-E1 fusion protein expressed in a native DI was only marginally more efficient than packaging of a DI encoding GFP, indicating that the C-E1 fusion protein did not function by enhancing packaging. However, infection with the DI encoding the C-GFP-E1 fusion protein (in the absence of wt RUB helper virus) resulted in formation of clusters of GFP-positive cells and the percentage of GFP-positive cells in the culture following infection remained relatively constant. In contrast, a DI encoding GFP did not form GFP-positive clusters and the percentage of GFP-positive cells declined by roughly half from 2 to 4 days post-infection. Cluster formation and sustaining the percentage of infected (GFP-positive) cells required the C part of the fusion protein, including the downstream but not the upstream of two arginine clusters (both of which are associated with RNA binding and association with mitochondrial p32 protein) and the E1 part through the transmembrane sequence, but not the C-terminal cytoplasmic tail. Among a collection of mutant DI constructs, cluster formation and sustaining infected cell percentage correlated with maintenance during serial passage with wt RUB. We hypothesize that cluster formation and

  3. THE SWIFT AGN AND CLUSTER SURVEY. II. CLUSTER CONFIRMATION WITH SDSS DATA

    International Nuclear Information System (INIS)

    Griffin, Rhiannon D.; Dai, Xinyu; Kochanek, Christopher S.; Bregman, Joel N.

    2016-01-01

    We study 203 (of 442) Swift AGN and Cluster Survey extended X-ray sources located in the SDSS DR8 footprint to search for galaxy over-densities in three-dimensional space using SDSS galaxy photometric redshifts and positions near the Swift cluster candidates. We find 104 Swift clusters with a >3σ galaxy over-density. The remaining targets are potentially located at higher redshifts and require deeper optical follow-up observations for confirmation as galaxy clusters. We present a series of cluster properties including the redshift, brightest cluster galaxy (BCG) magnitude, BCG-to-X-ray center offset, optical richness, and X-ray luminosity. We also detect red sequences in ∼85% of the 104 confirmed clusters. The X-ray luminosity and optical richness for the SDSS confirmed Swift clusters are correlated and follow previously established relations. The distribution of the separations between the X-ray centroids and the most likely BCG is also consistent with expectation. We compare the observed redshift distribution of the sample with a theoretical model, and find that our sample is complete for z ≲ 0.3 and is still 80% complete up to z ≃ 0.4, consistent with the SDSS survey depth. These analysis results suggest that our Swift cluster selection algorithm has yielded a statistically well-defined cluster sample for further study of cluster evolution and cosmology. We also match our SDSS confirmed Swift clusters to existing cluster catalogs, and find 42, 23, and 1 matches in optical, X-ray, and Sunyaev–Zel’dovich catalogs, respectively, and so the majority of these clusters are new detections

  4. Clustering and cellular distribution characteristics of virus particles of Tomato spotted wilt virus and Tomato zonate spot virus in different plant hosts.

    Science.gov (United States)

    Zhang, Zhongkai; Zheng, Kuanyu; Dong, Jiahong; Fang, Qi; Hong, Jian; Wang, Xifeng

    2016-01-19

    Tomato spotted wilt virus (TSWV) and Tomato zonate spot virus (TZSV) are the two dominant species of thrip-transmitted tospoviruses, cause significant losses in crop yield in Yunnan and its neighboring provinces in China. TSWV and TZSV belong to different serogroup of tospoviruses but induce similar symptoms in the same host plant species, which makes diagnostic difficult. We used different electron microscopy preparing methods to investigate clustering and cellular distribution of TSWV and TZSV in the host plant species. Negative staining of samples infected with TSWV and TZSV revealed that particles usually clustered in the vesicles, including single particle (SP), double particles clustering (DPC), triple particles clustering (TPC). In the immunogold labeling negative staining against proteins of TZSV, the antibodies against Gn protein were stained more strongly than the N protein. Ultrathin section and high pressure freeze (HPF)-electron microscopy preparations revealed that TSWV particles were distributed in the cisternae of endoplasmic reticulum (ER), filamentous inclusions (FI) and Golgi bodies in the mesophyll cells. The TSWV particles clustered as multiple particles clustering (MPC) and distributed in globular viroplasm or cisternae of ER in the top leaf cell. TZSV particles were distributed more abundantly in the swollen membrane of ER in the mesophyll cell than those in the phloem parenchyma cells and were not observed in the top leaf cell. However, TZSV virions were mainly present as single particle in the cytoplasm, with few clustering as MPC. In this study, we identified TSWV and TZSV particles had the distinct cellular distribution patterns in the cytoplasm from different tissues and host plants. This is the first report of specific clustering characteristics of tospoviruses particles as well as the cellular distribution of TSWV particles in the FI and globular viroplasm where as TZSV particles inside the membrane of ER. These results indicated that

  5. Scientific Cluster Deployment and Recovery – Using puppet to simplify cluster management

    International Nuclear Information System (INIS)

    Hendrix, Val; Yao Yushu; Benjamin, Doug

    2012-01-01

    Deployment, maintenance and recovery of a scientific cluster, which has complex, specialized services, can be a time consuming task requiring the assistance of Linux system administrators, network engineers as well as domain experts. Universities and small institutions that have a part-time FTE with limited time for and knowledge of the administration of such clusters can be strained by such maintenance tasks. This current work is the result of an effort to maintain a data analysis cluster (DAC) with minimal effort by a local system administrator. The realized benefit is the scientist, who is the local system administrator, is able to focus on the data analysis instead of the intricacies of managing a cluster. Our work provides a cluster deployment and recovery process (CDRP) based on the puppet configuration engine allowing a part-time FTE to easily deploy and recover entire clusters with minimal effort. Puppet is a configuration management system (CMS) used widely in computing centers for the automatic management of resources. Domain experts use Puppet's declarative language to define reusable modules for service configuration and deployment. Our CDRP has three actors: domain experts, a cluster designer and a cluster manager. The domain experts first write the puppet modules for the cluster services. A cluster designer would then define a cluster. This includes the creation of cluster roles, mapping the services to those roles and determining the relationships between the services. Finally, a cluster manager would acquire the resources (machines, networking), enter the cluster input parameters (hostnames, IP addresses) and automatically generate deployment scripts used by puppet to configure it to act as a designated role. In the event of a machine failure, the originally generated deployment scripts along with puppet can be used to easily reconfigure a new machine. The cluster definition produced in our CDRP is an integral part of automating cluster deployment

  6. Cluster consensus in discrete-time networks of multiagents with inter-cluster nonidentical inputs.

    Science.gov (United States)

    Han, Yujuan; Lu, Wenlian; Chen, Tianping

    2013-04-01

    In this paper, cluster consensus of multiagent systems is studied via inter-cluster nonidentical inputs. Here, we consider general graph topologies, which might be time-varying. The cluster consensus is defined by two aspects: intracluster synchronization, the state at which differences between each pair of agents in the same cluster converge to zero, and inter-cluster separation, the state at which agents in different clusters are separated. For intra-cluster synchronization, the concepts and theories of consensus, including the spanning trees, scramblingness, infinite stochastic matrix product, and Hajnal inequality, are extended. As a result, it is proved that if the graph has cluster spanning trees and all vertices self-linked, then the static linear system can realize intra-cluster synchronization. For the time-varying coupling cases, it is proved that if there exists T > 0 such that the union graph across any T-length time interval has cluster spanning trees and all graphs has all vertices self-linked, then the time-varying linear system can also realize intra-cluster synchronization. Under the assumption of common inter-cluster influence, a sort of inter-cluster nonidentical inputs are utilized to realize inter-cluster separation, such that each agent in the same cluster receives the same inputs and agents in different clusters have different inputs. In addition, the boundedness of the infinite sum of the inputs can guarantee the boundedness of the trajectory. As an application, we employ a modified non-Bayesian social learning model to illustrate the effectiveness of our results.

  7. Relative efficiency and sample size for cluster randomized trials with variable cluster sizes.

    Science.gov (United States)

    You, Zhiying; Williams, O Dale; Aban, Inmaculada; Kabagambe, Edmond Kato; Tiwari, Hemant K; Cutter, Gary

    2011-02-01

    The statistical power of cluster randomized trials depends on two sample size components, the number of clusters per group and the numbers of individuals within clusters (cluster size). Variable cluster sizes are common and this variation alone may have significant impact on study power. Previous approaches have taken this into account by either adjusting total sample size using a designated design effect or adjusting the number of clusters according to an assessment of the relative efficiency of unequal versus equal cluster sizes. This article defines a relative efficiency of unequal versus equal cluster sizes using noncentrality parameters, investigates properties of this measure, and proposes an approach for adjusting the required sample size accordingly. We focus on comparing two groups with normally distributed outcomes using t-test, and use the noncentrality parameter to define the relative efficiency of unequal versus equal cluster sizes and show that statistical power depends only on this parameter for a given number of clusters. We calculate the sample size required for an unequal cluster sizes trial to have the same power as one with equal cluster sizes. Relative efficiency based on the noncentrality parameter is straightforward to calculate and easy to interpret. It connects the required mean cluster size directly to the required sample size with equal cluster sizes. Consequently, our approach first determines the sample size requirements with equal cluster sizes for a pre-specified study power and then calculates the required mean cluster size while keeping the number of clusters unchanged. Our approach allows adjustment in mean cluster size alone or simultaneous adjustment in mean cluster size and number of clusters, and is a flexible alternative to and a useful complement to existing methods. Comparison indicated that we have defined a relative efficiency that is greater than the relative efficiency in the literature under some conditions. Our measure

  8. Cooperative protein structural dynamics of homodimeric hemoglobin linked to water cluster at subunit interface revealed by time-resolved X-ray solution scattering

    Directory of Open Access Journals (Sweden)

    Jong Goo Kim

    2016-03-01

    Full Text Available Homodimeric hemoglobin (HbI consisting of two subunits is a good model system for investigating the allosteric structural transition as it exhibits cooperativity in ligand binding. In this work, as an effort to extend our previous study on wild-type and F97Y mutant HbI, we investigate structural dynamics of a mutant HbI in solution to examine the role of well-organized interfacial water cluster, which has been known to mediate intersubunit communication in HbI. In the T72V mutant of HbI, the interfacial water cluster in the T state is perturbed due to the lack of Thr72, resulting in two less interfacial water molecules than in wild-type HbI. By performing picosecond time-resolved X-ray solution scattering experiment and kinetic analysis on the T72V mutant, we identify three structurally distinct intermediates (I1, I2, and I3 and show that the kinetics of the T72V mutant are well described by the same kinetic model used for wild-type and F97Y HbI, which involves biphasic kinetics, geminate recombination, and bimolecular CO recombination. The optimized kinetic model shows that the R-T transition and bimolecular CO recombination are faster in the T72V mutant than in the wild type. From structural analysis using species-associated difference scattering curves for the intermediates, we find that the T-like deoxy I3 intermediate in solution has a different structure from deoxy HbI in crystal. In addition, we extract detailed structural parameters of the intermediates such as E-F distance, intersubunit rotation angle, and heme-heme distance. By comparing the structures of protein intermediates in wild-type HbI and the T72V mutant, we reveal how the perturbation in the interfacial water cluster affects the kinetics and structures of reaction intermediates of HbI.

  9. The function of communities in protein interaction networks at multiple scales

    Directory of Open Access Journals (Sweden)

    Jones Nick S

    2010-07-01

    Full Text Available Abstract Background If biology is modular then clusters, or communities, of proteins derived using only protein interaction network structure should define protein modules with similar biological roles. We investigate the link between biological modules and network communities in yeast and its relationship to the scale at which we probe the network. Results Our results demonstrate that the functional homogeneity of communities depends on the scale selected, and that almost all proteins lie in a functionally homogeneous community at some scale. We judge functional homogeneity using a novel test and three independent characterizations of protein function, and find a high degree of overlap between these measures. We show that a high mean clustering coefficient of a community can be used to identify those that are functionally homogeneous. By tracing the community membership of a protein through multiple scales we demonstrate how our approach could be useful to biologists focusing on a particular protein. Conclusions We show that there is no one scale of interest in the community structure of the yeast protein interaction network, but we can identify the range of resolution parameters that yield the most functionally coherent communities, and predict which communities are most likely to be functionally homogeneous.

  10. Correlation between protein sequence similarity and x-ray diffraction quality in the protein data bank.

    Science.gov (United States)

    Lu, Hui-Meng; Yin, Da-Chuan; Ye, Ya-Jing; Luo, Hui-Min; Geng, Li-Qiang; Li, Hai-Sheng; Guo, Wei-Hong; Shang, Peng

    2009-01-01

    As the most widely utilized technique to determine the 3-dimensional structure of protein molecules, X-ray crystallography can provide structure of the highest resolution among the developed techniques. The resolution obtained via X-ray crystallography is known to be influenced by many factors, such as the crystal quality, diffraction techniques, and X-ray sources, etc. In this paper, the authors found that the protein sequence could also be one of the factors. We extracted information of the resolution and the sequence of proteins from the Protein Data Bank (PDB), classified the proteins into different clusters according to the sequence similarity, and statistically analyzed the relationship between the sequence similarity and the best resolution obtained. The results showed that there was a pronounced correlation between the sequence similarity and the obtained resolution. These results indicate that protein structure itself is one variable that may affect resolution when X-ray crystallography is used.

  11. GALAXY CLUSTERS AT HIGH REDSHIFT AND EVOLUTION OF BRIGHTEST CLUSTER GALAXIES

    International Nuclear Information System (INIS)

    Wen, Z. L.; Han, J. L.

    2011-01-01

    Identification of high-redshift clusters is important for studies of cosmology and cluster evolution. Using photometric redshifts of galaxies, we identify 631 clusters from the Canada-France-Hawaii Telescope (CFHT) wide field, 202 clusters from the CFHT deep field, 187 clusters from the Cosmic Evolution Survey (COSMOS) field, and 737 clusters from the Spitzer Wide-area InfraRed Extragalactic Survey (SWIRE) field. The redshifts of these clusters are in the range 0.1 ∼ + - m 3.6 μ m colors of the BCGs are consistent with a stellar population synthesis model in which the BCGs are formed at redshift z f ≥ 2 and evolved passively. The g' - z' and B - m 3.6μm colors of the BCGs at redshifts z > 0.8 are systematically bluer than the passive evolution model for galaxies formed at z f ∼ 2, indicating star formation in high-redshift BCGs.

  12. Unequal cluster sizes in stepped-wedge cluster randomised trials: a systematic review.

    Science.gov (United States)

    Kristunas, Caroline; Morris, Tom; Gray, Laura

    2017-11-15

    To investigate the extent to which cluster sizes vary in stepped-wedge cluster randomised trials (SW-CRT) and whether any variability is accounted for during the sample size calculation and analysis of these trials. Any, not limited to healthcare settings. Any taking part in an SW-CRT published up to March 2016. The primary outcome is the variability in cluster sizes, measured by the coefficient of variation (CV) in cluster size. Secondary outcomes include the difference between the cluster sizes assumed during the sample size calculation and those observed during the trial, any reported variability in cluster sizes and whether the methods of sample size calculation and methods of analysis accounted for any variability in cluster sizes. Of the 101 included SW-CRTs, 48% mentioned that the included clusters were known to vary in size, yet only 13% of these accounted for this during the calculation of the sample size. However, 69% of the trials did use a method of analysis appropriate for when clusters vary in size. Full trial reports were available for 53 trials. The CV was calculated for 23 of these: the median CV was 0.41 (IQR: 0.22-0.52). Actual cluster sizes could be compared with those assumed during the sample size calculation for 14 (26%) of the trial reports; the cluster sizes were between 29% and 480% of that which had been assumed. Cluster sizes often vary in SW-CRTs. Reporting of SW-CRTs also remains suboptimal. The effect of unequal cluster sizes on the statistical power of SW-CRTs needs further exploration and methods appropriate to studies with unequal cluster sizes need to be employed. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2017. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  13. Exploring the binding sites and binding mechanism for hydrotrope encapsulated griseofulvin drug on γ-tubulin protein.

    Directory of Open Access Journals (Sweden)

    Shubhadip Das

    Full Text Available The protein γ-tubulin plays an important role in centrosomal clustering and this makes it an attractive therapeutic target for treating cancers. Griseofulvin, an antifungal drug, has recently been used to inhibit proliferation of various types of cancer cells. It can also affect the microtubule dynamics by targeting the γ-tubulin protein. So far, the binding pockets of γ-tubulin protein are not properly identified and the exact mechanism by which the drug binds to it is an area of intense speculation and research. The aim of the present study is to investigate the binding mechanism and binding affinity of griseofulvin on γ-tubulin protein using classical molecular dynamics simulations. Since the drug griseofulvin is sparingly soluble in water, here we also present a promising approach for formulating and achieving delivery of hydrophobic griseofulvin drug via hydrotrope sodium cumene sulfonate (SCS cluster. We observe that the binding pockets of γ-tubulin protein are mainly formed by the H8, H9 helices and S7, S8, S14 strands and the hydrophobic interactions between the drug and γ-tubulin protein drive the binding process. The release of the drug griseofulvin from the SCS cluster is confirmed by the coordination number analysis. We also find hydrotrope-induced alteration of the binding sites of γ-tubulin protein and the weakening of the drug-protein interactions.

  14. Heterologous expression of the Halothiobacillus neapolitanus carboxysomal gene cluster in Corynebacterium glutamicum.

    Science.gov (United States)

    Baumgart, Meike; Huber, Isabel; Abdollahzadeh, Iman; Gensch, Thomas; Frunzke, Julia

    2017-09-20

    Compartmentalization represents a ubiquitous principle used by living organisms to optimize metabolic flux and to avoid detrimental interactions within the cytoplasm. Proteinaceous bacterial microcompartments (BMCs) have therefore created strong interest for the encapsulation of heterologous pathways in microbial model organisms. However, attempts were so far mostly restricted to Escherichia coli. Here, we introduced the carboxysomal gene cluster of Halothiobacillus neapolitanus into the biotechnological platform species Corynebacterium gluta-micum. Transmission electron microscopy, fluorescence microscopy and single molecule localization microscopy suggested the formation of BMC-like structures in cells expressing the complete carboxysome operon or only the shell proteins. Purified carboxysomes consisted of the expected protein components as verified by mass spectrometry. Enzymatic assays revealed the functional production of RuBisCO in C. glutamicum both in the presence and absence of carboxysomal shell proteins. Furthermore, we could show that eYFP is targeted to the carboxysomes by fusion to the large RuBisCO subunit. Overall, this study represents the first transfer of an α-carboxysomal gene cluster into a Gram-positive model species supporting the modularity and orthogonality of these microcompartments, but also identified important challenges which need to be addressed on the way towards biotechnological application. Copyright © 2017 Elsevier B.V. All rights reserved.

  15. Cluster synchronization induced by one-node clusters in networks with asymmetric negative couplings

    International Nuclear Information System (INIS)

    Zhang, Jianbao; Ma, Zhongjun; Zhang, Gang

    2013-01-01

    This paper deals with the problem of cluster synchronization in networks with asymmetric negative couplings. By decomposing the coupling matrix into three matrices, and employing Lyapunov function method, sufficient conditions are derived for cluster synchronization. The conditions show that the couplings of multi-node clusters from one-node clusters have beneficial effects on cluster synchronization. Based on the effects of the one-node clusters, an effective and universal control scheme is put forward for the first time. The obtained results may help us better understand the relation between cluster synchronization and cluster structures of the networks. The validity of the control scheme is confirmed through two numerical simulations, in a network with no cluster structure and in a scale-free network

  16. Cluster synchronization induced by one-node clusters in networks with asymmetric negative couplings

    Science.gov (United States)

    Zhang, Jianbao; Ma, Zhongjun; Zhang, Gang

    2013-12-01

    This paper deals with the problem of cluster synchronization in networks with asymmetric negative couplings. By decomposing the coupling matrix into three matrices, and employing Lyapunov function method, sufficient conditions are derived for cluster synchronization. The conditions show that the couplings of multi-node clusters from one-node clusters have beneficial effects on cluster synchronization. Based on the effects of the one-node clusters, an effective and universal control scheme is put forward for the first time. The obtained results may help us better understand the relation between cluster synchronization and cluster structures of the networks. The validity of the control scheme is confirmed through two numerical simulations, in a network with no cluster structure and in a scale-free network.

  17. Semi-supervised clustering methods.

    Science.gov (United States)

    Bair, Eric

    2013-01-01

    Cluster analysis methods seek to partition a data set into homogeneous subgroups. It is useful in a wide variety of applications, including document processing and modern genetics. Conventional clustering methods are unsupervised, meaning that there is no outcome variable nor is anything known about the relationship between the observations in the data set. In many situations, however, information about the clusters is available in addition to the values of the features. For example, the cluster labels of some observations may be known, or certain observations may be known to belong to the same cluster. In other cases, one may wish to identify clusters that are associated with a particular outcome variable. This review describes several clustering algorithms (known as "semi-supervised clustering" methods) that can be applied in these situations. The majority of these methods are modifications of the popular k-means clustering method, and several of them will be described in detail. A brief description of some other semi-supervised clustering algorithms is also provided.

  18. Redefining the Breast Cancer Exosome Proteome by Tandem Mass Tag Quantitative Proteomics and Multivariate Cluster Analysis.

    Science.gov (United States)

    Clark, David J; Fondrie, William E; Liao, Zhongping; Hanson, Phyllis I; Fulton, Amy; Mao, Li; Yang, Austin J

    2015-10-20

    Exosomes are microvesicles of endocytic origin constitutively released by multiple cell types into the extracellular environment. With evidence that exosomes can be detected in the blood of patients with various malignancies, the development of a platform that uses exosomes as a diagnostic tool has been proposed. However, it has been difficult to truly define the exosome proteome due to the challenge of discerning contaminant proteins that may be identified via mass spectrometry using various exosome enrichment strategies. To better define the exosome proteome in breast cancer, we incorporated a combination of Tandem-Mass-Tag (TMT) quantitative proteomics approach and Support Vector Machine (SVM) cluster analysis of three conditioned media derived fractions corresponding to a 10 000g cellular debris pellet, a 100 000g crude exosome pellet, and an Optiprep enriched exosome pellet. The quantitative analysis identified 2 179 proteins in all three fractions, with known exosomal cargo proteins displaying at least a 2-fold enrichment in the exosome fraction based on the TMT protein ratios. Employing SVM cluster analysis allowed for the classification 251 proteins as "true" exosomal cargo proteins. This study provides a robust and vigorous framework for the future development of using exosomes as a potential multiprotein marker phenotyping tool that could be useful in breast cancer diagnosis and monitoring disease progression.

  19. Establishing homology between mitochondrial calcium uniporters, prokaryotic magnesium channels and chlamydial IncA proteins.

    Science.gov (United States)

    Lee, Andre; Vastermark, Ake; Saier, Milton H

    2014-08-01

    Mitochondrial calcium uniporters (MCUs) (TC no. 1.A.77) are oligomeric channel proteins found in the mitochondrial inner membrane. MCUs have two well-conserved transmembrane segments (TMSs), connected by a linker, similar to bacterial MCU homologues. These proteins and chlamydial IncA proteins (of unknown function; TC no. 9.B.159) are homologous to prokaryotic Mg(2+) transporters, AtpI and AtpZ, based on comparison scores of up to 14.5 sds. A phylogenetic tree containing all of these proteins showed that the AtpZ proteins cluster coherently as a subset within the large and diverse AtpI cluster, which branches separately from the MCUs and IncAs, both of which cluster coherently. The MCUs and AtpZs share the same two TMS topology, but the AtpIs have four TMSs, and IncAs can have either two (most frequent) or four (less frequent) TMSs. Binary alignments, comparison scores and motif analyses showed that TMSs 1 and 2 align with TMSs 3 and 4 of the AtpIs, suggesting that the four TMS AtpI proteins arose via an intragenic duplication event. These findings establish an evolutionary link interconnecting eukaryotic and prokaryotic Ca(2+) and Mg(2+) transporters with chlamydial IncAs, and lead us to suggest that all members of the MCU superfamily, including IncAs, function as divalent cation channels. © 2014 The Authors.

  20. Acetylcholine receptor (AChR) clustering is regulated both by glycogen synthase kinase 3β (GSK3β)-dependent phosphorylation and the level of CLIP-associated protein 2 (CLASP2) mediating the capture of microtubule plus-ends.

    Science.gov (United States)

    Basu, Sreya; Sladecek, Stefan; Pemble, Hayley; Wittmann, Torsten; Slotman, Johan A; van Cappellen, Wiggert; Brenner, Hans-Rudolf; Galjart, Niels

    2014-10-31

    The postsynaptic apparatus of the neuromuscular junction (NMJ) traps and anchors acetylcholine receptors (AChRs) at high density at the synapse. We have previously shown that microtubule (MT) capture by CLASP2, a MT plus-end-tracking protein (+TIP), increases the size and receptor density of AChR clusters at the NMJ through the delivery of AChRs and that this is regulated by a pathway involving neuronal agrin and several postsynaptic kinases, including GSK3. Phosphorylation by GSK3 has been shown to cause CLASP2 dissociation from MT ends, and nine potential phosphorylation sites for GSK3 have been mapped on CLASP2. How CLASP2 phosphorylation regulates MT capture at the NMJ and how this controls the size of AChR clusters are not yet understood. To examine this, we used myotubes cultured on agrin patches that induce AChR clustering in a two-dimensional manner. We show that expression of a CLASP2 mutant, in which the nine GSK3 target serines are mutated to alanine (CLASP2-9XS/9XA) and are resistant to GSK3β-dependent phosphorylation, promotes MT capture at clusters and increases AChR cluster size, compared with myotubes that express similar levels of wild type CLASP2 or that are noninfected. Conversely, myotubes expressing a phosphomimetic form of CLASP2 (CLASP2-8XS/D) show enrichment of immobile mutant CLASP2 in clusters, but MT capture and AChR cluster size are reduced. Taken together, our data suggest that both GSK3β-dependent phosphorylation and the level of CLASP2 play a role in the maintenance of AChR cluster size through the regulated capture and release of MT plus-ends. © 2014 by The American Society for Biochemistry and Molecular Biology, Inc.

  1. Comparing clustering models in bank customers: Based on Fuzzy relational clustering approach

    Directory of Open Access Journals (Sweden)

    Ayad Hendalianpour

    2016-11-01

    Full Text Available Clustering is absolutely useful information to explore data structures and has been employed in many places. It organizes a set of objects into similar groups called clusters, and the objects within one cluster are both highly similar and dissimilar with the objects in other clusters. The K-mean, C-mean, Fuzzy C-mean and Kernel K-mean algorithms are the most popular clustering algorithms for their easy implementation and fast work, but in some cases we cannot use these algorithms. Regarding this, in this paper, a hybrid model for customer clustering is presented that is applicable in five banks of Fars Province, Shiraz, Iran. In this way, the fuzzy relation among customers is defined by using their features described in linguistic and quantitative variables. As follows, the customers of banks are grouped according to K-mean, C-mean, Fuzzy C-mean and Kernel K-mean algorithms and the proposed Fuzzy Relation Clustering (FRC algorithm. The aim of this paper is to show how to choose the best clustering algorithms based on density-based clustering and present a new clustering algorithm for both crisp and fuzzy variables. Finally, we apply the proposed approach to five datasets of customer's segmentation in banks. The result of the FCR shows the accuracy and high performance of FRC compared other clustering methods.

  2. Management of cluster headache

    DEFF Research Database (Denmark)

    Tfelt-Hansen, Peer C; Jensen, Rigmor H

    2012-01-01

    The prevalence of cluster headache is 0.1% and cluster headache is often not diagnosed or misdiagnosed as migraine or sinusitis. In cluster headache there is often a considerable diagnostic delay - an average of 7 years in a population-based survey. Cluster headache is characterized by very severe...... or severe orbital or periorbital pain with a duration of 15-180 minutes. The cluster headache attacks are accompanied by characteristic associated unilateral symptoms such as tearing, nasal congestion and/or rhinorrhoea, eyelid oedema, miosis and/or ptosis. In addition, there is a sense of restlessness...... and agitation. Patients may have up to eight attacks per day. Episodic cluster headache (ECH) occurs in clusters of weeks to months duration, whereas chronic cluster headache (CCH) attacks occur for more than 1 year without remissions. Management of cluster headache is divided into acute attack treatment...

  3. Histone and Ribosomal RNA Repetitive Gene Clusters of the Boll Weevil are Linked in a Tandem Array

    Science.gov (United States)

    Histones are the major protein component of chromatin structure. The histone family is made up of a quintet of proteins, four core histones (H2A, H2B, H3 & H4) and the linker histones (H1). Spacers are found between the coding regions. Among insects this quintet of genes is usually clustered and ...

  4. Dual localized AtHscB involved in iron sulfur protein biogenesis in Arabidopsis.

    Directory of Open Access Journals (Sweden)

    Xiang Ming Xu

    2009-10-01

    Full Text Available Iron-sulfur clusters are ubiquitous structures which act as prosthetic groups for numerous proteins involved in several fundamental biological processes including respiration and photosynthesis. Although simple in structure both the assembly and insertion of clusters into apoproteins requires complex biochemical pathways involving a diverse set of proteins. In yeast, the J-type chaperone Jac1 plays a key role in the biogenesis of iron sulfur clusters in mitochondria.In this study we demonstrate that AtHscB from Arabidopsis can rescue the Jac1 yeast knockout mutant suggesting a role for AtHscB in iron sulfur protein biogenesis in plants. In contrast to mitochondrial Jac1, AtHscB localizes to both mitochondria and the cytosol. AtHscB interacts with AtIscU1, an Isu-like scaffold protein involved in iron-sulfur cluster biogenesis, and through this interaction AtIscU1 is most probably retained in the cytosol. The chaperone AtHscA can functionally complement the yeast Ssq1knockout mutant and its ATPase activity is enhanced by AtHscB and AtIscU1. Interestingly, AtHscA is also localized in both mitochondria and the cytosol. Furthermore, AtHscB is highly expressed in anthers and trichomes and an AtHscB T-DNA insertion mutant shows reduced seed set, a waxless phenotype and inappropriate trichome development as well as dramatically reduced activities of the iron-sulfur enzymes aconitase and succinate dehydrogenase.Our data suggest that AtHscB together with AtHscA and AtIscU1 plays an important role in the biogenesis of iron-sulfur proteins in both mitochondria and the cytosol.

  5. A scalable and accurate method for classifying protein-ligand binding geometries using a MapReduce approach.

    Science.gov (United States)

    Estrada, T; Zhang, B; Cicotti, P; Armen, R S; Taufer, M

    2012-07-01

    We present a scalable and accurate method for classifying protein-ligand binding geometries in molecular docking. Our method is a three-step process: the first step encodes the geometry of a three-dimensional (3D) ligand conformation into a single 3D point in the space; the second step builds an octree by assigning an octant identifier to every single point in the space under consideration; and the third step performs an octree-based clustering on the reduced conformation space and identifies the most dense octant. We adapt our method for MapReduce and implement it in Hadoop. The load-balancing, fault-tolerance, and scalability in MapReduce allow screening of very large conformation spaces not approachable with traditional clustering methods. We analyze results for docking trials for 23 protein-ligand complexes for HIV protease, 21 protein-ligand complexes for Trypsin, and 12 protein-ligand complexes for P38alpha kinase. We also analyze cross docking trials for 24 ligands, each docking into 24 protein conformations of the HIV protease, and receptor ensemble docking trials for 24 ligands, each docking in a pool of HIV protease receptors. Our method demonstrates significant improvement over energy-only scoring for the accurate identification of native ligand geometries in all these docking assessments. The advantages of our clustering approach make it attractive for complex applications in real-world drug design efforts. We demonstrate that our method is particularly useful for clustering docking results using a minimal ensemble of representative protein conformational states (receptor ensemble docking), which is now a common strategy to address protein flexibility in molecular docking. Copyright © 2012 Elsevier Ltd. All rights reserved.

  6. Cluster Dynamics: Laying the Foundation for Tailoring the Design of Cluster ASSE

    Science.gov (United States)

    2016-02-25

    AFRL-AFOSR-VA-TR-2016-0081 CLUSTER DYNAMICS: LAYING THE FOUNDATION FOR TAILORING THE DESIGN OF CLUSTER ASSE Albert Castleman PENNSYLVANIA STATE...15-10-2015 4. TITLE AND SUBTITLE CLUSTER DYNAMICS: LAYING THE FOUNDATION FOR TAILORING THE DESIGN OF CLUSTER ASSEMBLED NANOSCALE MATERIALS 5a... clusters as the building blocks of new materials with tailored properties that are beneficial to the AFOSR. Our continuing program is composed of two

  7. Analysis of initial changes in the proteins of soybean root tip under flooding stress using gel-free and gel-based proteomic techniques.

    Science.gov (United States)

    Yin, Xiaojian; Sakata, Katsumi; Nanjo, Yohei; Komatsu, Setsuko

    2014-06-25

    Flooding has a severe negative effect on soybean cultivation in the early stages of growth. To obtain a better understanding of the response mechanisms of soybean to flooding stress, initial changes in root tip proteins under flooding were analyzed using two proteomic techniques. Two-day-old soybeans were treated with flooding for 3, 6, 12, and 24h. The weight of soybeans increased during the first 3h of flooding, but root elongation was not observed. Using gel-based and gel-free proteomic techniques, 115 proteins were identified in root tips, of which 9 proteins were commonly detected by both methods. The 71 proteins identified by the gel-free proteomics were analyzed by a hierarchical clustering method based on induction levels during the flooding, and the proteins were divided into 5 clusters. Additional interaction analysis of the proteins revealed that ten proteins belonging to cluster I formed the center of a protein interaction network. mRNA expression analysis of these ten proteins showed that citrate lyase and heat shock protein 70 were down-regulated, whereas calreticulin was up-regulated in initial phase of flooding. These results suggest that flooding stress to soybean induces calcium-related signal transduction, which might play important roles in the early responses to flooding. Flooding has a severe negative effect on soybean cultivation, particularly in the early stages of growth. To better understand the response mechanisms of soybean to the early stages of flooding stress, two proteomic techniques were used. Two-day-old soybeans were treated without or with flooding for 3, 6, 12, and 24h. The fresh weight of soybeans increased during the first 3h of flooding stress, but the growth then slowed and no root elongation was observed. Using gel-based and gel-free proteomic techniques, 115 proteins were identified in root tips, of which 9 proteins were commonly detected by both methods. The 71 proteins identified by the gel-free proteomics were analyzed

  8. Determining characteristic principal clusters in the “cluster-plus-glue-atom” model

    International Nuclear Information System (INIS)

    Du, Jinglian; Wen, Bin; 2NeT Lab, Wilfrid Laurier University, Waterloo, 75 University Ave West, Ontario N2L 3C5 (Canada))" data-affiliation=" (M2NeT Lab, Wilfrid Laurier University, Waterloo, 75 University Ave West, Ontario N2L 3C5 (Canada))" >Melnik, Roderick; Kawazoe, Yoshiyuki

    2014-01-01

    The “cluster-plus-glue-atom” model can easily describe the structure of complex metallic alloy phases. However, the biggest obstacle limiting the application of this model is that it is difficult to determine the characteristic principal cluster. In the case when interatomic force constants (IFCs) inside the cluster lead to stronger interaction than the interaction between the clusters, a new rule for determining the characteristic principal cluster in the “cluster-plus-glue-atom” model has been proposed on the basis of IFCs. To verify this new rule, the alloy phases in Cu–Zr and Al–Ni–Zr systems have been tested, and our results indicate that the present new rule for determining characteristic principal clusters is effective and reliable

  9. OPEN CLUSTERS AS PROBES OF THE GALACTIC MAGNETIC FIELD. I. CLUSTER PROPERTIES

    Energy Technology Data Exchange (ETDEWEB)

    Hoq, Sadia; Clemens, D. P., E-mail: shoq@bu.edu, E-mail: clemens@bu.edu [Institute for Astrophysical Research, 725 Commonwealth Avenue, Boston University, Boston, MA 02215 (United States)

    2015-10-15

    Stars in open clusters are powerful probes of the intervening Galactic magnetic field via background starlight polarimetry because they provide constraints on the magnetic field distances. We use 2MASS photometric data for a sample of 31 clusters in the outer Galaxy for which near-IR polarimetric data were obtained to determine the cluster distances, ages, and reddenings via fitting theoretical isochrones to cluster color–magnitude diagrams. The fitting approach uses an objective χ{sup 2} minimization technique to derive the cluster properties and their uncertainties. We found the ages, distances, and reddenings for 24 of the clusters, and the distances and reddenings for 6 additional clusters that were either sparse or faint in the near-IR. The derived ranges of log(age), distance, and E(B−V) were 7.25–9.63, ∼670–6160 pc, and 0.02–1.46 mag, respectively. The distance uncertainties ranged from ∼8% to 20%. The derived parameters were compared to previous studies, and most cluster parameters agree within our uncertainties. To test the accuracy of the fitting technique, synthetic clusters with 50, 100, or 200 cluster members and a wide range of ages were fit. These tests recovered the input parameters within their uncertainties for more than 90% of the individual synthetic cluster parameters. These results indicate that the fitting technique likely provides reliable estimates of cluster properties. The distances derived will be used in an upcoming study of the Galactic magnetic field in the outer Galaxy.

  10. Identification of new candidate drugs for lung cancer using chemical-chemical interactions, chemical-protein interactions and a K-means clustering algorithm.

    Science.gov (United States)

    Lu, Jing; Chen, Lei; Yin, Jun; Huang, Tao; Bi, Yi; Kong, Xiangyin; Zheng, Mingyue; Cai, Yu-Dong

    2016-01-01

    Lung cancer, characterized by uncontrolled cell growth in the lung tissue, is the leading cause of global cancer deaths. Until now, effective treatment of this disease is limited. Many synthetic compounds have emerged with the advancement of combinatorial chemistry. Identification of effective lung cancer candidate drug compounds among them is a great challenge. Thus, it is necessary to build effective computational methods that can assist us in selecting for potential lung cancer drug compounds. In this study, a computational method was proposed to tackle this problem. The chemical-chemical interactions and chemical-protein interactions were utilized to select candidate drug compounds that have close associations with approved lung cancer drugs and lung cancer-related genes. A permutation test and K-means clustering algorithm were employed to exclude candidate drugs with low possibilities to treat lung cancer. The final analysis suggests that the remaining drug compounds have potential anti-lung cancer activities and most of them have structural dissimilarity with approved drugs for lung cancer.

  11. Symmetries of cluster configurations

    International Nuclear Information System (INIS)

    Kramer, P.

    1975-01-01

    A deeper understanding of clustering phenomena in nuclei must encompass at least two interrelated aspects of the subject: (A) Given a system of A nucleons with two-body interactions, what are the relevant and persistent modes of clustering involved. What is the nature of the correlated nucleon groups which form the clusters, and what is their mutual interaction. (B) Given the cluster modes and their interaction, what systematic patterns of nuclear structure and reactions emerge from it. Are there, for example, families of states which share the same ''cluster parents''. Which cluster modes are compatible or exclude each other. What quantum numbers could characterize cluster configurations. There is no doubt that we can learn a good deal from the experimentalists who have discovered many of the features relevant to aspect (B). Symmetries specific to cluster configurations which can throw some light on both aspects of clustering are discussed

  12. Atomic Insight into the Altered O6-Methylguanine-DNA Methyltransferase Protein Architecture in Gastric Cancer.

    Directory of Open Access Journals (Sweden)

    Naveed Anjum Chikan

    Full Text Available O6-methylguanine-DNA methyltransferase (MGMT is one of the major DNA repair protein that counteracts the alkalyting agent-induced DNA damage by replacing O6-methylguanine (mutagenic lesion back to guanine, eventually suppressing the mismatch errors and double strand crosslinks. Exonic alterations in the form of nucleotide polymorphism may result in altered protein structure that in turn can lead to the loss of function. In the present study, we focused on the population feared for high exposure to alkylating agents owing to their typical and specialized dietary habits. To this end, gastric cancer patients pooled out from the population were selected for the mutational screening of a specific error prone region of MGMT gene. We found that nearly 40% of the studied neoplastic samples harbored missense mutation at codon151 resulting into Serine to Isoleucine variation. This variation resulted in bringing about the structural disorder, subsequently ensuing into a major stoichiometric variance in recognition domain, substrate binding and selectivity loop of the active site of the MGMT protein, as observed under virtual microscope of molecular dynamics simulation (MDS. The atomic insight into MGMT protein by computational approach showed a significant change in the intra molecular hydrogen bond pattern, thus leading to the observed structural anomalies. To further examine the mutational implications on regulatory plugs of MGMT that holds the protein in a DNA-Binding position, a MDS based analysis was carried out on, all known physically interacting amino acids essentially clustered into groups based on their position and function. The results generated by physical-functional clustering of protein indicated that the identified mutation in the vicinity of the active site of MGMT protein causes the local and global destabilization of a protein by either eliminating the stabilizing salt bridges in cluster C3, C4, and C5 or by locally destabilizing the

  13. Open source clustering software.

    Science.gov (United States)

    de Hoon, M J L; Imoto, S; Nolan, J; Miyano, S

    2004-06-12

    We have implemented k-means clustering, hierarchical clustering and self-organizing maps in a single multipurpose open-source library of C routines, callable from other C and C++ programs. Using this library, we have created an improved version of Michael Eisen's well-known Cluster program for Windows, Mac OS X and Linux/Unix. In addition, we generated a Python and a Perl interface to the C Clustering Library, thereby combining the flexibility of a scripting language with the speed of C. The C Clustering Library and the corresponding Python C extension module Pycluster were released under the Python License, while the Perl module Algorithm::Cluster was released under the Artistic License. The GUI code Cluster 3.0 for Windows, Macintosh and Linux/Unix, as well as the corresponding command-line program, were released under the same license as the original Cluster code. The complete source code is available at http://bonsai.ims.u-tokyo.ac.jp/mdehoon/software/cluster. Alternatively, Algorithm::Cluster can be downloaded from CPAN, while Pycluster is also available as part of the Biopython distribution.

  14. Monomeric Yeast Frataxin is an Iron-Binding Protein

    International Nuclear Information System (INIS)

    Cook, J.; Bencze, K.; Jankovic, A.; Crater, A.; Busch, C.; Bradley, P.; Stemmler, A.; Spaller, M.; Stemmler, T.

    2006-01-01

    Friedreich's ataxia, an autosomal cardio- and neurodegenerative disorder that affects 1 in 50 000 humans, is caused by decreased levels of the protein frataxin. Although frataxin is nuclear-encoded, it is targeted to the mitochondrial matrix and necessary for proper regulation of cellular iron homeostasis. Frataxin is required for the cellular production of both heme and iron-sulfur (Fe-S) clusters. Monomeric frataxin binds with high affinity to ferrochelatase, the enzyme involved in iron insertion into porphyrin during heme production. Monomeric frataxin also binds to Isu, the scaffold protein required for assembly of Fe-S cluster intermediates. These processes (heme and Fe-S cluster assembly) share requirements for iron, suggesting that monomeric frataxin might function as the common iron donor. To provide a molecular basis to better understand frataxin's function, we have characterized the binding properties and metal-site structure of ferrous iron bound to monomeric yeast frataxin. Yeast frataxin is stable as an iron-loaded monomer, and the protein can bind two ferrous iron atoms with micromolar binding affinity. Frataxin amino acids affected by the presence of iron are localized within conserved acidic patches located on the surfaces of both helix-1 and strand-1. Under anaerobic conditions, bound metal is stable in the high-spin ferrous state. The metal-ligand coordination geometry of both metal-binding sites is consistent with a six-coordinate iron-(oxygen/nitrogen) based ligand geometry, surely constructed in part from carboxylate and possibly imidazole side chains coming from residues within these conserved acidic patches on the protein. On the basis of our results, we have developed a model for how we believe yeast frataxin interacts with iron

  15. The Serratia gene cluster encoding biosynthesis of the red antibiotic, prodigiosin, shows species- and strain-dependent genome context variation

    DEFF Research Database (Denmark)

    Harris, Abigail K P; Williamson, Neil R; Slater, Holly

    2004-01-01

    The prodigiosin biosynthesis gene cluster (pig cluster) from two strains of Serratia (S. marcescens ATCC 274 and Serratia sp. ATCC 39006) has been cloned, sequenced and expressed in heterologous hosts. Sequence analysis of the respective pig clusters revealed 14 ORFs in S. marcescens ATCC 274...... and 15 ORFs in Serratia sp. ATCC 39006. In each Serratia species, predicted gene products showed similarity to polyketide synthases (PKSs), non-ribosomal peptide synthases (NRPSs) and the Red proteins of Streptomyces coelicolor A3(2). Comparisons between the two Serratia pig clusters and the red cluster...... from Str. coelicolor A3(2) revealed some important differences. A modified scheme for the biosynthesis of prodigiosin, based on the pathway recently suggested for the synthesis of undecylprodigiosin, is proposed. The distribution of the pig cluster within several Serratia sp. isolates is demonstrated...

  16. Electronic structure and properties of designer clusters and cluster-assemblies

    International Nuclear Information System (INIS)

    Khanna, S.N.; Jena, P.

    1995-01-01

    Using self-consistent calculations based on density functional theory, we demonstrate that electronic shell filling and close atomic packing criteria can be used to design ultra-stable clusters. Interaction of these clusters with each other and with gas atoms is found to be weak confirming their chemical inertness. A crystal composed of these inert clusters is expected to have electronic properties that are markedly different from crystals where atoms are the building blocks. The recent observation of ferromagnetism in potassium clusters assembled in zeolite cages is discussed. (orig.)

  17. Exploring biological network structure with clustered random networks

    Directory of Open Access Journals (Sweden)

    Bansal Shweta

    2009-12-01

    Full Text Available Abstract Background Complex biological systems are often modeled as networks of interacting units. Networks of biochemical interactions among proteins, epidemiological contacts among hosts, and trophic interactions in ecosystems, to name a few, have provided useful insights into the dynamical processes that shape and traverse these systems. The degrees of nodes (numbers of interactions and the extent of clustering (the tendency for a set of three nodes to be interconnected are two of many well-studied network properties that can fundamentally shape a system. Disentangling the interdependent effects of the various network properties, however, can be difficult. Simple network models can help us quantify the structure of empirical networked systems and understand the impact of various topological properties on dynamics. Results Here we develop and implement a new Markov chain simulation algorithm to generate simple, connected random graphs that have a specified degree sequence and level of clustering, but are random in all other respects. The implementation of the algorithm (ClustRNet: Clustered Random Networks provides the generation of random graphs optimized according to a local or global, and relative or absolute measure of clustering. We compare our algorithm to other similar methods and show that ours more successfully produces desired network characteristics. Finding appropriate null models is crucial in bioinformatics research, and is often difficult, particularly for biological networks. As we demonstrate, the networks generated by ClustRNet can serve as random controls when investigating the impacts of complex network features beyond the byproduct of degree and clustering in empirical networks. Conclusion ClustRNet generates ensembles of graphs of specified edge structure and clustering. These graphs allow for systematic study of the impacts of connectivity and redundancies on network function and dynamics. This process is a key step in

  18. CASP10-BCL::Fold efficiently samples topologies of large proteins.

    Science.gov (United States)

    Heinze, Sten; Putnam, Daniel K; Fischer, Axel W; Kohlmann, Tim; Weiner, Brian E; Meiler, Jens

    2015-03-01

    During CASP10 in summer 2012, we tested BCL::Fold for prediction of free modeling (FM) and template-based modeling (TBM) targets. BCL::Fold assembles the tertiary structure of a protein from predicted secondary structure elements (SSEs) omitting more flexible loop regions early on. This approach enables the sampling of conformational space for larger proteins with more complex topologies. In preparation of CASP11, we analyzed the quality of CASP10 models throughout the prediction pipeline to understand BCL::Fold's ability to sample the native topology, identify native-like models by scoring and/or clustering approaches, and our ability to add loop regions and side chains to initial SSE-only models. The standout observation is that BCL::Fold sampled topologies with a GDT_TS score > 33% for 12 of 18 and with a topology score > 0.8 for 11 of 18 test cases de novo. Despite the sampling success of BCL::Fold, significant challenges still exist in clustering and loop generation stages of the pipeline. The clustering approach employed for model selection often failed to identify the most native-like assembly of SSEs for further refinement and submission. It was also observed that for some β-strand proteins model refinement failed as β-strands were not properly aligned to form hydrogen bonds removing otherwise accurate models from the pool. Further, BCL::Fold samples frequently non-natural topologies that require loop regions to pass through the center of the protein. © 2015 Wiley Periodicals, Inc.

  19. Cluster Headache

    Science.gov (United States)

    ... a role. Unlike migraine and tension headache, cluster headache generally isn't associated with triggers, such as foods, hormonal changes or stress. Once a cluster period begins, however, drinking alcohol ...

  20. Performance Evaluation of Spectral Clustering Algorithm using Various Clustering Validity Indices

    OpenAIRE

    M. T. Somashekara; D. Manjunatha

    2014-01-01

    In spite of the popularity of spectral clustering algorithm, the evaluation procedures are still in developmental stage. In this article, we have taken benchmarking IRIS dataset for performing comparative study of twelve indices for evaluating spectral clustering algorithm. The results of the spectral clustering technique were also compared with k-mean algorithm. The validity of the indices was also verified with accuracy and (Normalized Mutual Information) NMI score. Spectral clustering algo...

  1. Substructure in clusters of galaxies

    International Nuclear Information System (INIS)

    Fitchett, M.J.

    1988-01-01

    Optical observations suggesting the existence of substructure in clusters of galaxies are examined. Models of cluster formation and methods used to detect substructure in clusters are reviewed. Consideration is given to classification schemes based on a departure of bright cluster galaxies from a spherically symmetric distribution, evidence for statistically significant substructure, and various types of substructure, including velocity, spatial, and spatial-velocity substructure. The substructure observed in the galaxy distribution in clusters is discussed, focusing on observations from general cluster samples, the Virgo cluster, the Hydra cluster, Centaurus, the Coma cluster, and the Cancer cluster. 88 refs

  2. CC_TRS: Continuous Clustering of Trajectory Stream Data Based on Micro Cluster Life

    Directory of Open Access Journals (Sweden)

    Musaab Riyadh

    2017-01-01

    Full Text Available The rapid spreading of positioning devices leads to the generation of massive spatiotemporal trajectories data. In some scenarios, spatiotemporal data are received in stream manner. Clustering of stream data is beneficial for different applications such as traffic management and weather forecasting. In this article, an algorithm for Continuous Clustering of Trajectory Stream Data Based on Micro Cluster Life is proposed. The algorithm consists of two phases. There is the online phase where temporal micro clusters are used to store summarized spatiotemporal information for each group of similar segments. The clustering task in online phase is based on temporal micro cluster lifetime instead of time window technique which divides stream data into time bins and clusters each bin separately. For offline phase, a density based clustering approach is used to generate macro clusters depending on temporal micro clusters. The evaluation of the proposed algorithm on real data sets shows the efficiency and the effectiveness of the proposed algorithm and proved it is efficient alternative to time window technique.

  3. Defining objective clusters for rabies virus sequences using affinity propagation clustering.

    Directory of Open Access Journals (Sweden)

    Susanne Fischer

    2018-01-01

    Full Text Available Rabies is caused by lyssaviruses, and is one of the oldest known zoonoses. In recent years, more than 21,000 nucleotide sequences of rabies viruses (RABV, from the prototype species rabies lyssavirus, have been deposited in public databases. Subsequent phylogenetic analyses in combination with metadata suggest geographic distributions of RABV. However, these analyses somewhat experience technical difficulties in defining verifiable criteria for cluster allocations in phylogenetic trees inviting for a more rational approach. Therefore, we applied a relatively new mathematical clustering algorythm named 'affinity propagation clustering' (AP to propose a standardized sub-species classification utilizing full-genome RABV sequences. Because AP has the advantage that it is computationally fast and works for any meaningful measure of similarity between data samples, it has previously been applied successfully in bioinformatics, for analysis of microarray and gene expression data, however, cluster analysis of sequences is still in its infancy. Existing (516 and original (46 full genome RABV sequences were used to demonstrate the application of AP for RABV clustering. On a global scale, AP proposed four clusters, i.e. New World cluster, Arctic/Arctic-like, Cosmopolitan, and Asian as previously assigned by phylogenetic studies. By combining AP with established phylogenetic analyses, it is possible to resolve phylogenetic relationships between verifiably determined clusters and sequences. This workflow will be useful in confirming cluster distributions in a uniform transparent manner, not only for RABV, but also for other comparative sequence analyses.

  4. Clustering methods for the optimization of atomic cluster structure

    Science.gov (United States)

    Bagattini, Francesco; Schoen, Fabio; Tigli, Luca

    2018-04-01

    In this paper, we propose a revised global optimization method and apply it to large scale cluster conformation problems. In the 1990s, the so-called clustering methods were considered among the most efficient general purpose global optimization techniques; however, their usage has quickly declined in recent years, mainly due to the inherent difficulties of clustering approaches in large dimensional spaces. Inspired from the machine learning literature, we redesigned clustering methods in order to deal with molecular structures in a reduced feature space. Our aim is to show that by suitably choosing a good set of geometrical features coupled with a very efficient descent method, an effective optimization tool is obtained which is capable of finding, with a very high success rate, all known putative optima for medium size clusters without any prior information, both for Lennard-Jones and Morse potentials. The main result is that, beyond being a reliable approach, the proposed method, based on the idea of starting a computationally expensive deep local search only when it seems worth doing so, is capable of saving a huge amount of searches with respect to an analogous algorithm which does not employ a clustering phase. In this paper, we are not claiming the superiority of the proposed method compared to specific, refined, state-of-the-art procedures, but rather indicating a quite straightforward way to save local searches by means of a clustering scheme working in a reduced variable space, which might prove useful when included in many modern methods.

  5. Interaction of the anaphase-promoting complex/cyclosome and proteasome protein complexes with multiubiquitin chain-binding proteins

    DEFF Research Database (Denmark)

    Seeger, Michael; Hartmann-Petersen, Rasmus; Wilkinson, Caroline R M

    2003-01-01

    Fission yeast Rhp23 and Pus1 represent two families of multiubiquitin chain-binding proteins that associate with the proteasome. We show that both proteins bind to different regions of the proteasome subunit Mts4. The binding site for Pus1 was mapped to a cluster of repetitive sequences also found...... in the proteasome subunit SpRpn2 and the anaphase-promoting complex/cyclosome (APC/C) subunit Cut4. The putative role of Pus1 as a factor involved in allocation of ubiquitinylated substrates for the proteasome is discussed....

  6. Super-resolution imaging of ESCRT-proteins at HIV-1 assembly sites.

    Directory of Open Access Journals (Sweden)

    Jens Prescher

    2015-02-01

    Full Text Available The cellular endosomal sorting complex required for transport (ESCRT machinery is involved in membrane budding processes, such as multivesicular biogenesis and cytokinesis. In HIV-infected cells, HIV-1 hijacks the ESCRT machinery to drive HIV release. Early in the HIV-1 assembly process, the ESCRT-I protein Tsg101 and the ESCRT-related protein ALIX are recruited to the assembly site. Further downstream, components such as the ESCRT-III proteins CHMP4 and CHMP2 form transient membrane associated lattices, which are involved in virus-host membrane fission. Although various geometries of ESCRT-III assemblies could be observed, the actual membrane constriction and fission mechanism is not fully understood. Fission might be driven from inside the HIV-1 budding neck by narrowing the membranes from the outside by larger lattices surrounding the neck, or from within the bud. Here, we use super-resolution fluorescence microscopy to elucidate the size and structure of the ESCRT components Tsg101, ALIX, CHMP4B and CHMP2A during HIV-1 budding below the diffraction limit. To avoid the deleterious effects of using fusion proteins attached to ESCRT components, we performed measurements on the endogenous protein or, in the case of CHMP4B, constructs modified with the small HA tag. Due to the transient nature of the ESCRT interactions, the fraction of HIV-1 assembly sites with colocalizing ESCRT complexes was low (1.5%-3.4%. All colocalizing ESCRT clusters exhibited closed, circular structures with an average size (full-width at half-maximum between 45 and 60 nm or a diameter (determined using a Ripley's L-function analysis of roughly 60 to 100 nm. The size distributions for colocalizing clusters were narrower than for non-colocalizing clusters, and significantly smaller than the HIV-1 bud. Hence, our results support a membrane scission process driven by ESCRT protein assemblies inside a confined structure, such as the bud neck, rather than by large lattices

  7. Topological properties of complex networks in protein structures

    Science.gov (United States)

    Kim, Kyungsik; Jung, Jae-Won; Min, Seungsik

    2014-03-01

    We study topological properties of networks in structural classification of proteins. We model the native-state protein structure as a network made of its constituent amino-acids and their interactions. We treat four structural classes of proteins composed predominantly of α helices and β sheets and consider several proteins from each of these classes whose sizes range from amino acids of the Protein Data Bank. Particularly, we simulate and analyze the network metrics such as the mean degree, the probability distribution of degree, the clustering coefficient, the characteristic path length, the local efficiency, and the cost. This work was supported by the KMAR and DP under Grant WISE project (153-3100-3133-302-350).

  8. The correlation functions for the clustering of galaxies and Abell clusters

    International Nuclear Information System (INIS)

    Jones, B.J.T.; Jones, J.E.; Copenhagen Univ.

    1985-01-01

    The difference in amplitudes between the galaxy-galaxy correlation function and the correlation function between Abell clusters is a consequence of two facts. Firstly, most Abell clusters with z<0.08 lie in a relatively small volume of the sampled space, and secondly, the fraction of galaxies lying in Abell clusters differs considerably inside and outside of this volume. (The Abell clusters are confined to a smaller volume of space than are the galaxies.) We discuss the implications of this interpretation of the clustering correlation functions and present a simple model showing how such a situation may arise quite naturally in standard theories for galaxy formation. (orig.)

  9. Subspace K-means clustering.

    Science.gov (United States)

    Timmerman, Marieke E; Ceulemans, Eva; De Roover, Kim; Van Leeuwen, Karla

    2013-12-01

    To achieve an insightful clustering of multivariate data, we propose subspace K-means. Its central idea is to model the centroids and cluster residuals in reduced spaces, which allows for dealing with a wide range of cluster types and yields rich interpretations of the clusters. We review the existing related clustering methods, including deterministic, stochastic, and unsupervised learning approaches. To evaluate subspace K-means, we performed a comparative simulation study, in which we manipulated the overlap of subspaces, the between-cluster variance, and the error variance. The study shows that the subspace K-means algorithm is sensitive to local minima but that the problem can be reasonably dealt with by using partitions of various cluster procedures as a starting point for the algorithm. Subspace K-means performs very well in recovering the true clustering across all conditions considered and appears to be superior to its competitor methods: K-means, reduced K-means, factorial K-means, mixtures of factor analyzers (MFA), and MCLUST. The best competitor method, MFA, showed a performance similar to that of subspace K-means in easy conditions but deteriorated in more difficult ones. Using data from a study on parental behavior, we show that subspace K-means analysis provides a rich insight into the cluster characteristics, in terms of both the relative positions of the clusters (via the centroids) and the shape of the clusters (via the within-cluster residuals).

  10. Calcium Domains around Single and Clustered IP3 Receptors and Their Modulation by Buffers

    Science.gov (United States)

    Rüdiger, S.; Nagaiah, Ch.; Warnecke, G.; Shuai, J.W.

    2010-01-01

    Abstract We study Ca2+ release through single and clustered IP3 receptor channels on the ER membrane under presence of buffer proteins. Our computational scheme couples reaction-diffusion equations and a Markovian channel model and allows our investigating the effects of buffer proteins on local calcium concentrations and channel gating. We find transient and stationary elevations of calcium concentrations around active channels and show how they determine release amplitude. Transient calcium domains occur after closing of isolated channels and constitute an important part of the channel's feedback. They cause repeated openings (bursts) and mediate increased release due to Ca2+ buffering by immobile proteins. Stationary domains occur during prolonged activity of clustered channels, where the spatial proximity of IP3Rs produces a distinct [Ca2+] scale (0.5–10 μM), which is smaller than channel pore concentrations (>100 μM) but larger than transient levels. While immobile buffer affects transient levels only, mobile buffers in general reduce both transient and stationary domains, giving rise to Ca2+ evacuation and biphasic modulation of release amplitude. Our findings explain recent experiments in oocytes and provide a general framework for the understanding of calcium signals. PMID:20655827

  11. Globular clusters and galaxy halos

    International Nuclear Information System (INIS)

    Van Den Bergh, S.

    1984-01-01

    Using semipartial correlation coefficients and bootstrap techniques, a study is made of the important features of globular clusters with respect to the total number of galaxy clusters and dependence of specific galaxy cluster on parent galaxy type, cluster radii, luminosity functions and cluster ellipticity. It is shown that the ellipticity of LMC clusters correlates significantly with cluster luminosity functions, but not with cluster age. The cluter luminosity value above which globulars are noticeably flattened may differ by a factor of about 100 from galaxy to galaxy. Both in the Galaxy and in M31 globulars with small core radii have a Gaussian distribution over luminosity, whereas clusters with large core radii do not. In the cluster systems surrounding the Galaxy, M31 and NGC 5128 the mean radii of globular clusters was found to increase with the distance from the nucleus. Central galaxies in rich clusters have much higher values for specific globular cluster frequency than do other cluster ellipticals, suggesting that such central galaxies must already have been different from normal ellipticals at the time they were formed

  12. Adaptive Scaling of Cluster Boundaries for Large-Scale Social Media Data Clustering.

    Science.gov (United States)

    Meng, Lei; Tan, Ah-Hwee; Wunsch, Donald C

    2016-12-01

    The large scale and complex nature of social media data raises the need to scale clustering techniques to big data and make them capable of automatically identifying data clusters with few empirical settings. In this paper, we present our investigation and three algorithms based on the fuzzy adaptive resonance theory (Fuzzy ART) that have linear computational complexity, use a single parameter, i.e., the vigilance parameter to identify data clusters, and are robust to modest parameter settings. The contribution of this paper lies in two aspects. First, we theoretically demonstrate how complement coding, commonly known as a normalization method, changes the clustering mechanism of Fuzzy ART, and discover the vigilance region (VR) that essentially determines how a cluster in the Fuzzy ART system recognizes similar patterns in the feature space. The VR gives an intrinsic interpretation of the clustering mechanism and limitations of Fuzzy ART. Second, we introduce the idea of allowing different clusters in the Fuzzy ART system to have different vigilance levels in order to meet the diverse nature of the pattern distribution of social media data. To this end, we propose three vigilance adaptation methods, namely, the activation maximization (AM) rule, the confliction minimization (CM) rule, and the hybrid integration (HI) rule. With an initial vigilance value, the resulting clustering algorithms, namely, the AM-ART, CM-ART, and HI-ART, can automatically adapt the vigilance values of all clusters during the learning epochs in order to produce better cluster boundaries. Experiments on four social media data sets show that AM-ART, CM-ART, and HI-ART are more robust than Fuzzy ART to the initial vigilance value, and they usually achieve better or comparable performance and much faster speed than the state-of-the-art clustering algorithms that also do not require a predefined number of clusters.

  13. The Molecular Bases of the Dual Regulation of Bacterial Iron Sulfur Cluster Biogenesis by CyaY and IscX

    Directory of Open Access Journals (Sweden)

    Salvatore Adinolfi

    2018-02-01

    Full Text Available IscX (or YfhJ is a protein of unknown function which takes part in the iron-sulfur cluster assembly machinery, a highly specialized and essential metabolic pathway. IscX binds to iron with low affinity and interacts with IscS, the desulfurase central to cluster assembly. Previous studies have suggested a competition between IscX and CyaY, the bacterial ortholog of frataxin, for the same binding surface of IscS. This competition could suggest a link between the two proteins with a functional significance. Using a hybrid approach based on nuclear magnetic resonance, small angle scattering and biochemical methods, we show here that IscX is a modulator of the inhibitory properties of CyaY: by competing for the same site on IscS, the presence of IscX rescues the rates of enzymatic cluster formation which are inhibited by CyaY. The effect is stronger at low iron concentrations, whereas it becomes negligible at high iron concentrations. These results strongly suggest the mechanism of the dual regulation of iron sulfur cluster assembly under the control of iron as the effector.

  14. Isotopic clusters

    International Nuclear Information System (INIS)

    Geraedts, J.M.P.

    1983-01-01

    Spectra of isotopically mixed clusters (dimers of SF 6 ) are calculated as well as transition frequencies. The result leads to speculations about the suitability of the laser-cluster fragmentation process for isotope separation. (Auth.)

  15. Identification of stress responsive genes by studying specific relationships between mRNA and protein abundance.

    Science.gov (United States)

    Morimoto, Shimpei; Yahara, Koji

    2018-03-01

    Protein expression is regulated by the production and degradation of mRNAs and proteins but the specifics of their relationship are controversial. Although technological advances have enabled genome-wide and time-series surveys of mRNA and protein abundance, recent studies have shown paradoxical results, with most statistical analyses being limited to linear correlation, or analysis of variance applied separately to mRNA and protein datasets. Here, using recently analyzed genome-wide time-series data, we have developed a statistical analysis framework for identifying which types of genes or biological gene groups have significant correlation between mRNA and protein abundance after accounting for potential time delays. Our framework stratifies all genes in terms of the extent of time delay, conducts gene clustering in each stratum, and performs a non-parametric statistical test of the correlation between mRNA and protein abundance in a gene cluster. Consequently, we revealed stronger correlations than previously reported between mRNA and protein abundance in two metabolic pathways. Moreover, we identified a pair of stress responsive genes ( ADC17 and KIN1 ) that showed a highly similar time series of mRNA and protein abundance. Furthermore, we confirmed robustness of the analysis framework by applying it to another genome-wide time-series data and identifying a cytoskeleton-related gene cluster (keratin 18, keratin 17, and mitotic spindle positioning) that shows similar correlation. The significant correlation and highly similar changes of mRNA and protein abundance suggests a concerted role of these genes in cellular stress response, which we consider provides an answer to the question of the specific relationships between mRNA and protein in a cell. In addition, our framework for studying the relationship between mRNAs and proteins in a cell will provide a basis for studying specific relationships between mRNA and protein abundance after accounting for potential

  16. A comparative analysis of clustering algorithms: O{sub 2} migration in truncated hemoglobin I from transition networks

    Energy Technology Data Exchange (ETDEWEB)

    Cazade, Pierre-André; Berezovska, Ganna; Meuwly, Markus, E-mail: m.meuwly@unibas.ch [Department of Chemistry, University of Basel, Klingelbergstrasse 80, CH-4056 Basel (Switzerland); Zheng, Wenwei; Clementi, Cecilia [Department of Chemistry, Rice University, 6100 Main St., Houston, Texas 77005 (United States); Prada-Gracia, Diego; Rao, Francesco [School of Soft Matter Research, Freiburg Institute for Advanced Studies, Albertstrasse 19, 79104 Freiburg im Breisgau (Germany)

    2015-01-14

    The ligand migration network for O{sub 2}–diffusion in truncated Hemoglobin N is analyzed based on three different clustering schemes. For coordinate-based clustering, the conventional k–means and the kinetics-based Markov Clustering (MCL) methods are employed, whereas the locally scaled diffusion map (LSDMap) method is a collective-variable-based approach. It is found that all three methods agree well in their geometrical definition of the most important docking site, and all experimentally known docking sites are recovered by all three methods. Also, for most of the states, their population coincides quite favourably, whereas the kinetics of and between the states differs. One of the major differences between k–means and MCL clustering on the one hand and LSDMap on the other is that the latter finds one large primary cluster containing the Xe1a, IS1, and ENT states. This is related to the fact that the motion within the state occurs on similar time scales, whereas structurally the state is found to be quite diverse. In agreement with previous explicit atomistic simulations, the Xe3 pocket is found to be a highly dynamical site which points to its potential role as a hub in the network. This is also highlighted in the fact that LSDMap cannot identify this state. First passage time distributions from MCL clusterings using a one- (ligand-position) and two-dimensional (ligand-position and protein-structure) descriptor suggest that ligand- and protein-motions are coupled. The benefits and drawbacks of the three methods are discussed in a comparative fashion and highlight that depending on the questions at hand the best-performing method for a particular data set may differ.

  17. THE DYNAMICAL STATE OF BRIGHTEST CLUSTER GALAXIES AND THE FORMATION OF CLUSTERS

    International Nuclear Information System (INIS)

    Coziol, R.; Andernach, H.; Caretta, C. A.; Alamo-MartInez, K. A.; Tago, E.

    2009-01-01

    A large sample of Abell clusters of galaxies, selected for the likely presence of a dominant galaxy, is used to study the dynamical properties of the brightest cluster members (BCMs). From visual inspection of Digitized Sky Survey images combined with redshift information we identify 1426 candidate BCMs located in 1221 different redshift components associated with 1169 different Abell clusters. This is the largest sample published so far of such galaxies. From our own morphological classification we find that ∼92% of the BCMs in our sample are early-type galaxies and 48% are of cD type. We confirm what was previously observed based on much smaller samples, namely, that a large fraction of BCMs have significant peculiar velocities. From a subsample of 452 clusters having at least 10 measured radial velocities, we estimate a median BCM peculiar velocity of 32% of their host clusters' radial velocity dispersion. This suggests that most BCMs are not at rest in the potential well of their clusters. This phenomenon is common to galaxy clusters in our sample, and not a special trait of clusters hosting cD galaxies. We show that the peculiar velocity of the BCM is independent of cluster richness and only slightly dependent on the Bautz-Morgan type. We also find a weak trend for the peculiar velocity to rise with the cluster velocity dispersion. The strongest dependence is with the morphological type of the BCM: cD galaxies tend to have lower relative peculiar velocities than elliptical galaxies. This result points to a connection between the formation of the BCMs and that of their clusters. Our data are qualitatively consistent with the merging-groups scenario, where BCMs in clusters formed first in smaller subsystems comparable to compact groups of galaxies. In this scenario, clusters would have formed recently from the mergers of many such groups and would still be in a dynamically unrelaxed state.

  18. Semi-supervised clustering methods

    Science.gov (United States)

    Bair, Eric

    2013-01-01

    Cluster analysis methods seek to partition a data set into homogeneous subgroups. It is useful in a wide variety of applications, including document processing and modern genetics. Conventional clustering methods are unsupervised, meaning that there is no outcome variable nor is anything known about the relationship between the observations in the data set. In many situations, however, information about the clusters is available in addition to the values of the features. For example, the cluster labels of some observations may be known, or certain observations may be known to belong to the same cluster. In other cases, one may wish to identify clusters that are associated with a particular outcome variable. This review describes several clustering algorithms (known as “semi-supervised clustering” methods) that can be applied in these situations. The majority of these methods are modifications of the popular k-means clustering method, and several of them will be described in detail. A brief description of some other semi-supervised clustering algorithms is also provided. PMID:24729830

  19. Mutations in iron-sulfur cluster proteins that improve xylose utilization

    Science.gov (United States)

    Froehlich, Allan; Henningsen, Brooks; Covalla, Sean; Zelle, Rintze M.

    2018-03-20

    There is provided an engineered host cells comprising (a) one or more mutations in one or more endogenous genes encoding a protein associated with iron metabolism; and (b) at least one gene encoding a polypeptide having xylose isomerase activity, and methods of their use thereof.

  20. Clustering of correlated networks

    OpenAIRE

    Dorogovtsev, S. N.

    2003-01-01

    We obtain the clustering coefficient, the degree-dependent local clustering, and the mean clustering of networks with arbitrary correlations between the degrees of the nearest-neighbor vertices. The resulting formulas allow one to determine the nature of the clustering of a network.

  1. Analysis of Network Clustering Algorithms and Cluster Quality Metrics at Scale.

    Science.gov (United States)

    Emmons, Scott; Kobourov, Stephen; Gallant, Mike; Börner, Katy

    2016-01-01

    Notions of community quality underlie the clustering of networks. While studies surrounding network clustering are increasingly common, a precise understanding of the realtionship between different cluster quality metrics is unknown. In this paper, we examine the relationship between stand-alone cluster quality metrics and information recovery metrics through a rigorous analysis of four widely-used network clustering algorithms-Louvain, Infomap, label propagation, and smart local moving. We consider the stand-alone quality metrics of modularity, conductance, and coverage, and we consider the information recovery metrics of adjusted Rand score, normalized mutual information, and a variant of normalized mutual information used in previous work. Our study includes both synthetic graphs and empirical data sets of sizes varying from 1,000 to 1,000,000 nodes. We find significant differences among the results of the different cluster quality metrics. For example, clustering algorithms can return a value of 0.4 out of 1 on modularity but score 0 out of 1 on information recovery. We find conductance, though imperfect, to be the stand-alone quality metric that best indicates performance on the information recovery metrics. Additionally, our study shows that the variant of normalized mutual information used in previous work cannot be assumed to differ only slightly from traditional normalized mutual information. Smart local moving is the overall best performing algorithm in our study, but discrepancies between cluster evaluation metrics prevent us from declaring it an absolutely superior algorithm. Interestingly, Louvain performed better than Infomap in nearly all the tests in our study, contradicting the results of previous work in which Infomap was superior to Louvain. We find that although label propagation performs poorly when clusters are less clearly defined, it scales efficiently and accurately to large graphs with well-defined clusters.

  2. Identification of Protein Complexes from Tandem Affinity Purification/Mass Spectrometry Data via Biased Random Walk.

    Science.gov (United States)

    Cai, Bingjing; Wang, Haiying; Zheng, Huiru; Wang, Hui

    2015-01-01

    Systematic identification of protein complexes from protein-protein interaction networks (PPIs) is an important application of data mining in life science. Over the past decades, various new clustering techniques have been developed based on modelling PPIs as binary relations. Non-binary information of co-complex relations (prey/bait) in PPIs data derived from tandem affinity purification/mass spectrometry (TAP-MS) experiments has been unfairly disregarded. In this paper, we propose a Biased Random Walk based algorithm for detecting protein complexes from TAP-MS data, resulting in the random walk with restarting baits (RWRB). RWRB is developed based on Random walk with restart. The main contribution of RWRB is the incorporation of co-complex relations in TAP-MS PPI networks into the clustering process, by implementing a new restarting strategy during the process of random walk. Through experimentation on un-weighted and weighted TAP-MS data sets, we validated biological significance of our results by mapping them to manually curated complexes. Results showed that, by incorporating non-binary, co-membership information, significant improvement has been achieved in terms of both statistical measurements and biological relevance. Better accuracy demonstrates that the proposed method outperformed several state-of-the-art clustering algorithms for the detection of protein complexes in TAP-MS data.

  3. Structure and physical properties of silicon clusters and of vacancy clusters in bulk silicon

    International Nuclear Information System (INIS)

    Sieck, A.

    2000-01-01

    In this thesis the growth-pattern of free silicon clusters and vacancy clusters in bulk silicon is investigated. The aim is to describe and to better understand the cluster to bulk transition. Silicon structures in between clusters and solids feature new interesting physical properties. The structure and physical properties of silicon clusters can be revealed by a combination of theory and experiment, only. Low-energy clusters are determined with different optimization techniques and a density-functional based tight-binding method. Additionally, infrared and Raman spectra, and polarizabilities calculated within self-consistent field density-functional theory are provided for the smaller clusters. For clusters with 25 to 35 atoms an analysis of the shape of the clusters and the related mobilities in a buffer gas is given. Finally, the clusters observed in low-temperature experiments are identified via the best match between calculated properties and experimental data. Silicon clusters with 10 to 15 atoms have a tricapped trigonal prism as a common subunit. Clusters with up to about 25 atoms follow a prolate growth-path. In the range from 24 to 30 atoms the geometry of the clusters undergoes a transition towards compact spherical structures. Low-energy clusters with up to 240 atoms feature a bonding pattern strikingly different from the tetrahedral bonding in the solid. It follows that structures with dimensions of several Angstroem have electrical and optical properties different from the solid. The calculated stabilities and positron-lifetimes of vacancy clusters in bulk silicon indicate the positron-lifetimes of about 435 ps detected in irradiated silicon to be related to clusters of 9 or 10 vacancies. The vacancies in these clusters form neighboring hexa-rings and, therefore, minimize the number of dangling bonds. (orig.)

  4. Regulatory role of tetR gene in a novel gene cluster of Acidovorax avenae subsp. avenae RS-1 under oxidative stress

    Directory of Open Access Journals (Sweden)

    He eLiu

    2014-10-01

    Full Text Available Acidovorax avenae subsp. avenae is the causal agent of bacterial brown stripe disease in rice. In this study, we characterized a novel horizontal transfer of a gene cluster, including tetR, on the chromosome of A. avenae subsp. avenae RS-1 by genome-wide analysis. TetR acted as a repressor in this gene cluster and the oxidative stress resistance was enhanced in tetR-deletion mutant strain. Electrophoretic mobility shift assay (EMSA demonstrated that TetR regulator bound directly to the promoter of this gene cluster. Consistently, the results of quantitative real-time PCR also showed alterations in expression of associated genes. Moreover, the proteins affected by TetR under oxidative stress were revealed by comparing proteomic profiles of wild-type and mutant strains via 1D SDS-PAGE and LC-MS/MS analyses. Taken together, our results demonstrated that tetR gene in this novel gene cluster contributed to cell survival under oxidative stress, and TetR protein played an important regulatory role in growth kinetics, biofilm-forming capability, SOD and catalase activity, and oxide detoxicating ability.

  5. Regulatory role of tetR gene in a novel gene cluster of Acidovorax avenae subsp. avenae RS-1 under oxidative stress.

    Science.gov (United States)

    Liu, He; Yang, Chun-Lan; Ge, Meng-Yu; Ibrahim, Muhammad; Li, Bin; Zhao, Wen-Jun; Chen, Gong-You; Zhu, Bo; Xie, Guan-Lin

    2014-01-01

    Acidovorax avenae subsp. avenae is the causal agent of bacterial brown stripe disease in rice. In this study, we characterized a novel horizontal transfer of a gene cluster, including tetR, on the chromosome of A. avenae subsp. avenae RS-1 by genome-wide analysis. TetR acted as a repressor in this gene cluster and the oxidative stress resistance was enhanced in tetR-deletion mutant strain. Electrophoretic mobility shift assay demonstrated that TetR regulator bound directly to the promoter of this gene cluster. Consistently, the results of quantitative real-time PCR also showed alterations in expression of associated genes. Moreover, the proteins affected by TetR under oxidative stress were revealed by comparing proteomic profiles of wild-type and mutant strains via 1D SDS-PAGE and LC-MS/MS analyses. Taken together, our results demonstrated that tetR gene in this novel gene cluster contributed to cell survival under oxidative stress, and TetR protein played an important regulatory role in growth kinetics, biofilm-forming capability, superoxide dismutase and catalase activity, and oxide detoxicating ability.

  6. Cluster Decline and Resilience

    DEFF Research Database (Denmark)

    Østergaard, Christian Richter; Park, Eun Kyung

    Most studies on regional clusters focus on identifying factors and processes that make clusters grow. However, sometimes technologies and market conditions suddenly shift, and clusters decline. This paper analyses the process of decline of the wireless communication cluster in Denmark, 1963......-2011. Our longitudinal study reveals that technological lock-in and exit of key firms have contributed to impairment of the cluster’s resilience in adapting to disruptions. Entrepreneurship has a positive effect on cluster resilience, while multinational companies have contradicting effects by bringing...... in new resources to the cluster but being quick to withdraw in times of crisis....

  7. The rotation of galaxy clusters

    International Nuclear Information System (INIS)

    Tovmassian, H.M.

    2015-01-01

    The method for detection of the galaxy cluster rotation based on the study of distribution of member galaxies with velocities lower and higher of the cluster mean velocity over the cluster image is proposed. The search for rotation is made for flat clusters with a/b> 1.8 and BMI type clusters which are expected to be rotating. For comparison there were studied also round clusters and clusters of NBMI type, the second by brightness galaxy in which does not differ significantly from the cluster cD galaxy. Seventeen out of studied 65 clusters are found to be rotating. It was found that the detection rate is sufficiently high for flat clusters, over 60 per cent, and clusters of BMI type with dominant cD galaxy, ≈ 35 per cent. The obtained results show that clusters were formed from the huge primordial gas clouds and preserved the rotation of the primordial clouds, unless they did not have mergings with other clusters and groups of galaxies, in the result of which the rotation has been prevented

  8. Agricultural Clusters in the Netherlands

    NARCIS (Netherlands)

    Schouten, M.A.; Heijman, W.J.M.

    2012-01-01

    Michael Porter was the first to use the term cluster in an economic context. He introduced the term in The Competitive Advantage of Nations (1990). The term cluster is also known as business cluster, industry cluster, competitive cluster or Porterian cluster. This article aims at determining and

  9. Organization and Dynamics of Receptor Proteins in a Plasma Membrane.

    Science.gov (United States)

    Koldsø, Heidi; Sansom, Mark S P

    2015-11-25

    The interactions of membrane proteins are influenced by their lipid environment, with key lipid species able to regulate membrane protein function. Advances in high-resolution microscopy can reveal the organization and dynamics of proteins and lipids within living cells at resolutions membranes of in vivo-like complexity. We explore the dynamics of proteins and lipids in crowded and complex plasma membrane models, thereby closing the gap in length and complexity between computations and experiments. Our simulations provide insights into the mutual interplay between lipids and proteins in determining mesoscale (20-100 nm) fluctuations of the bilayer, and in enabling oligomerization and clustering of membrane proteins.

  10. ProteinWorldDB: querying radical pairwise alignments among protein sets from complete genomes.

    Science.gov (United States)

    Otto, Thomas Dan; Catanho, Marcos; Tristão, Cristian; Bezerra, Márcia; Fernandes, Renan Mathias; Elias, Guilherme Steinberger; Scaglia, Alexandre Capeletto; Bovermann, Bill; Berstis, Viktors; Lifschitz, Sergio; de Miranda, Antonio Basílio; Degrave, Wim

    2010-03-01

    Many analyses in modern biological research are based on comparisons between biological sequences, resulting in functional, evolutionary and structural inferences. When large numbers of sequences are compared, heuristics are often used resulting in a certain lack of accuracy. In order to improve and validate results of such comparisons, we have performed radical all-against-all comparisons of 4 million protein sequences belonging to the RefSeq database, using an implementation of the Smith-Waterman algorithm. This extremely intensive computational approach was made possible with the help of World Community Grid, through the Genome Comparison Project. The resulting database, ProteinWorldDB, which contains coordinates of pairwise protein alignments and their respective scores, is now made available. Users can download, compare and analyze the results, filtered by genomes, protein functions or clusters. ProteinWorldDB is integrated with annotations derived from Swiss-Prot, Pfam, KEGG, NCBI Taxonomy database and gene ontology. The database is a unique and valuable asset, representing a major effort to create a reliable and consistent dataset of cross-comparisons of the whole protein content encoded in hundreds of completely sequenced genomes using a rigorous dynamic programming approach. The database can be accessed through http://proteinworlddb.org

  11. A GMBCG GALAXY CLUSTER CATALOG OF 55,424 RICH CLUSTERS FROM SDSS DR7

    International Nuclear Information System (INIS)

    Hao Jiangang; Annis, James; Johnston, David E.; McKay, Timothy A.; Evrard, August; Siegel, Seth R.; Gerdes, David; Koester, Benjamin P.; Rykoff, Eli S.; Rozo, Eduardo; Wechsler, Risa H.; Busha, Michael; Becker, Matthew; Sheldon, Erin

    2010-01-01

    We present a large catalog of optically selected galaxy clusters from the application of a new Gaussian Mixture Brightest Cluster Galaxy (GMBCG) algorithm to SDSS Data Release 7 data. The algorithm detects clusters by identifying the red-sequence plus brightest cluster galaxy (BCG) feature, which is unique for galaxy clusters and does not exist among field galaxies. Red-sequence clustering in color space is detected using an Error Corrected Gaussian Mixture Model. We run GMBCG on 8240 deg 2 of photometric data from SDSS DR7 to assemble the largest ever optical galaxy cluster catalog, consisting of over 55,000 rich clusters across the redshift range from 0.1 < z < 0.55. We present Monte Carlo tests of completeness and purity and perform cross-matching with X-ray clusters and with the maxBCG sample at low redshift. These tests indicate high completeness and purity across the full redshift range for clusters with 15 or more members.

  12. A GMBCG galaxy cluster catalog of 55,880 rich clusters from SDSS DR7

    Energy Technology Data Exchange (ETDEWEB)

    Hao, Jiangang; McKay, Timothy A.; Koester, Benjamin P.; Rykoff, Eli S.; Rozo, Eduardo; Annis, James; Wechsler, Risa H.; Evrard, August; Siegel, Seth R.; Becker, Matthew; Busha, Michael; /Fermilab /Michigan U. /Chicago U., Astron. Astrophys. Ctr. /UC, Santa Barbara /KICP, Chicago /KIPAC, Menlo Park /SLAC /Caltech /Brookhaven

    2010-08-01

    We present a large catalog of optically selected galaxy clusters from the application of a new Gaussian Mixture Brightest Cluster Galaxy (GMBCG) algorithm to SDSS Data Release 7 data. The algorithm detects clusters by identifying the red sequence plus Brightest Cluster Galaxy (BCG) feature, which is unique for galaxy clusters and does not exist among field galaxies. Red sequence clustering in color space is detected using an Error Corrected Gaussian Mixture Model. We run GMBCG on 8240 square degrees of photometric data from SDSS DR7 to assemble the largest ever optical galaxy cluster catalog, consisting of over 55,000 rich clusters across the redshift range from 0.1 < z < 0.55. We present Monte Carlo tests of completeness and purity and perform cross-matching with X-ray clusters and with the maxBCG sample at low redshift. These tests indicate high completeness and purity across the full redshift range for clusters with 15 or more members.

  13. Genome-Wide Analysis of Type VI System Clusters and Effectors in Burkholderia Species

    Directory of Open Access Journals (Sweden)

    Thao Thi Nguyen

    2018-02-01

    Full Text Available Type VI secretion system (T6SS has been discovered in a variety of gram-negative bacteria as a versatile weapon to stimulate the killing of eukaryotic cells or prokaryotic competitors. Type VI secretion effectors (T6SEs are well known as key virulence factors for important pathogenic bacteria. In many Burkholderia species, T6SS has evolved as the most complicated secretion pathway with distinguished types to translocate diverse T6SEs, suggesting their essential roles in this genus. Here we attempted to detect and characterize T6SSs and potential T6SEs in target genomes of plant-associated and environmental Burkholderia species based on computational analyses. In total, 66 potential functional T6SS clusters were found in 30 target Burkholderia bacterial genomes, of which 33% possess three or four clusters. The core proteins in each cluster were specified and phylogenetic trees of three components (i.e., TssC, TssD, TssL were constructed to elucidate the relationship among the identified T6SS clusters. Next, we identified 322 potential T6SEs in the target genomes based on homology searches and explored the important domains conserved in effector candidates. In addition, using the screening approach based on the profile hidden Markov model (pHMM of T6SEs that possess markers for type VI effectors (MIX motif (MIX T6SEs, 57 revealed proteins that were not included in training datasets were recognized as novel MIX T6SE candidates from the Burkholderia species. This approach could be useful to identify potential T6SEs from other bacterial genomes.

  14. Subspace K-means clustering

    NARCIS (Netherlands)

    Timmerman, Marieke E.; Ceulemans, Eva; De Roover, Kim; Van Leeuwen, Karla

    2013-01-01

    To achieve an insightful clustering of multivariate data, we propose subspace K-means. Its central idea is to model the centroids and cluster residuals in reduced spaces, which allows for dealing with a wide range of cluster types and yields rich interpretations of the clusters. We review the

  15. Clustering analysis

    International Nuclear Information System (INIS)

    Romli

    1997-01-01

    Cluster analysis is the name of group of multivariate techniques whose principal purpose is to distinguish similar entities from the characteristics they process.To study this analysis, there are several algorithms that can be used. Therefore, this topic focuses to discuss the algorithms, such as, similarity measures, and hierarchical clustering which includes single linkage, complete linkage and average linkage method. also, non-hierarchical clustering method, which is popular name K -mean method ' will be discussed. Finally, this paper will be described the advantages and disadvantages of every methods

  16. Cluster analysis

    CERN Document Server

    Everitt, Brian S; Leese, Morven; Stahl, Daniel

    2011-01-01

    Cluster analysis comprises a range of methods for classifying multivariate data into subgroups. By organizing multivariate data into such subgroups, clustering can help reveal the characteristics of any structure or patterns present. These techniques have proven useful in a wide range of areas such as medicine, psychology, market research and bioinformatics.This fifth edition of the highly successful Cluster Analysis includes coverage of the latest developments in the field and a new chapter dealing with finite mixture models for structured data.Real life examples are used throughout to demons

  17. Ethical implications of excessive cluster sizes in cluster randomised trials.

    Science.gov (United States)

    Hemming, Karla; Taljaard, Monica; Forbes, Gordon; Eldridge, Sandra M; Weijer, Charles

    2018-02-20

    The cluster randomised trial (CRT) is commonly used in healthcare research. It is the gold-standard study design for evaluating healthcare policy interventions. A key characteristic of this design is that as more participants are included, in a fixed number of clusters, the increase in achievable power will level off. CRTs with cluster sizes that exceed the point of levelling-off will have excessive numbers of participants, even if they do not achieve nominal levels of power. Excessively large cluster sizes may have ethical implications due to exposing trial participants unnecessarily to the burdens of both participating in the trial and the potential risks of harm associated with the intervention. We explore these issues through the use of two case studies. Where data are routinely collected, available at minimum cost and the intervention poses low risk, the ethical implications of excessively large cluster sizes are likely to be low (case study 1). However, to maximise the social benefit of the study, identification of excessive cluster sizes can allow for prespecified and fully powered secondary analyses. In the second case study, while there is no burden through trial participation (because the outcome data are routinely collected and non-identifiable), the intervention might be considered to pose some indirect risk to patients and risks to the healthcare workers. In this case study it is therefore important that the inclusion of excessively large cluster sizes is justifiable on other grounds (perhaps to show sustainability). In any randomised controlled trial, including evaluations of health policy interventions, it is important to minimise the burdens and risks to participants. Funders, researchers and research ethics committees should be aware of the ethical issues of excessively large cluster sizes in cluster trials. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is

  18. Minimalist's linux cluster

    International Nuclear Information System (INIS)

    Choi, Chang-Yeong; Kim, Jeong-Hyun; Kim, Seyong

    2004-01-01

    Using barebone PC components and NIC's, we construct a linux cluster which has 2-dimensional mesh structure. This cluster has smaller footprint, is less expensive, and use less power compared to conventional linux cluster. Here, we report our experience in building such a machine and discuss our current lattice project on the machine

  19. Cluster ion beam facilities

    International Nuclear Information System (INIS)

    Popok, V.N.; Prasalovich, S.V.; Odzhaev, V.B.; Campbell, E.E.B.

    2001-01-01

    A brief state-of-the-art review in the field of cluster-surface interactions is presented. Ionised cluster beams could become a powerful and versatile tool for the modification and processing of surfaces as an alternative to ion implantation and ion assisted deposition. The main effects of cluster-surface collisions and possible applications of cluster ion beams are discussed. The outlooks of the Cluster Implantation and Deposition Apparatus (CIDA) being developed in Guteborg University are shown

  20. The Role of Inflammation in the Pain, Fatigue, and Sleep Disturbance Symptom Cluster in Advanced Cancer.

    Science.gov (United States)

    Kwekkeboom, Kristine L; Tostrud, Lauren; Costanzo, Erin; Coe, Christopher L; Serlin, Ronald C; Ward, Sandra E; Zhang, Yingzi

    2018-05-01

    Symptom researchers have proposed a model of inflammatory cytokine activity and dysregulation in cancer to explain co-occurring symptoms including pain, fatigue, and sleep disturbance. We tested the hypothesis that psychological stress accentuates inflammation and that stress and inflammation contribute to one's experience of the pain, fatigue, and sleep disturbance symptom cluster (symptom cluster severity, symptom cluster distress) and its impact (symptom cluster interference with daily life, quality of life). We used baseline data from a symptom cluster management trial. Adult participants (N = 158) receiving chemotherapy for advanced cancer reported pain, fatigue, and sleep disturbance on enrollment. Before intervention, participants completed measures of demographics, perceived stress, symptom cluster severity, symptom cluster distress, symptom cluster interference with daily life, and quality of life and provided a blood sample for four inflammatory biomarkers (interleukin-1β, interleukin-6, tumor necrosis factor-α, and C-reactive protein). Stress was not directly related to any inflammatory biomarker. Stress and tumor necrosis factor-α were positively related to symptom cluster distress, although not symptom cluster severity. Tumor necrosis factor-α was indirectly related to symptom cluster interference with daily life, through its effect on symptom cluster distress. Stress was positively associated with symptom cluster interference with daily life and inversely with quality of life. Stress also had indirect effects on symptom cluster interference with daily life, through its effect on symptom cluster distress. The proposed inflammatory model of symptoms was partially supported. Investigators should test interventions that target stress as a contributing factor in co-occurring pain, fatigue, and sleep disturbance and explore other factors that may influence inflammatory biomarker levels within the context of an advanced cancer diagnosis and treatment

  1. Clustered DNA lesion repair in eukaryotes: Relevance to mutagenesis and cell survival

    Energy Technology Data Exchange (ETDEWEB)

    Sage, Evelyne [Institut Curie, Bat. 110, Centre Universitaire, 91405 Orsay (France); CNRS UMR3348, Bat. 110, Centre Universitaire, 91405 Orsay (France); Harrison, Lynn, E-mail: lclary@lsuhsc.edu [Department of Molecular and Cellular Physiology, LSUHSC-S, 1501 Kings Highway, Shreveport, LA 71130 (United States)

    2011-06-03

    A clustered DNA lesion, also known as a multiply damaged site, is defined as {>=}2 damages in the DNA within 1-2 helical turns. Only ionizing radiation and certain chemicals introduce DNA damage in the genome in this non-random way. What is now clear is that the lethality of a damaging agent is not just related to the types of DNA lesions introduced, but also to how the damage is distributed in the DNA. Clustered DNA lesions were first hypothesized to exist in the 1990s, and work has progressed where these complex lesions have been characterized and measured in irradiated as well as in non-irradiated cells. A clustered lesion can consist of single as well as double strand breaks, base damage and abasic sites, and the damages can be situated on the same strand or opposing strands. They include tandem lesions, double strand break (DSB) clusters and non-DSB clusters, and base excision repair as well as the DSB repair pathways can be required to remove these complex lesions. Due to the plethora of oxidative damage induced by ionizing radiation, and the repair proteins involved in their removal from the DNA, it has been necessary to study how repair systems handle these lesions using synthetic DNA damage. This review focuses on the repair process and mutagenic consequences of clustered lesions in yeast and mammalian cells. By examining the studies on synthetic clustered lesions, and the effects of low vs high LET radiation on mammalian cells or tissues, it is possible to extrapolate the potential biological relevance of these clustered lesions to the killing of tumor cells by radiotherapy and chemotherapy, and to the risk of cancer in non-tumor cells, and this will be discussed.

  2. Clustering by reordering of similarity and Laplacian matrices: Application to galaxy clusters

    Science.gov (United States)

    Mahmoud, E.; Shoukry, A.; Takey, A.

    2018-04-01

    Similarity metrics, kernels and similarity-based algorithms have gained much attention due to their increasing applications in information retrieval, data mining, pattern recognition and machine learning. Similarity Graphs are often adopted as the underlying representation of similarity matrices and are at the origin of known clustering algorithms such as spectral clustering. Similarity matrices offer the advantage of working in object-object (two-dimensional) space where visualization of clusters similarities is available instead of object-features (multi-dimensional) space. In this paper, sparse ɛ-similarity graphs are constructed and decomposed into strong components using appropriate methods such as Dulmage-Mendelsohn permutation (DMperm) and/or Reverse Cuthill-McKee (RCM) algorithms. The obtained strong components correspond to groups (clusters) in the input (feature) space. Parameter ɛi is estimated locally, at each data point i from a corresponding narrow range of the number of nearest neighbors. Although more advanced clustering techniques are available, our method has the advantages of simplicity, better complexity and direct visualization of the clusters similarities in a two-dimensional space. Also, no prior information about the number of clusters is needed. We conducted our experiments on two and three dimensional, low and high-sized synthetic datasets as well as on an astronomical real-dataset. The results are verified graphically and analyzed using gap statistics over a range of neighbors to verify the robustness of the algorithm and the stability of the results. Combining the proposed algorithm with gap statistics provides a promising tool for solving clustering problems. An astronomical application is conducted for confirming the existence of 45 galaxy clusters around the X-ray positions of galaxy clusters in the redshift range [0.1..0.8]. We re-estimate the photometric redshifts of the identified galaxy clusters and obtain acceptable values

  3. What Makes Clusters Decline?

    DEFF Research Database (Denmark)

    Østergaard, Christian Richter; Park, Eun Kyung

    2015-01-01

    Most studies on regional clusters focus on identifying factors and processes that make clusters grow. However, sometimes technologies and market conditions suddenly shift, and clusters decline. This paper analyses the process of decline of the wireless communication cluster in Denmark. The longit...... but being quick to withdraw in times of crisis....

  4. Applications of Cluster Analysis to the Creation of Perfectionism Profiles: A Comparison of two Clustering Approaches

    Directory of Open Access Journals (Sweden)

    Jocelyn H Bolin

    2014-04-01

    Full Text Available Although traditional clustering methods (e.g., K-means have been shown to be useful in the social sciences it is often difficult for such methods to handle situations where clusters in the population overlap or are ambiguous. Fuzzy clustering, a method already recognized in many disciplines, provides a more flexible alternative to these traditional clustering methods. Fuzzy clustering differs from other traditional clustering methods in that it allows for a case to belong to multiple clusters simultaneously. Unfortunately, fuzzy clustering techniques remain relatively unused in the social and behavioral sciences. The purpose of this paper is to introduce fuzzy clustering to these audiences who are currently relatively unfamiliar with the technique. In order to demonstrate the advantages associated with this method, cluster solutions of a common perfectionism measure were created using both fuzzy clustering and K-means clustering, and the results compared. Results of these analyses reveal that different cluster solutions are found by the two methods, and the similarity between the different clustering solutions depends on the amount of cluster overlap allowed for in fuzzy clustering.

  5. Applications of cluster analysis to the creation of perfectionism profiles: a comparison of two clustering approaches.

    Science.gov (United States)

    Bolin, Jocelyn H; Edwards, Julianne M; Finch, W Holmes; Cassady, Jerrell C

    2014-01-01

    Although traditional clustering methods (e.g., K-means) have been shown to be useful in the social sciences it is often difficult for such methods to handle situations where clusters in the population overlap or are ambiguous. Fuzzy clustering, a method already recognized in many disciplines, provides a more flexible alternative to these traditional clustering methods. Fuzzy clustering differs from other traditional clustering methods in that it allows for a case to belong to multiple clusters simultaneously. Unfortunately, fuzzy clustering techniques remain relatively unused in the social and behavioral sciences. The purpose of this paper is to introduce fuzzy clustering to these audiences who are currently relatively unfamiliar with the technique. In order to demonstrate the advantages associated with this method, cluster solutions of a common perfectionism measure were created using both fuzzy clustering and K-means clustering, and the results compared. Results of these analyses reveal that different cluster solutions are found by the two methods, and the similarity between the different clustering solutions depends on the amount of cluster overlap allowed for in fuzzy clustering.

  6. Novel bacterial gas sensor proteins with transition metal-containing prosthetic groups as active sites.

    Science.gov (United States)

    Aono, Shigetoshi

    2012-04-01

    Gas molecules function as signaling molecules in many biological regulatory systems responsible for transcription, chemotaxis, and other complex physiological processes. Gas sensor proteins play a crucial role in regulating such biological systems in response to gas molecules. New sensor proteins that sense oxygen or nitric oxide have recently been found, and they have been characterized by X-ray crystallographic and/or spectroscopic analysis. It has become clear that the interaction between a prosthetic group and gas molecules triggers dynamic structural changes in the protein backbone when a gas sensor protein senses gas molecules. Gas sensor proteins employ novel mechanisms to trigger conformational changes in the presence of a gas. In gas sensor proteins that have iron-sulfur clusters as active sites, the iron-sulfur clusters undergo structural changes, which trigger a conformational change. Heme-based gas sensor proteins reconstruct hydrogen-bonding networks around the heme and heme-bound ligand. Gas sensor proteins have two functional states, on and off, which are active and inactive, respectively, for subsequent signal transduction in response to their physiological effector molecules. To fully understand the structure-function relationships of gas sensor proteins, it is vital to perform X-ray crystal structure analyses of full-length proteins in both the on and off states.

  7. The HectoMAP Cluster Survey. I. redMaPPer Clusters

    Science.gov (United States)

    Sohn, Jubee; Geller, Margaret J.; Rines, Kenneth J.; Hwang, Ho Seong; Utsumi, Yousuke; Diaferio, Antonaldo

    2018-04-01

    We use the dense HectoMAP redshift survey to explore the properties of 104 redMaPPer cluster candidates. The redMaPPer systems in HectoMAP cover the full range of richness and redshift (0.08 systems included in the Subaru/Hyper Suprime-Cam public data release are bona fide clusters. The median number of spectroscopic members per cluster is ∼20. We include redshifts of 3547 member candidates listed in the redMaPPer catalog whether they are cluster members or not. We evaluate the redMaPPer membership probability spectroscopically. The purity (number of real systems) in redMaPPer exceeds 90% even at the lowest richness. Three massive galaxy clusters (M ∼ 2 × 1013 M ⊙) associated with X-ray emission in the HectoMAP region are not included in the public redMaPPer catalog with λ rich > 20, because they lie outside the cuts for this catalog.

  8. Herd Clustering: A synergistic data clustering approach using collective intelligence

    KAUST Repository

    Wong, Kachun

    2014-10-01

    Traditional data mining methods emphasize on analytical abilities to decipher data, assuming that data are static during a mining process. We challenge this assumption, arguing that we can improve the analysis by vitalizing data. In this paper, this principle is used to develop a new clustering algorithm. Inspired by herd behavior, the clustering method is a synergistic approach using collective intelligence called Herd Clustering (HC). The novel part is laid in its first stage where data instances are represented by moving particles. Particles attract each other locally and form clusters by themselves as shown in the case studies reported. To demonstrate its effectiveness, the performance of HC is compared to other state-of-the art clustering methods on more than thirty datasets using four performance metrics. An application for DNA motif discovery is also conducted. The results support the effectiveness of HC and thus the underlying philosophy. © 2014 Elsevier B.V.

  9. Investigation of total seed storage proteins of pakistani and japanese maize (zea mays l.) through sds-page markers

    International Nuclear Information System (INIS)

    Shinwari, Z.K.

    2014-01-01

    The assessment of genetic diversity among the members of a species is of vital importance for successful breeding and adaptability. In the present study 83 genotypes of maize of Pakistani and Japanese origin were evaluated for the total seed storage proteins using sodium dodecyl sulphate polyacrylamide gel electrophoresis (SDS-PAGE) through vertical slab unit. The total protein subunits were separated on 12% polyacrylamide gel using standard protocols. A total of 18 protein subunits were noted out of which 7 (39%) were monomorphic and 11 (61%) were polymorphic, with molecular weight ranging from 10 to 122 kDa. Coefficients of similarity among the accessions ranged between 0.89 and 1.00. The dendrogram obtained through UPGMA clustering method showed two main clusters: 1 and 2. First cluster comprised of 9 genotypes including Sahiwal-2002, while second cluster contained 74 genotypes including Aaiti-2002 and Sadaf. Over all a low level of polymorphism was observed in total seed storage protein patterns of maize genotypes from Pakistan as well as Japan. It is inferred from the present study that more genotypes of maize could be brought under study and more advanced biochemical techniques with more reliable results could be followed to bring assessment of genetic diversity of maize for planning breeding programs. (author)

  10. THE HST/ACS COMA CLUSTER SURVEY. IV. INTERGALACTIC GLOBULAR CLUSTERS AND THE MASSIVE GLOBULAR CLUSTER SYSTEM AT THE CORE OF THE COMA GALAXY CLUSTER

    International Nuclear Information System (INIS)

    Peng, Eric W.; Ferguson, Henry C.; Goudfrooij, Paul; Hammer, Derek; Lucey, John R.; Marzke, Ronald O.; Puzia, Thomas H.; Carter, David; Balcells, Marc; Bridges, Terry; Chiboucas, Kristin; Del Burgo, Carlos; Graham, Alister W.; Guzman, Rafael; Hudson, Michael J.; Matkovic, Ana

    2011-01-01

    Intracluster stellar populations are a natural result of tidal interactions in galaxy clusters. Measuring these populations is difficult, but important for understanding the assembly of the most massive galaxies. The Coma cluster of galaxies is one of the nearest truly massive galaxy clusters and is host to a correspondingly large system of globular clusters (GCs). We use imaging from the HST/ACS Coma Cluster Survey to present the first definitive detection of a large population of intracluster GCs (IGCs) that fills the Coma cluster core and is not associated with individual galaxies. The GC surface density profile around the central massive elliptical galaxy, NGC 4874, is dominated at large radii by a population of IGCs that extend to the limit of our data (R +4000 -5000 (systematic) IGCs out to this radius, and that they make up ∼70% of the central GC system, making this the largest GC system in the nearby universe. Even including the GC systems of other cluster galaxies, the IGCs still make up ∼30%-45% of the GCs in the cluster core. Observational limits from previous studies of the intracluster light (ICL) suggest that the IGC population has a high specific frequency. If the IGC population has a specific frequency similar to high-S N dwarf galaxies, then the ICL has a mean surface brightness of μ V ∼ 27 mag arcsec -2 and a total stellar mass of roughly 10 12 M sun within the cluster core. The ICL makes up approximately half of the stellar luminosity and one-third of the stellar mass of the central (NGC 4874+ICL) system. The color distribution of the IGC population is bimodal, with blue, metal-poor GCs outnumbering red, metal-rich GCs by a ratio of 4:1. The inner GCs associated with NGC 4874 also have a bimodal distribution in color, but with a redder metal-poor population. The fraction of red IGCs (20%), and the red color of those GCs, implies that IGCs can originate from the halos of relatively massive, L* galaxies, and not solely from the disruption of

  11. Range-clustering queries

    NARCIS (Netherlands)

    Abrahamsen, M.; de Berg, M.T.; Buchin, K.A.; Mehr, M.; Mehrabi, A.D.

    2017-01-01

    In a geometric k -clustering problem the goal is to partition a set of points in R d into k subsets such that a certain cost function of the clustering is minimized. We present data structures for orthogonal range-clustering queries on a point set S : given a query box Q and an integer k>2 , compute

  12. Innovation performance and clusters: a dynamic capability perspective on regional technology clusters

    OpenAIRE

    Röttmer, Nicole

    2009-01-01

    This research provides a novel, empirically tested, actionable theory of cluster innovativeness. Cluster innovativeness has for long been subject of research and resulting policy efforts. The cluster's endowment with assets, such as specialized labor, firms, research institutes, existing regional networks and a specific culture are, among others, recognized as sources of innovativeness. While the asset structure of clusters as been subject to a variety of research efforts, the evidence on the...

  13. Global Conservation of Protein Status between Cell Lines and Xenografts

    Directory of Open Access Journals (Sweden)

    Julian Biau

    2016-08-01

    Full Text Available Common preclinical models for testing anticancer treatment include cultured human tumor cell lines in monolayer, and xenografts derived from these cell lines in immunodeficient mice. Our goal was to determine how similar the xenografts are compared with their original cell line and to determine whether it is possible to predict the stability of a xenograft model beforehand. We studied a selection of 89 protein markers of interest in 14 human cell cultures and respective subcutaneous xenografts using the reverse-phase protein array technology. We specifically focused on proteins and posttranslational modifications involved in DNA repair, PI3K pathway, apoptosis, tyrosine kinase signaling, stress, cell cycle, MAPK/ERK signaling, SAPK/JNK signaling, NFκB signaling, and adhesion/cytoskeleton. Using hierarchical clustering, most cell culture-xenograft pairs cluster together, suggesting a global conservation of protein signature. Particularly, Akt, NFkB, EGFR, and Vimentin showed very stable protein expression and phosphorylation levels highlighting that 4 of 10 pathways were highly correlated whatever the model. Other proteins were heterogeneously conserved depending on the cell line. Finally, cell line models with low Akt pathway activation and low levels of Vimentin gave rise to more reliable xenograft models. These results may be useful for the extrapolation of cell culture experiments to in vivo models in novel targeted drug discovery.

  14. Fuzzy Clustering

    DEFF Research Database (Denmark)

    Berks, G.; Keyserlingk, Diedrich Graf von; Jantzen, Jan

    2000-01-01

    A symptom is a condition indicating the presence of a disease, especially, when regarded as an aid in diagnosis.Symptoms are the smallest units indicating the existence of a disease. A syndrome on the other hand is an aggregate, set or cluster of concurrent symptoms which together indicate...... and clustering are the basic concerns in medicine. Classification depends on definitions of the classes and their required degree of participant of the elements in the cases' symptoms. In medicine imprecise conditions are the rule and therefore fuzzy methods are much more suitable than crisp ones. Fuzzy c......-mean clustering is an easy and well improved tool, which has been applied in many medical fields. We used c-mean fuzzy clustering after feature extraction from an aphasia database. Factor analysis was applied on a correlation matrix of 26 symptoms of language disorders and led to five factors. The factors...

  15. Nanomanufacturing of titania interfaces with controlled structural and functional properties by supersonic cluster beam deposition

    International Nuclear Information System (INIS)

    Podestà, Alessandro; Borghi, Francesca; Indrieri, Marco; Bovio, Simone; Piazzoni, Claudio; Milani, Paolo

    2015-01-01

    Great emphasis is placed on the development of integrated approaches for the synthesis and the characterization of ad hoc nanostructured platforms, to be used as templates with controlled morphology and chemical properties for the investigation of specific phenomena of great relevance in interdisciplinary fields such as biotechnology, medicine, and advanced materials. Here, we discuss the crucial role and the advantages of thin film deposition strategies based on cluster-assembling from supersonic cluster beams. We select cluster-assembled nanostructured titania (ns-TiO 2 ) as a case study to demonstrate that accurate control over morphological parameters can be routinely achieved, and consequently, over several relevant interfacial properties and phenomena, like surface charging in a liquid electrolyte, and proteins and nanoparticles adsorption. In particular, we show that the very good control of nanoscale morphology is obtained by taking advantage of simple scaling laws governing the ballistic deposition regime of low-energy, mass-dispersed clusters with reduced surface mobility

  16. Nanomanufacturing of titania interfaces with controlled structural and functional properties by supersonic cluster beam deposition

    Energy Technology Data Exchange (ETDEWEB)

    Podestà, Alessandro, E-mail: alessandro.podesta@mi.infn.it, E-mail: pmilani@mi.infn.it; Borghi, Francesca; Indrieri, Marco; Bovio, Simone; Piazzoni, Claudio; Milani, Paolo, E-mail: alessandro.podesta@mi.infn.it, E-mail: pmilani@mi.infn.it [Centro Interdisciplinare Materiali e Interfacce Nanostrutturati (C.I.Ma.I.Na.), Dipartimento di Fisica, Università degli Studi di Milano, via Celoria 16, 20133 Milano (Italy)

    2015-12-21

    Great emphasis is placed on the development of integrated approaches for the synthesis and the characterization of ad hoc nanostructured platforms, to be used as templates with controlled morphology and chemical properties for the investigation of specific phenomena of great relevance in interdisciplinary fields such as biotechnology, medicine, and advanced materials. Here, we discuss the crucial role and the advantages of thin film deposition strategies based on cluster-assembling from supersonic cluster beams. We select cluster-assembled nanostructured titania (ns-TiO{sub 2}) as a case study to demonstrate that accurate control over morphological parameters can be routinely achieved, and consequently, over several relevant interfacial properties and phenomena, like surface charging in a liquid electrolyte, and proteins and nanoparticles adsorption. In particular, we show that the very good control of nanoscale morphology is obtained by taking advantage of simple scaling laws governing the ballistic deposition regime of low-energy, mass-dispersed clusters with reduced surface mobility.

  17. Nanomanufacturing of titania interfaces with controlled structural and functional properties by supersonic cluster beam deposition

    Science.gov (United States)

    Podestà, Alessandro; Borghi, Francesca; Indrieri, Marco; Bovio, Simone; Piazzoni, Claudio; Milani, Paolo

    2015-12-01

    Great emphasis is placed on the development of integrated approaches for the synthesis and the characterization of ad hoc nanostructured platforms, to be used as templates with controlled morphology and chemical properties for the investigation of specific phenomena of great relevance in interdisciplinary fields such as biotechnology, medicine, and advanced materials. Here, we discuss the crucial role and the advantages of thin film deposition strategies based on cluster-assembling from supersonic cluster beams. We select cluster-assembled nanostructured titania (ns-TiO2) as a case study to demonstrate that accurate control over morphological parameters can be routinely achieved, and consequently, over several relevant interfacial properties and phenomena, like surface charging in a liquid electrolyte, and proteins and nanoparticles adsorption. In particular, we show that the very good control of nanoscale morphology is obtained by taking advantage of simple scaling laws governing the ballistic deposition regime of low-energy, mass-dispersed clusters with reduced surface mobility.

  18. Photochemistry in rare gas clusters

    International Nuclear Information System (INIS)

    Moeller, T.; Haeften, K. von; Pietrowski, R. von

    1999-01-01

    In this contribution photochemical processes in pure rare gas clusters will be discussed. The relaxation dynamics of electronically excited He clusters is investigated with luminescence spectroscopy. After electronic excitation of He clusters many sharp lines are observed in the visible and infrared spectral range which can be attributed to He atoms and molecules desorbing from the cluster. It turns out that the desorption of electronically excited He atoms and molecules is an important decay channel. The findings for He clusters are compared with results for Ar clusters. While desorption of electronically excited He atoms is observed for all clusters containing up to several thousand atoms a corresponding process in Ar clusters is only observed for very small clusters (N<10). (orig.)

  19. Photochemistry in rare gas clusters

    Energy Technology Data Exchange (ETDEWEB)

    Moeller, T.; Haeften, K. von; Pietrowski, R. von [Deutsches Elektronen-Synchrotron (DESY), Hamburg (Germany). Hamburger Synchrotronstrahlungslabor; Laarman, T. [Universitaet Hamburg, II. Institut fuer Experimentalphysik, Luruper Chaussee 149, D-22761 Hamburg (Germany)

    1999-12-01

    In this contribution photochemical processes in pure rare gas clusters will be discussed. The relaxation dynamics of electronically excited He clusters is investigated with luminescence spectroscopy. After electronic excitation of He clusters many sharp lines are observed in the visible and infrared spectral range which can be attributed to He atoms and molecules desorbing from the cluster. It turns out that the desorption of electronically excited He atoms and molecules is an important decay channel. The findings for He clusters are compared with results for Ar clusters. While desorption of electronically excited He atoms is observed for all clusters containing up to several thousand atoms a corresponding process in Ar clusters is only observed for very small clusters (N<10). (orig.)

  20. Stochastic coupled cluster theory: Efficient sampling of the coupled cluster expansion

    Science.gov (United States)

    Scott, Charles J. C.; Thom, Alex J. W.

    2017-09-01

    We consider the sampling of the coupled cluster expansion within stochastic coupled cluster theory. Observing the limitations of previous approaches due to the inherently non-linear behavior of a coupled cluster wavefunction representation, we propose new approaches based on an intuitive, well-defined condition for sampling weights and on sampling the expansion in cluster operators of different excitation levels. We term these modifications even and truncated selections, respectively. Utilising both approaches demonstrates dramatically improved calculation stability as well as reduced computational and memory costs. These modifications are particularly effective at higher truncation levels owing to the large number of terms within the cluster expansion that can be neglected, as demonstrated by the reduction of the number of terms to be sampled when truncating at triple excitations by 77% and hextuple excitations by 98%.