WorldWideScience

Sample records for hox protein classification

  1. Solving Classification Problems for Large Sets of Protein Sequences with the Example of Hox and ParaHox Proteins

    Directory of Open Access Journals (Sweden)

    Stefanie D. Hueber

    2016-02-01

    Full Text Available Phylogenetic methods are key to providing models for how a given protein family evolved. However, these methods run into difficulties when sequence divergence is either too low or too high. Here, we provide a case study of Hox and ParaHox proteins so that additional insights can be gained using a new computational approach to help solve old classification problems. For two (Gsx and Cdx out of three ParaHox proteins the assignments differ between the currently most established view and four alternative scenarios. We use a non-phylogenetic, pairwise-sequence-similarity-based method to assess which of the previous predictions, if any, are best supported by the sequence-similarity relationships between Hox and ParaHox proteins. The overall sequence-similarities show Gsx to be most similar to Hox2–3, and Cdx to be most similar to Hox4–8. The results indicate that a purely pairwise-sequence-similarity-based approach can provide additional information not only when phylogenetic inference methods have insufficient information to provide reliable classifications (as was shown previously for central Hox proteins, but also when the sequence variation is so high that the resulting phylogenetic reconstructions are likely plagued by long-branch-attraction artifacts.

  2. The TALE face of Hox proteins in animal evolution.

    Science.gov (United States)

    Merabet, Samir; Galliot, Brigitte

    2015-01-01

    Hox genes are major regulators of embryonic development. One of their most conserved functions is to coordinate the formation of specific body structures along the anterior-posterior (AP) axis in Bilateria. This architectural role was at the basis of several morphological innovations across bilaterian evolution. In this review, we traced the origin of the Hox patterning system by considering the partnership with PBC and Meis proteins. PBC and Meis belong to the TALE-class of homeodomain-containing transcription factors and act as generic cofactors of Hox proteins for AP axis patterning in Bilateria. Recent data indicate that Hox proteins acquired the ability to interact with their TALE partners in the last common ancestor of Bilateria and Cnidaria. These interactions relied initially on a short peptide motif called hexapeptide (HX), which is present in Hox and non-Hox protein families. Remarkably, Hox proteins can also recruit the TALE cofactors by using specific PBC Interaction Motifs (SPIMs). We describe how a functional Hox/TALE patterning system emerged in eumetazoans through the acquisition of SPIMs. We anticipate that interaction flexibility could be found in other patterning systems, being at the heart of the astonishing morphological diversity observed in the animal kingdom.

  3. Multiple structure-intrinsic disorder interactions regulate and coordinate Hox protein function

    Science.gov (United States)

    Bondos, Sarah

    During animal development, Hox transcription factors determine fate of developing tissues to generate diverse organs and appendages. Hox proteins are famous for their bizarre mutant phenotypes, such as replacing antennae with legs. Clearly, the functions of individual Hox proteins must be distinct and reliable in vivo, or the organism risks malformation or death. However, within the Hox protein family, the DNA-binding homeodomains are highly conserved and the amino acids that contact DNA are nearly invariant. These observations raise the question: How do different Hox proteins correctly identify their distinct target genes using a common DNA binding domain? One possible means to modulate DNA binding is through the influence of the non-homeodomain protein regions, which differ significantly among Hox proteins. However genetic approaches never detected intra-protein interactions, and early biochemical attempts were hindered because the special features of ``intrinsically disordered'' sequences were not appreciated. We propose the first-ever structural model of a Hox protein to explain how specific contacts between distant, intrinsically disordered regions of the protein and the homeodomain regulate DNA binding and coordinate this activity with other Hox molecular functions.

  4. To Be Specific or Not: The Critical Relationship Between Hox And TALE Proteins.

    Science.gov (United States)

    Merabet, Samir; Mann, Richard S

    2016-06-01

    Hox proteins are key regulatory transcription factors that act in different tissues of the embryo to provide specific spatial and temporal coordinates to each cell. These patterning functions often depend on the presence of the TALE-homeodomain class cofactors, which form cooperative DNA-binding complexes with all Hox proteins. How this family of cofactors contributes to the highly diverse and specific functions of Hox proteins in vivo remains an important unsolved question. We review here the most recent advances in understanding the molecular mechanisms underlying Hox-TALE function. In particular, we discuss the role of DNA shape, DNA-binding affinity, and protein-protein interaction flexibility in dictating Hox-TALE specificity. We propose several models to explain how these mechanisms are integrated with each other in the context of the many distinct functions that Hox and TALE factors carry out in vivo. Copyright © 2016 Elsevier Ltd. All rights reserved.

  5. Human HOX Proteins Use Diverse and Context-Dependent Motifs to Interact with TALE Class Cofactors.

    Science.gov (United States)

    Dard, Amélie; Reboulet, Jonathan; Jia, Yunlong; Bleicher, Françoise; Duffraisse, Marilyne; Vanaker, Jean-Marc; Forcet, Christelle; Merabet, Samir

    2018-03-13

    HOX proteins achieve numerous functions by interacting with the TALE class PBX and MEIS cofactors. In contrast to this established partnership in development and disease, how HOX proteins could interact with PBX and MEIS remains unclear. Here, we present a systematic analysis of HOX/PBX/MEIS interaction properties, scanning all paralog groups with human and mouse HOX proteins in vitro and in live cells. We demonstrate that a previously characterized HOX protein motif known to be critical for HOX-PBX interactions becomes dispensable in the presence of MEIS in all except the two most anterior paralog groups. We further identify paralog-specific TALE-binding sites that are used in a highly context-dependent manner. One of these binding sites is involved in the proliferative activity of HOXA7 in breast cancer cells. Together these findings reveal an extraordinary level of interaction flexibility between HOX proteins and their major class of developmental cofactors. Copyright © 2018 The Author(s). Published by Elsevier Inc. All rights reserved.

  6. Functional dissection of the Hox protein Abdominal-B in Drosophila cell culture

    Energy Technology Data Exchange (ETDEWEB)

    Zhai, Zongzhao [Key Laboratory of the Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beichen West Road, Chaoyang, Beijing 100101 (China); CellNetworks - Cluster of Excellence, Centre for Organismal Studies (COS) Heidelberg, University of Heidelberg, D-69120 Heidelberg (Germany); Graduate School of Chinese Academy of Sciences, Beijing 100039 (China); Yang, Xingke, E-mail: yangxk@ioz.ac.cn [Key Laboratory of the Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beichen West Road, Chaoyang, Beijing 100101 (China); Lohmann, Ingrid, E-mail: ilohmann@flydev.org [CellNetworks - Cluster of Excellence, Centre for Organismal Studies (COS) Heidelberg, University of Heidelberg, D-69120 Heidelberg (Germany)

    2011-11-04

    Highlights: Black-Right-Pointing-Pointer ct340 CRM was identified to be the posterior spiracle enhancer of gene cut. Black-Right-Pointing-Pointer ct340 is under the direct transcriptional control of Hox protein Abd-B. Black-Right-Pointing-Pointer An efficient cloning system was developed to assay protein-DNA interaction. Black-Right-Pointing-Pointer New features of Abd-B dependent target gene regulation were detected. -- Abstract: Hox transcription factors regulate the morphogenesis along the anterior-posterior (A/P) body axis through the interaction with small cis-regulatory modules (CRMs) of their target gene, however so far very few Hox CRMs are known and have been analyzed in detail. In this study we have identified a new Hox CRM, ct340, which guides the expression of the cell type specification gene cut (ct) in the posterior spiracle under the direct control of the Hox protein Abdominal-B (Abd-B). Using the ct340 enhancer activity as readout, an efficient cloning system to generate VP16 activation domain fusion protein was developed to unambiguously test protein-DNA interaction in Drosophila cell culture. By functionally dissecting the Abd-B protein, new features of Abd-B dependent target gene regulation were detected. Due to its easy adaptability, this system can be generally used to map functional domains within sequence-specific transcriptional factors in Drosophila cell culture, and thus provide preliminary knowledge of the protein functional domain structure for further in vivo analysis.

  7. TALE factors poise promoters for activation by Hox proteins.

    Science.gov (United States)

    Choe, Seong-Kyu; Ladam, Franck; Sagerström, Charles G

    2014-01-27

    Hox proteins form complexes with TALE cofactors from the Pbx and Prep/Meis families to control transcription, but it remains unclear how Hox:TALE complexes function. Examining a Hoxb1b:TALE complex that regulates zebrafish hoxb1a transcription, we find maternally deposited TALE proteins at the hoxb1a promoter already during blastula stages. These TALE factors recruit histone-modifying enzymes to promote an active chromatin profile at the hoxb1a promoter and also recruit RNA polymerase II (RNAPII) and P-TEFb. However, in the presence of TALE factors, RNAPII remains phosphorylated on serine 5 and hoxb1a transcription is inefficient. By gastrula stages, Hoxb1b binds together with TALE factors to the hoxb1a promoter. This triggers P-TEFb-mediated transitioning of RNAPII to the serine 2-phosphorylated form and efficient hoxb1a transcription. We conclude that TALE factors access promoters during early embryogenesis to poise them for activation but that Hox proteins are required to trigger efficient transcription. Copyright © 2014 Elsevier Inc. All rights reserved.

  8. Functional dissection of the Hox protein Abdominal-B in Drosophila cell culture

    International Nuclear Information System (INIS)

    Zhai, Zongzhao; Yang, Xingke; Lohmann, Ingrid

    2011-01-01

    Highlights: ► ct340 CRM was identified to be the posterior spiracle enhancer of gene cut. ► ct340 is under the direct transcriptional control of Hox protein Abd-B. ► An efficient cloning system was developed to assay protein–DNA interaction. ► New features of Abd-B dependent target gene regulation were detected. -- Abstract: Hox transcription factors regulate the morphogenesis along the anterior–posterior (A/P) body axis through the interaction with small cis-regulatory modules (CRMs) of their target gene, however so far very few Hox CRMs are known and have been analyzed in detail. In this study we have identified a new Hox CRM, ct340, which guides the expression of the cell type specification gene cut (ct) in the posterior spiracle under the direct control of the Hox protein Abdominal-B (Abd-B). Using the ct340 enhancer activity as readout, an efficient cloning system to generate VP16 activation domain fusion protein was developed to unambiguously test protein–DNA interaction in Drosophila cell culture. By functionally dissecting the Abd-B protein, new features of Abd-B dependent target gene regulation were detected. Due to its easy adaptability, this system can be generally used to map functional domains within sequence-specific transcriptional factors in Drosophila cell culture, and thus provide preliminary knowledge of the protein functional domain structure for further in vivo analysis.

  9. Differential pleiotropy and HOX functional organization.

    Science.gov (United States)

    Sivanantharajah, Lovesha; Percival-Smith, Anthony

    2015-02-01

    Key studies led to the idea that transcription factors are composed of defined modular protein motifs or domains, each with separable, unique function. During evolution, the recombination of these modular domains could give rise to transcription factors with new properties, as has been shown using recombinant molecules. This archetypic, modular view of transcription factor organization is based on the analyses of a few transcription factors such as GAL4, which may represent extreme exemplars rather than an archetype or the norm. Recent work with a set of Homeotic selector (HOX) proteins has revealed differential pleiotropy: the observation that highly-conserved HOX protein motifs and domains make small, additive, tissue specific contributions to HOX activity. Many of these differentially pleiotropic HOX motifs may represent plastic sequence elements called short linear motifs (SLiMs). The coupling of differential pleiotropy with SLiMs, suggests that protein sequence changes in HOX transcription factors may have had a greater impact on morphological diversity during evolution than previously believed. Furthermore, differential pleiotropy may be the genetic consequence of an ensemble nature of HOX transcription factor allostery, where HOX proteins exist as an ensemble of states with the capacity to integrate an extensive array of developmental information. Given a new structural model for HOX functional domain organization, the properties of the archetypic TF may require reassessment. Copyright © 2014 Elsevier Inc. All rights reserved.

  10. Insights into Hox protein function from a large scale combinatorial analysis of protein domains.

    Directory of Open Access Journals (Sweden)

    Samir Merabet

    2011-10-01

    Full Text Available Protein function is encoded within protein sequence and protein domains. However, how protein domains cooperate within a protein to modulate overall activity and how this impacts functional diversification at the molecular and organism levels remains largely unaddressed. Focusing on three domains of the central class Drosophila Hox transcription factor AbdominalA (AbdA, we used combinatorial domain mutations and most known AbdA developmental functions as biological readouts to investigate how protein domains collectively shape protein activity. The results uncover redundancy, interactivity, and multifunctionality of protein domains as salient features underlying overall AbdA protein activity, providing means to apprehend functional diversity and accounting for the robustness of Hox-controlled developmental programs. Importantly, the results highlight context-dependency in protein domain usage and interaction, allowing major modifications in domains to be tolerated without general functional loss. The non-pleoitropic effect of domain mutation suggests that protein modification may contribute more broadly to molecular changes underlying morphological diversification during evolution, so far thought to rely largely on modification in gene cis-regulatory sequences.

  11. Characterization of Rice Homeobox Genes, OsHOX22 and OsHOX24, and Over-expression of OsHOX24 in Transgenic Arabidopsis Suggest their Role in Abiotic Stress Response

    Directory of Open Access Journals (Sweden)

    Annapurna eBhattacharjee

    2016-05-01

    Full Text Available Homeobox transcription factors are well known regulators of plant growth and development. In this study, we carried out functional analysis of two candidate stress-responsive HD-ZIP I class homeobox genes from rice, OsHOX22 and OsHOX24. These genes were highly upregulated under various abiotic stress conditions at different stages of rice development, including seedling, mature and reproductive stages. The transcript levels of these genes were enhanced significantly in the presence of plant hormones, including abscisic acid (ABA, auxin, salicylic acid and gibberellic acid. The recombinant full-length and truncated homeobox proteins were found to be localized in the nucleus. Electrophoretic mobility shift assay established the binding of these homeobox proteins with specific DNA sequences, AH1 (CAAT(A/TATTG and AH2 (CAAT(C/GATTG. Transactivation assays in yeast revealed the transcriptional activation potential of full-length OsHOX22 and OsHOX24 proteins. Homo- and hetero-dimerization capabilities of these proteins have also been demonstrated. Further, we identified putative novel interacting proteins of OsHOX22 and OsHOX24 via yeast-two hybrid analysis. Over-expression of OsHOX24 imparted higher sensitivity to stress hormone, ABA, and abiotic stresses in the transgenic Arabidopsis plants as revealed by various physiological and phenotypic assays. Microarray analysis revealed differential expression of several stress-responsive genes in transgenic lines as compared to wild-type. Many of these genes were found to be involved in transcriptional regulation and various metabolic pathways. Altogether, our results suggest the possible role of OsHOX22/OsHOX24 homeobox proteins as negative regulators in abiotic stress responses.

  12. Regulation of Silk Genes by Hox and Homeodomain Proteins in the Terminal Differentiated Silk Gland of the Silkworm Bombyx mori

    Science.gov (United States)

    Takiya, Shigeharu; Tsubota, Takuya; Kimoto, Mai

    2016-01-01

    The silk gland of the silkworm Bombyx mori is a long tubular organ that is divided into several subparts along its anteroposterior (AP) axis. As a trait of terminal differentiation of the silk gland, several silk protein genes are expressed with unique regional specificities. Most of the Hox and some of the homeobox genes are also expressed in the differentiated silk gland with regional specificities. The expression patterns of Hox genes in the silk gland roughly correspond to those in embryogenesis showing “colinearity”. The central Hox class protein Antennapedia (Antp) directly regulates the expression of several middle silk gland–specific silk genes, whereas the Lin-1/Isl-1/Mec3 (LIM)-homeodomain transcriptional factor Arrowhead (Awh) regulates the expression of posterior silk gland–specific genes for silk fiber proteins. We summarize our results and discuss the usefulness of the silk gland of Bombyx mori for analyzing the function of Hox genes. Further analyses of the regulatory mechanisms underlying the region-specific expression of silk genes will provide novel insights into the molecular bases for target-gene selection and regulation by Hox and homeodomain proteins. PMID:29615585

  13. Regulation of Silk Genes by Hox and Homeodomain Proteins in the Terminal Differentiated Silk Gland of the Silkworm Bombyx mori

    Directory of Open Access Journals (Sweden)

    Shigeharu Takiya

    2016-05-01

    Full Text Available The silk gland of the silkworm Bombyx mori is a long tubular organ that is divided into several subparts along its anteroposterior (AP axis. As a trait of terminal differentiation of the silk gland, several silk protein genes are expressed with unique regional specificities. Most of the Hox and some of the homeobox genes are also expressed in the differentiated silk gland with regional specificities. The expression patterns of Hox genes in the silk gland roughly correspond to those in embryogenesis showing “colinearity”. The central Hox class protein Antennapedia (Antp directly regulates the expression of several middle silk gland–specific silk genes, whereas the Lin-1/Isl-1/Mec3 (LIM-homeodomain transcriptional factor Arrowhead (Awh regulates the expression of posterior silk gland–specific genes for silk fiber proteins. We summarize our results and discuss the usefulness of the silk gland of Bombyx mori for analyzing the function of Hox genes. Further analyses of the regulatory mechanisms underlying the region-specific expression of silk genes will provide novel insights into the molecular bases for target-gene selection and regulation by Hox and homeodomain proteins.

  14. Effect of increased HoxB4 on human megakaryocytic development

    International Nuclear Information System (INIS)

    Zhong, Yiming; Sullenbarger, Brent; Lasky, Larry C.

    2010-01-01

    Research highlights: → HoxB4 overexpression in human TF1 cells increased the expression of CD61 and CD41a. → HoxB4 fusion protein enhanced megakaryocytic development of CD34 + cord blood cells. → Ectopic HoxB4 increased Tpo receptor expression and decreased c-Myb expression. → HoxB4 RNA silencing increased c-Myb expression and decreased Fli-1 expression. -- Abstract: In order to produce clinically useful quantities of platelets ex vivo we may need to firstly enhance early self-renewal of hematopoietic stem cells (HSCs) and/or megakaryocyte (Mk) progenitors. The homeodomain transcription factor HoxB4 has been shown to be an important regulator of stem cell renewal and hematopoiesis; however, its effect on megakaryopoiesis is unclear. In this study, we investigated the effect of HoxB4 overexpression or RNA silencing on megakaryocytic development in the human TF1 progenitor cell line; we then used recombinant tPTD-HoxB4 fusion protein to study the effect of exogenous HoxB4 on megakaryocytic development of human CD34 positively-selected cord blood cells. We found that ectopic HoxB4 in TF1 cells increased the antigen expression of CD61and CD41a, increased the gene expression of thrombopoietin receptor (TpoR), Scl-1, Cyclin D1, Fog-1 and Fli-1 while it decreased c-Myb expression. HoxB4 RNA silencing in TF1 cells decreased the expression of CD61 and CD41a and decreased Fli-1 expression while it increased the expression of c-Myb. Recombinant tPTD-HoxB4 fusion protein increased the percentages and absolute numbers of CD41a and CD61 positive cells during megakaryocytic differentiation of CD34 positively-selected cord blood cells and increased the numbers of colony-forming unit-megakaryocyte (CFU-Mk). Adding tPTD-HoxB4 fusion protein increased the gene expression of TpoR, Cyclin D1, Fog-1 and Fli-1 while it inhibited c-Myb expression. Our data suggest that increased HoxB4 enhanced early megakaryocytic development in human TF1 cells and CD34 positively-selected cord

  15. Effect of increased HoxB4 on human megakaryocytic development

    Energy Technology Data Exchange (ETDEWEB)

    Zhong, Yiming [Department of Pathology, The Ohio State University, Columbus, OH (United States); Program in Molecular, Cellular, and Developmental Biology, The Ohio State University, Columbus, OH (United States); Sullenbarger, Brent [Department of Pathology, The Ohio State University, Columbus, OH (United States); Lasky, Larry C., E-mail: Lasky.4@osu.edu [Department of Pathology, The Ohio State University, Columbus, OH (United States); Program in Molecular, Cellular, and Developmental Biology, The Ohio State University, Columbus, OH (United States)

    2010-07-30

    Research highlights: {yields} HoxB4 overexpression in human TF1 cells increased the expression of CD61 and CD41a. {yields} HoxB4 fusion protein enhanced megakaryocytic development of CD34{sup +} cord blood cells. {yields} Ectopic HoxB4 increased Tpo receptor expression and decreased c-Myb expression. {yields} HoxB4 RNA silencing increased c-Myb expression and decreased Fli-1 expression. -- Abstract: In order to produce clinically useful quantities of platelets ex vivo we may need to firstly enhance early self-renewal of hematopoietic stem cells (HSCs) and/or megakaryocyte (Mk) progenitors. The homeodomain transcription factor HoxB4 has been shown to be an important regulator of stem cell renewal and hematopoiesis; however, its effect on megakaryopoiesis is unclear. In this study, we investigated the effect of HoxB4 overexpression or RNA silencing on megakaryocytic development in the human TF1 progenitor cell line; we then used recombinant tPTD-HoxB4 fusion protein to study the effect of exogenous HoxB4 on megakaryocytic development of human CD34 positively-selected cord blood cells. We found that ectopic HoxB4 in TF1 cells increased the antigen expression of CD61and CD41a, increased the gene expression of thrombopoietin receptor (TpoR), Scl-1, Cyclin D1, Fog-1 and Fli-1 while it decreased c-Myb expression. HoxB4 RNA silencing in TF1 cells decreased the expression of CD61 and CD41a and decreased Fli-1 expression while it increased the expression of c-Myb. Recombinant tPTD-HoxB4 fusion protein increased the percentages and absolute numbers of CD41a and CD61 positive cells during megakaryocytic differentiation of CD34 positively-selected cord blood cells and increased the numbers of colony-forming unit-megakaryocyte (CFU-Mk). Adding tPTD-HoxB4 fusion protein increased the gene expression of TpoR, Cyclin D1, Fog-1 and Fli-1 while it inhibited c-Myb expression. Our data suggest that increased HoxB4 enhanced early megakaryocytic development in human TF1 cells and CD34

  16. Molecular insights into the origin of the Hox-TALE patterning system.

    Science.gov (United States)

    Hudry, Bruno; Thomas-Chollier, Morgane; Volovik, Yael; Duffraisse, Marilyne; Dard, Amélie; Frank, Dale; Technau, Ulrich; Merabet, Samir

    2014-03-18

    Despite tremendous body form diversity in nature, bilaterian animals share common sets of developmental genes that display conserved expression patterns in the embryo. Among them are the Hox genes, which define different identities along the anterior-posterior axis. Hox proteins exert their function by interaction with TALE transcription factors. Hox and TALE members are also present in some but not all non-bilaterian phyla, raising the question of how Hox-TALE interactions evolved to provide positional information. By using proteins from unicellular and multicellular lineages, we showed that these networks emerged from an ancestral generic motif present in Hox and other related protein families. Interestingly, Hox-TALE networks experienced additional and extensive molecular innovations that were likely crucial for differentiating Hox functions along body plans. Together our results highlight how homeobox gene families evolved during eukaryote evolution to eventually constitute a major patterning system in Eumetazoans. DOI: http://dx.doi.org/10.7554/eLife.01939.001.

  17. Targeting HOX and PBX transcription factors in ovarian cancer

    International Nuclear Information System (INIS)

    Morgan, Richard; Plowright, Lynn; Harrington, Kevin J; Michael, Agnieszka; Pandha, Hardev S

    2010-01-01

    Ovarian cancer still has a relatively poor prognosis due to the frequent occurrence of drug resistance, making the identification of new therapeutic targets an important goal. We have studied the role of HOX genes in the survival and proliferation of ovarian cancer cells. These are a family of homeodomain-containing transcription factors that determine cell and tissue identity in the early embryo, and have an anti-apoptotic role in a number of malignancies including lung and renal cancer. We used QPCR to determine HOX gene expression in normal ovary and in the ovarian cancer cell lines SK-OV3 and OV-90. We used a short peptide, HXR9, to disrupt the formation of HOX/PBX dimers and alter transcriptional regulation by HOX proteins. In this study we show that the ovarian cancer derived line SK-OV3, but not OV-90, exhibits highly dysregulated expression of members of the HOX gene family. Disrupting the interaction between HOX proteins and their co-factor PBX induces apoptosis in SK-OV3 cells and retards tumour growth in vivo. HOX/PBX binding is a potential target in ovarian cancer

  18. Glimpse into Hox and tale regulation of cell differentiation and reprogramming.

    Science.gov (United States)

    Cerdá-Esteban, Nuria; Spagnoli, Francesca M

    2014-01-01

    During embryonic development, cells become gradually restricted in their developmental potential and start elaborating lineage-specific transcriptional networks to ultimately acquire a unique differentiated state. Hox genes play a central role in specifying regional identities, thereby providing the cell with critical information on positional value along its differentiation path. The exquisite DNA-binding specificity of the Hox proteins is frequently dependent upon their interaction with members of the TALE family of homeodomain proteins. In addition to their function as Hox-cofactors, TALE homeoproteins control multiple crucial developmental processes through Hox-independent mechanisms. Here, we will review recent findings on the function of both Hox and TALE proteins in cell differentiation, referring mostly to vertebrate species. In addition, we will discuss the direct implications of this knowledge on cell plasticity and cell reprogramming. Copyright © 2013 Wiley Periodicals, Inc.

  19. Grappling with the HOX network in hematopoiesis and leukemia.

    Science.gov (United States)

    McGonigle, Glenda J; Lappin, Terence R J; Thompson, Alexander

    2008-05-01

    The mammalian HOX gene network encodes a family of proteins which act as master regulators of developmental processes such as embryogenesis and hematopoiesis. The complex arrangement, regulation and co-factor association of HOX has been an area of intense research, particularly in cancer biology, for over a decade. The concept of redeployment of embryonic regulators in the neoplastic arena has received support from many quarters. Observations of altered HOX gene expression in various solid tumours and leukemia appear to support the thesis that 'oncology recapitulates ontogeny' but the identification of critical HOX subsets and their functional role in cancer onset and maintenance requires further investigation. The application of novel techniques and model systems will continue to enhance our understanding of the HOX network in the years to come. Better understanding of the intricacy of the complex as well as identification of functional pathways and direct targets of the encoded proteins will permit harnessing of this family of genes for clinical application.

  20. HoxA Genes and the Fin-to-Limb Transition in Vertebrates

    Directory of Open Access Journals (Sweden)

    João Leite-Castro

    2016-02-01

    Full Text Available HoxA genes encode for important DNA-binding transcription factors that act during limb development, regulating primarily gene expression and, consequently, morphogenesis and skeletal differentiation. Within these genes, HoxA11 and HoxA13 were proposed to have played an essential role in the enigmatic evolutionary transition from fish fins to tetrapod limbs. Indeed, comparative gene expression analyses led to the suggestion that changes in their regulation might have been essential for the diversification of vertebrates’ appendages. In this review, we highlight three potential modifications in the regulation and function of these genes that may have boosted appendage evolution: (1 the expansion of polyalanine repeats in the HoxA11 and HoxA13 proteins; (2 the origin of +a novel long-non-coding RNA with a possible inhibitory function on HoxA11; and (3 the acquisition of cis-regulatory elements modulating 5’ HoxA transcription. We discuss the relevance of these mechanisms for appendage diversification reviewing the current state of the art and performing additional comparative analyses to characterize, in a phylogenetic framework, HoxA11 and HoxA13 expression, alanine composition within the encoded proteins, long-non-coding RNAs and cis-regulatory elements.

  1. Polycomb-dependent regulatory contacts between distant Hox loci in Drosophila

    DEFF Research Database (Denmark)

    Bantignies, Frédéric; Roure, Virginie; Comet, Itys

    2011-01-01

    In Drosophila melanogaster, Hox genes are organized in an anterior and a posterior cluster, called Antennapedia complex and bithorax complex, located on the same chromosome arm and separated by 10 Mb of DNA. Both clusters are repressed by Polycomb group (PcG) proteins. Here, we show that genes...... of the two Hox complexes can interact within nuclear PcG bodies in tissues where they are corepressed. This colocalization increases during development and depends on PcG proteins. Hox gene contacts are conserved in the distantly related Drosophila virilis species and they are part of a large gene...

  2. Functional comparison of the nematode Hox gene lin-39 in C. elegans and P. pacificus reveals evolutionary conservation of protein function despite divergence of primary sequences

    OpenAIRE

    Grandien, Kaj; Sommer, Ralf J.

    2001-01-01

    Hox transcription factors have been implicated in playing a central role in the evolution of animal morphology. Many studies indicate the evolutionary importance of regulatory changes in Hox genes, but little is known about the role of functional changes in Hox proteins. In the nematodes Pristionchus pacificus and Caenorhabditis elegans, developmental processes can be compared at the cellular, genetic, and molecular levels and differences in gene function can be identified. The Hox gene lin-3...

  3. Hox genes and evolution [version 1; referees: 3 approved

    Directory of Open Access Journals (Sweden)

    Steven M. Hrycaj

    2016-05-01

    Full Text Available Hox proteins are a deeply conserved group of transcription factors originally defined for their critical roles in governing segmental identity along the antero-posterior (AP axis in Drosophila. Over the last 30 years, numerous data generated in evolutionarily diverse taxa have clearly shown that changes in the expression patterns of these genes are closely associated with the regionalization of the AP axis, suggesting that Hox genes have played a critical role in the evolution of novel body plans within Bilateria. Despite this deep functional conservation and the importance of these genes in AP patterning, key questions remain regarding many aspects of Hox biology. In this commentary, we highlight recent reports that have provided novel insight into the origins of the mammalian Hox cluster, the role of Hox genes in the generation of a limbless body plan, and a novel putative mechanism in which Hox genes may encode specificity along the AP axis. Although the data discussed here offer a fresh perspective, it is clear that there is still much to learn about Hox biology and the roles it has played in the evolution of the Bilaterian body plan.

  4. Broken colinearity of the amphioxus Hox cluster

    Directory of Open Access Journals (Sweden)

    Pascual-Anaya Juan

    2012-12-01

    Full Text Available Abstract Background In most eumetazoans studied so far, Hox genes determine the identity of structures along the main body axis. They are usually linked in genomic clusters and, in the case of the vertebrate embryo, are expressed with spatial and temporal colinearity. Outside vertebrates, temporal colinearity has been reported in the cephalochordate amphioxus (the least derived living relative of the chordate ancestor but only for anterior and central genes, namely Hox1 to Hox4 and Hox6. However, most of the Hox gene expression patterns in amphioxus have not been reported. To gain global insights into the evolution of Hox clusters in chordates, we investigated a more extended expression profile of amphioxus Hox genes. Results Here we report an extended expression profile of the European amphioxus Branchiostoma lanceolatum Hox genes and describe that all Hox genes, except Hox13, are expressed during development. Interestingly, we report the breaking of both spatial and temporal colinearity for at least Hox6 and Hox14, which thus have escaped from the classical Hox code concept. We show a previously unidentified Hox6 expression pattern and a faint expression for posterior Hox genes in structures such as the posterior mesoderm, notochord, and hindgut. Unexpectedly, we found that amphioxus Hox14 had the most divergent expression pattern. This gene is expressed in the anterior cerebral vesicle and pharyngeal endoderm. Amphioxus Hox14 expression represents the first report of Hox gene expression in the most anterior part of the central nervous system. Nevertheless, despite these divergent expression patterns, amphioxus Hox6 and Hox14 seem to be still regulated by retinoic acid. Conclusions Escape from colinearity by Hox genes is not unusual in either vertebrates or amphioxus and we suggest that those genes escaping from it are probably associated with the patterning of lineage-specific morphological traits, requiring the loss of those developmental

  5. Novel interactions between vertebrate Hox genes

    NARCIS (Netherlands)

    Hooiveld, MHW; Morgan, R; Rieden, PID; Houtzager, E; Pannese, M; Damen, K; Boncinelli, E; Durston, AJ

    1999-01-01

    Understanding why metazoan Hox/HOM-C genes are expressed in spatiotemporal sequences showing colinearity with their genomic sequence is a central challenge in developmental biology. Here, we studied the consequences of ectopically expressing Hox genes to investigate whether Hox-Hox interactions

  6. Hox gene cluster of the ascidian, Halocynthia roretzi, reveals multiple ancient steps of cluster disintegration during ascidian evolution.

    Science.gov (United States)

    Sekigami, Yuka; Kobayashi, Takuya; Omi, Ai; Nishitsuji, Koki; Ikuta, Tetsuro; Fujiyama, Asao; Satoh, Noriyuki; Saiga, Hidetoshi

    2017-01-01

    Hox gene clusters with at least 13 paralog group (PG) members are common in vertebrate genomes and in that of amphioxus. Ascidians, which belong to the subphylum Tunicata (Urochordata), are phylogenetically positioned between vertebrates and amphioxus, and traditionally divided into two groups: the Pleurogona and the Enterogona. An enterogonan ascidian, Ciona intestinalis ( Ci ), possesses nine Hox genes localized on two chromosomes; thus, the Hox gene cluster is disintegrated. We investigated the Hox gene cluster of a pleurogonan ascidian, Halocynthia roretzi ( Hr ) to investigate whether Hox gene cluster disintegration is common among ascidians, and if so, how such disintegration occurred during ascidian or tunicate evolution. Our phylogenetic analysis reveals that the Hr Hox gene complement comprises nine members, including one with a relatively divergent Hox homeodomain sequence. Eight of nine Hr Hox genes were orthologous to Ci-Hox1 , 2, 3, 4, 5, 10, 12 and 13. Following the phylogenetic classification into 13 PGs, we designated Hr Hox genes as Hox1, 2, 3, 4, 5, 10, 11/12/13.a , 11/12/13.b and HoxX . To address the chromosomal arrangement of the nine Hox genes, we performed two-color chromosomal fluorescent in situ hybridization, which revealed that the nine Hox genes are localized on a single chromosome in Hr , distinct from their arrangement in Ci . We further examined the order of the nine Hox genes on the chromosome by chromosome/scaffold walking. This analysis suggested a gene order of Hox1 , 11/12/13.b, 11/12/13.a, 10, 5, X, followed by either Hox4, 3, 2 or Hox2, 3, 4 on the chromosome. Based on the present results and those previously reported in Ci , we discuss the establishment of the Hox gene complement and disintegration of Hox gene clusters during the course of ascidian or tunicate evolution. The Hox gene cluster and the genome must have experienced extensive reorganization during the course of evolution from the ancestral tunicate to Hr and Ci

  7. Composition and genomic organization of arthropod Hox clusters.

    Science.gov (United States)

    Pace, Ryan M; Grbić, Miodrag; Nagy, Lisa M

    2016-01-01

    The ancestral arthropod is believed to have had a clustered arrangement of ten Hox genes. Within arthropods, Hox gene mutations result in transformation of segment identities. Despite the fact that variation in segment number/character was common in the diversification of arthropods, few examples of Hox gene gains/losses have been correlated with morphological evolution. Furthermore, a full appreciation of the variation in the genomic arrangement of Hox genes in extant arthropods has not been recognized, as genome sequences from each major arthropod clade have not been reported until recently. Initial genomic analysis of the chelicerate Tetranychus urticae suggested that loss of Hox genes and Hox gene clustering might be more common than previously assumed. To further characterize the genomic evolution of arthropod Hox genes, we compared the genomic arrangement and general characteristics of Hox genes from representative taxa from each arthropod subphylum. In agreement with others, we find arthropods generally contain ten Hox genes arranged in a common orientation in the genome, with an increasing number of sampled species missing either Hox3 or abdominal-A orthologs. The genomic clustering of Hox genes in species we surveyed varies significantly, ranging from 0.3 to 13.6 Mb. In all species sampled, arthropod Hox genes are dispersed in the genome relative to the vertebrate Mus musculus. Differences in Hox cluster size arise from variation in the number of intervening genes, intergenic spacing, and the size of introns and UTRs. In the arthropods surveyed, Hox gene duplications are rare and four microRNAs are, in general, conserved in similar genomic positions relative to the Hox genes. The tightly clustered Hox complexes found in the vertebrates are not evident within arthropods, and differential patterns of Hox gene dispersion are found throughout the arthropods. The comparative genomic data continue to support an ancestral arthropod Hox cluster of ten genes with

  8. The HOX genes are expressed, in vivo, in human tooth germs: in vitro cAMP exposure of dental pulp cells results in parallel HOX network activation and neuronal differentiation.

    Science.gov (United States)

    D'Antò, Vincenzo; Cantile, Monica; D'Armiento, Maria; Schiavo, Giulia; Spagnuolo, Gianrico; Terracciano, Luigi; Vecchione, Raffaela; Cillo, Clemente

    2006-03-01

    Homeobox-containing genes play a crucial role in odontogenesis. After the detection of Dlx and Msx genes in overlapping domains along maxillary and mandibular processes, a homeobox odontogenic code has been proposed to explain the interaction between different homeobox genes during dental lamina patterning. No role has so far been assigned to the Hox gene network in the homeobox odontogenic code due to studies on specific Hox genes and evolutionary considerations. Despite its involvement in early patterning during embryonal development, the HOX gene network, the most repeat-poor regions of the human genome, controls the phenotype identity of adult eukaryotic cells. Here, according to our results, the HOX gene network appears to be active in human tooth germs between 18 and 24 weeks of development. The immunohistochemical localization of specific HOX proteins mostly concerns the epithelial tooth germ compartment. Furthermore, only a few genes of the network are active in embryonal retromolar tissues, as well as in ectomesenchymal dental pulp cells (DPC) grown in vitro from adult human molar. Exposure of DPCs to cAMP induces the expression of from three to nine total HOX genes of the network in parallel with phenotype modifications with traits of neuronal differentiation. Our observations suggest that: (i) by combining its component genes, the HOX gene network determines the phenotype identity of epithelial and ectomesenchymal cells interacting in the generation of human tooth germ; (ii) cAMP treatment activates the HOX network and induces, in parallel, a neuronal-like phenotype in human primary ectomesenchymal dental pulp cells. 2005 Wiley-Liss, Inc.

  9. Functional comparison of the nematode Hox gene lin-39 in C. elegans and P. pacificus reveals evolutionary conservation of protein function despite divergence of primary sequences.

    Science.gov (United States)

    Grandien, K; Sommer, R J

    2001-08-15

    Hox transcription factors have been implicated in playing a central role in the evolution of animal morphology. Many studies indicate the evolutionary importance of regulatory changes in Hox genes, but little is known about the role of functional changes in Hox proteins. In the nematodes Pristionchus pacificus and Caenorhabditis elegans, developmental processes can be compared at the cellular, genetic, and molecular levels and differences in gene function can be identified. The Hox gene lin-39 is involved in the regulation of nematode vulva development. Comparison of known lin-39 mutations in P. pacificus and C. elegans revealed both conservation and changes of gene function. Here, we study evolutionary changes of lin-39 function using hybrid transgenes and site-directed mutagenesis in an in vivo assay using C. elegans lin-39 mutants. Our data show that despite the functional differences of LIN-39 between the two species, Ppa-LIN-39, when driven by Cel-lin-39 regulatory elements, can functionally replace Cel-lin-39. Furthermore, we show that the MAPK docking and phosphorylation motifs unique for Cel-LIN-39 are dispensable for Cel-lin-39 function. Therefore, the evolution of lin-39 function is driven by changes in regulatory elements rather than changes in the protein itself.

  10. Rice homeobox transcription factor HOX1a positively regulates gibberellin responses by directly suppressing EL1.

    Science.gov (United States)

    Wen, Bi-Qing; Xing, Mei-Qing; Zhang, Hua; Dai, Cheng; Xue, Hong-Wei

    2011-11-01

    Homeobox transcription factors are involved in various aspects of plant development, including maintenance of the biosynthesis and signaling pathways of different hormones. However, few direct targets of homeobox proteins have been identified. We here show that overexpression of rice homeobox gene HOX1a resulted in enhanced gibberellin (GA) response, indicating a positive effect of HOX1a in GA signaling. HOX1a is induced by GA and encodes a homeobox transcription factor with transcription repression activity. In addition, HOX1a suppresses the transcription of early flowering1 (EL1), a negative regulator of GA signaling, and further electrophoretic mobility shift assay and chromatin immunoprecipitation analysis revealed that HOX1a directly bound to the promoter region of EL1 to suppress its expression and stimulate GA signaling. These results demonstrate that HOX1a functions as a positive regulator of GA signaling by suppressing EL1, providing informative hints on the study of GA signaling. © 2011 Institute of Botany, Chinese Academy of Sciences.

  11. The role of HoxA11 and HoxA13 in the evolution of novel fin morphologies in a representative batoid (Leucoraja erinacea

    Directory of Open Access Journals (Sweden)

    Shannon N. Barry

    2017-12-01

    Full Text Available Abstract Background Batoids exhibit unique body plans with derived fin morphologies, such as the anteriorly expanded pectoral fins that fuse to the head, or distally extended anterior pelvic fin lobes used for a modified swimming technique utilized by skates (Rajidae. The little skate (Leucoraja erinacea, exhibits both of these unique fin morphologies. These fin modifications are not present in a typical shark body plan, and little is known regarding the mechanisms underlying their development. A recent study identified a novel apical ectodermal ridge (AER associated with the development of the anterior pectoral fin in the little skate, but the role of the posterior HoxA genes was not featured during skate fin development. Results We present the first evidence for HoxA expression (HoxA11 and HoxA13 in novel AER domains associated with the development of three novel fin morphologies in a representative batoid, L. erinacea. We found HoxA13 expression associated with the recently described novel AER in the anterior pectoral fin, and HoxA11 expression in a novel AER domain in the anterior pelvic fin that we describe here. We find that both HoxA11 and HoxA13 are expressed in claspers, and while HoxA11 is expressed in pelvic fins and claspers, HoxA13 is expressed exclusively in developing claspers of males. Finally, HoxA11 expression is associated with the developing fin rays in paired fins. Conclusion Overall, these results indicate that the posterior HoxA genes play an important role in the morphological evolution of paired fins in a representative batoid. These data suggest that the batoids utilize a unique Hox code, where the posterior HoxA genes exhibit distinct expression patterns that are likely associated with specification of novel fin morphologies.

  12. HOX and TALE signatures specify human stromal stem cell populations from different sources.

    Science.gov (United States)

    Picchi, Jacopo; Trombi, Luisa; Spugnesi, Laura; Barachini, Serena; Maroni, Giorgia; Brodano, Giovanni Barbanti; Boriani, Stefano; Valtieri, Mauro; Petrini, Mario; Magli, Maria Cristina

    2013-04-01

    Human stromal stem cell populations reside in different tissues and anatomical sites, however a critical question related to their efficient use in regenerative medicine is whether they exhibit equivalent biological properties. Here, we compared cellular and molecular characteristics of stromal stem cells derived from the bone marrow, at different body sites (iliac crest, sternum, and vertebrae) and other tissues (dental pulp and colon). In particular, we investigated whether homeobox genes of the HOX and TALE subfamilies might provide suitable markers to identify distinct stromal cell populations, as HOX proteins control cell positional identity and, together with their co-factors TALE, are involved in orchestrating differentiation of adult tissues. Our results show that stromal populations from different sources, although immunophenotypically similar, display distinct HOX and TALE signatures, as well as different growth and differentiation abilities. Stromal stem cells from different tissues are characterized by specific HOX profiles, differing in the number and type of active genes, as well as in their level of expression. Conversely, bone marrow-derived cell populations can be essentially distinguished for the expression levels of specific HOX members, strongly suggesting that quantitative differences in HOX activity may be crucial. Taken together, our data indicate that the HOX and TALE profiles provide positional, embryological and hierarchical identity of human stromal stem cells. Furthermore, our data suggest that cell populations derived from different body sites may not represent equivalent cell sources for cell-based therapeutical strategies for regeneration and repair of specific tissues. Copyright © 2012 Wiley Periodicals, Inc.

  13. Hox gene clusters in the Indonesian coelacanth, Latimeria menadoensis

    Science.gov (United States)

    Koh, Esther G. L.; Lam, Kevin; Christoffels, Alan; Erdmann, Mark V.; Brenner, Sydney; Venkatesh, Byrappa

    2003-01-01

    The Hox genes encode transcription factors that play a key role in specifying body plans of metazoans. They are organized into clusters that contain up to 13 paralogue group members. The complex morphology of vertebrates has been attributed to the duplication of Hox clusters during vertebrate evolution. In contrast to the single Hox cluster in the amphioxus (Branchiostoma floridae), an invertebrate-chordate, mammals have four clusters containing 39 Hox genes. Ray-finned fishes (Actinopterygii) such as zebrafish and fugu possess more than four Hox clusters. The coelacanth occupies a basal phylogenetic position among lobe-finned fishes (Sarcopterygii), which gave rise to the tetrapod lineage. The lobe fins of sarcopterygians are considered to be the evolutionary precursors of tetrapod limbs. Thus, the characterization of Hox genes in the coelacanth should provide insights into the origin of tetrapod limbs. We have cloned the complete second exon of 33 Hox genes from the Indonesian coelacanth, Latimeria menadoensis, by extensive PCR survey and genome walking. Phylogenetic analysis shows that 32 of these genes have orthologs in the four mammalian HOX clusters, including three genes (HoxA6, D1, and D8) that are absent in ray-finned fishes. The remaining coelacanth gene is an ortholog of hoxc1 found in zebrafish but absent in mammals. Our results suggest that coelacanths have four Hox clusters bearing a gene complement more similar to mammals than to ray-finned fishes, but with an additional gene, HoxC1, which has been lost during the evolution of mammals from lobe-finned fishes. PMID:12547909

  14. Extensive polycistronism and antisense transcription in the mammalian Hox clusters.

    Directory of Open Access Journals (Sweden)

    Gaëll Mainguy

    Full Text Available The Hox clusters play a crucial role in body patterning during animal development. They encode both Hox transcription factor and micro-RNA genes that are activated in a precise temporal and spatial sequence that follows their chromosomal order. These remarkable collinear properties confer functional unit status for Hox clusters. We developed the TranscriptView platform to establish high resolution transcriptional profiling and report here that transcription in the Hox clusters is far more complex than previously described in both human and mouse. Unannotated transcripts can represent up to 60% of the total transcriptional output of a cluster. In particular, we identified 14 non-coding Transcriptional Units antisense to Hox genes, 10 of which (70% have a detectable mouse homolog. Most of these Transcriptional Units in both human and mouse present conserved sizeable sequences (>40 bp overlapping Hox transcripts, suggesting that these Hox antisense transcripts are functional. Hox clusters also display at least seven polycistronic clusters, i.e., different genes being co-transcribed on long isoforms (up to 30 kb. This work provides a reevaluated framework for understanding Hox gene function and dys-function. Such extensive transcriptions may provide a structural explanation for Hox clustering.

  15. A general scenario of Hox gene inventory variation among major sarcopterygian lineages

    Directory of Open Access Journals (Sweden)

    Wang Chaolin

    2011-01-01

    Full Text Available Abstract Background Hox genes are known to play a key role in shaping the body plan of metazoans. Evolutionary dynamics of these genes is therefore essential in explaining patterns of evolutionary diversity. Among extant sarcopterygians comprising both lobe-finned fishes and tetrapods, our knowledge of the Hox genes and clusters has largely been restricted in several model organisms such as frogs, birds and mammals. Some evolutionary gaps still exist, especially for those groups with derived body morphology or occupying key positions on the tree of life, hindering our understanding of how Hox gene inventory varied along the sarcopterygian lineage. Results We determined the Hox gene inventory for six sarcopterygian groups: lungfishes, caecilians, salamanders, snakes, turtles and crocodiles by comprehensive PCR survey and genome walking. Variable Hox genes in each of the six sarcopterygian group representatives, compared to the human Hox gene inventory, were further validated for their presence/absence by PCR survey in a number of related species representing a broad evolutionary coverage of the group. Turtles, crocodiles, birds and placental mammals possess the same 39 Hox genes. HoxD12 is absent in snakes, amphibians and probably lungfishes. HoxB13 is lost in frogs and caecilians. Lobe-finned fishes, amphibians and squamate reptiles possess HoxC3. HoxC1 is only present in caecilians and lobe-finned fishes. Similar to coelacanths, lungfishes also possess HoxA14, which is only found in lobe-finned fishes to date. Our Hox gene variation data favor the lungfish-tetrapod, turtle-archosaur and frog-salamander relationships and imply that the loss of HoxD12 is not directly related to digit reduction. Conclusions Our newly determined Hox inventory data provide a more complete scenario for evolutionary dynamics of Hox genes along the sarcopterygian lineage. Limbless, worm-like caecilians and snakes possess similar Hox gene inventories to animals with

  16. Unusual Gene Order and Organization of the Sea Urchin HoxCluster

    Energy Technology Data Exchange (ETDEWEB)

    Richardson, Paul M.; Lucas, Susan; Cameron, R. Andrew; Rowen,Lee; Nesbitt, Ryan; Bloom, Scott; Rast, Jonathan P.; Berney, Kevin; Arenas-Mena, Cesar; Martinez, Pedro; Davidson, Eric H.; Peterson, KevinJ.; Hood, Leroy

    2005-05-10

    The highly consistent gene order and axial colinear expression patterns found in vertebrate hox gene clusters are less well conserved across the rest of bilaterians. We report the first deuterostome instance of an intact hox cluster with a unique gene order where the paralog groups are not expressed in a sequential manner. The finished sequence from BAC clones from the genome of the sea urchin, Strongylocentrotus purpuratus, reveals a gene order wherein the anterior genes (Hox1, Hox2 and Hox3) lie nearest the posterior genes in the cluster such that the most 3' gene is Hox5. (The gene order is : 5'-Hox1,2, 3, 11/13c, 11/13b, '11/13a, 9/10, 8, 7, 6, 5 - 3)'. The finished sequence result is corroborated by restriction mapping evidence and BAC-end scaffold analyses. Comparisons with a putative ancestral deuterostome Hox gene cluster suggest that the rearrangements leading to the sea urchin gene order were many and complex.

  17. Evolutionary conservation of regulatory elements in vertebrate HOX gene clusters

    Energy Technology Data Exchange (ETDEWEB)

    Santini, Simona; Boore, Jeffrey L.; Meyer, Axel

    2003-12-31

    Due to their high degree of conservation, comparisons of DNA sequences among evolutionarily distantly-related genomes permit to identify functional regions in noncoding DNA. Hox genes are optimal candidate sequences for comparative genome analyses, because they are extremely conserved in vertebrates and occur in clusters. We aligned (Pipmaker) the nucleotide sequences of HoxA clusters of tilapia, pufferfish, striped bass, zebrafish, horn shark, human and mouse (over 500 million years of evolutionary distance). We identified several highly conserved intergenic sequences, likely to be important in gene regulation. Only a few of these putative regulatory elements have been previously described as being involved in the regulation of Hox genes, while several others are new elements that might have regulatory functions. The majority of these newly identified putative regulatory elements contain short fragments that are almost completely conserved and are identical to known binding sites for regulatory proteins (Transfac). The conserved intergenic regions located between the most rostrally expressed genes in the developing embryo are longer and better retained through evolution. We document that presumed regulatory sequences are retained differentially in either A or A clusters resulting from a genome duplication in the fish lineage. This observation supports both the hypothesis that the conserved elements are involved in gene regulation and the Duplication-Deletion-Complementation model.

  18. Dynamic nucleosome organization at hox promoters during zebrafish embryogenesis.

    Directory of Open Access Journals (Sweden)

    Steven E Weicksel

    Full Text Available Nucleosome organization at promoter regions plays an important role in regulating gene activity. Genome-wide studies in yeast, flies, worms, mammalian embryonic stem cells and transformed cell lines have found well-positioned nucleosomes flanking a nucleosome depleted region (NDR at transcription start sites. This nucleosome arrangement depends on DNA sequence (cis-elements as well as DNA binding factors and ATP-dependent chromatin modifiers (trans-factors. However, little is understood about how the nascent embryonic genome positions nucleosomes during development. This is particularly intriguing since the embryonic genome must undergo a broad reprogramming event upon fusion of sperm and oocyte. Using four stages of early embryonic zebrafish development, we map nucleosome positions at the promoter region of 37 zebrafish hox genes. We find that nucleosome arrangement at the hox promoters is a progressive process that takes place over several stages. At stages immediately after fertilization, nucleosomes appear to be largely disordered at hox promoter regions. At stages after activation of the embryonic genome, nucleosomes are detectable at hox promoters, with positions becoming more uniform and more highly occupied. Since the genomic sequence is invariant during embryogenesis, this progressive change in nucleosome arrangement suggests that trans-factors play an important role in organizing nucleosomes during embryogenesis. Separating hox genes into expressed and non-expressed groups shows that expressed promoters have better positioned and occupied nucleosomes, as well as distinct NDRs, than non-expressed promoters. Finally, by blocking the retinoic acid-signaling pathway, we disrupt early hox gene transcription, but observe no effect on nucleosome positions, suggesting that active hox transcription is not a driving force behind the arrangement of nucleosomes at the promoters of hox genes during early development.

  19. Lampreys, the jawless vertebrates, contain only two ParaHox gene clusters.

    Science.gov (United States)

    Zhang, Huixian; Ravi, Vydianathan; Tay, Boon-Hui; Tohari, Sumanty; Pillai, Nisha E; Prasad, Aravind; Lin, Qiang; Brenner, Sydney; Venkatesh, Byrappa

    2017-08-22

    ParaHox genes ( Gsx , Pdx , and Cdx ) are an ancient family of developmental genes closely related to the Hox genes. They play critical roles in the patterning of brain and gut. The basal chordate, amphioxus, contains a single ParaHox cluster comprising one member of each family, whereas nonteleost jawed vertebrates contain four ParaHox genomic loci with six or seven ParaHox genes. Teleosts, which have experienced an additional whole-genome duplication, contain six ParaHox genomic loci with six ParaHox genes. Jawless vertebrates, represented by lampreys and hagfish, are the most ancient group of vertebrates and are crucial for understanding the origin and evolution of vertebrate gene families. We have previously shown that lampreys contain six Hox gene loci. Here we report that lampreys contain only two ParaHox gene clusters (designated as α- and β-clusters) bearing five ParaHox genes ( Gsxα , Pdxα , Cdxα , Gsxβ , and Cdxβ ). The order and orientation of the three genes in the α-cluster are identical to that of the single cluster in amphioxus. However, the orientation of Gsxβ in the β-cluster is inverted. Interestingly, Gsxβ is expressed in the eye, unlike its homologs in jawed vertebrates, which are expressed mainly in the brain. The lamprey Pdxα is expressed in the pancreas similar to jawed vertebrate Pdx genes, indicating that the pancreatic expression of Pdx was acquired before the divergence of jawless and jawed vertebrate lineages. It is likely that the lamprey Pdxα plays a crucial role in pancreas specification and insulin production similar to the Pdx of jawed vertebrates.

  20. Deregulation of HOX B13 expression in urinary bladder cancer progression.

    Science.gov (United States)

    Marra, L; Cantile, M; Scognamiglio, G; Perdonà, S; La Mantia, E; Cerrone, M; Gigantino, V; Cillo, C; Caraglia, M; Pignata, S; Facchini, G; Botti, G; Chieffi, S; Chieffi, P; Franco, R

    2013-02-01

    Urinary bladder cancer is a common malignancy in industrialized countries. More than 90% of bladder cancer originates in the transitional cells. Bladder transitional cancer prognosis is, according to the most recent definition related to the level of tumor infiltration, characterized by two main phenotypes, Non Muscle Invasive Bladder Transitional Cancer (NMIBC) and Muscle Invasive Bladder Transitional Cancer (MIBC). The genetic profile and the clinical course of the two subtypes are completely different, however among NMIBC the prognosis is not completely predictable, since 20% of the cases experience a relapse, even in the form of MIBC. It has recently been reported that the chromosomal region 12q13-15, containing crucial cancer genes such as MDM2, CDK4, GLI and an entire cluster of HOX genes, is amplified in bladder cancer. HOX genes codify for transcriptionl factor, involved in embryonal development and cancer progression, with main nuclear expression. Particularly it was also described the strong involvement of HOX B13 in several tumors of urogenital system. In this study we have been investigated, by immunohistochemisty and quantitative Real Time PCR, the HOX B13 expression in bladder cancer evolution and progression, evaluating its ability to discriminate between NMIBC and MBCI phenotypes. Cytoplasmic HOX B13 delocalization significantly relates with muscle invasion (p 0.004). In addition in the series of NMIBC nuclear HOX B13 expression loss is significantly associated to shorter disease free survival (p-value=0.038) defining a potential prognostic role. Overexpression of HOX B13 in more aggressive phenotype is also demonstrate at gene level by quantitative RT-PCR. The de-regulation and delocalization of HOX B13 in urinary bladder cancer supports again the important role of HOX genes in tumor evolution and represents a starting point to establish an integrated analysis, in which HOX genes represent important prognostic and predictive markers for bladder

  1. Are Hox genes ancestrally involved in axial patterning? Evidence from the hydrozoan Clytia hemisphaerica (Cnidaria.

    Directory of Open Access Journals (Sweden)

    Roxane Chiori

    Full Text Available BACKGROUND: The early evolution and diversification of Hox-related genes in eumetazoans has been the subject of conflicting hypotheses concerning the evolutionary conservation of their role in axial patterning and the pre-bilaterian origin of the Hox and ParaHox clusters. The diversification of Hox/ParaHox genes clearly predates the origin of bilaterians. However, the existence of a "Hox code" predating the cnidarian-bilaterian ancestor and supporting the deep homology of axes is more controversial. This assumption was mainly based on the interpretation of Hox expression data from the sea anemone, but growing evidence from other cnidarian taxa puts into question this hypothesis. METHODOLOGY/PRINCIPAL FINDINGS: Hox, ParaHox and Hox-related genes have been investigated here by phylogenetic analysis and in situ hybridisation in Clytia hemisphaerica, an hydrozoan species with medusa and polyp stages alternating in the life cycle. Our phylogenetic analyses do not support an origin of ParaHox and Hox genes by duplication of an ancestral ProtoHox cluster, and reveal a diversification of the cnidarian HOX9-14 genes into three groups called A, B, C. Among the 7 examined genes, only those belonging to the HOX9-14 and the CDX groups exhibit a restricted expression along the oral-aboral axis during development and in the planula larva, while the others are expressed in very specialised areas at the medusa stage. CONCLUSIONS/SIGNIFICANCE: Cross species comparison reveals a strong variability of gene expression along the oral-aboral axis and during the life cycle among cnidarian lineages. The most parsimonious interpretation is that the Hox code, collinearity and conservative role along the antero-posterior axis are bilaterian innovations.

  2. Spatial expression of Hox cluster genes in the ontogeny of a sea urchin

    Science.gov (United States)

    Arenas-Mena, C.; Cameron, A. R.; Davidson, E. H.

    2000-01-01

    The Hox cluster of the sea urchin Strongylocentrous purpuratus contains ten genes in a 500 kb span of the genome. Only two of these genes are expressed during embryogenesis, while all of eight genes tested are expressed during development of the adult body plan in the larval stage. We report the spatial expression during larval development of the five 'posterior' genes of the cluster: SpHox7, SpHox8, SpHox9/10, SpHox11/13a and SpHox11/13b. The five genes exhibit a dynamic, largely mesodermal program of expression. Only SpHox7 displays extensive expression within the pentameral rudiment itself. A spatially sequential and colinear arrangement of expression domains is found in the somatocoels, the paired posterior mesodermal structures that will become the adult perivisceral coeloms. No such sequential expression pattern is observed in endodermal, epidermal or neural tissues of either the larva or the presumptive juvenile sea urchin. The spatial expression patterns of the Hox genes illuminate the evolutionary process by which the pentameral echinoderm body plan emerged from a bilateral ancestor.

  3. Multiple chromosomal rearrangements structured the ancestral vertebrate Hox-bearing protochromosomes.

    Directory of Open Access Journals (Sweden)

    Vincent J Lynch

    2009-01-01

    Full Text Available While the proposal that large-scale genome expansions occurred early in vertebrate evolution is widely accepted, the exact mechanisms of the expansion--such as a single or multiple rounds of whole genome duplication, bloc chromosome duplications, large-scale individual gene duplications, or some combination of these--is unclear. Gene families with a single invertebrate member but four vertebrate members, such as the Hox clusters, provided early support for Ohno's hypothesis that two rounds of genome duplication (the 2R-model occurred in the stem lineage of extant vertebrates. However, despite extensive study, the duplication history of the Hox clusters has remained unclear, calling into question its usefulness in resolving the role of large-scale gene or genome duplications in early vertebrates. Here, we present a phylogenetic analysis of the vertebrate Hox clusters and several linked genes (the Hox "paralogon" and show that different phylogenies are obtained for Dlx and Col genes than for Hox and ErbB genes. We show that these results are robust to errors in phylogenetic inference and suggest that these competing phylogenies can be resolved if two chromosomal crossover events occurred in the ancestral vertebrate. These results resolve conflicting data on the order of Hox gene duplications and the role of genome duplication in vertebrate evolution and suggest that a period of genome reorganization occurred after genome duplications in early vertebrates.

  4. Patterning of anteroposterior body axis displayed in the expression of Hox genes in sea cucumber Apostichopus japonicus.

    Science.gov (United States)

    Kikuchi, Mani; Omori, Akihito; Kurokawa, Daisuke; Akasaka, Koji

    2015-09-01

    The presence of an anteroposterior body axis is a fundamental feature of bilateria. Within this group, echinoderms have secondarily evolved pentameral symmetric body plans. Although all echinoderms present bilaterally symmetric larval stages, they dramatically rearrange their body axis and develop a pentaradial body plan during metamorphosis. Therefore, the location of their anteroposterior body axis in adult forms remains a contentious issue. Unlike other echinoderms, sea cucumbers present an obvious anteroposterior axis not rearranged during metamorphosis, thus representing an interesting group to study their anteroposterior axis patterning. Hox genes are known to play a broadly conserved role in anteroposterior axis patterning in deuterostomes. Here, we report the expression patterns of Hox genes from early development to pentactula stage in sea cucumber. In early larval stages, five Hox genes (AjHox1, AjHox7, AjHox8, AjHox11/13a, and AjHox11/13b) were expressed sequentially along the archenteron, suggesting that the role of anteroposterior patterning of the Hox genes is conserved in bilateral larvae of echinoderms. In doliolaria and pentactula stages, eight Hox genes (AjHox1, AjHox5, AjHox7, AjHox8, AjHox9/10, AjHox11/13a, AjHox11/13b, and AjHox11/13c) were expressed sequentially along the digestive tract, following a similar expression pattern to that found in the visceral mesoderm of other bilateria. Unlike other echinoderms, pentameral expression patterns of AjHox genes were not observed in sea cucumber. Altogether, we concluded that AjHox genes are involved in the patterning of the digestive tract in both larvae and metamorphosis of sea cucumbers. In addition, the anteroposterior axis in sea cucumbers might be patterned like that of other bilateria.

  5. Reduced Abd-B Hox function during kidney development results in lineage infidelity.

    Science.gov (United States)

    Magella, Bliss; Mahoney, Robert; Adam, Mike; Potter, S Steven

    2018-06-15

    Hox genes can function as key drivers of segment identity, with Hox mutations in Drosophila often resulting in dramatic homeotic transformations. In addition, however, they can serve other essential functions. In mammals, the study of Hox gene roles in development is complicated by the presence of four Hox clusters with a total of 39 genes showing extensive functional overlap. In this study, in order to better understand shared core Hox functions, we examined kidney development in mice with frameshift mutations of multiple Abd-B type Hox genes. The resulting phenotypes included dramatically reduced branching morphogenesis of the ureteric bud, premature depletion of nephron progenitors and abnormal development of the stromal compartment. Most unexpected, however, we also observed a cellular level lineage infidelity in nephron segments. Scattered cells within the proximal tubules, for example, expressed genes normally expressed only in collecting ducts. Multiple combinations of inappropriate nephron segment specific marker expression were found. In some cases, cells within a tubule showed incorrect identity, while in other cases cells showed ambiguous character, with simultaneous expression of genes associated with more than one nephron segment. These results give evidence that Hox genes have an overlapping core function at the cellular level in driving and/or maintaining correct differentiation decisions. Copyright © 2018 Elsevier Inc. All rights reserved.

  6. Distinct regulation of c-myb gene expression by HoxA9, Meis1 and Pbx proteins in normal hematopoietic progenitors and transformed myeloid cells

    International Nuclear Information System (INIS)

    Dassé, E; Volpe, G; Walton, D S; Wilson, N; Del Pozzo, W; O'Neill, L P; Slany, R K; Frampton, J; Dumon, S

    2012-01-01

    The proto-oncogenic protein c-Myb is an essential regulator of hematopoiesis and is frequently deregulated in hematological diseases such as lymphoma and leukemia. To gain insight into the mechanisms underlying the aberrant expression of c-Myb in myeloid leukemia, we analyzed and compared c-myb gene transcriptional regulation using two cell lines modeling normal hematopoietic progenitor cells (HPCs) and transformed myelomonocytic blasts. We report that the transcription factors HoxA9, Meis1, Pbx1 and Pbx2 bind in vivo to the c-myb locus and maintain its expression through different mechanisms in HPCs and leukemic cells. Our analysis also points to a critical role for Pbx2 in deregulating c-myb expression in murine myeloid cells cotransformed by the cooperative activity of HoxA9 and Meis1. This effect is associated with an intronic positioning of epigenetic marks and RNA polymerase II binding in the orthologous region of a previously described alternative promoter for c-myb. Taken together, our results could provide a first hint to explain the abnormal expression of c-myb in leukemic cells

  7. Calcisponges have a ParaHox gene and dynamic expression of dispersed NK homeobox genes.

    Science.gov (United States)

    Fortunato, Sofia A V; Adamski, Marcin; Ramos, Olivia Mendivil; Leininger, Sven; Liu, Jing; Ferrier, David E K; Adamska, Maja

    2014-10-30

    Sponges are simple animals with few cell types, but their genomes paradoxically contain a wide variety of developmental transcription factors, including homeobox genes belonging to the Antennapedia (ANTP) class, which in bilaterians encompass Hox, ParaHox and NK genes. In the genome of the demosponge Amphimedon queenslandica, no Hox or ParaHox genes are present, but NK genes are linked in a tight cluster similar to the NK clusters of bilaterians. It has been proposed that Hox and ParaHox genes originated from NK cluster genes after divergence of sponges from the lineage leading to cnidarians and bilaterians. On the other hand, synteny analysis lends support to the notion that the absence of Hox and ParaHox genes in Amphimedon is a result of secondary loss (the ghost locus hypothesis). Here we analysed complete suites of ANTP-class homeoboxes in two calcareous sponges, Sycon ciliatum and Leucosolenia complicata. Our phylogenetic analyses demonstrate that these calcisponges possess orthologues of bilaterian NK genes (Hex, Hmx and Msx), a varying number of additional NK genes and one ParaHox gene, Cdx. Despite the generation of scaffolds spanning multiple genes, we find no evidence of clustering of Sycon NK genes. All Sycon ANTP-class genes are developmentally expressed, with patterns suggesting their involvement in cell type specification in embryos and adults, metamorphosis and body plan patterning. These results demonstrate that ParaHox genes predate the origin of sponges, thus confirming the ghost locus hypothesis, and highlight the need to analyse the genomes of multiple sponge lineages to obtain a complete picture of the ancestral composition of the first animal genome.

  8. Hox gene regulation in the central nervous system of Drosophila

    Directory of Open Access Journals (Sweden)

    Maheshwar eGummalla

    2014-04-01

    Full Text Available Hox genes specify the structures that form along the anteroposterior (AP axis of bilateria. Within the genome, they often form clusters where, remarkably enough, their position within the clusters reflects the relative positions of the structures they specify along the AP axis. This correspondence between genomic organization and gene expression pattern has been conserved through evolution and provides a unique opportunity to study how chromosomal context affects gene regulation. In Drosophila, a general rule, often called posterior dominance, states that Hox genes specifying more posterior structures repress the expression of more anterior Hox genes. This rule explains the apparent spatial complementarity of Hox gene expression patterns in Drosophila. Here we review a noticeable exception to this rule where the more-posteriorly expressed Abd-B hox gene fails to repress the more-anterior abd-A gene in cells of the central nervous system (CNS. While Abd-B is required to repress ectopic expression of abd-A in the posterior epidermis, abd-A repression in the posterior CNS is accomplished by a different mechanism that involves a large 92kb long non-coding RNA (lncRNA encoded by the intergenic region separating abd-A and Abd-B (the iab8ncRNA. Dissection of this lncRNA revealed that abd-A is repressed by the lncRNA using two redundant mechanisms. The 1st mechanism is mediated by a microRNA (mir-iab-8 encoded by intronic sequence within the large iab8-ncRNA. Meanwhile, the second mechanism seems to involve transcriptional interference by the long iab-8 ncRNA on the abd-A promoter. Recent work demonstrating CNS-specific regulation of genes by ncRNAs in Drosophila, seem to highlight a potential role for the iab-8-ncRNA in the evolution of the Drosophila hox complexes

  9. An inhibitory switch derepressed by pbx, hox, and Meis/Prep1 partners regulates DNA-binding by pbx1 and E2a-pbx1 and is dispensable for myeloid immortalization by E2a-pbx1.

    Science.gov (United States)

    Calvo, K R; Knoepfler, P; McGrath, S; Kamps, M P

    1999-12-23

    The Pbx/Exd family of homeodomain (HD) proteins contribute to the transcriptional and developmental roles of other Hox and Meis/Prep1/Hth HD proteins through heterodimer formation. E2a-Pbx1 is an oncogenic derrivative of Pbx1 produced by the t(1;19) translocation in pediatric pre-B cell acute lymphoblastic leukemia. E2a-Pbx1 heterodimerizes with Hox but not with Meis/Prep1 proteins, produces acute myeloid leukemia in mice, and blocks differentiation of cultured murine myeloid progenitors. Here, we characterize negative and positive regulatory sequences that flank the Pbx1 HD and determine their importance for myeloid immortalization by E2a-Pbx1. A 25 residue predicted alpha helix preceding the Pbx1 HD bound the HD and prevented both its binding to DNA and its ability to heterodimerize with Hox proteins. Addition of 39 residues N-terminal to this inhibitory helix exposed a Pbx dimerization interface that orchestrated cooperative DNA-binding of E2a-Pbx1 and all Pbx proteins as homodimers and heterdimers. Sequences inhibiting DNA-binding and mediating Pbx dimerization coincided with those reported to have nuclear export function. An additional 103 residues N-terminal to the Pbx dimerization interface restored heterodimerization with Hox and Meis1/Prep1 proteins. This negative switch domain - comprised of the inhibitory helix and N-terminal regions required for its partner-mediated derepression - was dispensable for myeloid immortalization by E2a-Pbx1. While stabilizing the heterodimer, the 310 helix C-terminal to the Pbx1 HD was also dispensable for the ability of E2a-Pbx1 to heterodimerize with Hox proteins and immortalize myeloblasts. Retention of myeloid immortalization by E2a-Pbx1 proteins lacking all Pbx1 sequences N- or C-terminal to the HD indicates that Hox proteins, or a yet undefined factor that binds the Pbx1 HD and derepresses DNA-binding by the HD, cooperate with E2a-Pbx1 in myeloid immortalization.

  10. Isolation of Hox cluster genes from insects reveals an accelerated sequence evolution rate.

    Directory of Open Access Journals (Sweden)

    Heike Hadrys

    Full Text Available Among gene families it is the Hox genes and among metazoan animals it is the insects (Hexapoda that have attracted particular attention for studying the evolution of development. Surprisingly though, no Hox genes have been isolated from 26 out of 35 insect orders yet, and the existing sequences derive mainly from only two orders (61% from Hymenoptera and 22% from Diptera. We have designed insect specific primers and isolated 37 new partial homeobox sequences of Hox cluster genes (lab, pb, Hox3, ftz, Antp, Scr, abd-a, Abd-B, Dfd, and Ubx from six insect orders, which are crucial to insect phylogenetics. These new gene sequences provide a first step towards comparative Hox gene studies in insects. Furthermore, comparative distance analyses of homeobox sequences reveal a correlation between gene divergence rate and species radiation success with insects showing the highest rate of homeobox sequence evolution.

  11. Hox gene function and interaction in the milkweed bug Oncopeltus fasciatus (Hemiptera).

    Science.gov (United States)

    Angelini, David R; Liu, Paul Z; Hughes, Cynthia L; Kaufman, Thomas C

    2005-11-15

    Studies in genetic model organisms such as Drosophila have demonstrated that the homeotic complex (Hox) genes impart segmental identity during embryogenesis. Comparative studies in a wide range of other insect taxa have shown that the Hox genes are expressed in largely conserved domains along the anterior-posterior body axis, but whether they are performing the same functions in different insects is an open question. Most of the Hox genes have been studied functionally in only a few holometabolous insects that undergo metamorphosis. Thus, it is unclear how the Hox genes are functioning in the majority of direct-developing insects and other arthropods. To address this question, we used a combination of RNAi and in situ hybridization to reveal the expression, functions, and regulatory interactions of the Hox genes in the milkweed bug Oncopeltus fasciatus. Our results reveal many similarities and some interesting differences compared to Drosophila. We find that the gene Antennapedia is required for the identity of all three thoracic segments, while Ultrabithorax, abdominal-A and Abdominal-B cooperate to pattern the abdomen. The three abdominal genes exhibit posterior prevalence like in Drosophila, but apparently via some post-transcriptional mechanism. The functions of the head genes proboscipedia, Deformed, and Sex combs reduced were shown previously, and here we find that the complex temporal expression of pb in the labium is like that of other insects, but its regulatory relationship with Scr is unique. Overall, our data reveal that the evolution of insect Hox genes has included many small changes within general conservation of expression and function, and that the milkweed bug provides a useful model for understanding the roles of Hox genes in a direct-developing insect.

  12. Evolution of the snake body form reveals homoplasy in amniote Hox gene function.

    Science.gov (United States)

    Head, Jason J; Polly, P David

    2015-04-02

    Hox genes regulate regionalization of the axial skeleton in vertebrates, and changes in their expression have been proposed to be a fundamental mechanism driving the evolution of new body forms. The origin of the snake-like body form, with its deregionalized pre-cloacal axial skeleton, has been explained as either homogenization of Hox gene expression domains, or retention of standard vertebrate Hox domains with alteration of downstream expression that suppresses development of distinct regions. Both models assume a highly regionalized ancestor, but the extent of deregionalization of the primaxial domain (vertebrae, dorsal ribs) of the skeleton in snake-like body forms has never been analysed. Here we combine geometric morphometrics and maximum-likelihood analysis to show that the pre-cloacal primaxial domain of elongate, limb-reduced lizards and snakes is not deregionalized compared with limbed taxa, and that the phylogenetic structure of primaxial morphology in reptiles does not support a loss of regionalization in the evolution of snakes. We demonstrate that morphometric regional boundaries correspond to mapped gene expression domains in snakes, suggesting that their primaxial domain is patterned by a normally functional Hox code. Comparison of primaxial osteology in fossil and modern amniotes with Hox gene distributions within Amniota indicates that a functional, sequentially expressed Hox code patterned a subtle morphological gradient along the anterior-posterior axis in stem members of amniote clades and extant lizards, including snakes. The highly regionalized skeletons of extant archosaurs and mammals result from independent evolution in the Hox code and do not represent ancestral conditions for clades with snake-like body forms. The developmental origin of snakes is best explained by decoupling of the primaxial and abaxial domains and by increases in somite number, not by changes in the function of primaxial Hox genes.

  13. Classification of proteins: available structural space for molecular modeling.

    Science.gov (United States)

    Andreeva, Antonina

    2012-01-01

    The wealth of available protein structural data provides unprecedented opportunity to study and better understand the underlying principles of protein folding and protein structure evolution. A key to achieving this lies in the ability to analyse these data and to organize them in a coherent classification scheme. Over the past years several protein classifications have been developed that aim to group proteins based on their structural relationships. Some of these classification schemes explore the concept of structural neighbourhood (structural continuum), whereas other utilize the notion of protein evolution and thus provide a discrete rather than continuum view of protein structure space. This chapter presents a strategy for classification of proteins with known three-dimensional structure. Steps in the classification process along with basic definitions are introduced. Examples illustrating some fundamental concepts of protein folding and evolution with a special focus on the exceptions to them are presented.

  14. The vertebrate Hox gene regulatory network for hindbrain segmentation: Evolution and diversification: Coupling of a Hox gene regulatory network to hindbrain segmentation is an ancient trait originating at the base of vertebrates.

    Science.gov (United States)

    Parker, Hugo J; Bronner, Marianne E; Krumlauf, Robb

    2016-06-01

    Hindbrain development is orchestrated by a vertebrate gene regulatory network that generates segmental patterning along the anterior-posterior axis via Hox genes. Here, we review analyses of vertebrate and invertebrate chordate models that inform upon the evolutionary origin and diversification of this network. Evidence from the sea lamprey reveals that the hindbrain regulatory network generates rhombomeric compartments with segmental Hox expression and an underlying Hox code. We infer that this basal feature was present in ancestral vertebrates and, as an evolutionarily constrained developmental state, is fundamentally important for patterning of the vertebrate hindbrain across diverse lineages. Despite the common ground plan, vertebrates exhibit neuroanatomical diversity in lineage-specific patterns, with different vertebrates revealing variations of Hox expression in the hindbrain that could underlie this diversification. Invertebrate chordates lack hindbrain segmentation but exhibit some conserved aspects of this network, with retinoic acid signaling playing a role in establishing nested domains of Hox expression. © 2016 WILEY Periodicals, Inc.

  15. Cdx and Hox genes differentially regulate posterior axial growth in mammalian embryos

    NARCIS (Netherlands)

    Young, Teddy; Rowland, Jennifer Elizabeth; van de Ven, Cesca; Bialecka, Monika; Novoa, Ana; Carapuco, Marta; van Nes, Johan; de Graaff, Wim; Duluc, Isabelle; Freund, Jean-Noël; Beck, Felix; Mallo, Moises; Deschamps, Jacqueline

    2009-01-01

    Hox and Cdx transcription factors regulate embryonic positional identities. Cdx mutant mice display posterior body truncations of the axial skeleton, neuraxis, and caudal urorectal structures. We show that trunk Hox genes stimulate axial extension, as they can largely rescue these Cdx mutant

  16. Feature generation and representations for protein-protein interaction classification.

    Science.gov (United States)

    Lan, Man; Tan, Chew Lim; Su, Jian

    2009-10-01

    Automatic detecting protein-protein interaction (PPI) relevant articles is a crucial step for large-scale biological database curation. The previous work adopted POS tagging, shallow parsing and sentence splitting techniques, but they achieved worse performance than the simple bag-of-words representation. In this paper, we generated and investigated multiple types of feature representations in order to further improve the performance of PPI text classification task. Besides the traditional domain-independent bag-of-words approach and the term weighting methods, we also explored other domain-dependent features, i.e. protein-protein interaction trigger keywords, protein named entities and the advanced ways of incorporating Natural Language Processing (NLP) output. The integration of these multiple features has been evaluated on the BioCreAtIvE II corpus. The experimental results showed that both the advanced way of using NLP output and the integration of bag-of-words and NLP output improved the performance of text classification. Specifically, in comparison with the best performance achieved in the BioCreAtIvE II IAS, the feature-level and classifier-level integration of multiple features improved the performance of classification 2.71% and 3.95%, respectively.

  17. Protein structure: geometry, topology and classification

    Energy Technology Data Exchange (ETDEWEB)

    Taylor, William R.; May, Alex C.W.; Brown, Nigel P.; Aszodi, Andras [Division of Mathematical Biology, National Institute for Medical Research, London (United Kingdom)

    2001-04-01

    The structural principals of proteins are reviewed and analysed from a geometric perspective with a view to revealing the underlying regularities in their construction. Computer methods for the automatic comparison and classification of these structures are then reviewed with an analysis of the statistical significance of comparing different shapes. Following an analysis of the current state of the classification of proteins, more abstract geometric and topological representations are explored, including the occurrence of knotted topologies. The review concludes with a consideration of the origin of higher-level symmetries in protein structure. (author)

  18. Disruption of Hox9,10,11 function results in cellular level lineage infidelity in the kidney.

    Science.gov (United States)

    Drake, Keri A; Adam, Mike; Mahoney, Robert; Potter, S Steven

    2018-04-20

    Hox genes are important regulators of development. The 39 mammalian Hox genes have considerable functional overlap, greatly confounding their study. In this report, we generated mice with multiple combinations of paralogous and flanking Abd-B Hox gene mutations to investigate functional redundancies in kidney development. The resulting mice developed a number of kidney abnormalities, including hypoplasia, agenesis, and severe cysts, with distinct Hox functions observed in early metanephric kidney formation and nephron progenitor maintenance. Most surprising, however, was that extensive removal of Hox shared function in these kidneys resulted in cellular level lineage infidelity. Strikingly, mutant nephron tubules consisted of intermixed cells with proximal tubule, loop of Henle, and collecting duct identities, with some single cells expressing markers associated with more than one nephron segment. These results indicate that Hox genes are required for proper lineage selection/maintenance and full repression of genes involved in cell fate restriction in the developing kidney.

  19. Rice HOX12 Regulates Panicle Exsertion by Directly Modulating the Expression of ELONGATED UPPERMOST INTERNODE1[OPEN

    Science.gov (United States)

    Gao, Shaopei; Fang, Jun; Xu, Fan; Wang, Wei

    2016-01-01

    Bioactive gibberellins (GAs) are key endogenous regulators of plant growth. Previous work identified ELONGATED UPPERMOST INTERNODE1 (EUI1) as a GA-deactivating enzyme that plays an important role in panicle exsertion from the flag leaf sheath in rice (Oryza sativa). However, the mechanism that regulates EUI1 activity during development is still largely unexplored. In this study, we identified the dominant panicle enclosure mutant regulator of eui1 (ree1-D), whose phenotype is caused by the activation of the homeodomain-leucine zipper transcription factor HOX12. Diminished HOX12 expression by RNA interference enhanced panicle exsertion, mimicking the eui1 phenotype. HOX12 knockdown plants contain higher levels of the major biologically active GAs (such as GA1 and GA4) than the wild type. The expression of EUI1 is elevated in the ree1-D mutant but reduced in HOX12 knockdown plants. Interestingly, both HOX12 and EUI1 are predominantly expressed in panicles, where GA4 is highly accumulated. Yeast one-hybrid, electrophoretic mobility shift assay, and chromatin immunoprecipitation analyses showed that HOX12 physically interacts with the EUI1 promoter both in vitro and in vivo. Furthermore, plants overexpressing HOX12 in the eui1 mutant background retained the elongated uppermost internode phenotype. These results indicate that HOX12 acts directly through EUI1 to regulate panicle exsertion in rice. PMID:26977084

  20. Hox genes require homothorax and extradenticle for body wall identity specification but not for appendage identity specification during metamorphosis of Tribolium castaneum.

    Science.gov (United States)

    Smith, Frank W; Jockusch, Elizabeth L

    2014-11-01

    The establishment of segment identity is a key developmental process that allows for divergence along the anteroposterior body axis in arthropods. In Drosophila, the identity of a segment is determined by the complement of Hox genes it expresses. In many contexts, Hox transcription factors require the protein products of extradenticle (exd) and homothorax (hth) as cofactors to perform their identity specification functions. In holometabolous insects, segment identity may be specified twice, during embryogenesis and metamorphosis. To glean insight into the relationship between embryonic and metamorphic segmental identity specification, we have compared these processes in the flour beetle Tribolium castaneum, which develops ventral appendages during embryogenesis that later metamorphose into adult appendages with distinct morphologies. At metamorphosis, comparisons of RNAi phenotypes indicate that Hox genes function jointly with Tc-hth and Tc-exd to specify several region-specific aspects of the adult body wall. On the other hand, Hox genes specify appendage identities along the anteroposterior axis independently of Tc-hth/Tc-exd and Tc-hth/Tc-exd specify proximal vs. distal identity within appendages independently of Hox genes during this stage. During embryogenesis, Tc-hth and Tc-exd play a broad role in the segmentation process and are required for specification of body wall identities in the thorax; however, contrasting with results from other species, we did not obtain homeotic transformations of embryonic appendages in response to Tc-hth or Tc-exd RNAi. In general, the homeotic effects of interference with the function of Hox genes and Tc-hth/Tc-exd during metamorphosis did not match predictions based on embryonic roles of these genes. Comparing metamorphic patterning in T. castaneum to embryonic and post-embryonic development in hemimetabolous insects suggests that holometabolous metamorphosis combines patterning processes of both late embryogenesis and

  1. Regulation of number and size of digits by posterior Hox genes: a dose-dependent mechanism with potential evolutionary implications.

    Science.gov (United States)

    Zákány, J; Fromental-Ramain, C; Warot, X; Duboule, D

    1997-12-09

    The proper development of digits, in tetrapods, requires the activity of several genes of the HoxA and HoxD homeobox gene complexes. By using a variety of loss-of-function alleles involving the five Hox genes that have been described to affect digit patterning, we report here that the group 11, 12, and 13 genes control both the size and number of murine digits in a dose-dependent fashion, rather than through a Hox code involving differential qualitative functions. A similar dose-response is observed in the morphogenesis of the penian bone, the baculum, which further suggests that digits and external genitalia share this genetic control mechanism. A progressive reduction in the dose of Hox gene products led first to ectrodactyly, then to olygodactyly and adactyly. Interestingly, this transition between the pentadactyl to the adactyl formula went through a step of polydactyly. We propose that in the distal appendage of polydactylous short-digited ancestral tetrapods, such as Acanthostega, the HoxA complex was predominantly active. Subsequent recruitment of the HoxD complex contributed to both reductions in digit number and increase in digit length. Thus, transition through a polydactylous limb before reaching and stabilizing the pentadactyl pattern may have relied, at least in part, on asynchronous and independent changes in the regulation of HoxA and HoxD gene complexes.

  2. A saturation screen for cis-acting regulatory DNA in the Hox genes of Ciona intestinalis

    Energy Technology Data Exchange (ETDEWEB)

    Keys, David N.; Lee, Byung-in; Di Gregorio, Anna; Harafuji, Naoe; Detter, Chris; Wang, Mei; Kahsai, Orsalem; Ahn, Sylvia; Arellano, Andre; Zhang, Quin; Trong, Stephan; Doyle, Sharon A.; Satoh, Noriyuki; Satou, Yutaka; Saiga, Hidetoshi; Christian, Allen; Rokhsar, Dan; Hawkins, Trevor L.; Levine, Mike; Richardson, Paul

    2005-01-05

    A screen for the systematic identification of cis-regulatory elements within large (>100 kb) genomic domains containing Hox genes was performed by using the basal chordate Ciona intestinalis. Randomly generated DNA fragments from bacterial artificial chromosomes containing two clusters of Hox genes were inserted into a vector upstream of a minimal promoter and lacZ reporter gene. A total of 222 resultant fusion genes were separately electroporated into fertilized eggs, and their regulatory activities were monitored in larvae. In sum, 21 separable cis-regulatory elements were found. These include eight Hox linked domains that drive expression in nested anterior-posterior domains of ectodermally derived tissues. In addition to vertebrate-like CNS regulation, the discovery of cis-regulatory domains that drive epidermal transcription suggests that C. intestinalis has arthropod-like Hox patterning in the epidermis.

  3. Adaptive evolution of the Hox gene family for development in bats and dolphins.

    Directory of Open Access Journals (Sweden)

    Lu Liang

    Full Text Available Bats and cetaceans (i.e., whales, dolphins, porpoises are two kinds of mammals with unique locomotive styles and occupy novel niches. Bats are the only mammals capable of sustained flight in the sky, while cetaceans have returned to the aquatic environment and are specialized for swimming. Associated with these novel adaptations to their environment, various development changes have occurred to their body plans and associated structures. Given the importance of Hox genes in many aspects of embryonic development, we conducted an analysis of the coding regions of all Hox gene family members from bats (represented by Pteropus vampyrus, Pteropus alecto, Myotis lucifugus and Myotis davidii and cetaceans (represented by Tursiops truncatus for adaptive evolution using the available draft genome sequences. Differences in the selective pressures acting on many Hox genes in bats and cetaceans were found compared to other mammals. Positive selection, however, was not found to act on any of the Hox genes in the common ancestor of bats and only upon Hoxb9 in cetaceans. PCR amplification data from additional bat and cetacean species, and application of the branch-site test 2, showed that the Hoxb2 gene within bats had significant evidence of positive selection. Thus, our study, with genomic and newly sequenced Hox genes, identifies two candidate Hox genes that may be closely linked with developmental changes in bats and cetaceans, such as those associated with the pancreatic, neuronal, thymus shape and forelimb. In addition, the difference in our results from the genome-wide scan and newly sequenced data reveals that great care must be taken in interpreting results from draft genome data from a limited number of species, and deep genetic sampling of a particular clade is a powerful tool for generating complementary data to address this limitation.

  4. Deep time perspective on turtle neck evolution: chasing the Hox code by vertebral morphology.

    Science.gov (United States)

    Böhmer, Christine; Werneburg, Ingmar

    2017-08-21

    The unparalleled ability of turtle neck retraction is possible in three different modes, which characterize stem turtles, living side-necked (Pleurodira), and hidden-necked (Cryptodira) turtles, respectively. Despite the conservatism in vertebral count among turtles, there is significant functional and morphological regionalization in the cervical vertebral column. Since Hox genes play a fundamental role in determining the differentiation in vertebra morphology and based on our reconstruction of evolutionary genetics in deep time, we hypothesize genetic differences among the turtle groups and between turtles and other land vertebrates. We correlated anterior Hox gene expression and the quantifiable shape of the vertebrae to investigate the morphological modularity in the neck across living and extinct turtles. This permitted the reconstruction of the hypothetical ancestral Hox code pattern of the whole turtle clade. The scenario of the evolution of axial patterning in turtles indicates shifts in the spatial expression of HoxA-5 in relation to the reduction of cervical ribs in modern turtles and of HoxB-5 linked with a lower morphological differentiation between the anterior cervical vertebrae observed in cryptodirans. By comparison with the mammalian pattern, we illustrate how the fixed count of eight cervical vertebrae in turtles resulted from the emergence of the unique turtle shell.

  5. SCOWLP classification: Structural comparison and analysis of protein binding regions

    Directory of Open Access Journals (Sweden)

    Anders Gerd

    2008-01-01

    Full Text Available Abstract Background Detailed information about protein interactions is critical for our understanding of the principles governing protein recognition mechanisms. The structures of many proteins have been experimentally determined in complex with different ligands bound either in the same or different binding regions. Thus, the structural interactome requires the development of tools to classify protein binding regions. A proper classification may provide a general view of the regions that a protein uses to bind others and also facilitate a detailed comparative analysis of the interacting information for specific protein binding regions at atomic level. Such classification might be of potential use for deciphering protein interaction networks, understanding protein function, rational engineering and design. Description Protein binding regions (PBRs might be ideally described as well-defined separated regions that share no interacting residues one another. However, PBRs are often irregular, discontinuous and can share a wide range of interacting residues among them. The criteria to define an individual binding region can be often arbitrary and may differ from other binding regions within a protein family. Therefore, the rational behind protein interface classification should aim to fulfil the requirements of the analysis to be performed. We extract detailed interaction information of protein domains, peptides and interfacial solvent from the SCOWLP database and we classify the PBRs of each domain family. For this purpose, we define a similarity index based on the overlapping of interacting residues mapped in pair-wise structural alignments. We perform our classification with agglomerative hierarchical clustering using the complete-linkage method. Our classification is calculated at different similarity cut-offs to allow flexibility in the analysis of PBRs, feature especially interesting for those protein families with conflictive binding regions

  6. Large scale genomic reorganization of topological domains at the HoxD locus.

    Science.gov (United States)

    Fabre, Pierre J; Leleu, Marion; Mormann, Benjamin H; Lopez-Delisle, Lucille; Noordermeer, Daan; Beccari, Leonardo; Duboule, Denis

    2017-08-07

    The transcriptional activation of HoxD genes during mammalian limb development involves dynamic interactions with two topologically associating domains (TADs) flanking the HoxD cluster. In particular, the activation of the most posterior HoxD genes in developing digits is controlled by regulatory elements located in the centromeric TAD (C-DOM) through long-range contacts. To assess the structure-function relationships underlying such interactions, we measured compaction levels and TAD discreteness using a combination of chromosome conformation capture (4C-seq) and DNA FISH. We assessed the robustness of the TAD architecture by using a series of genomic deletions and inversions that impact the integrity of this chromatin domain and that remodel long-range contacts. We report multi-partite associations between HoxD genes and up to three enhancers. We find that the loss of native chromatin topology leads to the remodeling of TAD structure following distinct parameters. Our results reveal that the recomposition of TAD architectures after large genomic re-arrangements is dependent on a boundary-selection mechanism in which CTCF mediates the gating of long-range contacts in combination with genomic distance and sequence specificity. Accordingly, the building of a recomposed TAD at this locus depends on distinct functional and constitutive parameters.

  7. Transporter taxonomy - a comparison of different transport protein classification schemes.

    Science.gov (United States)

    Viereck, Michael; Gaulton, Anna; Digles, Daniela; Ecker, Gerhard F

    2014-06-01

    Currently, there are more than 800 well characterized human membrane transport proteins (including channels and transporters) and there are estimates that about 10% (approx. 2000) of all human genes are related to transport. Membrane transport proteins are of interest as potential drug targets, for drug delivery, and as a cause of side effects and drug–drug interactions. In light of the development of Open PHACTS, which provides an open pharmacological space, we analyzed selected membrane transport protein classification schemes (Transporter Classification Database, ChEMBL, IUPHAR/BPS Guide to Pharmacology, and Gene Ontology) for their ability to serve as a basis for pharmacology driven protein classification. A comparison of these membrane transport protein classification schemes by using a set of clinically relevant transporters as use-case reveals the strengths and weaknesses of the different taxonomy approaches.

  8. Regulation of number and size of digits by posterior Hox genes: A dose-dependent mechanism with potential evolutionary implications

    OpenAIRE

    Zákány, József; Fromental-Ramain, Catherine; Warot, Xavier; Duboule, Denis

    1997-01-01

    The proper development of digits, in tetrapods, requires the activity of several genes of the HoxA and HoxD homeobox gene complexes. By using a variety of loss-of-function alleles involving the five Hox genes that have been described to affect digit patterning, we report here that the group 11, 12, and 13 genes control both the size and number of murine digits in a dose-dependent fashion, rather than through a Hox code involving differential qualitative functions. A similar dose–response is o...

  9. Semi-Supervised Learning for Classification of Protein Sequence Data

    Directory of Open Access Journals (Sweden)

    Brian R. King

    2008-01-01

    Full Text Available Protein sequence data continue to become available at an exponential rate. Annotation of functional and structural attributes of these data lags far behind, with only a small fraction of the data understood and labeled by experimental methods. Classification methods that are based on semi-supervised learning can increase the overall accuracy of classifying partly labeled data in many domains, but very few methods exist that have shown their effect on protein sequence classification. We show how proven methods from text classification can be applied to protein sequence data, as we consider both existing and novel extensions to the basic methods, and demonstrate restrictions and differences that must be considered. We demonstrate comparative results against the transductive support vector machine, and show superior results on the most difficult classification problems. Our results show that large repositories of unlabeled protein sequence data can indeed be used to improve predictive performance, particularly in situations where there are fewer labeled protein sequences available, and/or the data are highly unbalanced in nature.

  10. Dataset of the HOX1 gene sequences of the wheat polyploids and their diploid relatives

    Directory of Open Access Journals (Sweden)

    Andrey B. Shcherban

    2018-02-01

    Full Text Available The TaHOX-1 gene of common wheat Triticum aestivum L. (BAD-genome encodes transcription factor (HD-Zip I which is characterized by the presence of a DNA-binding homeodomain (HD with an adjacent Leucine zipper (LZ motif. This gene can play a role in adapting plant to a variety of abiotic stresses, such as drought, cold, salinity etc., which strongly affect wheat production. However, it's both functional role in stress resistance and divergence during wheat evolution has not yet been elucidated. This data in brief article is associated with the research paper “Structural and functional divergence of homoeologous copies of the TaHOX-1 gene in polyploid wheats and their diploid ancestors”. The data set represents a recent survey of the primary HOX-1 gene sequences isolated from the first wheat allotetraploids (BA-genome and their corresponding Triticum and Aegilops diploid relatives. Specifically, we provide detailed information about the HOX-1 nucleotide sequences of the promoter region and both nucleotide and amino acid sequences of the gene. The sequencing data used here is available at DDBJ/EMBL/GenBank under the accession numbers MG000630-MG000698. Keywords: Wheat, Polyploid, HOX-1 gene, Homeodomain, Transcription factor, Promoter, Triticum, Aegilops

  11. A framework for classification of prokaryotic protein kinases.

    Directory of Open Access Journals (Sweden)

    Nidhi Tyagi

    Full Text Available BACKGROUND: Overwhelming majority of the Serine/Threonine protein kinases identified by gleaning archaeal and eubacterial genomes could not be classified into any of the well known Hanks and Hunter subfamilies of protein kinases. This is owing to the development of Hanks and Hunter classification scheme based on eukaryotic protein kinases which are highly divergent from their prokaryotic homologues. A large dataset of prokaryotic Serine/Threonine protein kinases recognized from genomes of prokaryotes have been used to develop a classification framework for prokaryotic Ser/Thr protein kinases. METHODOLOGY/PRINCIPAL FINDINGS: We have used traditional sequence alignment and phylogenetic approaches and clustered the prokaryotic kinases which represent 72 subfamilies with at least 4 members in each. Such a clustering enables classification of prokaryotic Ser/Thr kinases and it can be used as a framework to classify newly identified prokaryotic Ser/Thr kinases. After series of searches in a comprehensive sequence database we recognized that 38 subfamilies of prokaryotic protein kinases are associated to a specific taxonomic level. For example 4, 6 and 3 subfamilies have been identified that are currently specific to phylum proteobacteria, cyanobacteria and actinobacteria respectively. Similarly subfamilies which are specific to an order, sub-order, class, family and genus have also been identified. In addition to these, we also identify organism-diverse subfamilies. Members of these clusters are from organisms of different taxonomic levels, such as archaea, bacteria, eukaryotes and viruses. CONCLUSION/SIGNIFICANCE: Interestingly, occurrence of several taxonomic level specific subfamilies of prokaryotic kinases contrasts with classification of eukaryotic protein kinases in which most of the popular subfamilies of eukaryotic protein kinases occur diversely in several eukaryotes. Many prokaryotic Ser/Thr kinases exhibit a wide variety of modular

  12. Genetic interactions between Shox2 and Hox genes during the regional growth and development of the mouse limb.

    Science.gov (United States)

    Neufeld, Stanley J; Wang, Fan; Cobb, John

    2014-11-01

    The growth and development of the vertebrate limb relies on homeobox genes of the Hox and Shox families, with their independent mutation often giving dose-dependent effects. Here we investigate whether Shox2 and Hox genes function together during mouse limb development by modulating their relative dosage and examining the limb for nonadditive effects on growth. Using double mRNA fluorescence in situ hybridization (FISH) in single embryos, we first show that Shox2 and Hox genes have associated spatial expression dynamics, with Shox2 expression restricted to the proximal limb along with Hoxd9 and Hoxa11 expression, juxtaposing the distal expression of Hoxa13 and Hoxd13. By generating mice with all possible dosage combinations of mutant Shox2 alleles and HoxA/D cluster deletions, we then show that their coordinated proximal limb expression is critical to generate normally proportioned limb segments. These epistatic interactions tune limb length, where Shox2 underexpression enhances, and Shox2 overexpression suppresses, Hox-mutant phenotypes. Disruption of either Shox2 or Hox genes leads to a similar reduction in Runx2 expression in the developing humerus, suggesting their concerted action drives cartilage maturation during normal development. While we furthermore provide evidence that Hox gene function influences Shox2 expression, this regulation is limited in extent and is unlikely on its own to be a major explanation for their genetic interaction. Given the similar effect of human SHOX mutations on regional limb growth, Shox and Hox genes may generally function as genetic interaction partners during the growth and development of the proximal vertebrate limb. Copyright © 2014 by the Genetics Society of America.

  13. Evolutionary changes of Hox genes and relevant regulatory factors provide novel insights into mammalian morphological modifications.

    Science.gov (United States)

    Li, Kui; Sun, Xiaohui; Chen, Meixiu; Sun, Yingying; Tian, Ran; Wang, Zhengfei; Xu, Shixia; Yang, Guang

    2018-01-01

    The diversity of body plans of mammals accelerates the innovation of lifestyles and the extensive adaptation to different habitats, including terrestrial, aerial and aquatic habitats. However, the genetic basis of those phenotypic modifications, which have occurred during mammalian evolution, remains poorly explored. In the present study, we synthetically surveyed the evolutionary pattern of Hox clusters that played a powerful role in the morphogenesis along the head-tail axis of animal embryos and the main regulatory factors (Mll, Bmi1 and E2f6) that control the expression of Hox genes. A deflected density of repetitive elements and lineage-specific radical mutations of Mll have been determined in marine mammals with morphological changes, suggesting that evolutionary changes may alter Hox gene expression in these lineages, leading to the morphological modification of these lineages. Although no positive selection was detected at certain ancestor nodes of lineages, the increased ω values of Hox genes implied the relaxation of functional constraints of these genes during the mammalian evolutionary process. More importantly, 49 positively-selected sites were identified in mammalian lineages with phenotypic modifications, indicating adaptive evolution acting on Hox genes and regulatory factors. In addition, 3 parallel amino acid substitutions in some Hox genes were examined in marine mammals, which might be responsible for their streamlined body. © 2017 The Authors. Integrative Zoology published by International Society of Zoological Sciences, Institute of Zoology/Chinese Academy of Sciences and John Wiley & Sons Australia, Ltd.

  14. Protein classification using modified n-grams and skip-grams.

    Science.gov (United States)

    Islam, S M Ashiqul; Heil, Benjamin J; Kearney, Christopher Michel; Baker, Erich J

    2018-05-01

    Classification by supervised machine learning greatly facilitates the annotation of protein characteristics from their primary sequence. However, the feature generation step in this process requires detailed knowledge of attributes used to classify the proteins. Lack of this knowledge risks the selection of irrelevant features, resulting in a faulty model. In this study, we introduce a supervised protein classification method with a novel means of automating the work-intensive feature generation step via a Natural Language Processing (NLP)-dependent model, using a modified combination of n-grams and skip-grams (m-NGSG). A meta-comparison of cross-validation accuracy with twelve training datasets from nine different published studies demonstrates a consistent increase in accuracy of m-NGSG when compared to contemporary classification and feature generation models. We expect this model to accelerate the classification of proteins from primary sequence data and increase the accessibility of protein characteristic prediction to a broader range of scientists. m-NGSG is freely available at Bitbucket: https://bitbucket.org/sm_islam/mngsg/src. A web server is available at watson.ecs.baylor.edu/ngsg. erich_baker@baylor.edu. Supplementary data are available at Bioinformatics online.

  15. Transient Responses to NOTCH and TLX1/HOX11 Inhibition in T-Cell Acute Lymphoblastic Leukemia/Lymphoma

    OpenAIRE

    Rakowski, Lesley A.; Lehotzky, Erica A.; Chiang, Mark Y.

    2011-01-01

    To improve the treatment strategies of T-cell acute lymphoblastic leukemia/lymphoma (T-ALL), further efforts are needed to identify therapeutic targets. Dysregulated expression of HOX-type transcription factors occurs in 30-40% of cases of T-ALL. TLX1/HOX11 is the prototypical HOX-type transcription factor. TLX1 may be an attractive therapeutic target because mice that are deficient in TLX1 are healthy. To test this possibility, we developed a conditional doxycycline-regulated mouse model of ...

  16. Clustering of Tissue-Specific Sub-TADs Accompanies the Regulation of HoxA Genes in Developing Limbs

    Science.gov (United States)

    Berlivet, Soizik; Paquette, Denis; Dumouchel, Annie; Langlais, David; Dostie, Josée; Kmita, Marie

    2013-01-01

    HoxA genes exhibit central roles during development and causal mutations have been found in several human syndromes including limb malformation. Despite their importance, information on how these genes are regulated is lacking. Here, we report on the first identification of bona fide transcriptional enhancers controlling HoxA genes in developing limbs and show that these enhancers are grouped into distinct topological domains at the sub-megabase scale (sub-TADs). We provide evidence that target genes and regulatory elements physically interact with each other through contacts between sub-TADs rather than by the formation of discreet “DNA loops”. Interestingly, there is no obvious relationship between the functional domains of the enhancers within the limb and how they are partitioned among the topological domains, suggesting that sub-TAD formation does not rely on enhancer activity. Moreover, we show that suppressing the transcriptional activity of enhancers does not abrogate their contacts with HoxA genes. Based on these data, we propose a model whereby chromatin architecture defines the functional landscapes of enhancers. From an evolutionary standpoint, our data points to the convergent evolution of HoxA and HoxD regulation in the fin-to-limb transition, one of the major morphological innovations in vertebrates. PMID:24385922

  17. Clustering of tissue-specific sub-TADs accompanies the regulation of HoxA genes in developing limbs.

    Directory of Open Access Journals (Sweden)

    Soizik Berlivet

    Full Text Available HoxA genes exhibit central roles during development and causal mutations have been found in several human syndromes including limb malformation. Despite their importance, information on how these genes are regulated is lacking. Here, we report on the first identification of bona fide transcriptional enhancers controlling HoxA genes in developing limbs and show that these enhancers are grouped into distinct topological domains at the sub-megabase scale (sub-TADs. We provide evidence that target genes and regulatory elements physically interact with each other through contacts between sub-TADs rather than by the formation of discreet "DNA loops". Interestingly, there is no obvious relationship between the functional domains of the enhancers within the limb and how they are partitioned among the topological domains, suggesting that sub-TAD formation does not rely on enhancer activity. Moreover, we show that suppressing the transcriptional activity of enhancers does not abrogate their contacts with HoxA genes. Based on these data, we propose a model whereby chromatin architecture defines the functional landscapes of enhancers. From an evolutionary standpoint, our data points to the convergent evolution of HoxA and HoxD regulation in the fin-to-limb transition, one of the major morphological innovations in vertebrates.

  18. At the base of colinear Hox gene expression : cis-features and trans-factors orchestrating the initial phase of Hox cluster activation

    NARCIS (Netherlands)

    Neijts, Roel; Deschamps, Jacqueline

    2017-01-01

    Hox genes are crucial players in the generation and pattering of the vertebrate trunk and posterior body during embryogenesis. Their initial expression takes place shortly after the establishment of the primitive streak, in the posterior-most part of the mouse embryo and is a determinant step for

  19. HoxD10 gene delivery using adenovirus/adeno-associate hybrid virus inhibits the proliferation and tumorigenicity of GH4 pituitary lactotrope tumor cells

    International Nuclear Information System (INIS)

    Cho, Mi Ae; Yashar, Parham; Kim, Suk Kyoung; Noh, Taewoong; Gillam, Mary P.; Lee, Eun Jig; Jameson, J. Larry

    2008-01-01

    Prolactinoma is one of the most common types of pituitary adenoma. It has been reported that a variety of growth factors and cytokines regulating cell growth and angiogenesis play an important role in the growth of prolactinoma. HoxD10 has been shown to impair endothelial cell migration, block angiogenesis, and maintain a differentiated phenotype of cells. We investigated whether HoxD10 gene delivery could inhibit the growth of prolactinoma. Rat GH4 lactotrope tumor cells were infected with adenovirus/adeno-associated virus (Ad/AAV) hybrid vectors carrying the mouse HoxD10 gene (Hyb-HoxD10) or the β-galactosidase gene (Hyb-Gal). Hyb-HoxD10 expression inhibited GH4 cell proliferation in vitro. The expression of FGF-2 and cyclin D2 was inhibited in GH4 cells infected with Hyb-HoxD10. GH4 cells transduced with Hyb-HoxD10 did not form tumors in nude mice. These results indicate that the delivery of HoxD10 could potentially inhibit the growth of PRL-secreting tumors. This approach may be a useful tool for targeted therapy of prolactinoma and other neoplasms

  20. Benchmarking protein classification algorithms via supervised cross-validation

    NARCIS (Netherlands)

    Kertész-Farkas, A.; Dhir, S.; Sonego, P.; Pacurar, M.; Netoteia, S.; Nijveen, H.; Kuzniar, A.; Leunissen, J.A.M.; Kocsor, A.; Pongor, S.

    2008-01-01

    Development and testing of protein classification algorithms are hampered by the fact that the protein universe is characterized by groups vastly different in the number of members, in average protein size, similarity within group, etc. Datasets based on traditional cross-validation (k-fold,

  1. Hox gene colinear expression in the avian medulla oblongata is correlated with pseudorhombomeric domains.

    Science.gov (United States)

    Marín, Faustino; Aroca, Pilar; Puelles, Luis

    2008-11-15

    The medulla oblongata (or caudal hindbrain) is not overtly segmented, since it lacks observable interrhombomeric boundaries. However, quail-chick fate maps showed that it is formed by 5 pseudorhombomeres (r7-r11) which were empirically found to be delimited consistently at planes crossing through adjacent somites (Cambronero and Puelles, 2000). We aimed to reexamine the possible segmentation or rostrocaudal regionalisation of this brain region attending to molecular criteria. To this end, we studied the expression of Hox genes from groups 3 to 7 correlative to the differentiating nuclei of the medulla oblongata. Our results show that these genes are differentially expressed in the mature medulla oblongata, displaying instances of typical antero-posterior (3' to 5') Hox colinearity. The different sensory and motor columns, as well as the reticular formation, appear rostrocaudally regionalised according to spaced steps in their Hox expression pattern. The anterior limits of the respective expression domains largely fit boundaries defined between the experimental pseudorhombomeres. Therefore the medulla oblongata shows a Hox-related rostrocaudal molecular regionalisation comparable to that found among rhombomeres, and numerically consistent with the pseudorhombomere list. This suggests that medullary pseudorhombomeres share some AP patterning mechanisms with the rhombomeres present in the rostral, overtly-segmented hindbrain, irrespective of variant boundary properties.

  2. Combinatorial action of Grainyhead, Extradenticle and Notch in regulating Hox mediated apoptosis in Drosophila larval CNS.

    Science.gov (United States)

    Khandelwal, Risha; Sipani, Rashmi; Govinda Rajan, Sriivatsan; Kumar, Raviranjan; Joshi, Rohit

    2017-10-01

    Hox mediated neuroblast apoptosis is a prevalent way to pattern larval central nervous system (CNS) by different Hox genes, but the mechanism of this apoptosis is not understood. Our studies with Abdominal-A (Abd-A) mediated larval neuroblast (pNB) apoptosis suggests that AbdA, its cofactor Extradenticle (Exd), a helix-loop-helix transcription factor Grainyhead (Grh), and Notch signaling transcriptionally contribute to expression of RHG family of apoptotic genes. We find that Grh, AbdA, and Exd function together at multiple motifs on the apoptotic enhancer. In vivo mutagenesis of these motifs suggest that they are important for the maintenance of the activity of the enhancer rather than its initiation. We also find that Exd function is independent of its known partner homothorax in this apoptosis. We extend some of our findings to Deformed expressing region of sub-esophageal ganglia where pNBs undergo a similar Hox dependent apoptosis. We propose a mechanism where common players like Exd-Grh-Notch work with different Hox genes through region specific enhancers to pattern respective segments of larval central nervous system.

  3. Efficient Feature Selection and Classification of Protein Sequence Data in Bioinformatics

    Science.gov (United States)

    Faye, Ibrahima; Samir, Brahim Belhaouari; Md Said, Abas

    2014-01-01

    Bioinformatics has been an emerging area of research for the last three decades. The ultimate aims of bioinformatics were to store and manage the biological data, and develop and analyze computational tools to enhance their understanding. The size of data accumulated under various sequencing projects is increasing exponentially, which presents difficulties for the experimental methods. To reduce the gap between newly sequenced protein and proteins with known functions, many computational techniques involving classification and clustering algorithms were proposed in the past. The classification of protein sequences into existing superfamilies is helpful in predicting the structure and function of large amount of newly discovered proteins. The existing classification results are unsatisfactory due to a huge size of features obtained through various feature encoding methods. In this work, a statistical metric-based feature selection technique has been proposed in order to reduce the size of the extracted feature vector. The proposed method of protein classification shows significant improvement in terms of performance measure metrics: accuracy, sensitivity, specificity, recall, F-measure, and so forth. PMID:25045727

  4. Birth and upgrowth of the Hox topological domains during evolution

    NARCIS (Netherlands)

    Deschamps, Jacqueline

    The recently discovered chromatin compartments called topologically associating domains (TADs) are essential for the three-dimensional organization of regulatory interactions driving gene expression. A new study documents the emergence of a TAD flanking the amphioxus Hox cluster, prefiguring the

  5. Birth and upgrowth of the Hox topological domains during evolution

    NARCIS (Netherlands)

    Deschamps, J.

    2016-01-01

    The recently discovered chromatin compartments called topologically associating domains (TADs) are essential for the three-dimensional organization of regulatory interactions driving gene expression. A new study documents the emergence of a TAD flanking the amphioxus Hox cluster, prefiguring the

  6. 3D complex: a structural classification of protein complexes.

    Directory of Open Access Journals (Sweden)

    Emmanuel D Levy

    2006-11-01

    Full Text Available Most of the proteins in a cell assemble into complexes to carry out their function. It is therefore crucial to understand the physicochemical properties as well as the evolution of interactions between proteins. The Protein Data Bank represents an important source of information for such studies, because more than half of the structures are homo- or heteromeric protein complexes. Here we propose the first hierarchical classification of whole protein complexes of known 3-D structure, based on representing their fundamental structural features as a graph. This classification provides the first overview of all the complexes in the Protein Data Bank and allows nonredundant sets to be derived at different levels of detail. This reveals that between one-half and two-thirds of known structures are multimeric, depending on the level of redundancy accepted. We also analyse the structures in terms of the topological arrangement of their subunits and find that they form a small number of arrangements compared with all theoretically possible ones. This is because most complexes contain four subunits or less, and the large majority are homomeric. In addition, there is a strong tendency for symmetry in complexes, even for heteromeric complexes. Finally, through comparison of Biological Units in the Protein Data Bank with the Protein Quaternary Structure database, we identified many possible errors in quaternary structure assignments. Our classification, available as a database and Web server at http://www.3Dcomplex.org, will be a starting point for future work aimed at understanding the structure and evolution of protein complexes.

  7. Molecular integration of HoxB4 and STAT3 for self-renewal of hematopoietic stem cells: a model of molecular convergence for stemness.

    Science.gov (United States)

    Hong, Sung-Hyun; Yang, Seung-Jip; Kim, Tae-Min; Shim, Jae-Seung; Lee, Ho-Sun; Lee, Ga-Young; Park, Bo-Bae; Nam, Suk Woo; Ryoo, Zae Young; Oh, Il-Hoan

    2014-05-01

    The upregulation of HoxB4 promotes self-renewal of hematopoietic stem cells (HSCs) without overriding the normal stem cell pool size. A similar enhancement of HSC self-renewal occurs when signal transducer and activator of transcription 3 (STAT3) is activated in HSCs. In this study, to gain insight into the functional organization of individual transcription factors (TFs) that have similar effects on HSCs, we investigated the molecular interplay between HoxB4 and STAT3 in the regulation of HSC self-renewal. We found that while STAT3-C or HoxB4 similarly enhanced the in vitro self-renewal and in vivo repopulating activities of HSCs, simultaneous transduction of both TFs did not have additive effects, indicating their functional redundancy in HSCs. In addition, activation of STAT3 did not cause changes in the expression levels of HoxB4. In contrast, the inhibition of STAT3 activity in HoxB4-overexpressing hematopoietic cells significantly abrogated the enhancing effects of HoxB4, and the upregulation of HoxB4 caused a ligand-independent Tyr-phosphorylation of STAT3. Microarray analysis revealed a significant overlap of the transcriptomes regulated by STAT3 and HoxB4 in undifferentiated hematopoietic cells. Moreover, a gene set enrichment analysis showed significant overlap in the candidate TFs that can recapitulate the transcriptional changes induced by HoxB4 or STAT3. Interestingly, among these common TFs were the pluripotency-related genes Oct-4 and Nanog. These results indicate that tissue-specific TFs regulating HSC self-renewal are functionally organized to play an equivalent role in transcription and provide insights into the functional convergence of multiple entries of TFs toward a conserved transcription program for the stem cell state. © 2014 AlphaMed Press.

  8. Improving Classification of Protein Interaction Articles Using Context Similarity-Based Feature Selection.

    Science.gov (United States)

    Chen, Yifei; Sun, Yuxing; Han, Bing-Qing

    2015-01-01

    Protein interaction article classification is a text classification task in the biological domain to determine which articles describe protein-protein interactions. Since the feature space in text classification is high-dimensional, feature selection is widely used for reducing the dimensionality of features to speed up computation without sacrificing classification performance. Many existing feature selection methods are based on the statistical measure of document frequency and term frequency. One potential drawback of these methods is that they treat features separately. Hence, first we design a similarity measure between the context information to take word cooccurrences and phrase chunks around the features into account. Then we introduce the similarity of context information to the importance measure of the features to substitute the document and term frequency. Hence we propose new context similarity-based feature selection methods. Their performance is evaluated on two protein interaction article collections and compared against the frequency-based methods. The experimental results reveal that the context similarity-based methods perform better in terms of the F1 measure and the dimension reduction rate. Benefiting from the context information surrounding the features, the proposed methods can select distinctive features effectively for protein interaction article classification.

  9. HOX Gene Promoter Prediction and Inter-genomic Comparison: An Evo-Devo Study

    Directory of Open Access Journals (Sweden)

    Marla A. Endriga

    2010-10-01

    Full Text Available Homeobox genes direct the anterior-posterior axis of the body plan in eukaryotic organisms. Promoter regions upstream of the Hox genes jumpstart the transcription process. CpG islands found within the promoter regions can cause silencing of these promoters. The locations of the promoter regions and the CpG islands of Homeo sapiens sapiens (human, Pan troglodytes (chimpanzee, Mus musculus (mouse, and Rattus norvegicus (brown rat are compared and related to the possible influence on the specification of the mammalian body plan. The sequence of each gene in Hox clusters A-D of the mammals considered were retrieved from Ensembl and locations of promoter regions and CpG islands predicted using Exon Finder. The predicted promoter sequences were confirmed via BLAST and verified against the Eukaryotic Promoter Database. The significance of the locations was determined using the Kruskal-Wallis test. Among the four clusters, only promoter locations in cluster B showed significant difference. HOX B genes have been linked with the control of genes that direct the development of axial morphology, particularly of the vertebral column bones. The magnitude of variation among the body plans of closely-related species can thus be partially attributed to the promoter kind, location and number, and gene inactivation via CpG methylation.

  10. A three-way approach for protein function classification.

    Directory of Open Access Journals (Sweden)

    Hafeez Ur Rehman

    Full Text Available The knowledge of protein functions plays an essential role in understanding biological cells and has a significant impact on human life in areas such as personalized medicine, better crops and improved therapeutic interventions. Due to expense and inherent difficulty of biological experiments, intelligent methods are generally relied upon for automatic assignment of functions to proteins. The technological advancements in the field of biology are improving our understanding of biological processes and are regularly resulting in new features and characteristics that better describe the role of proteins. It is inevitable to neglect and overlook these anticipated features in designing more effective classification techniques. A key issue in this context, that is not being sufficiently addressed, is how to build effective classification models and approaches for protein function prediction by incorporating and taking advantage from the ever evolving biological information. In this article, we propose a three-way decision making approach which provides provisions for seeking and incorporating future information. We considered probabilistic rough sets based models such as Game-Theoretic Rough Sets (GTRS and Information-Theoretic Rough Sets (ITRS for inducing three-way decisions. An architecture of protein functions classification with probabilistic rough sets based three-way decisions is proposed and explained. Experiments are carried out on Saccharomyces cerevisiae species dataset obtained from Uniprot database with the corresponding functional classes extracted from the Gene Ontology (GO database. The results indicate that as the level of biological information increases, the number of deferred cases are reduced while maintaining similar level of accuracy.

  11. miR-7 and miR-218 epigenetically control tumor suppressor genes RASSF1A and Claudin-6 by targeting HoxB3 in breast cancer

    International Nuclear Information System (INIS)

    Li, Qiaoyan; Zhu, Fufan; Chen, Puxiang

    2012-01-01

    Highlights: ► Both miR-7 and miR-218 down-regulates HoxB3 expression by targeting the 3′-UTR of HoxB3 mRNA. ► A reverse correlation between the levels of endogenous miR-7, miR218 and HoxB3 expression. ► Epigenetic changes involve in the reactivation of HoxB3. ► Both miRNAs inhibits the cell cycle and clone formation of breast cancer cells. -- Abstract: Many microRNAs have been implicated as key regulators of cellular growth and differentiation and have been found to dysregulate proliferation in human tumors, including breast cancer. Cancer-linked microRNAs also alter the epigenetic landscape by way of DNA methylation and post-translational modifications of histones. Aberrations in Hox gene expression are important for oncogene or tumor suppressor during abnormal development and malignancy. Although recent studies suggest that HoxB3 is critical in breast cancer, the putative role(s) of microRNAs impinging on HoxB3 is not yet fully understood. In this study, we found that the expression levels of miR-7 and miR-218 were strongly and reversely associated with HoxB3 expression. Stable overexpression of miR-7 and miR-218 was accompanied by reactivation of tumor suppressor genes including RASSF1A and Claudin-6 by means of epigenetic switches in DNA methylation and histone modification, giving rise to inhibition of the cell cycle and clone formation of breast cancer cells. The current study provides a novel link between overexpression of collinear Hox genes and multiple microRNAs in human breast malignancy.

  12. Conservation of ParaHox genes' function in patterning of the digestive tract of the marine gastropod Gibbula varia

    Directory of Open Access Journals (Sweden)

    Steiner Gerhard

    2010-07-01

    Full Text Available Abstract Background Presence of all three ParaHox genes has been described in deuterostomes and lophotrochozoans, but to date one of these three genes, Xlox has not been reported from any ecdysozoan taxa and both Xlox and Gsx are absent in nematodes. There is evidence that the ParaHox genes were ancestrally a single chromosomal cluster. Colinear expression of the ParaHox genes in anterior, middle, and posterior tissues of several species studied so far suggest that these genes may be responsible for axial patterning of the digestive tract. So far, there are no data on expression of these genes in molluscs. Results We isolated the complete coding sequences of the three Gibbula varia ParaHox genes, and then tested their expression in larval and postlarval development. In Gibbula varia, the ParaHox genes participate in patterning of the digestive tract and are expressed in some cells of the neuroectoderm. The expression of these genes coincides with the gradual formation of the gut in the larva. Gva-Gsx patterns potential neural precursors of cerebral ganglia as well as of the apical sensory organ. During larval development this gene is involved in the formation of the mouth and during postlarval development it is expressed in the precursor cells involved in secretion of the radula, the odontoblasts. Gva-Xolx and Gva-Cdx are involved in gut patterning in the middle and posterior parts of digestive tract, respectively. Both genes are expressed in some ventral neuroectodermal cells; however the expression of Gva-Cdx fades in later larval stages while the expression of Gva-Xolx in these cells persists. Conclusions In Gibbula varia the ParaHox genes are expressed during anterior-posterior patterning of the digestive system. This colinearity is not easy to spot during early larval stages because the differentiated endothelial cells within the yolk permanently migrate to their destinations in the gut. After torsion, Gsx patterns the mouth and foregut

  13. Rotiferan Hox genes give new insights into the evolution of metazoan bodyplans

    DEFF Research Database (Denmark)

    Andreas, Fröbius C.; Funch, Peter

    2017-01-01

    The phylum Rotifera consists of minuscule, nonsegmented animals with a unique body plan and an unresolved phylogenetic position. The presence of pharyngeal articulated jaws supports an inclusion in Gnathifera nested in the Spiralia. Comparison of Hox genes, involved in animal body plan patterning...

  14. Structural classification of proteins using texture descriptors extracted from the cellular automata image.

    Science.gov (United States)

    Kavianpour, Hamidreza; Vasighi, Mahdi

    2017-02-01

    Nowadays, having knowledge about cellular attributes of proteins has an important role in pharmacy, medical science and molecular biology. These attributes are closely correlated with the function and three-dimensional structure of proteins. Knowledge of protein structural class is used by various methods for better understanding the protein functionality and folding patterns. Computational methods and intelligence systems can have an important role in performing structural classification of proteins. Most of protein sequences are saved in databanks as characters and strings and a numerical representation is essential for applying machine learning methods. In this work, a binary representation of protein sequences is introduced based on reduced amino acids alphabets according to surrounding hydrophobicity index. Many important features which are hidden in these long binary sequences can be clearly displayed through their cellular automata images. The extracted features from these images are used to build a classification model by support vector machine. Comparing to previous studies on the several benchmark datasets, the promising classification rates obtained by tenfold cross-validation imply that the current approach can help in revealing some inherent features deeply hidden in protein sequences and improve the quality of predicting protein structural class.

  15. HOx Radical Behavior in Urban, Biogenic and Mixed Environments

    Science.gov (United States)

    Cantrell, C. A.; Mauldin, L.; Schardt, N.; Mukherjee, A. D.

    2014-12-01

    The importance of HOx radicals in tropospheric chemistry is well-recognized. These roles include control of the lifetimes of a wide variety of trace gases, and control of photochemical ozone formation. The continued advance in understanding comes from laboratory investigations and field observations especially as part of comprehensive measurement campaigns. We participated in two recent observational campaigns aboard the NSF/NCAR C-130 aircraft platform: NOMADSS (Nitrogen, Oxidants, Mercury and Aerosol Distributions, Sources and Sinks) and FRAPPE (Front Range Atmospheric Pollution and Photochemistry Experiment). During these studies, a wide varieties of air masses were sampled ranging from fresh urban to rural both without and without biogenic influence to marine, and including the impacts of emissions from oil and gas extraction and animal production. Among the wide variety of parameters and species related to tropospheric chemistry that were measured, our group made observations of HOx and related species: OH, HO2, HO2+RO2, H2SO4, and stabilized Criegee intermediates (sCIs) using selected ion chemical ionization mass spectrometry. The paper discusses the functional dependence of these species on other measures of the chemical environment (e.g. NO, VOCs, j-values) as well as comparison of model estimates with the observations.

  16. Modeling the oxidative capacity of the atmosphere of the south coast air basin of California. 2. HOx radical production.

    Science.gov (United States)

    Griffin, Robert J

    2004-02-01

    The production of HOx radicals in the South Coast Air Basin of California is investigated during the smog episode of September 9, 1993 using the California Institute of Technology (CIT) air-quality model. Sources of HOx(hydroxyl, hydroperoxy, and organic peroxy radicals) incorporated into the associated gas-phase chemical mechanism include the combination of excited-state singlet oxygen (formed from ozone (O3) photolysis (hv)) with water, the photolysis of nitrous acid, hydrogen peroxide (H2O2), and carbonyl compounds (formaldehyde (HCHO) or higher aldehydes and ketones), the consumption of aldehydes and alkenes (ALK) by the nitrate radical, and the consumption of alkenes by O3 and the oxygen atom (O). At a given time or location for surface cells and vertical averages, each route of HOx formation may be the greatest contributor to overall formation except HCHO-hv, H2O2-hv, and ALK-O, the latter two of which are insignificant pathways in general. The contribution of the ALK-O3 pathway is dependent on the stoichiometric yield of OH, but this pathway, at least for the studied smog episode, may not be as generally significant as previous research suggests. Future emissions scenarios yield lower total HOx production rates and a shift in the relative importance of individual pathways.

  17. Classification of protein-protein interaction full-text documents using text and citation network features.

    Science.gov (United States)

    Kolchinsky, Artemy; Abi-Haidar, Alaa; Kaur, Jasleen; Hamed, Ahmed Abdeen; Rocha, Luis M

    2010-01-01

    We participated (as Team 9) in the Article Classification Task of the Biocreative II.5 Challenge: binary classification of full-text documents relevant for protein-protein interaction. We used two distinct classifiers for the online and offline challenges: 1) the lightweight Variable Trigonometric Threshold (VTT) linear classifier we successfully introduced in BioCreative 2 for binary classification of abstracts and 2) a novel Naive Bayes classifier using features from the citation network of the relevant literature. We supplemented the supplied training data with full-text documents from the MIPS database. The lightweight VTT classifier was very competitive in this new full-text scenario: it was a top-performing submission in this task, taking into account the rank product of the Area Under the interpolated precision and recall Curve, Accuracy, Balanced F-Score, and Matthew's Correlation Coefficient performance measures. The novel citation network classifier for the biomedical text mining domain, while not a top performing classifier in the challenge, performed above the central tendency of all submissions, and therefore indicates a promising new avenue to investigate further in bibliome informatics.

  18. Wavelet images and Chou's pseudo amino acid composition for protein classification.

    Science.gov (United States)

    Nanni, Loris; Brahnam, Sheryl; Lumini, Alessandra

    2012-08-01

    The last decade has seen an explosion in the collection of protein data. To actualize the potential offered by this wealth of data, it is important to develop machine systems capable of classifying and extracting features from proteins. Reliable machine systems for protein classification offer many benefits, including the promise of finding novel drugs and vaccines. In developing our system, we analyze and compare several feature extraction methods used in protein classification that are based on the calculation of texture descriptors starting from a wavelet representation of the protein. We then feed these texture-based representations of the protein into an Adaboost ensemble of neural network or a support vector machine classifier. In addition, we perform experiments that combine our feature extraction methods with a standard method that is based on the Chou's pseudo amino acid composition. Using several datasets, we show that our best approach outperforms standard methods. The Matlab code of the proposed protein descriptors is available at http://bias.csr.unibo.it/nanni/wave.rar .

  19. The homeobox BcHOX8 gene in Botrytis cinerea regulates vegetative growth and morphology.

    Directory of Open Access Journals (Sweden)

    Zsuzsanna Antal

    Full Text Available Filamentous growth and the capacity at producing conidia are two critical aspects of most fungal life cycles, including that of many plant or animal pathogens. Here, we report on the identification of a homeobox transcription factor encoding gene that plays a role in these two particular aspects of the development of the phytopathogenic fungus Botrytis cinerea. Deletion of the BcHOX8 gene in both the B. cinerea B05-10 and T4 strains causes similar phenotypes, among which a curved, arabesque-like, hyphal growth on hydrophobic surfaces; the mutants were hence named Arabesque. Expression of the BcHOX8 gene is higher in conidia and infection cushions than in developing appressorium or mycelium. In the Arabesque mutants, colony growth rate is reduced and abnormal infection cushions are produced. Asexual reproduction is also affected with abnormal conidiophore being formed, strongly reduced conidia production and dramatic changes in conidial morphology. Finally, the mutation affects the fungus ability to efficiently colonize different host plants. Analysis of the B. cinerea genome shows that BcHOX8 is one member of a nine putative homeobox genes family. Available gene expression data suggest that these genes are functional and sequence comparisons indicate that two of them would be specific to B. cinerea and its close relative Sclerotinia sclerotiorum.

  20. Protein Structure Classification and Loop Modeling Using Multiple Ramachandran Distributions

    KAUST Repository

    Najibi, Seyed Morteza; Maadooliat, Mehdi; Zhou, Lan; Huang, Jianhua Z.; Gao, Xin

    2017-01-01

    Recently, the study of protein structures using angular representations has attracted much attention among structural biologists. The main challenge is how to efficiently model the continuous conformational space of the protein structures based on the differences and similarities between different Ramachandran plots. Despite the presence of statistical methods for modeling angular data of proteins, there is still a substantial need for more sophisticated and faster statistical tools to model the large-scale circular datasets. To address this need, we have developed a nonparametric method for collective estimation of multiple bivariate density functions for a collection of populations of protein backbone angles. The proposed method takes into account the circular nature of the angular data using trigonometric spline which is more efficient compared to existing methods. This collective density estimation approach is widely applicable when there is a need to estimate multiple density functions from different populations with common features. Moreover, the coefficients of adaptive basis expansion for the fitted densities provide a low-dimensional representation that is useful for visualization, clustering, and classification of the densities. The proposed method provides a novel and unique perspective to two important and challenging problems in protein structure research: structure-based protein classification and angular-sampling-based protein loop structure prediction.

  1. Protein Structure Classification and Loop Modeling Using Multiple Ramachandran Distributions

    KAUST Repository

    Najibi, Seyed Morteza

    2017-02-08

    Recently, the study of protein structures using angular representations has attracted much attention among structural biologists. The main challenge is how to efficiently model the continuous conformational space of the protein structures based on the differences and similarities between different Ramachandran plots. Despite the presence of statistical methods for modeling angular data of proteins, there is still a substantial need for more sophisticated and faster statistical tools to model the large-scale circular datasets. To address this need, we have developed a nonparametric method for collective estimation of multiple bivariate density functions for a collection of populations of protein backbone angles. The proposed method takes into account the circular nature of the angular data using trigonometric spline which is more efficient compared to existing methods. This collective density estimation approach is widely applicable when there is a need to estimate multiple density functions from different populations with common features. Moreover, the coefficients of adaptive basis expansion for the fitted densities provide a low-dimensional representation that is useful for visualization, clustering, and classification of the densities. The proposed method provides a novel and unique perspective to two important and challenging problems in protein structure research: structure-based protein classification and angular-sampling-based protein loop structure prediction.

  2. Role of Cdx and Hox genes in posterior axial extension in the mouse

    NARCIS (Netherlands)

    Young, T.

    2009-01-01

    Hox and Cdx genes are phylogenetically related transcription factor-encoding genes that control positional tissue identity during embryonic develop- ment. In addition, mutations impairing Cdx activity in mice elicit poste- rior body truncations, affecting the axial skeleton, the neuraxis and cau-

  3. Elman RNN based classification of proteins sequences on account of their mutual information.

    Science.gov (United States)

    Mishra, Pooja; Nath Pandey, Paras

    2012-10-21

    In the present work we have employed the method of estimating residue correlation within the protein sequences, by using the mutual information (MI) of adjacent residues, based on structural and solvent accessibility properties of amino acids. The long range correlation between nonadjacent residues is improved by constructing a mutual information vector (MIV) for a single protein sequence, like this each protein sequence is associated with its corresponding MIVs. These MIVs are given to Elman RNN to obtain the classification of protein sequences. The modeling power of MIV was shown to be significantly better, giving a new approach towards alignment free classification of protein sequences. We also conclude that sequence structural and solvent accessible property based MIVs are better predictor. Copyright © 2012 Elsevier Ltd. All rights reserved.

  4. Accumulation of transposable elements in Hox gene clusters during adaptive radiation of Anolis lizards.

    Science.gov (United States)

    Feiner, Nathalie

    2016-10-12

    Transposable elements (TEs) are DNA sequences that can insert elsewhere in the genome and modify genome structure and gene regulation. The role of TEs in evolution is contentious. One hypothesis posits that TE activity generates genomic incompatibilities that can cause reproductive isolation between incipient species. This predicts that TEs will accumulate during speciation events. Here, I tested the prediction that extant lineages with a relatively high rate of speciation have a high number of TEs in their genomes. I sequenced and analysed the TE content of a marker genomic region (Hox clusters) in Anolis lizards, a classic case of an adaptive radiation. Unlike other vertebrates, including closely related lizards, Anolis lizards have high numbers of TEs in their Hox clusters, genomic regions that regulate development of the morphological adaptations that characterize habitat specialists in these lizards. Following a burst of TE activity in the lineage leading to extant Anolis, TEs have continued to accumulate during or after speciation events, resulting in a positive relationship between TE density and lineage speciation rate. These results are consistent with the prediction that TE activity contributes to adaptive radiation by promoting speciation. Although there was no evidence that TE density per se is associated with ecological morphology, the activity of TEs in Hox clusters could have been a rich source for phenotypic variation that may have facilitated the rapid parallel morphological adaptation to microhabitats seen in extant Anolis lizards. © 2016 The Author(s).

  5. Hox code in embryos of Chinese soft-shelled turtle Pelodiscus sinensis correlates with the evolutionary innovation in the turtle.

    Science.gov (United States)

    Ohya, Yoshie Kawashima; Kuraku, Shigehiro; Kuratani, Shigeru

    2005-03-15

    Turtles have the most unusual body plan of the amniotes, with a dorsal shell consisting of modified ribs. Because this morphological change in the ribs can be described as an axial-level specific alteration, the evolution of the turtle carapace should depend on changes in the Hox code. To identify turtle-specific changes in developmental patterns, we cloned several Hox genes from the Chinese soft-shelled turtle, Pelodiscus sinensis, examined their expression patterns during embryogenesis, and compared them with those of chicken and mouse embryos. We detected possibly turtle-specific derived traits in Hoxc-6 expression, which is restricted to the paraxial part of the embryo; in the expression of Hoxa-5 and Hoxb-5, the transcripts of which were detected only at the cervical level; and in Hoxc-8 and Hoxa-7 expression, which is shifted anteriorly relative to that of the other two amniote groups. From the known functions of the Hox orthologs in model animals, these P. sinensis-specific changes apparently correlate with specializations in the turtle-specific body plan. Copyright 2005 Wiley-Liss, Inc.

  6. Differential Expression of Hox and Notch Genes in Larval and Adult Stages of Echinococcus granulosus.

    Science.gov (United States)

    Dezaki, Ebrahim Saedi; Yaghoobi, Mohammad Mehdi; Taheri, Elham; Almani, Pooya Ghaseminejad; Tohidi, Farideh; Gottstein, Bruno; Harandi, Majid Fasihi

    2016-10-01

    This investigation aimed to evaluate the differential expression of HoxB7 and notch genes in different developmental stages of Echinococcus granulosus sensu stricto. The expression of HoxB7 gene was observed at all developmental stages. Nevertheless, significant fold differences in the expression level was documented in the juvenile worm with 3 or more proglottids, the germinal layer from infected sheep, and the adult worm from an experimentally infected dog. The notch gene was expressed at all developmental stages of E. granulosus ; however, the fold difference was significantly increased at the microcysts in monophasic culture medium and the germinal layer of infected sheep in comparison with other stages. The findings demonstrated that the 2 aforementioned genes evaluated in the present study were differentially expressed at different developmental stages of the parasite and may contribute to some important biological processes of E. granulosus .

  7. Representation Learning for Class C G Protein-Coupled Receptors Classification

    Directory of Open Access Journals (Sweden)

    Raúl Cruz-Barbosa

    2018-03-01

    Full Text Available G protein-coupled receptors (GPCRs are integral cell membrane proteins of relevance for pharmacology. The complete tertiary structure including both extracellular and transmembrane domains has not been determined for any member of class C GPCRs. An alternative way to work on GPCR structural models is the investigation of their functionality through the analysis of their primary structure. For this, sequence representation is a key factor for the GPCRs’ classification context, where usually, feature engineering is carried out. In this paper, we propose the use of representation learning to acquire the features that best represent the class C GPCR sequences and at the same time to obtain a model for classification automatically. Deep learning methods in conjunction with amino acid physicochemical property indices are then used for this purpose. Experimental results assessed by the classification accuracy, Matthews’ correlation coefficient and the balanced error rate show that using a hydrophobicity index and a restricted Boltzmann machine (RBM can achieve performance results (accuracy of 92.9% similar to those reported in the literature. As a second proposal, we combine two or more physicochemical property indices instead of only one as the input for a deep architecture in order to add information from the sequences. Experimental results show that using three hydrophobicity-related index combinations helps to improve the classification performance (accuracy of 94.1% of an RBM better than those reported in the literature for class C GPCRs without using feature selection methods.

  8. Protein structure database search and evolutionary classification.

    Science.gov (United States)

    Yang, Jinn-Moon; Tung, Chi-Hua

    2006-01-01

    As more protein structures become available and structural genomics efforts provide structural models in a genome-wide strategy, there is a growing need for fast and accurate methods for discovering homologous proteins and evolutionary classifications of newly determined structures. We have developed 3D-BLAST, in part, to address these issues. 3D-BLAST is as fast as BLAST and calculates the statistical significance (E-value) of an alignment to indicate the reliability of the prediction. Using this method, we first identified 23 states of the structural alphabet that represent pattern profiles of the backbone fragments and then used them to represent protein structure databases as structural alphabet sequence databases (SADB). Our method enhanced BLAST as a search method, using a new structural alphabet substitution matrix (SASM) to find the longest common substructures with high-scoring structured segment pairs from an SADB database. Using personal computers with Intel Pentium4 (2.8 GHz) processors, our method searched more than 10 000 protein structures in 1.3 s and achieved a good agreement with search results from detailed structure alignment methods. [3D-BLAST is available at http://3d-blast.life.nctu.edu.tw].

  9. Cross-over between discrete and continuous protein structure space: insights into automatic classification and networks of protein structures.

    Directory of Open Access Journals (Sweden)

    Alberto Pascual-García

    2009-03-01

    Full Text Available Structural classifications of proteins assume the existence of the fold, which is an intrinsic equivalence class of protein domains. Here, we test in which conditions such an equivalence class is compatible with objective similarity measures. We base our analysis on the transitive property of the equivalence relationship, requiring that similarity of A with B and B with C implies that A and C are also similar. Divergent gene evolution leads us to expect that the transitive property should approximately hold. However, if protein domains are a combination of recurrent short polypeptide fragments, as proposed by several authors, then similarity of partial fragments may violate the transitive property, favouring the continuous view of the protein structure space. We propose a measure to quantify the violations of the transitive property when a clustering algorithm joins elements into clusters, and we find out that such violations present a well defined and detectable cross-over point, from an approximately transitive regime at high structure similarity to a regime with large transitivity violations and large differences in length at low similarity. We argue that protein structure space is discrete and hierarchic classification is justified up to this cross-over point, whereas at lower similarities the structure space is continuous and it should be represented as a network. We have tested the qualitative behaviour of this measure, varying all the choices involved in the automatic classification procedure, i.e., domain decomposition, alignment algorithm, similarity score, and clustering algorithm, and we have found out that this behaviour is quite robust. The final classification depends on the chosen algorithms. We used the values of the clustering coefficient and the transitivity violations to select the optimal choices among those that we tested. Interestingly, this criterion also favours the agreement between automatic and expert classifications

  10. Targeting protein-protein interaction between MLL1 and reciprocal proteins for leukemia therapy.

    Science.gov (United States)

    Wang, Zhi-Hui; Li, Dong-Dong; Chen, Wei-Lin; You, Qi-Dong; Guo, Xiao-Ke

    2018-01-15

    The mixed lineage leukemia protein-1 (MLL1), as a lysine methyltransferase, predominantly regulates the methylation of histone H3 lysine 4 (H3K4) and functions in hematopoietic stem cell (HSC) self-renewal. MLL1 gene fuses with partner genes that results in the generation of MLL1 fusion proteins (MLL1-FPs), which are frequently detected in acute leukemia. In the progress of leukemogenesis, a great deal of proteins cooperate with MLL1 to form multiprotein complexes serving for the dysregulation of H3K4 methylation, the overexpression of homeobox (HOX) cluster genes, and the consequent generation of leukemia. Hence, disrupting the interactions between MLL1 and the reciprocal proteins has been considered to be a new treatment strategy for leukemia. Here, we reviewed potential protein-protein interactions (PPIs) between MLL1 and its reciprocal proteins, and summarized the inhibitors to target MLL1 PPIs. The druggability of MLL1 PPIs for leukemia were also discussed. Copyright © 2017. Published by Elsevier Ltd.

  11. EG-05COMBINATION OF GENE COPY GAIN AND EPIGENETIC DEREGULATION ARE ASSOCIATED WITH THE ABERRANT EXPRESSION OF A STEM CELL RELATED HOX-SIGNATURE IN GLIOBLASTOMA

    Science.gov (United States)

    Kurscheid, Sebastian; Bady, Pierre; Sciuscio, Davide; Samarzija, Ivana; Shay, Tal; Vassallo, Irene; Van Criekinge, Wim; Domany, Eytan; Stupp, Roger; Delorenzi, Mauro; Hegi, Monika

    2014-01-01

    We previously reported a stem cell related HOX gene signature associated with resistance to chemo-radiotherapy (TMZ/RT- > TMZ) in glioblastoma. However, underlying mechanisms triggering overexpression remain mostly elusive. Interestingly, HOX genes are neither involved in the developing brain, nor expressed in normal brain, suggestive of an acquired gene expression signature during gliomagenesis. HOXA genes are located on CHR 7 that displays trisomy in most glioblastoma which strongly impacts gene expression on this chromosome, modulated by local regulatory elements. Furthermore we observed more pronounced DNA methylation across the HOXA locus as compared to non-tumoral brain (Human methylation 450K BeadChip Illumina; 59 glioblastoma, 5 non-tumoral brain sampes). CpG probes annotated for HOX-signature genes, contributing most to the variability, served as input into the analysis of DNA methylation and expression to identify key regulatory regions. The structural similarity of the observed correlation matrices between DNA methylation and gene expression in our cohort and an independent data-set from TCGA (106 glioblastoma) was remarkable (RV-coefficient, 0.84; p-value < 0.0001). We identified a CpG located in the promoter region of the HOXA10 locus exerting the strongest mean negative correlation between methylation and expression of the whole HOX-signature. Applying this analysis the same CpG emerged in the external set. We then determined the contribution of both, gene copy aberration (CNA) and methylation at the selected probe to explain expression of the HOX-signature using a linear model. Statistically significant results suggested an additive effect between gene dosage and methylation at the key CpG identified. Similarly, such an additive effect was also observed in the external data-set. Taken together, we hypothesize that overexpression of the stem-cell related HOX signature is triggered by gain of trisomy 7 and escape from compensatory DNA methylation at

  12. Automatic structure classification of small proteins using random forest

    Directory of Open Access Journals (Sweden)

    Hirst Jonathan D

    2010-07-01

    Full Text Available Abstract Background Random forest, an ensemble based supervised machine learning algorithm, is used to predict the SCOP structural classification for a target structure, based on the similarity of its structural descriptors to those of a template structure with an equal number of secondary structure elements (SSEs. An initial assessment of random forest is carried out for domains consisting of three SSEs. The usability of random forest in classifying larger domains is demonstrated by applying it to domains consisting of four, five and six SSEs. Results Random forest, trained on SCOP version 1.69, achieves a predictive accuracy of up to 94% on an independent and non-overlapping test set derived from SCOP version 1.73. For classification to the SCOP Class, Fold, Super-family or Family levels, the predictive quality of the model in terms of Matthew's correlation coefficient (MCC ranged from 0.61 to 0.83. As the number of constituent SSEs increases the MCC for classification to different structural levels decreases. Conclusions The utility of random forest in classifying domains from the place-holder classes of SCOP to the true Class, Fold, Super-family or Family levels is demonstrated. Issues such as introduction of a new structural level in SCOP and the merger of singleton levels can also be addressed using random forest. A real-world scenario is mimicked by predicting the classification for those protein structures from the PDB, which are yet to be assigned to the SCOP classification hierarchy.

  13. The C. elegans SoxC protein SEM-2 opposes differentiation factors to promote a proliferative blast cell fate in the postembryonic mesoderm

    Science.gov (United States)

    Tian, Chenxi; Shi, Herong; Colledge, Clark; Stern, Michael; Waterston, Robert; Liu, Jun

    2011-01-01

    The proper development of multicellular organisms requires precise regulation and coordination of cell fate specification, cell proliferation and differentiation. Abnormal regulation and coordination of these processes could lead to disease, including cancer. We have examined the function of the sole C. elegans SoxC protein, SEM-2, in the M lineage, which produces the postembryonic mesoderm. We found that SEM-2/SoxC is both necessary and sufficient to promote a proliferating blast cell fate, the sex myoblast fate, over a differentiated striated bodywall muscle fate. A number of factors control the specific expression of sem-2 in the sex myoblast precursors and their descendants. This includes direct control of sem-2 expression by a Hox-PBC complex. The crucial nature of the HOX/PBC factors in directly enhancing expression of this proliferative factor in the C. elegans M lineage suggests a possible more general link between Hox-PBC factors and SoxC proteins in regulating cell proliferation. PMID:21307099

  14. Distinct subsets of Eve-positive pericardial cells stabilise cardiac outflow and contribute to Hox gene-triggered heart morphogenesis in Drosophila.

    Science.gov (United States)

    Zmojdzian, Monika; de Joussineau, Svetlana; Da Ponte, Jean Philippe; Jagla, Krzysztof

    2018-01-17

    The Drosophila heart, composed of discrete subsets of cardioblasts and pericardial cells, undergoes Hox-triggered anterior-posterior morphogenesis, leading to a functional subdivision into heart proper and aorta, with its most anterior part forming a funnel-shaped cardiac outflow. Cardioblasts differentiate into Tin-positive 'working myocytes' and Svp-expressing ostial cells. However, developmental fates and functions of heart-associated pericardial cells remain elusive. Here, we show that the pericardial cells that express the transcription factor Even Skipped adopt distinct fates along the anterior-posterior axis. Among them, the most anterior Antp-Ubx-AbdA - negative cells form a novel cardiac outflow component we call the outflow hanging structure, whereas the Antp-expressing cells differentiate into wing heart precursors. Interestingly, Hox gene expression in the Even Skipped-positive cells not only underlies their antero-posterior diversification, but also influences heart morphogenesis in a non-cell-autonomous way. In brief, we identify a new cardiac outflow component derived from a subset of Even Skipped-expressing cells that stabilises the anterior heart tip, and demonstrate non-cell-autonomous effects of Hox gene expression in the Even Skipped-positive cells on heart morphogenesis. © 2018. Published by The Company of Biologists Ltd.

  15. 3D representations of amino acids—applications to protein sequence comparison and classification

    Directory of Open Access Journals (Sweden)

    Jie Li

    2014-08-01

    Full Text Available The amino acid sequence of a protein is the key to understanding its structure and ultimately its function in the cell. This paper addresses the fundamental issue of encoding amino acids in ways that the representation of such a protein sequence facilitates the decoding of its information content. We show that a feature-based representation in a three-dimensional (3D space derived from amino acid substitution matrices provides an adequate representation that can be used for direct comparison of protein sequences based on geometry. We measure the performance of such a representation in the context of the protein structural fold prediction problem. We compare the results of classifying different sets of proteins belonging to distinct structural folds against classifications of the same proteins obtained from sequence alone or directly from structural information. We find that sequence alone performs poorly as a structure classifier. We show in contrast that the use of the three dimensional representation of the sequences significantly improves the classification accuracy. We conclude with a discussion of the current limitations of such a representation and with a description of potential improvements.

  16. Ancient Pbx-Hox signatures define hundreds of vertebrate developmental enhancers

    Directory of Open Access Journals (Sweden)

    Parker Hugo J

    2011-12-01

    Full Text Available Abstract Background Gene regulation through cis-regulatory elements plays a crucial role in development and disease. A major aim of the post-genomic era is to be able to read the function of cis-regulatory elements through scrutiny of their DNA sequence. Whilst comparative genomics approaches have identified thousands of putative regulatory elements, our knowledge of their mechanism of action is poor and very little progress has been made in systematically de-coding them. Results Here, we identify ancient functional signatures within vertebrate conserved non-coding elements (CNEs through a combination of phylogenetic footprinting and functional assay, using genomic sequence from the sea lamprey as a reference. We uncover a striking enrichment within vertebrate CNEs for conserved binding-site motifs of the Pbx-Hox hetero-dimer. We further show that these predict reporter gene expression in a segment specific manner in the hindbrain and pharyngeal arches during zebrafish development. Conclusions These findings evoke an evolutionary scenario in which many CNEs evolved early in the vertebrate lineage to co-ordinate Hox-dependent gene-regulatory interactions that pattern the vertebrate head. In a broader context, our evolutionary analyses reveal that CNEs are composed of tightly linked transcription-factor binding-sites (TFBSs, which can be systematically identified through phylogenetic footprinting approaches. By placing a large number of ancient vertebrate CNEs into a developmental context, our findings promise to have a significant impact on efforts toward de-coding gene-regulatory elements that underlie vertebrate development, and will facilitate building general models of regulatory element evolution.

  17. Ancient expansion of the hox cluster in lepidoptera generated four homeobox genes implicated in extra-embryonic tissue formation.

    Directory of Open Access Journals (Sweden)

    Laura Ferguson

    2014-10-01

    Full Text Available Gene duplications within the conserved Hox cluster are rare in animal evolution, but in Lepidoptera an array of divergent Hox-related genes (Shx genes has been reported between pb and zen. Here, we use genome sequencing of five lepidopteran species (Polygonia c-album, Pararge aegeria, Callimorpha dominula, Cameraria ohridella, Hepialus sylvina plus a caddisfly outgroup (Glyphotaelius pellucidus to trace the evolution of the lepidopteran Shx genes. We demonstrate that Shx genes originated by tandem duplication of zen early in the evolution of large clade Ditrysia; Shx are not found in a caddisfly and a member of the basally diverging Hepialidae (swift moths. Four distinct Shx genes were generated early in ditrysian evolution, and were stably retained in all descendent Lepidoptera except the silkmoth which has additional duplications. Despite extensive sequence divergence, molecular modelling indicates that all four Shx genes have the potential to encode stable homeodomains. The four Shx genes have distinct spatiotemporal expression patterns in early development of the Speckled Wood butterfly (Pararge aegeria, with ShxC demarcating the future sites of extraembryonic tissue formation via strikingly localised maternal RNA in the oocyte. All four genes are also expressed in presumptive serosal cells, prior to the onset of zen expression. Lepidopteran Shx genes represent an unusual example of Hox cluster expansion and integration of novel genes into ancient developmental regulatory networks.

  18. Hox gene expression leads to differential hind leg development between honeybee castes.

    Science.gov (United States)

    Bomtorin, Ana Durvalina; Barchuk, Angel Roberto; Moda, Livia Maria; Simoes, Zila Luz Paulino

    2012-01-01

    Beyond the physiological and behavioural, differences in appendage morphology between the workers and queens of Apis mellifera are pre-eminent. The hind legs of workers, which are highly specialized pollinators, deserve special attention. The hind tibia of worker has an expanded bristle-free region used for carrying pollen and propolis, the corbicula. In queens this structure is absent. Although the morphological differences are well characterized, the genetic inputs driving the development of this alternative morphology remain unknown. Leg phenotype determination takes place between the fourth and fifth larval instar and herein we show that the morphogenesis is completed at brown-eyed pupa. Using results from the hybridization of whole genome-based oligonucleotide arrays with RNA samples from hind leg imaginal discs of pre-pupal honeybees of both castes we present a list of 200 differentially expressed genes. Notably, there are castes preferentially expressed cuticular protein genes and members of the P450 family. We also provide results of qPCR analyses determining the developmental transcription profiles of eight selected genes, including abdominal-A, distal-less and ultrabithorax (Ubx), whose roles in leg development have been previously demonstrated in other insect models. Ubx expression in workers hind leg is approximately 25 times higher than in queens. Finally, immunohistochemistry assays show that Ubx localization during hind leg development resembles the bristles localization in the tibia/basitarsus of the adult legs in both castes. Our data strongly indicate that the development of the hind legs diphenism characteristic of this corbiculate species is driven by a set of caste-preferentially expressed genes, such as those encoding cuticular protein genes, P450 and Hox proteins, in response to the naturally different diets offered to honeybees during the larval period.

  19. Accurate Classification of Protein Subcellular Localization from High-Throughput Microscopy Images Using Deep Learning

    Directory of Open Access Journals (Sweden)

    Tanel Pärnamaa

    2017-05-01

    Full Text Available High-throughput microscopy of many single cells generates high-dimensional data that are far from straightforward to analyze. One important problem is automatically detecting the cellular compartment where a fluorescently-tagged protein resides, a task relatively simple for an experienced human, but difficult to automate on a computer. Here, we train an 11-layer neural network on data from mapping thousands of yeast proteins, achieving per cell localization classification accuracy of 91%, and per protein accuracy of 99% on held-out images. We confirm that low-level network features correspond to basic image characteristics, while deeper layers separate localization classes. Using this network as a feature calculator, we train standard classifiers that assign proteins to previously unseen compartments after observing only a small number of training examples. Our results are the most accurate subcellular localization classifications to date, and demonstrate the usefulness of deep learning for high-throughput microscopy.

  20. Accurate Classification of Protein Subcellular Localization from High-Throughput Microscopy Images Using Deep Learning.

    Science.gov (United States)

    Pärnamaa, Tanel; Parts, Leopold

    2017-05-05

    High-throughput microscopy of many single cells generates high-dimensional data that are far from straightforward to analyze. One important problem is automatically detecting the cellular compartment where a fluorescently-tagged protein resides, a task relatively simple for an experienced human, but difficult to automate on a computer. Here, we train an 11-layer neural network on data from mapping thousands of yeast proteins, achieving per cell localization classification accuracy of 91%, and per protein accuracy of 99% on held-out images. We confirm that low-level network features correspond to basic image characteristics, while deeper layers separate localization classes. Using this network as a feature calculator, we train standard classifiers that assign proteins to previously unseen compartments after observing only a small number of training examples. Our results are the most accurate subcellular localization classifications to date, and demonstrate the usefulness of deep learning for high-throughput microscopy. Copyright © 2017 Parnamaa and Parts.

  1. Application of InterPro for the functional classification of the proteins ...

    Indian Academy of Sciences (India)

    InterPro (http://www.ebi.ac.uk/interpro/) is an integrated documentation resource for protein families, domains and sites, developed initially as a means of rationalizing the complementary efforts of the PROSITE, PRINTS, Pfam and ProDom database projects. It is a useful resource that aids the functional classification of ...

  2. Dissecting the hydrogenase expression and activity of transformed escherichia coli with the bidirectional NiFe-hydrogenase from synechocystis sp. PCC 6803

    International Nuclear Information System (INIS)

    Moon, Yu Ran; Lee, Min Hee; An, Byung Chull; Chung, Byung Yeoup; Kim, Jae Sung; Kim, Jin Hong; Park, Youn Il; Kim, Cha Soon

    2009-01-01

    Synechocystis bidirectional hydrogenase genes (hoxEFUYH) and their putative promoter regions and transformed into E. coli. The hox genes were transcribed in the E. coli cells carrying the vector construct of pCCIFOS::phox::hox or pCCIFOS::pT7::hox and translated into HoxEFUYH proteins, suggesting that the putative hox promoter can be constitutively activated in E. coli. Accordingly, the total hydrogenase activity was markedly increased up to 192% or 169% in the transformed cells, while the hydrogen uptake was decreased up to about 30% of the negative control. Although the gene expression of LexA, the only one transcription regulator proven for the hox genes, was substantially decreased in the pCCIFOS::phox::hox cells after γ-irradiation of 30 Gy, the expression levels of HoxEFUYH proteins were not altered significantly. Thus, it is also suggested that other transcription regulators as well as LexA might contribute to activation of the hox promoter in E. coli

  3. Gene identification and protein classification in microbial metagenomic sequence data via incremental clustering

    Directory of Open Access Journals (Sweden)

    Li Weizhong

    2008-04-01

    Full Text Available Abstract Background The identification and study of proteins from metagenomic datasets can shed light on the roles and interactions of the source organisms in their communities. However, metagenomic datasets are characterized by the presence of organisms with varying GC composition, codon usage biases etc., and consequently gene identification is challenging. The vast amount of sequence data also requires faster protein family classification tools. Results We present a computational improvement to a sequence clustering approach that we developed previously to identify and classify protein coding genes in large microbial metagenomic datasets. The clustering approach can be used to identify protein coding genes in prokaryotes, viruses, and intron-less eukaryotes. The computational improvement is based on an incremental clustering method that does not require the expensive all-against-all compute that was required by the original approach, while still preserving the remote homology detection capabilities. We present evaluations of the clustering approach in protein-coding gene identification and classification, and also present the results of updating the protein clusters from our previous work with recent genomic and metagenomic sequences. The clustering results are available via CAMERA, (http://camera.calit2.net. Conclusion The clustering paradigm is shown to be a very useful tool in the analysis of microbial metagenomic data. The incremental clustering method is shown to be much faster than the original approach in identifying genes, grouping sequences into existing protein families, and also identifying novel families that have multiple members in a metagenomic dataset. These clusters provide a basis for further studies of protein families.

  4. Classification of protein fold classes by knot theory and prediction of folds by neural networks: A combined theoretical and experimental approach

    DEFF Research Database (Denmark)

    Ramnarayan, K.; Bohr, Henrik; Jalkanen, Karl J.

    2008-01-01

    We present different means of classifying protein structure. One is made rigorous by mathematical knot invariants that coincide reasonably well with ordinary graphical fold classification and another classification is by packing analysis. Furthermore when constructing our mathematical fold...... classifications, we utilize standard neural network methods for predicting protein fold classes from amino acid sequences. We also make an analysis of the redundancy of the structural classifications in relation to function and ligand binding. Finally we advocate the use of combining the measurement of the VA...

  5. Protein-Protein Interaction Article Classification Using a Convolutional Recurrent Neural Network with Pre-trained Word Embeddings.

    Science.gov (United States)

    Matos, Sérgio; Antunes, Rui

    2017-12-13

    Curation of protein interactions from scientific articles is an important task, since interaction networks are essential for the understanding of biological processes associated with disease or pharmacological action for example. However, the increase in the number of publications that potentially contain relevant information turns this into a very challenging and expensive task. In this work we used a convolutional recurrent neural network for identifying relevant articles for extracting information regarding protein interactions. Using the BioCreative III Article Classification Task dataset, we achieved an area under the precision-recall curve of 0.715 and a Matthew's correlation coefficient of 0.600, which represents an improvement over previous works.

  6. Transporter Classification Database (TCDB)

    Data.gov (United States)

    U.S. Department of Health & Human Services — The Transporter Classification Database details a comprehensive classification system for membrane transport proteins known as the Transporter Classification (TC)...

  7. HOXA1 and TALE proteins display cross-regulatory interactions and form a combinatorial binding code on HOXA1 targets.

    Science.gov (United States)

    De Kumar, Bony; Parker, Hugo J; Paulson, Ariel; Parrish, Mark E; Pushel, Irina; Singh, Narendra Pratap; Zhang, Ying; Slaughter, Brian D; Unruh, Jay R; Florens, Laurence; Zeitlinger, Julia; Krumlauf, Robb

    2017-09-01

    Hoxa1 has diverse functional roles in differentiation and development. We identify and characterize properties of regions bound by HOXA1 on a genome-wide basis in differentiating mouse ES cells. HOXA1-bound regions are enriched for clusters of consensus binding motifs for HOX, PBX, and MEIS, and many display co-occupancy of PBX and MEIS. PBX and MEIS are members of the TALE family and genome-wide analysis of multiple TALE members (PBX, MEIS, TGIF, PREP1, and PREP2) shows that nearly all HOXA1 targets display occupancy of one or more TALE members. The combinatorial binding patterns of TALE proteins define distinct classes of HOXA1 targets, which may create functional diversity. Transgenic reporter assays in zebrafish confirm enhancer activities for many HOXA1-bound regions and the importance of HOX-PBX and TGIF motifs for their regulation. Proteomic analyses show that HOXA1 physically interacts on chromatin with PBX, MEIS, and PREP family members, but not with TGIF, suggesting that TGIF may have an independent input into HOXA1-bound regions. Therefore, TALE proteins appear to represent a wide repertoire of HOX cofactors, which may coregulate enhancers through distinct mechanisms. We also discover extensive auto- and cross-regulatory interactions among the Hoxa1 and TALE genes, indicating that the specificity of HOXA1 during development may be regulated though a complex cross-regulatory network of HOXA1 and TALE proteins. This study provides new insight into a regulatory network involving combinatorial interactions between HOXA1 and TALE proteins. © 2017 De Kumar et al.; Published by Cold Spring Harbor Laboratory Press.

  8. Bat Accelerated Regions Identify a Bat Forelimb Specific Enhancer in the HoxD Locus.

    Directory of Open Access Journals (Sweden)

    Betty M Booker

    2016-03-01

    Full Text Available The molecular events leading to the development of the bat wing remain largely unknown, and are thought to be caused, in part, by changes in gene expression during limb development. These expression changes could be instigated by variations in gene regulatory enhancers. Here, we used a comparative genomics approach to identify regions that evolved rapidly in the bat ancestor, but are highly conserved in other vertebrates. We discovered 166 bat accelerated regions (BARs that overlap H3K27ac and p300 ChIP-seq peaks in developing mouse limbs. Using a mouse enhancer assay, we show that five Myotis lucifugus BARs drive gene expression in the developing mouse limb, with the majority showing differential enhancer activity compared to the mouse orthologous BAR sequences. These include BAR116, which is located telomeric to the HoxD cluster and had robust forelimb expression for the M. lucifugus sequence and no activity for the mouse sequence at embryonic day 12.5. Developing limb expression analysis of Hoxd10-Hoxd13 in Miniopterus natalensis bats showed a high-forelimb weak-hindlimb expression for Hoxd10-Hoxd11, similar to the expression trend observed for M. lucifugus BAR116 in mice, suggesting that it could be involved in the regulation of the bat HoxD complex. Combined, our results highlight novel regulatory regions that could be instrumental for the morphological differences leading to the development of the bat wing.

  9. Support Vector Machines Trained with Evolutionary Algorithms Employing Kernel Adatron for Large Scale Classification of Protein Structures.

    Science.gov (United States)

    Arana-Daniel, Nancy; Gallegos, Alberto A; López-Franco, Carlos; Alanís, Alma Y; Morales, Jacob; López-Franco, Adriana

    2016-01-01

    With the increasing power of computers, the amount of data that can be processed in small periods of time has grown exponentially, as has the importance of classifying large-scale data efficiently. Support vector machines have shown good results classifying large amounts of high-dimensional data, such as data generated by protein structure prediction, spam recognition, medical diagnosis, optical character recognition and text classification, etc. Most state of the art approaches for large-scale learning use traditional optimization methods, such as quadratic programming or gradient descent, which makes the use of evolutionary algorithms for training support vector machines an area to be explored. The present paper proposes an approach that is simple to implement based on evolutionary algorithms and Kernel-Adatron for solving large-scale classification problems, focusing on protein structure prediction. The functional properties of proteins depend upon their three-dimensional structures. Knowing the structures of proteins is crucial for biology and can lead to improvements in areas such as medicine, agriculture and biofuels.

  10. Comparison of Enzymes / Non-Enzymes Proteins Classification Models Based on 3D, Composition, Sequences and Topological Indices

    OpenAIRE

    Munteanu, Cristian Robert

    2014-01-01

    Comparison of Enzymes / Non-Enzymes Proteins Classification Models Based on 3D, Composition, Sequences and Topological Indices, German Conference on Bioinformatics (GCB), Potsdam, Germany (September, 2007)

  11. Phylogeny-dominant classification of J-proteins in Arabidopsis thaliana and Brassica oleracea.

    Science.gov (United States)

    Zhang, Bin; Qiu, Han-Lin; Qu, Dong-Hai; Ruan, Ying; Chen, Dong-Hong

    2018-04-05

    Hsp40s or DnaJ/J-proteins are evolutionarily conserved in all organisms as co-chaperones of molecular chaperone HSP70s that mainly participate in maintaining cellular protein homeostasis, such as protein folding, assembly, stabilization, and translocation under normal conditions as well as refolding and degradation under environmental stresses. It has been reported that Arabidopsis J-proteins are classified into four classes (types A-D) according to domain organization, but their phylogenetic relationships are unknown. Here, we identified 129 J-proteins in the world-wide popular vegetable Brassica oleracea, a close relative of the model plant Arabidopsis, and also revised the information of Arabidopsis J-proteins based on the latest online bioresources. According to phylogenetic analysis with domain organization and gene structure as references, the J-proteins from Arabidopsis and B. oleracea were classified into 15 main clades (I-XV) separated by a number of undefined small branches with remote relationship. Based on the number of members, they respectively belong to multigene clades, oligo-gene clades, and mono-gene clades. The J-protein genes from different clades may function together or separately to constitute a complicated regulatory network. This study provides a constructive viewpoint for J-protein classification and an informative platform for further functional dissection and resistant genes discovery related to genetic improvement of crop plants.

  12. PRED-CLASS: cascading neural networks for generalized protein classification and genome-wide applications

    OpenAIRE

    Pasquier, Claude; Promponas, Vasilis; Hamodrakas, Stavros

    2009-01-01

    International audience; A cascading system of hierarchical, artificial neural networks (named PRED-CLASS) is presented for the generalized classification of proteins into four distinct classes-transmembrane, fibrous, globular, and mixed-from information solely encoded in their amino acid sequences. The architecture of the individual component networks is kept very simple, reducing the number of free parameters (network synaptic weights) for faster training, improved generalization, and the av...

  13. Molecular classification of fatty liver by high-throughput profiling of protein post-translational modifications.

    Science.gov (United States)

    Urasaki, Yasuyo; Fiscus, Ronald R; Le, Thuc T

    2016-04-01

    We describe an alternative approach to classifying fatty liver by profiling protein post-translational modifications (PTMs) with high-throughput capillary isoelectric focusing (cIEF) immunoassays. Four strains of mice were studied, with fatty livers induced by different causes, such as ageing, genetic mutation, acute drug usage, and high-fat diet. Nutrient-sensitive PTMs of a panel of 12 liver metabolic and signalling proteins were simultaneously evaluated with cIEF immunoassays, using nanograms of total cellular protein per assay. Changes to liver protein acetylation, phosphorylation, and O-N-acetylglucosamine glycosylation were quantified and compared between normal and diseased states. Fatty liver tissues could be distinguished from one another by distinctive protein PTM profiles. Fatty liver is currently classified by morphological assessment of lipid droplets, without identifying the underlying molecular causes. In contrast, high-throughput profiling of protein PTMs has the potential to provide molecular classification of fatty liver. Copyright © 2016 Pathological Society of Great Britain and Ireland. Published by John Wiley & Sons, Ltd.

  14. Classification of EA1-box proteins and new insights into their role during reproduction in grasses.

    Science.gov (United States)

    Uebler, Susanne; Márton, Mihaela L; Dresselhaus, Thomas

    2015-12-01

    EA1-box protein classification. Success in reproduction and vegetative development in flowering plants strongly depends on precise cell-to-cell signaling events mediated by secreted peptides.A small peptide family named as EA1-like (EAL) has been first described 10 years ago and includes EA1 involved in pollen tubes attraction by the female gametophyte and EAL1-regulating germ cell identity in maize. EALs consist of an N-terminal endoplasmic reticulum-targeting motif, the highly conserved EA1-box and a short C-terminal alanine-rich domain. Whereas EAL peptides are exclusively found in the Gramineae, the EA1-box is widely distributed throughout the plant kingdom. Based on in silico analysis and subcellular localization studies, we report here a new classification of EA1-box proteins in flowering plants. They can be distinguished into three protein classes: the already defined EAL proteins, the EAG (EA1-box glycine-rich) proteins and the EAC (EA1-box containing)proteins. While fusion proteins of EAL and EAC classes locate to the secretory pathway, EAGs are cytoplasmic and locate also to the nucleus. Moreover, we further show that the third EAL protein of Zea mays, EAL2, appears to be also involved in processes related to late embryogenic development as its peptide level increases after formation of leaf primordia. Immunohistochemical studies indicate its presence in the scutellar parenchyma and around the vasculature, where it is secreted to the extracellular space. In conclusion, the members of the maize EAL family possess very diverse functions during reproduction and it will now be exciting to elucidate the functions of EAGs and EACs in plants.

  15. The expression of HoxB5 and SPC in neonatal rat lung after exposure to fluoxetine.

    Science.gov (United States)

    Taghizadeh, Razieh; Taghipour, Zahra; Karimi, Akbar; Shamsizadeh, Ali; Taghavi, Mohammad Mohsen; Shariati, Mahdi; Shabanizadeh, Ahmad; Jafari Naveh, Hamid Reza; Bidaki, Reza; Aminzadeh, Fariba

    2016-01-01

    Approximately 10% of pregnant women suffer from pregnancy-associated depression. Fluoxetine, as a selective serotonin reuptake inhibitor, is being employed as a therapy for depressive disorders. The present study aimed to determine the effects of fluoxetine on neonatal lung development. Thirty pregnant Wistar rats (weighing 200-250 g) were treated daily with 7 mg/kg fluoxetine from gestation day 0 to gestation day 21, via gavage. The control group received a similar volume of distilled water only. Following delivery, the newborns and their lungs were immediately weighed in both of the groups. The right lung was fixed for histological assessments while the left lung was used for evaluation of the expression of SPC and HoxB5 by the real-time polymerase chain reaction method. Results have indicated that even though the body weight and the number of neonatal rats in both groups were the same, the lung weight of neonates exposed to fluoxetine was significantly different compared to the control group ( P fluoxetine treatment group morphologically appears to be similar to the pseudoglandular phase, whereas the control group lungs experienced more development. According to the upregulated expression of HoxB5 concerning histological findings, results of the present study showed that fluoxetine can influence lung growth and may in turn lead to delay in lung development. So establishment of studies to identify the effects of antidepressant drugs during pregnancy is deserved.

  16. Functional classification of protein structures by local structure matching in graph representation.

    Science.gov (United States)

    Mills, Caitlyn L; Garg, Rohan; Lee, Joslynn S; Tian, Liang; Suciu, Alexandru; Cooperman, Gene; Beuning, Penny J; Ondrechen, Mary Jo

    2018-03-31

    As a result of high-throughput protein structure initiatives, over 14,400 protein structures have been solved by structural genomics (SG) centers and participating research groups. While the totality of SG data represents a tremendous contribution to genomics and structural biology, reliable functional information for these proteins is generally lacking. Better functional predictions for SG proteins will add substantial value to the structural information already obtained. Our method described herein, Graph Representation of Active Sites for Prediction of Function (GRASP-Func), predicts quickly and accurately the biochemical function of proteins by representing residues at the predicted local active site as graphs rather than in Cartesian coordinates. We compare the GRASP-Func method to our previously reported method, structurally aligned local sites of activity (SALSA), using the ribulose phosphate binding barrel (RPBB), 6-hairpin glycosidase (6-HG), and Concanavalin A-like Lectins/Glucanase (CAL/G) superfamilies as test cases. In each of the superfamilies, SALSA and the much faster method GRASP-Func yield similar correct classification of previously characterized proteins, providing a validated benchmark for the new method. In addition, we analyzed SG proteins using our SALSA and GRASP-Func methods to predict function. Forty-one SG proteins in the RPBB superfamily, nine SG proteins in the 6-HG superfamily, and one SG protein in the CAL/G superfamily were successfully classified into one of the functional families in their respective superfamily by both methods. This improved, faster, validated computational method can yield more reliable predictions of function that can be used for a wide variety of applications by the community. © 2018 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.

  17. Quantitative Proteomics Analysis Reveals Novel Insights into Mechanisms of Action of Long Noncoding RNA Hox Transcript Antisense Intergenic RNA (HOTAIR) in HeLa Cells*

    Science.gov (United States)

    Zheng, Peng; Xiong, Qian; Wu, Ying; Chen, Ying; Chen, Zhuo; Fleming, Joy; Gao, Ding; Bi, Lijun; Ge, Feng

    2015-01-01

    Long noncoding RNAs (lncRNAs), which have emerged in recent years as a new and crucial layer of gene regulators, regulate various biological processes such as carcinogenesis and metastasis. HOTAIR (Hox transcript antisense intergenic RNA), a lncRNA overexpressed in most human cancers, has been shown to be an oncogenic lncRNA. Here, we explored the role of HOTAIR in HeLa cells and searched for proteins regulated by HOTAIR. To understand the mechanism of action of HOTAIR from a systems perspective, we employed a quantitative proteomic strategy to systematically identify potential targets of HOTAIR. The expression of 170 proteins was significantly dys-regulated after inhibition of HOTAIR, implying that they could be potential targets of HOTAIR. Analysis of this data at the systems level revealed major changes in proteins involved in diverse cellular components, including the cytoskeleton and the respiratory chain. Further functional studies on vimentin (VIM), a key protein involved in the cytoskeleton, revealed that HOTAIR exerts its effects on migration and invasion of HeLa cells, at least in part, through the regulation of VIM expression. Inhibition of HOTAIR leads to mitochondrial dysfunction and ultrastructural alterations, suggesting a novel role of HOTAIR in maintaining mitochondrial function in cancer cells. Our results provide novel insights into the mechanisms underlying the function of HOTAIR in cancer cells. We expect that the methods used in this study will become an integral part of functional studies of lncRNAs. PMID:25762744

  18. Effects of Aircraft Wake Dynamics on Measured and Simulated NO(x) and HO(x) Wake Chemistry. Appendix B

    Science.gov (United States)

    Lewellen, D. C.; Lewellen, W. S.

    2001-01-01

    High-resolution numerical large-eddy simulations of the near wake of a B757 including simplified NOx and HOx chemistry were performed to explore the effects of dynamics on chemistry in wakes of ages from a few seconds to several minutes. Dilution plays an important basic role in the NOx-O3 chemistry in the wake, while a more interesting interaction between the chemistry and dynamics occurs for the HOx species. These simulation results are compared with published measurements of OH and HO2 within a B757 wake under cruise conditions in the upper troposphere taken during the Subsonic Aircraft Contrail and Cloud Effects Special Study (SUCCESS) mission in May 1996. The simulation provides a much finer grained representation of the chemistry and dynamics of the early wake than is possible from the 1 s data samples taken in situ. The comparison suggests that the previously reported discrepancy of up to a factor of 20 - 50 between the SUCCESS measurements of the [HO2]/[OH] ratio and that predicted by simplified theoretical computations is due to the combined effects of large mixing rates around the wake plume edges and averaging over volumes containing large species fluctuations. The results demonstrate the feasibility of using three-dimensional unsteady large-eddy simulations with coupled chemistry to study such phenomena.

  19. The HoxD cluster is a dynamic and resilient TAD boundary controlling the segregation of antagonistic regulatory landscapes.

    Science.gov (United States)

    Rodríguez-Carballo, Eddie; Lopez-Delisle, Lucille; Zhan, Ye; Fabre, Pierre J; Beccari, Leonardo; El-Idrissi, Imane; Huynh, Thi Hanh Nguyen; Ozadam, Hakan; Dekker, Job; Duboule, Denis

    2017-11-15

    The mammalian HoxD cluster lies between two topologically associating domains (TADs) matching distinct enhancer-rich regulatory landscapes. During limb development, the telomeric TAD controls the early transcription of Hoxd genes in forearm cells, whereas the centromeric TAD subsequently regulates more posterior Hoxd genes in digit cells. Therefore, the TAD boundary prevents the terminal Hoxd13 gene from responding to forearm enhancers, thereby allowing proper limb patterning. To assess the nature and function of this CTCF-rich DNA region in embryos , we compared chromatin interaction profiles between proximal and distal limb bud cells isolated from mutant stocks where various parts of this boundary region were removed. The resulting progressive release in boundary effect triggered inter-TAD contacts, favored by the activity of the newly accessed enhancers. However, the boundary was highly resilient, and only a 400-kb deletion, including the whole-gene cluster, was eventually able to merge the neighboring TADs into a single structure. In this unified TAD, both proximal and distal limb enhancers nevertheless continued to work independently over a targeted transgenic reporter construct. We propose that the whole HoxD cluster is a dynamic TAD border and that the exact boundary position varies depending on both the transcriptional status and the developmental context. © 2017 Rodríguez-Carballo et al.; Published by Cold Spring Harbor Laboratory Press.

  20. Quantification and Classification of E. coli Proteome Utilization and Unused Protein Costs across Environments.

    Directory of Open Access Journals (Sweden)

    Edward J O'Brien

    2016-06-01

    Full Text Available The costs and benefits of protein expression are balanced through evolution. Expression of un-utilized protein (that have no benefits in the current environment incurs a quantifiable fitness costs on cellular growth rates; however, the magnitude and variability of un-utilized protein expression in natural settings is unknown, largely due to the challenge in determining environment-specific proteome utilization. We address this challenge using absolute and global proteomics data combined with a recently developed genome-scale model of Escherichia coli that computes the environment-specific cost and utility of the proteome on a per gene basis. We show that nearly half of the proteome mass is unused in certain environments and accounting for the cost of this unused protein expression explains >95% of the variance in growth rates of Escherichia coli across 16 distinct environments. Furthermore, reduction in unused protein expression is shown to be a common mechanism to increase cellular growth rates in adaptive evolution experiments. Classification of the unused protein reveals that the unused protein encodes several nutrient- and stress- preparedness functions, which may convey fitness benefits in varying environments. Thus, unused protein expression is the source of large and pervasive fitness costs that may provide the benefit of hedging against environmental change.

  1. Homeotic function of Drosophila Bithorax-Complex miRNAs mediates fertility by restricting multiple Hox genes and TALE cofactors in the central nervous system

    Science.gov (United States)

    Garaulet, Daniel L.; Castellanos, Monica; Bejarano, Fernando; Sanfilippo, Piero; Tyler, David M.; Allan, Douglas W.; Sánchez-Herrero, Ernesto; Lai, Eric C.

    2014-01-01

    The Drosophila Bithorax-Complex (BX-C) Hox cluster contains a bidirectionally-transcribed miRNA locus, and a deletion mutant (∆mir) lays no eggs and is completely sterile. We show these miRNAs are expressed and active in distinct spatial registers along the anterior-posterior axis in the central nervous system. ∆mir larvae derepress a network of direct homeobox gene targets in the posterior ventral nerve cord (VNC), including BX-C genes and their TALE cofactors. These are phenotypically critical targets, since sterility of ∆mir mutants was substantially rescued by heterozygosity of these genes. The posterior VNC contains Ilp7+ oviduct motoneurons, whose innervation and morphology are defective in ∆mir females, and substantially rescued by heterozygosity of ∆mir targets, especially within the BX-C. Collectively, we reveal (1) critical roles for Hox miRNAs that determine segment-specific expression of homeotic genes, which are not masked by transcriptional regulation, and (2) that BX-C miRNAs are essential for neural patterning and reproductive behavior. PMID:24909902

  2. Proteomic classification of breast cancer.

    LENUS (Irish Health Repository)

    Kamel, Dalia

    2012-11-01

    Being a significant health problem that affects patients in various age groups, breast cancer has been extensively studied to date. Recently, molecular breast cancer classification has advanced significantly with the availability of genomic profiling technologies. Proteomic technologies have also advanced from traditional protein assays including enzyme-linked immunosorbent assay, immunoblotting and immunohistochemistry to more comprehensive approaches including mass spectrometry and reverse phase protein lysate arrays (RPPA). The purpose of this manuscript is to review the current protein markers that influence breast cancer prediction and prognosis and to focus on novel advances in proteomic classification of breast cancer.

  3. Identification of a developmental gene expression signature, including HOX genes, for the normal human colonic crypt stem cell niche: overexpression of the signature parallels stem cell overpopulation during colon tumorigenesis.

    Science.gov (United States)

    Bhatlekar, Seema; Addya, Sankar; Salunek, Moreh; Orr, Christopher R; Surrey, Saul; McKenzie, Steven; Fields, Jeremy Z; Boman, Bruce M

    2014-01-15

    Our goal was to identify a unique gene expression signature for human colonic stem cells (SCs). Accordingly, we determined the gene expression pattern for a known SC-enriched region--the crypt bottom. Colonic crypts and isolated crypt subsections (top, middle, and bottom) were purified from fresh, normal, human, surgical specimens. We then used an innovative strategy that used two-color microarrays (∼18,500 genes) to compare gene expression in the crypt bottom with expression in the other crypt subsections (middle or top). Array results were validated by PCR and immunostaining. About 25% of genes analyzed were expressed in crypts: 88 preferentially in the bottom, 68 in the middle, and 131 in the top. Among genes upregulated in the bottom, ∼30% were classified as growth and/or developmental genes including several in the PI3 kinase pathway, a six-transmembrane protein STAMP1, and two homeobox (HOXA4, HOXD10) genes. qPCR and immunostaining validated that HOXA4 and HOXD10 are selectively expressed in the normal crypt bottom and are overexpressed in colon carcinomas (CRCs). Immunostaining showed that HOXA4 and HOXD10 are co-expressed with the SC markers CD166 and ALDH1 in cells at the normal crypt bottom, and the number of these co-expressing cells is increased in CRCs. Thus, our findings show that these two HOX genes are selectively expressed in colonic SCs and that HOX overexpression in CRCs parallels the SC overpopulation that occurs during CRC development. Our study suggests that developmental genes play key roles in the maintenance of normal SCs and crypt renewal, and contribute to the SC overpopulation that drives colon tumorigenesis.

  4. Molecular cytogenetic differentiation of paralogs of Hox paralogs in duplicated and re-diploidized genome of the North American paddlefish (Polyodon spathula)

    Czech Academy of Sciences Publication Activity Database

    Symonová, Radka; Havelka, M.; Amemiya, C. T.; Howell, M. W.; Kořínková, Tereza; Flajšhans, M.; Gela, D.; Ráb, Petr

    2017-01-01

    Roč. 18, č. 1 (2017), č. článku 19. ISSN 1471-2156 R&D Projects: GA ČR GA14-02940S; GA MŠk EF15_003/0000460 Institutional support: RVO:67985904 Keywords : hoxA/D paralogs mapping * sturgeon whole genome duplication * ancient fish genome * rediploidization Subject RIV: EB - Genetics ; Molecular Biology OBOR OECD: Genetics and heredity (medical genetics to be 3) Impact factor: 2.266, year: 2016

  5. Evaluation of Semi-supervised Learning for Classification of Protein Crystallization Imagery.

    Science.gov (United States)

    Sigdel, Madhav; Dinç, İmren; Dinç, Semih; Sigdel, Madhu S; Pusey, Marc L; Aygün, Ramazan S

    2014-03-01

    In this paper, we investigate the performance of two wrapper methods for semi-supervised learning algorithms for classification of protein crystallization images with limited labeled images. Firstly, we evaluate the performance of semi-supervised approach using self-training with naïve Bayesian (NB) and sequential minimum optimization (SMO) as the base classifiers. The confidence values returned by these classifiers are used to select high confident predictions to be used for self-training. Secondly, we analyze the performance of Yet Another Two Stage Idea (YATSI) semi-supervised learning using NB, SMO, multilayer perceptron (MLP), J48 and random forest (RF) classifiers. These results are compared with the basic supervised learning using the same training sets. We perform our experiments on a dataset consisting of 2250 protein crystallization images for different proportions of training and test data. Our results indicate that NB and SMO using both self-training and YATSI semi-supervised approaches improve accuracies with respect to supervised learning. On the other hand, MLP, J48 and RF perform better using basic supervised learning. Overall, random forest classifier yields the best accuracy with supervised learning for our dataset.

  6. PCVMZM: Using the Probabilistic Classification Vector Machines Model Combined with a Zernike Moments Descriptor to Predict Protein-Protein Interactions from Protein Sequences.

    Science.gov (United States)

    Wang, Yanbin; You, Zhuhong; Li, Xiao; Chen, Xing; Jiang, Tonghai; Zhang, Jingting

    2017-05-11

    Protein-protein interactions (PPIs) are essential for most living organisms' process. Thus, detecting PPIs is extremely important to understand the molecular mechanisms of biological systems. Although many PPIs data have been generated by high-throughput technologies for a variety of organisms, the whole interatom is still far from complete. In addition, the high-throughput technologies for detecting PPIs has some unavoidable defects, including time consumption, high cost, and high error rate. In recent years, with the development of machine learning, computational methods have been broadly used to predict PPIs, and can achieve good prediction rate. In this paper, we present here PCVMZM, a computational method based on a Probabilistic Classification Vector Machines (PCVM) model and Zernike moments (ZM) descriptor for predicting the PPIs from protein amino acids sequences. Specifically, a Zernike moments (ZM) descriptor is used to extract protein evolutionary information from Position-Specific Scoring Matrix (PSSM) generated by Position-Specific Iterated Basic Local Alignment Search Tool (PSI-BLAST). Then, PCVM classifier is used to infer the interactions among protein. When performed on PPIs datasets of Yeast and H. Pylori , the proposed method can achieve the average prediction accuracy of 94.48% and 91.25%, respectively. In order to further evaluate the performance of the proposed method, the state-of-the-art support vector machines (SVM) classifier is used and compares with the PCVM model. Experimental results on the Yeast dataset show that the performance of PCVM classifier is better than that of SVM classifier. The experimental results indicate that our proposed method is robust, powerful and feasible, which can be used as a helpful tool for proteomics research.

  7. Measurements of HOx radicals and the total OH reactivity (kOH) in the planetary boundary layer over southern Finland aboard the Zeppelin NT airship during the PEGASOS field campaign.

    Science.gov (United States)

    Broch, Sebastian; Gomm, Sebastian; Fuchs, Hendrik; Hofzumahaus, Andreas; Holland, Frank; Bachner, Mathias; Bohn, Birger; Häseler, Rolf; Jäger, Julia; Kaiser, Jennifer; Keutsch, Frank; Li, Xin; Lohse, Insa; Rohrer, Franz; Thayer, Mitchell; Tillmann, Ralf; Wegener, Robert; Mentel, Thomas F.; Kiendler-Scharr, Astrid; Wahner, Andreas

    2014-05-01

    The concentration of hydroxyl (OH) and hydroperoxy (HO2) radicals (also named HOx) and the total OH reactivity were measured over southern Finland and during transfer flights over Germany, Denmark and Sweden aboard the Zeppelin NT airship within the framework of the Pan-European Gas-AeroSOls-climate interaction Study (PEGASOS) field campaign in 2013. The measurements were performed with a remotely controlled Laser Induced Fluorescence (LIF) instrument which was installed on top of the airship. Together with a comprehensive set of trace gas (O3, CO, NO, NO2, HCHO, HONO, VOCs), photolysis frequencies and aerosol measurements as well as the measurement of meteorological parameters, these data provide the possibility to test the current understanding of the chemical processes in the planetary boundary layer (PBL) over different landscapes and in different chemical regimes. The unique flight performance of the Zeppelin NT allowed us to measure transects at a constant altitude as well as vertical profiles within the range of 80 m to 1000 m above ground. The transect flights show changes in the HOx distribution and kOH while crossing different chemical regimes on the way from Friedrichshafen, Germany to Jämijärvi, Finland over Germany, Denmark and Sweden. Vertical profile flights over the boreal forest close to Jämijärvi and Hyytiälä (both Finland) gave the opportunity to investigate the layering of the PBL and with that the vertical distribution of HOx and kOH with a high spatial and temporal resolution. Gradients in the HOx concentration and kOH were measured between the different layers during the early morning hours. The maximum radical concentrations found during the campaign were 1.0 x 107 cm-3 for OH and 1.0 x 109 cm-3 for HO2. The total OH reactivity measured in Finland was much lower than what was reported before in the literature from ground based measurements and ranged from 1 s-1 to 6 s-1. Acknowledgement: PEGASOS project funded by the European

  8. Classification using diffraction patterns for single-particle analysis

    International Nuclear Information System (INIS)

    Hu, Hongli; Zhang, Kaiming; Meng, Xing

    2016-01-01

    An alternative method has been assessed; diffraction patterns derived from the single particle data set were used to perform the first round of classification in creating the initial averages for proteins data with symmetrical morphology. The test protein set was a collection of Caenorhabditis elegans small heat shock protein 17 obtained by Cryo EM, which has a tetrahedral (12-fold) symmetry. It is demonstrated that the initial classification on diffraction patterns is workable as well as the real-space classification that is based on the phase contrast. The test results show that the information from diffraction patterns has the enough details to make the initial model faithful. The potential advantage using the alternative method is twofold, the ability to handle the sets with poor signal/noise or/and that break the symmetry properties. - Highlights: • New classification method. • Create the accurate initial model. • Better in handling noisy data.

  9. Classification using diffraction patterns for single-particle analysis

    Energy Technology Data Exchange (ETDEWEB)

    Hu, Hongli; Zhang, Kaiming [Department of Biophysics, the Health Science Centre, Peking University, Beijing 100191 (China); Meng, Xing, E-mail: xmeng101@gmail.com [Wadsworth Centre, New York State Department of Health, Albany, New York 12201 (United States)

    2016-05-15

    An alternative method has been assessed; diffraction patterns derived from the single particle data set were used to perform the first round of classification in creating the initial averages for proteins data with symmetrical morphology. The test protein set was a collection of Caenorhabditis elegans small heat shock protein 17 obtained by Cryo EM, which has a tetrahedral (12-fold) symmetry. It is demonstrated that the initial classification on diffraction patterns is workable as well as the real-space classification that is based on the phase contrast. The test results show that the information from diffraction patterns has the enough details to make the initial model faithful. The potential advantage using the alternative method is twofold, the ability to handle the sets with poor signal/noise or/and that break the symmetry properties. - Highlights: • New classification method. • Create the accurate initial model. • Better in handling noisy data.

  10. LZW-Kernel: fast kernel utilizing variable length code blocks from LZW compressors for protein sequence classification.

    Science.gov (United States)

    Filatov, Gleb; Bauwens, Bruno; Kertész-Farkas, Attila

    2018-05-07

    Bioinformatics studies often rely on similarity measures between sequence pairs, which often pose a bottleneck in large-scale sequence analysis. Here, we present a new convolutional kernel function for protein sequences called the LZW-Kernel. It is based on code words identified with the Lempel-Ziv-Welch (LZW) universal text compressor. The LZW-Kernel is an alignment-free method, it is always symmetric, is positive, always provides 1.0 for self-similarity and it can directly be used with Support Vector Machines (SVMs) in classification problems, contrary to normalized compression distance (NCD), which often violates the distance metric properties in practice and requires further techniques to be used with SVMs. The LZW-Kernel is a one-pass algorithm, which makes it particularly plausible for big data applications. Our experimental studies on remote protein homology detection and protein classification tasks reveal that the LZW-Kernel closely approaches the performance of the Local Alignment Kernel (LAK) and the SVM-pairwise method combined with Smith-Waterman (SW) scoring at a fraction of the time. Moreover, the LZW-Kernel outperforms the SVM-pairwise method when combined with BLAST scores, which indicates that the LZW code words might be a better basis for similarity measures than local alignment approximations found with BLAST. In addition, the LZW-Kernel outperforms n-gram based mismatch kernels, hidden Markov model based SAM and Fisher kernel, and protein family based PSI-BLAST, among others. Further advantages include the LZW-Kernel's reliance on a simple idea, its ease of implementation, and its high speed, three times faster than BLAST and several magnitudes faster than SW or LAK in our tests. LZW-Kernel is implemented as a standalone C code and is a free open-source program distributed under GPLv3 license and can be downloaded from https://github.com/kfattila/LZW-Kernel. akerteszfarkas@hse.ru. Supplementary data are available at Bioinformatics Online.

  11. Classification of protein profiles using fuzzy clustering techniques

    DEFF Research Database (Denmark)

    Karemore, Gopal; Mullick, Jhinuk B.; Sujatha, R.

    2010-01-01

     Present  study  has  brought  out  a  comparison  of PCA  and  fuzzy  clustering  techniques  in  classifying  protein profiles  (chromatogram)  of  homogenates  of  different  tissue origins:  Ovarian,  Cervix,  Oral  cancers,  which  were  acquired using HPLC–LIF (High Performance Liquid...... Chromatography- Laser   Induced   Fluorescence)   method   developed   in   our laboratory. Study includes 11 chromatogram spectra each from oral,  cervical,  ovarian  cancers  as  well  as  healthy  volunteers. Generally  multivariate  analysis  like  PCA  demands  clear  data that   is   devoid   of   day......   PCA   mapping   in   classifying   various cancers from healthy spectra with classification rate up to 95 % from  60%.  Methods  are  validated  using  various  clustering indexes   and   shows   promising   improvement   in   developing optical pathology like HPLC-LIF for early detection of various...

  12. Efficient use of unlabeled data for protein sequence classification: a comparative study.

    Science.gov (United States)

    Kuksa, Pavel; Huang, Pai-Hsi; Pavlovic, Vladimir

    2009-04-29

    Recent studies in computational primary protein sequence analysis have leveraged the power of unlabeled data. For example, predictive models based on string kernels trained on sequences known to belong to particular folds or superfamilies, the so-called labeled data set, can attain significantly improved accuracy if this data is supplemented with protein sequences that lack any class tags-the unlabeled data. In this study, we present a principled and biologically motivated computational framework that more effectively exploits the unlabeled data by only using the sequence regions that are more likely to be biologically relevant for better prediction accuracy. As overly-represented sequences in large uncurated databases may bias the estimation of computational models that rely on unlabeled data, we also propose a method to remove this bias and improve performance of the resulting classifiers. Combined with state-of-the-art string kernels, our proposed computational framework achieves very accurate semi-supervised protein remote fold and homology detection on three large unlabeled databases. It outperforms current state-of-the-art methods and exhibits significant reduction in running time. The unlabeled sequences used under the semi-supervised setting resemble the unpolished gemstones; when used as-is, they may carry unnecessary features and hence compromise the classification accuracy but once cut and polished, they improve the accuracy of the classifiers considerably.

  13. Hoxa2 Selectively Enhances Meis Binding to Change a Branchial Arch Ground State

    Science.gov (United States)

    Amin, Shilu; Donaldson, Ian J.; Zannino, Denise A.; Hensman, James; Rattray, Magnus; Losa, Marta; Spitz, François; Ladam, Franck; Sagerström, Charles; Bobola, Nicoletta

    2015-01-01

    Summary Hox transcription factors (TFs) are essential for vertebrate development, but how these evolutionary conserved proteins function in vivo remains unclear. Because Hox proteins have notoriously low binding specificity, they are believed to bind with cofactors, mainly homeodomain TFs Pbx and Meis, to select their specific targets. We mapped binding of Meis, Pbx, and Hoxa2 in the branchial arches, a series of segments in the developing vertebrate head. Meis occupancy is largely similar in Hox-positive and -negative arches. Hoxa2, which specifies second arch (IIBA) identity, recognizes a subset of Meis prebound sites that contain Hox motifs. Importantly, at these sites Meis binding is strongly increased. This enhanced Meis binding coincides with active enhancers, which are linked to genes highly expressed in the IIBA and regulated by Hoxa2. These findings show that Hoxa2 operates as a tissue-specific cofactor, enhancing Meis binding to specific sites that provide the IIBA with its anatomical identity. PMID:25640223

  14. Can-CSC-GBE: Developing Cost-sensitive Classifier with Gentleboost Ensemble for breast cancer classification using protein amino acids and imbalanced data.

    Science.gov (United States)

    Ali, Safdar; Majid, Abdul; Javed, Syed Gibran; Sattar, Mohsin

    2016-06-01

    Early prediction of breast cancer is important for effective treatment and survival. We developed an effective Cost-Sensitive Classifier with GentleBoost Ensemble (Can-CSC-GBE) for the classification of breast cancer using protein amino acid features. In this work, first, discriminant information of the protein sequences related to breast tissue is extracted. Then, the physicochemical properties hydrophobicity and hydrophilicity of amino acids are employed to generate molecule descriptors in different feature spaces. For comparison, we obtained results by combining Cost-Sensitive learning with conventional ensemble of AdaBoostM1 and Bagging. The proposed Can-CSC-GBE system has effectively reduced the misclassification costs and thereby improved the overall classification performance. Our novel approach has highlighted promising results as compared to the state-of-the-art ensemble approaches. Copyright © 2016 Elsevier Ltd. All rights reserved.

  15. TALE transcription factors during early development of the vertebrate brain and eye.

    Science.gov (United States)

    Schulte, Dorothea; Frank, Dale

    2014-01-01

    Our brain's cognitive performance arises from the coordinated activities of billions of nerve cells. Despite a high degree of morphological and functional differences, all neurons of the vertebrate central nervous system (CNS) arise from a common field of multipotent progenitors. Cell fate specification and differentiation are directed by multistep processes that include inductive/external cues, such as the extracellular matrix or growth factors, and cell-intrinsic determinants, such as transcription factors and epigenetic modulators of proteins and DNA. Here we review recent findings implicating TALE-homeodomain proteins in these processes. Although originally identified as HOX-cofactors, TALE proteins also contribute to many physiological processes that do not require HOX-activity. Particular focus is, therefore, given to HOX-dependent and -independent functions of TALE proteins during early vertebrate brain development. Additionally, we provide an overview about known upstream and downstream factors of TALE proteins in the developing vertebrate brain and discuss general concepts of how TALE proteins function to modulate neuronal cell fate specification. Copyright © 2013 Wiley Periodicals, Inc.

  16. Comparative Analysis of Chromatin Binding by Sex Comb on Midleg (SCM) and Other Polycomb Group Repressors at a Drosophila Hox Gene▿

    OpenAIRE

    Wang, Liangjun; Jahren, Neal; Miller, Ellen L.; Ketel, Carrie S.; Mallin, Daniel R.; Simon, Jeffrey A.

    2010-01-01

    Sex Comb on Midleg (SCM) is a transcriptional repressor in the Polycomb group (PcG), but its molecular role in PcG silencing is not known. Although SCM can interact with Polycomb repressive complex 1 (PRC1) in vitro, biochemical studies have indicated that SCM is not a core constituent of PRC1 or PRC2. Nevertheless, SCM is just as critical for Drosophila Hox gene silencing as canonical subunits of these well-characterized PcG complexes. To address functional relationships between SCM and othe...

  17. PRED-CLASS: cascading neural networks for generalized protein classification and genome-wide applications.

    Science.gov (United States)

    Pasquier, C; Promponas, V J; Hamodrakas, S J

    2001-08-15

    A cascading system of hierarchical, artificial neural networks (named PRED-CLASS) is presented for the generalized classification of proteins into four distinct classes-transmembrane, fibrous, globular, and mixed-from information solely encoded in their amino acid sequences. The architecture of the individual component networks is kept very simple, reducing the number of free parameters (network synaptic weights) for faster training, improved generalization, and the avoidance of data overfitting. Capturing information from as few as 50 protein sequences spread among the four target classes (6 transmembrane, 10 fibrous, 13 globular, and 17 mixed), PRED-CLASS was able to obtain 371 correct predictions out of a set of 387 proteins (success rate approximately 96%) unambiguously assigned into one of the target classes. The application of PRED-CLASS to several test sets and complete proteomes of several organisms demonstrates that such a method could serve as a valuable tool in the annotation of genomic open reading frames with no functional assignment or as a preliminary step in fold recognition and ab initio structure prediction methods. Detailed results obtained for various data sets and completed genomes, along with a web sever running the PRED-CLASS algorithm, can be accessed over the World Wide Web at http://o2.biol.uoa.gr/PRED-CLASS.

  18. Multiple transcription factors directly regulate Hox gene lin-39 expression in ventral hypodermal cells of the C. elegans embryo and larva, including the hypodermal fate regulators LIN-26 and ELT-6.

    Science.gov (United States)

    Liu, Wan-Ju; Reece-Hoyes, John S; Walhout, Albertha J M; Eisenmann, David M

    2014-05-13

    Hox genes encode master regulators of regional fate specification during early metazoan development. Much is known about the initiation and regulation of Hox gene expression in Drosophila and vertebrates, but less is known in the non-arthropod invertebrate model system, C. elegans. The C. elegans Hox gene lin-39 is required for correct fate specification in the midbody region, including the Vulval Precursor Cells (VPCs). To better understand lin-39 regulation and function, we aimed to identify transcription factors necessary for lin-39 expression in the VPCs, and in particular sought factors that initiate lin-39 expression in the embryo. We used the yeast one-hybrid (Y1H) method to screen for factors that bound to 13 fragments from the lin-39 region: twelve fragments contained sequences conserved between C. elegans and two other nematode species, while one fragment was known to drive reporter gene expression in the early embryo in cells that generate the VPCs. Sixteen transcription factors that bind to eight lin-39 genomic fragments were identified in yeast, and we characterized several factors by verifying their physical interactions in vitro, and showing that reduction of their function leads to alterations in lin-39 levels and lin-39::GFP reporter expression in vivo. Three factors, the orphan nuclear hormone receptor NHR-43, the hypodermal fate regulator LIN-26, and the GATA factor ELT-6 positively regulate lin-39 expression in the embryonic precursors to the VPCs. In particular, ELT-6 interacts with an enhancer that drives GFP expression in the early embryo, and the ELT-6 site we identified is necessary for proper embryonic expression. These three factors, along with the factors ZTF-17, BED-3 and TBX-9, also positively regulate lin-39 expression in the larval VPCs. These results significantly expand the number of factors known to directly bind and regulate lin-39 expression, identify the first factors required for lin-39 expression in the embryo, and hint at a

  19. A concerted action of a paired-type homeobox gene, aristaless, and a homolog of Hox11/tlx homeobox gene, clawless, is essential for the distal tip development of the Drosophila leg.

    Science.gov (United States)

    Kojima, Tetsuya; Tsuji, Takuya; Saigo, Kaoru

    2005-03-15

    The subdivision of the developing field by region-specific expression of genes encoding transcription factors is an essential step during appendage development in arthropod and vertebrates. In Drosophila leg development, the distal-most region (pretarsus) is specified by the expression of homeobox genes, aristaless and Lim1, and its immediate neighbor (distal tarsus) is specified by the expression of a pair of Bar homeobox genes. Here, we show that one additional gene, clawless, which is a homolog of vertebrate Hox11/tlx homeobox gene family and formerly known as C15, is specifically expressed in the pretarsus and cooperatively acts with aristaless to repress Bar and possibly to activate Lim1. Similar to aristaless, the maximal expression of clawless requires Lim1 and its co-factor, Chip. Bar attenuates aristaless and clawless expression through Lim1 repression. Aristaless and Clawless proteins form a complex capable of binding to specific DNA targets, which cannot be well recognized solely by Aristaless or Clawless.

  20. Secondary structure classification of amino-acid sequences using state-space modeling

    OpenAIRE

    Brunnert, Marcus; Krahnke, Tillmann; Urfer, Wolfgang

    2001-01-01

    The secondary structure classification of amino acid sequences can be carried out by a statistical analysis of sequence and structure data using state-space models. Aiming at this classification, a modified filter algorithm programmed in S is applied to data of three proteins. The application leads to correct classifications of two proteins even when using relatively simple estimation methods for the parameters of the state-space models. Furthermore, it has been shown that the assumed initial...

  1. Automated classification of immunostaining patterns in breast tissue from the human protein atlas.

    Science.gov (United States)

    Swamidoss, Issac Niwas; Kårsnäs, Andreas; Uhlmann, Virginie; Ponnusamy, Palanisamy; Kampf, Caroline; Simonsson, Martin; Wählby, Carolina; Strand, Robin

    2013-01-01

    The Human Protein Atlas (HPA) is an effort to map the location of all human proteins (http://www.proteinatlas.org/). It contains a large number of histological images of sections from human tissue. Tissue micro arrays (TMA) are imaged by a slide scanning microscope, and each image represents a thin slice of a tissue core with a dark brown antibody specific stain and a blue counter stain. When generating antibodies for protein profiling of the human proteome, an important step in the quality control is to compare staining patterns of different antibodies directed towards the same protein. This comparison is an ultimate control that the antibody recognizes the right protein. In this paper, we propose and evaluate different approaches for classifying sub-cellular antibody staining patterns in breast tissue samples. The proposed methods include the computation of various features including gray level co-occurrence matrix (GLCM) features, complex wavelet co-occurrence matrix (CWCM) features, and weighted neighbor distance using compound hierarchy of algorithms representing morphology (WND-CHARM)-inspired features. The extracted features are used into two different multivariate classifiers (support vector machine (SVM) and linear discriminant analysis (LDA) classifier). Before extracting features, we use color deconvolution to separate different tissue components, such as the brownly stained positive regions and the blue cellular regions, in the immuno-stained TMA images of breast tissue. We present classification results based on combinations of feature measurements. The proposed complex wavelet features and the WND-CHARM features have accuracy similar to that of a human expert. Both human experts and the proposed automated methods have difficulties discriminating between nuclear and cytoplasmic staining patterns. This is to a large extent due to mixed staining of nucleus and cytoplasm. Methods for quantification of staining patterns in histopathology have many

  2. Automated classification of immunostaining patterns in breast tissue from the human protein Atlas

    Directory of Open Access Journals (Sweden)

    Issac Niwas Swamidoss

    2013-01-01

    Full Text Available Background: The Human Protein Atlas (HPA is an effort to map the location of all human proteins (http://www.proteinatlas.org/. It contains a large number of histological images of sections from human tissue. Tissue micro arrays (TMA are imaged by a slide scanning microscope, and each image represents a thin slice of a tissue core with a dark brown antibody specific stain and a blue counter stain. When generating antibodies for protein profiling of the human proteome, an important step in the quality control is to compare staining patterns of different antibodies directed towards the same protein. This comparison is an ultimate control that the antibody recognizes the right protein. In this paper, we propose and evaluate different approaches for classifying sub-cellular antibody staining patterns in breast tissue samples. Materials and Methods: The proposed methods include the computation of various features including gray level co-occurrence matrix (GLCM features, complex wavelet co-occurrence matrix (CWCM features, and weighted neighbor distance using compound hierarchy of algorithms representing morphology (WND-CHARM-inspired features. The extracted features are used into two different multivariate classifiers (support vector machine (SVM and linear discriminant analysis (LDA classifier. Before extracting features, we use color deconvolution to separate different tissue components, such as the brownly stained positive regions and the blue cellular regions, in the immuno-stained TMA images of breast tissue. Results: We present classification results based on combinations of feature measurements. The proposed complex wavelet features and the WND-CHARM features have accuracy similar to that of a human expert. Conclusions: Both human experts and the proposed automated methods have difficulties discriminating between nuclear and cytoplasmic staining patterns. This is to a large extent due to mixed staining of nucleus and cytoplasm. Methods for

  3. Comparative analysis of chromatin binding by Sex Comb on Midleg (SCM) and other polycomb group repressors at a Drosophila Hox gene.

    Science.gov (United States)

    Wang, Liangjun; Jahren, Neal; Miller, Ellen L; Ketel, Carrie S; Mallin, Daniel R; Simon, Jeffrey A

    2010-06-01

    Sex Comb on Midleg (SCM) is a transcriptional repressor in the Polycomb group (PcG), but its molecular role in PcG silencing is not known. Although SCM can interact with Polycomb repressive complex 1 (PRC1) in vitro, biochemical studies have indicated that SCM is not a core constituent of PRC1 or PRC2. Nevertheless, SCM is just as critical for Drosophila Hox gene silencing as canonical subunits of these well-characterized PcG complexes. To address functional relationships between SCM and other PcG components, we have performed chromatin immunoprecipitation studies using cultured Drosophila Schneider line 2 (S2) cells and larval imaginal discs. We find that SCM associates with a Polycomb response element (PRE) upstream of the Ubx gene which also binds PRC1, PRC2, and the DNA-binding PcG protein Pleiohomeotic (PHO). However, SCM is retained at this Ubx PRE despite genetic disruption or knockdown of PHO, PRC1, or PRC2, suggesting that SCM chromatin targeting does not require prior association of these other PcG components. Chromatin immunoprecipitations (IPs) to test the consequences of SCM genetic disruption or knockdown revealed that PHO association is unaffected, but reduced levels of PRE-bound PRC2 and PRC1 were observed. We discuss these results in light of current models for recruitment of PcG complexes to chromatin targets.

  4. Comparative Analysis of Chromatin Binding by Sex Comb on Midleg (SCM) and Other Polycomb Group Repressors at a Drosophila Hox Gene▿

    Science.gov (United States)

    Wang, Liangjun; Jahren, Neal; Miller, Ellen L.; Ketel, Carrie S.; Mallin, Daniel R.; Simon, Jeffrey A.

    2010-01-01

    Sex Comb on Midleg (SCM) is a transcriptional repressor in the Polycomb group (PcG), but its molecular role in PcG silencing is not known. Although SCM can interact with Polycomb repressive complex 1 (PRC1) in vitro, biochemical studies have indicated that SCM is not a core constituent of PRC1 or PRC2. Nevertheless, SCM is just as critical for Drosophila Hox gene silencing as canonical subunits of these well-characterized PcG complexes. To address functional relationships between SCM and other PcG components, we have performed chromatin immunoprecipitation studies using cultured Drosophila Schneider line 2 (S2) cells and larval imaginal discs. We find that SCM associates with a Polycomb response element (PRE) upstream of the Ubx gene which also binds PRC1, PRC2, and the DNA-binding PcG protein Pleiohomeotic (PHO). However, SCM is retained at this Ubx PRE despite genetic disruption or knockdown of PHO, PRC1, or PRC2, suggesting that SCM chromatin targeting does not require prior association of these other PcG components. Chromatin immunoprecipitations (IPs) to test the consequences of SCM genetic disruption or knockdown revealed that PHO association is unaffected, but reduced levels of PRE-bound PRC2 and PRC1 were observed. We discuss these results in light of current models for recruitment of PcG complexes to chromatin targets. PMID:20351181

  5. Transcription and Regulation of the Bidirectional Hydrogenase in the Cyanobacterium Nostoc sp. Strain PCC 7120▿

    Science.gov (United States)

    Sjöholm, Johannes; Oliveira, Paulo; Lindblad, Peter

    2007-01-01

    The filamentous, heterocystous cyanobacterium Nostoc sp. strain PCC 7120 (Anabaena sp. strain PCC 7120) possesses an uptake hydrogenase and a bidirectional enzyme, the latter being capable of catalyzing both H2 production and evolution. The completely sequenced genome of Nostoc sp. strain PCC 7120 reveals that the five structural genes encoding the bidirectional hydrogenase (hoxEFUYH) are separated in two clusters at a distance of approximately 8.8 kb. The transcription of the hox genes was examined under nitrogen-fixing conditions, and the results demonstrate that the cluster containing hoxE and hoxF can be transcribed as one polycistronic unit together with the open reading frame alr0750. The second cluster, containing hoxU, hoxY, and hoxH, is transcribed together with alr0763 and alr0765, located between the hox genes. Moreover, alr0760 and alr0761 form an additional larger operon. Nevertheless, Northern blot hybridizations revealed a rather complex transcription pattern in which the different hox genes are expressed differently. Transcriptional start points (TSPs) were identified 66 and 57 bp upstream from the start codon of alr0750 and hoxU, respectively. The transcriptions of the two clusters containing the hox genes are both induced under anaerobic conditions concomitantly with the induction of a higher level of hydrogenase activity. An additional TSP, within the annotated alr0760, 244 bp downstream from the suggested translation start codon, was identified. Electrophoretic mobility shift assays with purified LexA from Nostoc sp. strain PCC 7120 demonstrated specific interactions between the transcriptional regulator and both hox promoter regions. However, when LexA from Synechocystis sp. strain PCC 6803 was used, the purified protein interacted only with the promoter region of the alr0750-hoxE-hoxF operon. A search of the whole Nostoc sp. strain PCC 7120 genome demonstrated the presence of 216 putative LexA binding sites in total, including recA and rec

  6. Information assessment on predicting protein-protein interactions

    Directory of Open Access Journals (Sweden)

    Gerstein Mark

    2004-10-01

    Full Text Available Abstract Background Identifying protein-protein interactions is fundamental for understanding the molecular machinery of the cell. Proteome-wide studies of protein-protein interactions are of significant value, but the high-throughput experimental technologies suffer from high rates of both false positive and false negative predictions. In addition to high-throughput experimental data, many diverse types of genomic data can help predict protein-protein interactions, such as mRNA expression, localization, essentiality, and functional annotation. Evaluations of the information contributions from different evidences help to establish more parsimonious models with comparable or better prediction accuracy, and to obtain biological insights of the relationships between protein-protein interactions and other genomic information. Results Our assessment is based on the genomic features used in a Bayesian network approach to predict protein-protein interactions genome-wide in yeast. In the special case, when one does not have any missing information about any of the features, our analysis shows that there is a larger information contribution from the functional-classification than from expression correlations or essentiality. We also show that in this case alternative models, such as logistic regression and random forest, may be more effective than Bayesian networks for predicting interactions. Conclusions In the restricted problem posed by the complete-information subset, we identified that the MIPS and Gene Ontology (GO functional similarity datasets as the dominating information contributors for predicting the protein-protein interactions under the framework proposed by Jansen et al. Random forests based on the MIPS and GO information alone can give highly accurate classifications. In this particular subset of complete information, adding other genomic data does little for improving predictions. We also found that the data discretizations used in the

  7. Interplay between structure and magnetism in HoxPr1-x alloys. 2. Resonant x-ray magnetic scattering

    DEFF Research Database (Denmark)

    Vigliante, A.; Christensen, M.J.; Hill, J.P.

    1998-01-01

    X-ray-scattering techniques have been used to study the crystal and magnetic structures of HoxPr1-x alloys in the form of thin films. Three distinct crystal structures are found as a function of concentration x, each of which has a characteristic magnetic structure. For x greater than or equal to 0.......6 a hexagonal-close-packed phase is found with the magnetic moments ordered in a basal-plane helix, whereas for 0.4 less than or equal to x... hexagonal-close-packed and remain nonmagnetic down to the lowest temperatures studied. Using x-ray magnetic resonance scattering techniques, we demonstrate that a small, static spin-density wave is induced within the alloy 5d band at both the Pr and Ho sites in both of the magnetically ordered phases...

  8. Multi-label literature classification based on the Gene Ontology graph

    Directory of Open Access Journals (Sweden)

    Lu Xinghua

    2008-12-01

    Full Text Available Abstract Background The Gene Ontology is a controlled vocabulary for representing knowledge related to genes and proteins in a computable form. The current effort of manually annotating proteins with the Gene Ontology is outpaced by the rate of accumulation of biomedical knowledge in literature, which urges the development of text mining approaches to facilitate the process by automatically extracting the Gene Ontology annotation from literature. The task is usually cast as a text classification problem, and contemporary methods are confronted with unbalanced training data and the difficulties associated with multi-label classification. Results In this research, we investigated the methods of enhancing automatic multi-label classification of biomedical literature by utilizing the structure of the Gene Ontology graph. We have studied three graph-based multi-label classification algorithms, including a novel stochastic algorithm and two top-down hierarchical classification methods for multi-label literature classification. We systematically evaluated and compared these graph-based classification algorithms to a conventional flat multi-label algorithm. The results indicate that, through utilizing the information from the structure of the Gene Ontology graph, the graph-based multi-label classification methods can significantly improve predictions of the Gene Ontology terms implied by the analyzed text. Furthermore, the graph-based multi-label classifiers are capable of suggesting Gene Ontology annotations (to curators that are closely related to the true annotations even if they fail to predict the true ones directly. A software package implementing the studied algorithms is available for the research community. Conclusion Through utilizing the information from the structure of the Gene Ontology graph, the graph-based multi-label classification methods have better potential than the conventional flat multi-label classification approach to facilitate

  9. Molecular mechanisms of conformational specificity: A study of Hox in vivo target DNA binding specificities and the structure of a Ure2p mutation that affects fibril formation rates

    Science.gov (United States)

    Bauer, William Joseph, Jr.

    The fate of an individual cell, or even an entire organism, is often determined by minute, yet very specific differences in the conformation of a single protein species. Very often, proteins take on alternate folds or even side chain conformations to deal with different situations present within the cell. These differences can be as large as a whole domain or as subtle as the alteration of a single amino acid side chain. Yet, even these seemingly minor side chain conformational differences can determine the development of a cell type during differentiation or even dictate whether a cell will live or die. Two examples of situations where minor conformational differences within a specific protein could lead to major differences in the life cycle of a cell are described herein. The first example describes the variations seen in DNA conformations which can lead to slightly different Hox protein binding conformations responsible for recognizing biologically relevant regulatory sites. These specific differences occur in the minor groove of the bound DNA and are limited to the conformation of only two side chains. The conformation of the bound DNA, however, is not solely determined by the sequence of the DNA, as multiple sequences can result in the same DNA conformation. The second example takes place in the context of a yeast prion protein which contains a mutation that decreases the frequency at which fibrils form. While the specific interactions leading to this physiological change were not directly detected, it can be ascertained from the crystal structure that the structural changes are subtle and most likely involve another binding partner. In both cases, these conformational changes are very slight but have a profound effect on the downstream processes.

  10. Molecular cloning and developmental expression of Tlx (Hox11) genes in zebrafish (Danio rerio).

    Science.gov (United States)

    Langenau, D M; Palomero, T; Kanki, J P; Ferrando, A A; Zhou, Y; Zon, L I; Look, A T

    2002-09-01

    Tlx (Hox11) genes are orphan homeobox genes that play critical roles in the regulation of early developmental processes in vertebrates. Here, we report the identification and expression patterns of three members of the zebrafish Tlx family. These genes share similar, but not identical, expression patterns with other vertebrate Tlx-1 and Tlx-3 genes. Tlx-1 is expressed early in the developing hindbrain and pharyngeal arches, and later in the putative splenic primordium. However, unlike its orthologues, zebrafish Tlx-1 is not expressed in the cranial sensory ganglia or spinal cord. Two homologues of Tlx-3 were identified: Tlx-3a and Tlx-3b, which are both expressed in discrete regions of the developing nervous system, including the cranial sensory ganglia and Rohon-Beard neurons. However, only Tlx-3a is expressed in the statoacoustic cranial ganglia, enteric neurons and non-neural tissues such as the fin bud and pharyngeal arches and Tlx-3b is only expressed in the dorsal root ganglia. Copyright 2002 Elsevier Science Ireland Ltd.

  11. A structural classification of substrate-binding proteins

    NARCIS (Netherlands)

    Berntsson, Ronnie P. -A.; Smits, Sander H. J.; Schmitt, Lutz; Slotboom, Dirk-Jan; Poolman, Bert

    2010-01-01

    Substrate-binding proteins (SBP) are associated with a wide variety of protein complexes. The proteins are part of ATP-binding cassette transporters for substrate uptake, ion gradient driven transporters, DNA-binding proteins, as well as channels and receptors from both pro-and eukaryotes. A wealth

  12. An improved classification of G-protein-coupled receptors using sequence-derived features

    Directory of Open Access Journals (Sweden)

    Peng Zhen-Ling

    2010-08-01

    Full Text Available Abstract Background G-protein-coupled receptors (GPCRs play a key role in diverse physiological processes and are the targets of almost two-thirds of the marketed drugs. The 3 D structures of GPCRs are largely unavailable; however, a large number of GPCR primary sequences are known. To facilitate the identification and characterization of novel receptors, it is therefore very valuable to develop a computational method to accurately predict GPCRs from the protein primary sequences. Results We propose a new method called PCA-GPCR, to predict GPCRs using a comprehensive set of 1497 sequence-derived features. The principal component analysis is first employed to reduce the dimension of the feature space to 32. Then, the resulting 32-dimensional feature vectors are fed into a simple yet powerful classification algorithm, called intimate sorting, to predict GPCRs at five levels. The prediction at the first level determines whether a protein is a GPCR or a non-GPCR. If it is predicted to be a GPCR, then it will be further predicted into certain family, subfamily, sub-subfamily and subtype by the classifiers at the second, third, fourth, and fifth levels, respectively. To train the classifiers applied at five levels, a non-redundant dataset is carefully constructed, which contains 3178, 1589, 4772, 4924, and 2741 protein sequences at the respective levels. Jackknife tests on this training dataset show that the overall accuracies of PCA-GPCR at five levels (from the first to the fifth can achieve up to 99.5%, 88.8%, 80.47%, 80.3%, and 92.34%, respectively. We further perform predictions on a dataset of 1238 GPCRs at the second level, and on another two datasets of 167 and 566 GPCRs respectively at the fourth level. The overall prediction accuracies of our method are consistently higher than those of the existing methods to be compared. Conclusions The comprehensive set of 1497 features is believed to be capable of capturing information about amino acid

  13. Annotation and Classification of CRISPR-Cas Systems.

    Science.gov (United States)

    Makarova, Kira S; Koonin, Eugene V

    2015-01-01

    The clustered regularly interspaced short palindromic repeats (CRISPR)-Cas (CRISPR-associated proteins) is a prokaryotic adaptive immune system that is represented in most archaea and many bacteria. Among the currently known prokaryotic defense systems, the CRISPR-Cas genomic loci show unprecedented complexity and diversity. Classification of CRISPR-Cas variants that would capture their evolutionary relationships to the maximum possible extent is essential for comparative genomic and functional characterization of this theoretically and practically important system of adaptive immunity. To this end, a multipronged approach has been developed that combines phylogenetic analysis of the conserved Cas proteins with comparison of gene repertoires and arrangements in CRISPR-Cas loci. This approach led to the current classification of CRISPR-Cas systems into three distinct types and ten subtypes for each of which signature genes have been identified. Comparative genomic analysis of the CRISPR-Cas systems in new archaeal and bacterial genomes performed over the 3 years elapsed since the development of this classification makes it clear that new types and subtypes of CRISPR-Cas need to be introduced. Moreover, this classification system captures only part of the complexity of CRISPR-Cas organization and evolution, due to the intrinsic modularity and evolutionary mobility of these immunity systems, resulting in numerous recombinant variants. Moreover, most of the cas genes evolve rapidly, complicating the family assignment for many Cas proteins and the use of family profiles for the recognition of CRISPR-Cas subtype signatures. Further progress in the comparative analysis of CRISPR-Cas systems requires integration of the most sensitive sequence comparison tools, protein structure comparison, and refined approaches for comparison of gene neighborhoods.

  14. Annotation and Classification of CRISPR-Cas Systems

    Science.gov (United States)

    Makarova, Kira S.; Koonin, Eugene V.

    2018-01-01

    The clustered regularly interspaced short palindromic repeats (CRISPR)-Cas (CRISPR-associated proteins) is a prokaryotic adaptive immune system that is represented in most archaea and many bacteria. Among the currently known prokaryotic defense systems, the CRISPR-Cas genomic loci show unprecedented complexity and diversity. Classification of CRISPR-Cas variants that would capture their evolutionary relationships to the maximum possible extent is essential for comparative genomic and functional characterization of this theoretically and practically important system of adaptive immunity. To this end, a multipronged approach has been developed that combines phylogenetic analysis of the conserved Cas proteins with comparison of gene repertoires and arrangements in CRISPR-Cas loci. This approach led to the current classification of CRISPR-Cas systems into three distinct types and ten subtypes for each of which signature genes have been identified. Comparative genomic analysis of the CRISPR-Cas systems in new archaeal and bacterial genomes performed over the 3 years elapsed since the development of this classification makes it clear that new types and subtypes of CRISPR-Cas need to be introduced. Moreover, this classification system captures only part of the complexity of CRISPR-Cas organization and evolution, due to the intrinsic modularity and evolutionary mobility of these immunity systems, resulting in numerous recombinant variants. Moreover, most of the cas genes evolve rapidly, complicating the family assignment for many Cas proteins and the use of family profiles for the recognition of CRISPR-Cas subtype signatures. Further progress in the comparative analysis of CRISPR-Cas systems requires integration of the most sensitive sequence comparison tools, protein structure comparison, and refined approaches for comparison of gene neighborhoods. PMID:25981466

  15. Molecular Pathological Classification of Neurodegenerative Diseases: Turning towards Precision Medicine.

    Science.gov (United States)

    Kovacs, Gabor G

    2016-02-02

    Neurodegenerative diseases (NDDs) are characterized by selective dysfunction and loss of neurons associated with pathologically altered proteins that deposit in the human brain but also in peripheral organs. These proteins and their biochemical modifications can be potentially targeted for therapy or used as biomarkers. Despite a plethora of modifications demonstrated for different neurodegeneration-related proteins, such as amyloid-β, prion protein, tau, α-synuclein, TAR DNA-binding protein 43 (TDP-43), or fused in sarcoma protein (FUS), molecular classification of NDDs relies on detailed morphological evaluation of protein deposits, their distribution in the brain, and their correlation to clinical symptoms together with specific genetic alterations. A further facet of the neuropathology-based classification is the fact that many protein deposits show a hierarchical involvement of brain regions. This has been shown for Alzheimer and Parkinson disease and some forms of tauopathies and TDP-43 proteinopathies. The present paper aims to summarize current molecular classification of NDDs, focusing on the most relevant biochemical and morphological aspects. Since the combination of proteinopathies is frequent, definition of novel clusters of patients with NDDs needs to be considered in the era of precision medicine. Optimally, neuropathological categorizing of NDDs should be translated into in vivo detectable biomarkers to support better prediction of prognosis and stratification of patients for therapy trials.

  16. Domain duplication, divergence, and loss events in vertebrate Msx paralogs reveal phylogenomically informed disease markers.

    Science.gov (United States)

    Finnerty, John R; Mazza, Maureen E; Jezewski, Peter A

    2009-01-20

    Msx originated early in animal evolution and is implicated in human genetic disorders. To reconstruct the functional evolution of Msx and inform the study of human mutations, we analyzed the phylogeny and synteny of 46 metazoan Msx proteins and tracked the duplication, diversification and loss of conserved motifs. Vertebrate Msx sequences sort into distinct Msx1, Msx2 and Msx3 clades. The sister-group relationship between MSX1 and MSX2 reflects their derivation from the 4p/5q chromosomal paralogon, a derivative of the original "MetaHox" cluster. We demonstrate physical linkage between Msx and other MetaHox genes (Hmx, NK1, Emx) in a cnidarian. Seven conserved domains, including two Groucho repression domains (N- and C-terminal), were present in the ancestral Msx. In cnidarians, the Groucho domains are highly similar. In vertebrate Msx1, the N-terminal Groucho domain is conserved, while the C-terminal domain diverged substantially, implying a novel function. In vertebrate Msx2 and Msx3, the C-terminal domain was lost. MSX1 mutations associated with ectodermal dysplasia or orofacial clefting disorders map to conserved domains in a non-random fashion. Msx originated from a MetaHox ancestor that also gave rise to Tlx, Demox, NK, and possibly EHGbox, Hox and ParaHox genes. Duplication, divergence or loss of domains played a central role in the functional evolution of Msx. Duplicated domains allow pleiotropically expressed proteins to evolve new functions without disrupting existing interaction networks. Human missense sequence variants reside within evolutionarily conserved domains, likely disrupting protein function. This phylogenomic evaluation of candidate disease markers will inform clinical and functional studies.

  17. Domain duplication, divergence, and loss events in vertebrate Msx paralogs reveal phylogenomically informed disease markers

    Directory of Open Access Journals (Sweden)

    Finnerty John R

    2009-01-01

    Full Text Available Abstract Background Msx originated early in animal evolution and is implicated in human genetic disorders. To reconstruct the functional evolution of Msx and inform the study of human mutations, we analyzed the phylogeny and synteny of 46 metazoan Msx proteins and tracked the duplication, diversification and loss of conserved motifs. Results Vertebrate Msx sequences sort into distinct Msx1, Msx2 and Msx3 clades. The sister-group relationship between MSX1 and MSX2 reflects their derivation from the 4p/5q chromosomal paralogon, a derivative of the original "MetaHox" cluster. We demonstrate physical linkage between Msx and other MetaHox genes (Hmx, NK1, Emx in a cnidarian. Seven conserved domains, including two Groucho repression domains (N- and C-terminal, were present in the ancestral Msx. In cnidarians, the Groucho domains are highly similar. In vertebrate Msx1, the N-terminal Groucho domain is conserved, while the C-terminal domain diverged substantially, implying a novel function. In vertebrate Msx2 and Msx3, the C-terminal domain was lost. MSX1 mutations associated with ectodermal dysplasia or orofacial clefting disorders map to conserved domains in a non-random fashion. Conclusion Msx originated from a MetaHox ancestor that also gave rise to Tlx, Demox, NK, and possibly EHGbox, Hox and ParaHox genes. Duplication, divergence or loss of domains played a central role in the functional evolution of Msx. Duplicated domains allow pleiotropically expressed proteins to evolve new functions without disrupting existing interaction networks. Human missense sequence variants reside within evolutionarily conserved domains, likely disrupting protein function. This phylogenomic evaluation of candidate disease markers will inform clinical and functional studies.

  18. A Classification Framework Applied to Cancer Gene Expression Profiles

    Directory of Open Access Journals (Sweden)

    Hussein Hijazi

    2013-01-01

    Full Text Available Classification of cancer based on gene expression has provided insight into possible treatment strategies. Thus, developing machine learning methods that can successfully distinguish among cancer subtypes or normal versus cancer samples is important. This work discusses supervised learning techniques that have been employed to classify cancers. Furthermore, a two-step feature selection method based on an attribute estimation method (e.g., ReliefF and a genetic algorithm was employed to find a set of genes that can best differentiate between cancer subtypes or normal versus cancer samples. The application of different classification methods (e.g., decision tree, k-nearest neighbor, support vector machine (SVM, bagging, and random forest on 5 cancer datasets shows that no classification method universally outperforms all the others. However, k-nearest neighbor and linear SVM generally improve the classification performance over other classifiers. Finally, incorporating diverse types of genomic data (e.g., protein-protein interaction data and gene expression increase the prediction accuracy as compared to using gene expression alone.

  19. Analysis of protein folds using protein contact networks

    Indian Academy of Sciences (India)

    is a well-recognized classification system of proteins, which is based on manual in- ... can easily correspond to the information in the 2D matrix. ..... [7] U K Muppirala and Zhijun Li, Protein Engineering, Design & Selection 19, 265 (2006).

  20. Monomorphism in humans and sequence differences among higher primates for a sequence tagged site (STS) in homeo box cluster 2 as assayed by denaturing gradient electrophoresis

    Energy Technology Data Exchange (ETDEWEB)

    Ruano, G.; Ruddle, F.H.; Kidd, K.K. (Yale Univ., New Haven, CT (United States)); Gray, M.R. (Tufts Univ., Boston, MA (United States)); Miki, Tetsuro (Osaka Univ. (Japan)); Ferguson-Smith, A.C. (Inst. of Animal Physiology and Genetics Research, Cambridge (United Kingdom))

    1990-03-11

    The human homeo box cluster 2 (HOX2) contains genes coding for DNA binding proteins involved in developmental control and is highly conserved between mouse and man. The authors have applied in concert the Polymerase Chain Reaction (PCR) and Denaturing Gradient Electrophoresis (DGE) to amplify defined primate HOX2 segments and to detect sequence differences among them. They have sequenced a PstI fragment 4 kb upstream from HOX 2.2 and synthesized primers delimiting both halves of 630 bp segment within it PCR on various unrelated humans and SC-PCR on chimpanzee, gorilla, orangutan and gibbon yielded products of the same length for each primer pair.

  1. Classification of G-protein coupled receptors based on a rich generation of convolutional neural network, N-gram transformation and multiple sequence alignments.

    Science.gov (United States)

    Li, Man; Ling, Cheng; Xu, Qi; Gao, Jingyang

    2018-02-01

    Sequence classification is crucial in predicting the function of newly discovered sequences. In recent years, the prediction of the incremental large-scale and diversity of sequences has heavily relied on the involvement of machine-learning algorithms. To improve prediction accuracy, these algorithms must confront the key challenge of extracting valuable features. In this work, we propose a feature-enhanced protein classification approach, considering the rich generation of multiple sequence alignment algorithms, N-gram probabilistic language model and the deep learning technique. The essence behind the proposed method is that if each group of sequences can be represented by one feature sequence, composed of homologous sites, there should be less loss when the sequence is rebuilt, when a more relevant sequence is added to the group. On the basis of this consideration, the prediction becomes whether a query sequence belonging to a group of sequences can be transferred to calculate the probability that the new feature sequence evolves from the original one. The proposed work focuses on the hierarchical classification of G-protein Coupled Receptors (GPCRs), which begins by extracting the feature sequences from the multiple sequence alignment results of the GPCRs sub-subfamilies. The N-gram model is then applied to construct the input vectors. Finally, these vectors are imported into a convolutional neural network to make a prediction. The experimental results elucidate that the proposed method provides significant performance improvements. The classification error rate of the proposed method is reduced by at least 4.67% (family level I) and 5.75% (family Level II), in comparison with the current state-of-the-art methods. The implementation program of the proposed work is freely available at: https://github.com/alanFchina/CNN .

  2. Expression of β-nerve growth factor and homeobox A10 in experimental cryptorchidism treated with exogenous nerve growth factor.

    Science.gov (United States)

    Xian, Hua; Xian, Yun; Liu, Lili; Wang, Yongjun; He, Jianghong; Huang, Jianfei

    2015-04-01

    With the exception of standard inguinal orchidopexy, treatment of cryptorchidism with human chorionic gonadotropin has been performed for several years; however, its side effects have limited its application. The β‑nerve growth factor (NGF) and homeobox A10 (HoxA10) genes are closely associated with the development of the testes. To the best of our knowledge, whether exogenous NGF alters the endogenous levels of NGF and HoxA10 in cryptorchidism in rats remains to be elucidated. The aim of the present study was to evaluate the gene and protein expression of NGF and HoxA10 in experimental cryptorchidism following treatment with exogenous NGF. A unilateral mechanical cryptorchidism model in Sprague-Dawley rats was established and different concentrations of exogenous NGF were administered to observe the effects of NGF on cryptorchidism. Changes in the gene and protein expression levels of NGF and HoxA10 in the cryptorchid tissues of each group were identified using one step reverse transcription-quantitative polymerase chain reaction, in situ hybridization with digoxigenin‑labeled‑β‑NGF RNA probes, immunofluorescence and immunohistochemistry, respectively. The expression levels of NGF and HoxA10 were markedly higher in the group treated with a high dose of exogenous NGF compared with the group treated with a low dose of exogenous NGF and the group treated with human chorionic gonadotropin. These results confirmed the potential therapeutic effect of exogenous NGF in human cryptorchidism.

  3. Protein clustering and RNA phylogenetic reconstruction of the influenza A [corrected] virus NS1 protein allow an update in classification and identification of motif conservation.

    Science.gov (United States)

    Sevilla-Reyes, Edgar E; Chavaro-Pérez, David A; Piten-Isidro, Elvira; Gutiérrez-González, Luis H; Santos-Mendoza, Teresa

    2013-01-01

    The non-structural protein 1 (NS1) of influenza A virus (IAV), coded by its third most diverse gene, interacts with multiple molecules within infected cells. NS1 is involved in host immune response regulation and is a potential contributor to the virus host range. Early phylogenetic analyses using 50 sequences led to the classification of NS1 gene variants into groups (alleles) A and B. We reanalyzed NS1 diversity using 14,716 complete NS IAV sequences, downloaded from public databases, without host bias. Removal of sequence redundancy and further structured clustering at 96.8% amino acid similarity produced 415 clusters that enhanced our capability to detect distinct subgroups and lineages, which were assigned a numerical nomenclature. Maximum likelihood phylogenetic reconstruction using RNA sequences indicated the previously identified deep branching separating group A from group B, with five distinct subgroups within A as well as two and five lineages within the A4 and A5 subgroups, respectively. Our classification model proposes that sequence patterns in thirteen amino acid positions are sufficient to fit >99.9% of all currently available NS1 sequences into the A subgroups/lineages or the B group. This classification reduces host and virus bias through the prioritization of NS1 RNA phylogenetics over host or virus phenetics. We found significant sequence conservation within the subgroups and lineages with characteristic patterns of functional motifs, such as the differential binding of CPSF30 and crk/crkL or the availability of a C-terminal PDZ-binding motif. To understand selection pressures and evolution acting on NS1, it is necessary to organize the available data. This updated classification may help to clarify and organize the study of NS1 interactions and pathogenic differences and allow the drawing of further functional inferences on sequences in each group, subgroup and lineage rather than on a strain-by-strain basis.

  4. CDX2 hox gene product in a rat model of esophageal cancer

    Directory of Open Access Journals (Sweden)

    Rizzetto Christian

    2009-08-01

    Full Text Available Abstract Background Barrett's mucosa is the precursor of esophageal adenocarcinoma. The molecular mechanisms behind Barrett's carcinogenesis are largely unknown. Experimental models of longstanding esophageal reflux of duodenal-gastric contents may provide important information on the biological sequence of the Barrett's oncogenesis. Methods The expression of CDX2 hox-gene product was assessed in a rat model of Barrett's carcinogenesis. Seventy-four rats underwent esophago-jejunostomy with gastric preservation. Excluding perisurgical deaths, the animals were sacrificed at various times after the surgical treatment (Group A: 30 weeks. Results No Cdx2 expression was detected in either squamous epithelia of the proximal esophagus or squamous cell carcinomas. De novo Cdx2 expression was consistently documented in the proliferative zone of the squamous epithelium close to reflux ulcers (Group A: 68%; Group B: 64%; Group C: 80%, multilayered epithelium and intestinal metaplasia (Group A: 9%; Group B: 41%; Group C: 60%, and esophageal adenocarcinomas (Group B: 36%; Group C: 35%. A trend for increasing overall Cdx2 expression was documented during the course of the experiment (p = 0.001. Conclusion De novo expression of Cdx2 is an early event in the spectrum of the lesions induced by experimental gastro-esophageal reflux and should be considered as a key step in the morphogenesis of esophageal adenocarcinoma.

  5. Specificity of molecular interactions in transient protein-protein interaction interfaces.

    Science.gov (United States)

    Cho, Kyu-il; Lee, KiYoung; Lee, Kwang H; Kim, Dongsup; Lee, Doheon

    2006-11-15

    In this study, we investigate what types of interactions are specific to their biological function, and what types of interactions are persistent regardless of their functional category in transient protein-protein heterocomplexes. This is the first approach to analyze protein-protein interfaces systematically at the molecular interaction level in the context of protein functions. We perform systematic analysis at the molecular interaction level using classification and feature subset selection technique prevalent in the field of pattern recognition. To represent the physicochemical properties of protein-protein interfaces, we design 18 molecular interaction types using canonical and noncanonical interactions. Then, we construct input vector using the frequency of each interaction type in protein-protein interface. We analyze the 131 interfaces of transient protein-protein heterocomplexes in PDB: 33 protease-inhibitors, 52 antibody-antigens, 46 signaling proteins including 4 cyclin dependent kinase and 26 G-protein. Using kNN classification and feature subset selection technique, we show that there are specific interaction types based on their functional category, and such interaction types are conserved through the common binding mechanism, rather than through the sequence or structure conservation. The extracted interaction types are C(alpha)-- H...O==C interaction, cation...anion interaction, amine...amine interaction, and amine...cation interaction. With these four interaction types, we achieve the classification success rate up to 83.2% with leave-one-out cross-validation at k = 15. Of these four interaction types, C(alpha)--H...O==C shows binding specificity for protease-inhibitor complexes, while cation-anion interaction is predominant in signaling complexes. The amine ... amine and amine...cation interaction give a minor contribution to the classification accuracy. When combined with these two interactions, they increase the accuracy by 3.8%. In the case of

  6. Interplay of biopharmaceutics, biopharmaceutics drug disposition and salivary excretion classification systems

    Science.gov (United States)

    Idkaidek, Nasir M.

    2013-01-01

    The aim of this commentary is to investigate the interplay of Biopharmaceutics Classification System (BCS), Biopharmaceutics Drug Disposition Classification System (BDDCS) and Salivary Excretion Classification System (SECS). BCS first classified drugs based on permeability and solubility for the purpose of predicting oral drug absorption. Then BDDCS linked permeability with hepatic metabolism and classified drugs based on metabolism and solubility for the purpose of predicting oral drug disposition. On the other hand, SECS classified drugs based on permeability and protein binding for the purpose of predicting the salivary excretion of drugs. The role of metabolism, rather than permeability, on salivary excretion is investigated and the results are not in agreement with BDDCS. Conclusion The proposed Salivary Excretion Classification System (SECS) can be used as a guide for drug salivary excretion based on permeability (not metabolism) and protein binding. PMID:24493977

  7. Nucleoporin Nup98 associates with Trx/MLL and NSL histone-modifying complexes and regulates Hox gene expression.

    Science.gov (United States)

    Pascual-Garcia, Pau; Jeong, Jieun; Capelson, Maya

    2014-10-23

    The nuclear pore complex is a transport channel embedded in the nuclear envelope and made up of 30 different components termed nucleoporins (Nups). In addition to their classical role in transport, a subset of Nups has a conserved role in the regulation of transcription via direct binding to chromatin. The molecular details of this function remain obscure, and it is unknown how metazoan Nups are recruited to their chromatin locations or what transcription steps they regulate. Here, we demonstrate genome-wide and physical association between Nup98 and histone-modifying complexes MBD-R2/NSL [corrected] and Trx/MLL. Importantly, we identify a requirement for MBD-R2 in recruitment of Nup98 to many of its genomic target sites. Consistent with its interaction with the Trx/MLL complex, Nup98 is shown to be necessary for Hox gene expression in developing fly tissues. These findings introduce roles of Nup98 in epigenetic regulation that may underlie the basis of oncogenicity of Nup98 fusions in leukemia.

  8. The intersection of the extrinsic hedgehog and WNT/wingless signals with the intrinsic Hox code underpins branching pattern and tube shape diversity in the drosophila airways.

    Directory of Open Access Journals (Sweden)

    Ryo Matsuda

    2015-01-01

    Full Text Available The tubular networks of the Drosophila respiratory system and our vasculature show distinct branching patterns and tube shapes in different body regions. These local variations are crucial for organ function and organismal fitness. Organotypic patterns and tube geometries in branched networks are typically controlled by variations of extrinsic signaling but the impact of intrinsic factors on branch patterns and shapes is not well explored. Here, we show that the intersection of extrinsic hedgehog(hh and WNT/wingless (wg signaling with the tube-intrinsic Hox code of distinct segments specifies the tube pattern and shape of the Drosophila airways. In the cephalic part of the airways, hh signaling induces expression of the transcription factor (TF knirps (kni in the anterior dorsal trunk (DTa1. kni represses the expression of another TF spalt major (salm, making DTa1 a narrow and long tube. In DTa branches of more posterior metameres, Bithorax Complex (BX-C Hox genes autonomously divert hh signaling from inducing kni, thereby allowing DTa branches to develop as salm-dependent thick and short tubes. Moreover, the differential expression of BX-C genes is partly responsible for the anterior-to-posterior gradual increase of the DT tube diameter through regulating the expression level of Salm, a transcriptional target of WNT/wg signaling. Thus, our results highlight how tube intrinsic differential competence can diversify tube morphology without changing availabilities of extrinsic factors.

  9. Land-cover classification with an expert classification algorithm using digital aerial photographs

    Directory of Open Access Journals (Sweden)

    José L. de la Cruz

    2010-05-01

    Full Text Available The purpose of this study was to evaluate the usefulness of the spectral information of digital aerial sensors in determining land-cover classification using new digital techniques. The land covers that have been evaluated are the following, (1 bare soil, (2 cereals, including maize (Zea mays L., oats (Avena sativa L., rye (Secale cereale L., wheat (Triticum aestivum L. and barley (Hordeun vulgare L., (3 high protein crops, such as peas (Pisum sativum L. and beans (Vicia faba L., (4 alfalfa (Medicago sativa L., (5 woodlands and scrublands, including holly oak (Quercus ilex L. and common retama (Retama sphaerocarpa L., (6 urban soil, (7 olive groves (Olea europaea L. and (8 burnt crop stubble. The best result was obtained using an expert classification algorithm, achieving a reliability rate of 95%. This result showed that the images of digital airborne sensors hold considerable promise for the future in the field of digital classifications because these images contain valuable information that takes advantage of the geometric viewpoint. Moreover, new classification techniques reduce problems encountered using high-resolution images; while reliabilities are achieved that are better than those achieved with traditional methods.

  10. Biochemistry of the tale transcription factors PREP, MEIS, and PBX in vertebrates.

    Science.gov (United States)

    Longobardi, E; Penkov, D; Mateos, D; De Florian, G; Torres, M; Blasi, Francesco

    2014-01-01

    TALE (three amino acids loop extension) homeodomain transcription factors are required in various steps of embryo development, in many adult physiological functions, and are involved in important pathologies. This review focuses on the PREP, MEIS, and PBX sub-families of TALE factors and aims at giving information on their biochemical properties, i.e., structure, interactors, and interaction surfaces. Members of the three sets of protein form dimers in which the common partner is PBX but they can also directly interact with other proteins forming higher-order complexes, in particular HOX. Finally, recent advances in determining the genome-wide DNA-binding sites of PREP1, MEIS1, and PBX1, and their partial correspondence with the binding sites of some HOX proteins, are reviewed. These studies have generated a few general rules that can be applied to all members of the three gene families. PREP and MEIS recognize slightly different consensus sequences: PREP prefers to bind to promoters and to have PBX as a DNA-binding partner; MEIS prefers HOX as partner, and both PREP and MEIS drive PBX to their own binding sites. This outlines the clear individuality of the PREP and MEIS proteins, the former mostly devoted to basic cellular functions, the latter more to developmental functions. Copyright © 2013 Wiley Periodicals, Inc.

  11. The Most Developmentally Truncated Fishes Show Extensive Hox Gene Loss and Miniaturized Genomes

    Science.gov (United States)

    Malmstrøm, Martin; Britz, Ralf; Matschiner, Michael; Tørresen, Ole K; Hadiaty, Renny Kurnia; Yaakob, Norsham; Tan, Heok Hui; Jakobsen, Kjetill Sigurd; Salzburger, Walter; Rüber, Lukas

    2018-01-01

    Abstract The world’s smallest fishes belong to the genus Paedocypris. These miniature fishes are endemic to an extreme habitat: the peat swamp forests in Southeast Asia, characterized by highly acidic blackwater. This threatened habitat is home to a large array of fishes, including a number of miniaturized but also developmentally truncated species. Especially the genus Paedocypris is characterized by profound, organism-wide developmental truncation, resulting in sexually mature individuals of <8 mm in length with a larval phenotype. Here, we report on evolutionary simplification in the genomes of two species of the dwarf minnow genus Paedocypris using whole-genome sequencing. The two species feature unprecedented Hox gene loss and genome reduction in association with their massive developmental truncation. We also show how other genes involved in the development of musculature, nervous system, and skeleton have been lost in Paedocypris, mirroring its highly progenetic phenotype. Further, our analyses suggest two mechanisms responsible for the genome streamlining in Paedocypris in relation to other Cypriniformes: severe intron shortening and reduced repeat content. As the first report on the genomic sequence of a vertebrate species with organism-wide developmental truncation, the results of our work enhance our understanding of genome evolution and how genotypes are translated to phenotypes. In addition, as a naturally simplified system closely related to zebrafish, Paedocypris provides novel insights into vertebrate development. PMID:29684203

  12. The Most Developmentally Truncated Fishes Show Extensive Hox Gene Loss and Miniaturized Genomes.

    Science.gov (United States)

    Malmstrøm, Martin; Britz, Ralf; Matschiner, Michael; Tørresen, Ole K; Hadiaty, Renny Kurnia; Yaakob, Norsham; Tan, Heok Hui; Jakobsen, Kjetill Sigurd; Salzburger, Walter; Rüber, Lukas

    2018-04-01

    The world's smallest fishes belong to the genus Paedocypris. These miniature fishes are endemic to an extreme habitat: the peat swamp forests in Southeast Asia, characterized by highly acidic blackwater. This threatened habitat is home to a large array of fishes, including a number of miniaturized but also developmentally truncated species. Especially the genus Paedocypris is characterized by profound, organism-wide developmental truncation, resulting in sexually mature individuals of <8 mm in length with a larval phenotype. Here, we report on evolutionary simplification in the genomes of two species of the dwarf minnow genus Paedocypris using whole-genome sequencing. The two species feature unprecedented Hox gene loss and genome reduction in association with their massive developmental truncation. We also show how other genes involved in the development of musculature, nervous system, and skeleton have been lost in Paedocypris, mirroring its highly progenetic phenotype. Further, our analyses suggest two mechanisms responsible for the genome streamlining in Paedocypris in relation to other Cypriniformes: severe intron shortening and reduced repeat content. As the first report on the genomic sequence of a vertebrate species with organism-wide developmental truncation, the results of our work enhance our understanding of genome evolution and how genotypes are translated to phenotypes. In addition, as a naturally simplified system closely related to zebrafish, Paedocypris provides novel insights into vertebrate development.

  13. High resolution crystal structure of PedB: a structural basis for the classification of pediocin-like immunity proteins

    Directory of Open Access Journals (Sweden)

    Cha Sun-Shin

    2007-05-01

    Full Text Available Abstract Background Pediocin-like bacteriocins, ribosomally-synthesized antimicrobial peptides, are generally coexpressed with cognate immunity proteins in order to protect the bacteriocin-producer from its own bacteriocin. As a step for understanding the mode of action of immunity proteins, we determined the crystal structure of PedB, a pediocin-like immunity protein conferring immunity to pediocin PP-1. Results The 1.6 Å crystal structure of PedB reveals that PedB consists of an antiparallel four-helix bundle with a flexible C-terminal end. PedB shows structural similarity to an immunity protein against enterocin A (EntA-im but some disparity to an immunity protein against carnobacteriocin B2 (ImB2 in both the C-terminal conformation and the local structure constructed by α3, α4, and their connecting loop. Structure-inspired mutational studies reveal that deletion of the last seven residues of the C-terminus of PedB almost abolished its immunity activity. Conclusion The fact that PedB, EntA-im, and ImB2 share a four-helix bundle structure strongly suggests the structural conservation of this motif in the pediocin-like immunity proteins. The significant difference in the core structure and the C-terminal conformation provides a structural basis for the classification of pediocin-like immunity proteins. Our mutational study using C-terminal-shortened PedBs and the investigation of primary sequence of the C-terminal region, propose that several polar or charged residues in the extreme C-terminus of PedB which is crucial for the immunity are involved in the specific recognition of pediocin PP-1.

  14. SURF'S UP! – Protein classification by surface comparisons

    Indian Academy of Sciences (India)

    Prakash

    encounter large protein families with only a few members of ... server for analysis of functional relationships in protein families, as inferred from protein surface maps comparison ... features, SURF'S UP! can work with models obtained from comparative modelling. ... 1997) or, if the user is confident in the quality of automated.

  15. Developing Novel Protein-based Materials using Ultrabithorax: Production, Characterization, and Functionalization

    Science.gov (United States)

    Huang, Zhao

    2011-12-01

    Compared to 'conventional' materials made from metal, glass, or ceramics, protein-based materials have unique mechanical properties. Furthermore, the morphology, mechanical properties, and functionality of protein-based materials may be optimized via sequence engineering for use in a variety of applications, including textile materials, biosensors, and tissue engineering scaffolds. The development of recombinant DNA technology has enabled the production and engineering of protein-based materials ex vivo. However, harsh production conditions can compromise the mechanical properties of protein-based materials and diminish their ability to incorporate functional proteins. Developing a new generation of protein-based materials is crucial to (i) improve materials assembly conditions, (ii) create novel mechanical properties, and (iii) expand the capacity to carry functional protein/peptide sequences. This thesis describes development of novel protein-based materials using Ultrabithorax, a member of the Hox family of proteins that regulate developmental pathways in Drosophila melanogaster. The experiments presented (i) establish the conditions required for the assembly of Ubx-based materials, (ii) generate a wide range of Ubx morphologies, (iii) examine the mechanical properties of Ubx fibers, (iv) incorporate protein functions to Ubx-based materials via gene fusion, (v) pattern protein functions within the Ubx materials, and (vi) examine the biocompatibility of Ubx materials in vitro. Ubx-based materials assemble at mild conditions compatible with protein folding and activity, which enables Ubx chimeric materials to retain the function of appended proteins in spatial patterns determined by materials assembly. Ubx-based materials also display mechanical properties comparable to existing protein-based materials and demonstrate good biocompatibility with living cells in vitro. Taken together, this research demonstrates the unique features and future potential of novel Ubx

  16. Catalytic Properties of the Isolated Diaphorase Fragment of the NAD+-Reducing [NiFe]-Hydrogenase from Ralstonia eutropha

    Science.gov (United States)

    Lauterbach, Lars; Idris, Zulkifli; Vincent, Kylie A.; Lenz, Oliver

    2011-01-01

    The NAD+-reducing soluble hydrogenase (SH) from Ralstonia eutropha H16 catalyzes the H2-driven reduction of NAD+, as well as reverse electron transfer from NADH to H+, in the presence of O2. It comprises six subunits, HoxHYFUI2, and incorporates a [NiFe] H+/H2 cycling catalytic centre, two non-covalently bound flavin mononucleotide (FMN) groups and an iron-sulfur cluster relay for electron transfer. This study provides the first characterization of the diaphorase sub-complex made up of HoxF and HoxU. Sequence comparisons with the closely related peripheral subunits of Complex I in combination with UV/Vis spectroscopy and the quantification of the metal and FMN content revealed that HoxFU accommodates a [2Fe2S] cluster, FMN and a series of [4Fe4S] clusters. Protein film electrochemistry (PFE) experiments show clear electrocatalytic activity for both NAD+ reduction and NADH oxidation with minimal overpotential relative to the potential of the NAD+/NADH couple. Michaelis-Menten constants of 56 µM and 197 µM were determined for NADH and NAD+, respectively. Catalysis in both directions is product inhibited with K I values of around 0.2 mM. In PFE experiments, the electrocatalytic current was unaffected by O2, however in aerobic solution assays, a moderate superoxide production rate of 54 nmol per mg of protein was observed, meaning that the formation of reactive oxygen species (ROS) observed for the native SH can be attributed mainly to HoxFU. The results are discussed in terms of their implications for aerobic functioning of the SH and possible control mechanism for the direction of catalysis. PMID:22016788

  17. A novel Multi-Agent Ada-Boost algorithm for predicting protein structural class with the information of protein secondary structure.

    Science.gov (United States)

    Fan, Ming; Zheng, Bin; Li, Lihua

    2015-10-01

    Knowledge of the structural class of a given protein is important for understanding its folding patterns. Although a lot of efforts have been made, it still remains a challenging problem for prediction of protein structural class solely from protein sequences. The feature extraction and classification of proteins are the main problems in prediction. In this research, we extended our earlier work regarding these two aspects. In protein feature extraction, we proposed a scheme by calculating the word frequency and word position from sequences of amino acid, reduced amino acid, and secondary structure. For an accurate classification of the structural class of protein, we developed a novel Multi-Agent Ada-Boost (MA-Ada) method by integrating the features of Multi-Agent system into Ada-Boost algorithm. Extensive experiments were taken to test and compare the proposed method using four benchmark datasets in low homology. The results showed classification accuracies of 88.5%, 96.0%, 88.4%, and 85.5%, respectively, which are much better compared with the existing methods. The source code and dataset are available on request.

  18. RNAi analysis of Deformed, proboscipedia and Sex combs reduced in the milkweed bug Oncopeltus fasciatus: novel roles for Hox genes in the hemipteran head.

    Science.gov (United States)

    Hughes, C L; Kaufman, T C

    2000-09-01

    Insects have evolved a large variety of specialized feeding strategies, with a corresponding variability in mouthpart morphology. We have, however, little understanding of the developmental mechanisms that underlie this diversity. Until recently it was difficult to perform any analysis of gene function outside of the genetic model insects Drosophila melanogaster and Tribolium castaneum. In this paper, we report the use of dsRNA-mediated interference (RNAi) to dissect gene function in the development of the milkweed bug Oncopeltus fasciatus, which has specialized suctorial mouthparts. The Hox genes Deformed (Dfd), proboscipedia (pb) and Sex combs reduced (Scr) have previously been shown to be expressed in the gnathal appendages of this species. Strikingly, the milkweed bug was found to have an unusual expression pattern of pb. Here, by analyzing single and combination RNAi depletions, we find that Dfd, pb and Scr are used in the milkweed bug to specify the identity of the mouthparts. The exact roles of the genes, however, are different from what is known in the two genetic model insects. The maxillary appendages in the bug are determined by the activities of the genes Dfd and Scr, rather than Dfd and pb as in the fly and beetle. The mandibular appendages are specified by Dfd, but their unique morphology in Oncopeltus suggests that Dfd's target genes are different. As in flies and beetles, the labium is specified by the combined activities of pb and Scr, but again, the function of pb appears to be different. Additionally, the regulatory control of pb by the other two genes seems to be different in the bug than in either of the other species. These novelties in Hox function, expression pattern and regulatory relationships may have been important for the evolution of the unique Hemipteran head.

  19. Classification-based comparison of pre-processing methods for interpretation of mass spectrometry generated clinical datasets

    Directory of Open Access Journals (Sweden)

    Hoefsloot Huub CJ

    2009-05-01

    Full Text Available Abstract Background Mass spectrometry is increasingly being used to discover proteins or protein profiles associated with disease. Experimental design of mass-spectrometry studies has come under close scrutiny and the importance of strict protocols for sample collection is now understood. However, the question of how best to process the large quantities of data generated is still unanswered. Main challenges for the analysis are the choice of proper pre-processing and classification methods. While these two issues have been investigated in isolation, we propose to use the classification of patient samples as a clinically relevant benchmark for the evaluation of pre-processing methods. Results Two in-house generated clinical SELDI-TOF MS datasets are used in this study as an example of high throughput mass-spectrometry data. We perform a systematic comparison of two commonly used pre-processing methods as implemented in Ciphergen ProteinChip Software and in the Cromwell package. With respect to reproducibility, Ciphergen and Cromwell pre-processing are largely comparable. We find that the overlap between peaks detected by either Ciphergen ProteinChip Software or Cromwell is large. This is especially the case for the more stringent peak detection settings. Moreover, similarity of the estimated intensities between matched peaks is high. We evaluate the pre-processing methods using five different classification methods. Classification is done in a double cross-validation protocol using repeated random sampling to obtain an unbiased estimate of classification accuracy. No pre-processing method significantly outperforms the other for all peak detection settings evaluated. Conclusion We use classification of patient samples as a clinically relevant benchmark for the evaluation of pre-processing methods. Both pre-processing methods lead to similar classification results on an ovarian cancer and a Gaucher disease dataset. However, the settings for pre

  20. Chromatin-bound IκBα regulates a subset of polycomb target genes in differentiation and cancer.

    Science.gov (United States)

    Mulero, María Carmen; Ferres-Marco, Dolors; Islam, Abul; Margalef, Pol; Pecoraro, Matteo; Toll, Agustí; Drechsel, Nils; Charneco, Cristina; Davis, Shelly; Bellora, Nicolás; Gallardo, Fernando; López-Arribillaga, Erika; Asensio-Juan, Elena; Rodilla, Verónica; González, Jessica; Iglesias, Mar; Shih, Vincent; Mar Albà, M; Di Croce, Luciano; Hoffmann, Alexander; Miyamoto, Shigeki; Villà-Freixa, Jordi; López-Bigas, Nuria; Keyes, William M; Domínguez, María; Bigas, Anna; Espinosa, Lluís

    2013-08-12

    IκB proteins are the primary inhibitors of NF-κB. Here, we demonstrate that sumoylated and phosphorylated IκBα accumulates in the nucleus of keratinocytes and interacts with histones H2A and H4 at the regulatory region of HOX and IRX genes. Chromatin-bound IκBα modulates Polycomb recruitment and imparts their competence to be activated by TNFα. Mutations in the Drosophila IκBα gene cactus enhance the homeotic phenotype of Polycomb mutants, which is not counteracted by mutations in dorsal/NF-κB. Oncogenic transformation of keratinocytes results in cytoplasmic IκBα translocation associated with a massive activation of Hox. Accumulation of cytoplasmic IκBα was found in squamous cell carcinoma (SCC) associated with IKK activation and HOX upregulation. Copyright © 2013 Elsevier Inc. All rights reserved.

  1. A protein and mRNA expression-based classification of gastric cancer.

    Science.gov (United States)

    Setia, Namrata; Agoston, Agoston T; Han, Hye S; Mullen, John T; Duda, Dan G; Clark, Jeffrey W; Deshpande, Vikram; Mino-Kenudson, Mari; Srivastava, Amitabh; Lennerz, Jochen K; Hong, Theodore S; Kwak, Eunice L; Lauwers, Gregory Y

    2016-07-01

    The overall survival of gastric carcinoma patients remains poor despite improved control over known risk factors and surveillance. This highlights the need for new classifications, driven towards identification of potential therapeutic targets. Using sophisticated molecular technologies and analysis, three groups recently provided genetic and epigenetic molecular classifications of gastric cancer (The Cancer Genome Atlas, 'Singapore-Duke' study, and Asian Cancer Research Group). Suggested by these classifications, here, we examined the expression of 14 biomarkers in a cohort of 146 gastric adenocarcinomas and performed unsupervised hierarchical clustering analysis using less expensive and widely available immunohistochemistry and in situ hybridization. Ultimately, we identified five groups of gastric cancers based on Epstein-Barr virus (EBV) positivity, microsatellite instability, aberrant E-cadherin, and p53 expression; the remaining cases constituted a group characterized by normal p53 expression. In addition, the five categories correspond to the reported molecular subgroups by virtue of clinicopathologic features. Furthermore, evaluation between these clusters and survival using the Cox proportional hazards model showed a trend for superior survival in the EBV and microsatellite-instable related adenocarcinomas. In conclusion, we offer as a proposal a simplified algorithm that is able to reproduce the recently proposed molecular subgroups of gastric adenocarcinoma, using immunohistochemical and in situ hybridization techniques.

  2. Nonlinear programming for classification problems in machine learning

    Science.gov (United States)

    Astorino, Annabella; Fuduli, Antonio; Gaudioso, Manlio

    2016-10-01

    We survey some nonlinear models for classification problems arising in machine learning. In the last years this field has become more and more relevant due to a lot of practical applications, such as text and web classification, object recognition in machine vision, gene expression profile analysis, DNA and protein analysis, medical diagnosis, customer profiling etc. Classification deals with separation of sets by means of appropriate separation surfaces, which is generally obtained by solving a numerical optimization model. While linear separability is the basis of the most popular approach to classification, the Support Vector Machine (SVM), in the recent years using nonlinear separating surfaces has received some attention. The objective of this work is to recall some of such proposals, mainly in terms of the numerical optimization models. In particular we tackle the polyhedral, ellipsoidal, spherical and conical separation approaches and, for some of them, we also consider the semisupervised versions.

  3. Diversity and transcription of proteases involved in the maturation of hydrogenases in Nostoc punctiforme ATCC 29133 and Nostoc sp. strain PCC 7120

    Science.gov (United States)

    2009-01-01

    Background The last step in the maturation process of the large subunit of [NiFe]-hydrogenases is a proteolytic cleavage of the C-terminal by a hydrogenase specific protease. Contrary to other accessory proteins these hydrogenase proteases are believed to be specific whereby one type of hydrogenases specific protease only cleaves one type of hydrogenase. In cyanobacteria this is achieved by the gene product of either hupW or hoxW, specific for the uptake or the bidirectional hydrogenase respectively. The filamentous cyanobacteria Nostoc punctiforme ATCC 29133 and Nostoc sp strain PCC 7120 may contain a single uptake hydrogenase or both an uptake and a bidirectional hydrogenase respectively. Results In order to examine these proteases in cyanobacteria, transcriptional analyses were performed of hupW in Nostoc punctiforme ATCC 29133 and hupW and hoxW in Nostoc sp. strain PCC 7120. These studies revealed numerous transcriptional start points together with putative binding sites for NtcA (hupW) and LexA (hoxW). In order to investigate the diversity and specificity among hydrogeanse specific proteases we constructed a phylogenetic tree which revealed several subgroups that showed a striking resemblance to the subgroups previously described for [NiFe]-hydrogenases. Additionally the proteases specificity was also addressed by amino acid sequence analysis and protein-protein docking experiments with 3D-models derived from bioinformatic studies. These studies revealed a so called "HOXBOX"; an amino acid sequence specific for protease of Hox-type which might be involved in docking with the large subunit of the hydrogenase. Conclusion Our findings suggest that the hydrogenase specific proteases are under similar regulatory control as the hydrogenases they cleave. The result from the phylogenetic study also indicates that the hydrogenase and the protease have co-evolved since ancient time and suggests that at least one major horizontal gene transfer has occurred. This co

  4. Diversity and transcription of proteases involved in the maturation of hydrogenases in Nostoc punctiforme ATCC 29133 and Nostoc sp. strain PCC 7120

    Directory of Open Access Journals (Sweden)

    Lindblad Peter

    2009-03-01

    Full Text Available Abstract Background The last step in the maturation process of the large subunit of [NiFe]-hydrogenases is a proteolytic cleavage of the C-terminal by a hydrogenase specific protease. Contrary to other accessory proteins these hydrogenase proteases are believed to be specific whereby one type of hydrogenases specific protease only cleaves one type of hydrogenase. In cyanobacteria this is achieved by the gene product of either hupW or hoxW, specific for the uptake or the bidirectional hydrogenase respectively. The filamentous cyanobacteria Nostoc punctiforme ATCC 29133 and Nostoc sp strain PCC 7120 may contain a single uptake hydrogenase or both an uptake and a bidirectional hydrogenase respectively. Results In order to examine these proteases in cyanobacteria, transcriptional analyses were performed of hupW in Nostoc punctiforme ATCC 29133 and hupW and hoxW in Nostoc sp. strain PCC 7120. These studies revealed numerous transcriptional start points together with putative binding sites for NtcA (hupW and LexA (hoxW. In order to investigate the diversity and specificity among hydrogeanse specific proteases we constructed a phylogenetic tree which revealed several subgroups that showed a striking resemblance to the subgroups previously described for [NiFe]-hydrogenases. Additionally the proteases specificity was also addressed by amino acid sequence analysis and protein-protein docking experiments with 3D-models derived from bioinformatic studies. These studies revealed a so called "HOXBOX"; an amino acid sequence specific for protease of Hox-type which might be involved in docking with the large subunit of the hydrogenase. Conclusion Our findings suggest that the hydrogenase specific proteases are under similar regulatory control as the hydrogenases they cleave. The result from the phylogenetic study also indicates that the hydrogenase and the protease have co-evolved since ancient time and suggests that at least one major horizontal gene transfer

  5. The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text.

    Science.gov (United States)

    Krallinger, Martin; Vazquez, Miguel; Leitner, Florian; Salgado, David; Chatr-Aryamontri, Andrew; Winter, Andrew; Perfetto, Livia; Briganti, Leonardo; Licata, Luana; Iannuccelli, Marta; Castagnoli, Luisa; Cesareni, Gianni; Tyers, Mike; Schneider, Gerold; Rinaldi, Fabio; Leaman, Robert; Gonzalez, Graciela; Matos, Sergio; Kim, Sun; Wilbur, W John; Rocha, Luis; Shatkay, Hagit; Tendulkar, Ashish V; Agarwal, Shashank; Liu, Feifan; Wang, Xinglong; Rak, Rafal; Noto, Keith; Elkan, Charles; Lu, Zhiyong; Dogan, Rezarta Islamaj; Fontaine, Jean-Fred; Andrade-Navarro, Miguel A; Valencia, Alfonso

    2011-10-03

    Determining usefulness of biomedical text mining systems requires realistic task definition and data selection criteria without artificial constraints, measuring performance aspects that go beyond traditional metrics. The BioCreative III Protein-Protein Interaction (PPI) tasks were motivated by such considerations, trying to address aspects including how the end user would oversee the generated output, for instance by providing ranked results, textual evidence for human interpretation or measuring time savings by using automated systems. Detecting articles describing complex biological events like PPIs was addressed in the Article Classification Task (ACT), where participants were asked to implement tools for detecting PPI-describing abstracts. Therefore the BCIII-ACT corpus was provided, which includes a training, development and test set of over 12,000 PPI relevant and non-relevant PubMed abstracts labeled manually by domain experts and recording also the human classification times. The Interaction Method Task (IMT) went beyond abstracts and required mining for associations between more than 3,500 full text articles and interaction detection method ontology concepts that had been applied to detect the PPIs reported in them. A total of 11 teams participated in at least one of the two PPI tasks (10 in ACT and 8 in the IMT) and a total of 62 persons were involved either as participants or in preparing data sets/evaluating these tasks. Per task, each team was allowed to submit five runs offline and another five online via the BioCreative Meta-Server. From the 52 runs submitted for the ACT, the highest Matthew's Correlation Coefficient (MCC) score measured was 0.55 at an accuracy of 89% and the best AUC iP/R was 68%. Most ACT teams explored machine learning methods, some of them also used lexical resources like MeSH terms, PSI-MI concepts or particular lists of verbs and nouns, some integrated NER approaches. For the IMT, a total of 42 runs were evaluated by comparing

  6. The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text

    Science.gov (United States)

    2011-01-01

    Background Determining usefulness of biomedical text mining systems requires realistic task definition and data selection criteria without artificial constraints, measuring performance aspects that go beyond traditional metrics. The BioCreative III Protein-Protein Interaction (PPI) tasks were motivated by such considerations, trying to address aspects including how the end user would oversee the generated output, for instance by providing ranked results, textual evidence for human interpretation or measuring time savings by using automated systems. Detecting articles describing complex biological events like PPIs was addressed in the Article Classification Task (ACT), where participants were asked to implement tools for detecting PPI-describing abstracts. Therefore the BCIII-ACT corpus was provided, which includes a training, development and test set of over 12,000 PPI relevant and non-relevant PubMed abstracts labeled manually by domain experts and recording also the human classification times. The Interaction Method Task (IMT) went beyond abstracts and required mining for associations between more than 3,500 full text articles and interaction detection method ontology concepts that had been applied to detect the PPIs reported in them. Results A total of 11 teams participated in at least one of the two PPI tasks (10 in ACT and 8 in the IMT) and a total of 62 persons were involved either as participants or in preparing data sets/evaluating these tasks. Per task, each team was allowed to submit five runs offline and another five online via the BioCreative Meta-Server. From the 52 runs submitted for the ACT, the highest Matthew's Correlation Coefficient (MCC) score measured was 0.55 at an accuracy of 89% and the best AUC iP/R was 68%. Most ACT teams explored machine learning methods, some of them also used lexical resources like MeSH terms, PSI-MI concepts or particular lists of verbs and nouns, some integrated NER approaches. For the IMT, a total of 42 runs were

  7. Collective Classification in Network Data

    OpenAIRE

    Sen, Prithviraj; Namata, Galileo; Bilgic, Mustafa; Getoor, Lise; University of Maryland; Galligher, Brian; Eliassi-Rad, Tina

    2008-01-01

    Many real-world applications produce networked data such as the world-wide web (hypertext documents connected via hyperlinks), social networks (for example, people connected by friendship links), communication networks (computers connected via communication links) and biological networks (for example, protein interaction networks). A recent focus in machine learning research has been to extend traditional machine learning classification techniques to classify nodes in such networks. In this a...

  8. Feature-Based Classification of Amino Acid Substitutions outside Conserved Functional Protein Domains

    Directory of Open Access Journals (Sweden)

    Branislava Gemovic

    2013-01-01

    Full Text Available There are more than 500 amino acid substitutions in each human genome, and bioinformatics tools irreplaceably contribute to determination of their functional effects. We have developed feature-based algorithm for the detection of mutations outside conserved functional domains (CFDs and compared its classification efficacy with the most commonly used phylogeny-based tools, PolyPhen-2 and SIFT. The new algorithm is based on the informational spectrum method (ISM, a feature-based technique, and statistical analysis. Our dataset contained neutral polymorphisms and mutations associated with myeloid malignancies from epigenetic regulators ASXL1, DNMT3A, EZH2, and TET2. PolyPhen-2 and SIFT had significantly lower accuracies in predicting the effects of amino acid substitutions outside CFDs than expected, with especially low sensitivity. On the other hand, only ISM algorithm showed statistically significant classification of these sequences. It outperformed PolyPhen-2 and SIFT by 15% and 13%, respectively. These results suggest that feature-based methods, like ISM, are more suitable for the classification of amino acid substitutions outside CFDs than phylogeny-based tools.

  9. SURF'S UP! – Protein classification by surface comparisons

    Indian Academy of Sciences (India)

    2006-12-12

    Dec 12, 2006 ... Large-scale genome sequencing and structural genomics projects generate numerous sequences and structures for 'hypothetical' proteins without functional characterizations. Detection of homology to experimentally characterized proteins can provide functional clues, but the accuracy of homology-based ...

  10. Classification of projection images of proteins with structural polymorphism by manifold: A simulation study for x-ray free-electron laser diffraction imaging

    Science.gov (United States)

    Yoshidome, Takashi; Oroguchi, Tomotaka; Nakasako, Masayoshi; Ikeguchi, Mitsunori

    2015-09-01

    Coherent x-ray diffraction imaging (CXDI) enables us to visualize noncrystalline sample particles with micrometer to submicrometer dimensions. Using x-ray free-electron laser (XFEL) sources, two-dimensional diffraction patterns are collected from fresh samples supplied to the irradiation area in the "diffraction-before-destruction" scheme. A recent significant increase in the intensity of the XFEL pulse is promising and will allow us to visualize the three-dimensional structures of proteins using XFEL-CXDI in the future. For the protocol proposed for molecular structure determination using future XFEL-CXDI [T. Oroguchi and M. Nakasako, Phys. Rev. E 87, 022712 (2013), 10.1103/PhysRevE.87.022712], we require an algorithm that can classify the data in accordance with the structural polymorphism of proteins arising from their conformational dynamics. However, most of the algorithms proposed primarily require the numbers of conformational classes, and then the results are biased by the numbers. To improve this point, here we examine whether a method based on the manifold concept can classify simulated XFEL-CXDI data with respect to the structural polymorphism of a protein that predominantly adopts two states. After random sampling of the conformations of the two states and in-between states from the trajectories of molecular dynamics simulations, a diffraction pattern is calculated from each conformation. Classification was performed by using our custom-made program suite named enma, in which the diffusion map (DM) method developed based on the manifold concept was implemented. We successfully classify most of the projection electron density maps phase retrieved from diffraction patterns into each of the two states and in-between conformations without the knowledge of the number of conformational classes. We also examined the classification of the projection electron density maps of each of the three states with respect to the Euler angle. The present results suggest

  11. Influence of the Li···π Interaction on the H/X···π Interactions in HOLi···C6H6···HOX/XOH (X=F, Cl, Br, I) complexes.

    Science.gov (United States)

    Zeng, Yanli; Wu, Wenjie; Li, Xiaoyan; Zheng, Shijun; Meng, Lingpeng

    2013-06-03

    The influences of the Li···π interaction of C6H6···LiOH on the H···π interaction of C6H6···HOX (X=F, Cl, Br, I) and the X···π interaction of C6H6···XOH (X=Cl, Br, I) are investigated by means of full electronic second-order Møller-Plesset perturbation theory calculations and "quantum theory of atoms in molecules" (QTAIM) studies. The binding energies, binding distances, infrared vibrational frequencies, and electron densities at the bond critical points (BCPs) of the hydrogen bonds and halogen bonds prove that the addition of the Li···π interaction to benzene weakens the H···π and X···π interactions. The influences of the Li···π interaction on H···π interactions are greater than those on X···π interactions; the influences of the H···π interactions on the Li···π interaction are greater than X···π interactions on Li···π interaction. The greater the influence of Li···π interaction on H/X···π interactions, the greater the influences of H/X···π interactions on Li···π interaction. QTAIM studies show that the intermolecular interactions of C6H6···HOX and C6H6···XOH are mainly of the π type. The electron densities at the BCPs of hydrogen bonds and halogen bonds decrease on going from bimolecular complexes to termolecular complexes, and the π-electron densities at the BCPs show the same pattern. Natural bond orbital analyses show that the Li···π interaction reduces electron transfer from C6 H6 to HOX and XOH. Copyright © 2013 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  12. Classifying Classifications

    DEFF Research Database (Denmark)

    Debus, Michael S.

    2017-01-01

    This paper critically analyzes seventeen game classifications. The classifications were chosen on the basis of diversity, ranging from pre-digital classification (e.g. Murray 1952), over game studies classifications (e.g. Elverdam & Aarseth 2007) to classifications of drinking games (e.g. LaBrie et...... al. 2013). The analysis aims at three goals: The classifications’ internal consistency, the abstraction of classification criteria and the identification of differences in classification across fields and/or time. Especially the abstraction of classification criteria can be used in future endeavors...... into the topic of game classifications....

  13. A comparative evaluation of sequence classification programs

    Directory of Open Access Journals (Sweden)

    Bazinet Adam L

    2012-05-01

    Full Text Available Abstract Background A fundamental problem in modern genomics is to taxonomically or functionally classify DNA sequence fragments derived from environmental sampling (i.e., metagenomics. Several different methods have been proposed for doing this effectively and efficiently, and many have been implemented in software. In addition to varying their basic algorithmic approach to classification, some methods screen sequence reads for ’barcoding genes’ like 16S rRNA, or various types of protein-coding genes. Due to the sheer number and complexity of methods, it can be difficult for a researcher to choose one that is well-suited for a particular analysis. Results We divided the very large number of programs that have been released in recent years for solving the sequence classification problem into three main categories based on the general algorithm they use to compare a query sequence against a database of sequences. We also evaluated the performance of the leading programs in each category on data sets whose taxonomic and functional composition is known. Conclusions We found significant variability in classification accuracy, precision, and resource consumption of sequence classification programs when used to analyze various metagenomics data sets. However, we observe some general trends and patterns that will be useful to researchers who use sequence classification programs.

  14. Proteopedia: 3D Visualization and Annotation of Transcription Factor-DNA Readout Modes

    Science.gov (United States)

    Dantas Machado, Ana Carolina; Saleebyan, Skyler B.; Holmes, Bailey T.; Karelina, Maria; Tam, Julia; Kim, Sharon Y.; Kim, Keziah H.; Dror, Iris; Hodis, Eran; Martz, Eric; Compeau, Patricia A.; Rohs, Remo

    2012-01-01

    3D visualization assists in identifying diverse mechanisms of protein-DNA recognition that can be observed for transcription factors and other DNA binding proteins. We used Proteopedia to illustrate transcription factor-DNA readout modes with a focus on DNA shape, which can be a function of either nucleotide sequence (Hox proteins) or base pairing…

  15. DockQ: A Quality Measure for Protein-Protein Docking Models.

    Directory of Open Access Journals (Sweden)

    Sankar Basu

    Full Text Available The state-of-the-art to assess the structural quality of docking models is currently based on three related yet independent quality measures: Fnat, LRMS, and iRMS as proposed and standardized by CAPRI. These quality measures quantify different aspects of the quality of a particular docking model and need to be viewed together to reveal the true quality, e.g. a model with relatively poor LRMS (>10Å might still qualify as 'acceptable' with a descent Fnat (>0.50 and iRMS (<3.0Å. This is also the reason why the so called CAPRI criteria for assessing the quality of docking models is defined by applying various ad-hoc cutoffs on these measures to classify a docking model into the four classes: Incorrect, Acceptable, Medium, or High quality. This classification has been useful in CAPRI, but since models are grouped in only four bins it is also rather limiting, making it difficult to rank models, correlate with scoring functions or use it as target function in machine learning algorithms. Here, we present DockQ, a continuous protein-protein docking model quality measure derived by combining Fnat, LRMS, and iRMS to a single score in the range [0, 1] that can be used to assess the quality of protein docking models. By using DockQ on CAPRI models it is possible to almost completely reproduce the original CAPRI classification into Incorrect, Acceptable, Medium and High quality. An average PPV of 94% at 90% Recall demonstrating that there is no need to apply predefined ad-hoc cutoffs to classify docking models. Since DockQ recapitulates the CAPRI classification almost perfectly, it can be viewed as a higher resolution version of the CAPRI classification, making it possible to estimate model quality in a more quantitative way using Z-scores or sum of top ranked models, which has been so valuable for the CASP community. The possibility to directly correlate a quality measure to a scoring function has been crucial for the development of scoring functions for

  16. A Hox Gene, Antennapedia, Regulates Expression of Multiple Major Silk Protein Genes in the Silkworm Bombyx mori.

    Science.gov (United States)

    Tsubota, Takuya; Tomita, Shuichiro; Uchino, Keiro; Kimoto, Mai; Takiya, Shigeharu; Kajiwara, Hideyuki; Yamazaki, Toshimasa; Sezutsu, Hideki

    2016-03-25

    Hoxgenes play a pivotal role in the determination of anteroposterior axis specificity during bilaterian animal development. They do so by acting as a master control and regulating the expression of genes important for development. Recently, however, we showed that Hoxgenes can also function in terminally differentiated tissue of the lepidopteranBombyx mori In this species,Antennapedia(Antp) regulates expression of sericin-1, a major silk protein gene, in the silk gland. Here, we investigated whether Antpcan regulate expression of multiple genes in this tissue. By means of proteomic, RT-PCR, and in situ hybridization analyses, we demonstrate that misexpression of Antpin the posterior silk gland induced ectopic expression of major silk protein genes such assericin-3,fhxh4, and fhxh5 These genes are normally expressed specifically in the middle silk gland as is Antp Therefore, the evidence strongly suggests that Antpactivates these silk protein genes in the middle silk gland. The putativesericin-1 activator complex (middle silk gland-intermolt-specific complex) can bind to the upstream regions of these genes, suggesting that Antpdirectly activates their expression. We also found that the pattern of gene expression was well conserved between B. moriand the wild species Bombyx mandarina, indicating that the gene regulation mechanism identified here is an evolutionarily conserved mechanism and not an artifact of the domestication of B. mori We suggest that Hoxgenes have a role as a master control in terminally differentiated tissues, possibly acting as a primary regulator for a range of physiological processes. © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

  17. Comprehensive inventory of protein complexes in the Protein Data Bank from consistent classification of interfaces

    Directory of Open Access Journals (Sweden)

    Gorin Andrey A

    2008-05-01

    Full Text Available Abstract Background Protein-protein interactions are ubiquitous and essential for all cellular processes. High-resolution X-ray crystallographic structures of protein complexes can reveal the details of their function and provide a basis for many computational and experimental approaches. Differentiation between biological and non-biological contacts and reconstruction of the intact complex is a challenging computational problem. A successful solution can provide additional insights into the fundamental principles of biological recognition and reduce errors in many algorithms and databases utilizing interaction information extracted from the Protein Data Bank (PDB. Results We have developed a method for identifying protein complexes in the PDB X-ray structures by a four step procedure: (1 comprehensively collecting all protein-protein interfaces; (2 clustering similar protein-protein interfaces together; (3 estimating the probability that each cluster is relevant based on a diverse set of properties; and (4 combining these scores for each PDB entry in order to predict the complex structure. The resulting clusters of biologically relevant interfaces provide a reliable catalog of evolutionary conserved protein-protein interactions. These interfaces, as well as the predicted protein complexes, are available from the Protein Interface Server (PInS website (see Availability and requirements section. Conclusion Our method demonstrates an almost two-fold reduction of the annotation error rate as evaluated on a large benchmark set of complexes validated from the literature. We also estimate relative contributions of each interface property to the accurate discrimination of biologically relevant interfaces and discuss possible directions for further improving the prediction method.

  18. Insights into the classification of small GTPases

    Directory of Open Access Journals (Sweden)

    Dominik Heider

    2010-05-01

    Full Text Available Dominik Heider1, Sascha Hauke3, Martin Pyka4, Daniel Kessler21Department of Bioinformatics, Center for Medical Biotechnology, 2Institute of Cell Biology (Cancer Research, University of Duisburg-Essen, Essen, Germany; 3Institute of Computer Science, University of Münster, Münster, Germany; 4Interdisciplinary Center for Clinical Research, University Hospital of Münster, Münster, GermanyAbstract: In this study we used a Random Forest-based approach for an assignment of small guanosine triphosphate proteins (GTPases to specific subgroups. Small GTPases represent an important functional group of proteins that serve as molecular switches in a wide range of fundamental cellular processes, including intracellular transport, movement and signaling events. These proteins have further gained a special emphasis in cancer research, because within the last decades a huge variety of small GTPases from different subgroups could be related to the development of all types of tumors. Using a random forest approach, we were able to identify the most important amino acid positions for the classification process within the small GTPases superfamily and its subgroups. These positions are in line with the results of earlier studies and have been shown to be the essential elements for the different functionalities of the GTPase families. Furthermore, we provide an accurate and reliable software tool (GTPasePred to identify potential novel GTPases and demonstrate its application to genome sequences.Keywords: cancer, machine learning, classification, Random Forests, proteins

  19. Using linear algebra for protein structural comparison and classification.

    Science.gov (United States)

    Gomide, Janaína; Melo-Minardi, Raquel; Dos Santos, Marcos Augusto; Neshich, Goran; Meira, Wagner; Lopes, Júlio César; Santoro, Marcelo

    2009-07-01

    In this article, we describe a novel methodology to extract semantic characteristics from protein structures using linear algebra in order to compose structural signature vectors which may be used efficiently to compare and classify protein structures into fold families. These signatures are built from the pattern of hydrophobic intrachain interactions using Singular Value Decomposition (SVD) and Latent Semantic Indexing (LSI) techniques. Considering proteins as documents and contacts as terms, we have built a retrieval system which is able to find conserved contacts in samples of myoglobin fold family and to retrieve these proteins among proteins of varied folds with precision of up to 80%. The classifier is a web tool available at our laboratory website. Users can search for similar chains from a specific PDB, view and compare their contact maps and browse their structures using a JMol plug-in.

  20. Using linear algebra for protein structural comparison and classification

    Directory of Open Access Journals (Sweden)

    Janaína Gomide

    2009-01-01

    Full Text Available In this article, we describe a novel methodology to extract semantic characteristics from protein structures using linear algebra in order to compose structural signature vectors which may be used efficiently to compare and classify protein structures into fold families. These signatures are built from the pattern of hydrophobic intrachain interactions using Singular Value Decomposition (SVD and Latent Semantic Indexing (LSI techniques. Considering proteins as documents and contacts as terms, we have built a retrieval system which is able to find conserved contacts in samples of myoglobin fold family and to retrieve these proteins among proteins of varied folds with precision of up to 80%. The classifier is a web tool available at our laboratory website. Users can search for similar chains from a specific PDB, view and compare their contact maps and browse their structures using a JMol plug-in.

  1. Ebolavirus Classification Based on Natural Vectors

    Science.gov (United States)

    Zheng, Hui; Yin, Changchuan; Hoang, Tung; He, Rong Lucy; Yang, Jie

    2015-01-01

    According to the WHO, ebolaviruses have resulted in 8818 human deaths in West Africa as of January 2015. To better understand the evolutionary relationship of the ebolaviruses and infer virulence from the relationship, we applied the alignment-free natural vector method to classify the newest ebolaviruses. The dataset includes three new Guinea viruses as well as 99 viruses from Sierra Leone. For the viruses of the family of Filoviridae, both genus label classification and species label classification achieve an accuracy rate of 100%. We represented the relationships among Filoviridae viruses by Unweighted Pair Group Method with Arithmetic Mean (UPGMA) phylogenetic trees and found that the filoviruses can be separated well by three genera. We performed the phylogenetic analysis on the relationship among different species of Ebolavirus by their coding-complete genomes and seven viral protein genes (glycoprotein [GP], nucleoprotein [NP], VP24, VP30, VP35, VP40, and RNA polymerase [L]). The topology of the phylogenetic tree by the viral protein VP24 shows consistency with the variations of virulence of ebolaviruses. The result suggests that VP24 be a pharmaceutical target for treating or preventing ebolaviruses. PMID:25803489

  2. Evolution and classification of the CRISPR-Cas systems

    Science.gov (United States)

    S. Makarova, Kira; H. Haft, Daniel; Barrangou, Rodolphe; J. J. Brouns, Stan; Charpentier, Emmanuelle; Horvath, Philippe; Moineau, Sylvain; J. M. Mojica, Francisco; I. Wolf, Yuri; Yakunin, Alexander F.; van der Oost, John; V. Koonin, Eugene

    2012-01-01

    The CRISPR–Cas (clustered regularly interspaced short palindromic repeats–CRISPR-associated proteins) modules are adaptive immunity systems that are present in many archaea and bacteria. These defence systems are encoded by operons that have an extraordinarily diverse architecture and a high rate of evolution for both the cas genes and the unique spacer content. Here, we provide an updated analysis of the evolutionary relationships between CRISPR–Cas systems and Cas proteins. Three major types of CRISPR–Cas system are delineated, with a further division into several subtypes and a few chimeric variants. Given the complexity of the genomic architectures and the extremely dynamic evolution of the CRISPR–Cas systems, a unified classification of these systems should be based on multiple criteria. Accordingly, we propose a `polythetic' classification that integrates the phylogenies of the most common cas genes, the sequence and organization of the CRISPR repeats and the architecture of the CRISPR–cas loci. PMID:21552286

  3. Karakteristik Protein dan Nitrogen Non Protein Daging Ikan Cucut Lanyam (Charcharhinus limbatus (Characteristics of Protein and Non Protein Nitrogen in Lanyam Shark Muscle

    Directory of Open Access Journals (Sweden)

    Yuspihana Fitrial

    2017-02-01

    Based on protein solubility of Lanyam muscle at pH 1.5 to 12 obtained two points which is minimum solubility at pH 4.5 and pH 9. Based on the classification Osborn, Lanyam muscle contained albumin (28.64%, globulin (13:44%, prolamin (03.29%, glutelin (33.70%. Observation of non-protein nitrogen levels indicated that the washing process was very effective to reduce non-protein nitrogen levels up to 62.34% and urea levels up to 58% . Differential Scanning Calorimetry Study of Lanyam mince showed two types of protein that has a different stability to heat and after added 2.5% NaCl formed a peak which is a fusion of both these proteins

  4. Arabidopsis thaliana mTERF proteins: evolution and functional classification

    Directory of Open Access Journals (Sweden)

    Tatjana eKleine

    2012-10-01

    Full Text Available Organellar gene expression (OGE is crucial for plant development, photosynthesis and respiration, but our understanding of the mechanisms that control it is still relatively poor. Thus, OGE requires various nucleus-encoded proteins that promote transcription, splicing, trimming and editing of organellar RNAs, and regulate translation. In metazoans, proteins of the mitochondrial Transcription tERmination Factor (mTERF family interact with the mitochondrial chromosome and regulate transcriptional initiation and termination. Sequencing of the Arabidopsis thaliana genome led to the identification of a diversified MTERF gene family but, in contrast to mammalian mTERFs, knowledge about the function of these proteins in photosynthetic organisms is scarce. In this hypothesis article, I show that tandem duplications and one block duplication contributed to the large number of MTERF genes in A. thaliana, and propose that the expansion of the family is related to the evolution of land plants. The MTERF genes - especially the duplicated genes - display a number of distinct mRNA accumulation patterns, suggesting functional diversification of mTERF proteins to increase adaptability to environmental changes. Indeed, hypothetical functions for the different mTERF proteins can be predicted using co-expression analysis and gene ontology annotations. On this basis, mTERF proteins can be sorted into five groups. Members of the chloroplast and chloroplast-associated clusters are principally involved in chloroplast gene expression, embryogenesis and protein catabolism, while representatives of the mitochondrial cluster seem to participate in DNA and RNA metabolism in that organelle. Moreover, members of the mitochondrion-associated cluster and the low expression group may act in the nucleus and/or the cytosol. As proteins involved in OGE and presumably nuclear gene expression, mTERFs are ideal candidates for the coordination of the expression of organelle and nuclear

  5. Improved protein surface comparison and application to low-resolution protein structure data

    Directory of Open Access Journals (Sweden)

    Kihara Daisuke

    2010-12-01

    Full Text Available Abstract Background Recent advancements of experimental techniques for determining protein tertiary structures raise significant challenges for protein bioinformatics. With the number of known structures of unknown function expanding at a rapid pace, an urgent task is to provide reliable clues to their biological function on a large scale. Conventional approaches for structure comparison are not suitable for a real-time database search due to their slow speed. Moreover, a new challenge has arisen from recent techniques such as electron microscopy (EM, which provide low-resolution structure data. Previously, we have introduced a method for protein surface shape representation using the 3D Zernike descriptors (3DZDs. The 3DZD enables fast structure database searches, taking advantage of its rotation invariance and compact representation. The search results of protein surface represented with the 3DZD has showngood agreement with the existing structure classifications, but some discrepancies were also observed. Results The three new surface representations of backbone atoms, originally devised all-atom-surface representation, and the combination of all-atom surface with the backbone representation are examined. All representations are encoded with the 3DZD. Also, we have investigated the applicability of the 3DZD for searching protein EM density maps of varying resolutions. The surface representations are evaluated on structure retrieval using two existing classifications, SCOP and the CE-based classification. Conclusions Overall, the 3DZDs representing backbone atoms show better retrieval performance than the original all-atom surface representation. The performance further improved when the two representations are combined. Moreover, we observed that the 3DZD is also powerful in comparing low-resolution structures obtained by electron microscopy.

  6. Improved protein surface comparison and application to low-resolution protein structure data.

    Science.gov (United States)

    Sael, Lee; Kihara, Daisuke

    2010-12-14

    Recent advancements of experimental techniques for determining protein tertiary structures raise significant challenges for protein bioinformatics. With the number of known structures of unknown function expanding at a rapid pace, an urgent task is to provide reliable clues to their biological function on a large scale. Conventional approaches for structure comparison are not suitable for a real-time database search due to their slow speed. Moreover, a new challenge has arisen from recent techniques such as electron microscopy (EM), which provide low-resolution structure data. Previously, we have introduced a method for protein surface shape representation using the 3D Zernike descriptors (3DZDs). The 3DZD enables fast structure database searches, taking advantage of its rotation invariance and compact representation. The search results of protein surface represented with the 3DZD has showngood agreement with the existing structure classifications, but some discrepancies were also observed. The three new surface representations of backbone atoms, originally devised all-atom-surface representation, and the combination of all-atom surface with the backbone representation are examined. All representations are encoded with the 3DZD. Also, we have investigated the applicability of the 3DZD for searching protein EM density maps of varying resolutions. The surface representations are evaluated on structure retrieval using two existing classifications, SCOP and the CE-based classification. Overall, the 3DZDs representing backbone atoms show better retrieval performance than the original all-atom surface representation. The performance further improved when the two representations are combined. Moreover, we observed that the 3DZD is also powerful in comparing low-resolution structures obtained by electron microscopy.

  7. Problems of classification in the family Paramyxoviridae.

    Science.gov (United States)

    Rima, Bert; Collins, Peter; Easton, Andrew; Fouchier, Ron; Kurath, Gael; Lamb, Robert A; Lee, Benhur; Maisner, Andrea; Rota, Paul; Wang, Lin-Fa

    2018-05-01

    A number of unassigned viruses in the family Paramyxoviridae need to be classified either as a new genus or placed into one of the seven genera currently recognized in this family. Furthermore, numerous new paramyxoviruses continue to be discovered. However, attempts at classification have highlighted the difficulties that arise by applying historic criteria or criteria based on sequence alone to the classification of the viruses in this family. While the recent taxonomic change that elevated the previous subfamily Pneumovirinae into a separate family Pneumoviridae is readily justified on the basis of RNA dependent -RNA polymerase (RdRp or L protein) sequence motifs, using RdRp sequence comparisons for assignment to lower level taxa raises problems that would require an overhaul of the current criteria for assignment into genera in the family Paramyxoviridae. Arbitrary cut off points to delineate genera and species would have to be set if classification was based on the amino acid sequence of the RdRp alone or on pairwise analysis of sequence complementarity (PASC) of all open reading frames (ORFs). While these cut-offs cannot be made consistent with the current classification in this family, resorting to genus-level demarcation criteria with additional input from the biological context may afford a way forward. Such criteria would reflect the increasingly dynamic nature of virus taxonomy even if it would require a complete revision of the current classification.

  8. K-nearest uphill clustering in the protein structure space

    KAUST Repository

    Cui, Xuefeng; Gao, Xin

    2016-01-01

    The protein structure classification problem, which is to assign a protein structure to a cluster of similar proteins, is one of the most fundamental problems in the construction and application of the protein structure space. Early manually curated

  9. Genomewide identification, classification and analysis of NAC type ...

    Indian Academy of Sciences (India)

    Supplementary data: Genomewide identification, classification and analysis of NAC type gene family in maize. Xiaojian Peng, Yang Zhao, Xiaoming Li, Min Wu, Wenbo Chai, Lei Sheng, Yu Wang, Qing Dong,. Haiyang Jiang and Beijiu Cheng. J. Genet. 94, 377–390. Table 1. Detailed information of NAC proteins in maize.

  10. Diversity, classification and function of the plant protein kinase superfamily

    OpenAIRE

    Lehti-Shiu, Melissa D.; Shiu, Shin-Han

    2012-01-01

    Eukaryotic protein kinases belong to a large superfamily with hundreds to thousands of copies and are components of essentially all cellular functions. The goals of this study are to classify protein kinases from 25 plant species and to assess their evolutionary history in conjunction with consideration of their molecular functions. The protein kinase superfamily has expanded in the flowering plant lineage, in part through recent duplications. As a result, the flowering plant protein kinase r...

  11. Predicting protein complexes using a supervised learning method combined with local structural information.

    Science.gov (United States)

    Dong, Yadong; Sun, Yongqi; Qin, Chao

    2018-01-01

    The existing protein complex detection methods can be broadly divided into two categories: unsupervised and supervised learning methods. Most of the unsupervised learning methods assume that protein complexes are in dense regions of protein-protein interaction (PPI) networks even though many true complexes are not dense subgraphs. Supervised learning methods utilize the informative properties of known complexes; they often extract features from existing complexes and then use the features to train a classification model. The trained model is used to guide the search process for new complexes. However, insufficient extracted features, noise in the PPI data and the incompleteness of complex data make the classification model imprecise. Consequently, the classification model is not sufficient for guiding the detection of complexes. Therefore, we propose a new robust score function that combines the classification model with local structural information. Based on the score function, we provide a search method that works both forwards and backwards. The results from experiments on six benchmark PPI datasets and three protein complex datasets show that our approach can achieve better performance compared with the state-of-the-art supervised, semi-supervised and unsupervised methods for protein complex detection, occasionally significantly outperforming such methods.

  12. A proposal of criteria for the classification of systemic sclerosis.

    Science.gov (United States)

    Nadashkevich, Oleg; Davis, Paul; Fritzler, Marvin J

    2004-11-01

    Sensitive and specific criteria for the classification of systemic sclerosis are required by clinicians and investigators to achieve higher quality clinical studies and approaches to therapy. A clinical study of systemic sclerosis patients in Europe and Canada led to a set of criteria that achieve high sensitivity and specificity. Both clinical and laboratory investigations of patients with systemic sclerosis, related conditions and diseases with clinical features that can be mistaken as part of the systemic sclerosis spectrum were undertaken. Laboratory investigations included the detection of autoantibodies to centromere proteins, Scl-70 (topoisomerase I), and fibrillarin (U3-RNP). Based on the investigation of 269 systemic sclerosis patients and 720 patients presenting with related and confounding conditions, the following set of criteria for the classification of systemic sclerosis was proposed: 1) autoantibodies to: centromere proteins, Scl-70 (topo I), fibrillarin; 2) bibasilar pulmonary fibrosis; 3) contractures of the digital joints or prayer sign; 4) dermal thickening proximal to the wrists; 5) calcinosis cutis; 6) Raynaud's phenomenon; 7) esophageal distal hypomotility or reflux-esophagitis; 8) sclerodactyly or non-pitting digital edema; 9) teleangiectasias. The classification of definite SSc requires at least three of the above criteria. Criteria for the classification of systemic sclerosis have been proposed. Preliminary testing has defined the sensitivity and specificity of these criteria as high as 99% and 100%, respectively. Testing and validation of the proposed criteria by other clinical centers is required.

  13. PASTEC: an automatic transposable element classification tool.

    Directory of Open Access Journals (Sweden)

    Claire Hoede

    Full Text Available SUMMARY: The classification of transposable elements (TEs is key step towards deciphering their potential impact on the genome. However, this process is often based on manual sequence inspection by TE experts. With the wealth of genomic sequences now available, this task requires automation, making it accessible to most scientists. We propose a new tool, PASTEC, which classifies TEs by searching for structural features and similarities. This tool outperforms currently available software for TE classification. The main innovation of PASTEC is the search for HMM profiles, which is useful for inferring the classification of unknown TE on the basis of conserved functional domains of the proteins. In addition, PASTEC is the only tool providing an exhaustive spectrum of possible classifications to the order level of the Wicker hierarchical TE classification system. It can also automatically classify other repeated elements, such as SSR (Simple Sequence Repeats, rDNA or potential repeated host genes. Finally, the output of this new tool is designed to facilitate manual curation by providing to biologists with all the evidence accumulated for each TE consensus. AVAILABILITY: PASTEC is available as a REPET module or standalone software (http://urgi.versailles.inra.fr/download/repet/REPET_linux-x64-2.2.tar.gz. It requires a Unix-like system. There are two standalone versions: one of which is parallelized (requiring Sun grid Engine or Torque, and the other of which is not.

  14. Polycomb Group Protein PHF1 Regulates p53-dependent Cell Growth Arrest and Apoptosis*

    Science.gov (United States)

    Yang, Yang; Wang, Chenji; Zhang, Pingzhao; Gao, Kun; Wang, Dejie; Yu, Hongxiu; Zhang, Ting; Jiang, Sirui; Hexige, Saiyin; Hong, Zehui; Yasui, Akira; Liu, Jun O.; Huang, Haojie; Yu, Long

    2013-01-01

    Polycomb group protein PHF1 is well known as a component of a novel EED-EZH2·Polycomb repressive complex 2 complex and plays important roles in H3K27 methylation and Hox gene silencing. PHF1 is also involved in the response to DNA double-strand breaks in human cells, promotes nonhomologous end-joining processes through interaction with Ku70/Ku80. Here, we identified another function of PHF1 as a potential p53 pathway activator in a pathway screen using luminescence reporter assay. Subsequent studies showed PHF1 directly interacts with p53 proteins both in vivo and in vitro and co-localized in nucleus. PHF1 binds to the C-terminal regulatory domain of p53. Overexpression of PHF1 elevated p53 protein level and prolonged its turnover. Knockdown of PHF1 reduced p53 protein level and its target gene expression both in normal state and DNA damage response. Mechanically, PHF1 protects p53 proteins from MDM2-mediated ubiquitination and degradation. Furthermore, we showed that PHF1 regulates cell growth arrest and etoposide-induced apoptosis in a p53-dependent manner. Finally, PHF1 expression was significantly down-regulated in human breast cancer samples. Taken together, we establish PHF1 as a novel positive regulator of the p53 pathway. These data shed light on the potential roles of PHF1 in tumorigenesis and/or tumor progression. PMID:23150668

  15. A New Classification Approach Based on Multiple Classification Rules

    OpenAIRE

    Zhongmei Zhou

    2014-01-01

    A good classifier can correctly predict new data for which the class label is unknown, so it is important to construct a high accuracy classifier. Hence, classification techniques are much useful in ubiquitous computing. Associative classification achieves higher classification accuracy than some traditional rule-based classification approaches. However, the approach also has two major deficiencies. First, it generates a very large number of association classification rules, especially when t...

  16. Towards a systematic classification of protein folds

    DEFF Research Database (Denmark)

    Lindgård, Per-Anker; Bohr, Henrik

    1997-01-01

    structures are given a unique name, which simultaneously represent a linear string of physical coupling constants describing hinge spin interactions. We have defined a metric and a precise distance measure between the fold classes. An automated procedure is constructed in which any protein structure...

  17. Multiple promoters and alternative splicing: Hoxa5 transcriptional complexity in the mouse embryo.

    Directory of Open Access Journals (Sweden)

    Yan Coulombe

    2010-05-01

    Full Text Available The genomic organization of Hox clusters is fundamental for the precise spatio-temporal regulation and the function of each Hox gene, and hence for correct embryo patterning. Multiple overlapping transcriptional units exist at the Hoxa5 locus reflecting the complexity of Hox clustering: a major form of 1.8 kb corresponding to the two characterized exons of the gene and polyadenylated RNA species of 5.0, 9.5 and 11.0 kb. This transcriptional intricacy raises the question of the involvement of the larger transcripts in Hox function and regulation.We have undertaken the molecular characterization of the Hoxa5 larger transcripts. They initiate from two highly conserved distal promoters, one corresponding to the putative Hoxa6 promoter, and a second located nearby Hoxa7. Alternative splicing is also involved in the generation of the different transcripts. No functional polyadenylation sequence was found at the Hoxa6 locus and all larger transcripts use the polyadenylation site of the Hoxa5 gene. Some larger transcripts are potential Hoxa6/Hoxa5 bicistronic units. However, even though all transcripts could produce the genuine 270 a.a. HOXA5 protein, only the 1.8 kb form is translated into the protein, indicative of its essential role in Hoxa5 gene function. The Hoxa6 mutation disrupts the larger transcripts without major phenotypic impact on axial specification in their expression domain. However, Hoxa5-like skeletal anomalies are observed in Hoxa6 mutants and these defects can be explained by the loss of expression of the 1.8 kb transcript. Our data raise the possibility that the larger transcripts may be involved in Hoxa5 gene regulation.Our observation that the Hoxa5 larger transcripts possess a developmentally-regulated expression combined to the increasing sum of data on the role of long noncoding RNAs in transcriptional regulation suggest that the Hoxa5 larger transcripts may participate in the control of Hox gene expression.

  18. 78 FR 68983 - Cotton Futures Classification: Optional Classification Procedure

    Science.gov (United States)

    2013-11-18

    ...-AD33 Cotton Futures Classification: Optional Classification Procedure AGENCY: Agricultural Marketing... regulations to allow for the addition of an optional cotton futures classification procedure--identified and... response to requests from the U.S. cotton industry and ICE, AMS will offer a futures classification option...

  19. Classification of ancient mammal individuals using dental pulp MALDI-TOF MS peptide profiling.

    Directory of Open Access Journals (Sweden)

    Thi-Nguyen-Ny Tran

    Full Text Available BACKGROUND: The classification of ancient animal corpses at the species level remains a challenging task for forensic scientists and anthropologists. Severe damage and mixed, tiny pieces originating from several skeletons may render morphological classification virtually impossible. Standard approaches are based on sequencing mitochondrial and nuclear targets. METHODOLOGY/PRINCIPAL FINDINGS: We present a method that can accurately classify mammalian species using dental pulp and mass spectrometry peptide profiling. Our work was organized into three successive steps. First, after extracting proteins from the dental pulp collected from 37 modern individuals representing 13 mammalian species, trypsin-digested peptides were used for matrix-assisted laser desorption/ionization time-of-flight mass spectrometry analysis. The resulting peptide profiles accurately classified every individual at the species level in agreement with parallel cytochrome b gene sequencing gold standard. Second, using a 279-modern spectrum database, we blindly classified 33 of 37 teeth collected in 37 modern individuals (89.1%. Third, we classified 10 of 18 teeth (56% collected in 15 ancient individuals representing five mammal species including human, from five burial sites dating back 8,500 years. Further comparison with an upgraded database comprising ancient specimen profiles yielded 100% classification in ancient teeth. Peptide sequencing yield 4 and 16 different non-keratin proteins including collagen (alpha-1 type I and alpha-2 type I in human ancient and modern dental pulp, respectively. CONCLUSIONS/SIGNIFICANCE: Mass spectrometry peptide profiling of the dental pulp is a new approach that can be added to the arsenal of species classification tools for forensics and anthropology as a complementary method to DNA sequencing. The dental pulp is a new source for collagen and other proteins for the species classification of modern and ancient mammal individuals.

  20. Classification of Ancient Mammal Individuals Using Dental Pulp MALDI-TOF MS Peptide Profiling

    Science.gov (United States)

    Tran, Thi-Nguyen-Ny; Aboudharam, Gérard; Gardeisen, Armelle; Davoust, Bernard; Bocquet-Appel, Jean-Pierre; Flaudrops, Christophe; Belghazi, Maya; Raoult, Didier; Drancourt, Michel

    2011-01-01

    Background The classification of ancient animal corpses at the species level remains a challenging task for forensic scientists and anthropologists. Severe damage and mixed, tiny pieces originating from several skeletons may render morphological classification virtually impossible. Standard approaches are based on sequencing mitochondrial and nuclear targets. Methodology/Principal Findings We present a method that can accurately classify mammalian species using dental pulp and mass spectrometry peptide profiling. Our work was organized into three successive steps. First, after extracting proteins from the dental pulp collected from 37 modern individuals representing 13 mammalian species, trypsin-digested peptides were used for matrix-assisted laser desorption/ionization time-of-flight mass spectrometry analysis. The resulting peptide profiles accurately classified every individual at the species level in agreement with parallel cytochrome b gene sequencing gold standard. Second, using a 279–modern spectrum database, we blindly classified 33 of 37 teeth collected in 37 modern individuals (89.1%). Third, we classified 10 of 18 teeth (56%) collected in 15 ancient individuals representing five mammal species including human, from five burial sites dating back 8,500 years. Further comparison with an upgraded database comprising ancient specimen profiles yielded 100% classification in ancient teeth. Peptide sequencing yield 4 and 16 different non-keratin proteins including collagen (alpha-1 type I and alpha-2 type I) in human ancient and modern dental pulp, respectively. Conclusions/Significance Mass spectrometry peptide profiling of the dental pulp is a new approach that can be added to the arsenal of species classification tools for forensics and anthropology as a complementary method to DNA sequencing. The dental pulp is a new source for collagen and other proteins for the species classification of modern and ancient mammal individuals. PMID:21364886

  1. DockQ: A Quality Measure for Protein-Protein Docking Models

    Science.gov (United States)

    Basu, Sankar

    2016-01-01

    The state-of-the-art to assess the structural quality of docking models is currently based on three related yet independent quality measures: Fnat, LRMS, and iRMS as proposed and standardized by CAPRI. These quality measures quantify different aspects of the quality of a particular docking model and need to be viewed together to reveal the true quality, e.g. a model with relatively poor LRMS (>10Å) might still qualify as 'acceptable' with a descent Fnat (>0.50) and iRMS (iRMS to a single score in the range [0, 1] that can be used to assess the quality of protein docking models. By using DockQ on CAPRI models it is possible to almost completely reproduce the original CAPRI classification into Incorrect, Acceptable, Medium and High quality. An average PPV of 94% at 90% Recall demonstrating that there is no need to apply predefined ad-hoc cutoffs to classify docking models. Since DockQ recapitulates the CAPRI classification almost perfectly, it can be viewed as a higher resolution version of the CAPRI classification, making it possible to estimate model quality in a more quantitative way using Z-scores or sum of top ranked models, which has been so valuable for the CASP community. The possibility to directly correlate a quality measure to a scoring function has been crucial for the development of scoring functions for protein structure prediction, and DockQ should be useful in a similar development in the protein docking field. DockQ is available at http://github.com/bjornwallner/DockQ/ PMID:27560519

  2. Advances in the Function and Regulation of Hydrogenase in the Cyanobacterium Synechocystis PCC6803

    Science.gov (United States)

    Cassier-Chauvat, Corinne; Veaudor, Théo; Chauvat, Franck

    2014-01-01

    In order to use cyanobacteria for the biological production of hydrogen, it is important to thoroughly study the function and the regulation of the hydrogen-production machine in order to better understand its role in the global cell metabolism and identify bottlenecks limiting H2 production. Most of the recent advances in our understanding of the bidirectional [Ni-Fe] hydrogenase (Hox) came from investigations performed in the widely-used model cyanobacterium Synechocystis PCC6803 where Hox is the sole enzyme capable of combining electrons with protons to produce H2 under specific conditions. Recent findings suggested that the Hox enzyme can receive electrons from not only NAD(P)H as usually shown, but also, or even preferentially, from ferredoxin. Furthermore, plasmid-encoded functions and glutathionylation (the formation of a mixed-disulfide between the cysteines residues of a protein and the cysteine residue of glutathione) are proposed as possible new players in the function and regulation of hydrogen production. PMID:25365180

  3. Segmentation and classification of cell cycle phases in fluorescence imaging.

    Science.gov (United States)

    Ersoy, Ilker; Bunyak, Filiz; Chagin, Vadim; Cardoso, M Christina; Palaniappan, Kannappan

    2009-01-01

    Current chemical biology methods for studying spatiotemporal correlation between biochemical networks and cell cycle phase progression in live-cells typically use fluorescence-based imaging of fusion proteins. Stable cell lines expressing fluorescently tagged protein GFP-PCNA produce rich, dynamically varying sub-cellular foci patterns characterizing the cell cycle phases, including the progress during the S-phase. Variable fluorescence patterns, drastic changes in SNR, shape and position changes and abundance of touching cells require sophisticated algorithms for reliable automatic segmentation and cell cycle classification. We extend the recently proposed graph partitioning active contours (GPAC) for fluorescence-based nucleus segmentation using regional density functions and dramatically improve its efficiency, making it scalable for high content microscopy imaging. We utilize surface shape properties of GFP-PCNA intensity field to obtain descriptors of foci patterns and perform automated cell cycle phase classification, and give quantitative performance by comparing our results to manually labeled data.

  4. Is overall similarity classification less effortful than single-dimension classification?

    Science.gov (United States)

    Wills, Andy J; Milton, Fraser; Longmore, Christopher A; Hester, Sarah; Robinson, Jo

    2013-01-01

    It is sometimes argued that the implementation of an overall similarity classification is less effortful than the implementation of a single-dimension classification. In the current article, we argue that the evidence securely in support of this view is limited, and report additional evidence in support of the opposite proposition--overall similarity classification is more effortful than single-dimension classification. Using a match-to-standards procedure, Experiments 1A, 1B and 2 demonstrate that concurrent load reduces the prevalence of overall similarity classification, and that this effect is robust to changes in the concurrent load task employed, the level of time pressure experienced, and the short-term memory requirements of the classification task. Experiment 3 demonstrates that participants who produced overall similarity classifications from the outset have larger working memory capacities than those who produced single-dimension classifications initially, and Experiment 4 demonstrates that instructions to respond meticulously increase the prevalence of overall similarity classification.

  5. Classification and Lineage Tracing of SH2 Domains Throughout Eukaryotes.

    Science.gov (United States)

    Liu, Bernard A

    2017-01-01

    Today there exists a rapidly expanding number of sequenced genomes. Cataloging protein interaction domains such as the Src Homology 2 (SH2) domain across these various genomes can be accomplished with ease due to existing algorithms and predictions models. An evolutionary analysis of SH2 domains provides a step towards understanding how SH2 proteins integrated with existing signaling networks to position phosphotyrosine signaling as a crucial driver of robust cellular communication networks in metazoans. However organizing and tracing SH2 domain across organisms and understanding their evolutionary trajectory remains a challenge. This chapter describes several methodologies towards analyzing the evolutionary trajectory of SH2 domains including a global SH2 domain classification system, which facilitates annotation of new SH2 sequences essential for tracing the lineage of SH2 domains throughout eukaryote evolution. This classification utilizes a combination of sequence homology, protein domain architecture and the boundary positions between introns and exons within the SH2 domain or genes encoding these domains. Discrete SH2 families can then be traced across various genomes to provide insight into its origins. Furthermore, additional methods for examining potential mechanisms for divergence of SH2 domains from structural changes to alterations in the protein domain content and genome duplication will be discussed. Therefore a better understanding of SH2 domain evolution may enhance our insight into the emergence of phosphotyrosine signaling and the expansion of protein interaction domains.

  6. Structural and physical property study of sol-gel synthesized CoFe2-xHoxO4 nano ferrites

    Science.gov (United States)

    Patankar, K. K.; Ghone, D. M.; Mathe, V. L.; Kaushik, S. D.

    2018-05-01

    CoFe2-xHoxO4 (x = 0.00, 0.05, 0.10, 0.15, 0.20) ferrites were prepared by the suitably modified Sol-Gel technique. X-ray diffraction (XRD) analysis revealed that the substituted samples show phase pure formation till 10% substitution, which is far higher phase pure than the earlier reports. Upon further substitution an inevitable secondary phase of HoFeO3 along with the spinel phase despite regulating synthesis parameters in the sol-gel reaction route. These results are further corroborated more convincingly by room temperature neutron diffraction. Morphological features of the ferrites were studied by Scanning Electron Microscopy (SEM). The magnetic parameters viz. the saturation magnetization (Ms), coercivity (Hc) and remanence (Mr) were determined from room temperature isothermal magnetization. These parameters were found to decrease with increase in Ho substitution. The decrease in magnetization is analyzed in the light of exchange interactions between rare earth and transition metal ions. Magnetostriction measurements revealed interesting results and the presence of a secondary phase was found to be responsible for decreased measu-red magnetostriction values. The solubility limit of Ho in CoFe2O4 lattice is also reflected from the X-ray and neutron diffraction analysis and magnetostriction studies.

  7. Identification of group specific motifs in Beta-lactamase family of proteins

    Directory of Open Access Journals (Sweden)

    Saxena Akansha

    2009-12-01

    Full Text Available Abstract Background Beta-lactamases are one of the most serious threats to public health. In order to combat this threat we need to study the molecular and functional diversity of these enzymes and identify signatures specific to these enzymes. These signatures will enable us to develop inhibitors and diagnostic probes specific to lactamases. The existing classification of beta-lactamases was developed nearly 30 years ago when few lactamases were available. DLact database contain more than 2000 beta-lactamase, which can be used to study the molecular diversity and to identify signatures specific to this family. Methods A set of 2020 beta-lactamase proteins available in the DLact database http://59.160.102.202/DLact were classified using graph-based clustering of Best Bi-Directional Hits. Non-redundant (> 90 percent identical protein sequences from each group were aligned using T-Coffee and annotated using information available in literature. Motifs specific to each group were predicted using PRATT program. Results The graph-based classification of beta-lactamase proteins resulted in the formation of six groups (Four major groups containing 191, 726, 774 and 73 proteins while two minor groups containing 50 and 8 proteins. Based on the information available in literature, we found that each of the four major groups correspond to the four classes proposed by Ambler. The two minor groups were novel and do not contain molecular signatures of beta-lactamase proteins reported in literature. The group-specific motifs showed high sensitivity (> 70% and very high specificity (> 90%. The motifs from three groups (corresponding to class A, C and D had a high level of conservation at DNA as well as protein level whereas the motifs from the fourth group (corresponding to class B showed conservation at only protein level. Conclusion The graph-based classification of beta-lactamase proteins corresponds with the classification proposed by Ambler, thus there is

  8. ASH structure alignment package: Sensitivity and selectivity in domain classification

    Directory of Open Access Journals (Sweden)

    Toh Hiroyuki

    2007-04-01

    Full Text Available Abstract Background Structure alignment methods offer the possibility of measuring distant evolutionary relationships between proteins that are not visible by sequence-based analysis. However, the question of how structural differences and similarities ought to be quantified in this regard remains open. In this study we construct a training set of sequence-unique CATH and SCOP domains, from which we develop a scoring function that can reliably identify domains with the same CATH topology and SCOP fold classification. The score is implemented in the ASH structure alignment package, for which the source code and a web service are freely available from the PDBj website http://www.pdbj.org/ASH/. Results The new ASH score shows increased selectivity and sensitivity compared with values reported for several popular programs using the same test set of 4,298,905 structure pairs, yielding an area of .96 under the receiver operating characteristic (ROC curve. In addition, weak sequence homologies between similar domains are revealed that could not be detected by BLAST sequence alignment. Also, a subset of domain pairs is identified that exhibit high similarity, even though their CATH and SCOP classification differs. Finally, we show that the ranking of alignment programs based solely on geometric measures depends on the choice of the quality measure. Conclusion ASH shows high selectivity and sensitivity with regard to domain classification, an important step in defining distantly related protein sequence families. Moreover, the CPU cost per alignment is competitive with the fastest programs, making ASH a practical option for large-scale structure classification studies.

  9. Melody discrimination and protein fold classification

    Directory of Open Access Journals (Sweden)

    Robert P. Bywater

    2016-10-01

    Full Text Available One of the greatest challenges in theoretical biophysics and bioinformatics is the identification of protein folds from sequence data. This can be regarded as a pattern recognition problem. In this paper we report the use of a melody generation software where the inputs are derived from calculations of evolutionary information, secondary structure, flexibility, hydropathy and solvent accessibility from multiple sequence alignment data. The melodies so generated are derived from the sequence, and by inference, of the fold, in ways that give each fold a sound representation that may facilitate analysis, recognition, or comparison with other sequences.

  10. SAW Classification Algorithm for Chinese Text Classification

    OpenAIRE

    Xiaoli Guo; Huiyu Sun; Tiehua Zhou; Ling Wang; Zhaoyang Qu; Jiannan Zang

    2015-01-01

    Considering the explosive growth of data, the increased amount of text data’s effect on the performance of text categorization forward the need for higher requirements, such that the existing classification method cannot be satisfied. Based on the study of existing text classification technology and semantics, this paper puts forward a kind of Chinese text classification oriented SAW (Structural Auxiliary Word) algorithm. The algorithm uses the special space effect of Chinese text where words...

  11. IDM-PhyChm-Ens: intelligent decision-making ensemble methodology for classification of human breast cancer using physicochemical properties of amino acids.

    Science.gov (United States)

    Ali, Safdar; Majid, Abdul; Khan, Asifullah

    2014-04-01

    Development of an accurate and reliable intelligent decision-making method for the construction of cancer diagnosis system is one of the fast growing research areas of health sciences. Such decision-making system can provide adequate information for cancer diagnosis and drug discovery. Descriptors derived from physicochemical properties of protein sequences are very useful for classifying cancerous proteins. Recently, several interesting research studies have been reported on breast cancer classification. To this end, we propose the exploitation of the physicochemical properties of amino acids in protein primary sequences such as hydrophobicity (Hd) and hydrophilicity (Hb) for breast cancer classification. Hd and Hb properties of amino acids, in recent literature, are reported to be quite effective in characterizing the constituent amino acids and are used to study protein foldings, interactions, structures, and sequence-order effects. Especially, using these physicochemical properties, we observed that proline, serine, tyrosine, cysteine, arginine, and asparagine amino acids offer high discrimination between cancerous and healthy proteins. In addition, unlike traditional ensemble classification approaches, the proposed 'IDM-PhyChm-Ens' method was developed by combining the decision spaces of a specific classifier trained on different feature spaces. The different feature spaces used were amino acid composition, split amino acid composition, and pseudo amino acid composition. Consequently, we have exploited different feature spaces using Hd and Hb properties of amino acids to develop an accurate method for classification of cancerous protein sequences. We developed ensemble classifiers using diverse learning algorithms such as random forest (RF), support vector machines (SVM), and K-nearest neighbor (KNN) trained on different feature spaces. We observed that ensemble-RF, in case of cancer classification, performed better than ensemble-SVM and ensemble-KNN. Our

  12. Classification in context

    DEFF Research Database (Denmark)

    Mai, Jens Erik

    2004-01-01

    This paper surveys classification research literature, discusses various classification theories, and shows that the focus has traditionally been on establishing a scientific foundation for classification research. This paper argues that a shift has taken place, and suggests that contemporary...... classification research focus on contextual information as the guide for the design and construction of classification schemes....

  13. Classification

    DEFF Research Database (Denmark)

    Hjørland, Birger

    2017-01-01

    This article presents and discusses definitions of the term “classification” and the related concepts “Concept/conceptualization,”“categorization,” “ordering,” “taxonomy” and “typology.” It further presents and discusses theories of classification including the influences of Aristotle...... and Wittgenstein. It presents different views on forming classes, including logical division, numerical taxonomy, historical classification, hermeneutical and pragmatic/critical views. Finally, issues related to artificial versus natural classification and taxonomic monism versus taxonomic pluralism are briefly...

  14. A novel ensemble and composite approach for classifying proteins ...

    African Journals Online (AJOL)

    For the fact that the location of proteins gave some details about the function of a protein whose location was uncertain, protein classification was regarded as a very important task in the field of biological data mining. However, the success of a human genome project led to a protein sequence explosion. There is a great ...

  15. 78 FR 54970 - Cotton Futures Classification: Optional Classification Procedure

    Science.gov (United States)

    2013-09-09

    ... Service 7 CFR Part 27 [AMS-CN-13-0043] RIN 0581-AD33 Cotton Futures Classification: Optional Classification Procedure AGENCY: Agricultural Marketing Service, USDA. ACTION: Proposed rule. SUMMARY: The... optional cotton futures classification procedure--identified and known as ``registration'' by the U.S...

  16. HMMBinder: DNA-Binding Protein Prediction Using HMM Profile Based Features.

    Science.gov (United States)

    Zaman, Rianon; Chowdhury, Shahana Yasmin; Rashid, Mahmood A; Sharma, Alok; Dehzangi, Abdollah; Shatabda, Swakkhar

    2017-01-01

    DNA-binding proteins often play important role in various processes within the cell. Over the last decade, a wide range of classification algorithms and feature extraction techniques have been used to solve this problem. In this paper, we propose a novel DNA-binding protein prediction method called HMMBinder. HMMBinder uses monogram and bigram features extracted from the HMM profiles of the protein sequences. To the best of our knowledge, this is the first application of HMM profile based features for the DNA-binding protein prediction problem. We applied Support Vector Machines (SVM) as a classification technique in HMMBinder. Our method was tested on standard benchmark datasets. We experimentally show that our method outperforms the state-of-the-art methods found in the literature.

  17. HMMBinder: DNA-Binding Protein Prediction Using HMM Profile Based Features

    Directory of Open Access Journals (Sweden)

    Rianon Zaman

    2017-01-01

    Full Text Available DNA-binding proteins often play important role in various processes within the cell. Over the last decade, a wide range of classification algorithms and feature extraction techniques have been used to solve this problem. In this paper, we propose a novel DNA-binding protein prediction method called HMMBinder. HMMBinder uses monogram and bigram features extracted from the HMM profiles of the protein sequences. To the best of our knowledge, this is the first application of HMM profile based features for the DNA-binding protein prediction problem. We applied Support Vector Machines (SVM as a classification technique in HMMBinder. Our method was tested on standard benchmark datasets. We experimentally show that our method outperforms the state-of-the-art methods found in the literature.

  18. Classification of refrigerants; Classification des fluides frigorigenes

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    2001-07-01

    This document was made from the US standard ANSI/ASHRAE 34 published in 2001 and entitled 'designation and safety classification of refrigerants'. This classification allows to clearly organize in an international way the overall refrigerants used in the world thanks to a codification of the refrigerants in correspondence with their chemical composition. This note explains this codification: prefix, suffixes (hydrocarbons and derived fluids, azeotropic and non-azeotropic mixtures, various organic compounds, non-organic compounds), safety classification (toxicity, flammability, case of mixtures). (J.S.)

  19. Protonation/reduction dynamics at the [4Fe-4S] cluster of the hydrogen-forming cofactor in [FeFe]-hydrogenases.

    Science.gov (United States)

    Senger, Moritz; Mebs, Stefan; Duan, Jifu; Shulenina, Olga; Laun, Konstantin; Kertess, Leonie; Wittkamp, Florian; Apfel, Ulf-Peter; Happe, Thomas; Winkler, Martin; Haumann, Michael; Stripp, Sven T

    2018-01-31

    The [FeFe]-hydrogenases of bacteria and algae are the most efficient hydrogen conversion catalysts in nature. Their active-site cofactor (H-cluster) comprises a [4Fe-4S] cluster linked to a unique diiron site that binds three carbon monoxide (CO) and two cyanide (CN - ) ligands. Understanding microbial hydrogen conversion requires elucidation of the interplay of proton and electron transfer events at the H-cluster. We performed real-time spectroscopy on [FeFe]-hydrogenase protein films under controlled variation of atmospheric gas composition, sample pH, and reductant concentration. Attenuated total reflection Fourier-transform infrared spectroscopy was used to monitor shifts of the CO/CN - vibrational bands in response to redox and protonation changes. Three different [FeFe]-hydrogenases and several protein and cofactor variants were compared, including element and isotopic exchange studies. A protonated equivalent (HoxH) of the oxidized state (Hox) was found, which preferentially accumulated at acidic pH and under reducing conditions. We show that the one-electron reduced state Hred' represents an intrinsically protonated species. Interestingly, the formation of HoxH and Hred' was independent of the established proton pathway to the diiron site. Quantum chemical calculations of the respective CO/CN - infrared band patterns favored a cysteine ligand of the [4Fe-4S] cluster as the protonation site in HoxH and Hred'. We propose that proton-coupled electron transfer facilitates reduction of the [4Fe-4S] cluster and prevents premature formation of a hydride at the catalytic diiron site. Our findings imply that protonation events both at the [4Fe-4S] cluster and at the diiron site of the H-cluster are important in the hydrogen conversion reaction of [FeFe]-hydrogenases.

  20. TILLING in the two-rowed barley cultivar 'Barke' reveals preferred sites of functional diversity in the gene HvHox1

    Directory of Open Access Journals (Sweden)

    Komatsuda Takao

    2009-12-01

    Full Text Available Abstract Background The economic importance of cereals such as barley, and the demand for improved yield and quality require a better understanding of the genetic components that modulate biologically and commercially relevant traits. While Arabidopsis thaliana is the premiere model plant system, the spectrum of its traits cannot address all of the fundamental questions of crop plant development. Unlike Arabidopsis, barley is both a crop and a model system for scientific research, and it is increasingly being used for genetic and molecular investigations into the conserved biological processes of cereals. A common challenge in genetic studies in plants with large genomes arises from the very time-consuming work of associating mutant phenotypes with gene sequence information, especially if insertion mutagenesis is not routine, as in barley. Reverse genetics based on chemical mutagenesis represents the best solution to this obstacle. Findings In barley, we generated a new TILLING (Targeting Local Lesions IN Genomes resource comprising 10,279 M2 mutants in the two-rowed malting cultivar 'Barke,' which has been used in the generation of other genomic resources in barley (~150,000 ESTs, DH mapping population. The value of this new resource was tested using selected candidate genes. An average frequency of approximately one mutation per 0.5 Mb was determined by screening ten fragments of six different genes. The ethyl methanesulphonate (EMSmutagenesis efficiency was studied by recording and relating the mutagenesis-dependent effects found in the three mutant generations (M1-M3. A detailed analysis was performed for the homeodomain-leucine-zipper (HD-ZIP gene HvHox1. Thirty-one mutations were identified by screening a 1,270-bp fragment in 7,348 M2 lines. Three of the newly identified mutants exhibited either a six-rowed or an intermedium-spike phenotype, and one mutant displayed a significantly altered spikelet morphology compared to that of the 'Barke

  1. Cancer classification in the genomic era: five contemporary problems.

    Science.gov (United States)

    Song, Qingxuan; Merajver, Sofia D; Li, Jun Z

    2015-10-19

    Classification is an everyday instinct as well as a full-fledged scientific discipline. Throughout the history of medicine, disease classification is central to how we develop knowledge, make diagnosis, and assign treatment. Here, we discuss the classification of cancer and the process of categorizing cancer subtypes based on their observed clinical and biological features. Traditionally, cancer nomenclature is primarily based on organ location, e.g., "lung cancer" designates a tumor originating in lung structures. Within each organ-specific major type, finer subgroups can be defined based on patient age, cell type, histological grades, and sometimes molecular markers, e.g., hormonal receptor status in breast cancer or microsatellite instability in colorectal cancer. In the past 15+ years, high-throughput technologies have generated rich new data regarding somatic variations in DNA, RNA, protein, or epigenomic features for many cancers. These data, collected for increasingly large tumor cohorts, have provided not only new insights into the biological diversity of human cancers but also exciting opportunities to discover previously unrecognized cancer subtypes. Meanwhile, the unprecedented volume and complexity of these data pose significant challenges for biostatisticians, cancer biologists, and clinicians alike. Here, we review five related issues that represent contemporary problems in cancer taxonomy and interpretation. (1) How many cancer subtypes are there? (2) How can we evaluate the robustness of a new classification system? (3) How are classification systems affected by intratumor heterogeneity and tumor evolution? (4) How should we interpret cancer subtypes? (5) Can multiple classification systems co-exist? While related issues have existed for a long time, we will focus on those aspects that have been magnified by the recent influx of complex multi-omics data. Exploration of these problems is essential for data-driven refinement of cancer classification

  2. Genome-wide analyses and functional classification of proline repeat-rich proteins: potential role of eIF5A in eukaryotic evolution.

    Directory of Open Access Journals (Sweden)

    Ajeet Mandal

    Full Text Available The eukaryotic translation factor, eIF5A has been recently reported as a sequence-specific elongation factor that facilitates peptide bond formation at consecutive prolines in Saccharomyces cerevisiae, as its ortholog elongation factor P (EF-P does in bacteria. We have searched the genome databases of 35 representative organisms from six kingdoms of life for PPP (Pro-Pro-Pro and/or PPG (Pro-Pro-Gly-encoding genes whose expression is expected to depend on eIF5A. We have made detailed analyses of proteome data of 5 selected species, Escherichia coli, Saccharomyces cerevisiae, Drosophila melanogaster, Mus musculus and Homo sapiens. The PPP and PPG motifs are low in the prokaryotic proteomes. However, their frequencies markedly increase with the biological complexity of eukaryotic organisms, and are higher in newly derived proteins than in those orthologous proteins commonly shared in all species. Ontology classifications of S. cerevisiae and human genes encoding the highest level of polyprolines reveal their strong association with several specific biological processes, including actin/cytoskeletal associated functions, RNA splicing/turnover, DNA binding/transcription and cell signaling. Previously reported phenotypic defects in actin polarity and mRNA decay of eIF5A mutant strains are consistent with the proposed role for eIF5A in the translation of the polyproline-containing proteins. Of all the amino acid tandem repeats (≥3 amino acids, only the proline repeat frequency correlates with functional complexity of the five organisms examined. Taken together, these findings suggest the importance of proline repeat-rich proteins and a potential role for eIF5A and its hypusine modification pathway in the course of eukaryotic evolution.

  3. Application of wavelet transform for PDZ domain classification.

    Directory of Open Access Journals (Sweden)

    Khaled Daqrouq

    Full Text Available PDZ domains have been identified as part of an array of signaling proteins that are often unrelated, except for the well-conserved structural PDZ domain they contain. These domains have been linked to many disease processes including common Avian influenza, as well as very rare conditions such as Fraser and Usher syndromes. Historically, based on the interactions and the nature of bonds they form, PDZ domains have most often been classified into one of three classes (class I, class II and others - class III, that is directly dependent on their binding partner. In this study, we report on three unique feature extraction approaches based on the bigram and trigram occurrence and existence rearrangements within the domain's primary amino acid sequences in assisting PDZ domain classification. Wavelet packet transform (WPT and Shannon entropy denoted by wavelet entropy (WE feature extraction methods were proposed. Using 115 unique human and mouse PDZ domains, the existence rearrangement approach yielded a high recognition rate (78.34%, which outperformed our occurrence rearrangements based method. The recognition rate was (81.41% with validation technique. The method reported for PDZ domain classification from primary sequences proved to be an encouraging approach for obtaining consistent classification results. We anticipate that by increasing the database size, we can further improve feature extraction and correct classification.

  4. MicroRNA expression profile in head and neck cancer: HOX-cluster embedded microRNA-196a and microRNA-10b dysregulation implicated in cell proliferation

    International Nuclear Information System (INIS)

    Severino, Patricia; Mathor, Monica Beatriz; Nunes, Fabio Daumas; Ragoussis, Jiannis; Tajara, Eloiza Helena; Brüggemann, Holger; Andreghetto, Flavia Maziero; Camps, Carme; Klingbeil, Maria de Fatima Garrido; Pereira, Welbert Oliveira de; Soares, Renata Machado; Moyses, Raquel; Wünsch-Filho, Victor

    2013-01-01

    Current evidence implicates aberrant microRNA expression patterns in human malignancies; measurement of microRNA expression may have diagnostic and prognostic applications. Roles for microRNAs in head and neck squamous cell carcinomas (HNSCC) are largely unknown. HNSCC, a smoking-related cancer, is one of the most common malignancies worldwide but reliable diagnostic and prognostic markers have not been discovered so far. Some studies have evaluated the potential use of microRNA as biomarkers with clinical application in HNSCC. MicroRNA expression profile of oral squamous cell carcinoma samples was determined by means of DNA microarrays. We also performed gain-of-function assays for two differentially expressed microRNA using two squamous cell carcinoma cell lines and normal oral keratinocytes. The effect of the over-expression of these molecules was evaluated by means of global gene expression profiling and cell proliferation assessment. Altered microRNA expression was detected for a total of 72 microRNAs. Among these we found well studied molecules, such as the miR-17-92 cluster, comprising potent oncogenic microRNA, and miR-34, recently found to interact with p53. HOX-cluster embedded miR-196a/b and miR-10b were up- and down-regulated, respectively, in tumor samples. Since validated HOX gene targets for these microRNAs are not consistently deregulated in HNSCC, we performed gain-of-function experiments, in an attempt to outline their possible role. Our results suggest that both molecules interfere in cell proliferation through distinct processes, possibly targeting a small set of genes involved in cell cycle progression. Functional data on miRNAs in HNSCC is still scarce. Our data corroborate current literature and brings new insights into the role of microRNAs in HNSCC. We also show that miR-196a and miR-10b, not previously associated with HNSCC, may play an oncogenic role in this disease through the deregulation of cell proliferation. The study of micro

  5. Classification of hydrocephalus: critical analysis of classification categories and advantages of "Multi-categorical Hydrocephalus Classification" (Mc HC).

    Science.gov (United States)

    Oi, Shizuo

    2011-10-01

    Hydrocephalus is a complex pathophysiology with disturbed cerebrospinal fluid (CSF) circulation. There are numerous numbers of classification trials published focusing on various criteria, such as associated anomalies/underlying lesions, CSF circulation/intracranial pressure patterns, clinical features, and other categories. However, no definitive classification exists comprehensively to cover the variety of these aspects. The new classification of hydrocephalus, "Multi-categorical Hydrocephalus Classification" (Mc HC), was invented and developed to cover the entire aspects of hydrocephalus with all considerable classification items and categories. Ten categories include "Mc HC" category I: onset (age, phase), II: cause, III: underlying lesion, IV: symptomatology, V: pathophysiology 1-CSF circulation, VI: pathophysiology 2-ICP dynamics, VII: chronology, VII: post-shunt, VIII: post-endoscopic third ventriculostomy, and X: others. From a 100-year search of publication related to the classification of hydrocephalus, 14 representative publications were reviewed and divided into the 10 categories. The Baumkuchen classification graph made from the round o'clock classification demonstrated the historical tendency of deviation to the categories in pathophysiology, either CSF or ICP dynamics. In the preliminary clinical application, it was concluded that "Mc HC" is extremely effective in expressing the individual state with various categories in the past and present condition or among the compatible cases of hydrocephalus along with the possible chronological change in the future.

  6. Classification

    Science.gov (United States)

    Clary, Renee; Wandersee, James

    2013-01-01

    In this article, Renee Clary and James Wandersee describe the beginnings of "Classification," which lies at the very heart of science and depends upon pattern recognition. Clary and Wandersee approach patterns by first telling the story of the "Linnaean classification system," introduced by Carl Linnacus (1707-1778), who is…

  7. Binary classification of protein molecules into intrinsically disordered and ordered segments

    Directory of Open Access Journals (Sweden)

    Gojobori Takashi

    2011-06-01

    Full Text Available Abstract Background Although structural domains in proteins (SDs are important, half of the regions in the human proteome are currently left with no SD assignments. These unassigned regions consist not only of novel SDs, but also of intrinsically disordered (ID regions since proteins, especially those in eukaryotes, generally contain a significant fraction of ID regions. As ID regions can be inferred from amino acid sequences, a method that combines SD and ID region assignments can determine the fractions of SDs and ID regions in any proteome. Results In contrast to other available ID prediction programs that merely identify likely ID regions, the DICHOT system we previously developed classifies the entire protein sequence into SDs and ID regions. Application of DICHOT to the human proteome revealed that residue-wise ID regions constitute 35%, SDs with similarity to PDB structures comprise 52%, while SDs with no similarity to PDB structures account for the remaining 13%. The last group consists of novel structural domains, termed cryptic domains, which serve as good targets of structural genomics. The DICHOT method applied to the proteomes of other model organisms indicated that eukaryotes generally have high ID contents, while prokaryotes do not. In human proteins, ID contents differ among subcellular localizations: nuclear proteins had the highest residue-wise ID fraction (47%, while mitochondrial proteins exhibited the lowest (13%. Phosphorylation and O-linked glycosylation sites were found to be located preferentially in ID regions. As O-linked glycans are attached to residues in the extracellular regions of proteins, the modification is likely to protect the ID regions from proteolytic cleavage in the extracellular environment. Alternative splicing events tend to occur more frequently in ID regions. We interpret this as evidence that natural selection is operating at the protein level in alternative splicing. Conclusions We classified

  8. Low Dimensional Representation of Fisher Vectors for Microscopy Image Classification.

    Science.gov (United States)

    Song, Yang; Li, Qing; Huang, Heng; Feng, Dagan; Chen, Mei; Cai, Weidong

    2017-08-01

    Microscopy image classification is important in various biomedical applications, such as cancer subtype identification, and protein localization for high content screening. To achieve automated and effective microscopy image classification, the representative and discriminative capability of image feature descriptors is essential. To this end, in this paper, we propose a new feature representation algorithm to facilitate automated microscopy image classification. In particular, we incorporate Fisher vector (FV) encoding with multiple types of local features that are handcrafted or learned, and we design a separation-guided dimension reduction method to reduce the descriptor dimension while increasing its discriminative capability. Our method is evaluated on four publicly available microscopy image data sets of different imaging types and applications, including the UCSB breast cancer data set, MICCAI 2015 CBTC challenge data set, and IICBU malignant lymphoma, and RNAi data sets. Our experimental results demonstrate the advantage of the proposed low-dimensional FV representation, showing consistent performance improvement over the existing state of the art and the commonly used dimension reduction techniques.

  9. Chronology of p53 protein accumulation in gastric carcinogenesis

    NARCIS (Netherlands)

    Craanen, M. E.; Blok, P.; Dekker, W.; Offerhaus, G. J.; Tytgat, G. N.

    1995-01-01

    p53 Protein accumulation in early gastric carcinoma was studied in relation to the histological type (Lauren classification) and the type of growth pattern, including the chronology of p53 protein accumulation during carcinogenesis. Forty five, paraffin embedded gastrectomy specimens from early

  10. Preparation of functional lupine protein fractions by dry separation

    NARCIS (Netherlands)

    Pelgrom, P.J.M.; Berghout, J.A.M.; Goot, van der A.J.; Boom, R.M.; Schutyser, M.A.I.

    2014-01-01

    Lupine protein concentrate is a promising ingredient that can be obtained by a combination of milling and air classification, generally called dry fractionation. This is a more sustainable route than conventional wet extraction and delivers a protein concentrate with native functional properties.

  11. TALE factors use two distinct functional modes to control an essential zebrafish gene expression program.

    Science.gov (United States)

    Ladam, Franck; Stanney, William; Donaldson, Ian J; Yildiz, Ozge; Bobola, Nicoletta; Sagerström, Charles G

    2018-06-18

    TALE factors are broadly expressed embryonically and known to function in complexes with transcription factors (TFs) like Hox proteins at gastrula/segmentation stages, but it is unclear if such generally expressed factors act by the same mechanism throughout embryogenesis. We identify a TALE-dependent gene regulatory network (GRN) required for anterior development and detect TALE occupancy associated with this GRN throughout embryogenesis. At blastula stages, we uncover a novel functional mode for TALE factors, where they occupy genomic DECA motifs with nearby NF-Y sites. We demonstrate that TALE and NF-Y form complexes and regulate chromatin state at genes of this GRN. At segmentation stages, GRN-associated TALE occupancy expands to include HEXA motifs near PBX:HOX sites. Hence, TALE factors control a key GRN, but utilize distinct DNA motifs and protein partners at different stages - a strategy that may also explain their oncogenic potential and may be employed by other broadly expressed TFs. © 2018, Ladam et al.

  12. Hand eczema classification

    DEFF Research Database (Denmark)

    Diepgen, T L; Andersen, Klaus Ejner; Brandao, F M

    2008-01-01

    of the disease is rarely evidence based, and a classification system for different subdiagnoses of hand eczema is not agreed upon. Randomized controlled trials investigating the treatment of hand eczema are called for. For this, as well as for clinical purposes, a generally accepted classification system...... A classification system for hand eczema is proposed. Conclusions It is suggested that this classification be used in clinical work and in clinical trials....

  13. Extraction of Protein-Protein Interaction from Scientific Articles by Predicting Dominant Keywords.

    Science.gov (United States)

    Koyabu, Shun; Phan, Thi Thanh Thuy; Ohkawa, Takenao

    2015-01-01

    For the automatic extraction of protein-protein interaction information from scientific articles, a machine learning approach is useful. The classifier is generated from training data represented using several features to decide whether a protein pair in each sentence has an interaction. Such a specific keyword that is directly related to interaction as "bind" or "interact" plays an important role for training classifiers. We call it a dominant keyword that affects the capability of the classifier. Although it is important to identify the dominant keywords, whether a keyword is dominant depends on the context in which it occurs. Therefore, we propose a method for predicting whether a keyword is dominant for each instance. In this method, a keyword that derives imbalanced classification results is tentatively assumed to be a dominant keyword initially. Then the classifiers are separately trained from the instance with and without the assumed dominant keywords. The validity of the assumed dominant keyword is evaluated based on the classification results of the generated classifiers. The assumption is updated by the evaluation result. Repeating this process increases the prediction accuracy of the dominant keyword. Our experimental results using five corpora show the effectiveness of our proposed method with dominant keyword prediction.

  14. HIPPI: highly accurate protein family classification with ensembles of HMMs

    Directory of Open Access Journals (Sweden)

    Nam-phuong Nguyen

    2016-11-01

    Full Text Available Abstract Background Given a new biological sequence, detecting membership in a known family is a basic step in many bioinformatics analyses, with applications to protein structure and function prediction and metagenomic taxon identification and abundance profiling, among others. Yet family identification of sequences that are distantly related to sequences in public databases or that are fragmentary remains one of the more difficult analytical problems in bioinformatics. Results We present a new technique for family identification called HIPPI (Hierarchical Profile Hidden Markov Models for Protein family Identification. HIPPI uses a novel technique to represent a multiple sequence alignment for a given protein family or superfamily by an ensemble of profile hidden Markov models computed using HMMER. An evaluation of HIPPI on the Pfam database shows that HIPPI has better overall precision and recall than blastp, HMMER, and pipelines based on HHsearch, and maintains good accuracy even for fragmentary query sequences and for protein families with low average pairwise sequence identity, both conditions where other methods degrade in accuracy. Conclusion HIPPI provides accurate protein family identification and is robust to difficult model conditions. Our results, combined with observations from previous studies, show that ensembles of profile Hidden Markov models can better represent multiple sequence alignments than a single profile Hidden Markov model, and thus can improve downstream analyses for various bioinformatic tasks. Further research is needed to determine the best practices for building the ensemble of profile Hidden Markov models. HIPPI is available on GitHub at https://github.com/smirarab/sepp .

  15. Classification of the web

    DEFF Research Database (Denmark)

    Mai, Jens Erik

    2004-01-01

    This paper discusses the challenges faced by investigations into the classification of the Web and outlines inquiries that are needed to use principles for bibliographic classification to construct classifications of the Web. This paper suggests that the classification of the Web meets challenges...... that call for inquiries into the theoretical foundation of bibliographic classification theory....

  16. Security classification of information

    Energy Technology Data Exchange (ETDEWEB)

    Quist, A.S.

    1993-04-01

    This document is the second of a planned four-volume work that comprehensively discusses the security classification of information. The main focus of Volume 2 is on the principles for classification of information. Included herein are descriptions of the two major types of information that governments classify for national security reasons (subjective and objective information), guidance to use when determining whether information under consideration for classification is controlled by the government (a necessary requirement for classification to be effective), information disclosure risks and benefits (the benefits and costs of classification), standards to use when balancing information disclosure risks and benefits, guidance for assigning classification levels (Top Secret, Secret, or Confidential) to classified information, guidance for determining how long information should be classified (classification duration), classification of associations of information, classification of compilations of information, and principles for declassifying and downgrading information. Rules or principles of certain areas of our legal system (e.g., trade secret law) are sometimes mentioned to .provide added support to some of those classification principles.

  17. Cancer Classification Based on Support Vector Machine Optimized by Particle Swarm Optimization and Artificial Bee Colony.

    Science.gov (United States)

    Gao, Lingyun; Ye, Mingquan; Wu, Changrong

    2017-11-29

    Intelligent optimization algorithms have advantages in dealing with complex nonlinear problems accompanied by good flexibility and adaptability. In this paper, the FCBF (Fast Correlation-Based Feature selection) method is used to filter irrelevant and redundant features in order to improve the quality of cancer classification. Then, we perform classification based on SVM (Support Vector Machine) optimized by PSO (Particle Swarm Optimization) combined with ABC (Artificial Bee Colony) approaches, which is represented as PA-SVM. The proposed PA-SVM method is applied to nine cancer datasets, including five datasets of outcome prediction and a protein dataset of ovarian cancer. By comparison with other classification methods, the results demonstrate the effectiveness and the robustness of the proposed PA-SVM method in handling various types of data for cancer classification.

  18. Classification of intestinal lymphangiectasia with protein-losing enteropathy: white villi type and non-white villi type.

    Science.gov (United States)

    Ohmiya, Naoki; Nakamura, Masanao; Yamamura, Takeshi; Yamada, Koji; Nagura, Asuka; Yoshimura, Toru; Hirooka, Yoshiki; Hirata, Ichiro; Goto, Hidemi

    2014-01-01

    We classified intestinal lymphangiectasia (IL) into two categories, the white and non-white villi types, and evaluated their clinical characteristics and therapeutic responses. Of the 988 patients who underwent double-balloon enteroscopy, 14 consecutive patients (7 men and 7 women, median age at onset 34 years) were enrolled with immunohistochemically confirmed IL with protein-losing enteropathy. Enteroscopically the white villi type (n = 8) showed white plaques and white-tipped villi were scattered in the small bowel, while non-white villi type (n = 6) showed that apparently normal but under more detailed observation, low and round villi with a normal color were diffused. The serum albumin levels and fecal α1-antitrypsin clearance before treatment were significantly worse in the non-white villi type (p = 0.017 and 0.039, respectively), whereas the serum immunoglobulin A and M levels were significantly lower in the white villi type (p = 0.010 and 0.046, respectively). At gastroscopy, a non-cirrhotic snakeskin appearance was significantly observed in the non-white villi type (p = 0.015). The corticosteroid response was better in the non-white villi type (p = 0.015). Two distinct subgroups were found in IL. This classification was useful in pathophysiological clustering and in predicting the therapeutic response. © 2014 S. Karger AG, Basel.

  19. Dry fractionation for production of functional pea protein concentrates

    NARCIS (Netherlands)

    Pelgrom, P.J.M.; Vissers, A.M.; Boom, R.M.; Schutyser, M.A.I.

    2013-01-01

    Dry milling in combination with air classification was evaluated as an alternative to conventional wet extraction of protein from yellow field peas (Pisum sativum). Major advantages of dry fractionation are retention of native functionality of proteins and its lower energy and water use. Peas were

  20. Phylogenetic classification and the universal tree.

    Science.gov (United States)

    Doolittle, W F

    1999-06-25

    From comparative analyses of the nucleotide sequences of genes encoding ribosomal RNAs and several proteins, molecular phylogeneticists have constructed a "universal tree of life," taking it as the basis for a "natural" hierarchical classification of all living things. Although confidence in some of the tree's early branches has recently been shaken, new approaches could still resolve many methodological uncertainties. More challenging is evidence that most archaeal and bacterial genomes (and the inferred ancestral eukaryotic nuclear genome) contain genes from multiple sources. If "chimerism" or "lateral gene transfer" cannot be dismissed as trivial in extent or limited to special categories of genes, then no hierarchical universal classification can be taken as natural. Molecular phylogeneticists will have failed to find the "true tree," not because their methods are inadequate or because they have chosen the wrong genes, but because the history of life cannot properly be represented as a tree. However, taxonomies based on molecular sequences will remain indispensable, and understanding of the evolutionary process will ultimately be enriched, not impoverished.

  1. Hazard classification methodology

    International Nuclear Information System (INIS)

    Brereton, S.J.

    1996-01-01

    This document outlines the hazard classification methodology used to determine the hazard classification of the NIF LTAB, OAB, and the support facilities on the basis of radionuclides and chemicals. The hazard classification determines the safety analysis requirements for a facility

  2. Prediction of human protein function according to Gene Ontology categories

    DEFF Research Database (Denmark)

    Jensen, Lars Juhl; Gupta, Ramneek; Stærfeldt, Hans Henrik

    2003-01-01

    developed a method for prediction of protein function for a subset of classes from the Gene Ontology classification scheme. This subset includes several pharmaceutically interesting categories-transcription factors, receptors, ion channels, stress and immune response proteins, hormones and growth factors...

  3. Hox paralog group 2 genes control the migration of mouse pontine neurons through slit-robo signaling.

    Directory of Open Access Journals (Sweden)

    Marc J Geisen

    2008-06-01

    Full Text Available The pontine neurons (PN represent a major source of mossy fiber projections to the cerebellum. During mouse hindbrain development, PN migrate tangentially and sequentially along both the anteroposterior (AP and dorsoventral (DV axes. Unlike DV migration, which is controlled by the Netrin-1/Dcc attractive pathway, little is known about the molecular mechanisms guiding PN migration along the AP axis. Here, we show that Hoxa2 and Hoxb2 are required both intrinsically and extrinsically to maintain normal AP migration of subsets of PN, by preventing their premature ventral attraction towards the midline. Moreover, the migration defects observed in Hoxa2 and Hoxb2 mutant mice were phenocopied in compound Robo1;Robo2, Slit1;Slit2, and Robo2;Slit2 knockout animals, indicating that these guidance molecules act downstream of Hox genes to control PN migration. Indeed, using chromatin immunoprecipitation assays, we further demonstrated that Robo2 is a direct target of Hoxa2 in vivo and that maintenance of high Robo and Slit expression levels was impaired in Hoxa2 mutant mice. Lastly, the analysis of Phox2b-deficient mice indicated that the facial motor nucleus is a major Slit signaling source required to prevent premature ventral migration of PN. These findings provide novel insights into the molecular control of neuronal migration from transcription factor to regulation of guidance receptor and ligand expression. Specifically, they address the question of how exposure to multiple guidance cues along the AP and DV axes is regulated at the transcriptional level and in turn translated into stereotyped migratory responses during tangential migration of neurons in the developing mammalian brain.

  4. Classification of EC 3.1.1.3 bacterial true lipases using phylogenetic ...

    African Journals Online (AJOL)

    To obtain an overview of this industrially and very important class of enzymes and their characteristics, we collected and classified bacterial lipases sequences available from protein databases. Here we proposed an updated and revised classification of family I bacterial "true" lipases based mainly on a comparison of their ...

  5. Empirical investigations into full-text protein interaction Article Categorization Task (ACT) in the BioCreative II.5 Challenge.

    Science.gov (United States)

    Lan, Man; Su, Jian

    2010-01-01

    The selection of protein interaction documents is one important application for biology research and has a direct impact on the quality of downstream BioNLP applications, i.e., information extraction and retrieval, summarization, QA, etc. The BioCreative II.5 Challenge Article Categorization task (ACT) involves doing a binary text classification to determine whether a given structured full-text article contains protein interaction information. This may be the first attempt at classification of full-text protein interaction documents in wide community. In this paper, we compare and evaluate the effectiveness of different section types in full-text articles for text classification. Moreover, in practice, the less number of true-positive samples results in unstable performance and unreliable classifier trained on it. Previous research on learning with skewed class distributions has altered the class distribution using oversampling and downsampling. We also investigate the skewed protein interaction classification and analyze the effect of various issues related to the choice of external sources, oversampling training sets, classifiers, etc. We report on the various factors above to show that 1) a full-text biomedical article contains a wealth of scientific information important to users that may not be completely represented by abstracts and/or keywords, which improves the accuracy performance of classification and 2) reinforcing true-positive samples significantly increases the accuracy and stability performance of classification.

  6. Classification, disease, and diagnosis.

    Science.gov (United States)

    Jutel, Annemarie

    2011-01-01

    Classification shapes medicine and guides its practice. Understanding classification must be part of the quest to better understand the social context and implications of diagnosis. Classifications are part of the human work that provides a foundation for the recognition and study of illness: deciding how the vast expanse of nature can be partitioned into meaningful chunks, stabilizing and structuring what is otherwise disordered. This article explores the aims of classification, their embodiment in medical diagnosis, and the historical traditions of medical classification. It provides a brief overview of the aims and principles of classification and their relevance to contemporary medicine. It also demonstrates how classifications operate as social framing devices that enable and disable communication, assert and refute authority, and are important items for sociological study.

  7. Standard classification: Physics

    International Nuclear Information System (INIS)

    1977-01-01

    This is a draft standard classification of physics. The conception is based on the physics part of the systematic catalogue of the Bayerische Staatsbibliothek and on the classification given in standard textbooks. The ICSU-AB classification now used worldwide by physics information services was not taken into account. (BJ) [de

  8. Effective Feature Selection for Classification of Promoter Sequences.

    Directory of Open Access Journals (Sweden)

    Kouser K

    Full Text Available Exploring novel computational methods in making sense of biological data has not only been a necessity, but also productive. A part of this trend is the search for more efficient in silico methods/tools for analysis of promoters, which are parts of DNA sequences that are involved in regulation of expression of genes into other functional molecules. Promoter regions vary greatly in their function based on the sequence of nucleotides and the arrangement of protein-binding short-regions called motifs. In fact, the regulatory nature of the promoters seems to be largely driven by the selective presence and/or the arrangement of these motifs. Here, we explore computational classification of promoter sequences based on the pattern of motif distributions, as such classification can pave a new way of functional analysis of promoters and to discover the functionally crucial motifs. We make use of Position Specific Motif Matrix (PSMM features for exploring the possibility of accurately classifying promoter sequences using some of the popular classification techniques. The classification results on the complete feature set are low, perhaps due to the huge number of features. We propose two ways of reducing features. Our test results show improvement in the classification output after the reduction of features. The results also show that decision trees outperform SVM (Support Vector Machine, KNN (K Nearest Neighbor and ensemble classifier LibD3C, particularly with reduced features. The proposed feature selection methods outperform some of the popular feature transformation methods such as PCA and SVD. Also, the methods proposed are as accurate as MRMR (feature selection method but much faster than MRMR. Such methods could be useful to categorize new promoters and explore regulatory mechanisms of gene expressions in complex eukaryotic species.

  9. Feature selection and nearest centroid classification for protein mass spectrometry

    Directory of Open Access Journals (Sweden)

    Levner Ilya

    2005-03-01

    Full Text Available Abstract Background The use of mass spectrometry as a proteomics tool is poised to revolutionize early disease diagnosis and biomarker identification. Unfortunately, before standard supervised classification algorithms can be employed, the "curse of dimensionality" needs to be solved. Due to the sheer amount of information contained within the mass spectra, most standard machine learning techniques cannot be directly applied. Instead, feature selection techniques are used to first reduce the dimensionality of the input space and thus enable the subsequent use of classification algorithms. This paper examines feature selection techniques for proteomic mass spectrometry. Results This study examines the performance of the nearest centroid classifier coupled with the following feature selection algorithms. Student-t test, Kolmogorov-Smirnov test, and the P-test are univariate statistics used for filter-based feature ranking. From the wrapper approaches we tested sequential forward selection and a modified version of sequential backward selection. Embedded approaches included shrunken nearest centroid and a novel version of boosting based feature selection we developed. In addition, we tested several dimensionality reduction approaches, namely principal component analysis and principal component analysis coupled with linear discriminant analysis. To fairly assess each algorithm, evaluation was done using stratified cross validation with an internal leave-one-out cross-validation loop for automated feature selection. Comprehensive experiments, conducted on five popular cancer data sets, revealed that the less advocated sequential forward selection and boosted feature selection algorithms produce the most consistent results across all data sets. In contrast, the state-of-the-art performance reported on isolated data sets for several of the studied algorithms, does not hold across all data sets. Conclusion This study tested a number of popular feature

  10. K-nearest uphill clustering in the protein structure space

    KAUST Repository

    Cui, Xuefeng

    2016-08-26

    The protein structure classification problem, which is to assign a protein structure to a cluster of similar proteins, is one of the most fundamental problems in the construction and application of the protein structure space. Early manually curated protein structure classifications (e.g., SCOP and CATH) are very successful, but recently suffer the slow updating problem because of the increased throughput of newly solved protein structures. Thus, fully automatic methods to cluster proteins in the protein structure space have been designed and developed. In this study, we observed that the SCOP superfamilies are highly consistent with clustering trees representing hierarchical clustering procedures, but the tree cutting is very challenging and becomes the bottleneck of clustering accuracy. To overcome this challenge, we proposed a novel density-based K-nearest uphill clustering method that effectively eliminates noisy pairwise protein structure similarities and identifies density peaks as cluster centers. Specifically, the density peaks are identified based on K-nearest uphills (i.e., proteins with higher densities) and K-nearest neighbors. To our knowledge, this is the first attempt to apply and develop density-based clustering methods in the protein structure space. Our results show that our density-based clustering method outperforms the state-of-the-art clustering methods previously applied to the problem. Moreover, we observed that computational methods and human experts could produce highly similar clusters at high precision values, while computational methods also suggest to split some large superfamilies into smaller clusters. © 2016 Elsevier B.V.

  11. Experimental data suggesting that inflammation mediated rat liver mitochondrial dysfunction results from secondary hypoxia rather than from direct effects of inflammatory mediators

    Directory of Open Access Journals (Sweden)

    Adelheid eWeidinger

    2013-06-01

    Full Text Available Systemic inflammatory response (SIR comprises direct effects of inflammatory mediators (IM and indirect effects, such as secondary circulatory failure which results in tissue hypoxia (HOX. These two key components, SIR and HOX, cause multiple organ failure (MOF. Since HOX and IM occur and interact simultaneously in vivo, it is difficult to clarify their individual pathological impact. To eliminate this interaction, precision cut liver slices (PCLS were used in this study aiming to dissect the effects of HOX and IM on mitochondrial function, integrity of cellular membrane and the expression of genes associated with inflammation. HOX was induced by incubating PCLS or rat liver mitochondria at pO2<1% followed by reoxygenation (HOX/ROX model. Inflammatory injury was stimulated by incubating PCLS with IM (IM model. We found upregulation of inducible nitric oxide synthase (iNOS expression only in the IM model, while heme oxygenase 1 (HO-1 expression was upregulated only in the HOX/ROX model. Elevated expression of interleukin 6 (IL-6 was found in both models reflecting converging pathways regulating the expression of this gene. Both models caused damage to hepatocytes resulting in the release of alanine aminotransferase (ALT. The leakage of aspartate aminotransferase (AST was observed only during the hypoxic phase in the HOX/ROX model. The reoxygenation phase of HOX, but not IM, drastically impaired mitochondrial electron supply via complex I and II. Additional experiments performed with isolated mitochondria showed that free iron, released during HOX, is likely a key prerequisite of mitochondrial dysfunction induced during the reoxygenation phase. Our data suggests that mitochondrial dysfunction, previously observed in in vivo SIR-models is the result of secondary circulatory failure inducing HOX rather than the result of a direct interaction of IM with liver cells.

  12. The paradox of atheoretical classification

    DEFF Research Database (Denmark)

    Hjørland, Birger

    2016-01-01

    A distinction can be made between “artificial classifications” and “natural classifications,” where artificial classifications may adequately serve some limited purposes, but natural classifications are overall most fruitful by allowing inference and thus many different purposes. There is strong...... support for the view that a natural classification should be based on a theory (and, of course, that the most fruitful theory provides the most fruitful classification). Nevertheless, atheoretical (or “descriptive”) classifications are often produced. Paradoxically, atheoretical classifications may...... be very successful. The best example of a successful “atheoretical” classification is probably the prestigious Diagnostic and Statistical Manual of Mental Disorders (DSM) since its third edition from 1980. Based on such successes one may ask: Should the claim that classifications ideally are natural...

  13. COMe: the ontology of bioinorganic proteins

    Directory of Open Access Journals (Sweden)

    Contrino Sergio

    2004-02-01

    Full Text Available Abstract Background Many characterised proteins contain metal ions, small organic molecules or modified residues. In contrast, the huge amount of data generated by genome projects consists exclusively of sequences with almost no annotation. One of the goals of the structural genomics initiative is to provide representative three-dimensional (3-D structures for as many protein/domain folds as possible to allow successful homology modelling. However, important functional features such as metal co-ordination or a type of prosthetic group are not always conserved in homologous proteins. So far, the problem of correct annotation of bioinorganic proteins has been largely ignored by the bioinformatics community and information on bioinorganic centres obtained by methods other than crystallography or NMR is only available in literature databases. Results COMe (Co-Ordination of Metals represents the ontology for bioinorganic and other small molecule centres in complex proteins. COMe consists of three types of entities: 'bioinorganic motif' (BIM, 'molecule' (MOL, and 'complex proteins' (PRX, with each entity being assigned a unique identifier. A BIM consists of at least one centre (metal atom, inorganic cluster, organic molecule and two or more endogenous and/or exogenous ligands. BIMs are represented as one-dimensional (1-D strings and 2-D diagrams. A MOL entity represents a 'small molecule' which, when in complex with one or more polypeptides, forms a functional protein. The PRX entities refer to the functional proteins as well as to separate protein domains and subunits. The complex proteins in COMe are subdivided into three categories: (i metalloproteins, (ii organic prosthetic group proteins and (iii modified amino acid proteins. The data are currently stored in both XML format and a relational database and are available at http://www.ebi.ac.uk/come/. Conclusion COMe provides the classification of proteins according to their 'bioinorganic' features

  14. Monitoring nanotechnology using patent classifications: an overview and comparison of nanotechnology classification schemes

    Energy Technology Data Exchange (ETDEWEB)

    Jürgens, Björn, E-mail: bjurgens@agenciaidea.es [Agency of Innovation and Development of Andalusia, CITPIA PATLIB Centre (Spain); Herrero-Solana, Victor, E-mail: victorhs@ugr.es [University of Granada, SCImago-UGR (SEJ036) (Spain)

    2017-04-15

    Patents are an essential information source used to monitor, track, and analyze nanotechnology. When it comes to search nanotechnology-related patents, a keyword search is often incomplete and struggles to cover such an interdisciplinary discipline. Patent classification schemes can reveal far better results since they are assigned by experts who classify the patent documents according to their technology. In this paper, we present the most important classifications to search nanotechnology patents and analyze how nanotechnology is covered in the main patent classification systems used in search systems nowadays: the International Patent Classification (IPC), the United States Patent Classification (USPC), and the Cooperative Patent Classification (CPC). We conclude that nanotechnology has a significantly better patent coverage in the CPC since considerable more nanotechnology documents were retrieved than by using other classifications, and thus, recommend its use for all professionals involved in nanotechnology patent searches.

  15. Monitoring nanotechnology using patent classifications: an overview and comparison of nanotechnology classification schemes

    International Nuclear Information System (INIS)

    Jürgens, Björn; Herrero-Solana, Victor

    2017-01-01

    Patents are an essential information source used to monitor, track, and analyze nanotechnology. When it comes to search nanotechnology-related patents, a keyword search is often incomplete and struggles to cover such an interdisciplinary discipline. Patent classification schemes can reveal far better results since they are assigned by experts who classify the patent documents according to their technology. In this paper, we present the most important classifications to search nanotechnology patents and analyze how nanotechnology is covered in the main patent classification systems used in search systems nowadays: the International Patent Classification (IPC), the United States Patent Classification (USPC), and the Cooperative Patent Classification (CPC). We conclude that nanotechnology has a significantly better patent coverage in the CPC since considerable more nanotechnology documents were retrieved than by using other classifications, and thus, recommend its use for all professionals involved in nanotechnology patent searches.

  16. Artificial neural network classification using a minimal training set - Comparison to conventional supervised classification

    Science.gov (United States)

    Hepner, George F.; Logan, Thomas; Ritter, Niles; Bryant, Nevin

    1990-01-01

    Recent research has shown an artificial neural network (ANN) to be capable of pattern recognition and the classification of image data. This paper examines the potential for the application of neural network computing to satellite image processing. A second objective is to provide a preliminary comparison and ANN classification. An artificial neural network can be trained to do land-cover classification of satellite imagery using selected sites representative of each class in a manner similar to conventional supervised classification. One of the major problems associated with recognition and classifications of pattern from remotely sensed data is the time and cost of developing a set of training sites. This reseach compares the use of an ANN back propagation classification procedure with a conventional supervised maximum likelihood classification procedure using a minimal training set. When using a minimal training set, the neural network is able to provide a land-cover classification superior to the classification derived from the conventional classification procedure. This research is the foundation for developing application parameters for further prototyping of software and hardware implementations for artificial neural networks in satellite image and geographic information processing.

  17. Small-scale classification schemes

    DEFF Research Database (Denmark)

    Hertzum, Morten

    2004-01-01

    Small-scale classification schemes are used extensively in the coordination of cooperative work. This study investigates the creation and use of a classification scheme for handling the system requirements during the redevelopment of a nation-wide information system. This requirements...... classification inherited a lot of its structure from the existing system and rendered requirements that transcended the framework laid out by the existing system almost invisible. As a result, the requirements classification became a defining element of the requirements-engineering process, though its main...... effects remained largely implicit. The requirements classification contributed to constraining the requirements-engineering process by supporting the software engineers in maintaining some level of control over the process. This way, the requirements classification provided the software engineers...

  18. Molecular classification of liver cirrhosis in a rat model by proteomics and bioinformatics.

    Science.gov (United States)

    Xu, Xiu-Qin; Leow, Chon K; Lu, Xin; Zhang, Xuegong; Liu, Jun S; Wong, Wing-Hung; Asperger, Arndt; Deininger, Sören; Eastwood Leung, Hon-Chiu

    2004-10-01

    Liver cirrhosis is a worldwide health problem. Reliable, noninvasive methods for early detection of liver cirrhosis are not available. Using a three-step approach, we classified sera from rats with liver cirrhosis following different treatment insults. The approach consisted of: (i) protein profiling using surface-enhanced laser desorption/ionization (SELDI) technology; (ii) selection of a statistically significant serum biomarker set using machine learning algorithms; and (iii) identification of selected serum biomarkers by peptide sequencing. We generated serum protein profiles from three groups of rats: (i) normal (n=8), (ii) thioacetamide-induced liver cirrhosis (n=22), and (iii) bile duct ligation-induced liver fibrosis (n=5) using a weak cation exchanger surface. Profiling data were further analyzed by a recursive support vector machine algorithm to select a panel of statistically significant biomarkers for class prediction. Sensitivity and specificity of classification using the selected protein marker set were higher than 92%. A consistently down-regulated 3495 Da protein in cirrhosis samples was one of the selected significant biomarkers. This 3495 Da protein was purified on-chip and trypsin digested. Further structural characterization of this biomarkers candidate was done by using cross-platform matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS) peptide mass fingerprinting (PMF) and matrix-assisted laser desorption/ionization time of flight/time of flight (MALDI-TOF/TOF) tandem mass spectrometry (MS/MS). Combined data from PMF and MS/MS spectra of two tryptic peptides suggested that this 3495 Da protein shared homology to a histidine-rich glycoprotein. These results demonstrated a novel approach to discovery of new biomarkers for early detection of liver cirrhosis and classification of liver diseases.

  19. Database Description - Yeast Interacting Proteins Database | LSDB Archive [Life Science Database Archive metadata

    Lifescience Database Archive (English)

    Full Text Available List Contact us Yeast Interacting Proteins Database Database Description General information of database Database... name Yeast Interacting Proteins Database Alternative name - DOI 10.18908/lsdba.nbdc00742-000 Creator C...-ken 277-8561 Tel: +81-4-7136-3989 FAX: +81-4-7136-3979 E-mail : Database classif...s cerevisiae Taxonomy ID: 4932 Database description Information on interactions and related information obta...l Acad Sci U S A. 2001 Apr 10;98(8):4569-74. Epub 2001 Mar 13. External Links: Original website information Database

  20. Gynecomastia Classification for Surgical Management: A Systematic Review and Novel Classification System.

    Science.gov (United States)

    Waltho, Daniel; Hatchell, Alexandra; Thoma, Achilleas

    2017-03-01

    Gynecomastia is a common deformity of the male breast, where certain cases warrant surgical management. There are several surgical options, which vary depending on the breast characteristics. To guide surgical management, several classification systems for gynecomastia have been proposed. A systematic review was performed to (1) identify all classification systems for the surgical management of gynecomastia, and (2) determine the adequacy of these classification systems to appropriately categorize the condition for surgical decision-making. The search yielded 1012 articles, and 11 articles were included in the review. Eleven classification systems in total were ascertained, and a total of 10 unique features were identified: (1) breast size, (2) skin redundancy, (3) breast ptosis, (4) tissue predominance, (5) upper abdominal laxity, (6) breast tuberosity, (7) nipple malposition, (8) chest shape, (9) absence of sternal notch, and (10) breast skin elasticity. On average, classification systems included two or three of these features. Breast size and ptosis were the most commonly included features. Based on their review of the current classification systems, the authors believe the ideal classification system should be universal and cater to all causes of gynecomastia; be surgically useful and easy to use; and should include a comprehensive set of clinically appropriate patient-related features, such as breast size, breast ptosis, tissue predominance, and skin redundancy. None of the current classification systems appears to fulfill these criteria.

  1. Incorporating functional inter-relationships into protein function prediction algorithms

    Directory of Open Access Journals (Sweden)

    Kumar Vipin

    2009-05-01

    Full Text Available Abstract Background Functional classification schemes (e.g. the Gene Ontology that serve as the basis for annotation efforts in several organisms are often the source of gold standard information for computational efforts at supervised protein function prediction. While successful function prediction algorithms have been developed, few previous efforts have utilized more than the protein-to-functional class label information provided by such knowledge bases. For instance, the Gene Ontology not only captures protein annotations to a set of functional classes, but it also arranges these classes in a DAG-based hierarchy that captures rich inter-relationships between different classes. These inter-relationships present both opportunities, such as the potential for additional training examples for small classes from larger related classes, and challenges, such as a harder to learn distinction between similar GO terms, for standard classification-based approaches. Results We propose a method to enhance the performance of classification-based protein function prediction algorithms by addressing the issue of using these interrelationships between functional classes constituting functional classification schemes. Using a standard measure for evaluating the semantic similarity between nodes in an ontology, we quantify and incorporate these inter-relationships into the k-nearest neighbor classifier. We present experiments on several large genomic data sets, each of which is used for the modeling and prediction of over hundred classes from the GO Biological Process ontology. The results show that this incorporation produces more accurate predictions for a large number of the functional classes considered, and also that the classes benefitted most by this approach are those containing the fewest members. In addition, we show how our proposed framework can be used for integrating information from the entire GO hierarchy for improving the accuracy of

  2. Information gathering for CLP classification

    Directory of Open Access Journals (Sweden)

    Ida Marcello

    2011-01-01

    Full Text Available Regulation 1272/2008 includes provisions for two types of classification: harmonised classification and self-classification. The harmonised classification of substances is decided at Community level and a list of harmonised classifications is included in the Annex VI of the classification, labelling and packaging Regulation (CLP. If a chemical substance is not included in the harmonised classification list it must be self-classified, based on available information, according to the requirements of Annex I of the CLP Regulation. CLP appoints that the harmonised classification will be performed for carcinogenic, mutagenic or toxic to reproduction substances (CMR substances and for respiratory sensitisers category 1 and for other hazard classes on a case-by-case basis. The first step of classification is the gathering of available and relevant information. This paper presents the procedure for gathering information and to obtain data. The data quality is also discussed.

  3. Systematic classification of the His-Me finger superfamily.

    Science.gov (United States)

    Jablonska, Jagoda; Matelska, Dorota; Steczkiewicz, Kamil; Ginalski, Krzysztof

    2017-11-16

    The His-Me finger endonucleases, also known as HNH or ββα-metal endonucleases, form a large and diverse protein superfamily. The His-Me finger domain can be found in proteins that play an essential role in cells, including genome maintenance, intron homing, host defense and target offense. Its overall structural compactness and non-specificity make it a perfectly-tailored pathogenic module that participates on both sides of inter- and intra-organismal competition. An extremely low sequence similarity across the superfamily makes it difficult to identify and classify new His-Me fingers. Using state-of-the-art distant homology detection methods, we provide an updated and systematic classification of His-Me finger proteins. In this work, we identified over 100 000 proteins and clustered them into 38 groups, of which three groups are new and cannot be found in any existing public domain database of protein families. Based on an analysis of sequences, structures, domain architectures, and genomic contexts, we provide a careful functional annotation of the poorly characterized members of this superfamily. Our results may inspire further experimental investigations that should address the predicted activity and clarify the potential substrates, to provide more detailed insights into the fundamental biological roles of these proteins. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. Parabolic section and distance excess of space curves applied to protein structure classification

    DEFF Research Database (Denmark)

    Røgen, Peter; Karlsson, Per W.

    2008-01-01

    Proteins are long chain molecules that fold up into beautiful and complicated three-dimensional structures before fulfilling their biological functions in the living organisms. With the aim of providing an efficient tool for describing the proteins' native folds, we present a global geometric mea...... measure of a space curve. This geometric measure allows us to define descriptors of protein structures that quantify how parallel the secondary structure elements of a protein are. These descriptors are C-2 in deformations of the protein structure, are evaluated very fast and reliably...

  5. Mechanical stretching of proteins-a theoretical survey of the Protein Data Bank

    International Nuclear Information System (INIS)

    Sulkowska, Joanna I; Cieplak, Marek

    2007-01-01

    The mechanical stretching of single proteins has been studied experimentally for about 50 proteins, yielding a variety of force patterns and peak forces. Here we perform a theoretical survey of proteins of known native structure and map out the landscape of possible dynamical behaviours under stretching at constant speed. We consider 7510 proteins comprising not more than 150 amino acids and 239 longer proteins. The model used is constructed based on the native geometry. It is solved by methods of molecular dynamics and validated by comparing the theoretical predictions to experimental results. We characterize the distribution of peak forces and investigate correlations with the system size and with the structure classification as characterized by the CATH scheme. Despite the presence of such correlations, proteins with the same CATH index may belong to different classes of dynamical behaviour. We identify proteins with the biggest forces and show that they belong to few topology classes. We determine which protein segments act as mechanical clamps and show that, in most cases, they correspond to long stretches of parallel β-strands, but other mechanisms are also possible. (topical review)

  6. A systematic classification of Plasmodium falciparum P-loop NTPases: structural and functional correlation

    Directory of Open Access Journals (Sweden)

    Chauhan Virander S

    2009-04-01

    Full Text Available Abstract Background The P-loop NTPases constitute one of the largest groups of globular protein domains that play highly diverse functional roles in most of the organisms. Even with the availability of nearly 300 different Hidden Markov Models representing the P-loop NTPase superfamily, not many P-loop NTPases are known in Plasmodium falciparum. A number of characteristic attributes of the genome have resulted into the lack of knowledge about this functionally diverse, but important class of proteins. Method In the study, protein sequences with characteristic motifs of NTPase domain (Walker A and Walker B are computationally extracted from the P. falciparum database. A detailed secondary structure analysis, functional classification, phylogenetic and orthology studies of the NTPase domain of repertoire of 97 P. falciparum P-loop NTPases is carried out. Results Based upon distinct sequence features and secondary structure profile of the P-loop domain of obtained sequences, a cladistic classification is also conceded: nucleotide kinases and GTPases, ABC and SMC family, SF1/2 helicases, AAA+ and AAA protein families. Attempts are made to identify any ortholog(s for each of these proteins in other Plasmodium sp. as well as its vertebrate host, Homo sapiens. A number of P. falciparum P-loop NTPases that have no homologue in the host, as well as those annotated as hypothetical proteins and lack any characteristic functional domain are identified. Conclusion The study suggests a strong correlation between sequence and secondary structure profile of P-loop domains and functional roles of these proteins and thus provides an opportunity to speculate the role of many hypothetical proteins. The study provides a methodical framework for the characterization of biologically diverse NTPases in the P. falciparum genome. The efforts made in the analysis are first of its kind; and the results augment to explore the functional role of many of these proteins from

  7. ASIST SIG/CR Classification Workshop 2000: Classification for User Support and Learning.

    Science.gov (United States)

    Soergel, Dagobert

    2001-01-01

    Reports on papers presented at the 62nd Annual Meeting of ASIST (American Society for Information Science and Technology) for the Special Interest Group in Classification Research (SIG/CR). Topics include types of knowledge; developing user-oriented classifications, including domain analysis; classification in the user interface; and automatic…

  8. Couinaud's classification v.s. Cho's classification. Their feasibility in the right hepatic lobe

    International Nuclear Information System (INIS)

    Shioyama, Yasukazu; Ikeda, Hiroaki; Sato, Motohito; Yoshimi, Fuyo; Kishi, Kazushi; Sato, Morio; Kimura, Masashi

    2008-01-01

    The objective of this study was to investigate if the new classification system proposed by Cho is feasible to clinical usage comparing with the classical Couinaud's one. One hundred consecutive cases of abdominal CT were studied using a 64 or an 8 slice multislice CT and created three dimensional portal vein images for analysis by the Workstation. We applied both Cho's classification and the classical Couinaud's one for each cases according to their definitions. Three diagnostic radiologists assessed their feasibility as category one (unable to classify) to five (clear to classify with total suit with the original classification criteria). And in each cases, we tried to judge whether Cho's or the classical Couinaud' classification could more easily transmit anatomical information. Analyzers could classified portal veins clearly (category 5) in 77 to 80% of cases and clearly (category 5) or almost clearly (category 4) in 86-93% along with both classifications. In the feasibility of classification, there was no statistically significant difference between two classifications. In 15 cases we felt that using Couinaud's classification is more convenient for us to transmit anatomical information to physicians than using Cho's one, because in these cases we noticed two large portal veins ramify from right main portal vein cranialy and caudaly and then we could not classify P5 as a branch of antero-ventral segment (AVS). Conversely in 17 cases we felt Cho's classification is more convenient because we could not divide right posterior branch as P6 and P7 and in these cases the right posterior portal vein ramified to several small branches. The anterior fissure vein was clearly noticed in only 60 cases. Comparing the classical Couinaud's classification and Cho's one in feasility of classification, there was no statistically significant difference. We propose we routinely report hepatic anatomy with the classical Couinauds classification and in the preoperative cases we

  9. HPSLPred: An Ensemble Multi-Label Classifier for Human Protein Subcellular Location Prediction with Imbalanced Source.

    Science.gov (United States)

    Wan, Shixiang; Duan, Yucong; Zou, Quan

    2017-09-01

    Predicting the subcellular localization of proteins is an important and challenging problem. Traditional experimental approaches are often expensive and time-consuming. Consequently, a growing number of research efforts employ a series of machine learning approaches to predict the subcellular location of proteins. There are two main challenges among the state-of-the-art prediction methods. First, most of the existing techniques are designed to deal with multi-class rather than multi-label classification, which ignores connections between multiple labels. In reality, multiple locations of particular proteins imply that there are vital and unique biological significances that deserve special focus and cannot be ignored. Second, techniques for handling imbalanced data in multi-label classification problems are necessary, but never employed. For solving these two issues, we have developed an ensemble multi-label classifier called HPSLPred, which can be applied for multi-label classification with an imbalanced protein source. For convenience, a user-friendly webserver has been established at http://server.malab.cn/HPSLPred. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  10. Large-scale gene function analysis with the PANTHER classification system.

    Science.gov (United States)

    Mi, Huaiyu; Muruganujan, Anushya; Casagrande, John T; Thomas, Paul D

    2013-08-01

    The PANTHER (protein annotation through evolutionary relationship) classification system (http://www.pantherdb.org/) is a comprehensive system that combines gene function, ontology, pathways and statistical analysis tools that enable biologists to analyze large-scale, genome-wide data from sequencing, proteomics or gene expression experiments. The system is built with 82 complete genomes organized into gene families and subfamilies, and their evolutionary relationships are captured in phylogenetic trees, multiple sequence alignments and statistical models (hidden Markov models or HMMs). Genes are classified according to their function in several different ways: families and subfamilies are annotated with ontology terms (Gene Ontology (GO) and PANTHER protein class), and sequences are assigned to PANTHER pathways. The PANTHER website includes a suite of tools that enable users to browse and query gene functions, and to analyze large-scale experimental data with a number of statistical tests. It is widely used by bench scientists, bioinformaticians, computer scientists and systems biologists. In the 2013 release of PANTHER (v.8.0), in addition to an update of the data content, we redesigned the website interface to improve both user experience and the system's analytical capability. This protocol provides a detailed description of how to analyze genome-wide experimental data with the PANTHER classification system.

  11. An efficient heuristic method for active feature acquisition and its application to protein-protein interaction prediction

    Directory of Open Access Journals (Sweden)

    Thahir Mohamed

    2012-11-01

    Full Text Available Abstract Background Machine learning approaches for classification learn the pattern of the feature space of different classes, or learn a boundary that separates the feature space into different classes. The features of the data instances are usually available, and it is only the class-labels of the instances that are unavailable. For example, to classify text documents into different topic categories, the words in the documents are features and they are readily available, whereas the topic is what is predicted. However, in some domains obtaining features may be resource-intensive because of which not all features may be available. An example is that of protein-protein interaction prediction, where not only are the labels ('interacting' or 'non-interacting' unavailable, but so are some of the features. It may be possible to obtain at least some of the missing features by carrying out a few experiments as permitted by the available resources. If only a few experiments can be carried out to acquire missing features, which proteins should be studied and which features of those proteins should be determined? From the perspective of machine learning for PPI prediction, it would be desirable that those features be acquired which when used in training the classifier, the accuracy of the classifier is improved the most. That is, the utility of the feature-acquisition is measured in terms of how much acquired features contribute to improving the accuracy of the classifier. Active feature acquisition (AFA is a strategy to preselect such instance-feature combinations (i.e. protein and experiment combinations for maximum utility. The goal of AFA is the creation of optimal training set that would result in the best classifier, and not in determining the best classification model itself. Results We present a heuristic method for active feature acquisition to calculate the utility of acquiring a missing feature. This heuristic takes into account the change in

  12. Classification with support hyperplanes

    NARCIS (Netherlands)

    G.I. Nalbantov (Georgi); J.C. Bioch (Cor); P.J.F. Groenen (Patrick)

    2006-01-01

    textabstractA new classification method is proposed, called Support Hy- perplanes (SHs). To solve the binary classification task, SHs consider the set of all hyperplanes that do not make classification mistakes, referred to as semi-consistent hyperplanes. A test object is classified using

  13. Classification of Flotation Frothers

    Directory of Open Access Journals (Sweden)

    Jan Drzymala

    2018-02-01

    Full Text Available In this paper, a scheme of flotation frothers classification is presented. The scheme first indicates the physical system in which a frother is present and four of them i.e., pure state, aqueous solution, aqueous solution/gas system and aqueous solution/gas/solid system are distinguished. As a result, there are numerous classifications of flotation frothers. The classifications can be organized into a scheme described in detail in this paper. The frother can be present in one of four physical systems, that is pure state, aqueous solution, aqueous solution/gas and aqueous solution/gas/solid system. It results from the paper that a meaningful classification of frothers relies on choosing the physical system and next feature, trend, parameter or parameters according to which the classification is performed. The proposed classification can play a useful role in characterizing and evaluation of flotation frothers.

  14. AN ANALYSIS OF PROTEIN AND CALORIE CONSUMPTION IN CENTRAL JAVA

    Directory of Open Access Journals (Sweden)

    Agus Widarjono

    2014-10-01

    Full Text Available AbstractThis study analyses calorie and protein consumption in Central Java Province. The calorie and protein elasticity are derived from demand elasticity of the ten food groups encompassing cereals, fish, meats, eggs and milk, vegetables, fruits, oil and fats, prepared foods and drinks, other foods and tobacco products. Most of calorie and protein-price elasticity of the ten food group are negative. Consumption of calorie and protein are most responsive to prices of eggs and milk. Protein -price elasticities are less elastic than calorie-price elasticities. Calorie and protein-income elasticity are positive and become lesselastic in moving from lower to higher income households but protein is less responsive to income change than calorie. Keywords: QUAIDS, demand elasticity, nutrient elasticity, Central Java ProvinceJEL Classification numbers: D12, O12AbstrakStudi ini menganlisis konsumsi kalori dan protein di Jawa Tengah. Elastisitas kalori dan protein dihitung dari elastisitas permintaan dari 10 kelompok komoditi makanan yang terdiri dari padi-padian, ikan, daging, telur dan susu, sayur-sayuran, buah-buahan, minyak dan lemak, makanan dan minumun jadi, makanan lainnya dan tembakau dan hasilnya. Sebagian besar elastisitas harga kalori dan protein adalah negatif. Konsumsi kalori dan protein adalah kelompok makanan yang paling responsif terhadap perubahan harga telur dan susu. Elastisitas pendapatan kalori dan protein adalah positif tetapi menjadi lebih elastis untuk rumah tangga berpenghasilan lebih tinggi tetapi protein kurang responsif terhadap perubahan pendapatan daripada kalori.Keywords: QUAIDS, elastisitas permintaan, elastisitas nutrisi, Jawa TengahJEL Classification numbers: D12, O12

  15. DOE LLW classification rationale

    International Nuclear Information System (INIS)

    Flores, A.Y.

    1991-01-01

    This report was about the rationale which the US Department of Energy had with low-level radioactive waste (LLW) classification. It is based on the Nuclear Regulatory Commission's classification system. DOE site operators met to review the qualifications and characteristics of the classification systems. They evaluated performance objectives, developed waste classification tables, and compiled dose limits on the waste. A goal of the LLW classification system was to allow each disposal site the freedom to develop limits to radionuclide inventories and concentrations according to its own site-specific characteristics. This goal was achieved with the adoption of a performance objectives system based on a performance assessment, with site-specific environmental conditions and engineered disposal systems

  16. Asteroid taxonomic classifications

    International Nuclear Information System (INIS)

    Tholen, D.J.

    1989-01-01

    This paper reports on three taxonomic classification schemes developed and applied to the body of available color and albedo data. Asteroid taxonomic classifications according to two of these schemes are reproduced

  17. Hydroxyl and Hydroperoxy Radical Chemistry during the MCMA-2006 Field Campaign: Measurement and Model Comparison

    Science.gov (United States)

    Dusanter, S.; Vimal, D.; Stevens, P. S.; Volkamer, R.; Molina, L. T.

    2007-12-01

    The Mexico City Metropolitan Area (MCMA) field campaign, held in March 2006, was a unique opportunity to collect data in one of the most polluted megacities in the world. Such environments exhibit a complex oxidation chemistry involving a strong coupling between odd hydrogen radicals (HOX=OH+HO2) and nitrogen oxides species (NOX=NO+NO2). High levels of volatile organic compounds (VOCs) and NOX control the HOX budget and lead to elevated tropospheric ozone formation. The HOX-NOX coupling can be investigated by comparing measured and model-predicted HOx concentrations. Atmospheric HOX concentrations were measured by the Indiana University laser-induced fluorescence instrument and data were collected at the Instituto Mexicano del Petroleo between 14 and 31 March. Measured hydroxyl radical (OH) concentrations are comparable to that measured in less polluted urban environments and suggest that the OH concentrations are highly buffered under high NOX conditions. In contrast, hydroperoxy radical (HO2) concentrations are more sensitive to the NOX levels and are highly variable between different urban sites. Enhanced levels of OH and HO2 radicals were observed on several days between 9h30-11h00 AM and suggest an additional HOX source for the morning hours and/or a fast HOX cycling under the high NOX conditions of the MCMA. A preliminary investigation of the HOX chemistry occurring in the MCMA urban atmosphere was performed using a photochemical box model based on the Regional Atmospheric Chemistry Mechanism (RACM). Model comparisons will be presented and the agreement between measured and predicted HOX concentrations will be discussed.

  18. The sequence, structure and evolutionary features of HOTAIR in mammals

    Science.gov (United States)

    2011-01-01

    Background An increasing number of long noncoding RNAs (lncRNAs) have been identified recently. Different from all the others that function in cis to regulate local gene expression, the newly identified HOTAIR is located between HoxC11 and HoxC12 in the human genome and regulates HoxD expression in multiple tissues. Like the well-characterised lncRNA Xist, HOTAIR binds to polycomb proteins to methylate histones at multiple HoxD loci, but unlike Xist, many details of its structure and function, as well as the trans regulation, remain unclear. Moreover, HOTAIR is involved in the aberrant regulation of gene expression in cancer. Results To identify conserved domains in HOTAIR and study the phylogenetic distribution of this lncRNA, we searched the genomes of 10 mammalian and 3 non-mammalian vertebrates for matches to its 6 exons and the two conserved domains within the 1800 bp exon6 using Infernal. There was just one high-scoring hit for each mammal, but many low-scoring hits were found in both mammals and non-mammalian vertebrates. These hits and their flanking genes in four placental mammals and platypus were examined to determine whether HOTAIR contained elements shared by other lncRNAs. Several of the hits were within unknown transcripts or ncRNAs, many were within introns of, or antisense to, protein-coding genes, and conservation of the flanking genes was observed only between human and chimpanzee. Phylogenetic analysis revealed discrete evolutionary dynamics for orthologous sequences of HOTAIR exons. Exon1 at the 5' end and a domain in exon6 near the 3' end, which contain domains that bind to multiple proteins, have evolved faster in primates than in other mammals. Structures were predicted for exon1, two domains of exon6 and the full HOTAIR sequence. The sequence and structure of two fragments, in exon1 and the domain B of exon6 respectively, were identified to robustly occur in predicted structures of exon1, domain B of exon6 and the full HOTAIR in mammals

  19. Non-uniform tube representation of proteins

    DEFF Research Database (Denmark)

    Hansen, Mikael Sonne

    Treating the full protein structure is often neither computationally nor physically possible. Instead one is forced to consider various reduced models capturing the properties of interest. Previous work have used tubular neighborhoods of the C-alpha backbone. However, assigning a unique radius...... might not correctly capture volume exclusion - of crucial importance when trying to understand a proteins $3$d-structure. We propose a new reduced model treating the protein as a non-uniform tube with a radius reflecting the positions of atoms. The tube representation is well suited considering X......-ray crystallographic resolution ~ 3Å while a varying radius accounts for the different sizes of side chains. Such a non-uniform tube better capture the protein geometry and has numerous applications in structural/computational biology from the classification of protein structures to sequence-structure prediction....

  20. PANDORA: keyword-based analysis of protein sets by integration of annotation sources.

    Science.gov (United States)

    Kaplan, Noam; Vaaknin, Avishay; Linial, Michal

    2003-10-01

    Recent advances in high-throughput methods and the application of computational tools for automatic classification of proteins have made it possible to carry out large-scale proteomic analyses. Biological analysis and interpretation of sets of proteins is a time-consuming undertaking carried out manually by experts. We have developed PANDORA (Protein ANnotation Diagram ORiented Analysis), a web-based tool that provides an automatic representation of the biological knowledge associated with any set of proteins. PANDORA uses a unique approach of keyword-based graphical analysis that focuses on detecting subsets of proteins that share unique biological properties and the intersections of such sets. PANDORA currently supports SwissProt keywords, NCBI Taxonomy, InterPro entries and the hierarchical classification terms from ENZYME, SCOP and GO databases. The integrated study of several annotation sources simultaneously allows a representation of biological relations of structure, function, cellular location, taxonomy, domains and motifs. PANDORA is also integrated into the ProtoNet system, thus allowing testing thousands of automatically generated clusters. We illustrate how PANDORA enhances the biological understanding of large, non-uniform sets of proteins originating from experimental and computational sources, without the need for prior biological knowledge on individual proteins.

  1. Determining effects of non-synonymous SNPs on protein-protein interactions using supervised and semi-supervised learning.

    Directory of Open Access Journals (Sweden)

    Nan Zhao

    2014-05-01

    Full Text Available Single nucleotide polymorphisms (SNPs are among the most common types of genetic variation in complex genetic disorders. A growing number of studies link the functional role of SNPs with the networks and pathways mediated by the disease-associated genes. For example, many non-synonymous missense SNPs (nsSNPs have been found near or inside the protein-protein interaction (PPI interfaces. Determining whether such nsSNP will disrupt or preserve a PPI is a challenging task to address, both experimentally and computationally. Here, we present this task as three related classification problems, and develop a new computational method, called the SNP-IN tool (non-synonymous SNP INteraction effect predictor. Our method predicts the effects of nsSNPs on PPIs, given the interaction's structure. It leverages supervised and semi-supervised feature-based classifiers, including our new Random Forest self-learning protocol. The classifiers are trained based on a dataset of comprehensive mutagenesis studies for 151 PPI complexes, with experimentally determined binding affinities of the mutant and wild-type interactions. Three classification problems were considered: (1 a 2-class problem (strengthening/weakening PPI mutations, (2 another 2-class problem (mutations that disrupt/preserve a PPI, and (3 a 3-class classification (detrimental/neutral/beneficial mutation effects. In total, 11 different supervised and semi-supervised classifiers were trained and assessed resulting in a promising performance, with the weighted f-measure ranging from 0.87 for Problem 1 to 0.70 for the most challenging Problem 3. By integrating prediction results of the 2-class classifiers into the 3-class classifier, we further improved its performance for Problem 3. To demonstrate the utility of SNP-IN tool, it was applied to study the nsSNP-induced rewiring of two disease-centered networks. The accurate and balanced performance of SNP-IN tool makes it readily available to study the

  2. Determining Effects of Non-synonymous SNPs on Protein-Protein Interactions using Supervised and Semi-supervised Learning

    Science.gov (United States)

    Zhao, Nan; Han, Jing Ginger; Shyu, Chi-Ren; Korkin, Dmitry

    2014-01-01

    Single nucleotide polymorphisms (SNPs) are among the most common types of genetic variation in complex genetic disorders. A growing number of studies link the functional role of SNPs with the networks and pathways mediated by the disease-associated genes. For example, many non-synonymous missense SNPs (nsSNPs) have been found near or inside the protein-protein interaction (PPI) interfaces. Determining whether such nsSNP will disrupt or preserve a PPI is a challenging task to address, both experimentally and computationally. Here, we present this task as three related classification problems, and develop a new computational method, called the SNP-IN tool (non-synonymous SNP INteraction effect predictor). Our method predicts the effects of nsSNPs on PPIs, given the interaction's structure. It leverages supervised and semi-supervised feature-based classifiers, including our new Random Forest self-learning protocol. The classifiers are trained based on a dataset of comprehensive mutagenesis studies for 151 PPI complexes, with experimentally determined binding affinities of the mutant and wild-type interactions. Three classification problems were considered: (1) a 2-class problem (strengthening/weakening PPI mutations), (2) another 2-class problem (mutations that disrupt/preserve a PPI), and (3) a 3-class classification (detrimental/neutral/beneficial mutation effects). In total, 11 different supervised and semi-supervised classifiers were trained and assessed resulting in a promising performance, with the weighted f-measure ranging from 0.87 for Problem 1 to 0.70 for the most challenging Problem 3. By integrating prediction results of the 2-class classifiers into the 3-class classifier, we further improved its performance for Problem 3. To demonstrate the utility of SNP-IN tool, it was applied to study the nsSNP-induced rewiring of two disease-centered networks. The accurate and balanced performance of SNP-IN tool makes it readily available to study the rewiring of

  3. Probing Enzyme-Surface Interactions via Protein Engineering and Single-Molecule Techniques

    Science.gov (United States)

    2017-06-26

    SECURITY CLASSIFICATION OF: The overall objective of this research was to exploit protein engineering and fluorescence single-molecule methods to...enhance our understanding of the interaction of proteins and surfaces. Given this objective, the specific aims of this research were to: 1) exploit the...incorporation of unnatural amino acids in proteins to introduce single-molecule probes (i.e., fluorophores for fluorescence resonance energy transfer

  4. Modified Marsh Classification of the Duodenal Biopsies of a Large Database Covering 10 Years

    Directory of Open Access Journals (Sweden)

    Cansu Abayli

    2014-02-01

    Full Text Available Purpose: Celiac is an autoimmune disease caused by of gluten proteins which can be found in multi-grain food like wheat, barley and oat. The disease affects more than 1% of population and characterized by intestinal inflammation. In celiac disease, mucosal damage is a dynamic process. It is shown that it has autoimmune components. It is also T-Cell mediated and can be categorised as a chronic inflammatory disease. The purpose of this study is to make modified Marsh classification of the duodenal biopsies that came to our department in the 10 years. The study deals with reassessment of all events and uncovering the low graded events that were not diagnosed. Material and Methods: 467 biopsies (diagnosed between 2001 and 2011 at the Cukurova University, Faculty of Medicine, Department of Pathology were taken and analyzed by two pathologists. Each sample was reevaluated without taking the previous reports into consideration and scored by using modified Marsh classification. Results: According to Modified Marsh Classification total of 48 cases were diagnosed as Type 1. Total of 6 cases according to Modified Marsh Classification was diagnosed as Type 2. Total of 11 cases according to Modified Marsh Classification was diagnosed as Type 3a. Total of 5 cases, according to Modified Marsh Classification, was diagnosed as Type 3b. Total of 6 cases according to Modified Marsh Classification was diagnosed as Type 3c. Conclusion: As a result of this study, it has been found that Modified Marsh Classification is a very important standardization tool for detection of suspicious duodenal biopsies and for early case examinations.

  5. 32 CFR 2001.15 - Classification guides.

    Science.gov (United States)

    2010-07-01

    ... 32 National Defense 6 2010-07-01 2010-07-01 false Classification guides. 2001.15 Section 2001.15..., NATIONAL ARCHIVES AND RECORDS ADMINISTRATION CLASSIFIED NATIONAL SECURITY INFORMATION Classification § 2001.15 Classification guides. (a) Preparation of classification guides. Originators of classification...

  6. FACET CLASSIFICATIONS OF E-LEARNING TOOLS

    Directory of Open Access Journals (Sweden)

    Olena Yu. Balalaieva

    2013-12-01

    Full Text Available The article deals with the classification of e-learning tools based on the facet method, which suggests the separation of the parallel set of objects into independent classification groups; at the same time it is not assumed rigid classification structure and pre-built finite groups classification groups are formed by a combination of values taken from the relevant facets. An attempt to systematize the existing classification of e-learning tools from the standpoint of classification theory is made for the first time. Modern Ukrainian and foreign facet classifications of e-learning tools are described; their positive and negative features compared to classifications based on a hierarchical method are analyzed. The original author's facet classification of e-learning tools is proposed.

  7. The 2017 international classification of the Ehlers-Danlos syndromes

    DEFF Research Database (Denmark)

    Malfait, Fransiska; Francomano, Clair; Byers, Peter H

    2017-01-01

    The Ehlers-Danlos syndromes (EDS) are a clinically and genetically heterogeneous group of heritable connective tissue disorders (HCTDs) characterized by joint hypermobility, skin hyperextensibility, and tissue fragility. Over the past two decades, the Villefranche Nosology, which delineated six s...... that the revised International EDS Classification will serve as a new standard for the diagnosis of EDS and will provide a framework for future research purposes. © 2017 Wiley Periodicals, Inc......., and mutations have been identified in an array of novel genes. The International EDS Consortium proposes a revised EDS classification, which recognizes 13 subtypes. For each of the subtypes, we propose a set of clinical criteria that are suggestive for the diagnosis. However, in view of the vast genetic...... revised the clinical criteria for hypermobile EDS in order to allow for a better distinction from other joint hypermobility disorders. To satisfy research needs, we also propose a pathogenetic scheme, that regroups EDS subtypes for which the causative proteins function within the same pathway. We hope...

  8. Maxillectomy defects: a suggested classification scheme.

    Science.gov (United States)

    Akinmoladun, V I; Dosumu, O O; Olusanya, A A; Ikusika, O F

    2013-06-01

    The term "maxillectomy" has been used to describe a variety of surgical procedures for a spectrum of diseases involving a diverse anatomical site. Hence, classifications of maxillectomy defects have often made communication difficult. This article highlights this problem, emphasises the need for a uniform system of classification and suggests a classification system which is simple and comprehensive. Articles related to this subject, especially those with specified classifications of maxillary surgical defects were sourced from the internet through Google, Scopus and PubMed using the search terms maxillectomy defects classification. A manual search through available literature was also done. The review of the materials revealed many classifications and modifications of classifications from the descriptive, reconstructive and prosthodontic perspectives. No globally acceptable classification exists among practitioners involved in the management of diseases in the mid-facial region. There were over 14 classifications of maxillary defects found in the English literature. Attempts made to address the inadequacies of previous classifications have tended to result in cumbersome and relatively complex classifications. A single classification that is based on both surgical and prosthetic considerations is most desirable and is hereby proposed.

  9. Recognition of functional sites in protein structures.

    Science.gov (United States)

    Shulman-Peleg, Alexandra; Nussinov, Ruth; Wolfson, Haim J

    2004-06-04

    Recognition of regions on the surface of one protein, that are similar to a binding site of another is crucial for the prediction of molecular interactions and for functional classifications. We first describe a novel method, SiteEngine, that assumes no sequence or fold similarities and is able to recognize proteins that have similar binding sites and may perform similar functions. We achieve high efficiency and speed by introducing a low-resolution surface representation via chemically important surface points, by hashing triangles of physico-chemical properties and by application of hierarchical scoring schemes for a thorough exploration of global and local similarities. We proceed to rigorously apply this method to functional site recognition in three possible ways: first, we search a given functional site on a large set of complete protein structures. Second, a potential functional site on a protein of interest is compared with known binding sites, to recognize similar features. Third, a complete protein structure is searched for the presence of an a priori unknown functional site, similar to known sites. Our method is robust and efficient enough to allow computationally demanding applications such as the first and the third. From the biological standpoint, the first application may identify secondary binding sites of drugs that may lead to side-effects. The third application finds new potential sites on the protein that may provide targets for drug design. Each of the three applications may aid in assigning a function and in classification of binding patterns. We highlight the advantages and disadvantages of each type of search, provide examples of large-scale searches of the entire Protein Data Base and make functional predictions.

  10. Constructing criticality by classification

    DEFF Research Database (Denmark)

    Machacek, Erika

    2017-01-01

    " in the bureaucratic practice of classification: Experts construct material criticality in assessments as they allot information on the materials to the parameters of the assessment framework. In so doing, they ascribe a new set of connotations to the materials, namely supply risk, and their importance to clean energy......, legitimizing a criticality discourse.Specifically, the paper introduces a typology delineating the inferences made by the experts from their produced recommendations in the classification of rare earth element criticality. The paper argues that the classification is a specific process of constructing risk....... It proposes that the expert bureaucratic practice of classification legitimizes (i) the valorisation that was made in the drafting of the assessment framework for the classification, and (ii) political operationalization when enacted that might have (non-)distributive implications for the allocation of public...

  11. Origin and Diversification of Basic-Helix-Loop-Helix Proteins in Plants

    OpenAIRE

    Pires, Nuno; Dolan, Liam

    2009-01-01

    Basic helix-loop-helix (bHLH) proteins are a class of transcription factors found throughout eukaryotic organisms. Classification of the complete sets of bHLH proteins in the sequenced genomes of Arabidopsis thaliana and Oryza sativa (rice) has defined the diversity of these proteins among flowering plants. However, the evolutionary relationships of different plant bHLH groups and the diversity of bHLH proteins in more ancestral groups of plants are currently unknown. In this study, we use wh...

  12. 12 CFR 403.4 - Derivative classification.

    Science.gov (United States)

    2010-01-01

    ... SAFEGUARDING OF NATIONAL SECURITY INFORMATION § 403.4 Derivative classification. (a) Use of derivative classification. (1) Unlike original classification which is an initial determination, derivative classification... 12 Banks and Banking 4 2010-01-01 2010-01-01 false Derivative classification. 403.4 Section 403.4...

  13. Present perspectives on the automated classification of the G-protein coupled receptors (GPCRs) at the protein sequence level

    DEFF Research Database (Denmark)

    Davies, Matthew N; Gloriam, David E; Secker, Andrew

    2011-01-01

    The G-protein coupled receptors--or GPCRs--comprise simultaneously one of the largest and one of the most multi-functional protein families known to modern-day molecular bioscience. From a drug discovery and pharmaceutical industry perspective, the GPCRs constitute one of the most commercially an...

  14. Supernova Photometric Lightcurve Classification

    Science.gov (United States)

    Zaidi, Tayeb; Narayan, Gautham

    2016-01-01

    This is a preliminary report on photometric supernova classification. We first explore the properties of supernova light curves, and attempt to restructure the unevenly sampled and sparse data from assorted datasets to allow for processing and classification. The data was primarily drawn from the Dark Energy Survey (DES) simulated data, created for the Supernova Photometric Classification Challenge. This poster shows a method for producing a non-parametric representation of the light curve data, and applying a Random Forest classifier algorithm to distinguish between supernovae types. We examine the impact of Principal Component Analysis to reduce the dimensionality of the dataset, for future classification work. The classification code will be used in a stage of the ANTARES pipeline, created for use on the Large Synoptic Survey Telescope alert data and other wide-field surveys. The final figure-of-merit for the DES data in the r band was 60% for binary classification (Type I vs II).Zaidi was supported by the NOAO/KPNO Research Experiences for Undergraduates (REU) Program which is funded by the National Science Foundation Research Experiences for Undergraduates Program (AST-1262829).

  15. Project implementation : classification of organic soils and classification of marls - training of INDOT personnel.

    Science.gov (United States)

    2012-09-01

    This is an implementation project for the research completed as part of the following projects: SPR3005 Classification of Organic Soils : and SPR3227 Classification of Marl Soils. The methods developed for the classification of both soi...

  16. 45 CFR 601.5 - Derivative classification.

    Science.gov (United States)

    2010-10-01

    ... CLASSIFICATION AND DECLASSIFICATION OF NATIONAL SECURITY INFORMATION § 601.5 Derivative classification. Distinct... 45 Public Welfare 3 2010-10-01 2010-10-01 false Derivative classification. 601.5 Section 601.5... classification guide, need not possess original classification authority. (a) If a person who applies derivative...

  17. Learning features for tissue classification with the classification restricted Boltzmann machine

    DEFF Research Database (Denmark)

    van Tulder, Gijs; de Bruijne, Marleen

    2014-01-01

    Performance of automated tissue classification in medical imaging depends on the choice of descriptive features. In this paper, we show how restricted Boltzmann machines (RBMs) can be used to learn features that are especially suited for texture-based tissue classification. We introduce the convo...... outperform conventional RBM-based feature learning, which is unsupervised and uses only a generative learning objective, as well as often-used filter banks. We show that a mixture of generative and discriminative learning can produce filters that give a higher classification accuracy....

  18. SVM-Fold: a tool for discriminative multi-class protein fold and superfamily recognition.

    Science.gov (United States)

    Melvin, Iain; Ie, Eugene; Kuang, Rui; Weston, Jason; Stafford, William Noble; Leslie, Christina

    2007-05-22

    Predicting a protein's structural class from its amino acid sequence is a fundamental problem in computational biology. Much recent work has focused on developing new representations for protein sequences, called string kernels, for use with support vector machine (SVM) classifiers. However, while some of these approaches exhibit state-of-the-art performance at the binary protein classification problem, i.e. discriminating between a particular protein class and all other classes, few of these studies have addressed the real problem of multi-class superfamily or fold recognition. Moreover, there are only limited software tools and systems for SVM-based protein classification available to the bioinformatics community. We present a new multi-class SVM-based protein fold and superfamily recognition system and web server called SVM-Fold, which can be found at http://svm-fold.c2b2.columbia.edu. Our system uses an efficient implementation of a state-of-the-art string kernel for sequence profiles, called the profile kernel, where the underlying feature representation is a histogram of inexact matching k-mer frequencies. We also employ a novel machine learning approach to solve the difficult multi-class problem of classifying a sequence of amino acids into one of many known protein structural classes. Binary one-vs-the-rest SVM classifiers that are trained to recognize individual structural classes yield prediction scores that are not comparable, so that standard "one-vs-all" classification fails to perform well. Moreover, SVMs for classes at different levels of the protein structural hierarchy may make useful predictions, but one-vs-all does not try to combine these multiple predictions. To deal with these problems, our method learns relative weights between one-vs-the-rest classifiers and encodes information about the protein structural hierarchy for multi-class prediction. In large-scale benchmark results based on the SCOP database, our code weighting approach

  19. Topological properties of complex networks in protein structures

    Science.gov (United States)

    Kim, Kyungsik; Jung, Jae-Won; Min, Seungsik

    2014-03-01

    We study topological properties of networks in structural classification of proteins. We model the native-state protein structure as a network made of its constituent amino-acids and their interactions. We treat four structural classes of proteins composed predominantly of α helices and β sheets and consider several proteins from each of these classes whose sizes range from amino acids of the Protein Data Bank. Particularly, we simulate and analyze the network metrics such as the mean degree, the probability distribution of degree, the clustering coefficient, the characteristic path length, the local efficiency, and the cost. This work was supported by the KMAR and DP under Grant WISE project (153-3100-3133-302-350).

  20. Highly multiplexed and quantitative cell-surface protein profiling using genetically barcoded antibodies.

    Science.gov (United States)

    Pollock, Samuel B; Hu, Amy; Mou, Yun; Martinko, Alexander J; Julien, Olivier; Hornsby, Michael; Ploder, Lynda; Adams, Jarrett J; Geng, Huimin; Müschen, Markus; Sidhu, Sachdev S; Moffat, Jason; Wells, James A

    2018-03-13

    Human cells express thousands of different surface proteins that can be used for cell classification, or to distinguish healthy and disease conditions. A method capable of profiling a substantial fraction of the surface proteome simultaneously and inexpensively would enable more accurate and complete classification of cell states. We present a highly multiplexed and quantitative surface proteomic method using genetically barcoded antibodies called phage-antibody next-generation sequencing (PhaNGS). Using 144 preselected antibodies displayed on filamentous phage (Fab-phage) against 44 receptor targets, we assess changes in B cell surface proteins after the development of drug resistance in a patient with acute lymphoblastic leukemia (ALL) and in adaptation to oncogene expression in a Myc-inducible Burkitt lymphoma model. We further show PhaNGS can be applied at the single-cell level. Our results reveal that a common set of proteins including FLT3, NCR3LG1, and ROR1 dominate the response to similar oncogenic perturbations in B cells. Linking high-affinity, selective, genetically encoded binders to NGS enables direct and highly multiplexed protein detection, comparable to RNA-sequencing for mRNA. PhaNGS has the potential to profile a substantial fraction of the surface proteome simultaneously and inexpensively to enable more accurate and complete classification of cell states. Copyright © 2018 the Author(s). Published by PNAS.

  1. Classification of smooth Fano polytopes

    DEFF Research Database (Denmark)

    Øbro, Mikkel

    A simplicial lattice polytope containing the origin in the interior is called a smooth Fano polytope, if the vertices of every facet is a basis of the lattice. The study of smooth Fano polytopes is motivated by their connection to toric varieties. The thesis concerns the classification of smooth...... Fano polytopes up to isomorphism. A smooth Fano -polytope can have at most vertices. In case of vertices an explicit classification is known. The thesis contains the classification in case of vertices. Classifications of smooth Fano -polytopes for fixed exist only for . In the thesis an algorithm...... for the classification of smooth Fano -polytopes for any given is presented. The algorithm has been implemented and used to obtain the complete classification for ....

  2. Maximum mutual information regularized classification

    KAUST Repository

    Wang, Jim Jing-Yan

    2014-09-07

    In this paper, a novel pattern classification approach is proposed by regularizing the classifier learning to maximize mutual information between the classification response and the true class label. We argue that, with the learned classifier, the uncertainty of the true class label of a data sample should be reduced by knowing its classification response as much as possible. The reduced uncertainty is measured by the mutual information between the classification response and the true class label. To this end, when learning a linear classifier, we propose to maximize the mutual information between classification responses and true class labels of training samples, besides minimizing the classification error and reducing the classifier complexity. An objective function is constructed by modeling mutual information with entropy estimation, and it is optimized by a gradient descend method in an iterative algorithm. Experiments on two real world pattern classification problems show the significant improvements achieved by maximum mutual information regularization.

  3. Maximum mutual information regularized classification

    KAUST Repository

    Wang, Jim Jing-Yan; Wang, Yi; Zhao, Shiguang; Gao, Xin

    2014-01-01

    In this paper, a novel pattern classification approach is proposed by regularizing the classifier learning to maximize mutual information between the classification response and the true class label. We argue that, with the learned classifier, the uncertainty of the true class label of a data sample should be reduced by knowing its classification response as much as possible. The reduced uncertainty is measured by the mutual information between the classification response and the true class label. To this end, when learning a linear classifier, we propose to maximize the mutual information between classification responses and true class labels of training samples, besides minimizing the classification error and reducing the classifier complexity. An objective function is constructed by modeling mutual information with entropy estimation, and it is optimized by a gradient descend method in an iterative algorithm. Experiments on two real world pattern classification problems show the significant improvements achieved by maximum mutual information regularization.

  4. Engineering Synechocystis PCC6803 for hydrogen production: influence on the tolerance to oxidative and sugar stresses.

    Directory of Open Access Journals (Sweden)

    Marcia Ortega-Ramos

    Full Text Available In the prospect of engineering cyanobacteria for the biological photoproduction of hydrogen, we have studied the hydrogen production machine in the model unicellular strain Synechocystis PCC6803 through gene deletion, and overexpression (constitutive or controlled by the growth temperature. We demonstrate that the hydrogenase-encoding hoxEFUYH operon is dispensable to standard photoautotrophic growth in absence of stress, and it operates in cell defense against oxidative (H₂O₂ and sugar (glucose and glycerol stresses. Furthermore, we showed that the simultaneous over-production of the proteins HoxEFUYH and HypABCDE (assembly of hydrogenase, combined to an increase in nickel availability, led to an approximately 20-fold increase in the level of active hydrogenase. These novel results and mutants have major implications for those interested in hydrogenase, hydrogen production and redox metabolism, and their connections with environmental conditions.

  5. Proteome scale identification, classification and structural analysis of iron-binding proteins in bread wheat.

    Science.gov (United States)

    Verma, Shailender Kumar; Sharma, Ankita; Sandhu, Padmani; Choudhary, Neha; Sharma, Shailaja; Acharya, Vishal; Akhter, Yusuf

    2017-05-01

    Bread wheat is one of the major staple foods of worldwide population and iron plays a significant role in growth and development of the plant. In this report, we are presenting the genome wide identification of iron-binding proteins in bread wheat. The wheat genome derived putative proteome was screened for identification of iron-binding sequence motifs. Out of 602 putative iron-binding proteins, 130 were able to produce reliable structural models by homology techniques and further analyzed for the presence of iron-binding structural motifs. The computationally identified proteins appear to bind to ferrous and ferric ions and showed diverse coordination geometries. Glu, His, Asp and Cys amino acid residues were found to be mostly involved in iron binding. We have classified these proteins on the basis of their localization in the different cellular compartments. The identified proteins were further classified into their protein folds, families and functional classes ranging from structure maintenance of cellular components, regulation of gene expression, post translational modification, membrane proteins, enzymes, signaling and storage proteins. This comprehensive report regarding structural iron binding proteome provides useful insights into the diversity of iron binding proteins of wheat plants and further utilized to study their roles in plant growth, development and physiology. Copyright © 2017 Elsevier Inc. All rights reserved.

  6. Cluster Based Text Classification Model

    DEFF Research Database (Denmark)

    Nizamani, Sarwat; Memon, Nasrullah; Wiil, Uffe Kock

    2011-01-01

    We propose a cluster based classification model for suspicious email detection and other text classification tasks. The text classification tasks comprise many training examples that require a complex classification model. Using clusters for classification makes the model simpler and increases...... the accuracy at the same time. The test example is classified using simpler and smaller model. The training examples in a particular cluster share the common vocabulary. At the time of clustering, we do not take into account the labels of the training examples. After the clusters have been created......, the classifier is trained on each cluster having reduced dimensionality and less number of examples. The experimental results show that the proposed model outperforms the existing classification models for the task of suspicious email detection and topic categorization on the Reuters-21578 and 20 Newsgroups...

  7. Classification of Beta-lactamases and penicillin binding proteins using ligand-centric network models.

    Directory of Open Access Journals (Sweden)

    Hakime Öztürk

    Full Text Available β-lactamase mediated antibiotic resistance is an important health issue and the discovery of new β-lactam type antibiotics or β-lactamase inhibitors is an area of intense research. Today, there are about a thousand β-lactamases due to the evolutionary pressure exerted by these ligands. While β-lactamases hydrolyse the β-lactam ring of antibiotics, rendering them ineffective, Penicillin-Binding Proteins (PBPs, which share high structural similarity with β-lactamases, also confer antibiotic resistance to their host organism by acquiring mutations that allow them to continue their participation in cell wall biosynthesis. In this paper, we propose a novel approach to include ligand sharing information for classifying and clustering β-lactamases and PBPs in an effort to elucidate the ligand induced evolution of these β-lactam binding proteins. We first present a detailed summary of the β-lactamase and PBP families in the Protein Data Bank, as well as the compounds they bind to. Then, we build two different types of networks in which the proteins are represented as nodes, and two proteins are connected by an edge with a weight that depends on the number of shared identical or similar ligands. These models are analyzed under three different edge weight settings, namely unweighted, weighted, and normalized weighted. A detailed comparison of these six networks showed that the use of ligand sharing information to cluster proteins resulted in modules comprising proteins with not only sequence similarity but also functional similarity. Consideration of ligand similarity highlighted some interactions that were not detected in the identical ligand network. Analysing the β-lactamases and PBPs using ligand-centric network models enabled the identification of novel relationships, suggesting that these models can be used to examine other protein families to obtain information on their ligand induced evolutionary paths.

  8. Ontologies vs. Classification Systems

    DEFF Research Database (Denmark)

    Madsen, Bodil Nistrup; Erdman Thomsen, Hanne

    2009-01-01

    What is an ontology compared to a classification system? Is a taxonomy a kind of classification system or a kind of ontology? These are questions that we meet when working with people from industry and public authorities, who need methods and tools for concept clarification, for developing meta...... data sets or for obtaining advanced search facilities. In this paper we will present an attempt at answering these questions. We will give a presentation of various types of ontologies and briefly introduce terminological ontologies. Furthermore we will argue that classification systems, e.g. product...... classification systems and meta data taxonomies, should be based on ontologies....

  9. The book classification of William Torrey Harris: influences of Bacon and Hegel in library classification

    Directory of Open Access Journals (Sweden)

    Rodrigo de Sales

    2017-09-01

    Full Text Available The studies of library classification generally interact with the historical contextualization approach and with the classification ideas typical of Philosophy. In the 19th century, the North-American philosopher and educator William Torrey Harris developed a book classification at the St. Louis Public School, based on Francis Bacon and Georg Wilhelm Friedrich Hegel. The objective of this essay is to analyze Harris’s classification, reflecting upon his theoretical and philosophical backgrounds. To achieve such objective, this essay adopts a critical-descriptive approach for analysis. Results show some influences of Bacon and Hegel in Harris’s classification.

  10. Zinc-finger proteins in health and disease.

    Science.gov (United States)

    Cassandri, Matteo; Smirnov, Artem; Novelli, Flavia; Pitolli, Consuelo; Agostini, Massimiliano; Malewicz, Michal; Melino, Gerry; Raschellà, Giuseppe

    2017-01-01

    Zinc-finger proteins (ZNFs) are one of the most abundant groups of proteins and have a wide range of molecular functions. Given the wide variety of zinc-finger domains, ZNFs are able to interact with DNA, RNA, PAR (poly-ADP-ribose) and other proteins. Thus, ZNFs are involved in the regulation of several cellular processes. In fact, ZNFs are implicated in transcriptional regulation, ubiquitin-mediated protein degradation, signal transduction, actin targeting, DNA repair, cell migration, and numerous other processes. The aim of this review is to provide a comprehensive summary of the current state of knowledge of this class of proteins. Firstly, we describe the actual classification of ZNFs, their structure and functions. Secondly, we focus on the biological role of ZNFs in the development of organisms under normal physiological and pathological conditions.

  11. Reliability of Oronasal Fistula Classification.

    Science.gov (United States)

    Sitzman, Thomas J; Allori, Alexander C; Matic, Damir B; Beals, Stephen P; Fisher, David M; Samson, Thomas D; Marcus, Jeffrey R; Tse, Raymond W

    2018-01-01

    Objective Oronasal fistula is an important complication of cleft palate repair that is frequently used to evaluate surgical quality, yet reliability of fistula classification has never been examined. The objective of this study was to determine the reliability of oronasal fistula classification both within individual surgeons and between multiple surgeons. Design Using intraoral photographs of children with repaired cleft palate, surgeons rated the location of palatal fistulae using the Pittsburgh Fistula Classification System. Intrarater and interrater reliability scores were calculated for each region of the palate. Participants Eight cleft surgeons rated photographs obtained from 29 children. Results Within individual surgeons reliability for each region of the Pittsburgh classification ranged from moderate to almost perfect (κ = .60-.96). By contrast, reliability between surgeons was lower, ranging from fair to substantial (κ = .23-.70). Between-surgeon reliability was lowest for the junction of the soft and hard palates (κ = .23). Within-surgeon and between-surgeon reliability were almost perfect for the more general classification of fistula in the secondary palate (κ = .95 and κ = .83, respectively). Conclusions This is the first reliability study of fistula classification. We show that the Pittsburgh Fistula Classification System is reliable when used by an individual surgeon, but less reliable when used among multiple surgeons. Comparisons of fistula occurrence among surgeons may be subject to less bias if they use the more general classification of "presence or absence of fistula of the secondary palate" rather than the Pittsburgh Fistula Classification System.

  12. 5 CFR 1312.7 - Derivative classification.

    Science.gov (United States)

    2010-01-01

    ..., DOWNGRADING, DECLASSIFICATION AND SAFEGUARDING OF NATIONAL SECURITY INFORMATION Classification and Declassification of National Security Information § 1312.7 Derivative classification. A derivative classification... 5 Administrative Personnel 3 2010-01-01 2010-01-01 false Derivative classification. 1312.7 Section...

  13. Proteins in olive fruit and oil.

    Science.gov (United States)

    Montealegre, Cristina; Esteve, Clara; García, Maria Concepción; García-Ruiz, Carmen; Marina, Maria Luisa

    2014-01-01

    This paper is a comprehensive review grouping the information on the extraction, characterization, and quantitation of olive and olive oil proteins and providing a practical guide about these proteins. Most characterized olive proteins are located in the fruit, mainly in the seed, where different oleosins and storage proteins have been found. Unlike the seed, the olive pulp contains a lower protein content having been described a polypeptide of 4.6 kDa and a thaumain-like protein. Other important proteins studied in olive fruits have been enzymes which could play important roles in olives characteristics. Part of these proteins is transferred from the fruit to the oil during the manufacturing process of olive oil. In fact, the same polypeptide of 4.6 kDa found in the pulp has been described in the olive oil and, additionally, the presence of other proteins and enzymes have also been described. Protein profiles have recently been proposed as an interesting strategy for the varietal classification of olive fruits and oils. Nevertheless, there is still a lot of knowledge without being explored requiring new studies focused on the determination and characterization of these proteins.

  14. Munitions Classification Library

    Science.gov (United States)

    2016-04-04

    members of the community to make their own additions to any, or all, of the classification libraries . The next phase entailed data collection over less......Include area code) 04/04/2016 Final Report August 2014 - August 2015 MUNITIONS CLASSIFICATION LIBRARY Mr. Craig Murray, Parsons Dr. Thomas H. Bell, Leidos

  15. Protein Subcellular Localization with Gaussian Kernel Discriminant Analysis and Its Kernel Parameter Selection.

    Science.gov (United States)

    Wang, Shunfang; Nie, Bing; Yue, Kun; Fei, Yu; Li, Wenjia; Xu, Dongshu

    2017-12-15

    Kernel discriminant analysis (KDA) is a dimension reduction and classification algorithm based on nonlinear kernel trick, which can be novelly used to treat high-dimensional and complex biological data before undergoing classification processes such as protein subcellular localization. Kernel parameters make a great impact on the performance of the KDA model. Specifically, for KDA with the popular Gaussian kernel, to select the scale parameter is still a challenging problem. Thus, this paper introduces the KDA method and proposes a new method for Gaussian kernel parameter selection depending on the fact that the differences between reconstruction errors of edge normal samples and those of interior normal samples should be maximized for certain suitable kernel parameters. Experiments with various standard data sets of protein subcellular localization show that the overall accuracy of protein classification prediction with KDA is much higher than that without KDA. Meanwhile, the kernel parameter of KDA has a great impact on the efficiency, and the proposed method can produce an optimum parameter, which makes the new algorithm not only perform as effectively as the traditional ones, but also reduce the computational time and thus improve efficiency.

  16. IAEA Classification of Uranium Deposits

    International Nuclear Information System (INIS)

    Bruneton, Patrice

    2014-01-01

    Classifications of uranium deposits follow two general approaches, focusing on: • descriptive features such as the geotectonic position, the host rock type, the orebody morphology, …… : « geologic classification »; • or on genetic aspects: « genetic classification »

  17. 32 CFR 2400.15 - Classification guides.

    Science.gov (United States)

    2010-07-01

    ... REGULATIONS TO IMPLEMENT E.O. 12356; OFFICE OF SCIENCE AND TECHNOLOGY POLICY INFORMATION SECURITY PROGRAM Derivative Classification § 2400.15 Classification guides. (a) OSTP shall issue and maintain classification guides to facilitate the proper and uniform derivative classification of information. These guides shall...

  18. New guidelines for dam safety classification

    International Nuclear Information System (INIS)

    Dascal, O.

    1999-01-01

    Elements are outlined of recommended new guidelines for safety classification of dams. Arguments are provided for the view that dam classification systems should require more than one system as follows: (a) classification for selection of design criteria, operation procedures and emergency measures plans, based on potential consequences of a dam failure - the hazard classification of water retaining structures; (b) classification for establishment of surveillance activities and for safety evaluation of dams, based on the probability and consequences of failure - the risk classification of water retaining structures; and (c) classification for establishment of water management plans, for safety evaluation of the entire project, for preparation of emergency measures plans, for definition of the frequency and extent of maintenance operations, and for evaluation of changes and modifications required - the hazard classification of the project. The hazard classification of the dam considers, as consequence, mainly the loss of lives or persons in jeopardy and the property damages to third parties. Difficulties in determining the risk classification of the dam lie in the fact that no tool exists to evaluate the probability of the dam's failure. To overcome this, the probability of failure can be substituted for by a set of dam characteristics that express the failure potential of the dam and its foundation. The hazard classification of the entire project is based on the probable consequences of dam failure influencing: loss of life, persons in jeopardy, property and environmental damage. The classification scheme is illustrated for dam threatening events such as earthquakes and floods. 17 refs., 5 tabs

  19. 7 CFR 28.911 - Review classification.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Review classification. 28.911 Section 28.911... REGULATIONS COTTON CLASSING, TESTING, AND STANDARDS Cotton Classification and Market News Service for Producers Classification § 28.911 Review classification. (a) A producer may request one review...

  20. Packet Classification by Multilevel Cutting of the Classification Space: An Algorithmic-Architectural Solution for IP Packet Classification in Next Generation Networks

    Directory of Open Access Journals (Sweden)

    Motasem Aldiab

    2008-01-01

    Full Text Available Traditionally, the Internet provides only a “best-effort” service, treating all packets going to the same destination equally. However, providing differentiated services for different users based on their quality requirements is increasingly becoming a demanding issue. For this, routers need to have the capability to distinguish and isolate traffic belonging to different flows. This ability to determine the flow each packet belongs to is called packet classification. Technology vendors are reluctant to support algorithmic solutions for classification due to their nondeterministic performance. Although content addressable memories (CAMs are favoured by technology vendors due to their deterministic high-lookup rates, they suffer from the problems of high-power consumption and high-silicon cost. This paper provides a new algorithmic-architectural solution for packet classification that mixes CAMs with algorithms based on multilevel cutting of the classification space into smaller spaces. The provided solution utilizes the geometrical distribution of rules in the classification space. It provides the deterministic performance of CAMs, support for dynamic updates, and added flexibility for system designers.

  1. Microsatellite Instability Use in Mismatch Repair Gene Sequence Variant Classification

    Directory of Open Access Journals (Sweden)

    Bryony A. Thompson

    2015-03-01

    Full Text Available Inherited mutations in the DNA mismatch repair genes (MMR can cause MMR deficiency and increased susceptibility to colorectal and endometrial cancer. Microsatellite instability (MSI is the defining molecular signature of MMR deficiency. The clinical classification of identified MMR gene sequence variants has a direct impact on the management of patients and their families. For a significant proportion of cases sequence variants of uncertain clinical significance (also known as unclassified variants are identified, constituting a challenge for genetic counselling and clinical management of families. The effect on protein function of these variants is difficult to interpret. The presence or absence of MSI in tumours can aid in determining the pathogenicity of associated unclassified MMR gene variants. However, there are some considerations that need to be taken into account when using MSI for variant interpretation. The use of MSI and other tumour characteristics in MMR gene sequence variant classification will be explored in this review.

  2. The new WHO 2016 classification of brain tumors-what neurosurgeons need to know.

    Science.gov (United States)

    Banan, Rouzbeh; Hartmann, Christian

    2017-03-01

    The understanding of molecular alterations of tumors has severely changed the concept of classification in all fields of pathology. The availability of high-throughput technologies such as next-generation sequencing allows for a much more precise definition of tumor entities. Also in the field of brain tumors a dramatic increase of knowledge has occurred over the last years partially calling into question the purely morphologically based concepts that were used as exclusive defining criteria in the WHO 2007 classification. Review of the WHO 2016 classification of brain tumors as well as a search and review of publications in the literature relevant for brain tumor classification from 2007 up to now. The idea of incorporating the molecular features in classifying tumors of the central nervous system led the authors of the new WHO 2016 classification to encounter inevitable conceptual problems, particularly with respect to linking morphology to molecular alterations. As a solution they introduced the concept of a "layered diagnosis" to the classification of brain tumors that still allows at a lower level a purely morphologically based diagnosis while partially forcing the incorporation of molecular characteristics for an "integrated diagnosis" at the highest diagnostic level. In this context the broad availability of molecular assays was debated. On the one hand molecular antibodies specifically targeting mutated proteins should be available in nearly all neuropathological laboratories. On the other hand, different high-throughput assays are accessible only in few first-world neuropathological institutions. As examples oligodendrogliomas are now primarily defined by molecular characteristics since the required assays are generally established, whereas molecular grouping of ependymomas, found to clearly outperform morphologically based tumor interpretation, was rejected from inclusion in the WHO 2016 classification because the required assays are currently only

  3. 7 CFR 1794.31 - Classification.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 12 2010-01-01 2010-01-01 false Classification. 1794.31 Section 1794.31 Agriculture... Classification. (a) Electric and telecommunications programs. RUS will normally determine the proper environmental classification of projects based on its evaluation of the project description set forth in the...

  4. 32 CFR 2400.34 - Classification.

    Science.gov (United States)

    2010-07-01

    ... 32 National Defense 6 2010-07-01 2010-07-01 false Classification. 2400.34 Section 2400.34 National... Government Information § 2400.34 Classification. (a) Foreign government information classified by a foreign government or international organization of governments shall retain its original classification designation...

  5. Seismic texture classification. Final report

    Energy Technology Data Exchange (ETDEWEB)

    Vinther, R.

    1997-12-31

    The seismic texture classification method, is a seismic attribute that can both recognize the general reflectivity styles and locate variations from these. The seismic texture classification performs a statistic analysis for the seismic section (or volume) aiming at describing the reflectivity. Based on a set of reference reflectivities the seismic textures are classified. The result of the seismic texture classification is a display of seismic texture categories showing both the styles of reflectivity from the reference set and interpolations and extrapolations from these. The display is interpreted as statistical variations in the seismic data. The seismic texture classification is applied to seismic sections and volumes from the Danish North Sea representing both horizontal stratifications and salt diapers. The attribute succeeded in recognizing both general structure of successions and variations from these. Also, the seismic texture classification is not only able to display variations in prospective areas (1-7 sec. TWT) but can also be applied to deep seismic sections. The seismic texture classification is tested on a deep reflection seismic section (13-18 sec. TWT) from the Baltic Sea. Applied to this section the seismic texture classification succeeded in locating the Moho, which could not be located using conventional interpretation tools. The seismic texture classification is a seismic attribute which can display general reflectivity styles and deviations from these and enhance variations not found by conventional interpretation tools. (LN)

  6. 28 CFR 345.20 - Position classification.

    Science.gov (United States)

    2010-07-01

    ... 28 Judicial Administration 2 2010-07-01 2010-07-01 false Position classification. 345.20 Section... INDUSTRIES (FPI) INMATE WORK PROGRAMS Position Classification § 345.20 Position classification. (a) Inmate... the objectives and principles of pay classification as a part of the routine orientation of new FPI...

  7. 7 CFR 51.2284 - Size classification.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Size classification. 51.2284 Section 51.2284... Size classification. The following classifications are provided to describe the size of any lot... shall conform to the requirements of the specified classification as defined below: (a) Halves. Lot...

  8. 22 CFR 9.8 - Classification challenges.

    Science.gov (United States)

    2010-04-01

    ... 22 Foreign Relations 1 2010-04-01 2010-04-01 false Classification challenges. 9.8 Section 9.8 Foreign Relations DEPARTMENT OF STATE GENERAL SECURITY INFORMATION REGULATIONS § 9.8 Classification... classification status is improper are expected and encouraged to challenge the classification status of the...

  9. 32 CFR 2001.21 - Original classification.

    Science.gov (United States)

    2010-07-01

    ... 32 National Defense 6 2010-07-01 2010-07-01 false Original classification. 2001.21 Section 2001.21... Markings § 2001.21 Original classification. (a) Primary markings. At the time of original classification... authority. The name and position, or personal identifier, of the original classification authority shall...

  10. Tissue Classification

    DEFF Research Database (Denmark)

    Van Leemput, Koen; Puonti, Oula

    2015-01-01

    Computational methods for automatically segmenting magnetic resonance images of the brain have seen tremendous advances in recent years. So-called tissue classification techniques, aimed at extracting the three main brain tissue classes (white matter, gray matter, and cerebrospinal fluid), are now...... well established. In their simplest form, these methods classify voxels independently based on their intensity alone, although much more sophisticated models are typically used in practice. This article aims to give an overview of often-used computational techniques for brain tissue classification...

  11. LocateP: Genome-scale subcellular-location predictor for bacterial proteins

    Directory of Open Access Journals (Sweden)

    Zhou Miaomiao

    2008-03-01

    Full Text Available Abstract Background In the past decades, various protein subcellular-location (SCL predictors have been developed. Most of these predictors, like TMHMM 2.0, SignalP 3.0, PrediSi and Phobius, aim at the identification of one or a few SCLs, whereas others such as CELLO and Psortb.v.2.0 aim at a broader classification. Although these tools and pipelines can achieve a high precision in the accurate prediction of signal peptides and transmembrane helices, they have a much lower accuracy when other sequence characteristics are concerned. For instance, it proved notoriously difficult to identify the fate of proteins carrying a putative type I signal peptidase (SPIase cleavage site, as many of those proteins are retained in the cell membrane as N-terminally anchored membrane proteins. Moreover, most of the SCL classifiers are based on the classification of the Swiss-Prot database and consequently inherited the inconsistency of that SCL classification. As accurate and detailed SCL prediction on a genome scale is highly desired by experimental researchers, we decided to construct a new SCL prediction pipeline: LocateP. Results LocateP combines many of the existing high-precision SCL identifiers with our own newly developed identifiers for specific SCLs. The LocateP pipeline was designed such that it mimics protein targeting and secretion processes. It distinguishes 7 different SCLs within Gram-positive bacteria: intracellular, multi-transmembrane, N-terminally membrane anchored, C-terminally membrane anchored, lipid-anchored, LPxTG-type cell-wall anchored, and secreted/released proteins. Moreover, it distinguishes pathways for Sec- or Tat-dependent secretion and alternative secretion of bacteriocin-like proteins. The pipeline was tested on data sets extracted from literature, including experimental proteomics studies. The tests showed that LocateP performs as well as, or even slightly better than other SCL predictors for some locations and outperforms

  12. 7 CFR 51.1860 - Color classification.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Color classification. 51.1860 Section 51.1860... STANDARDS) United States Standards for Fresh Tomatoes 1 Color Classification § 51.1860 Color classification... illustrating the color classification requirements, as set forth in this section. This visual aid may be...

  13. 22 CFR 42.11 - Classification symbols.

    Science.gov (United States)

    2010-04-01

    ... 22 Foreign Relations 1 2010-04-01 2010-04-01 false Classification symbols. 42.11 Section 42.11... NATIONALITY ACT, AS AMENDED Classification and Foreign State Chargeability § 42.11 Classification symbols. A... visa symbol to show the classification of the alien. Immigrants Symbol Class Section of law Immediate...

  14. 46 CFR 503.54 - Original classification.

    Science.gov (United States)

    2010-10-01

    ... 46 Shipping 9 2010-10-01 2010-10-01 false Original classification. 503.54 Section 503.54 Shipping... Program § 503.54 Original classification. (a) No Commission Member or employee has the authority to... classification, it shall be sent to the appropriate agency with original classification authority over the...

  15. Pitch Based Sound Classification

    DEFF Research Database (Denmark)

    Nielsen, Andreas Brinch; Hansen, Lars Kai; Kjems, U

    2006-01-01

    A sound classification model is presented that can classify signals into music, noise and speech. The model extracts the pitch of the signal using the harmonic product spectrum. Based on the pitch estimate and a pitch error measure, features are created and used in a probabilistic model with soft......-max output function. Both linear and quadratic inputs are used. The model is trained on 2 hours of sound and tested on publicly available data. A test classification error below 0.05 with 1 s classification windows is achieved. Further more it is shown that linear input performs as well as a quadratic......, and that even though classification gets marginally better, not much is achieved by increasing the window size beyond 1 s....

  16. Network-Based Integration of GWAS and Gene Expression Identifies a HOX-Centric Network Associated with Serous Ovarian Cancer Risk.

    Science.gov (United States)

    Kar, Siddhartha P; Tyrer, Jonathan P; Li, Qiyuan; Lawrenson, Kate; Aben, Katja K H; Anton-Culver, Hoda; Antonenkova, Natalia; Chenevix-Trench, Georgia; Baker, Helen; Bandera, Elisa V; Bean, Yukie T; Beckmann, Matthias W; Berchuck, Andrew; Bisogna, Maria; Bjørge, Line; Bogdanova, Natalia; Brinton, Louise; Brooks-Wilson, Angela; Butzow, Ralf; Campbell, Ian; Carty, Karen; Chang-Claude, Jenny; Chen, Yian Ann; Chen, Zhihua; Cook, Linda S; Cramer, Daniel; Cunningham, Julie M; Cybulski, Cezary; Dansonka-Mieszkowska, Agnieszka; Dennis, Joe; Dicks, Ed; Doherty, Jennifer A; Dörk, Thilo; du Bois, Andreas; Dürst, Matthias; Eccles, Diana; Easton, Douglas F; Edwards, Robert P; Ekici, Arif B; Fasching, Peter A; Fridley, Brooke L; Gao, Yu-Tang; Gentry-Maharaj, Aleksandra; Giles, Graham G; Glasspool, Rosalind; Goode, Ellen L; Goodman, Marc T; Grownwald, Jacek; Harrington, Patricia; Harter, Philipp; Hein, Alexander; Heitz, Florian; Hildebrandt, Michelle A T; Hillemanns, Peter; Hogdall, Estrid; Hogdall, Claus K; Hosono, Satoyo; Iversen, Edwin S; Jakubowska, Anna; Paul, James; Jensen, Allan; Ji, Bu-Tian; Karlan, Beth Y; Kjaer, Susanne K; Kelemen, Linda E; Kellar, Melissa; Kelley, Joseph; Kiemeney, Lambertus A; Krakstad, Camilla; Kupryjanczyk, Jolanta; Lambrechts, Diether; Lambrechts, Sandrina; Le, Nhu D; Lee, Alice W; Lele, Shashi; Leminen, Arto; Lester, Jenny; Levine, Douglas A; Liang, Dong; Lissowska, Jolanta; Lu, Karen; Lubinski, Jan; Lundvall, Lene; Massuger, Leon; Matsuo, Keitaro; McGuire, Valerie; McLaughlin, John R; McNeish, Iain A; Menon, Usha; Modugno, Francesmary; Moysich, Kirsten B; Narod, Steven A; Nedergaard, Lotte; Ness, Roberta B; Nevanlinna, Heli; Odunsi, Kunle; Olson, Sara H; Orlow, Irene; Orsulic, Sandra; Weber, Rachel Palmieri; Pearce, Celeste Leigh; Pejovic, Tanja; Pelttari, Liisa M; Permuth-Wey, Jennifer; Phelan, Catherine M; Pike, Malcolm C; Poole, Elizabeth M; Ramus, Susan J; Risch, Harvey A; Rosen, Barry; Rossing, Mary Anne; Rothstein, Joseph H; Rudolph, Anja; Runnebaum, Ingo B; Rzepecka, Iwona K; Salvesen, Helga B; Schildkraut, Joellen M; Schwaab, Ira; Shu, Xiao-Ou; Shvetsov, Yurii B; Siddiqui, Nadeem; Sieh, Weiva; Song, Honglin; Southey, Melissa C; Sucheston-Campbell, Lara E; Tangen, Ingvild L; Teo, Soo-Hwang; Terry, Kathryn L; Thompson, Pamela J; Timorek, Agnieszka; Tsai, Ya-Yu; Tworoger, Shelley S; van Altena, Anne M; Van Nieuwenhuysen, Els; Vergote, Ignace; Vierkant, Robert A; Wang-Gohrke, Shan; Walsh, Christine; Wentzensen, Nicolas; Whittemore, Alice S; Wicklund, Kristine G; Wilkens, Lynne R; Woo, Yin-Ling; Wu, Xifeng; Wu, Anna; Yang, Hannah; Zheng, Wei; Ziogas, Argyrios; Sellers, Thomas A; Monteiro, Alvaro N A; Freedman, Matthew L; Gayther, Simon A; Pharoah, Paul D P

    2015-10-01

    Genome-wide association studies (GWAS) have so far reported 12 loci associated with serous epithelial ovarian cancer (EOC) risk. We hypothesized that some of these loci function through nearby transcription factor (TF) genes and that putative target genes of these TFs as identified by coexpression may also be enriched for additional EOC risk associations. We selected TF genes within 1 Mb of the top signal at the 12 genome-wide significant risk loci. Mutual information, a form of correlation, was used to build networks of genes strongly coexpressed with each selected TF gene in the unified microarray dataset of 489 serous EOC tumors from The Cancer Genome Atlas. Genes represented in this dataset were subsequently ranked using a gene-level test based on results for germline SNPs from a serous EOC GWAS meta-analysis (2,196 cases/4,396 controls). Gene set enrichment analysis identified six networks centered on TF genes (HOXB2, HOXB5, HOXB6, HOXB7 at 17q21.32 and HOXD1, HOXD3 at 2q31) that were significantly enriched for genes from the risk-associated end of the ranked list (P < 0.05 and FDR < 0.05). These results were replicated (P < 0.05) using an independent association study (7,035 cases/21,693 controls). Genes underlying enrichment in the six networks were pooled into a combined network. We identified a HOX-centric network associated with serous EOC risk containing several genes with known or emerging roles in serous EOC development. Network analysis integrating large, context-specific datasets has the potential to offer mechanistic insights into cancer susceptibility and prioritize genes for experimental characterization. ©2015 American Association for Cancer Research.

  17. Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics.

    Science.gov (United States)

    Chepelev, Leonid L; Riazanov, Alexandre; Kouznetsov, Alexandre; Low, Hong Sang; Dumontier, Michel; Baker, Christopher J O

    2011-07-26

    The development of high-throughput experimentation has led to astronomical growth in biologically relevant lipids and lipid derivatives identified, screened, and deposited in numerous online databases. Unfortunately, efforts to annotate, classify, and analyze these chemical entities have largely remained in the hands of human curators using manual or semi-automated protocols, leaving many novel entities unclassified. Since chemical function is often closely linked to structure, accurate structure-based classification and annotation of chemical entities is imperative to understanding their functionality. As part of an exploratory study, we have investigated the utility of semantic web technologies in automated chemical classification and annotation of lipids. Our prototype framework consists of two components: an ontology and a set of federated web services that operate upon it. The formal lipid ontology we use here extends a part of the LiPrO ontology and draws on the lipid hierarchy in the LIPID MAPS database, as well as literature-derived knowledge. The federated semantic web services that operate upon this ontology are deployed within the Semantic Annotation, Discovery, and Integration (SADI) framework. Structure-based lipid classification is enacted by two core services. Firstly, a structural annotation service detects and enumerates relevant functional groups for a specified chemical structure. A second service reasons over lipid ontology class descriptions using the attributes obtained from the annotation service and identifies the appropriate lipid classification. We extend the utility of these core services by combining them with additional SADI services that retrieve associations between lipids and proteins and identify publications related to specified lipid types. We analyze the performance of SADI-enabled eicosanoid classification relative to the LIPID MAPS classification and reflect on the contribution of our integrative methodology in the context of

  18. Prototype semantic infrastructure for automated small molecule classification and annotation in lipidomics

    Directory of Open Access Journals (Sweden)

    Dumontier Michel

    2011-07-01

    Full Text Available Abstract Background The development of high-throughput experimentation has led to astronomical growth in biologically relevant lipids and lipid derivatives identified, screened, and deposited in numerous online databases. Unfortunately, efforts to annotate, classify, and analyze these chemical entities have largely remained in the hands of human curators using manual or semi-automated protocols, leaving many novel entities unclassified. Since chemical function is often closely linked to structure, accurate structure-based classification and annotation of chemical entities is imperative to understanding their functionality. Results As part of an exploratory study, we have investigated the utility of semantic web technologies in automated chemical classification and annotation of lipids. Our prototype framework consists of two components: an ontology and a set of federated web services that operate upon it. The formal lipid ontology we use here extends a part of the LiPrO ontology and draws on the lipid hierarchy in the LIPID MAPS database, as well as literature-derived knowledge. The federated semantic web services that operate upon this ontology are deployed within the Semantic Annotation, Discovery, and Integration (SADI framework. Structure-based lipid classification is enacted by two core services. Firstly, a structural annotation service detects and enumerates relevant functional groups for a specified chemical structure. A second service reasons over lipid ontology class descriptions using the attributes obtained from the annotation service and identifies the appropriate lipid classification. We extend the utility of these core services by combining them with additional SADI services that retrieve associations between lipids and proteins and identify publications related to specified lipid types. We analyze the performance of SADI-enabled eicosanoid classification relative to the LIPID MAPS classification and reflect on the contribution of

  19. Classification of Osteogenesis Imperfecta revisited

    NARCIS (Netherlands)

    van Dijk, F. S.; Pals, G.; van Rijn, R. R.; Nikkels, P. G. J.; Cobben, J. M.

    2010-01-01

    In 1979 Sillence proposed a classification of Osteogenesis Imperfecta (OI) in OI types I, II, III and IV. In 2004 and 2007 this classification was expanded with OI types V-VIII because of distinct clinical features and/or different causative gene mutations. We propose a revised classification of OI

  20. 14 CFR 1203.701 - Classification.

    Science.gov (United States)

    2010-01-01

    ... 14 Aeronautics and Space 5 2010-01-01 2010-01-01 false Classification. 1203.701 Section 1203.701... Government Information § 1203.701 Classification. (a) Foreign government information that is classified by a foreign entity shall either retain its original classification designation or be marked with a United...

  1. 32 CFR 1602.7 - Classification.

    Science.gov (United States)

    2010-07-01

    ... 32 National Defense 6 2010-07-01 2010-07-01 false Classification. 1602.7 Section 1602.7 National Defense Other Regulations Relating to National Defense SELECTIVE SERVICE SYSTEM DEFINITIONS § 1602.7 Classification. Classification is the exercise of the power to determine claims or questions with respect to...

  2. 32 CFR 644.426 - Classification.

    Science.gov (United States)

    2010-07-01

    ... 32 National Defense 4 2010-07-01 2010-07-01 true Classification. 644.426 Section 644.426 National... HANDBOOK Disposal Disposal of Fee-Owned Real Property and Easement Interests § 644.426 Classification... required by the special acts, classification will be coordinated with the interested Federal agency. The...

  3. 46 CFR 132.210 - Classification.

    Science.gov (United States)

    2010-10-01

    ... 46 Shipping 4 2010-10-01 2010-10-01 false Classification. 132.210 Section 132.210 Shipping COAST... Portable and Semiportable Fire Extinguishers § 132.210 Classification. (a) Each portable fire extinguisher... Classification Type Size Halon 1211, 1301, and 1211-1301 mixtures kgs. (lbs.) Foam, liters (gallons) Carbon...

  4. Relationship between serum high-sensitivity C-reactive protein and modified TOAST classification as well as OCSP subtypes in patients with acute ischemic stroke

    Directory of Open Access Journals (Sweden)

    Hua-jun CHANG

    2014-10-01

    Full Text Available This paper aims to investigate the relationship between serum high-sensitivity C-reactive protein (hs-CRP level and modified TOAST classification as well as OCSP subtypes in patients with acute ischemic stroke. Serum hs-CRP was measured in 240 patients with acute ischemic stroke and 120 normal controls. All patients were classified according to modified TOAST classification and OCSP criteria. Serum hs-CRP levels in acute ischemic stroke group were significantly higher than those in normal control group [(13.68 ± 6.92 mg/L vs (3.98 ± 0.76 mg/L; t = 6.922, P = 0.002]. Among modified TOAST subtypes, the highest serum hs-CRP level was in cardioembolism (CE group [(16.82 ± 6.16 mg/L], followed by arterothrombosis (AT group [(15.17 ± 5.68 mg/L], stroke of undetermined etiology (SUD group [(10.06 ± 3.89 mg/L] and small artery disease (SAD group [(9.86 ± 3.75 mg/L, P = 0.027]. Among OCSP subtypes, the highest serum hs-CRP level was in total anterior circulation infarct (TACI group [(17.02 ± 6.98 mg/L], followed by posterior circulation infarct (POCI group [(15.91 ± 7.12 mg/L], partial anterior circulation infarct (PACI group [(12.83 ± 4.95 mg/L] and lacunar infarct (LACI group [(10.61 ± 5.73 mg/L, P = 0.005]. Serum hs-CRP levels are various in different modified TOAST and OCSP subtypes, which may reflect etiological and pathophysiological diversity of acute ischemic stroke, guide clinical treatment and help to predict prognosis. doi: 10.3969/j.issn.1672-6731.2014.10.013

  5. Library Classification 2020

    Science.gov (United States)

    Harris, Christopher

    2013-01-01

    In this article the author explores how a new library classification system might be designed using some aspects of the Dewey Decimal Classification (DDC) and ideas from other systems to create something that works for school libraries in the year 2020. By examining what works well with the Dewey Decimal System, what features should be carried…

  6. 14 CFR 298.3 - Classification.

    Science.gov (United States)

    2010-01-01

    ... 14 Aeronautics and Space 4 2010-01-01 2010-01-01 false Classification. 298.3 Section 298.3... REGULATIONS EXEMPTIONS FOR AIR TAXI AND COMMUTER AIR CARRIER OPERATIONS General § 298.3 Classification. (a) There is hereby established a classification of air carriers, designated as “air taxi operators,” which...

  7. In vivo subsite classification and diagnosis of oral cancers using Raman spectroscopy

    Directory of Open Access Journals (Sweden)

    Aditi Sahu

    2016-09-01

    Full Text Available Oral cancers suffer from poor disease-free survival rates due to delayed diagnosis. Noninvasive, rapid, objective approaches as adjuncts to visual inspection can help in better management of oral cancers. Raman spectroscopy (RS has shown potential in identification of oral premalignant and malignant conditions and also in the detection of early cancer changes like cancer-field-effects (CFE at buccal mucosa subsite. Anatomic differences between different oral subsites have also been reported using RS. In this study, anatomical differences between subsites and their possible influence on healthy vs pathological classification were evaluated on 85 oral cancer and 72 healthy subjects. Spectra were acquired from buccal mucosa, lip and tongue in healthy, contralateral (internal healthy control, premalignant and cancer conditions using fiber-optic Raman spectrometer. Mean spectra indicate predominance of lipids in healthy buccal mucosa, contribution of both lipids and proteins in lip while major dominance of protein in tongue spectra. From healthy to tumor, changes in protein secondary-structure, DNA and heme-related features were observed. Principal component linear discriminant analysis (PC-LDA followed by leave-one-out-cross-validation (LOOCV was used for data analysis. Findings indicate buccal mucosa and tongue are distinct entities, while lip misclassifies with both these subsites. Additionally, the diagnostic algorithm for individual subsites gave improved classification efficiencies with respect to the pooled subsites model. However, as the pooled subsites model yielded 98% specificity and 100% sensitivity, this model may be more useful for preliminary screening applications. Large-scale validation studies are a pre-requisite before envisaging future clinical applications.

  8. 32 CFR 2700.22 - Classification guides.

    Science.gov (United States)

    2010-07-01

    ... SECURITY INFORMATION REGULATIONS Derivative Classification § 2700.22 Classification guides. OMSN shall... direct derivative classification, shall identify the information to be protected in specific and uniform...

  9. Update on diabetes classification.

    Science.gov (United States)

    Thomas, Celeste C; Philipson, Louis H

    2015-01-01

    This article highlights the difficulties in creating a definitive classification of diabetes mellitus in the absence of a complete understanding of the pathogenesis of the major forms. This brief review shows the evolving nature of the classification of diabetes mellitus. No classification scheme is ideal, and all have some overlap and inconsistencies. The only diabetes in which it is possible to accurately diagnose by DNA sequencing, monogenic diabetes, remains undiagnosed in more than 90% of the individuals who have diabetes caused by one of the known gene mutations. The point of classification, or taxonomy, of disease, should be to give insight into both pathogenesis and treatment. It remains a source of frustration that all schemes of diabetes mellitus continue to fall short of this goal. Copyright © 2015 Elsevier Inc. All rights reserved.

  10. ViCTree: An automated framework for taxonomic classification from protein sequences.

    Science.gov (United States)

    Modha, Sejal; Thanki, Anil; Cotmore, Susan F; Davison, Andrew J; Hughes, Joseph

    2018-02-20

    The increasing rate of submission of genetic sequences into public databases is providing a growing resource for classifying the organisms that these sequences represent. To aid viral classification, we have developed ViCTree, which automatically integrates the relevant sets of sequences in NCBI GenBank and transforms them into an interactive maximum likelihood phylogenetic tree that can be updated automatically. ViCTree incorporates ViCTreeView, which is a JavaScript-based visualisation tool that enables the tree to be explored interactively in the context of pairwise distance data. To demonstrate utility, ViCTree was applied to subfamily Densovirinae of family Parvoviridae. This led to the identification of six new species of insect virus. ViCTree is open-source and can be run on any Linux- or Unix-based computer or cluster. A tutorial, the documentation and the source code are available under a GPL3 license, and can be accessed at http://bioinformatics.cvr.ac.uk/victree_web/. sejal.modha@glasgow.ac.uk.

  11. The classification of osteonecrosis in patients with cancer: validation of a new radiological classification system

    International Nuclear Information System (INIS)

    Niinimäki, T.; Niinimäki, J.; Halonen, J.; Hänninen, P.; Harila-Saari, A.; Niinimäki, R.

    2015-01-01

    Aim: To validate a new, non-joint-specific radiological classification system that is suitable regardless of the site of the osteonecrosis (ON) in patients with cancer. Material and methods: Critical deficiencies in the existing ON classification systems were identified and a new, non-joint-specific radiological classification system was developed. Seventy-two magnetic resonance imaging (MRI) images of patients with cancer and ON lesions were graded, and the validation of the new system was performed by assessing inter- and intra-observer reliability. Results: Intra-observer reliability of ON grading was good or very good, with kappa values of 0.79–0.86. Interobserver agreement was lower but still good, with kappa values of 0.62–0.77. Ninety-eight percent of all intra- or interobserver differences were within one grade. Interobserver reliability of assessing the location of ON was very good, with kappa values of 0.93–0.98. Conclusion: All the available radiological ON classification systems are joint specific. This limitation has spurred the development of multiple systems, which has led to the insufficient use of classifications in ON studies among patients with cancer. The introduced radiological classification system overcomes the problem of joint-specificity, was found to be reliable, and can be used to classify all ON lesions regardless of the affected site. - Highlights: • Patients with cancer may have osteonecrosis lesions at multiple sites. • There is no non-joint-specific osteonecrosis classification available. • We introduced a new non-joint-specific osteonecrosis classification. • The validation was performed by assessing inter- and intra-observer reliability. • The classification was reliable and could be used regardless of the affected site.

  12. Border Lakes land-cover classification

    Science.gov (United States)

    Marvin Bauer; Brian Loeffelholz; Doug. Shinneman

    2009-01-01

    This document contains metadata and description of land-cover classification of approximately 5.1 million acres of land bordering Minnesota, U.S.A. and Ontario, Canada. The classification focused on the separation and identification of specific forest-cover types. Some separation of the nonforest classes also was performed. The classification was derived from multi-...

  13. The reliability and reproducibility of the Hertel classification for comminuted proximal humeral fractures compared with the Neer classification

    NARCIS (Netherlands)

    Iordens, Gijs I. T.; Mahabier, Kiran C.; Buisman, Florian E.; Schep, Niels W. L.; Muradin, Galied S. R.; Beenen, Ludo F. M.; Patka, Peter; van Lieshout, Esther M. M.; den Hartog, Dennis

    2016-01-01

    The Neer classification is the most commonly used fracture classification system for proximal humeral fractures. Inter- and intra-observer agreement is limited, especially for comminuted fractures. A possibly more straightforward and reliable classification system is the Hertel classification. The

  14. Classification of movement disorders.

    Science.gov (United States)

    Fahn, Stanley

    2011-05-01

    The classification of movement disorders has evolved. Even the terminology has shifted, from an anatomical one of extrapyramidal disorders to a phenomenological one of movement disorders. The history of how this shift came about is described. The history of both the definitions and the classifications of the various neurologic conditions is then reviewed. First is a review of movement disorders as a group; then, the evolving classifications for 3 of them--parkinsonism, dystonia, and tremor--are covered in detail. Copyright © 2011 Movement Disorder Society.

  15. Information Classification on University Websites

    DEFF Research Database (Denmark)

    Nawaz, Ather; Clemmensen, Torkil; Hertzum, Morten

    2011-01-01

    Websites are increasingly used as a medium for providing information to university students. The quality of a university website depends on how well the students’ information classification fits with the structure of the information on the website. This paper investigates the information classifi......Websites are increasingly used as a medium for providing information to university students. The quality of a university website depends on how well the students’ information classification fits with the structure of the information on the website. This paper investigates the information...... classification of 14 Danish and 14 Pakistani students and compares it with the information classification of their university website. Brainstorming, card sorting, and task exploration activities were used to discover similarities and differences in the participating students’ classification of website...... information and their ability to navigate the websites. The results of the study indicate group differences in user classification and related taskperformance differences. The main implications of the study are that (a) the edit distance appears a useful measure in cross-country HCI research and practice...

  16. Information Classification on University Websites

    DEFF Research Database (Denmark)

    Nawaz, Ather; Clemmensen, Torkil; Hertzum, Morten

    2011-01-01

    Websites are increasingly used as a medium for providing information to university students. The quality of a university website depends on how well the students’ information classification fits with the structure of the information on the website. This paper investigates the information classifi......Websites are increasingly used as a medium for providing information to university students. The quality of a university website depends on how well the students’ information classification fits with the structure of the information on the website. This paper investigates the information...... classification of 14 Danish and 14 Pakistani students and compares it with the information classification of their university website. Brainstorming, card sorting, and task exploration activities were used to discover similarities and differences in the participating students’ classification of website...... information and their ability to navigate the websites. The results of the study indicate group differences in user classification and related task-performance differences. The main implications of the study are that (a) the edit distance appears a useful measure in cross-country HCI research and practice...

  17. Colombia: Territorial classification

    International Nuclear Information System (INIS)

    Mendoza Morales, Alberto

    1998-01-01

    The article is about the approaches of territorial classification, thematic axes, handling principles and territorial occupation, politician and administrative units and administration regions among other topics. Understanding as Territorial Classification the space distribution on the territory of the country, of the geographical configurations, the human communities, the political-administrative units and the uses of the soil, urban and rural, existent and proposed

  18. Latent classification models

    DEFF Research Database (Denmark)

    Langseth, Helge; Nielsen, Thomas Dyhre

    2005-01-01

    parametric family ofdistributions.  In this paper we propose a new set of models forclassification in continuous domains, termed latent classificationmodels. The latent classification model can roughly be seen ascombining the \\NB model with a mixture of factor analyzers,thereby relaxing the assumptions...... classification model, and wedemonstrate empirically that the accuracy of the proposed model issignificantly higher than the accuracy of other probabilisticclassifiers....

  19. Overexpression of HOXA4 and HOXA9 genes promotes self-renewal and contributes to colon cancer stem cell overpopulation.

    Science.gov (United States)

    Bhatlekar, Seema; Viswanathan, Vignesh; Fields, Jeremy Z; Boman, Bruce M

    2018-02-01

    Because HOX genes encode master regulatory transcription factors that regulate stem cells (SCs) during development and aberrant expression of HOX genes occurs in various cancers, our goal was to determine if dysregulation of HOX genes is involved in the SC origin of colorectal cancer (CRC). We previously reported that HOXA4 and HOXD10 are expressed in the colonic SC niche and are overexpressed in CRC. HOX gene expression was studied in SCs from human colon tissue and CRC cells (CSCs) using qPCR and immunostaining. siRNA-mediated knockdown of HOX expression was used to evaluate the role of HOX genes in modulating cancer SC (CSC) phenotype at the level of proliferation, SC marker expression, and sphere formation. All-trans-retinoic-acid (ATRA), a differentiation-inducing agent was evaluated for its effects on HOX expression and CSC growth. We found that HOXA4 and HOXA9 are up-regulated in CRC SCs. siRNA knockdown of HOXA4 and HOXA9 reduced: (i) proliferation and sphere-formation and (ii) gene expression of known SC markers (ALDH1, CD166, LGR5). These results indicate that proliferation and self-renewal ability of CRC SCs are reduced in HOXA4 and HOXA9 knockdown cells. ATRA decreased HOXA4, HOXA9, and HOXD10 expression in parallel with reduction in ALDH1 expression, self-renewal, and proliferation. Overall, our findings indicate that overexpression of HOXA4 and HOXA9 contributes to self-renewal and overpopulation of SCs in CRC. Strategies designed to modulate HOX expression may provide ways to target malignant SCs and to develop more effective therapies for CRC. © 2017 Wiley Periodicals, Inc.

  20. Multikilovolt Coherent X-Ray Generation for Protein Analysis and Biological Threat Reduction

    National Research Council Canada - National Science Library

    Rhodes, Charles

    2003-01-01

    Efforts to trump bioterrorism can be sharply advanced by the development of new modalities for the rapid measurement and quantitative classification of protein structural and regulatory information. This report (1...

  1. Arabic text classification using Polynomial Networks

    Directory of Open Access Journals (Sweden)

    Mayy M. Al-Tahrawi

    2015-10-01

    Full Text Available In this paper, an Arabic statistical learning-based text classification system has been developed using Polynomial Neural Networks. Polynomial Networks have been recently applied to English text classification, but they were never used for Arabic text classification. In this research, we investigate the performance of Polynomial Networks in classifying Arabic texts. Experiments are conducted on a widely used Arabic dataset in text classification: Al-Jazeera News dataset. We chose this dataset to enable direct comparisons of the performance of Polynomial Networks classifier versus other well-known classifiers on this dataset in the literature of Arabic text classification. Results of experiments show that Polynomial Networks classifier is a competitive algorithm to the state-of-the-art ones in the field of Arabic text classification.

  2. Angle′s Molar Classification Revisited

    Directory of Open Access Journals (Sweden)

    Devanshi Yadav

    2014-01-01

    Results: Of the 500 pretreatment study casts assessed 52.4% were definitive Class I, 23.6% were Class II, 2.6% were Class III and the ambiguous cases were 21%. These could be easily classified with our method of classification. Conclusion: This improvised classification technique will help orthodontists in making classification of malocclusion accurate and simple.

  3. The Oxford classification of IgA nephropathy: rationale, clinicopathological correlations, and classification

    NARCIS (Netherlands)

    Cattran, Daniel C.; Coppo, Rosanna; Cook, H. Terence; Feehally, John; Roberts, Ian S. D.; Troyanov, Stéphan; Alpers, Charles E.; Amore, Alessandro; Barratt, Jonathan; Berthoux, Francois; Bonsib, Stephen; Bruijn, Jan A.; D'Agati, Vivette; D'Amico, Giuseppe; Emancipator, Steven; Emma, Francesco; Ferrario, Franco; Fervenza, Fernando C.; Florquin, Sandrine; Fogo, Agnes; Geddes, Colin C.; Groene, Hermann-Josef; Haas, Mark; Herzenberg, Andrew M.; Hill, Prue A.; Hogg, Ronald J.; Hsu, Stephen I.; Jennette, J. Charles; Joh, Kensuke; Julian, Bruce A.; Kawamura, Tetsuya; Lai, Fernand M.; Leung, Chi Bon; Li, Lei-Shi; Li, Philip K. T.; Liu, Zhi-Hong; Mackinnon, Bruce; Mezzano, Sergio; Schena, F. Paolo; Tomino, Yasuhiko; Walker, Patrick D.; Wang, Haiyan; Weening, Jan J.; Yoshikawa, Nori; Zhang, Hong

    2009-01-01

    IgA nephropathy is the most common glomerular disease worldwide, yet there is no international consensus for its pathological or clinical classification. Here a new classification for IgA nephropathy is presented by an international consensus working group. The goal of this new system was to

  4. An edit script for taxonomic classifications

    Directory of Open Access Journals (Sweden)

    Valiente Gabriel

    2005-08-01

    Full Text Available Abstract Background The NCBI taxonomy provides one of the most powerful ways to navigate sequence data bases but currently users are forced to formulate queries according to a single taxonomic classification. Given that there is not universal agreement on the classification of organisms, providing a single classification places constraints on the questions biologists can ask. However, maintaining multiple classifications is burdensome in the face of a constantly growing NCBI classification. Results In this paper, we present a solution to the problem of generating modifications of the NCBI taxonomy, based on the computation of an edit script that summarises the differences between two classification trees. Our algorithms find the shortest possible edit script based on the identification of all shared subtrees, and only take time quasi linear in the size of the trees because classification trees have unique node labels. Conclusion These algorithms have been recently implemented, and the software is freely available for download from http://darwin.zoology.gla.ac.uk/~rpage/forest/.

  5. Classification of huminite-ICCP System 1994

    Energy Technology Data Exchange (ETDEWEB)

    Sykorova, I. [Institute of Rock Structure and Mechanics, Academy of Science of the Czech Republic, V Holesovicka 41, 182 09 Prague 8 (Czech Republic); Pickel, W. [Coal and Organic Petrology Services Pty Ltd, 23/80 Box Road, Taren Point, NSW 2229 (Australia); Christanis, K. [Department of Geology, University of Patras, 26500 Rio-Patras (Greece); Wolf, M. [Mergelskull 29, 47802 Krefeld (Germany); Taylor, G.H. [15 Hawkesbury Cres, Farrer Act 2607 (Australia); Flores, D. [Departamento de Geologia, Faculdade de Ciencias do Porto, Praca de Gomes Teixeira, 4099-002 Porto (Portugal)

    2005-04-12

    In the new classification (ICCP System 1994), the maceral group huminite has been revised from the previous classification (ICCP, 1971. Int. Handbook Coal Petr., suppl. to 2nd ed.) to accommodate the nomenclature to changes in the other maceral groups, especially the changes in the vitrinite classification (ICCP, 1998. The new vitrinite classification (ICCP System 1994). Fuel 77, 349-358.). The vitrinite and huminite systems have been correlated so that down to the level of sub-maceral groups, the two systems can be used in parallel. At the level of macerals and for finer classifications, the analyst now has, according to the nature of the coal and the purpose of the analysis, a choice of using either of the two classification systems for huminite and vitrinite. This is in accordance with the new ISO Coal Classification that covers low rank coals as well and allows for the simultaneous use of the huminite and vitrinite nomenclature for low rank coals.

  6. Efficient Fingercode Classification

    Science.gov (United States)

    Sun, Hong-Wei; Law, Kwok-Yan; Gollmann, Dieter; Chung, Siu-Leung; Li, Jian-Bin; Sun, Jia-Guang

    In this paper, we present an efficient fingerprint classification algorithm which is an essential component in many critical security application systems e. g. systems in the e-government and e-finance domains. Fingerprint identification is one of the most important security requirements in homeland security systems such as personnel screening and anti-money laundering. The problem of fingerprint identification involves searching (matching) the fingerprint of a person against each of the fingerprints of all registered persons. To enhance performance and reliability, a common approach is to reduce the search space by firstly classifying the fingerprints and then performing the search in the respective class. Jain et al. proposed a fingerprint classification algorithm based on a two-stage classifier, which uses a K-nearest neighbor classifier in its first stage. The fingerprint classification algorithm is based on the fingercode representation which is an encoding of fingerprints that has been demonstrated to be an effective fingerprint biometric scheme because of its ability to capture both local and global details in a fingerprint image. We enhance this approach by improving the efficiency of the K-nearest neighbor classifier for fingercode-based fingerprint classification. Our research firstly investigates the various fast search algorithms in vector quantization (VQ) and the potential application in fingerprint classification, and then proposes two efficient algorithms based on the pyramid-based search algorithms in VQ. Experimental results on DB1 of FVC 2004 demonstrate that our algorithms can outperform the full search algorithm and the original pyramid-based search algorithms in terms of computational efficiency without sacrificing accuracy.

  7. Soliton concepts and protein structure

    Science.gov (United States)

    Krokhotin, Andrei; Niemi, Antti J.; Peng, Xubiao

    2012-03-01

    Structural classification shows that the number of different protein folds is surprisingly small. It also appears that proteins are built in a modular fashion from a relatively small number of components. Here we propose that the modular building blocks are made of the dark soliton solution of a generalized discrete nonlinear Schrödinger equation. We find that practically all protein loops can be obtained simply by scaling the size and by joining together a number of copies of the soliton, one after another. The soliton has only two loop-specific parameters, and we compute their statistical distribution in the Protein Data Bank (PDB). We explicitly construct a collection of 200 sets of parameters, each determining a soliton profile that describes a different short loop. The ensuing profiles cover practically all those proteins in PDB that have a resolution which is better than 2.0 Å, with a precision such that the average root-mean-square distance between the loop and its soliton is less than the experimental B-factor fluctuation distance. We also present two examples that describe how the loop library can be employed both to model and to analyze folded proteins.

  8. Classification of parotidectomy: a proposed modification to the European Salivary Gland Society classification system.

    Science.gov (United States)

    Wong, Wai Keat; Shetty, Subhaschandra

    2017-08-01

    Parotidectomy remains the mainstay of treatment for both benign and malignant lesions of the parotid gland. There exists a wide range of possible surgical options in parotidectomy in terms of extent of parotid tissue removed. There is increasing need for uniformity of terminology resulting from growing interest in modifications of the conventional parotidectomy. It is, therefore, of paramount importance for a standardized classification system in describing extent of parotidectomy. Recently, the European Salivary Gland Society (ESGS) proposed a novel classification system for parotidectomy. The aim of this study is to evaluate this system. A classification system proposed by the ESGS was critically re-evaluated and modified to increase its accuracy and its acceptability. Modifications mainly focused on subdividing Levels I and II into IA, IB, IIA, and IIB. From June 2006 to June 2016, 126 patients underwent 130 parotidectomies at our hospital. The classification system was tested in that cohort of patient. While the ESGS classification system is comprehensive, it does not cover all possibilities. The addition of Sublevels IA, IB, IIA, and IIB may help to address some of the clinical situations seen and is clinically relevant. We aim to test the modified classification system for partial parotidectomy to address some of the challenges mentioned.

  9. The future of general classification

    DEFF Research Database (Denmark)

    Mai, Jens Erik

    2013-01-01

    Discusses problems related to accessing multiple collections using a single retrieval language. Surveys the concepts of interoperability and switching language. Finds that mapping between more indexing languages always will be an approximation. Surveys the issues related to general classification...... and contrasts that to special classifications. Argues for the use of general classifications to provide access to collections nationally and internationally....

  10. Clinical classification of cancer cachexia: phenotypic correlates in human skeletal muscle.

    Directory of Open Access Journals (Sweden)

    Neil Johns

    Full Text Available BACKGROUND: Cachexia affects the majority of patients with advanced cancer and is associated with a reduction in treatment tolerance, response to therapy, and duration of survival. One impediment towards the effective treatment of cachexia is a validated classification system. METHODS: 41 patients with resectable upper gastrointestinal (GI or pancreatic cancer underwent characterisation for cachexia based on weight-loss (WL and/or low muscularity (LM. Four diagnostic criteria were used >5%WL, >10%WL, LM, and LM+>2%WL. All patients underwent biopsy of the rectus muscle. Analysis included immunohistochemistry for fibre size and type, protein and nucleic acid concentration, Western blots for markers of autophagy, SMAD signalling, and inflammation. FINDINGS: Compared with non-cachectic cancer patients, patients with LM or LM+>2%WL, mean muscle fibre diameter was reduced by about 25% (p = 0.02 and p = 0.001 respectively. No significant difference in fibre diameter was observed if patients had WL alone. Regardless of classification, there was no difference in fibre number or proportion of fibre type across all myosin heavy chain isoforms. Mean muscle protein content was reduced and the ratio of RNA/DNA decreased in patients with either >5%WL or LM+>2%WL. Compared with non-cachectic patients, SMAD3 protein levels were increased in patients with >5%WL (p = 0.022 and with >10%WL, beclin (p = 0.05 and ATG5 (p = 0.01 protein levels were increased. There were no differences in phospho-NFkB or phospho-STAT3 levels across any of the groups. CONCLUSION: Muscle fibre size, biochemical composition and pathway phenotype can vary according to whether the diagnostic criteria for cachexia are based on weight loss alone, a measure of low muscularity alone or a combination of the two. For intervention trials where the primary end-point is a change in muscle mass or function, use of combined diagnostic criteria may allow identification of a more

  11. Serum protein fingerprint of patients with gastric cancer by SELDI ...

    African Journals Online (AJOL)

    STORAGESEVER

    2010-04-12

    Apr 12, 2010 ... Software (BPS) to construct the classification tree of gastric cancer. Briefly, the ..... that modulates lipid trafficking and immune responses. It is also the ... Therefore, we can not get the structures, functions of the proteins, and it ...

  12. Bosniak Classification system

    DEFF Research Database (Denmark)

    Graumann, Ole; Osther, Susanne Sloth; Karstoft, Jens

    2014-01-01

    Background: The Bosniak classification is a diagnostic tool for the differentiation of cystic changes in the kidney. The process of categorizing renal cysts may be challenging, involving a series of decisions that may affect the final diagnosis and clinical outcome such as surgical management....... Purpose: To investigate the inter- and intra-observer agreement among experienced uroradiologists when categorizing complex renal cysts according to the Bosniak classification. Material and Methods: The original categories of 100 cystic renal masses were chosen as “Gold Standard” (GS), established...... to the calculated weighted κ all readers performed “very good” for both inter-observer and intra-observer variation. Most variation was seen in cysts catagorized as Bosniak II, IIF, and III. These results show that radiologists who evaluate complex renal cysts routinely may apply the Bosniak classification...

  13. Quantification and Classification of E. coli Proteome Utilization and Unused Protein Costs across Environments

    DEFF Research Database (Denmark)

    O'Brien, Edward J.; Utrilla, Jose; Palsson, Bernhard

    2016-01-01

    The costs and benefits of protein expression are balanced through evolution. Expression of un-utilized protein (that have no benefits in the current environment) incurs a quantifiable fitness costs on cellular growth rates; however, the magnitude and variability of un-utilized protein expression...... in varying environments. Thus, unused protein expression is the source of large and pervasive fitness costs that may provide the benefit of hedging against environmental change....... in natural settings is unknown, largely due to the challenge in determining environment-specific proteome utilization. We address this challenge using absolute and global proteomics data combined with a recently developed genome-scale model of Escherichia coli that computes the environment-specific cost...

  14. Definition and classification of epilepsy. Classification of epileptic seizures 2016

    Directory of Open Access Journals (Sweden)

    K. Yu. Mukhin

    2017-01-01

    Full Text Available Epilepsy is one of the most common neurological diseases, especially in childhood and adolescence. The incidence varies from 15 to 113 cases per 100 000 population with the maximum among children under 1 year old. The prevalence of epilepsy is high, ranging from 5 to 8 cases (in some regions – 10 cases per 1000 children under 15 years old. Classification of the disease has great importance for diagnosis, treatment and prognosis. The article presents a novel strategy for classification of epileptic seizures, developed in 2016. It contains a number of brand new concepts, including a very important one, saying that some seizures, previously considered as generalized or focal only, can be, in fact, both focal and generalized. They include tonic, atonic, myoclonic seizures and epileptic spasms. The term “secondarily generalized seizure” is replace by the term “bilateral tonic-clonic seizure” (as soon as it is not a separate type of epileptic seizures, and the term reflects the spread of discharge from any area of cerebral cortex and evolution of any types of focal seizures. International League Against Epilepsy recommends to abandon the term “pseudo-epileptic seizures” and replace it by the term “psychogenic non-epileptic seizures”. If a doctor is not sure that seizures have epileptic nature, the term “paroxysmal event” should be used without specifying the disease. The conception of childhood epileptic encephalopathies, developed within this novel classification project, is one of the most significant achievements, since in this case not only the seizures, but even epileptiform activity can induce severe disorders of higher mental functions. In addition to detailed description of the new strategy for classification of epileptic seizures, the article contains a comprehensive review of the existing principles of epilepsy and epileptic seizures classification.

  15. Protein Secondary Structure Prediction Using AutoEncoder Network and Bayes Classifier

    Science.gov (United States)

    Wang, Leilei; Cheng, Jinyong

    2018-03-01

    Protein secondary structure prediction is belong to bioinformatics,and it's important in research area. In this paper, we propose a new prediction way of protein using bayes classifier and autoEncoder network. Our experiments show some algorithms including the construction of the model, the classification of parameters and so on. The data set is a typical CB513 data set for protein. In terms of accuracy, the method is the cross validation based on the 3-fold. Then we can get the Q3 accuracy. Paper results illustrate that the autoencoder network improved the prediction accuracy of protein secondary structure.

  16. 5 CFR 2500.3 - Original classification.

    Science.gov (United States)

    2010-01-01

    ... 5 Administrative Personnel 3 2010-01-01 2010-01-01 false Original classification. 2500.3 Section... SECURITY REGULATION § 2500.3 Original classification. No one in the Office of Administration has been granted authority for original classification of information. ...

  17. BayesMotif: de novo protein sorting motif discovery from impure datasets.

    Science.gov (United States)

    Hu, Jianjun; Zhang, Fan

    2010-01-18

    Protein sorting is the process that newly synthesized proteins are transported to their target locations within or outside of the cell. This process is precisely regulated by protein sorting signals in different forms. A major category of sorting signals are amino acid sub-sequences usually located at the N-terminals or C-terminals of protein sequences. Genome-wide experimental identification of protein sorting signals is extremely time-consuming and costly. Effective computational algorithms for de novo discovery of protein sorting signals is needed to improve the understanding of protein sorting mechanisms. We formulated the protein sorting motif discovery problem as a classification problem and proposed a Bayesian classifier based algorithm (BayesMotif) for de novo identification of a common type of protein sorting motifs in which a highly conserved anchor is present along with a less conserved motif regions. A false positive removal procedure is developed to iteratively remove sequences that are unlikely to contain true motifs so that the algorithm can identify motifs from impure input sequences. Experiments on both implanted motif datasets and real-world datasets showed that the enhanced BayesMotif algorithm can identify anchored sorting motifs from pure or impure protein sequence dataset. It also shows that the false positive removal procedure can help to identify true motifs even when there is only 20% of the input sequences containing true motif instances. We proposed BayesMotif, a novel Bayesian classification based algorithm for de novo discovery of a special category of anchored protein sorting motifs from impure datasets. Compared to conventional motif discovery algorithms such as MEME, our algorithm can find less-conserved motifs with short highly conserved anchors. Our algorithm also has the advantage of easy incorporation of additional meta-sequence features such as hydrophobicity or charge of the motifs which may help to overcome the limitations of

  18. 10 CFR 61.55 - Waste classification.

    Science.gov (United States)

    2010-01-01

    ... REGULATORY COMMISSION (CONTINUED) LICENSING REQUIREMENTS FOR LAND DISPOSAL OF RADIOACTIVE WASTE Technical Requirements for Land Disposal Facilities § 61.55 Waste classification. (a) Classification of waste for near surface disposal—(1) Considerations. Determination of the classification of radioactive waste involves two...

  19. Evolution of homeobox genes.

    Science.gov (United States)

    Holland, Peter W H

    2013-01-01

    Many homeobox genes encode transcription factors with regulatory roles in animal and plant development. Homeobox genes are found in almost all eukaryotes, and have diversified into 11 gene classes and over 100 gene families in animal evolution, and 10 to 14 gene classes in plants. The largest group in animals is the ANTP class which includes the well-known Hox genes, plus other genes implicated in development including ParaHox (Cdx, Xlox, Gsx), Evx, Dlx, En, NK4, NK3, Msx, and Nanog. Genomic data suggest that the ANTP class diversified by extensive tandem duplication to generate a large array of genes, including an NK gene cluster and a hypothetical ProtoHox gene cluster that duplicated to generate Hox and ParaHox genes. Expression and functional data suggest that NK, Hox, and ParaHox gene clusters acquired distinct roles in patterning the mesoderm, nervous system, and gut. The PRD class is also diverse and includes Pax2/5/8, Pax3/7, Pax4/6, Gsc, Hesx, Otx, Otp, and Pitx genes. PRD genes are not generally arranged in ancient genomic clusters, although the Dux, Obox, and Rhox gene clusters arose in mammalian evolution as did several non-clustered PRD genes. Tandem duplication and genome duplication expanded the number of homeobox genes, possibly contributing to the evolution of developmental complexity, but homeobox gene loss must not be ignored. Evolutionary changes to homeobox gene expression have also been documented, including Hox gene expression patterns shifting in concert with segmental diversification in vertebrates and crustaceans, and deletion of a Pitx1 gene enhancer in pelvic-reduced sticklebacks. WIREs Dev Biol 2013, 2:31-45. doi: 10.1002/wdev.78 For further resources related to this article, please visit the WIREs website. The author declares that he has no conflicts of interest. Copyright © 2012 Wiley Periodicals, Inc.

  20. Vietnamese Document Representation and Classification

    Science.gov (United States)

    Nguyen, Giang-Son; Gao, Xiaoying; Andreae, Peter

    Vietnamese is very different from English and little research has been done on Vietnamese document classification, or indeed, on any kind of Vietnamese language processing, and only a few small corpora are available for research. We created a large Vietnamese text corpus with about 18000 documents, and manually classified them based on different criteria such as topics and styles, giving several classification tasks of different difficulty levels. This paper introduces a new syllable-based document representation at the morphological level of the language for efficient classification. We tested the representation on our corpus with different classification tasks using six classification algorithms and two feature selection techniques. Our experiments show that the new representation is effective for Vietnamese categorization, and suggest that best performance can be achieved using syllable-pair document representation, an SVM with a polynomial kernel as the learning algorithm, and using Information gain and an external dictionary for feature selection.

  1. Reprogramming Antagonizes the Oncogenicity of HOXA13-Long Noncoding RNA HOTTIP Axis in Gastric Cancer Cells.

    Science.gov (United States)

    Wu, Deng-Chyang; Wang, Sophie S W; Liu, Chung-Jung; Wuputra, Kenly; Kato, Kohsuke; Lee, Yen-Liang; Lin, Ying-Chu; Tsai, Ming-Ho; Ku, Chia-Chen; Lin, Wen-Hsin; Wang, Shin-Wei; Kishikawa, Shotaro; Noguchi, Michiya; Wu, Chu-Chieh; Chen, Yi-Ting; Chai, Chee-Yin; Lin, Chen-Lung Steve; Kuo, Kung-Kai; Yang, Ya-Han; Miyoshi, Hiroyuki; Nakamura, Yukio; Saito, Shigeo; Nagata, Kyosuke; Lin, Chang-Shen; Yokoyama, Kazunari K

    2017-10-01

    Reprogramming of cancer cells into induced pluripotent stem cells (iPSCs) is a compelling idea for inhibiting oncogenesis, especially through modulation of homeobox proteins in this reprogramming process. We examined the role of various long noncoding RNAs (lncRNAs)-homeobox protein HOXA13 axis on the switching of the oncogenic function of bone morphogenetic protein 7 (BMP7), which is significantly lost in the gastric cancer cell derived iPS-like cells (iPSLCs). BMP7 promoter activation occurred through the corecruitment of HOXA13, mixed-lineage leukemia 1 lysine N-methyltransferase, WD repeat-containing protein 5, and lncRNA HoxA transcript at the distal tip (HOTTIP) to commit the epigenetic changes to the trimethylation of lysine 4 on histone H3 in cancer cells. By contrast, HOXA13 inhibited BMP7 expression in iPSLCs via the corecruitment of HOXA13, enhancer of zeste homolog 2, Jumonji and AT rich interactive domain 2, and lncRNA HoxA transcript antisense RNA (HOTAIR) to various cis-element of the BMP7 promoter. Knockdown experiments demonstrated that HOTTIP contributed positively, but HOTAIR regulated negatively to HOXA13-mediated BMP7 expression in cancer cells and iPSLCs, respectively. These findings indicate that the recruitment of HOXA13-HOTTIP and HOXA13-HOTAIR to different sites in the BMP7 promoter is crucial for the oncogenic fate of human gastric cells. Reprogramming with octamer-binding protein 4 and Jun dimerization protein 2 can inhibit tumorigenesis by switching off BMP7. Stem Cells 2017;35:2115-2128. © 2017 The Authors Stem Cells published by Wiley Periodicals, Inc. on behalf of AlphaMed Press.

  2. Bosniak classification system

    DEFF Research Database (Denmark)

    Graumann, Ole; Osther, Susanne Sloth; Karstoft, Jens

    2016-01-01

    BACKGROUND: The Bosniak classification was originally based on computed tomographic (CT) findings. Magnetic resonance (MR) and contrast-enhanced ultrasonography (CEUS) imaging may demonstrate findings that are not depicted at CT, and there may not always be a clear correlation between the findings...... at MR and CEUS imaging and those at CT. PURPOSE: To compare diagnostic accuracy of MR, CEUS, and CT when categorizing complex renal cystic masses according to the Bosniak classification. MATERIAL AND METHODS: From February 2011 to June 2012, 46 complex renal cysts were prospectively evaluated by three...... readers. Each mass was categorized according to the Bosniak classification and CT was chosen as gold standard. Kappa was calculated for diagnostic accuracy and data was compared with pathological results. RESULTS: CT images found 27 BII, six BIIF, seven BIII, and six BIV. Forty-three cysts could...

  3. 46 CFR Sec. 18 - Group classification.

    Science.gov (United States)

    2010-10-01

    ... 46 Shipping 8 2010-10-01 2010-10-01 false Group classification. Sec. 18 Section 18 Shipping... Sec. 18 Group classification. In the preparation of specifications, Job Orders, Supplemental Job... inserted thereon: Number Classification 41 Maintenance Repairs (deck, engine and stewards department...

  4. 22 CFR 9.6 - Derivative classification.

    Science.gov (United States)

    2010-04-01

    ... 22 Foreign Relations 1 2010-04-01 2010-04-01 false Derivative classification. 9.6 Section 9.6 Foreign Relations DEPARTMENT OF STATE GENERAL SECURITY INFORMATION REGULATIONS § 9.6 Derivative classification. (a) Definition. Derivative classification is the incorporating, paraphrasing, restating or...

  5. 22 CFR 9.4 - Original classification.

    Science.gov (United States)

    2010-04-01

    ... 22 Foreign Relations 1 2010-04-01 2010-04-01 false Original classification. 9.4 Section 9.4 Foreign Relations DEPARTMENT OF STATE GENERAL SECURITY INFORMATION REGULATIONS § 9.4 Original classification. (a) Definition. Original classification is the initial determination that certain information...

  6. 28 CFR 524.73 - Classification procedures.

    Science.gov (United States)

    2010-07-01

    ... 28 Judicial Administration 2 2010-07-01 2010-07-01 false Classification procedures. 524.73 Section 524.73 Judicial Administration BUREAU OF PRISONS, DEPARTMENT OF JUSTICE INMATE ADMISSION, CLASSIFICATION, AND TRANSFER CLASSIFICATION OF INMATES Central Inmate Monitoring (CIM) System § 524.73...

  7. Gender classification under extended operating conditions

    Science.gov (United States)

    Rude, Howard N.; Rizki, Mateen

    2014-06-01

    Gender classification is a critical component of a robust image security system. Many techniques exist to perform gender classification using facial features. In contrast, this paper explores gender classification using body features extracted from clothed subjects. Several of the most effective types of features for gender classification identified in literature were implemented and applied to the newly developed Seasonal Weather And Gender (SWAG) dataset. SWAG contains video clips of approximately 2000 samples of human subjects captured over a period of several months. The subjects are wearing casual business attire and outer garments appropriate for the specific weather conditions observed in the Midwest. The results from a series of experiments are presented that compare the classification accuracy of systems that incorporate various types and combinations of features applied to multiple looks at subjects at different image resolutions to determine a baseline performance for gender classification.

  8. Radon classification of building ground

    International Nuclear Information System (INIS)

    Slunga, E.

    1988-01-01

    The Laboratories of Building Technology and Soil Mechanics and Foundation Engineering at the Helsinki University of Technology in cooperation with The Ministry of the Environment have proposed a radon classification for building ground. The proposed classification is based on the radon concentration in soil pores and on the permeability of the foundation soil. The classification includes four radon classes: negligible, normal, high and very high. Depending on the radon class the radon-technical solution for structures is chosen. It is proposed that the classification be done in general terms in connection with the site investigations for the planning of land use and in more detail in connection with the site investigations for an individual house. (author)

  9. Formalization of the classification pattern: survey of classification modeling in information systems engineering.

    Science.gov (United States)

    Partridge, Chris; de Cesare, Sergio; Mitchell, Andrew; Odell, James

    2018-01-01

    Formalization is becoming more common in all stages of the development of information systems, as a better understanding of its benefits emerges. Classification systems are ubiquitous, no more so than in domain modeling. The classification pattern that underlies these systems provides a good case study of the move toward formalization in part because it illustrates some of the barriers to formalization, including the formal complexity of the pattern and the ontological issues surrounding the "one and the many." Powersets are a way of characterizing the (complex) formal structure of the classification pattern, and their formalization has been extensively studied in mathematics since Cantor's work in the late nineteenth century. One can use this formalization to develop a useful benchmark. There are various communities within information systems engineering (ISE) that are gradually working toward a formalization of the classification pattern. However, for most of these communities, this work is incomplete, in that they have not yet arrived at a solution with the expressiveness of the powerset benchmark. This contrasts with the early smooth adoption of powerset by other information systems communities to, for example, formalize relations. One way of understanding the varying rates of adoption is recognizing that the different communities have different historical baggage. Many conceptual modeling communities emerged from work done on database design, and this creates hurdles to the adoption of the high level of expressiveness of powersets. Another relevant factor is that these communities also often feel, particularly in the case of domain modeling, a responsibility to explain the semantics of whatever formal structures they adopt. This paper aims to make sense of the formalization of the classification pattern in ISE and surveys its history through the literature, starting from the relevant theoretical works of the mathematical literature and gradually shifting focus

  10. Ototoxicity (cochleotoxicity) classifications: A review.

    Science.gov (United States)

    Crundwell, Gemma; Gomersall, Phil; Baguley, David M

    2016-01-01

    Drug-mediated ototoxicity, specifically cochleotoxicity, is a concern for patients receiving medications for the treatment of serious illness. A number of classification schemes exist, most of which are based on pure-tone audiometry, in order to assist non-audiological/non-otological specialists in the identification and monitoring of iatrogenic hearing loss. This review identifies the primary classification systems used in cochleototoxicity monitoring. By bringing together classifications published in discipline-specific literature, the paper aims to increase awareness of their relative strengths and limitations in the assessment and monitoring of ototoxic hearing loss and to indicate how future classification systems may improve upon the status-quo. Literature review. PubMed identified 4878 articles containing the search term ototox*. A systematic search identified 13 key classification systems. Cochleotoxicity classification systems can be divided into those which focus on hearing change from a baseline audiogram and those that focus on the functional impact of the hearing loss. Common weaknesses of these grading scales included a lack of sensitivity to small adverse changes in hearing thresholds, a lack of high-frequency audiometry (>8 kHz), and lack of indication of which changes are likely to be clinically significant for communication and quality of life.

  11. 10 CFR 1045.37 - Classification guides.

    Science.gov (United States)

    2010-01-01

    ... 10 Energy 4 2010-01-01 2010-01-01 false Classification guides. 1045.37 Section 1045.37 Energy DEPARTMENT OF ENERGY (GENERAL PROVISIONS) NUCLEAR CLASSIFICATION AND DECLASSIFICATION Generation and Review of Documents Containing Restricted Data and Formerly Restricted Data § 1045.37 Classification guides...

  12. 75 FR 10529 - Mail Classification Change

    Science.gov (United States)

    2010-03-08

    ... POSTAL REGULATORY COMMISSION [Docket Nos. MC2010-19; Order No. 415] Mail Classification Change...-filed Postal Service request to make a minor modification to the Mail Classification Schedule. The.... concerning a change in classification which reflects a change in terminology from Bulk Mailing Center (BMC...

  13. 32 CFR 1602.13 - Judgmental Classification.

    Science.gov (United States)

    2010-07-01

    ... 32 National Defense 6 2010-07-01 2010-07-01 false Judgmental Classification. 1602.13 Section 1602.13 National Defense Other Regulations Relating to National Defense SELECTIVE SERVICE SYSTEM DEFINITIONS § 1602.13 Judgmental Classification. A classification action relating to a registrant's claim for...

  14. 6 CFR 7.26 - Derivative classification.

    Science.gov (United States)

    2010-01-01

    ... 6 Domestic Security 1 2010-01-01 2010-01-01 false Derivative classification. 7.26 Section 7.26 Domestic Security DEPARTMENT OF HOMELAND SECURITY, OFFICE OF THE SECRETARY CLASSIFIED NATIONAL SECURITY INFORMATION Classified Information § 7.26 Derivative classification. (a) Derivative classification is defined...

  15. Hydropedological insights when considering catchment classification

    NARCIS (Netherlands)

    Bouma, J.; Droogers, P.; Sonneveld, M.P.W.; Ritsema, C.J.; Hunink, J.E.; Immerzeel, W.W.; Kauffman, S.

    2011-01-01

    Soil classification systems are analysed to explore the potential of developing classification systems for catchments. Soil classifications are useful to create systematic order in the overwhelming quantity of different soils in the world and to extrapolate data available for a given soil type to

  16. Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages.

    Directory of Open Access Journals (Sweden)

    Fábio R de Moraes

    Full Text Available Protein-protein interactions are involved in nearly all regulatory processes in the cell and are considered one of the most important issues in molecular biology and pharmaceutical sciences but are still not fully understood. Structural and computational biology contributed greatly to the elucidation of the mechanism of protein interactions. In this paper, we present a collection of the physicochemical and structural characteristics that distinguish interface-forming residues (IFR from free surface residues (FSR. We formulated a linear discriminative analysis (LDA classifier to assess whether chosen descriptors from the BlueStar STING database (http://www.cbi.cnptia.embrapa.br/SMS/ are suitable for such a task. Receiver operating characteristic (ROC analysis indicates that the particular physicochemical and structural descriptors used for building the linear classifier perform much better than a random classifier and in fact, successfully outperform some of the previously published procedures, whose performance indicators were recently compared by other research groups. The results presented here show that the selected set of descriptors can be utilized to predict IFRs, even when homologue proteins are missing (particularly important for orphan proteins where no homologue is available for comparative analysis/indication or, when certain conformational changes accompany interface formation. The development of amino acid type specific classifiers is shown to increase IFR classification performance. Also, we found that the addition of an amino acid conservation attribute did not improve the classification prediction. This result indicates that the increase in predictive power associated with amino acid conservation is exhausted by adequate use of an extensive list of independent physicochemical and structural parameters that, by themselves, fully describe the nano-environment at protein-protein interfaces. The IFR classifier developed in this study

  17. Improving predictions of protein-protein interfaces by combining amino acid-specific classifiers based on structural and physicochemical descriptors with their weighted neighbor averages.

    Science.gov (United States)

    de Moraes, Fábio R; Neshich, Izabella A P; Mazoni, Ivan; Yano, Inácio H; Pereira, José G C; Salim, José A; Jardine, José G; Neshich, Goran

    2014-01-01

    Protein-protein interactions are involved in nearly all regulatory processes in the cell and are considered one of the most important issues in molecular biology and pharmaceutical sciences but are still not fully understood. Structural and computational biology contributed greatly to the elucidation of the mechanism of protein interactions. In this paper, we present a collection of the physicochemical and structural characteristics that distinguish interface-forming residues (IFR) from free surface residues (FSR). We formulated a linear discriminative analysis (LDA) classifier to assess whether chosen descriptors from the BlueStar STING database (http://www.cbi.cnptia.embrapa.br/SMS/) are suitable for such a task. Receiver operating characteristic (ROC) analysis indicates that the particular physicochemical and structural descriptors used for building the linear classifier perform much better than a random classifier and in fact, successfully outperform some of the previously published procedures, whose performance indicators were recently compared by other research groups. The results presented here show that the selected set of descriptors can be utilized to predict IFRs, even when homologue proteins are missing (particularly important for orphan proteins where no homologue is available for comparative analysis/indication) or, when certain conformational changes accompany interface formation. The development of amino acid type specific classifiers is shown to increase IFR classification performance. Also, we found that the addition of an amino acid conservation attribute did not improve the classification prediction. This result indicates that the increase in predictive power associated with amino acid conservation is exhausted by adequate use of an extensive list of independent physicochemical and structural parameters that, by themselves, fully describe the nano-environment at protein-protein interfaces. The IFR classifier developed in this study is now

  18. Improving Predictions of Protein-Protein Interfaces by Combining Amino Acid-Specific Classifiers Based on Structural and Physicochemical Descriptors with Their Weighted Neighbor Averages

    Science.gov (United States)

    de Moraes, Fábio R.; Neshich, Izabella A. P.; Mazoni, Ivan; Yano, Inácio H.; Pereira, José G. C.; Salim, José A.; Jardine, José G.; Neshich, Goran

    2014-01-01

    Protein-protein interactions are involved in nearly all regulatory processes in the cell and are considered one of the most important issues in molecular biology and pharmaceutical sciences but are still not fully understood. Structural and computational biology contributed greatly to the elucidation of the mechanism of protein interactions. In this paper, we present a collection of the physicochemical and structural characteristics that distinguish interface-forming residues (IFR) from free surface residues (FSR). We formulated a linear discriminative analysis (LDA) classifier to assess whether chosen descriptors from the BlueStar STING database (http://www.cbi.cnptia.embrapa.br/SMS/) are suitable for such a task. Receiver operating characteristic (ROC) analysis indicates that the particular physicochemical and structural descriptors used for building the linear classifier perform much better than a random classifier and in fact, successfully outperform some of the previously published procedures, whose performance indicators were recently compared by other research groups. The results presented here show that the selected set of descriptors can be utilized to predict IFRs, even when homologue proteins are missing (particularly important for orphan proteins where no homologue is available for comparative analysis/indication) or, when certain conformational changes accompany interface formation. The development of amino acid type specific classifiers is shown to increase IFR classification performance. Also, we found that the addition of an amino acid conservation attribute did not improve the classification prediction. This result indicates that the increase in predictive power associated with amino acid conservation is exhausted by adequate use of an extensive list of independent physicochemical and structural parameters that, by themselves, fully describe the nano-environment at protein-protein interfaces. The IFR classifier developed in this study is now

  19. 14 CFR 1203.407 - Duration of classification.

    Science.gov (United States)

    2010-01-01

    ... 14 Aeronautics and Space 5 2010-01-01 2010-01-01 false Duration of classification. 1203.407... PROGRAM Guides for Original Classification § 1203.407 Duration of classification. (a) Information shall be... date or event for declassification shall be set by the original classification authority at the time...

  20. HIV classification using coalescent theory

    Energy Technology Data Exchange (ETDEWEB)

    Zhang, Ming [Los Alamos National Laboratory; Letiner, Thomas K [Los Alamos National Laboratory; Korber, Bette T [Los Alamos National Laboratory

    2008-01-01

    Algorithms for subtype classification and breakpoint detection of HIV-I sequences are based on a classification system of HIV-l. Hence, their quality highly depend on this system. Due to the history of creation of the current HIV-I nomenclature, the current one contains inconsistencies like: The phylogenetic distance between the subtype B and D is remarkably small compared with other pairs of subtypes. In fact, it is more like the distance of a pair of subsubtypes Robertson et al. (2000); Subtypes E and I do not exist any more since they were discovered to be composed of recombinants Robertson et al. (2000); It is currently discussed whether -- instead of CRF02 being a recombinant of subtype A and G -- subtype G should be designated as a circulating recombination form (CRF) nd CRF02 as a subtype Abecasis et al. (2007); There are 8 complete and over 400 partial HIV genomes in the LANL-database which belong neither to a subtype nor to a CRF (denoted by U). Moreover, the current classification system is somehow arbitrary like all complex classification systems that were created manually. To this end, it is desirable to deduce the classification system of HIV systematically by an algorithm. Of course, this problem is not restricted to HIV, but applies to all fast mutating and recombining viruses. Our work addresses the simpler subproblem to score classifications of given input sequences of some virus species (classification denotes a partition of the input sequences in several subtypes and CRFs). To this end, we reconstruct ancestral recombination graphs (ARG) of the input sequences under restrictions determined by the given classification. These restritions are imposed in order to ensure that the reconstructed ARGs do not contradict the classification under consideration. Then, we find the ARG with maximal probability by means of Markov Chain Monte Carlo methods. The probability of the most probable ARG is interpreted as a score for the classification. To our

  1. A practicable approach for periodontal classification

    Science.gov (United States)

    Mittal, Vishnu; Bhullar, Raman Preet K.; Bansal, Rachita; Singh, Karanprakash; Bhalodi, Anand; Khinda, Paramjit K.

    2013-01-01

    The Diagnosis and classification of periodontal diseases has remained a dilemma since long. Two distinct concepts have been used to define diseases: Essentialism and Nominalism. Essentialistic concept implies the real existence of disease whereas; nominalistic concept states that the names of diseases are the convenient way of stating concisely the endpoint of a diagnostic process. It generally advances from assessment of symptoms and signs toward knowledge of causation and gives a feasible option to name the disease for which etiology is either unknown or it is too complex to access in routine clinical practice. Various classifications have been proposed by the American Academy of Periodontology (AAP) in 1986, 1989 and 1999. The AAP 1999 classification is among the most widely used classification. But this classification also has demerits which provide impediment for its use in day to day practice. Hence a classification and diagnostic system is required which can help the clinician to access the patient's need and provide a suitable treatment which is in harmony with the diagnosis for that particular case. Here is an attempt to propose a practicable classification and diagnostic system of periodontal diseases for better treatment outcome. PMID:24379855

  2. Lossless Compression of Classification-Map Data

    Science.gov (United States)

    Hua, Xie; Klimesh, Matthew

    2009-01-01

    A lossless image-data-compression algorithm intended specifically for application to classification-map data is based on prediction, context modeling, and entropy coding. The algorithm was formulated, in consideration of the differences between classification maps and ordinary images of natural scenes, so as to be capable of compressing classification- map data more effectively than do general-purpose image-data-compression algorithms. Classification maps are typically generated from remote-sensing images acquired by instruments aboard aircraft (see figure) and spacecraft. A classification map is a synthetic image that summarizes information derived from one or more original remote-sensing image(s) of a scene. The value assigned to each pixel in such a map is the index of a class that represents some type of content deduced from the original image data for example, a type of vegetation, a mineral, or a body of water at the corresponding location in the scene. When classification maps are generated onboard the aircraft or spacecraft, it is desirable to compress the classification-map data in order to reduce the volume of data that must be transmitted to a ground station.

  3. A Hybrid Dry and Aqueous Fractionation Method to Obtain Protein-Rich Fractions from Quinoa (Chenopodium quinoa Willd)

    NARCIS (Netherlands)

    Avila Ruiz, Geraldine; Arts, Anke; Minor, Marcel; Schutyser, Maarten

    2016-01-01

    Combination of dry and aqueous fractionation is investigated to obtain protein-rich fractions from quinoa in a milder and more sustainable way compared to conventional wet fractionation. Dry fractionation of quinoa involved milling and subsequent air classification, generating a protein-enriched

  4. 6 CFR 7.30 - Classification challenges.

    Science.gov (United States)

    2010-01-01

    ... 6 Domestic Security 1 2010-01-01 2010-01-01 false Classification challenges. 7.30 Section 7.30... INFORMATION Classified Information § 7.30 Classification challenges. (a) Authorized holders of information... classified are encouraged and expected to challenge the classification status of that information pursuant to...

  5. 7 CFR 51.1904 - Maturity classification.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Maturity classification. 51.1904 Section 51.1904... STANDARDS) United States Consumer Standards for Fresh Tomatoes Size and Maturity Classification § 51.1904 Maturity classification. Tomatoes which are characteristically red when ripe, but are not overripe or soft...

  6. Classification of radiological procedures

    International Nuclear Information System (INIS)

    1989-01-01

    A classification for departments in Danish hospitals which use radiological procedures. The classification codes consist of 4 digits, where the first 2 are the codes for the main groups. The first digit represents the procedure's topographical object and the second the techniques. The last 2 digits describe individual procedures. (CLS)

  7. Expected Classification Accuracy

    Directory of Open Access Journals (Sweden)

    Lawrence M. Rudner

    2005-08-01

    Full Text Available Every time we make a classification based on a test score, we should expect some number..of misclassifications. Some examinees whose true ability is within a score range will have..observed scores outside of that range. A procedure for providing a classification table of..true and expected scores is developed for polytomously scored items under item response..theory and applied to state assessment data. A simplified procedure for estimating the..table entries is also presented.

  8. 12 CFR 560.160 - Asset classification.

    Science.gov (United States)

    2010-01-01

    ... 12 Banks and Banking 5 2010-01-01 2010-01-01 false Asset classification. 560.160 Section 560.160... Lending and Investment Provisions Applicable to all Savings Associations § 560.160 Asset classification... consistent with, or reconcilable to, the asset classification system used by OTS in its Thrift Activities...

  9. 33 CFR 154.1216 - Facility classification.

    Science.gov (United States)

    2010-07-01

    ... 33 Navigation and Navigable Waters 2 2010-07-01 2010-07-01 false Facility classification. 154.1216... Vegetable Oils Facilities § 154.1216 Facility classification. (a) The Coast Guard classifies facilities that... classification of a facility that handles, stores, or transports animal fats or vegetable oils. The COTP may...

  10. 76 FR 47614 - Mail Classification Change

    Science.gov (United States)

    2011-08-05

    ... POSTAL REGULATORY COMMISSION [Docket No. MC2011-27; Order No. 785] Mail Classification Change...-filed Postal Service request for a change in classification to the ``Reply Rides Free'' program. The... Service filed a notice of classification change pursuant to 39 CFR 3020.90 and 3020.91 concerning the...

  11. JNK Signaling: Regulation and Functions Based on Complex Protein-Protein Partnerships

    Science.gov (United States)

    Zeke, András; Misheva, Mariya

    2016-01-01

    SUMMARY The c-Jun N-terminal kinases (JNKs), as members of the mitogen-activated protein kinase (MAPK) family, mediate eukaryotic cell responses to a wide range of abiotic and biotic stress insults. JNKs also regulate important physiological processes, including neuronal functions, immunological actions, and embryonic development, via their impact on gene expression, cytoskeletal protein dynamics, and cell death/survival pathways. Although the JNK pathway has been under study for >20 years, its complexity is still perplexing, with multiple protein partners of JNKs underlying the diversity of actions. Here we review the current knowledge of JNK structure and isoforms as well as the partnerships of JNKs with a range of intracellular proteins. Many of these proteins are direct substrates of the JNKs. We analyzed almost 100 of these target proteins in detail within a framework of their classification based on their regulation by JNKs. Examples of these JNK substrates include a diverse assortment of nuclear transcription factors (Jun, ATF2, Myc, Elk1), cytoplasmic proteins involved in cytoskeleton regulation (DCX, Tau, WDR62) or vesicular transport (JIP1, JIP3), cell membrane receptors (BMPR2), and mitochondrial proteins (Mcl1, Bim). In addition, because upstream signaling components impact JNK activity, we critically assessed the involvement of signaling scaffolds and the roles of feedback mechanisms in the JNK pathway. Despite a clarification of many regulatory events in JNK-dependent signaling during the past decade, many other structural and mechanistic insights are just beginning to be revealed. These advances open new opportunities to understand the role of JNK signaling in diverse physiological and pathophysiological states. PMID:27466283

  12. Overweight and Obesity Prevalence Among School-Aged Nunavik Inuit Children According to Three Body Mass Index Classification Systems.

    Science.gov (United States)

    Medehouenou, Thierry Comlan Marc; Ayotte, Pierre; St-Jean, Audray; Meziou, Salma; Roy, Cynthia; Muckle, Gina; Lucas, Michel

    2015-07-01

    Little is known about the suitability of three commonly used body mass index (BMI) classification system for Indigenous children. This study aims to estimate overweight and obesity prevalence among school-aged Nunavik Inuit children according to International Obesity Task Force (IOTF), Centers for Disease Control and Prevention (CDC), and World Health Organization (WHO) BMI classification systems, to measure agreement between those classification systems, and to investigate whether BMI status as defined by these classification systems is associated with levels of metabolic and inflammatory biomarkers. Data were collected on 290 school-aged children (aged 8-14 years; 50.7% girls) from the Nunavik Child Development Study with data collected in 2005-2010. Anthropometric parameters were measured and blood sampled. Participants were classified as normal weight, overweight, and obese according to BMI classification systems. Weighted kappa (κw) statistics assessed agreement between different BMI classification systems, and multivariate analysis of variance ascertained their relationship with metabolic and inflammatory biomarkers. The combined prevalence rate of overweight/obesity was 26.9% (with 6.6% obesity) with IOTF, 24.1% (11.0%) with CDC, and 40.4% (12.8%) with WHO classification systems. Agreement was the highest between IOTF and CDC (κw = .87) classifications, and substantial for IOTF and WHO (κw = .69) and for CDC and WHO (κw = .73). Insulin and high-sensitivity C-reactive protein plasma levels were significantly higher from normal weight to obesity, regardless of classification system. Among obese subjects, higher insulin level was observed with IOTF. Compared with other systems, IOTF classification appears to be more specific to identify overweight and obesity in Inuit children. Copyright © 2015 Society for Adolescent Health and Medicine. Published by Elsevier Inc. All rights reserved.

  13. A pentatonic classification of extreme events

    International Nuclear Information System (INIS)

    Eliazar, Iddo; Cohen, Morrel H.

    2015-01-01

    In this paper we present a classification of the extreme events – very small and very large outcomes – of positive-valued random variables. The classification distinguishes five different categories of randomness, ranging from the very ‘mild’ to the very ‘wild’. In analogy with the common five-tone musical scale we term the classification ‘pentatonic’. The classification is based on the analysis of the inherent Gibbsian ‘forces’ and ‘temperatures’ existing on the logarithmic scale of the random variables under consideration, and provides a statistical-physics insight regarding the nature of these random variables. The practical application of the pentatonic classification is remarkably straightforward, it can be performed by non-experts, and it is demonstrated via an array of examples

  14. Sow-activity classification from acceleration patterns

    DEFF Research Database (Denmark)

    Escalante, Hugo Jair; Rodriguez, Sara V.; Cordero, Jorge

    2013-01-01

    sow-activity classification can be approached with standard machine learning methods for pattern classification. Individual predictions for elements of times series of arbitrary length are combined to classify it as a whole. An extensive comparison of representative learning algorithms, including......This paper describes a supervised learning approach to sow-activity classification from accelerometer measurements. In the proposed methodology, pairs of accelerometer measurements and activity types are considered as labeled instances of a usual supervised classification task. Under this scenario...... neural networks, support vector machines, and ensemble methods, is presented. Experimental results are reported using a data set for sow-activity classification collected in a real production herd. The data set, which has been widely used in related works, includes measurements from active (Feeding...

  15. Systematic Characterisation of Cellular Localisation and Expression Profiles of Proteins Containing MHC Ligands

    DEFF Research Database (Denmark)

    Juncker, Agnieszka; Larsen, Mette Voldby; Weinhold, Nils

    2009-01-01

    Background: Presentation of peptides on Major Histocompatibility Complex (MHC) molecules is the cornerstone in immune system activation and increased knowledge of the characteristics of MHC ligands and their source proteins is highly desirable. Methodology/Principal Finding: In the present large......-scale study, we used a large data set of proteins containing experimentally identified MHC class I or II ligands and examined the proteins according to their expression profiles at the mRNA level and their Gene Ontology (GO) classification within the cellular component ontology. Proteins encoded by highly...

  16. Active Learning for Text Classification

    OpenAIRE

    Hu, Rong

    2011-01-01

    Text classification approaches are used extensively to solve real-world challenges. The success or failure of text classification systems hangs on the datasets used to train them, without a good dataset it is impossible to build a quality system. This thesis examines the applicability of active learning in text classification for the rapid and economical creation of labelled training data. Four main contributions are made in this thesis. First, we present two novel selection strategies to cho...

  17. Unsupervised Classification Using Immune Algorithm

    OpenAIRE

    Al-Muallim, M. T.; El-Kouatly, R.

    2012-01-01

    Unsupervised classification algorithm based on clonal selection principle named Unsupervised Clonal Selection Classification (UCSC) is proposed in this paper. The new proposed algorithm is data driven and self-adaptive, it adjusts its parameters to the data to make the classification operation as fast as possible. The performance of UCSC is evaluated by comparing it with the well known K-means algorithm using several artificial and real-life data sets. The experiments show that the proposed U...

  18. 7 CFR 51.1903 - Size classification.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Size classification. 51.1903 Section 51.1903... STANDARDS) United States Consumer Standards for Fresh Tomatoes Size and Maturity Classification § 51.1903 Size classification. The following terms may be used for describing the size of the tomatoes in any lot...

  19. Analysis of hepatocellular carcinoma and metastatic hepatic carcinoma via functional modules in a protein-protein interaction network

    Directory of Open Access Journals (Sweden)

    Jun Pan

    2014-01-01

    Full Text Available Introduction: This study aims to identify protein clusters with potential functional relevance in the pathogenesis of hepatocellular carcinoma (HCC and metastatic hepatic carcinoma using network analysis. Materials and Methods: We used human protein interaction data to build a protein-protein interaction network with Cytoscape and then derived functional clusters using MCODE. Combining the gene expression profiles, we calculated the functional scores for the clusters and selected statistically significant clusters. Meanwhile, Gene Ontology was used to assess the functionality of these clusters. Finally, a support vector machine was trained on the gold standard data sets. Results: The differentially expressed genes of HCC were mainly involved in metabolic and signaling processes. We acquired 13 significant modules from the gene expression profiles. The area under the curve value based on the differentially expressed modules were 98.31%, which outweighed the classification with DEGs. Conclusions: Differentially expressed modules are valuable to screen biomarkers combined with functional modules.

  20. Boosted classification trees result in minor to modest improvement in the accuracy in classifying cardiovascular outcomes compared to conventional classification trees

    Science.gov (United States)

    Austin, Peter C; Lee, Douglas S

    2011-01-01

    Purpose: Classification trees are increasingly being used to classifying patients according to the presence or absence of a disease or health outcome. A limitation of classification trees is their limited predictive accuracy. In the data-mining and machine learning literature, boosting has been developed to improve classification. Boosting with classification trees iteratively grows classification trees in a sequence of reweighted datasets. In a given iteration, subjects that were misclassified in the previous iteration are weighted more highly than subjects that were correctly classified. Classifications from each of the classification trees in the sequence are combined through a weighted majority vote to produce a final classification. The authors' objective was to examine whether boosting improved the accuracy of classification trees for predicting outcomes in cardiovascular patients. Methods: We examined the utility of boosting classification trees for classifying 30-day mortality outcomes in patients hospitalized with either acute myocardial infarction or congestive heart failure. Results: Improvements in the misclassification rate using boosted classification trees were at best minor compared to when conventional classification trees were used. Minor to modest improvements to sensitivity were observed, with only a negligible reduction in specificity. For predicting cardiovascular mortality, boosted classification trees had high specificity, but low sensitivity. Conclusions: Gains in predictive accuracy for predicting cardiovascular outcomes were less impressive than gains in performance observed in the data mining literature. PMID:22254181

  1. GRIP: A web-based system for constructing Gold Standard datasets for protein-protein interaction prediction

    Directory of Open Access Journals (Sweden)

    Zheng Huiru

    2009-01-01

    Full Text Available Abstract Background Information about protein interaction networks is fundamental to understanding protein function and cellular processes. Interaction patterns among proteins can suggest new drug targets and aid in the design of new therapeutic interventions. Efforts have been made to map interactions on a proteomic-wide scale using both experimental and computational techniques. Reference datasets that contain known interacting proteins (positive cases and non-interacting proteins (negative cases are essential to support computational prediction and validation of protein-protein interactions. Information on known interacting and non interacting proteins are usually stored within databases. Extraction of these data can be both complex and time consuming. Although, the automatic construction of reference datasets for classification is a useful resource for researchers no public resource currently exists to perform this task. Results GRIP (Gold Reference dataset constructor from Information on Protein complexes is a web-based system that provides researchers with the functionality to create reference datasets for protein-protein interaction prediction in Saccharomyces cerevisiae. Both positive and negative cases for a reference dataset can be extracted, organised and downloaded by the user. GRIP also provides an upload facility whereby users can submit proteins to determine protein complex membership. A search facility is provided where a user can search for protein complex information in Saccharomyces cerevisiae. Conclusion GRIP is developed to retrieve information on protein complex, cellular localisation, and physical and genetic interactions in Saccharomyces cerevisiae. Manual construction of reference datasets can be a time consuming process requiring programming knowledge. GRIP simplifies and speeds up this process by allowing users to automatically construct reference datasets. GRIP is free to access at http://rosalind.infj.ulst.ac.uk/GRIP/.

  2. Morphological classification of plant cell deaths

    DEFF Research Database (Denmark)

    van Doorn, W.G.; Beers, E.P.; Dangl, J.L.

    2011-01-01

    , which can express features of both necrosis and vacuolar cell death, PCD in starchy cereal endosperm and during self-incompatibility. The present classification is not static, but will be subject to further revision, especially when specific biochemical pathways are better defined....... the classification of PCD in plants. Here we suggest a classification based on morphological criteria. According to this classification, the use of the term 'apoptosis' is not justified in plants, but at least two classes of PCD can be distinguished: vacuolar cell death and necrosis. During vacuolar cell death...

  3. 22 CFR 9a.4 - Classification.

    Science.gov (United States)

    2010-04-01

    ... 22 Foreign Relations 1 2010-04-01 2010-04-01 false Classification. 9a.4 Section 9a.4 Foreign... ENERGY PROGRAMS; RELATED MATERIAL § 9a.4 Classification. (a) Section 1 of E.O. 11932, August 4, 1976.... If the officer determines that the information or material warrants classification, he shall assign...

  4. 7 CFR 51.1402 - Size classification.

    Science.gov (United States)

    2010-01-01

    ... 7 Agriculture 2 2010-01-01 2010-01-01 false Size classification. 51.1402 Section 51.1402... STANDARDS) United States Standards for Grades of Pecans in the Shell 1 Size Classification § 51.1402 Size classification. Size of pecans may be specified in connection with the grade in accordance with one of the...

  5. Classification of high resolution satellite images

    OpenAIRE

    Karlsson, Anders

    2003-01-01

    In this thesis the Support Vector Machine (SVM)is applied on classification of high resolution satellite images. Sveral different measures for classification, including texture mesasures, 1st order statistics, and simple contextual information were evaluated. Additionnally, the image was segmented, using an enhanced watershed method, in order to improve the classification accuracy.

  6. An assessment of machine and statistical learning approaches to inferring networks of protein-protein interactions

    Directory of Open Access Journals (Sweden)

    Browne Fiona

    2006-12-01

    Full Text Available Protein-protein interactions (PPI play a key role in many biological systems. Over the past few years, an explosion in availability of functional biological data obtained from high-throughput technologies to infer PPI has been observed. However, results obtained from such experiments show high rates of false positives and false negatives predictions as well as systematic predictive bias. Recent research has revealed that several machine and statistical learning methods applied to integrate relatively weak, diverse sources of large-scale functional data may provide improved predictive accuracy and coverage of PPI. In this paper we describe the effects of applying different computational, integrative methods to predict PPI in Saccharomyces cerevisiae. We investigated the predictive ability of combining different sets of relatively strong and weak predictive datasets. We analysed several genomic datasets ranging from mRNA co-expression to marginal essentiality. Moreover, we expanded an existing multi-source dataset from S. cerevisiae by constructing a new set of putative interactions extracted from Gene Ontology (GO- driven annotations in the Saccharomyces Genome Database. Different classification techniques: Simple Naive Bayesian (SNB, Multilayer Perceptron (MLP and K-Nearest Neighbors (KNN were evaluated. Relatively simple classification methods (i.e. less computing intensive and mathematically complex, such as SNB, have been proven to be proficient at predicting PPI. SNB produced the “highest” predictive quality obtaining an area under Receiver Operating Characteristic (ROC curve (AUC value of 0.99. The lowest AUC value of 0.90 was obtained by the KNN classifier. This assessment also demonstrates the strong predictive power of GO-driven models, which offered predictive performance above 0.90 using the different machine learning and statistical techniques. As the predictive power of single-source datasets became weaker MLP and SNB performed

  7. Phylogenetic classification of bony fishes.

    Science.gov (United States)

    Betancur-R, Ricardo; Wiley, Edward O; Arratia, Gloria; Acero, Arturo; Bailly, Nicolas; Miya, Masaki; Lecointre, Guillaume; Ortí, Guillermo

    2017-07-06

    Fish classifications, as those of most other taxonomic groups, are being transformed drastically as new molecular phylogenies provide support for natural groups that were unanticipated by previous studies. A brief review of the main criteria used by ichthyologists to define their classifications during the last 50 years, however, reveals slow progress towards using an explicit phylogenetic framework. Instead, the trend has been to rely, in varying degrees, on deep-rooted anatomical concepts and authority, often mixing taxa with explicit phylogenetic support with arbitrary groupings. Two leading sources in ichthyology frequently used for fish classifications (JS Nelson's volumes of Fishes of the World and W. Eschmeyer's Catalog of Fishes) fail to adopt a global phylogenetic framework despite much recent progress made towards the resolution of the fish Tree of Life. The first explicit phylogenetic classification of bony fishes was published in 2013, based on a comprehensive molecular phylogeny ( www.deepfin.org ). We here update the first version of that classification by incorporating the most recent phylogenetic results. The updated classification presented here is based on phylogenies inferred using molecular and genomic data for nearly 2000 fishes. A total of 72 orders (and 79 suborders) are recognized in this version, compared with 66 orders in version 1. The phylogeny resolves placement of 410 families, or ~80% of the total of 514 families of bony fishes currently recognized. The ordinal status of 30 percomorph families included in this study, however, remains uncertain (incertae sedis in the series Carangaria, Ovalentaria, or Eupercaria). Comments to support taxonomic decisions and comparisons with conflicting taxonomic groups proposed by others are presented. We also highlight cases were morphological support exist for the groups being classified. This version of the phylogenetic classification of bony fishes is substantially improved, providing resolution

  8. Functional classifications for cerebral palsy: correlations between the gross motor function classification system (GMFCS), the manual ability classification system (MACS) and the communication function classification system (CFCS).

    Science.gov (United States)

    Compagnone, Eliana; Maniglio, Jlenia; Camposeo, Serena; Vespino, Teresa; Losito, Luciana; De Rinaldis, Marta; Gennaro, Leonarda; Trabacca, Antonio

    2014-11-01

    This study aimed to investigate a possible correlation between the gross motor function classification system-expanded and revised (GMFCS-E&R), the manual abilities classification system (MACS) and the communication function classification system (CFCS) functional levels in children with cerebral palsy (CP) by CP subtype. It was also geared to verify whether there is a correlation between these classification systems and intellectual functioning (IF) and parental socio-economic status (SES). A total of 87 children (47 males and 40 females, age range 4-18 years, mean age 8.9±4.2) were included in the study. A strong correlation was found between the three classifications: Level V of the GMFCS-E&R corresponds to Level V of the MACS (rs=0.67, p=0.001); the same relationship was found for the CFCS and the MACS (rs=0.73, p<0.001) and for the GMFCS-E&R and the CFCS (rs=0.61, p=0.001). The correlations between the IQ and the global functional disability profile were strong or moderate (GMFCS and IQ: rs=0.66, p=0.001; MACS and IQ: rs=0.58, p=0.001; CFCS and MACS: rs=0.65, p=0.001). The Kruskal-Wallis test was used to determine if there were differences between the GMFCS-E&R, the CFCS and the MACS by CP type. CP types showed different scores for the IQ level (Chi-square=8.59, df=2, p=0.014), the GMFCS-E&R (Chi-square=36.46, df=2, p<0.001), the CFCS (Chi-square=12.87, df=2, p=0.002), and the MACS Level (Chi-square=13.96, df=2, p<0.001) but no significant differences emerged for the SES (Chi-square=1.19, df=2, p=0.554). This study shows how the three functional classifications (GMFCS-E&R, CFCS and MACS) complement each other to provide a better description of the functional profile of CP. The systematic evaluation of the IQ can provide useful information about a possible future outcome for every functional level. The SES does not appear to affect functional profiles. Copyright © 2014 Elsevier Ltd. All rights reserved.

  9. Clustering and classification of email contents

    Directory of Open Access Journals (Sweden)

    Izzat Alsmadi

    2015-01-01

    Full Text Available Information users depend heavily on emails’ system as one of the major sources of communication. Its importance and usage are continuously growing despite the evolution of mobile applications, social networks, etc. Emails are used on both the personal and professional levels. They can be considered as official documents in communication among users. Emails’ data mining and analysis can be conducted for several purposes such as: Spam detection and classification, subject classification, etc. In this paper, a large set of personal emails is used for the purpose of folder and subject classifications. Algorithms are developed to perform clustering and classification for this large text collection. Classification based on NGram is shown to be the best for such large text collection especially as text is Bi-language (i.e. with English and Arabic content.

  10. Classification of Building Object Types

    DEFF Research Database (Denmark)

    Jørgensen, Kaj Asbjørn

    2011-01-01

    made. This is certainly the case in the Danish development. Based on the theories about these abstraction mechanisms, the basic principles for classification systems are presented and the observed misconceptions are analyses and explained. Furthermore, it is argued that the purpose of classification...... systems has changed and that new opportunities should be explored. Some proposals for new applications are presented and carefully aligned with IT opportunities. Especially, the use of building modelling will give new benefits and many of the traditional uses of classification systems will instead...... be managed by software applications and on the basis of building models. Classification systems with taxonomies of building object types have many application opportunities but can still be beneficial in data exchange between building construction partners. However, this will be performed by new methods...

  11. Using a multidisciplinary classification in nursing : The international classification of functioning disability and health

    NARCIS (Netherlands)

    Van Achterberg, T; Holleman, G; Heijnen-Kaales, Y; Van der Brug, Y; Roodbol, G; Stallinga, HA; Hellema, F; Frederiks, CMA

    This paper reports a study to explore systematically the usefulness of the International Classification of Functioning, Disability and Health to nurses giving patient care. The International Classification of Functioning, Disability and Health has a history of more than 20 years. Although this World

  12. Using a multidisciplinary classification in nursing: the International Classification of Functioning Disability and Health

    NARCIS (Netherlands)

    van Achterberg, Theo; Holleman, Gerda; Heijnen-Kaales, Yvonne; van der Brug, Ype; Roodbol, Gabriël; Stallinga, Hillegonda A.; Hellema, Fokje; Frederiks, Carla M. A.

    2005-01-01

    This paper reports a study to explore systematically the usefulness of the International Classification of Functioning, Disability and Health to nurses giving patient care. The International Classification of Functioning, Disability and Health has a history of more than 20 years. Although this World

  13. 46 CFR 76.50-5 - Classification.

    Science.gov (United States)

    2010-10-01

    ... 46 Shipping 3 2010-10-01 2010-10-01 false Classification. 76.50-5 Section 76.50-5 Shipping COAST... Classification. (a) Hand portable fire extinguishers and semiportable fire extinguishing systems shall be... extinguishing systems are set forth in table 76.50-5(c). Table 76.50-5(c) Classification Type Size Soda acid and...

  14. 46 CFR 193.50-5 - Classification.

    Science.gov (United States)

    2010-10-01

    ... 46 Shipping 7 2010-10-01 2010-10-01 false Classification. 193.50-5 Section 193.50-5 Shipping COAST... Details § 193.50-5 Classification. (a) Hand portable fire extinguishers and semiportable fire...) Classification Type Size Soda-acid and water, gals. Foam, gals. Carbon dioxide, lbs. Dry chemical, lbs. A II 21/2...

  15. The Classification of Romanian High-Schools

    Science.gov (United States)

    Ivan, Ion; Milodin, Daniel; Naie, Lucian

    2006-01-01

    The article tries to tackle the issue of high-schools classification from one city, district or from Romania. The classification criteria are presented. The National Database of Education is also presented and the application of criteria is illustrated. An algorithm for high-school multi-rang classification is proposed in order to build classes of…

  16. Recursive automatic classification algorithms

    Energy Technology Data Exchange (ETDEWEB)

    Bauman, E V; Dorofeyuk, A A

    1982-03-01

    A variational statement of the automatic classification problem is given. The dependence of the form of the optimal partition surface on the form of the classification objective functional is investigated. A recursive algorithm is proposed for maximising a functional of reasonably general form. The convergence problem is analysed in connection with the proposed algorithm. 8 references.

  17. Spectroscopic classification of transients

    DEFF Research Database (Denmark)

    Stritzinger, M. D.; Fraser, M.; Hummelmose, N. N.

    2017-01-01

    We report the spectroscopic classification of several transients based on observations taken with the Nordic Optical Telescope (NOT) equipped with ALFOSC, over the nights 23-25 August 2017.......We report the spectroscopic classification of several transients based on observations taken with the Nordic Optical Telescope (NOT) equipped with ALFOSC, over the nights 23-25 August 2017....

  18. Cellular image classification

    CERN Document Server

    Xu, Xiang; Lin, Feng

    2017-01-01

    This book introduces new techniques for cellular image feature extraction, pattern recognition and classification. The authors use the antinuclear antibodies (ANAs) in patient serum as the subjects and the Indirect Immunofluorescence (IIF) technique as the imaging protocol to illustrate the applications of the described methods. Throughout the book, the authors provide evaluations for the proposed methods on two publicly available human epithelial (HEp-2) cell datasets: ICPR2012 dataset from the ICPR'12 HEp-2 cell classification contest and ICIP2013 training dataset from the ICIP'13 Competition on cells classification by fluorescent image analysis. First, the reading of imaging results is significantly influenced by one’s qualification and reading systems, causing high intra- and inter-laboratory variance. The authors present a low-order LP21 fiber mode for optical single cell manipulation and imaging staining patterns of HEp-2 cells. A focused four-lobed mode distribution is stable and effective in optical...

  19. [Landscape classification: research progress and development trend].

    Science.gov (United States)

    Liang, Fa-Chao; Liu, Li-Ming

    2011-06-01

    Landscape classification is the basis of the researches on landscape structure, process, and function, and also, the prerequisite for landscape evaluation, planning, protection, and management, directly affecting the precision and practicability of landscape research. This paper reviewed the research progress on the landscape classification system, theory, and methodology, and summarized the key problems and deficiencies of current researches. Some major landscape classification systems, e. g. , LANMAP and MUFIC, were introduced and discussed. It was suggested that a qualitative and quantitative comprehensive classification based on the ideology of functional structure shape and on the integral consideration of landscape classification utility, landscape function, landscape structure, physiogeographical factors, and human disturbance intensity should be the major research directions in the future. The integration of mapping, 3S technology, quantitative mathematics modeling, computer artificial intelligence, and professional knowledge to enhance the precision of landscape classification would be the key issues and the development trend in the researches of landscape classification.

  20. Classifications of Patterned Hair Loss: A Review.

    Science.gov (United States)

    Gupta, Mrinal; Mysore, Venkataram

    2016-01-01

    Patterned hair loss is the most common cause of hair loss seen in both the sexes after puberty. Numerous classification systems have been proposed by various researchers for grading purposes. These systems vary from the simpler systems based on recession of the hairline to the more advanced multifactorial systems based on the morphological and dynamic parameters that affect the scalp and the hair itself. Most of these preexisting systems have certain limitations. Currently, the Hamilton-Norwood classification system for males and the Ludwig system for females are most commonly used to describe patterns of hair loss. In this article, we review the various classification systems for patterned hair loss in both the sexes. Relevant articles were identified through searches of MEDLINE and EMBASE. Search terms included but were not limited to androgenic alopecia classification, patterned hair loss classification, male pattern baldness classification, and female pattern hair loss classification. Further publications were identified from the reference lists of the reviewed articles.

  1. Classifications of patterned hair loss: a review

    Directory of Open Access Journals (Sweden)

    Mrinal Gupta

    2016-01-01

    Full Text Available Patterned hair loss is the most common cause of hair loss seen in both the sexes after puberty. Numerous classification systems have been proposed by various researchers for grading purposes. These systems vary from the simpler systems based on recession of the hairline to the more advanced multifactorial systems based on the morphological and dynamic parameters that affect the scalp and the hair itself. Most of these preexisting systems have certain limitations. Currently, the Hamilton-Norwood classification system for males and the Ludwig system for females are most commonly used to describe patterns of hair loss. In this article, we review the various classification systems for patterned hair loss in both the sexes. Relevant articles were identified through searches of MEDLINE and EMBASE. Search terms included but were not limited to androgenic alopecia classification, patterned hair loss classification, male pattern baldness classification, and female pattern hair loss classification. Further publications were identified from the reference lists of the reviewed articles.

  2. Protist classification and the kingdoms of organisms.

    Science.gov (United States)

    Whittaker, R H; Margulis, L

    1978-04-01

    Traditional classification imposed a division into plant-like and animal-like forms on the unicellular eukaryotes, or protists; in a current view the protists are a diverse assemblage of plant-, animal- and fungus-like groups. Classification of these into phyla is difficult because of their relatively simple structure and limited geological record, but study of ultrastructure and other characteristics is providing new insight on protist classification. Possible classifications are discussed, and a summary classification of the living world into kingdoms (Monera, Protista, Fungi, Animalia, Plantae) and phyla is suggested. This classification also suggests groupings of phyla into superphyla and form-superphyla, and a broadened kingdom Protista (including green algae, oomycotes and slime molds but excluding red and brown algae). The classification thus seeks to offer a compromise between the protist and protoctist kingdoms of Whittaker and Margulis and to combine a full listing of phyla with grouping of these for synoptic treatment.

  3. miR-10 in development and cancer

    DEFF Research Database (Denmark)

    Lund, Anders Henrik

    2010-01-01

    The microRNA (miRNA) miR-10 family has attracted attention because of its conservation and the position of the miR-10 genes within the Hox clusters of developmental regulators. In several species, miR-10 is coexpressed with a set of Hox genes and has been found to regulate the translation of Hox ...... function to the miRNA repertoire.Cell Death and Differentiation advance online publication, 22 May 2009; doi:10.1038/cdd.2009.58....

  4. 46 CFR 95.50-5 - Classification.

    Science.gov (United States)

    2010-10-01

    ... 46 Shipping 4 2010-10-01 2010-10-01 false Classification. 95.50-5 Section 95.50-5 Shipping COAST... Details § 95.50-5 Classification. (a) Hand portable fire extinguishers and semiportable fire extinguishing... extinguishing systems are set forth in Table 95.50-5(c). Table 95.50-5(c) Classification Type Size Soda-acid and...

  5. Pattern Classification with Memristive Crossbar Circuits

    Science.gov (United States)

    2016-03-31

    Pattern Classification with Memristive Crossbar Circuits Dmitri B. Strukov Department of Electrical and Computer Engineering Department UC Santa...pattern classification ; deep learning; convolutional neural network networks. Introduction Deep-learning convolutional neural networks (DLCNN), which...the best classification performances on a variety of benchmark tasks [1]. The major challenge in building fast and energy- efficient networks of this

  6. Casemix classification systems.

    Science.gov (United States)

    Fetter, R B

    1999-01-01

    The idea of using casemix classification to manage hospital services is not new, but has been limited by available technology. It was not until after the introduction of Medicare in the United States in 1965 that serious attempts were made to measure hospital production in order to contain spiralling costs. This resulted in a system of casemix classification known as diagnosis related groups (DRGs). This paper traces the development of DRGs and their evolution from the initial version to the All Patient Refined DRGs developed in 1991.

  7. An Algebro-Topological Description of Protein Domain Structure

    Science.gov (United States)

    Penner, Robert Clark; Knudsen, Michael; Wiuf, Carsten; Andersen, Jørgen Ellegaard

    2011-01-01

    The space of possible protein structures appears vast and continuous, and the relationship between primary, secondary and tertiary structure levels is complex. Protein structure comparison and classification is therefore a difficult but important task since structure is a determinant for molecular interaction and function. We introduce a novel mathematical abstraction based on geometric topology to describe protein domain structure. Using the locations of the backbone atoms and the hydrogen bonds, we build a combinatorial object – a so-called fatgraph. The description is discrete yet gives rise to a 2-dimensional mathematical surface. Thus, each protein domain corresponds to a particular mathematical surface with characteristic topological invariants, such as the genus (number of holes) and the number of boundary components. Both invariants are global fatgraph features reflecting the interconnectivity of the domain by hydrogen bonds. We introduce the notion of robust variables, that is variables that are robust towards minor changes in the structure/fatgraph, and show that the genus and the number of boundary components are robust. Further, we invesigate the distribution of different fatgraph variables and show how only four variables are capable of distinguishing different folds. We use local (secondary) and global (tertiary) fatgraph features to describe domain structures and illustrate that they are useful for classification of domains in CATH. In addition, we combine our method with two other methods thereby using primary, secondary, and tertiary structure information, and show that we can identify a large percentage of new and unclassified structures in CATH. PMID:21629687

  8. Video genre classification using multimodal features

    Science.gov (United States)

    Jin, Sung Ho; Bae, Tae Meon; Choo, Jin Ho; Ro, Yong Man

    2003-12-01

    We propose a video genre classification method using multimodal features. The proposed method is applied for the preprocessing of automatic video summarization or the retrieval and classification of broadcasting video contents. Through a statistical analysis of low-level and middle-level audio-visual features in video, the proposed method can achieve good performance in classifying several broadcasting genres such as cartoon, drama, music video, news, and sports. In this paper, we adopt MPEG-7 audio-visual descriptors as multimodal features of video contents and evaluate the performance of the classification by feeding the features into a decision tree-based classifier which is trained by CART. The experimental results show that the proposed method can recognize several broadcasting video genres with a high accuracy and the classification performance with multimodal features is superior to the one with unimodal features in the genre classification.

  9. Modified Angle's Classification for Primary Dentition.

    Science.gov (United States)

    Chandranee, Kaushik Narendra; Chandranee, Narendra Jayantilal; Nagpal, Devendra; Lamba, Gagandeep; Choudhari, Purva; Hotwani, Kavita

    2017-01-01

    This study aims to propose a modification of Angle's classification for primary dentition and to assess its applicability in children from Central India, Nagpur. Modification in Angle's classification has been proposed for application in primary dentition. Small roman numbers i/ii/iii are used for primary dentition notation to represent Angle's Class I/II/III molar relationships as in permanent dentition, respectively. To assess applicability of modified Angle's classification a cross-sectional preschool 2000 children population from central India; 3-6 years of age residing in Nagpur metropolitan city of Maharashtra state were selected randomly as per the inclusion and exclusion criteria. Majority 93.35% children were found to have bilateral Class i followed by 2.5% bilateral Class ii and 0.2% bilateral half cusp Class iii molar relationships as per the modified Angle's classification for primary dentition. About 3.75% children had various combinations of Class ii relationships and 0.2% children were having Class iii subdivision relationship. Modification of Angle's classification for application in primary dentition has been proposed. A cross-sectional investigation using new classification revealed various 6.25% Class ii and 0.4% Class iii molar relationships cases in preschool children population in a metropolitan city of Nagpur. Application of the modified Angle's classification to other population groups is warranted to validate its routine application in clinical pediatric dentistry.

  10. Additional sex combs-like 1 belongs to the enhancer of trithorax and Polycomb Group and genetically interacts with Cbx2 in mice

    Science.gov (United States)

    Fisher, C.L.; Lee, I.; Bloyer, S.; Bozza, S.; Chevalier, J.; Dahl, A; Bodner, C.; Helgason, C. D.; Hess, J.L.; Humphries, R.K.; Brock, H.W.

    2009-01-01

    The Additional sex combs (Asx) gene of Drosophila behaves genetically as an enhancer of trithorax and Polycomb (ETP) in displaying bidirectional homeotic phenotypes, suggesting that is required for maintenance of both activation and silencing of Hox genes. There are 3 murine homologs of Asx called Additional sex combs-like1, 2, and-3. Asxl1 is required for normal adult hematopoiesis; however its embryonic function is unknown. We used a targeted mouse mutant line Asxl1tm1Bc to determine if Asxl1 is required to silence and activate Hox genes in mice during axial patterning. The mutant embryos exhibit simultaneous anterior and posterior transformations of the axial skeleton, consistent with a role for Asxl1 in activation and silencing of Hox genes. Transformations of the axial skeleton are enhanced in compound mutant embryos for the Polycomb group gene M33/Cbx2. Hox a4, a7, and c8 are derepressed in Asxl1tm1Bc mutants in the antero-posterior axis, but Hox c8 expression is reduced in the brain of mutants, consistent with Asxl1 being required both for activation and repression of Hox genes. We discuss the genetic and molecular definition of ETPs, and suggest that the function of Asxl1 depends on its cellular context. PMID:19833123

  11. 14 CFR 1203.501 - Applying derivative classification markings.

    Science.gov (United States)

    2010-01-01

    ... INFORMATION SECURITY PROGRAM Derivative Classification § 1203.501 Applying derivative classification markings. Persons who apply derivative classification markings shall: (a) Observe and respect original... 14 Aeronautics and Space 5 2010-01-01 2010-01-01 false Applying derivative classification markings...

  12. Improvement of Classification of Enterprise Circulating Funds

    OpenAIRE

    Rohanova Hanna O.

    2014-01-01

    The goal of the article lies in revelation of possibilities of increase of efficiency of managing enterprise circulating funds by means of improvement of their classification features. Having analysed approaches of many economists to classification of enterprise circulating funds, systemised and supplementing them, the article offers grouping classification features of enterprise circulating funds. In the result of the study the article offers an expanded classification of circulating funds, ...

  13. GIS: a comprehensive source for protein structure similarities.

    Science.gov (United States)

    Guerler, Aysam; Knapp, Ernst-Walter

    2010-07-01

    A web service for analysis of protein structures that are sequentially or non-sequentially similar was generated. Recently, the non-sequential structure alignment algorithm GANGSTA+ was introduced. GANGSTA+ can detect non-sequential structural analogs for proteins stated to possess novel folds. Since GANGSTA+ ignores the polypeptide chain connectivity of secondary structure elements (i.e. alpha-helices and beta-strands), it is able to detect structural similarities also between proteins whose sequences were reshuffled during evolution. GANGSTA+ was applied in an all-against-all comparison on the ASTRAL40 database (SCOP version 1.75), which consists of >10,000 protein domains yielding about 55 x 10(6) possible protein structure alignments. Here, we provide the resulting protein structure alignments as a public web-based service, named GANGSTA+ Internet Services (GIS). We also allow to browse the ASTRAL40 database of protein structures with GANGSTA+ relative to an externally given protein structure using different constraints to select specific results. GIS allows us to analyze protein structure families according to the SCOP classification scheme. Additionally, users can upload their own protein structures for pairwise protein structure comparison, alignment against all protein structures of the ASTRAL40 database (SCOP version 1.75) or symmetry analysis. GIS is publicly available at http://agknapp.chemie.fu-berlin.de/gplus.

  14. Evolutionary hierarchy of vertebrate-like heterotrimeric G protein families.

    Science.gov (United States)

    Krishnan, Arunkumar; Mustafa, Arshi; Almén, Markus Sällman; Fredriksson, Robert; Williams, Michael J; Schiöth, Helgi B

    2015-10-01

    Heterotrimeric G proteins perform a crucial role as molecular switches controlling various cellular responses mediated by G protein-coupled receptor (GPCR) signaling pathway. Recent data have shown that the vertebrate-like G protein families are found across metazoans and their closest unicellular relatives. However, an overall evolutionary hierarchy of vertebrate-like G proteins, including gene family annotations and in particular mapping individual gene gain/loss events across diverse holozoan lineages is still incomplete. Here, with more expanded invertebrate taxon sampling, we have reconstructed phylogenetic trees for each of the G protein classes/families and provide a robust classification and hierarchy of vertebrate-like heterotrimeric G proteins. Our results further extend the evidence that the common ancestor (CA) of holozoans had at least five ancestral Gα genes corresponding to all major vertebrate Gα classes and contain a total of eight genes including two Gβ and one Gγ. Our results also indicate that the GNAI/O-like gene likely duplicated in the last CA of metazoans to give rise to GNAI- and GNAO-like genes, which are conserved across invertebrates. Moreover, homologs of GNB1-4 paralogon- and GNB5 family-like genes are found in most metazoans and that the unicellular holozoans encode two ancestral Gβ genes. Similarly, most bilaterian invertebrates encode two Gγ genes which include a representative of the GNG gene cluster and a putative homolog of GNG13. Interestingly, our results also revealed key evolutionary events such as the Drosophila melanogaster eye specific Gβ subunit that is found conserved in most arthropods and several previously unidentified species specific expansions within Gαi/o, Gαs, Gαq, Gα12/13 classes and the GNB1-4 paralogon. Also, we provide an overall proposed evolutionary scenario on the expansions of all G protein families in vertebrate tetraploidizations. Our robust classification/hierarchy is essential to further

  15. Trends and concepts in fern classification

    Science.gov (United States)

    Christenhusz, Maarten J. M.; Chase, Mark W.

    2014-01-01

    Background and Aims Throughout the history of fern classification, familial and generic concepts have been highly labile. Many classifications and evolutionary schemes have been proposed during the last two centuries, reflecting different interpretations of the available evidence. Knowledge of fern structure and life histories has increased through time, providing more evidence on which to base ideas of possible relationships, and classification has changed accordingly. This paper reviews previous classifications of ferns and presents ideas on how to achieve a more stable consensus. Scope An historical overview is provided from the first to the most recent fern classifications, from which conclusions are drawn on past changes and future trends. The problematic concept of family in ferns is discussed, with a particular focus on how this has changed over time. The history of molecular studies and the most recent findings are also presented. Key Results Fern classification generally shows a trend from highly artificial, based on an interpretation of a few extrinsic characters, via natural classifications derived from a multitude of intrinsic characters, towards more evolutionary circumscriptions of groups that do not in general align well with the distribution of these previously used characters. It also shows a progression from a few broad family concepts to systems that recognized many more narrowly and highly controversially circumscribed families; currently, the number of families recognized is stabilizing somewhere between these extremes. Placement of many genera was uncertain until the arrival of molecular phylogenetics, which has rapidly been improving our understanding of fern relationships. As a collective category, the so-called ‘fern allies’ (e.g. Lycopodiales, Psilotaceae, Equisetaceae) were unsurprisingly found to be polyphyletic, and the term should be abandoned. Lycopodiaceae, Selaginellaceae and Isoëtaceae form a clade (the lycopods) that is

  16. Trends and concepts in fern classification.

    Science.gov (United States)

    Christenhusz, Maarten J M; Chase, Mark W

    2014-03-01

    Throughout the history of fern classification, familial and generic concepts have been highly labile. Many classifications and evolutionary schemes have been proposed during the last two centuries, reflecting different interpretations of the available evidence. Knowledge of fern structure and life histories has increased through time, providing more evidence on which to base ideas of possible relationships, and classification has changed accordingly. This paper reviews previous classifications of ferns and presents ideas on how to achieve a more stable consensus. An historical overview is provided from the first to the most recent fern classifications, from which conclusions are drawn on past changes and future trends. The problematic concept of family in ferns is discussed, with a particular focus on how this has changed over time. The history of molecular studies and the most recent findings are also presented. Fern classification generally shows a trend from highly artificial, based on an interpretation of a few extrinsic characters, via natural classifications derived from a multitude of intrinsic characters, towards more evolutionary circumscriptions of groups that do not in general align well with the distribution of these previously used characters. It also shows a progression from a few broad family concepts to systems that recognized many more narrowly and highly controversially circumscribed families; currently, the number of families recognized is stabilizing somewhere between these extremes. Placement of many genera was uncertain until the arrival of molecular phylogenetics, which has rapidly been improving our understanding of fern relationships. As a collective category, the so-called 'fern allies' (e.g. Lycopodiales, Psilotaceae, Equisetaceae) were unsurprisingly found to be polyphyletic, and the term should be abandoned. Lycopodiaceae, Selaginellaceae and Isoëtaceae form a clade (the lycopods) that is sister to all other vascular plants, whereas

  17. Classification across gene expression microarray studies

    Directory of Open Access Journals (Sweden)

    Kuner Ruprecht

    2009-12-01

    Full Text Available Abstract Background The increasing number of gene expression microarray studies represents an important resource in biomedical research. As a result, gene expression based diagnosis has entered clinical practice for patient stratification in breast cancer. However, the integration and combined analysis of microarray studies remains still a challenge. We assessed the potential benefit of data integration on the classification accuracy and systematically evaluated the generalization performance of selected methods on four breast cancer studies comprising almost 1000 independent samples. To this end, we introduced an evaluation framework which aims to establish good statistical practice and a graphical way to monitor differences. The classification goal was to correctly predict estrogen receptor status (negative/positive and histological grade (low/high of each tumor sample in an independent study which was not used for the training. For the classification we chose support vector machines (SVM, predictive analysis of microarrays (PAM, random forest (RF and k-top scoring pairs (kTSP. Guided by considerations relevant for classification across studies we developed a generalization of kTSP which we evaluated in addition. Our derived version (DV aims to improve the robustness of the intrinsic invariance of kTSP with respect to technologies and preprocessing. Results For each individual study the generalization error was benchmarked via complete cross-validation and was found to be similar for all classification methods. The misclassification rates were substantially higher in classification across studies, when each single study was used as an independent test set while all remaining studies were combined for the training of the classifier. However, with increasing number of independent microarray studies used in the training, the overall classification performance improved. DV performed better than the average and showed slightly less variance. In

  18. 32 CFR 2400.14 - Use of derivative classification.

    Science.gov (United States)

    2010-07-01

    ... SECURITY PROGRAM Derivative Classification § 2400.14 Use of derivative classification. (a) Derivative... 32 National Defense 6 2010-07-01 2010-07-01 false Use of derivative classification. 2400.14... person who applies derivative classification markings believes that the paraphrasing, restating, or...

  19. The Importance of Classification to Business Model Research

    OpenAIRE

    Susan Lambert

    2015-01-01

    Purpose: To bring to the fore the scientific significance of classification and its role in business model theory building. To propose a method by which existing classifications of business models can be analyzed and new ones developed. Design/Methodology/Approach: A review of the scholarly literature relevant to classifications of business models is presented along with a brief overview of classification theory applicable to business model research. Existing business model classification...

  20. Asynchronous data-driven classification of weapon systems

    International Nuclear Information System (INIS)

    Jin, Xin; Mukherjee, Kushal; Gupta, Shalabh; Ray, Asok; Phoha, Shashi; Damarla, Thyagaraju

    2009-01-01

    This communication addresses real-time weapon classification by analysis of asynchronous acoustic data, collected from microphones on a sensor network. The weapon classification algorithm consists of two parts: (i) feature extraction from time-series data using symbolic dynamic filtering (SDF), and (ii) pattern classification based on the extracted features using the language measure (LM) and support vector machine (SVM). The proposed algorithm has been tested on field data, generated by firing of two types of rifles. The results of analysis demonstrate high accuracy and fast execution of the pattern classification algorithm with low memory requirements. Potential applications include simultaneous shooter localization and weapon classification with soldier-wearable networked sensors. (rapid communication)