WorldWideScience

Sample records for retrotransposon protein function

  1. Retrotransposons and non-protein coding RNAs

    DEFF Research Database (Denmark)

    Mourier, Tobias; Willerslev, Eske

    2009-01-01

    does not merely represent spurious transcription. We review examples of functional RNAs transcribed from retrotransposons, and address the collection of non-protein coding RNAs derived from transposable element sequences, including numerous human microRNAs and the neuronal BC RNAs. Finally, we review...

  2. [Ulysses retrotransposon aspartate proteinase (Drosophila virilis)].

    Science.gov (United States)

    Volkov, D A; Savvateeva, L V; Dergousova, N I; Rumsh, L D

    2002-01-01

    Retrotransposones are mobile genetic elements occurring in genomes of bacteria, plants or animals. Retrotransposones were found to contain nucleotide sequences encoding proteins which are homological to retroviral aspartic proteinases. Our research has been focused on Ulysses which is mobile genetic element found in Drosophila virilis. We suggested a primary structure of Ulysses proteinase using comparative analysis of amino acid sequences of retroviral proteinases and proteinases from retrotransposones. The appropriate cDNA fragment has been cloned and expressed in E. coli. The purification of recombinant protein (12 kD) has been carried out by affinity chromatography using pepstatine-agarose. The obtained protein has proteolytic activity at optimum pH 5.5 like the majority of aspartic proteinases.

  3. The Microprocessor controls the activity of mammalian retrotransposons

    DEFF Research Database (Denmark)

    Heras, Sara R.; Macias, Sara; Plass, Mireya

    2013-01-01

    RNA biogenesis, also recognizes and binds RNAs derived from human long interspersed element 1 (LINE-1), Alu and SVA retrotransposons. Expression analyses demonstrate that cells lacking a functional Microprocessor accumulate LINE-1 mRNA and encoded proteins. Furthermore, we show that structured regions...

  4. SREBP controls oxygen-dependent mobilization of retrotransposons in fission yeast.

    Directory of Open Access Journals (Sweden)

    Alfica Sehgal

    2007-08-01

    Full Text Available Retrotransposons are mobile genetic elements that proliferate through an RNA intermediate. Transposons do not encode transcription factors and thus rely on host factors for mRNA expression and survival. Despite information regarding conditions under which elements are upregulated, much remains to be learned about the regulatory mechanisms or factors controlling retrotransposon expression. Here, we report that low oxygen activates the fission yeast Tf2 family of retrotransposons. Sre1, the yeast ortholog of the mammalian membrane-bound transcription factor sterol regulatory element binding protein (SREBP, directly induces the expression and mobilization of Tf2 retrotransposons under low oxygen. Sre1 binds to DNA sequences in the Tf2 long terminal repeat that functions as an oxygen-dependent promoter. We find that Tf2 solo long terminal repeats throughout the genome direct oxygen-dependent expression of adjacent coding and noncoding sequences, providing a potential mechanism for the generation of oxygen-dependent gene expression.

  5. Copia and Gypsy retrotransposons activity in sunflower (Helianthus annuus L.)

    Science.gov (United States)

    2009-01-01

    Background Retrotransposons are heterogeneous sequences, widespread in eukaryotic genomes, which refer to the so-called mobile DNA. They resemble retroviruses, both in their structure and for their ability to transpose within the host genome, of which they make up a considerable portion. Copia- and Gypsy-like retrotransposons are the two main classes of retroelements shown to be ubiquitous in plant genomes. Ideally, the retrotransposons life cycle results in the synthesis of a messenger RNA and then self-encoded proteins to process retrotransposon mRNA in double stranded extra-chromosomal cDNA copies which may integrate in new chromosomal locations. Results The RT-PCR and IRAP protocol were applied to detect the presence of Copia and Gypsy retrotransposon transcripts and of new events of integration in unstressed plants of a sunflower (Helianthus annuus L.) selfed line. Results show that in sunflower retrotransposons transcription occurs in all analyzed organs (embryos, leaves, roots, and flowers). In one out of sixty-four individuals analyzed, retrotransposons transcription resulted in the integration of a new element into the genome. Conclusion These results indicate that the retrotransposon life cycle is firmly controlled at a post transcriptional level. A possible silencing mechanism is discussed. PMID:20030800

  6. Mammalian-specific genomic functions: Newly acquired traits generated by genomic imprinting and LTR retrotransposon-derived genes in mammals.

    Science.gov (United States)

    Kaneko-Ishino, Tomoko; Ishino, Fumitoshi

    2015-01-01

    Mammals, including human beings, have evolved a unique viviparous reproductive system and a highly developed central nervous system. How did these unique characteristics emerge in mammalian evolution, and what kinds of changes did occur in the mammalian genomes as evolution proceeded? A key conceptual term in approaching these issues is "mammalian-specific genomic functions", a concept covering both mammalian-specific epigenetics and genetics. Genomic imprinting and LTR retrotransposon-derived genes are reviewed as the representative, mammalian-specific genomic functions that are essential not only for the current mammalian developmental system, but also mammalian evolution itself. First, the essential roles of genomic imprinting in mammalian development, especially related to viviparous reproduction via placental function, as well as the emergence of genomic imprinting in mammalian evolution, are discussed. Second, we introduce the novel concept of "mammalian-specific traits generated by mammalian-specific genes from LTR retrotransposons", based on the finding that LTR retrotransposons served as a critical driving force in the mammalian evolution via generating mammalian-specific genes.

  7. LINE retrotransposon RNA is an essential structural and functional epigenetic component of a core neocentromeric chromatin.

    Directory of Open Access Journals (Sweden)

    Anderly C Chueh

    2009-01-01

    Full Text Available We have previously identified and characterized the phenomenon of ectopic human centromeres, known as neocentromeres. Human neocentromeres form epigenetically at euchromatic chromosomal sites and are structurally and functionally similar to normal human centromeres. Recent studies have indicated that neocentromere formation provides a major mechanism for centromere repositioning, karyotype evolution, and speciation. Using a marker chromosome mardel(10 containing a neocentromere formed at the normal chromosomal 10q25 region, we have previously mapped a 330-kb CENP-A-binding domain and described an increased prevalence of L1 retrotransposons in the underlying DNA sequences of the CENP-A-binding clusters. Here, we investigated the potential role of the L1 retrotransposons in the regulation of neocentromere activity. Determination of the transcriptional activity of a panel of full-length L1s (FL-L1s across a 6-Mb region spanning the 10q25 neocentromere chromatin identified one of the FL-L1 retrotransposons, designated FL-L1b and residing centrally within the CENP-A-binding clusters, to be transcriptionally active. We demonstrated the direct incorporation of the FL-L1b RNA transcripts into the CENP-A-associated chromatin. RNAi-mediated knockdown of the FL-L1b RNA transcripts led to a reduction in CENP-A binding and an impaired mitotic function of the 10q25 neocentromere. These results indicate that LINE retrotransposon RNA is a previously undescribed essential structural and functional component of the neocentromeric chromatin and that retrotransposable elements may serve as a critical epigenetic determinant in the chromatin remodelling events leading to neocentromere formation.

  8. BARE retrotransposons are translated and replicated via distinct RNA pools.

    Directory of Open Access Journals (Sweden)

    Wei Chang

    Full Text Available The replication of Long Terminal Repeat (LTR retrotransposons, which can constitute over 80% of higher plant genomes, resembles that of retroviruses. A major question for retrotransposons and retroviruses is how the two conflicting roles of their transcripts, in translation and reverse transcription, are balanced. Here, we show that the BARE retrotransposon, despite its organization into just one open reading frame, produces three distinct classes of transcripts. One is capped, polyadenylated, and translated, but cannot be copied into cDNA. The second is not capped or polyadenylated, but is destined for packaging and ultimate reverse transcription. The third class is capped, polyadenylated, and spliced to favor production of a subgenomic RNA encoding only Gag, the protein forming virus-like particles. Moreover, the BARE2 subfamily, which cannot synthesize Gag and is parasitic on BARE1, does not produce the spliced sub-genomic RNA for translation but does make the replication competent transcripts, which are packaged into BARE1 particles. To our knowledge, this is first demonstration of distinct RNA pools for translation and transcription for any retrotransposon.

  9. The RNAPII-CTD Maintains Genome Integrity through Inhibition of Retrotransposon Gene Expression and Transposition.

    Directory of Open Access Journals (Sweden)

    Maria J Aristizabal

    2015-10-01

    Full Text Available RNA polymerase II (RNAPII contains a unique C-terminal domain that is composed of heptapeptide repeats and which plays important regulatory roles during gene expression. RNAPII is responsible for the transcription of most protein-coding genes, a subset of non-coding genes, and retrotransposons. Retrotransposon transcription is the first step in their multiplication cycle, given that the RNA intermediate is required for the synthesis of cDNA, the material that is ultimately incorporated into a new genomic location. Retrotransposition can have grave consequences to genome integrity, as integration events can change the gene expression landscape or lead to alteration or loss of genetic information. Given that RNAPII transcribes retrotransposons, we sought to investigate if the RNAPII-CTD played a role in the regulation of retrotransposon gene expression. Importantly, we found that the RNAPII-CTD functioned to maintaining genome integrity through inhibition of retrotransposon gene expression, as reducing CTD length significantly increased expression and transposition rates of Ty1 elements. Mechanistically, the increased Ty1 mRNA levels in the rpb1-CTD11 mutant were partly due to Cdk8-dependent alterations to the RNAPII-CTD phosphorylation status. In addition, Cdk8 alone contributed to Ty1 gene expression regulation by altering the occupancy of the gene-specific transcription factor Ste12. Loss of STE12 and TEC1 suppressed growth phenotypes of the RNAPII-CTD truncation mutant. Collectively, our results implicate Ste12 and Tec1 as general and important contributors to the Cdk8, RNAPII-CTD regulatory circuitry as it relates to the maintenance of genome integrity.

  10. Drosophila: Retrotransposons Making up Telomeres.

    Science.gov (United States)

    Casacuberta, Elena

    2017-07-19

    Drosophila and extant species are the best-studied telomerase exception. In this organism, telomere elongation is coupled with targeted retrotransposition of Healing Transposon (HeT-A) and Telomere Associated Retrotransposon (TART) with sporadic additions of Telomere Associated and HeT-A Related (TAHRE), all three specialized non-Long Terminal Repeat (non-LTR) retrotransposons. These three very special retroelements transpose in head to tail arrays, always in the same orientation at the end of the chromosomes but never in interior locations. Apparently, retrotransposon and telomerase telomeres might seem very different, but a detailed view of their mechanisms reveals similarities explaining how the loss of telomerase in a Drosophila ancestor could successfully have been replaced by the telomere retrotransposons. In this review, we will discover that although HeT-A, TART, and TAHRE are still the only examples to date where their targeted transposition is perfectly tamed into the telomere biology of Drosophila, there are other examples of retrotransposons that manage to successfully integrate inside and at the end of telomeres. Because the aim of this special issue is viral integration at telomeres, understanding the base of the telomerase exceptions will help to obtain clues on similar strategies that mobile elements and viruses could have acquired in order to ensure their survival in the host genome.

  11. The Influence of LINE-1 and SINE Retrotransposons on Mammalian Genomes.

    Science.gov (United States)

    Richardson, Sandra R; Doucet, Aurélien J; Kopera, Huira C; Moldovan, John B; Garcia-Perez, José Luis; Moran, John V

    2015-04-01

    Transposable elements have had a profound impact on the structure and function of mammalian genomes. The retrotransposon Long INterspersed Element-1 (LINE-1 or L1), by virtue of its replicative mobilization mechanism, comprises ∼17% of the human genome. Although the vast majority of human LINE-1 sequences are inactive molecular fossils, an estimated 80-100 copies per individual retain the ability to mobilize by a process termed retrotransposition. Indeed, LINE-1 is the only active, autonomous retrotransposon in humans and its retrotransposition continues to generate both intra-individual and inter-individual genetic diversity. Here, we briefly review the types of transposable elements that reside in mammalian genomes. We will focus our discussion on LINE-1 retrotransposons and the non-autonomous Short INterspersed Elements (SINEs) that rely on the proteins encoded by LINE-1 for their mobilization. We review cases where LINE-1-mediated retrotransposition events have resulted in genetic disease and discuss how the characterization of these mutagenic insertions led to the identification of retrotransposition-competent LINE-1s in the human and mouse genomes. We then discuss how the integration of molecular genetic, biochemical, and modern genomic technologies have yielded insight into the mechanism of LINE-1 retrotransposition, the impact of LINE-1-mediated retrotransposition events on mammalian genomes, and the host cellular mechanisms that protect the genome from unabated LINE-1-mediated retrotransposition events. Throughout this review, we highlight unanswered questions in LINE-1 biology that provide exciting opportunities for future research. Clearly, much has been learned about LINE-1 and SINE biology since the publication of Mobile DNA II thirteen years ago. Future studies should continue to yield exciting discoveries about how these retrotransposons contribute to genetic diversity in mammalian genomes.

  12. [Non-LTR retrotransposons: LINEs and SINEs in plant genome].

    Science.gov (United States)

    Cheng, Xu-Dong; Ling, Hong-Qing

    2006-06-01

    Retrotransposons are one of the drivers of genome evolution. They include LTR (long terminal repeat) retrotransposons, which widespread in Eukaryotagenomes, show structural similarity to retroviruses. Non-LTR retrotransposons were first discovered in animal genomes and then identified as ubiquitous components of nuclear genomes in many species across the plant kingdom. They constitute a large fraction of the repetitive DNA. Non-LTR retrotransposons are divided into LINEs (long interspersed nuclear elements) and SINEs (short interspersed nuclear elements). Transposition of non-LTR retrotransposons is rarely observed in plants indicating that most of them are inactive and/or under regulation of the host genome. Transposition is poorly understood, but experimental evidence from other genetic systems shows that LINEs are able to transpose autonomously while non-autonomous SINEs depend on the reverse transcription machinery of other retrotransposons. Phylogenic analysis shows LINEs are probably the most ancient class of retrotransposons in plant genomes, while the origin of SINEs is unknown. This review sums up the above data and wants to show readers a clear picture of non-LTR retrotransposons.

  13. Envelope-like retrotransposons in the plant kingdom: evidence of their presence in gymnosperms (Pinus pinaster).

    Science.gov (United States)

    Miguel, Célia; Simões, Marta; Oliveira, Maria Margarida; Rocheta, Margarida

    2008-11-01

    Retroviruses differ from retrotransposons due to their infective capacity, which depends critically on the encoded envelope. Some plant retroelements contain domains reminiscent of the env of animal retroviruses but the number of such elements described to date is restricted to angiosperms. We show here the first evidence of the presence of putative env-like gene sequences in a gymnosperm species, Pinus pinaster (maritime pine). Using a degenerate primer approach for conserved domains of RNaseH gene, three clones from putative envelope-like retrotransposons (PpRT2, PpRT3, and PpRT4) were identified. The env-like sequences of P. pinaster clones are predicted to encode proteins with transmembrane domains. These sequences showed identity scores of up to 30% with env-like sequences belonging to different organisms. A phylogenetic analysis based on protein alignment of deduced aminoacid sequences revealed that these clones clustered with env-containing plant retrotransposons, as well as with retrotransposons from invertebrate organisms. The differences found among the sequences of maritime pine clones isolated here suggest the existence of different putative classes of env-like retroelements. The identification for the first time of env-like genes in a gymnosperm species may support the ancestrality of retroviruses among plants shedding light on their role in plant evolution.

  14. LTR retrotransposon landscape in Medicago truncatula: more rapid removal than in rice

    Directory of Open Access Journals (Sweden)

    Liu Jin-Song

    2008-08-01

    Full Text Available Abstract Background Long terminal repeat retrotransposons (LTR elements are ubiquitous Eukaryotic TEs that transpose through RNA intermediates. Accounting for significant proportion of many plant genomes, LTR elements have been well established as one of the major forces underlying the evolution of plant genome size, structure and function. The accessibility of more than 40% of genomic sequences of the model legume Medicago truncatula (Mt has made the comprehensive study of its LTR elements possible. Results We use a newly developed tool LTR_FINDER to identify LTR retrotransposons in the Mt genome and detect 526 full-length elements as well as a great number of copies related to them. These elements constitute about 9.6% of currently available genomic sequences. They are classified into 85 families of which 64 are reported for the first time. The majority of the LTR retrotransposons belong to either Copia or Gypsy superfamily and the others are categorized as TRIMs or LARDs by their length. We find that the copy-number of Copia-like families is 3 times more than that of Gypsy-like ones but the latter contribute more to the genome. The analysis of PBS and protein-coding domain structure of the LTR families reveals that they tend to use only 4–5 types of tRNAs and many families have quite conservative ORFs besides known TE domains. For several important families, we describe in detail their abundance, conservation, insertion time and structure. We investigate the amplification-deletion pattern of the elements and find that the detectable full-length elements are relatively young and most of them were inserted within the last 0.52 MY. We also estimate that more than ten million bp of the Mt genomic sequences have been removed by the deletion of LTR elements and the removal of the full-length structures in Mt has been more rapid than in rice. Conclusion This report is the first comprehensive description and analysis of LTR retrotransposons in the

  15. Determinants of Genomic RNA Encapsidation in the Saccharomyces cerevisiae Long Terminal Repeat Retrotransposons Ty1 and Ty3

    Directory of Open Access Journals (Sweden)

    Katarzyna Pachulska-Wieczorek

    2016-07-01

    Full Text Available Long-terminal repeat (LTR retrotransposons are transposable genetic elements that replicate intracellularly, and can be considered progenitors of retroviruses. Ty1 and Ty3 are the most extensively characterized LTR retrotransposons whose RNA genomes provide the template for both protein translation and genomic RNA that is packaged into virus-like particles (VLPs and reverse transcribed. Genomic RNAs are not divided into separate pools of translated and packaged RNAs, therefore their trafficking and packaging into VLPs requires an equilibrium between competing events. In this review, we focus on Ty1 and Ty3 genomic RNA trafficking and packaging as essential steps of retrotransposon propagation. We summarize the existing knowledge on genomic RNA sequences and structures essential to these processes, the role of Gag proteins in repression of genomic RNA translation, delivery to VLP assembly sites, and encapsidation.

  16. LTR-retrotransposons-based molecular markers in cultivated ...

    African Journals Online (AJOL)

    GRACE

    2006-07-03

    Jul 3, 2006 ... LTR-retrotransposons represent a standard component of the Gossypium Genome (Zaki and Abdel Ghany,. 2003). The analysis of the molecular existence and distribution of ancient and active LTR-retrotransposons, therefore, provides a comprehensive evaluation of the evolutionary history of Gossypium.

  17. Characterization of active reverse transcriptase and nucleoprotein complexes of the yeast retrotransposon Ty3 in vitro.

    Science.gov (United States)

    Cristofari, G; Gabus, C; Ficheux, D; Bona, M; Le Grice, S F; Darlix, J L

    1999-12-17

    Human immunodeficiency virus (HIV) and the distantly related yeast Ty3 retrotransposon encode reverse transcriptase (RT) and a nucleic acid-binding protein designated nucleocapsid protein (NCp) with either one or two zinc fingers, required for HIV-1 replication and Ty3 transposition, respectively. In vitro binding of HIV-1 NCp7 to viral 5' RNA and primer tRNA(3)(Lys) catalyzes formation of nucleoprotein complexes resembling the virion nucleocapsid. Nucleocapsid complex formation functions in viral RNA dimerization and tRNA annealing to the primer binding site (PBS). RT is recruited in these nucleoprotein complexes and synthesizes minus-strand cDNA initiated at the PBS. Recent results on yeast Ty3 have shown that the homologous NCp9 promotes annealing of primer tRNA(i)(Met) to a 5'-3' bipartite PBS, allowing RNA:tRNA dimer formation and initiation of cDNA synthesis at the 5' PBS (). To compare specific cDNA synthesis in a retrotransposon and HIV-1, we have established a Ty3 model system comprising Ty3 RNA with the 5'-3' PBS, primer tRNA(i)(Met), NCp9, and for the first time, highly purified Ty3 RT. Here we report that Ty3 RT is as active as retroviral HIV-1 or murine leukemia virus RT using a synthetic template-primer system. Moreover, and in contrast to what was found with retroviral RTs, retrotransposon Ty3 RT was unable to direct cDNA synthesis by self-priming. We also show that Ty3 nucleoprotein complexes were formed in vitro and that the N terminus of NCp9, but not the zinc finger, is required for complex formation, tRNA annealing to the PBS, RNA dimerization, and primer tRNA-directed cDNA synthesis by Ty3 RT. These results indicate that NCp9 chaperones bona fide cDNA synthesis by RT in the yeast Ty3 retrotransposon, as illustrated for NCp7 in HIV-1, reinforcing the notion that Ty3 NCp9 is an ancestor of HIV-1 NCp7.

  18. Convergent evolution of ribonuclease h in LTR retrotransposons and retroviruses.

    Science.gov (United States)

    Ustyantsev, Kirill; Novikova, Olga; Blinov, Alexander; Smyshlyaev, Georgy

    2015-05-01

    Ty3/Gypsy long terminals repeat (LTR) retrotransposons are structurally and phylogenetically close to retroviruses. Two notable structural differences between these groups of genetic elements are 1) the presence in retroviruses of an additional envelope gene, env, which mediates infection, and 2) a specific dual ribonuclease H (RNH) domain encoded by the retroviral pol gene. However, similar to retroviruses, many Ty3/Gypsy LTR retrotransposons harbor additional env-like genes, promoting concepts of the infective mode of these retrotransposons. Here, we provide a further line of evidence of similarity between retroviruses and some Ty3/Gypsy LTR retrotransposons. We identify that, together with their additional genes, plant Ty3/Gypsy LTR retrotransposons of the Tat group have a second RNH, as do retroviruses. Most importantly, we show that the resulting dual RNHs of Tat LTR retrotransposons and retroviruses emerged independently, providing strong evidence for their convergent evolution. The convergent resemblance of Tat LTR retrotransposons and retroviruses may indicate similar selection pressures acting on these diverse groups of elements and reveal potential evolutionary constraints on their structure. We speculate that dual RNH is required to accelerate retrotransposon evolution through increased rates of strand transfer events and subsequent recombination events. © The Author 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  19. Repetitive DNA and Plant Domestication: Variation in Copy Number and Proximity to Genes of LTR-Retrotransposons among Wild and Cultivated Sunflower (Helianthus annuus) Genotypes.

    Science.gov (United States)

    Mascagni, Flavia; Barghini, Elena; Giordani, Tommaso; Rieseberg, Loren H; Cavallini, Andrea; Natali, Lucia

    2015-11-24

    The sunflower (Helianthus annuus) genome contains a very large proportion of transposable elements, especially long terminal repeat retrotransposons. However, knowledge on the retrotransposon-related variability within this species is still limited. We used next-generation sequencing (NGS) technologies to perform a quantitative and qualitative survey of intraspecific variation of the retrotransposon fraction of the genome across 15 genotypes--7 wild accessions and 8 cultivars--of H. annuus. By mapping the Illumina reads of the 15 genotypes onto a library of sunflower long terminal repeat retrotransposons, we observed considerable variability in redundancy among genotypes, at both superfamily and family levels. In another analysis, we mapped Illumina paired reads to two sets of sequences, that is, long terminal repeat retrotransposons and protein-encoding sequences, and evaluated the extent of retrotransposon proximity to genes in the sunflower genome by counting the number of paired reads in which one read mapped to a retrotransposon and the other to a gene. Large variability among genotypes was also ascertained for retrotransposon proximity to genes. Both long terminal repeat retrotransposon redundancy and proximity to genes varied among retrotransposon families and also between cultivated and wild genotypes. Such differences are discussed in relation to the possible role of long terminal repeat retrotransposons in the domestication of sunflower. © The Author(s) 2015. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  20. Prediction of retrotransposons and assessment of genetic variability based on developed retrotransposon-based insertion polymorphism (RBIP) markers in Pyrus L.

    Science.gov (United States)

    Jiang, Shuang; Zong, Yu; Yue, Xiaoyan; Postman, Joseph; Teng, Yuanwen; Cai, Danying

    2015-02-01

    Interspecific hybridization has been considered the major mode of evolution in Pyrus (pear), and thus, the genetic relationships within this genus have not been well documented. Retrotransposons are ubiquitous components of plant genomes and 42.4 % of the pear genome was reported to be long terminal repeat (LTR) retrotransposons, implying that retrotransposons might be significant in the evolution of Pyrus. In this study, 1,836 putative full-length LTR retrotransposons were isolated and 196 retrotransposon-based insertion polymorphism (RBIP) primers were developed, of which 24 pairs to the Ppcr1 subfamily of copia retrotransposons were used to analyze genetic diversity among 110 Pyrus accessions from Eurasia. Our results showed that Ppcr1 replicated many times in the development of cultivated Asian pears. The genetic structure analysis and the unweighted pair group method with arithmetic mean (UPGMA) dendrogram indicated that all accessions could be divided into Oriental and Occidental groups. In Oriental pears, wild pea pears clustered separately into independent groups in accordance with their morphological classifications. Cultivars of P. ussuriensis Maxim, P. pyrifolia Nakai, and P. pyrifolia Chinese white pear were mingled together, which inferred that hybridization events occurred during the development of the cultivated Asian pears. In Occidental pears, two clades were obtained in the UPGMA dendrogram in accordance with their geographical distribution; one contained the European species and the other included species from North Africa and West Asia. New findings in this study will be important to further understand the phylogeny of Pyrus and origins of cultivated pears.

  1. Citrus and Prunuscopia-like retrotransposons.

    Science.gov (United States)

    Asíns, M J; Monforte, A J; Mestre, P F; Carbonell, E A

    1999-08-01

    Many of the world's most important citrus cultivars ("Washington Navel", satsumas, clementines) have arisen through somatic mutation. This phenomenon occurs fairly often in the various species and varieties of the genus.The presence of copia-like retrotransposons has been investigated in fruit trees, especially citrus, by using a PCR assay designed to detect copia-like reverse transcriptase (RT) sequences. Amplification products from a genotype of each the following species Citrus sinensis, Citrus grandis, Citrus clementina, Prunus armeniaca and Prunus amygdalus, were cloned and some of them sequenced. Southern-blot hybridization using RT clones as probes showed that multiple copies are integrated throughout the citrus genome, while only 1-3 copies are detected in the P. armeniaca genome, which is in accordance with the Citrus and Prunus genome sizes. Sequence analysis of RT clones allowed a search for homologous sequences within three gene banks. The most similar ones correspond to RT domains of copia-like retrotransposons from unrelated plant species. Cluster analysis of these sequences has shown a great heterogeneity among RT domains cloned from the same genotype. This finding supports the hypothesis that horizontal transmission of retrotransposons has occurred in the past. The species presenting a RT sequence most similar to citrus RT clones is Gnetum montanum, a gymnosperm whose distribution area coincides with two of the main centers of origin of Citrus spp. A new C-methylated restriction DNA fragment containing a RT sequence is present in navel sweet oranges, but not in Valencia oranges from which the former originated suggesting, that retrotransposon activity might be, at least in part, involved in the genetic variability among sweet orange cultivars. Given that retrotransposons are quite abundant throughout the citrus genome, their activity should be investigated thoroughly before commercializing any transgenic citrus plant where the transgene(s) is part

  2. Full Length Research Paper LTR-retrotransposons-based molecular ...

    African Journals Online (AJOL)

    LTR-retrotransposons possess unique properties that make them appropriate for investigating relationships between closely related species and populations. The aim of the current study was to employ Ty1-copia group retrotransposons as molecular markers in cultivated Egyptian cottons, G. barbadense L. Restriction site ...

  3. Sequencing the extrachromosomal circular mobilome reveals retrotransposon activity in plants.

    Directory of Open Access Journals (Sweden)

    Sophie Lanciano

    2017-02-01

    Full Text Available Retrotransposons are mobile genetic elements abundant in plant and animal genomes. While efficiently silenced by the epigenetic machinery, they can be reactivated upon stress or during development. Their level of transcription not reflecting their transposition ability, it is thus difficult to evaluate their contribution to the active mobilome. Here we applied a simple methodology based on the high throughput sequencing of extrachromosomal circular DNA (eccDNA forms of active retrotransposons to characterize the repertoire of mobile retrotransposons in plants. This method successfully identified known active retrotransposons in both Arabidopsis and rice material where the epigenome is destabilized. When applying mobilome-seq to developmental stages in wild type rice, we identified PopRice as a highly active retrotransposon producing eccDNA forms in the wild type endosperm. The mobilome-seq strategy opens new routes for the characterization of a yet unexplored fraction of plant genomes.

  4. Sequencing the extrachromosomal circular mobilome reveals retrotransposon activity in plants.

    Science.gov (United States)

    Lanciano, Sophie; Carpentier, Marie-Christine; Llauro, Christel; Jobet, Edouard; Robakowska-Hyzorek, Dagmara; Lasserre, Eric; Ghesquière, Alain; Panaud, Olivier; Mirouze, Marie

    2017-02-01

    Retrotransposons are mobile genetic elements abundant in plant and animal genomes. While efficiently silenced by the epigenetic machinery, they can be reactivated upon stress or during development. Their level of transcription not reflecting their transposition ability, it is thus difficult to evaluate their contribution to the active mobilome. Here we applied a simple methodology based on the high throughput sequencing of extrachromosomal circular DNA (eccDNA) forms of active retrotransposons to characterize the repertoire of mobile retrotransposons in plants. This method successfully identified known active retrotransposons in both Arabidopsis and rice material where the epigenome is destabilized. When applying mobilome-seq to developmental stages in wild type rice, we identified PopRice as a highly active retrotransposon producing eccDNA forms in the wild type endosperm. The mobilome-seq strategy opens new routes for the characterization of a yet unexplored fraction of plant genomes.

  5. Impact of Low-Energy Ion Beam Implantation on the Expression of Ty1-copia-like Retrotransposons in Wheat (Triticum aestivum)

    International Nuclear Information System (INIS)

    Ya Huiyuan; Jiao Zhen; Gu Yunhong; Wang Weidong; Qin Guangyong; Huo Yuping

    2007-01-01

    Retrotransposon-like elements are major constituents of most eukaryotic genomes. For example, they account for roughly 90% of the wheat (Triticum aestivum) genome. Previous study on a wheat strain treated by low-energy N + ions indicated the variations in AFLP (Amplified Fragment Length Polymorphism ) markers. One such variation was caused by the re-activation of Ty1-copia-like retrotransposons, implying that the mutagenic effects of low-energy ions might work through elevated activation of retrotransposons. In this paper an expression profile of Ty1-copia-like retrotransposons in wheat treated by low-energy N + ions is reported. The reverse transcriptase (RT) domains of these retrotransposons were amplified by reverse-transcriptional polymerase chain reaction (RT-PCR) and sequentially cloned. 42 and 65 clones were obtained from the treated (CL) and control materials (CK), respectively. Sequence analysis of each clone was performed by software. Phylogeny and classification were calculated responding to the sequences of the RT domains. All the results show that there is much difference in the RT domain between the control sample and the treated sample. Especially, the RT domains from the treated group encode significantly more functional ORF (open reading frames) than those from the control sample. This observation suggests that the treated sample has higher activation of retrotransposons, possibly as a consequence of low-energy ion beam irradiation. It also suggests that retrotransposons in the two groups impact the host gene expression in two different ways and carry out different functions in wheat cells

  6. Retrotransposon Domestication and Control in Dictyostelium discoideum

    Directory of Open Access Journals (Sweden)

    Marek Malicki

    2017-10-01

    Full Text Available Transposable elements, identified in all eukaryotes, are mobile genetic units that can change their genomic position. Transposons usually employ an excision and reintegration mechanism, by which they change position, but not copy number. In contrast, retrotransposons amplify via RNA intermediates, increasing their genomic copy number. Hence, they represent a particular threat to the structural and informational integrity of the invaded genome. The social amoeba Dictyostelium discoideum, model organism of the evolutionary Amoebozoa supergroup, features a haploid, gene-dense genome that offers limited space for damage-free transposition. Several of its contemporary retrotransposons display intrinsic integration preferences, for example by inserting next to transfer RNA genes or other retroelements. Likely, any retrotransposons that invaded the genome of the amoeba in a non-directed manner were lost during evolution, as this would result in decreased fitness of the organism. Thus, the positional preference of the Dictyostelium retroelements might represent a domestication of the selfish elements. Likewise, the reduced danger of such domesticated transposable elements led to their accumulation, and they represent about 10% of the current genome of D. discoideum. To prevent the uncontrolled spreading of retrotransposons, the amoeba employs control mechanisms including RNA interference and heterochromatization. Here, we review TRE5-A, DIRS-1 and Skipper-1, as representatives of the three retrotransposon classes in D. discoideum, which make up 5.7% of the Dictyostelium genome. We compile open questions with respect to their mobility and cellular regulation, and suggest strategies, how these questions might be addressed experimentally.

  7. Human Retrotransposon Insertion Polymorphisms Are Associated with Health and Disease via Gene Regulatory Phenotypes

    Directory of Open Access Journals (Sweden)

    Lu Wang

    2017-08-01

    Full Text Available The human genome hosts several active families of transposable elements (TEs, including the Alu, LINE-1, and SVA retrotransposons that are mobilized via reverse transcription of RNA intermediates. We evaluated how insertion polymorphisms generated by human retrotransposon activity may be related to common health and disease phenotypes that have been previously interrogated through genome-wide association studies (GWAS. To address this question, we performed a genome-wide screen for retrotransposon polymorphism disease associations that are linked to TE induced gene regulatory changes. Our screen first identified polymorphic retrotransposon insertions found in linkage disequilibrium (LD with single nucleotide polymorphisms that were previously associated with common complex diseases by GWAS. We further narrowed this set of candidate disease associated retrotransposon polymorphisms by identifying insertions that are located within tissue-specific enhancer elements. We then performed expression quantitative trait loci analysis on the remaining set of candidates in order to identify polymorphic retrotransposon insertions that are associated with gene expression changes in B-cells of the human immune system. This progressive and stringent screen yielded a list of six retrotransposon insertions as the strongest candidates for TE polymorphisms that lead to disease via enhancer-mediated changes in gene regulation. For example, we found an SVA insertion within a cell-type specific enhancer located in the second intron of the B4GALT1 gene. B4GALT1 encodes a glycosyltransferase that functions in the glycosylation of the Immunoglobulin G (IgG antibody in such a way as to convert its activity from pro- to anti-inflammatory. The disruption of the B4GALT1 enhancer by the SVA insertion is associated with down-regulation of the gene in B-cells, which would serve to keep the IgG molecule in a pro-inflammatory state. Consistent with this idea, the B4GALT1 enhancer

  8. Transferability of retrotransposon primers derived from Persimmon (Diospyros kaki Thunb.) across other plant species.

    Science.gov (United States)

    Du, X Y; Hu, Q N; Zhang, Q L; Wang, Y B; Luo, Z R

    2013-06-06

    Retrotransposon-based molecular markers are powerful molecular tools. However, these markers are not readily available due to the difficulty in obtaining species-specific retrotransposon primers. Although recent techniques enabling the rapid isolation of retrotransposon sequences have facilitated primer development, this process nonetheless remains time-consuming and costly. Therefore, research into the transferability of retrotransposon primers developed from one plant species onto others would be of great value. The present study investigated the transferability of retrotransposon primers derived from 'Luotian-tianshi' persimmon (Diospyros kaki Thunb.) across other fruit crops, as well as within the genus using inter-retrotransposon amplified polymorphism molecular marker. Fourteen of the 26 retrotransposon primers tested (53.85%) produced robust and reproducible amplification products across all fruit crops tested, indicating their applicability across plant species. Four of the 13 fruit crops showed the best transferability performances: persimmon, grape, citrus, and peach. Furthermore, similarity coefficients and UPGMA clustering indicated that these primers could further offer a potential tool for germplasm differentiation, parentage identification, genetic diversity assessment, classification, and phylogenetic studies across a variety of plant species. Transferability was further confirmed by examining published primers derived from Rosaceae, Gramineae, and Solanaceae. This study is one of the few currently available studies concerning the transferability of retrotransposon primers across plant species in general, and is the first successful study of the transferability of retrotransposon primers derived from persimmon. The primers presented here will help reduce costs for future retrotransposon primer development and therefore contribute to the popularization of retrotransposon molecular markers.

  9. Retrotransposon hypomethylation in melanoma and expression of a placenta-specific gene.

    Directory of Open Access Journals (Sweden)

    Erin C Macaulay

    Full Text Available In the human placenta, DNA hypomethylation permits the expression of retrotransposon-derived genes that are normally silenced by methylation in somatic tissues. We previously identified hypomethylation of a retrotransposon-derived transcript of the voltage-gated potassium channel gene KCNH5 that is expressed only in human placenta. However, an RNA sequence from this placental-specific transcript has been reported in melanoma. This study examined the promoter methylation and expression of the retrotransposon-derived KCNH5 transcript in 25 melanoma cell lines to determine whether the acquisition of 'placental' epigenetic marks is a feature of melanoma. Methylation and gene expression analysis revealed hypomethylation of this retrotransposon in melanoma cell lines, particularly in those samples that express the placental KCNH5 transcript. Therefore we propose that hypomethylation of the placental-specific KCNH5 promoter is frequently associated with KCNH5 expression in melanoma cells. Our findings show that melanoma can develop hypomethylation of a retrotransposon-derived gene; a characteristic notably shared with the normal placenta.

  10. Modeling the amplification dynamics of human Alu retrotransposons.

    Directory of Open Access Journals (Sweden)

    Dale J Hedges

    2005-09-01

    Full Text Available Retrotransposons have had a considerable impact on the overall architecture of the human genome. Currently, there are three lineages of retrotransposons (Alu, L1, and SVA that are believed to be actively replicating in humans. While estimates of their copy number, sequence diversity, and levels of insertion polymorphism can readily be obtained from existing genomic sequence data and population sampling, a detailed understanding of the temporal pattern of retrotransposon amplification remains elusive. Here we pose the question of whether, using genomic sequence and population frequency data from extant taxa, one can adequately reconstruct historical amplification patterns. To this end, we developed a computer simulation that incorporates several known aspects of primate Alu retrotransposon biology and accommodates sampling effects resulting from the methods by which mobile elements are typically discovered and characterized. By modeling a number of amplification scenarios and comparing simulation-generated expectations to empirical data gathered from existing Alu subfamilies, we were able to statistically reject a number of amplification scenarios for individual subfamilies, including that of a rapid expansion or explosion of Alu amplification at the time of human-chimpanzee divergence.

  11. Modeling the amplification dynamics of human alu retrotransposons.

    Directory of Open Access Journals (Sweden)

    2005-09-01

    Full Text Available Retrotransposons have had a considerable impact on the overall architecture of the human genome. Currently, there are three lineages of retrotransposons (Alu, L1, and SVA that are believed to be actively replicating in humans. While estimates of their copy number, sequence diversity, and levels of insertion polymorphism can readily be obtained from existing genomic sequence data and population sampling, a detailed understanding of the temporal pattern of retrotransposon amplification remains elusive. Here we pose the question of whether, using genomic sequence and population frequency data from extant taxa, one can adequately reconstruct historical amplification patterns. To this end, we developed a computer simulation that incorporates several known aspects of primate Alu retrotransposon biology and accommodates sampling effects resulting from the methods by which mobile elements are typically discovered and characterized. By modeling a number of amplification scenarios and comparing simulation-generated expectations to empirical data gathered from existing Alu subfamilies, we were able to statistically reject a number of amplification scenarios for individual subfamilies, including that of a rapid expansion or explosion of Alu amplification at the time of human-chimpanzee divergence.

  12. iPBS: a universal method for DNA fingerprinting and retrotransposon isolation.

    Science.gov (United States)

    Kalendar, Ruslan; Antonius, Kristiina; Smýkal, Petr; Schulman, Alan H

    2010-11-01

    Molecular markers are essential in plant and animal breeding and biodiversity applications, in human forensics, and for map-based cloning of genes. The long terminal repeat (LTR) retrotransposons are well suited as molecular markers. As dispersed and ubiquitous transposable elements, their "copy and paste" life cycle of replicative transposition leads to new genome insertions without excision of the original element. Both the overall structure of retrotransposons and the domains responsible for the various phases of their replication are highly conserved in all eukaryotes. Nevertheless, up to a year has been required to develop a retrotransposon marker system in a new species, involving cloning and sequencing steps as well as the development of custom primers. Here, we describe a novel PCR-based method useful both as a marker system in its own right and for the rapid isolation of retrotransposon termini and full-length elements, making it ideal for "orphan crops" and other species with underdeveloped marker systems. The method, iPBS amplification, is based on the virtually universal presence of a tRNA complement as a reverse transcriptase primer binding site (PBS) in LTR retrotransposons. The method differs from earlier retrotransposon isolation methods because it is applicable not only to endogenous retroviruses and retroviruses, but also to both Gypsy and Copia LTR retrotransposons, as well as to non-autonomous LARD and TRIM elements, throughout the plant kingdom and to animals. Furthermore, the inter-PBS amplification technique as such has proved to be a powerful DNA fingerprinting technology without the need for prior sequence knowledge.

  13. Infection-Induced Retrotransposon-Derived Noncoding RNAs Enhance Herpesviral Gene Expression via the NF-κB Pathway.

    Directory of Open Access Journals (Sweden)

    John Karijolich

    Full Text Available Short interspersed nuclear elements (SINEs are highly abundant, RNA polymerase III-transcribed noncoding retrotransposons that are silenced in somatic cells but activated during certain stresses including viral infection. How these induced SINE RNAs impact the host-pathogen interaction is unknown. Here we reveal that during murine gammaherpesvirus 68 (MHV68 infection, rapidly induced SINE RNAs activate the antiviral NF-κB signaling pathway through both mitochondrial antiviral-signaling protein (MAVS-dependent and independent mechanisms. However, SINE RNA-based signaling is hijacked by the virus to enhance viral gene expression and replication. B2 RNA expression stimulates IKKβ-dependent phosphorylation of the major viral lytic cycle transactivator protein RTA, thereby enhancing its activity and increasing progeny virion production. Collectively, these findings suggest that SINE RNAs participate in the innate pathogen response mechanism, but that herpesviruses have evolved to co-opt retrotransposon activation for viral benefit.

  14. Host factors that promote retrotransposon integration are similar in distantly related eukaryotes.

    Directory of Open Access Journals (Sweden)

    Sudhir Kumar Rai

    2017-12-01

    Full Text Available Retroviruses and Long Terminal Repeat (LTR-retrotransposons have distinct patterns of integration sites. The oncogenic potential of retrovirus-based vectors used in gene therapy is dependent on the selection of integration sites associated with promoters. The LTR-retrotransposon Tf1 of Schizosaccharomyces pombe is studied as a model for oncogenic retroviruses because it integrates into the promoters of stress response genes. Although integrases (INs encoded by retroviruses and LTR-retrotransposons are responsible for catalyzing the insertion of cDNA into the host genome, it is thought that distinct host factors are required for the efficiency and specificity of integration. We tested this hypothesis with a genome-wide screen of host factors that promote Tf1 integration. By combining an assay for transposition with a genetic assay that measures cDNA recombination we could identify factors that contribute differentially to integration. We utilized this assay to test a collection of 3,004 S. pombe strains with single gene deletions. Using these screens and immunoblot measures of Tf1 proteins, we identified a total of 61 genes that promote integration. The candidate integration factors participate in a range of processes including nuclear transport, transcription, mRNA processing, vesicle transport, chromatin structure and DNA repair. Two candidates, Rhp18 and the NineTeen complex were tested in two-hybrid assays and were found to interact with Tf1 IN. Surprisingly, a number of pathways we identified were found previously to promote integration of the LTR-retrotransposons Ty1 and Ty3 in Saccharomyces cerevisiae, indicating the contribution of host factors to integration are common in distantly related organisms. The DNA repair factors are of particular interest because they may identify the pathways that repair the single stranded gaps flanking the sites of strand transfer following integration of LTR retroelements.

  15. Host factors that promote retrotransposon integration are similar in distantly related eukaryotes.

    Science.gov (United States)

    Rai, Sudhir Kumar; Sangesland, Maya; Lee, Michael; Esnault, Caroline; Cui, Yujin; Chatterjee, Atreyi Ghatak; Levin, Henry L

    2017-12-01

    Retroviruses and Long Terminal Repeat (LTR)-retrotransposons have distinct patterns of integration sites. The oncogenic potential of retrovirus-based vectors used in gene therapy is dependent on the selection of integration sites associated with promoters. The LTR-retrotransposon Tf1 of Schizosaccharomyces pombe is studied as a model for oncogenic retroviruses because it integrates into the promoters of stress response genes. Although integrases (INs) encoded by retroviruses and LTR-retrotransposons are responsible for catalyzing the insertion of cDNA into the host genome, it is thought that distinct host factors are required for the efficiency and specificity of integration. We tested this hypothesis with a genome-wide screen of host factors that promote Tf1 integration. By combining an assay for transposition with a genetic assay that measures cDNA recombination we could identify factors that contribute differentially to integration. We utilized this assay to test a collection of 3,004 S. pombe strains with single gene deletions. Using these screens and immunoblot measures of Tf1 proteins, we identified a total of 61 genes that promote integration. The candidate integration factors participate in a range of processes including nuclear transport, transcription, mRNA processing, vesicle transport, chromatin structure and DNA repair. Two candidates, Rhp18 and the NineTeen complex were tested in two-hybrid assays and were found to interact with Tf1 IN. Surprisingly, a number of pathways we identified were found previously to promote integration of the LTR-retrotransposons Ty1 and Ty3 in Saccharomyces cerevisiae, indicating the contribution of host factors to integration are common in distantly related organisms. The DNA repair factors are of particular interest because they may identify the pathways that repair the single stranded gaps flanking the sites of strand transfer following integration of LTR retroelements.

  16. Stress-induced rearrangement of Fusarium retrotransposon sequences.

    Science.gov (United States)

    Anaya, N; Roncero, M I

    1996-11-27

    Rearrangement of fusarium oxysporum retrotransposon skippy was induced by growth in the presence of potassium chlorate. Three fungal strains, one sensitive to chlorate (Co60) and two resistant to chlorate and deficient for nitrate reductase (Co65 and Co94), were studied by Southern analysis of their genomic DNA. Polymorphism was detected in their hybridization banding pattern, relative to the wild type grown in the absence of chlorate, using various enzymes with or without restriction sites within the retrotransposon. Results were consistent with the assumption that three different events had occurred in strain Co60: genomic amplification of skippy yielding tandem arrays of the element, generation of new skippy sequences, and deletion of skippy sequences. Amplification of Co60 genomic DNA using the polymerase chain reaction and divergent primers derived from the retrotransposon generated a new band, corresponding to one long terminal repeat plus flanking sequences, that was not present in the wild-type strain. Molecular analysis of nitrate reductase-deficient mutants showed that generation and deletion of skippy sequences, but not genomic amplification in tandem repeats, had occurred in their genomes.

  17. Retrotransposons as regulators of gene expression.

    Science.gov (United States)

    Elbarbary, Reyad A; Lucas, Bronwyn A; Maquat, Lynne E

    2016-02-12

    Transposable elements (TEs) are both a boon and a bane to eukaryotic organisms, depending on where they integrate into the genome and how their sequences function once integrated. We focus on two types of TEs: long interspersed elements (LINEs) and short interspersed elements (SINEs). LINEs and SINEs are retrotransposons; that is, they transpose via an RNA intermediate. We discuss how LINEs and SINEs have expanded in eukaryotic genomes and contribute to genome evolution. An emerging body of evidence indicates that LINEs and SINEs function to regulate gene expression by affecting chromatin structure, gene transcription, pre-mRNA processing, or aspects of mRNA metabolism. We also describe how adenosine-to-inosine editing influences SINE function and how ongoing retrotransposition is countered by the body's defense mechanisms. Copyright © 2016, American Association for the Advancement of Science.

  18. Proteolytic Processing and Assembly of gag and gag-pol Proteins of TED, a Baculovirus-Associated Retrotransposon of the Gypsy Family

    Science.gov (United States)

    Hajek, Kathryn L.; Friesen, Paul D.

    1998-01-01

    TED (transposable element D) is an env-containing member of the gypsy family of retrotransposons that represents a possible retrovirus of invertebrates. This lepidopteran (moth) retroelement contains gag and pol genes that encode proteins capable of forming viruslike particles (VLP) with reverse transcriptase. Since VLP are likely intermediates in TED transposition, we investigated the roles of gag and pol in TED capsid assembly and maturation. By using constructed baculovirus vectors and TED Gag-specific antiserum, we show that the principal translation product of gag (Pr55gag) is cleaved to produce a single VLP structural protein, p37gag. Replacement of Asp436 within the retrovirus-like active site of the pol-encoded protease (PR) abolished Pr55gag cleavage and demonstrated the requirement for PR in capsid processing. As shown by expression of an in-frame fusion of TED gag and pol, PR is derived from the Gag-Pol polyprotein Pr195gag-pol. The PR cleavage site within Pr55gag was mapped to a position near the junction of a basic, nucleocapsid-like domain and a C-terminal acidic domain. Once released by cleavage, the C-terminal fragment was not detected. This acidic fragment was dispensable for VLP assembly, as demonstrated by the formation of VLP by C-terminal Pr55gag truncation proteins and replacement of the acidic domain with a heterologous protein. In contrast, C-terminal deletions that extended into the adjacent nucleocapsid-like domain of Pr55gag abolished VLP recovery and demonstrated that this central region contributes to VLP assembly or stability, or both. Collectively, these data suggest that the single TED protein p37gag provides both capsid and nucleocapsid functions. TED may therefore use a simple processing strategy for VLP assembly and genome packaging. PMID:9765414

  19. Activation of an endogenous retrotransposon associated with epigenetic changes in Lotus japonicus

    DEFF Research Database (Denmark)

    Fukai, Eigo; Stougaard, Jens; Hayashi, Makoto

    2013-01-01

    Long terminal repeat retrotransposons occupy a large portion of genomes in flowering plants. In spite of their abundance, the majority are silenced and rarely transpose. One of the examples of a highly active retrotransposon is Lotus Retrotransposon 1(LORE1), of the model legume Lotus japonicus...... significance of LORE1 as a member of chromovirus, a chromodomain containing clade of the Gypsy superfamily. Then we discuss possibilities and methodologies for using endogenous transposable elements as mutagens to generate gene tagging populations in plants...

  20. LTR retrotransposons in fungi.

    Directory of Open Access Journals (Sweden)

    Anna Muszewska

    Full Text Available Transposable elements with long terminal direct repeats (LTR TEs are one of the best studied groups of mobile elements. They are ubiquitous elements present in almost all eukaryotic genomes. Their number and state of conservation can be a highlight of genome dynamics. We searched all published fungal genomes for LTR-containing retrotransposons, including both complete, functional elements and remnant copies. We identified a total of over 66,000 elements, all of which belong to the Ty1/Copia or Ty3/Gypsy superfamilies. Most of the detected Gypsy elements represent Chromoviridae, i.e. they carry a chromodomain in the pol ORF. We analyzed our data from a genome-ecology perspective, looking at the abundance of various types of LTR TEs in individual genomes and at the highest-copy element from each genome. The TE content is very variable among the analyzed genomes. Some genomes are very scarce in LTR TEs (8000 elements. The data shows that transposon expansions in fungi usually involve an increase both in the copy number of individual elements and in the number of element types. The majority of the highest-copy TEs from all genomes are Ty3/Gypsy transposons. Phylogenetic analysis of these elements suggests that TE expansions have appeared independently of each other, in distant genomes and at different taxonomical levels. We also analyzed the evolutionary relationships between protein domains encoded by the transposon pol ORF and we found that the protease is the fastest evolving domain whereas reverse transcriptase and RNase H evolve much slower and in correlation with each other.

  1. New aspartic proteinase of Ulysses retrotransposon from Drosophila virilis.

    Science.gov (United States)

    Volkov, D A; Dergousova, N I; Rumsh, L D

    2004-06-01

    This work is focused on the investigation of a proteinase of Ulysses mobile genetic element from Drosophila virilis. The primary structure of this proteinase is suggested based on comparative analysis of amino acid sequences of aspartic proteinases from retroviruses and retrotransposons. The corresponding cDNA fragment has been cloned and expressed in E. coli. The protein accumulated in inclusion bodies. The recombinant protein (12 kD) was subjected to refolding and purified by affinity chromatography on pepstatin-agarose. Proteolytic activity of the protein was determined using oligopeptide substrates melittin and insulin B-chain. It was found that the maximum of the proteolytic activity is displayed at pH 5.5 as for the majority of aspartic proteinases. We observed that hydrolysis of B-chain of insulin was totally inhibited by pepstatin A in the micromolar concentration range. The molecular weight of the monomer of the Ulysses proteinase was determined by MALDI-TOF mass-spectrometry.

  2. Sequencing the extrachromosomal circular mobilome reveals retrotransposon activity in plants

    OpenAIRE

    Lanciano, Sophie; Carpentier, M. C.; Llauro, C.; Jobet, E.; Robakowska-Hyzorek, D.; Lasserre, E.; Ghesquière, Alain; Panaud, O.; Mirouze, Marie

    2017-01-01

    Retrotransposons are mobile genetic elements abundant in plant and animal genomes. While efficiently silenced by the epigenetic machinery, they can be reactivated upon stress or during development. Their level of transcription not reflecting their transposition ability, it is thus difficult to evaluate their contribution to the active mobilome. Here we applied a simple methodology based on the high throughput sequencing of extrachromosomal circular DNA (eccDNA) forms of active retrotransposon...

  3. Retrotransposon-Encoded Reverse Transcriptase in the Genesis, Progression and Cellular Plasticity of Human Cancer

    International Nuclear Information System (INIS)

    Sinibaldi-Vallebona, Paola; Matteucci, Claudia; Spadafora, Corrado

    2011-01-01

    LINE-1 (Long Interspersed Nuclear Elements) and HERVs (Human Endogenous Retroviruses) are two families of autonomously replicating retrotransposons that together account for about 28% of the human genome. Genes harbored within LINE-1 and HERV retrotransposons, particularly those encoding the reverse transcriptase (RT) enzyme, are generally expressed at low levels in differentiated cells, but their expression is upregulated in transformed cells and embryonic tissues. Here we discuss a recently discovered RT-dependent mechanism that operates in tumorigenesis and reversibly modulates phenotypic and functional variations associated with tumor progression. Downregulation of active LINE-1 elements drastically reduces the tumorigenic potential of cancer cells, paralleled by reduced proliferation and increased differentiation. Pharmacological RT inhibitors (e.g., nevirapine and efavirenz) exert similar effects on tumorigenic cell lines, both in culture and in animal models. The HERV-K family play a distinct complementary role in stress-dependent transition of melanoma cells from an adherent, non-aggressive, to a non-adherent, highly malignant, growth phenotype. In synthesis, the retrotransposon-encoded RT is increasingly emerging as a key regulator of tumor progression and a promising target in a novel anti-cancer therapy

  4. An evolutionary arms race between KRAB zinc-finger genes ZNF91/93 and SVA/L1 retrotransposons

    NARCIS (Netherlands)

    Jacobs, F.M.J.; Greenberg, D.; Nguyen, N.; Haeussler, M.; Ewing, A.D.; Katzman, S.; Paten, B.; Salama, S.R.; Haussler, D.

    2014-01-01

    Throughout evolution primate genomes have been modified by waves of retrotransposon insertions1, 2, 3. For each wave, the host eventually finds a way to repress retrotransposon transcription and prevent further insertions. In mouse embryonic stem cells, transcriptional silencing of retrotransposons

  5. Not so bad after all: retroviruses and long terminal repeat retrotransposons as a source of new genes in vertebrates.

    Science.gov (United States)

    Naville, M; Warren, I A; Haftek-Terreau, Z; Chalopin, D; Brunet, F; Levin, P; Galiana, D; Volff, J-N

    2016-04-01

    Viruses and transposable elements, once considered as purely junk and selfish sequences, have repeatedly been used as a source of novel protein-coding genes during the evolution of most eukaryotic lineages, a phenomenon called 'molecular domestication'. This is exemplified perfectly in mammals and other vertebrates, where many genes derived from long terminal repeat (LTR) retroelements (retroviruses and LTR retrotransposons) have been identified through comparative genomics and functional analyses. In particular, genes derived from gag structural protein and envelope (env) genes, as well as from the integrase-coding and protease-coding sequences, have been identified in humans and other vertebrates. Retroelement-derived genes are involved in many important biological processes including placenta formation, cognitive functions in the brain and immunity against retroelements, as well as in cell proliferation, apoptosis and cancer. These observations support an important role of retroelement-derived genes in the evolution and diversification of the vertebrate lineage. Copyright © 2016 European Society of Clinical Microbiology and Infectious Diseases. Published by Elsevier Ltd. All rights reserved.

  6. Profiling of Human Molecular Pathways Affected by Retrotransposons at the Level of Regulation by Transcription Factor Proteins

    Science.gov (United States)

    Nikitin, Daniil; Penzar, Dmitry; Garazha, Andrew; Sorokin, Maxim; Tkachev, Victor; Borisov, Nicolas; Poltorak, Alexander; Prassolov, Vladimir; Buzdin, Anton A.

    2018-01-01

    Endogenous retroviruses and retrotransposons also termed retroelements (REs) are mobile genetic elements that were active until recently in human genome evolution. REs regulate gene expression by actively reshaping chromatin structure or by directly providing transcription factor binding sites (TFBSs). We aimed to identify molecular processes most deeply impacted by the REs in human cells at the level of TFBS regulation. By using ENCODE data, we identified ~2 million TFBS overlapping with putatively regulation-competent human REs located in 5-kb gene promoter neighborhood (~17% of all TFBS in promoter neighborhoods; ~9% of all RE-linked TFBS). Most of REs hosting TFBS were highly diverged repeats, and for the evolutionary young (0–8% diverged) elements we identified only ~7% of all RE-linked TFBS. The gene-specific distributions of RE-linked TFBS generally correlated with the distributions for all TFBS. However, several groups of molecular processes were highly enriched in the RE-linked TFBS regulation. They were strongly connected with the immunity and response to pathogens, with the negative regulation of gene transcription, ubiquitination, and protein degradation, extracellular matrix organization, regulation of STAT signaling, fatty acids metabolism, regulation of GTPase activity, protein targeting to Golgi, regulation of cell division and differentiation, development and functioning of perception organs and reproductive system. By contrast, the processes most weakly affected by the REs were linked with the conservative aspects of embryo development. We also identified differences in the regulation features by the younger and older fractions of the REs. The regulation by the older fraction of the REs was linked mainly with the immunity, cell adhesion, cAMP, IGF1R, Notch, Wnt, and integrin signaling, neuronal development, chondroitin sulfate and heparin metabolism, and endocytosis. The younger REs regulate other aspects of immunity, cell cycle progression and

  7. Profiling of Human Molecular Pathways Affected by Retrotransposons at the Level of Regulation by Transcription Factor Proteins

    Directory of Open Access Journals (Sweden)

    Daniil Nikitin

    2018-01-01

    Full Text Available Endogenous retroviruses and retrotransposons also termed retroelements (REs are mobile genetic elements that were active until recently in human genome evolution. REs regulate gene expression by actively reshaping chromatin structure or by directly providing transcription factor binding sites (TFBSs. We aimed to identify molecular processes most deeply impacted by the REs in human cells at the level of TFBS regulation. By using ENCODE data, we identified ~2 million TFBS overlapping with putatively regulation-competent human REs located in 5-kb gene promoter neighborhood (~17% of all TFBS in promoter neighborhoods; ~9% of all RE-linked TFBS. Most of REs hosting TFBS were highly diverged repeats, and for the evolutionary young (0–8% diverged elements we identified only ~7% of all RE-linked TFBS. The gene-specific distributions of RE-linked TFBS generally correlated with the distributions for all TFBS. However, several groups of molecular processes were highly enriched in the RE-linked TFBS regulation. They were strongly connected with the immunity and response to pathogens, with the negative regulation of gene transcription, ubiquitination, and protein degradation, extracellular matrix organization, regulation of STAT signaling, fatty acids metabolism, regulation of GTPase activity, protein targeting to Golgi, regulation of cell division and differentiation, development and functioning of perception organs and reproductive system. By contrast, the processes most weakly affected by the REs were linked with the conservative aspects of embryo development. We also identified differences in the regulation features by the younger and older fractions of the REs. The regulation by the older fraction of the REs was linked mainly with the immunity, cell adhesion, cAMP, IGF1R, Notch, Wnt, and integrin signaling, neuronal development, chondroitin sulfate and heparin metabolism, and endocytosis. The younger REs regulate other aspects of immunity, cell cycle

  8. Switching of dominant retrotransposon silencing strategies from posttranscriptional to transcriptional mechanisms during male germ-cell development in mice.

    Directory of Open Access Journals (Sweden)

    Kota Inoue

    2017-07-01

    Full Text Available Mammalian genomes harbor millions of retrotransposon copies, some of which are transpositionally active. In mouse prospermatogonia, PIWI-interacting small RNAs (piRNAs combat retrotransposon activity to maintain the genomic integrity. The piRNA system destroys retrotransposon-derived RNAs and guides de novo DNA methylation at some retrotransposon promoters. However, it remains unclear whether DNA methylation contributes to retrotransposon silencing in prospermatogonia. We have performed comprehensive studies of DNA methylation and polyA(+ RNAs (transcriptome in developing male germ cells from Pld6/Mitopld and Dnmt3l knockout mice, which are defective in piRNA biogenesis and de novo DNA methylation, respectively. The Dnmt3l mutation greatly reduced DNA methylation levels at most retrotransposons, but its impact on their RNA abundance was limited in prospermatogonia. In Pld6 mutant germ cells, although only a few retrotransposons exhibited reduced DNA methylation, many showed increased expression at the RNA level. More detailed analysis of RNA sequencing, nascent RNA quantification, profiling of cleaved RNA ends, and the results obtained from double knockout mice suggest that PLD6 works mainly at the posttranscriptional level. The increase in retrotransposon expression was larger in Pld6 mutants than it was in Dnmt3l mutants, suggesting that RNA degradation by the piRNA system plays a more important role than does DNA methylation in prospermatogonia. However, DNA methylation had a long-term effect: hypomethylation caused by the Pld6 or Dnmt3l mutation resulted in increased retrotransposon expression in meiotic spermatocytes. Thus, posttranscriptional silencing plays an important role in the early stage of germ cell development, then transcriptional silencing becomes important in later stages. In addition, intergenic and intronic retrotransposon sequences, in particular those containing the antisense L1 promoters, drove ectopic expression of nearby

  9. Evolutionary genomics revealed interkingdom distribution of Tcn1-like chromodomain-containing Gypsy LTR retrotransposons among fungi and plants

    Directory of Open Access Journals (Sweden)

    Blinov Alexander

    2010-04-01

    Full Text Available Abstract Background Chromodomain-containing Gypsy LTR retrotransposons or chromoviruses are widely distributed among eukaryotes and have been found in plants, fungi and vertebrates. The previous comprehensive survey of chromoviruses from mosses (Bryophyta suggested that genomes of non-seed plants contain the clade which is closely related to the retrotransposons from fungi. The origin, distribution and evolutionary history of this clade remained unclear mainly due to the absence of information concerning the diversity and distribution of LTR retrotransposons in other groups of non-seed plants as well as in fungal genomes. Results In present study we preformed in silico analysis of chromodomain-containing LTR retrotransposons in 25 diverse fungi and a number of plant species including spikemoss Selaginella moellendorffii (Lycopodiophyta coupled with an experimental survey of chromodomain-containing Gypsy LTR retrotransposons from diverse non-seed vascular plants (lycophytes, ferns, and horsetails. Our mining of Gypsy LTR retrotransposons in genomic sequences allowed identification of numerous families which have not been described previously in fungi. Two new well-supported clades, Galahad and Mordred, as well as several other previously unknown lineages of chromodomain-containing Gypsy LTR retrotransposons were described based on the results of PCR-mediated survey of LTR retrotransposon fragments from ferns, horsetails and lycophytes. It appeared that one of the clades, namely Tcn1 clade, was present in basidiomycetes and non-seed plants including mosses (Bryophyta and lycophytes (genus Selaginella. Conclusions The interkingdom distribution is not typical for chromodomain-containing LTR retrotransposons clades which are usually very specific for a particular taxonomic group. Tcn1-like LTR retrotransposons from fungi and non-seed plants demonstrated high similarity to each other which can be explained by strong selective constraints and the

  10. Identification of an internal ribosome entry segment in the 5' region of the mouse VL30 retrotransposon and its use in the development of retroviral vectors.

    Science.gov (United States)

    López-Lastra, M; Ulrici, S; Gabus, C; Darlix, J L

    1999-10-01

    Mouse virus-like 30S RNAs (VL30m) constitute a family of retrotransposons, present at 100 to 200 copies, dispersed in the mouse genome. They display little sequence homology to Moloney murine leukemia virus (MoMLV), do not encode virus-like proteins, and have not been implicated in retroviral carcinogenesis. However, VL30 RNAs are efficiently packaged into MLV particles that are propagated in cell culture. In this study, we addressed whether the 5' region of VL30m could replace the 5' leader of MoMLV functionally in a recombinant vector construct. Our data confirm that the putative packaging sequence of VL30 is located within the 5' region (nucleotides 362 to 1149 with respect to the cap structure) and that it can replace the packaging sequence of MoMLV. We also show that VL30m contains an internal ribosome entry segment (IRES) in the 5' region, as do MoMLV, Friend murine leukemia virus, Harvey murine sarcoma virus, and avian reticuloendotheliosis virus type A. Our data show that both the packaging and IRES functions of the 5' region of VL30m RNA can be efficiently used to develop retrotransposon-based vectors.

  11. PpRT1: the first complete gypsy-like retrotransposon isolated in Pinus pinaster.

    Science.gov (United States)

    Rocheta, Margarida; Cordeiro, Jorge; Oliveira, M; Miguel, Célia

    2007-02-01

    We have isolated and characterized a complete retrotransposon sequence, named PpRT1, from the genome of Pinus pinaster. PpRT1 is 5,966 bp long and is closely related to IFG7 gypsy retrotransposon from Pinus radiata. The long terminal repeats (LTRs) have 333 bp each and show a 5.4% sequence divergence between them. In addition to the characteristic polypurine tract (PPT) and the primer binding site (PBS), PpRT1 carries internal regions with homology to retroviral genes gag and pol. The pol region contains sequence motifs related to the enzymes protease, reverse transcriptase, RNAseH and integrase in the same typical order known for Ty3/gypsy-like retrotransposons. PpRT1 was extended from an EST database sequence indicating that its transcription is occurring in pine tissues. Southern blot analyses indicate however, that PpRT1 is present in a unique or a low number of copies in the P. pinaster genome. The differences in nucleotide sequence found between PpRT1 and IFG7 may explain the strikingly different copy number in the two pine species genome. Based on the homologies observed when comparing LTR region among different gypsy elements we propose that the highly conserved LTR regions may be useful to amplify other retrotransposon sequences of the same or close retrotransposon family.

  12. LTRsift: a graphical user interface for semi-automatic classification and postprocessing of de novo detected LTR retrotransposons.

    Science.gov (United States)

    Steinbiss, Sascha; Kastens, Sascha; Kurtz, Stefan

    2012-11-07

    Long terminal repeat (LTR) retrotransposons are a class of eukaryotic mobile elements characterized by a distinctive sequence similarity-based structure. Hence they are well suited for computational identification. Current software allows for a comprehensive genome-wide de novo detection of such elements. The obvious next step is the classification of newly detected candidates resulting in (super-)families. Such a de novo classification approach based on sequence-based clustering of transposon features has been proposed before, resulting in a preliminary assignment of candidates to families as a basis for subsequent manual refinement. However, such a classification workflow is typically split across a heterogeneous set of glue scripts and generic software (for example, spreadsheets), making it tedious for a human expert to inspect, curate and export the putative families produced by the workflow. We have developed LTRsift, an interactive graphical software tool for semi-automatic postprocessing of de novo predicted LTR retrotransposon annotations. Its user-friendly interface offers customizable filtering and classification functionality, displaying the putative candidate groups, their members and their internal structure in a hierarchical fashion. To ease manual work, it also supports graphical user interface-driven reassignment, splitting and further annotation of candidates. Export of grouped candidate sets in standard formats is possible. In two case studies, we demonstrate how LTRsift can be employed in the context of a genome-wide LTR retrotransposon survey effort. LTRsift is a useful and convenient tool for semi-automated classification of newly detected LTR retrotransposons based on their internal features. Its efficient implementation allows for convenient and seamless filtering and classification in an integrated environment. Developed for life scientists, it is helpful in postprocessing and refining the output of software for predicting LTR

  13. Analysis of Hopi/Osr27 and Houba/Tos5/Osr13 retrotransposons in rice

    Directory of Open Access Journals (Sweden)

    Gozde Yuzbasioglu

    2016-03-01

    Full Text Available We investigated Hopi/Osr27 (gypsy and Houba/Tos5/Osr13 (copy retrotransposon movements in 10-day-old roots and leaves of Oryza sativa cvs. Ipsala, Beser and Osmancik-97. Seeds from these three cultivars were germinated between filter papers in Petri dishes for 10 days. Three biologically independent (nonrelated seeds were germinated for each cultivar. Then, roots and leaves grown from the same rice plant were harvested and used for genomic DNA isolation. Inter-retrotransposon amplified polymorphism–polymerase chain reaction with suitable primers was performed with each DNA template to analyze the movements of Hopi/Osr27 and Houba/Tos5/Osr13 retrotransposons. Polymorphism ratios were evaluated both among cultivars and among roots and leaves from the same cultivar. The polymorphism ratios ranged from 0% to 17% for Hopi/Osr27 and from 10% to 87% for Houba/Tos5/Osr13. The obtained results at retrotransposon and varietal levels indicated that the retrotransposon type and genotype dependence are responsible for the occurrence of different variations. Transposable elements are very important for understanding the relationship between cultivars and evolution. Our findings are expected to contribute to the understanding of spontaneous genomic insertion events and their effects on the genetic and epigenetic changes during rice development.

  14. Evolutionary characterization of Ty3/gypsy-like LTR retrotransposons in the parasitic cestode Echinococcus granulosus.

    Science.gov (United States)

    Bae, Young-An

    2016-11-01

    Cyclophyllidean cestodes including Echinococcus granulosus have a smaller genome and show characteristics such as loss of the gut, a segmented body plan, and accelerated growth rate in hosts compared with other tissue-invading helminths. In an effort to address the molecular mechanism relevant to genome shrinkage, the evolutionary status of long-terminal-repeat (LTR) retrotransposons, which are known as the most potent genomic modulators, was investigated in the E. granulosus draft genome. A majority of the E. granulosus LTR retrotransposons were classified into a novel characteristic clade, named Saci-2, of the Ty3/gypsy family, while the remaining elements belonged to the CsRn1 clade of identical family. Their nucleotide sequences were heavily corrupted by frequent base substitutions and segmental losses. The ceased mobile activity of the major retrotransposons and the following intrinsic DNA loss in their inactive progenies might have contributed to decrease in genome size. Apart from the degenerate copies, a gag gene originating from a CsRn1-like element exhibited substantial evidences suggesting its domestication including a preserved coding profile and transcriptional activity, the presence of syntenic orthologues in cestodes, and selective pressure acting on the gene. To my knowledge, the endogenized gag gene is reported for the first time in invertebrates, though its biological function remains elusive.

  15. The role of retrotransposons in gene family expansions: insights from the mouse Abp gene family.

    Science.gov (United States)

    Janoušek, Václav; Karn, Robert C; Laukaitis, Christina M

    2013-05-29

    Retrotransposons have been suggested to provide a substrate for non-allelic homologous recombination (NAHR) and thereby promote gene family expansion. Their precise role, however, is controversial. Here we ask whether retrotransposons contributed to the recent expansions of the Androgen-binding protein (Abp) gene families that occurred independently in the mouse and rat genomes. Using dot plot analysis, we found that the most recent duplication in the Abp region of the mouse genome is flanked by L1Md_T elements. Analysis of the sequence of these elements revealed breakpoints that are the relicts of the recombination that caused the duplication, confirming that the duplication arose as a result of NAHR using L1 elements as substrates. L1 and ERVII retrotransposons are considerably denser in the Abp regions than in one Mb flanking regions, while other repeat types are depleted in the Abp regions compared to flanking regions. L1 retrotransposons preferentially accumulated in the Abp gene regions after lineage separation and roughly followed the pattern of Abp gene expansion. By contrast, the proportion of shared vs. lineage-specific ERVII repeats in the Abp region resembles the rest of the genome. We confirmed the role of L1 repeats in Abp gene duplication with the identification of recombinant L1Md_T elements at the edges of the most recent mouse Abp gene duplication. High densities of L1 and ERVII repeats were found in the Abp gene region with abrupt transitions at the region boundaries, suggesting that their higher densities are tightly associated with Abp gene duplication. We observed that the major accumulation of L1 elements occurred after the split of the mouse and rat lineages and that there is a striking overlap between the timing of L1 accumulation and expansion of the Abp gene family in the mouse genome. Establishing a link between the accumulation of L1 elements and the expansion of the Abp gene family and identification of an NAHR-related breakpoint in

  16. How a retrotransposon exploits the plant's heat stress response for its activation.

    Directory of Open Access Journals (Sweden)

    Vladimir V Cavrak

    2014-01-01

    Full Text Available Retrotransposons are major components of plant and animal genomes. They amplify by reverse transcription and reintegration into the host genome but their activity is usually epigenetically silenced. In plants, genomic copies of retrotransposons are typically associated with repressive chromatin modifications installed and maintained by RNA-directed DNA methylation. To escape this tight control, retrotransposons employ various strategies to avoid epigenetic silencing. Here we describe the mechanism developed by ONSEN, an LTR-copia type retrotransposon in Arabidopsis thaliana. ONSEN has acquired a heat-responsive element recognized by plant-derived heat stress defense factors, resulting in transcription and production of full length extrachromosomal DNA under elevated temperatures. Further, the ONSEN promoter is free of CG and CHG sites, and the reduction of DNA methylation at the CHH sites is not sufficient to activate the element. Since dividing cells have a more pronounced heat response, the extrachromosomal ONSEN DNA, capable of reintegrating into the genome, accumulates preferentially in the meristematic tissue of the shoot. The recruitment of a major plant heat shock transcription factor in periods of heat stress exploits the plant's heat stress response to achieve the transposon's activation, making it impossible for the host to respond appropriately to stress without losing control over the invader.

  17. Transcriptionally active LTR retrotransposons in Eucalyptus genus are differentially expressed and insertionally polymorphic.

    Science.gov (United States)

    Marcon, Helena Sanches; Domingues, Douglas Silva; Silva, Juliana Costa; Borges, Rafael Junqueira; Matioli, Fábio Filippi; Fontes, Marcos Roberto de Mattos; Marino, Celso Luis

    2015-08-14

    In Eucalyptus genus, studies on genome composition and transposable elements (TEs) are particularly scarce. Nearly half of the recently released Eucalyptus grandis genome is composed by retrotransposons and this data provides an important opportunity to understand TE dynamics in Eucalyptus genome and transcriptome. We characterized nine families of transcriptionally active LTR retrotransposons from Copia and Gypsy superfamilies in Eucalyptus grandis genome and we depicted genomic distribution and copy number in two Eucalyptus species. We also evaluated genomic polymorphism and transcriptional profile in three organs of five Eucalyptus species. We observed contrasting genomic and transcriptional behavior in the same family among different species. RLC_egMax_1 was the most prevalent family and RLC_egAngela_1 was the family with the lowest copy number. Most families of both superfamilies have their insertions occurring Eucalyptus species. Using EST analysis and qRT-PCRs, we observed transcriptional activity in several tissues and in all evaluated species. In some families, osmotic stress increases transcript values. Our strategy was successful in isolating transcriptionally active retrotransposons in Eucalyptus, and each family has a particular genomic and transcriptional pattern. Overall, our results show that retrotransposon activity have differentially affected genome and transcriptome among Eucalyptus species.

  18. Optical tweezers reveal how proteins alter replication

    Science.gov (United States)

    Chaurasiya, Kathy

    Single molecule force spectroscopy is a powerful method that explores the DNA interaction properties of proteins involved in a wide range of fundamental biological processes such as DNA replication, transcription, and repair. We use optical tweezers to capture and stretch a single DNA molecule in the presence of proteins that bind DNA and alter its mechanical properties. We quantitatively characterize the DNA binding mechanisms of proteins in order to provide a detailed understanding of their function. In this work, we focus on proteins involved in replication of Escherichia coli (E. coli ), endogenous eukaryotic retrotransposons Ty3 and LINE-1, and human immunodeficiency virus (HIV). DNA polymerases replicate the entire genome of the cell, and bind both double-stranded DNA (dsDNA) and single-stranded DNA (ssDNA) during DNA replication. The replicative DNA polymerase in the widely-studied model system E. coli is the DNA polymerase III subunit alpha (DNA pol III alpha). We use optical tweezers to determine that UmuD, a protein that regulates bacterial mutagenesis through its interactions with DNA polymerases, specifically disrupts alpha binding to ssDNA. This suggests that UmuD removes alpha from its ssDNA template to allow DNA repair proteins access to the damaged DNA, and to facilitate exchange of the replicative polymerase for an error-prone translesion synthesis (TLS) polymerase that inserts nucleotides opposite the lesions, so that bacterial DNA replication may proceed. This work demonstrates a biophysical mechanism by which E. coli cells tolerate DNA damage. Retroviruses and retrotransposons reproduce by copying their RNA genome into the nuclear DNA of their eukaryotic hosts. Retroelements encode proteins called nucleic acid chaperones, which rearrange nucleic acid secondary structure and are therefore required for successful replication. The chaperone activity of these proteins requires strong binding affinity for both single- and double-stranded nucleic

  19. LTRsift: a graphical user interface for semi-automatic classification and postprocessing of de novo detected LTR retrotransposons

    Directory of Open Access Journals (Sweden)

    Steinbiss Sascha

    2012-11-01

    Full Text Available Abstract Background Long terminal repeat (LTR retrotransposons are a class of eukaryotic mobile elements characterized by a distinctive sequence similarity-based structure. Hence they are well suited for computational identification. Current software allows for a comprehensive genome-wide de novo detection of such elements. The obvious next step is the classification of newly detected candidates resulting in (super-families. Such a de novo classification approach based on sequence-based clustering of transposon features has been proposed before, resulting in a preliminary assignment of candidates to families as a basis for subsequent manual refinement. However, such a classification workflow is typically split across a heterogeneous set of glue scripts and generic software (for example, spreadsheets, making it tedious for a human expert to inspect, curate and export the putative families produced by the workflow. Results We have developed LTRsift, an interactive graphical software tool for semi-automatic postprocessing of de novo predicted LTR retrotransposon annotations. Its user-friendly interface offers customizable filtering and classification functionality, displaying the putative candidate groups, their members and their internal structure in a hierarchical fashion. To ease manual work, it also supports graphical user interface-driven reassignment, splitting and further annotation of candidates. Export of grouped candidate sets in standard formats is possible. In two case studies, we demonstrate how LTRsift can be employed in the context of a genome-wide LTR retrotransposon survey effort. Conclusions LTRsift is a useful and convenient tool for semi-automated classification of newly detected LTR retrotransposons based on their internal features. Its efficient implementation allows for convenient and seamless filtering and classification in an integrated environment. Developed for life scientists, it is helpful in postprocessing and refining

  20. Genome-wide analysis of LTR-retrotransposons in oil palm.

    Science.gov (United States)

    Beulé, Thierry; Agbessi, Mawussé Dt; Dussert, Stephane; Jaligot, Estelle; Guyot, Romain

    2015-10-15

    The oil palm (Elaeis guineensis Jacq.) is a major cultivated crop and the world's largest source of edible vegetable oil. The genus Elaeis comprises two species E. guineensis, the commercial African oil palm and E. oleifera, which is used in oil palm genetic breeding. The recent publication of both the African oil palm genome assembly and the first draft sequence of its Latin American relative now allows us to tackle the challenge of understanding the genome composition, structure and evolution of these palm genomes through the annotation of their repeated sequences. In this study, we identified, annotated and compared Transposable Elements (TE) from the African and Latin American oil palms. In a first step, Transposable Element databases were built through de novo detection in both genome sequences then the TE content of both genomes was estimated. Then putative full-length retrotransposons with Long Terminal Repeats (LTRs) were further identified in the E. guineensis genome for characterization of their structural diversity, copy number and chromosomal distribution. Finally, their relative expression in several tissues was determined through in silico analysis of publicly available transcriptome data. Our results reveal a congruence in the transpositional history of LTR retrotransposons between E. oleifera and E. guineensis, especially the Sto-4 family. Also, we have identified and described 583 full-length LTR-retrotransposons in the Elaeis guineensis genome. Our work shows that these elements are most likely no longer mobile and that no recent insertion event has occurred. Moreover, the analysis of chromosomal distribution suggests a preferential insertion of Copia elements in gene-rich regions, whereas Gypsy elements appear to be evenly distributed throughout the genome. Considering the high proportion of LTR retrotransposon in the oil palm genome, our work will contribute to a greater understanding of their impact on genome organization and evolution

  1. Identification of an Internal Ribosome Entry Segment in the 5′ Region of the Mouse VL30 Retrotransposon and Its Use in the Development of Retroviral Vectors

    Science.gov (United States)

    López-Lastra, Marcelo; Ulrici, Sandrine; Gabus, Caroline; Darlix, Jean-Luc

    1999-01-01

    Mouse virus-like 30S RNAs (VL30m) constitute a family of retrotransposons, present at 100 to 200 copies, dispersed in the mouse genome. They display little sequence homology to Moloney murine leukemia virus (MoMLV), do not encode virus-like proteins, and have not been implicated in retroviral carcinogenesis. However, VL30 RNAs are efficiently packaged into MLV particles that are propagated in cell culture. In this study, we addressed whether the 5′ region of VL30m could replace the 5′ leader of MoMLV functionally in a recombinant vector construct. Our data confirm that the putative packaging sequence of VL30 is located within the 5′ region (nucleotides 362 to 1149 with respect to the cap structure) and that it can replace the packaging sequence of MoMLV. We also show that VL30m contains an internal ribosome entry segment (IRES) in the 5′ region, as do MoMLV, Friend murine leukemia virus, Harvey murine sarcoma virus, and avian reticuloendotheliosis virus type A. Our data show that both the packaging and IRES functions of the 5′ region of VL30m RNA can be efficiently used to develop retrotransposon-based vectors. PMID:10482590

  2. Identification of retrotransposon-like sequences in Iranian river buffalo

    African Journals Online (AJOL)

    ONOS

    2010-03-29

    % of a genome (Waterston et al., 2002). Mobile elements can be divided into two classes: Class I includes retrotransposons and class II includes DNA tran- sposons ... including dog, cat, horse, cattle, donkey, kangaroo, etc.

  3. Long Terminal Repeat Retrotransposon Content in Eight Diploid Sunflower Species Inferred from Next-Generation Sequence Data

    Science.gov (United States)

    Tetreault, Hannah M.; Ungerer, Mark C.

    2016-01-01

    The most abundant transposable elements (TEs) in plant genomes are Class I long terminal repeat (LTR) retrotransposons represented by superfamilies gypsy and copia. Amplification of these superfamilies directly impacts genome structure and contributes to differential patterns of genome size evolution among plant lineages. Utilizing short-read Illumina data and sequence information from a panel of Helianthus annuus (sunflower) full-length gypsy and copia elements, we explore the contribution of these sequences to genome size variation among eight diploid Helianthus species and an outgroup taxon, Phoebanthus tenuifolius. We also explore transcriptional dynamics of these elements in both leaf and bud tissue via RT-PCR. We demonstrate that most LTR retrotransposon sublineages (i.e., families) display patterns of similar genomic abundance across species. A small number of LTR retrotransposon sublineages exhibit lineage-specific amplification, particularly in the genomes of species with larger estimated nuclear DNA content. RT-PCR assays reveal that some LTR retrotransposon sublineages are transcriptionally active across all species and tissue types, whereas others display species-specific and tissue-specific expression. The species with the largest estimated genome size, H. agrestis, has experienced amplification of LTR retrotransposon sublineages, some of which have proliferated independently in other lineages in the Helianthus phylogeny. PMID:27233667

  4. Diaspora, a large family of Ty3-gypsy retrotransposons in Glycine max, is an envelope-less member of an endogenous plant retrovirus lineage.

    Science.gov (United States)

    Yano, Sho T; Panbehi, Bahman; Das, Arpita; Laten, Howard M

    2005-05-05

    The chromosomes of higher plants are littered with retrotransposons that, in many cases, constitute as much as 80% of plant genomes. Long terminal repeat retrotransposons have been especially successful colonizers of the chromosomes of higher plants and examinations of their function, evolution, and dispersal are essential to understanding the evolution of eukaryotic genomes. In soybean, several families of retrotransposons have been identified, including at least two that, by virtue of the presence of an envelope-like gene, may constitute endogenous retroviruses. However, most elements are highly degenerate and are often sequestered in regions of the genome that sequencing projects initially shun. In addition, finding potentially functional copies from genomic DNA is rare. This study provides a mechanism to surmount these issues to generate a consensus sequence that can then be functionally and phylogenetically evaluated. Diaspora is a multicopy member of the Ty3-gypsy-like family of LTR retrotransposons and comprises at least 0.5% of the soybean genome. Although the Diaspora family is highly degenerate, and with the exception of this report, is not represented in the Genbank nr database, a full-length consensus sequence was generated from short overlapping sequences using a combination of experimental and in silico methods. Diaspora is 11,737 bp in length and contains a single 1892-codon ORF that encodes a gag-pol polyprotein. Phylogenetic analysis indicates that it is closely related to Athila and Calypso retroelements from Arabidopsis and soybean, respectively. These in turn form the framework of an endogenous retrovirus lineage whose members possess an envelope-like gene. Diaspora appears to lack any trace of this coding region. A combination of empirical sequencing and retrieval of unannotated Genome Survey Sequence database entries was successfully used to construct a full-length representative of the Diaspora family in Glycine max. Diaspora is presently the

  5. Retrotransposons. An RNA polymerase III subunit determines sites of retrotransposon integration.

    Science.gov (United States)

    Bridier-Nahmias, Antoine; Tchalikian-Cosson, Aurélie; Baller, Joshua A; Menouni, Rachid; Fayol, Hélène; Flores, Amando; Saïb, Ali; Werner, Michel; Voytas, Daniel F; Lesage, Pascale

    2015-05-01

    Mobile genetic elements are ubiquitous. Their integration site influences genome stability and gene expression. The Ty1 retrotransposon of the yeast Saccharomyces cerevisiae integrates upstream of RNA polymerase III (Pol III)-transcribed genes, yet the primary determinant of target specificity has remained elusive. Here we describe an interaction between Ty1 integrase and the AC40 subunit of Pol III and demonstrate that AC40 is the predominant determinant targeting Ty1 integration upstream of Pol III-transcribed genes. Lack of an integrase-AC40 interaction dramatically alters target site choice, leading to a redistribution of Ty1 insertions in the genome, mainly to chromosome ends. The mechanism of target specificity allows Ty1 to proliferate and yet minimizes genetic damage to its host. Copyright © 2015, American Association for the Advancement of Science.

  6. Assessment of genetic variation for the LINE-1 retrotransposon from next generation sequence data

    Directory of Open Access Journals (Sweden)

    Ramos Kenneth

    2010-10-01

    Full Text Available Abstract Background In humans, copies of the Long Interspersed Nuclear Element 1 (LINE-1 retrotransposon comprise 21% of the reference genome, and have been shown to modulate expression and produce novel splice isoforms of transcripts from genes that span or neighbor the LINE-1 insertion site. Results In this work, newly released pilot data from the 1000 Genomes Project is analyzed to detect previously unreported full length insertions of the retrotransposon LINE-1. By direct analysis of the sequence data, we have identified 22 previously unreported LINE-1 insertion sites within the sequence data reported for a mother/father/daughter trio. Conclusions It is demonstrated here that next generation sequencing data, as well as emerging high quality datasets from individual genome projects allow us to assess the amount of heterogeneity with respect to the LINE-1 retrotransposon amongst humans, and provide us with a wealth of testable hypotheses as to the impact that this diversity may have on the health of individuals and populations.

  7. Effects of As2O3 on DNA methylation, genomic instability, and LTR retrotransposon polymorphism in Zea mays.

    Science.gov (United States)

    Erturk, Filiz Aygun; Aydin, Murat; Sigmaz, Burcu; Taspinar, M Sinan; Arslan, Esra; Agar, Guleray; Yagci, Semra

    2015-12-01

    Arsenic is a well-known toxic substance on the living organisms. However, limited efforts have been made to study its DNA methylation, genomic instability, and long terminal repeat (LTR) retrotransposon polymorphism causing properties in different crops. In the present study, effects of As2O3 (arsenic trioxide) on LTR retrotransposon polymorphism and DNA methylation as well as DNA damage in Zea mays seedlings were investigated. The results showed that all of arsenic doses caused a decreasing genomic template stability (GTS) and an increasing Random Amplified Polymorphic DNAs (RAPDs) profile changes (DNA damage). In addition, increasing DNA methylation and LTR retrotransposon polymorphism characterized a model to explain the epigenetically changes in the gene expression were also found. The results of this experiment have clearly shown that arsenic has epigenetic effect as well as its genotoxic effect. Especially, the increasing of polymorphism of some LTR retrotransposon under arsenic stress may be a part of the defense system against the stress.

  8. DIRS1-like retrotransposons are widely distributed among Decapoda and are particularly present in hydrothermal vent organisms

    Directory of Open Access Journals (Sweden)

    Bonnivard Eric

    2009-04-01

    Full Text Available Abstract Background Transposable elements are major constituents of eukaryote genomes and have a great impact on genome structure and stability. Considering their mutational abilities, TEs can contribute to the genetic diversity and evolution of organisms. Knowledge of their distribution among several genomes is an essential condition to study their dynamics and to better understand their role in species evolution. DIRS1-like retrotransposons are a particular group of retrotransposons according to their mode of transposition that implies a tyrosine recombinase. To date, they have been described in a restricted number of species in comparison with the LTR retrotransposons. In this paper, we determine the distribution of DIRS1-like elements among 25 decapod species, 10 of them living in hydrothermal vents that correspond to particularly unstable environments. Results Using PCR approaches, we have identified 15 new DIRS1-like families in 15 diverse decapod species (shrimps, lobsters, crabs and galatheid crabs. Hydrothermal organisms show a particularly great diversity of DIRS1-like elements with 5 families characterized among Alvinocarididae shrimps and 3 in the galatheid crab Munidopsis recta. Phylogenic analyses show that these elements are divergent toward the DIRS1-like families previously described in other crustaceans and arthropods and form a new clade called AlDIRS1. At larger scale, the distribution of DIRS1-like retrotransposons appears more or less patchy depending on the taxa considered. Indeed, a scattered distribution can be observed in the infraorder Brachyura whereas all the species tested in infraorders Caridea and Astacidea harbor some DIRS1-like elements. Conclusion Our results lead to nearly double both the number of DIRS1-like elements described to date, and the number of species known to harbor these ones. In this study, we provide the first degenerate primers designed to look specifically for DIRS1-like retrotransposons. They

  9. Genetic diversity of cultivated flax (Linum usitatissimum L.) germplasm assessed by retrotransposon-based markers.

    Science.gov (United States)

    Smýkal, P; Bačová-Kerteszová, N; Kalendar, R; Corander, J; Schulman, A H; Pavelek, M

    2011-05-01

    Retrotransposon segments were characterized and inter-retrotransposon amplified polymorphism (IRAP) markers developed for cultivated flax (Linum usitatissimum L.) and the Linum genus. Over 75 distinct long terminal repeat retrotransposon segments were cloned, the first set for Linum, and specific primers designed for them. IRAP was then used to evaluate genetic diversity among 708 accessions of cultivated flax comprising 143 landraces, 387 varieties, and 178 breeding lines. These included both traditional and modern, oil (86), fiber (351), and combined-use (271) accessions, originating from 36 countries, and 10 wild Linum species. The set of 10 most polymorphic primers yielded 141 reproducible informative data points per accession, with 52% polymorphism and a 0.34 Shannon diversity index. The maximal genetic diversity was detected among wild Linum species (100% IRAP polymorphism and 0.57 Jaccard similarity), while diversity within cultivated germplasm decreased from landraces (58%, 0.63) to breeding lines (48%, 0.85) and cultivars (50%, 0.81). Application of Bayesian methods for clustering resulted in the robust identification of 20 clusters of accessions, which were unstratified according to origin or user type. This indicates an overlap in genetic diversity despite disruptive selection for fiber versus oil types. Nevertheless, eight clusters contained high proportions (70-100%) of commercial cultivars, whereas two clusters were rich (60%) in landraces. These findings provide a basis for better flax germplasm management, core collection establishment, and exploration of diversity in breeding, as well as for exploration of the role of retrotransposons in flax genome dynamics.

  10. Links between human LINE-1 retrotransposons and hepatitis virus-related hepatocellular carcinoma

    Science.gov (United States)

    Honda, Tomoyuki

    2016-05-01

    Hepatocellular carcinoma (HCC) accounts for approximately 80% of liver cancers, the third most frequent cause of cancer mortality. The most prevalent risk factors for HCC are infections by hepatitis B or hepatitis C virus. Findings suggest that hepatitis virus-related HCC might be a cancer in which LINE-1 retrotransposons, often termed L1, activity plays a potential role. Firstly, hepatitis viruses can suppress host defense factors that also control L1 mobilization. Secondly, many recent studies also have indicated that hypomethylation of L1 affects the prognosis of HCC patients. Thirdly, endogenous L1 retrotransposition was demonstrated to activate oncogenic pathways in HCC. Fourthly, several L1 chimeric transcripts with host or viral genes are found in hepatitis virus-related HCC. Such lines of evidence suggest a linkage between L1 retrotransposons and hepatitis virus-related HCC. Here, I briefly summarize current understandings of the association between hepatitis virus-related HCC and L1. Then, I discuss potential mechanisms of how hepatitis viruses drive the development of HCC via L1 retrotransposons. An increased understanding of the contribution of L1 to hepatitis virus-related HCC may provide unique insights related to the development of novel therapeutics for this disease.

  11. L1 retrotransposition is activated by Ten-eleven-translocation protein 1 and repressed by methyl-CpG binding proteins.

    Science.gov (United States)

    Zhang, Peng; Ludwig, Anne K; Hastert, Florian D; Rausch, Cathia; Lehmkuhl, Anne; Hellmann, Ines; Smets, Martha; Leonhardt, Heinrich; Cardoso, M Cristina

    2017-09-03

    One of the major functions of DNA methylation is the repression of transposable elements, such as the long-interspersed nuclear element 1 (L1). The underlying mechanism(s), however, are unclear. Here, we addressed how retrotransposon activation and mobilization are regulated by methyl-cytosine modifying ten-eleven-translocation (Tet) proteins and how this is modulated by methyl-CpG binding domain (MBD) proteins. We show that Tet1 activates both, endogenous and engineered L1 retrotransposons. Furthermore, we found that Mecp2 and Mbd2 repress Tet1-mediated activation of L1 by preventing 5hmC formation at the L1 promoter. Finally, we demonstrate that the methyl-CpG binding domain, as well as the adjacent non-sequence specific DNA binding domain of Mecp2 are each sufficient to mediate repression of Tet1-induced L1 mobilization. Our study reveals a mechanism how L1 elements get activated in the absence of Mecp2 and suggests that Tet1 may contribute to Mecp2/Mbd2-deficiency phenotypes, such as the Rett syndrome. We propose that the balance between methylation "reader" and "eraser/writer" controls L1 retrotransposition.

  12. Ulysses transposable element of Drosophila shows high structural similarities to functional domains of retroviruses.

    Science.gov (United States)

    Evgen'ev, M B; Corces, V G; Lankenau, D H

    1992-06-05

    We have determined the DNA structure of the Ulysses transposable element of Drosophila virilis and found that this transposon is 10,653 bp and is flanked by two unusually large direct repeats 2136 bp long. Ulysses shows the characteristic organization of LTR-containing retrotransposons, with matrix and capsid protein domains encoded in the first open reading frame. In addition, Ulysses contains protease, reverse transcriptase, RNase H and integrase domains encoded in the second open reading frame. Ulysses lacks a third open reading frame present in some retrotransposons that could encode an env-like protein. A dendrogram analysis based on multiple alignments of the protease, reverse transcriptase, RNase H, integrase and tRNA primer binding site of all known Drosophila LTR-containing retrotransposon sequences establishes a phylogenetic relationship of Ulysses to other retrotransposons and suggests that Ulysses belongs to a new family of this type of elements.

  13. Recurrent emergence of structural variants of LTR retrotransposon CsRn1 evolving novel expression strategy and their selective expansion in a carcinogenic liver fluke, Clonorchis sinensis.

    Science.gov (United States)

    Kim, Seon-Hee; Kong, Yoon; Bae, Young-An

    2017-06-01

    Autonomous retrotransposons, in which replication and transcription are coupled, encode the essential gag and pol genes as a fusion or separate overlapping form(s) that are expressed in single transcripts regulated by a common upstream promoter. The element-specific expression strategies have driven development of relevant translational recoding mechanisms including ribosomal frameshifting to satisfy the protein stoichiometry critical for the assembly of infectious virus-like particles. Retrotransposons with different recoding strategies exhibit a mosaic distribution pattern across the diverse families of reverse transcribing elements, even though their respective distributions are substantially skewed towards certain family groups. However, only a few investigations to date have focused on the emergence of retrotransposons evolving novel expression strategy and causal genetic drivers of the structural variants. In this study, the bulk of genomic and transcribed sequences of a Ty3/gypsy-like CsRn1 retrotransposon in Clonorchis sinensis were analyzed for the comprehensive examination of its expression strategy. Our results demonstrated that structural variants with single open reading frame (ORF) have recurrently emerged from precedential CsRn1 copies encoding overlapping gag-pol ORFs by a single-nucleotide insertion in an upstream region of gag stop codon. In the parasite genome, some of the newly evolved variants appeared to undergo proliferative burst as active master lineages together with their ancestral copies. The genetic event was similarly observed in Opisthorchis viverrini, the closest neighbor of C. sinensis, whereas the resulting structural variants might have failed to overcome purifying selection and comprised minor remnant copies in the Opisthorchis genome. Copyright © 2017 Elsevier B.V. All rights reserved.

  14. Forward and reverse genetics: The LORE1 retrotransposon insertion mutants

    DEFF Research Database (Denmark)

    Fukai, Eigo; Malolepszy, Anna; Sandal, Niels Nørgaard

    2014-01-01

    The endogenous Lotus retrotransposon 1 (LORE1) transposes in the germ line of Lotus japonicus plants that carry an active element. This feature of LORE1 has been exploited for generation of a large non-transgenic insertion mutant population, where insertions have been annotated using next-generat...

  15. Different histories of two highly variable LTR retrotransposons in sunflower species.

    Science.gov (United States)

    Mascagni, Flavia; Cavallini, Andrea; Giordani, Tommaso; Natali, Lucia

    2017-11-15

    In the Helianthus genus, very large intra- and interspecific variability related to two specific retrotransposons of Helianthus annuus (Helicopia and SURE) exists. When comparing these two sequences to sunflower sequence databases recently produced by our lab, the Helicopia family was shown to belong to the Maximus/SIRE lineage of the Sirevirus genus of the Copia superfamily, whereas the SURE element (whose superfamily was not even previously identified) was classified as a Gypsy element of the Ogre/Tat lineage of the Metavirus genus. Bioinformatic analysis of the two retrotransposon families revealed their genomic abundance and relative proliferation timing. The genomic abundance of these families differed significantly among 12 Helianthus species. The ratio between the abundance of long terminal repeats and their reverse transcriptases suggested that the SURE family has relatively more solo long terminal repeats than does Helicopia. Pairwise comparisons of Illumina reads encoding the reverse transcriptase domain indicated that SURE amplification may have occurred more recently than that of Helicopia. Finally, the analysis of population structure based on the SURE and Helicopia polymorphisms of 32 Helianthus species evidenced two subpopulations, which roughly corresponded to species of the Helianthus and Divaricati/Ciliares sections. However, a number of species showed an admixed structure, confirming the importance of interspecific hybridisation in the evolution of this genus. In general, these two retrotransposon families differentially contributed to interspecific variability, emphasising the need to refer to specific families when studying genome evolution. Copyright © 2017 Elsevier B.V. All rights reserved.

  16. Plant centromeric retrotransposons: a structural and cytogenetic perspective

    Czech Academy of Sciences Publication Activity Database

    Neumann, Pavel; Navrátilová, Alice; Koblížková, Andrea; Kejnovský, Eduard; Hřibová, Eva; Hobza, Roman; Widmer, A.; Doležel, Jaroslav; Macas, Jiří

    2011-01-01

    Roč. 2, č. 4 (2011), s. 1-16 ISSN 1759-8753 R&D Projects: GA AV ČR KJB500960802; GA MŠk(CZ) LC06004; GA ČR GA522/09/0083 Institutional research plan: CEZ:AV0Z50510513; CEZ:AV0Z50040507; CEZ:AV0Z50040702; CEZ:AV0Z50380511 Keywords : plant chromosomes * retrotransposons * cytogenetic perspective Subject RIV: EB - Genetics ; Molecular Biology

  17. Structural characterization of copia-type retrotransposons leads to insights into the marker development in a biofuel crop, Jatropha curcas L.

    Science.gov (United States)

    2013-01-01

    Background Recently, Jatropha curcas L. has attracted worldwide attention for its potential as a source of biodiesel. However, most DNA markers have demonstrated high levels of genetic similarity among and within jatropha populations around the globe. Despite promising features of copia-type retrotransposons as ideal genetic tools for gene tagging, mutagenesis, and marker-assisted selection, they have not been characterized in the jatropha genome yet. Here, we examined the diversity, evolution, and genome-wide organization of copia-type retrotransposons in the Asian, African, and Mesoamerican accessions of jatropha, then introduced a retrotransposon-based marker for this biofuel crop. Results In total, 157 PCR fragments that were amplified using the degenerate primers for the reverse transcriptase (RT) domain of copia-type retroelements were sequenced and aligned to construct the neighbor-joining tree. Phylogenetic analysis demonstrated that isolated copia RT sequences were classified into ten families, which were then grouped into three lineages. An in-depth study of the jatropha genome for the RT sequences of each family led to the characterization of full consensus sequences of the jatropha copia-type families. Estimated copy numbers of target sequences were largely different among families, as was presence of genes within 5 kb flanking regions for each family. Five copia-type families were as appealing candidates for the development of DNA marker systems. A candidate marker from family Jc7 was particularly capable of detecting genetic variation among different jatropha accessions. Fluorescence in situ hybridization (FISH) to metaphase chromosomes reveals that copia-type retrotransposons are scattered across chromosomes mainly located in the distal part regions. Conclusion This is the first report on genome-wide analysis and the cytogenetic mapping of copia-type retrotransposons of jatropha, leading to the discovery of families bearing high potential as DNA

  18. Retrotransposon-Based Molecular Markers for Analysis of Genetic Diversity within the Genus Linum

    Science.gov (United States)

    Melnikova, Nataliya V.; Kudryavtseva, Anna V.; Zelenin, Alexander V.; Lakunina, Valentina A.; Yurkevich, Olga Yu.; Speranskaya, Anna S.; Dmitriev, Alexey A.; Krinitsina, Anastasia A.; Belenikin, Maxim S.; Uroshlev, Leonid A.; Snezhkina, Anastasiya V.; Sadritdinova, Asiya F.; Koroban, Nadezda V.; Amosova, Alexandra V.; Samatadze, Tatiana E.; Guzenko, Elena V.; Lemesh, Valentina A.; Savilova, Anastasya M.; Rachinskaia, Olga A.; Kishlyan, Natalya V.; Rozhmina, Tatiana A.; Bolsheva, Nadezhda L.; Muravenko, Olga V.

    2014-01-01

    SSAP method was used to study the genetic diversity of 22 Linum species from sections Linum, Adenolinum, Dasylinum, Stellerolinum, and 46 flax cultivars. All the studied flax varieties were distinguished using SSAP for retrotransposons FL9 and FL11. Thus, the validity of SSAP method was demonstrated for flax marking, identification of accessions in genebank collections, and control during propagation of flax varieties. Polymorphism of Fl1a, Fl1b, and Cassandra insertions were very low in flax varieties, but these retrotransposons were successfully used for the investigation of Linum species. Species clusterization based on SSAP markers was in concordance with their taxonomic division into sections Dasylinum, Stellerolinum, Adenolinum, and Linum. All species of sect. Adenolinum clustered apart from species of sect. Linum. The data confirmed the accuracy of the separation in these sections. Members of section Linum are not as closely related as members of other sections, so taxonomic revision of this section is desirable. L. usitatissimum accessions genetically distant from modern flax cultivars were revealed in our work. These accessions are of utmost interest for flax breeding and introduction of new useful traits into flax cultivars. The chromosome localization of Cassandra retrotransposon in Linum species was determined. PMID:25243121

  19. Identification of a non-LTR retrotransposon from the gypsy moth

    Science.gov (United States)

    K.J. Garner; J.M. Slavicek

    1999-01-01

    A family of highly repetitive elements, named LDT1, has been identified in the gypsy moth, Lymantria dispar. The complete element is 5.4 kb in length and lacks long-terminal repeats, The element contains two open reading frames with a significant amino acid sequence similarity to several non-LTR retrotransposons. The first open reading frame contains...

  20. Ancient Origin of the U2 Small Nuclear RNA Gene-Targeting Non-LTR Retrotransposons Utopia.

    Science.gov (United States)

    Kojima, Kenji K; Jurka, Jerzy

    2015-01-01

    Most non-long terminal repeat (non-LTR) retrotransposons encoding a restriction-like endonuclease show target-specific integration into repetitive sequences such as ribosomal RNA genes and microsatellites. However, only a few target-specific lineages of non-LTR retrotransposons are distributed widely and no lineage is found across the eukaryotic kingdoms. Here we report the most widely distributed lineage of target sequence-specific non-LTR retrotransposons, designated Utopia. Utopia is found in three supergroups of eukaryotes: Amoebozoa, SAR, and Opisthokonta. Utopia is inserted into a specific site of U2 small nuclear RNA genes with different strength of specificity for each family. Utopia families from oomycetes and wasps show strong target specificity while only a small number of Utopia copies from reptiles are flanked with U2 snRNA genes. Oomycete Utopia families contain an "archaeal" RNase H domain upstream of reverse transcriptase (RT), which likely originated from a plant RNase H gene. Analysis of Utopia from oomycetes indicates that multiple lineages of Utopia have been maintained inside of U2 genes with few copy numbers. Phylogenetic analysis of RT suggests the monophyly of Utopia, and it likely dates back to the early evolution of eukaryotes.

  1. Retrotransposon-associated long non-coding RNAs in mice and men

    Czech Academy of Sciences Publication Activity Database

    Ganesh, Sravya; Svoboda, Petr

    2016-01-01

    Roč. 468, č. 6 (2016), s. 1049-1060 ISSN 0031-6768 R&D Projects: GA ČR(CZ) GBP305/12/G034; GA MŠk LO1419 EU Projects: European Commission 647403; European Commission 607720 Institutional support: RVO:68378050 Keywords : lncRNA * Retrotransposon * line * sine * ltr * MaLR Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 3.156, year: 2016

  2. A 5-methylcytosine DNA glycosylase/lyase demethylates the retrotransposon Tos17 and promotes its transposition in rice

    KAUST Repository

    La, Honggui; Ding, Bo; Mishra, Gyan Prakash; Zhou, Bo; Yang, Hongmei; Bellizzi, Maria Del Rosario; Chen, Songbiao; Meyers, Blake C.; Peng, Zhaohua; Zhu, Jian-Kang; Wang, Guoliang

    2011-01-01

    DNA 5-methylcytosine (5-meC) is an important epigenetic mark for transcriptional gene silencing in many eukaryotes. In Arabidopsis, 5-meC DNA glycosylase/lyases actively remove 5-meC to counter-act transcriptional gene silencing in a locus-specific manner, and have been suggested to maintain the expression of transposons. However, it is unclear whether plant DNA demethylases can promote the transposition of transposons. Here we report the functional characterization of the DNA glycosylase/lyase DNG701 in rice. DNG701 encodes a large (1,812 amino acid residues) DNA glycosylase domain protein. Recombinant DNG701 protein showed 5-meC DNA glycosylase and lyase activities in vitro. Knockout or knockdown of DNG701 in rice plants led to DNA hypermethylation and reduced expression of the retrotransposon Tos17. Tos17 showed less transposition in calli derived from dng701 knockout mutant seeds compared with that in wild-type calli. Overexpression of DNG701 in both rice calli and transgenic plants substantially reduced DNA methylation levels of Tos17 and enhanced its expression. The overexpression also led to more frequent transposition of Tos17 in calli. Our results demonstrate that rice DNG701 is a 5-meC DNA glycosylase/lyase responsible for the demethylation of Tos17 and this DNA demethylase plays a critical role in promoting Tos17 transposition in rice calli.

  3. A 5-methylcytosine DNA glycosylase/lyase demethylates the retrotransposon Tos17 and promotes its transposition in rice

    KAUST Repository

    La, Honggui

    2011-09-06

    DNA 5-methylcytosine (5-meC) is an important epigenetic mark for transcriptional gene silencing in many eukaryotes. In Arabidopsis, 5-meC DNA glycosylase/lyases actively remove 5-meC to counter-act transcriptional gene silencing in a locus-specific manner, and have been suggested to maintain the expression of transposons. However, it is unclear whether plant DNA demethylases can promote the transposition of transposons. Here we report the functional characterization of the DNA glycosylase/lyase DNG701 in rice. DNG701 encodes a large (1,812 amino acid residues) DNA glycosylase domain protein. Recombinant DNG701 protein showed 5-meC DNA glycosylase and lyase activities in vitro. Knockout or knockdown of DNG701 in rice plants led to DNA hypermethylation and reduced expression of the retrotransposon Tos17. Tos17 showed less transposition in calli derived from dng701 knockout mutant seeds compared with that in wild-type calli. Overexpression of DNG701 in both rice calli and transgenic plants substantially reduced DNA methylation levels of Tos17 and enhanced its expression. The overexpression also led to more frequent transposition of Tos17 in calli. Our results demonstrate that rice DNG701 is a 5-meC DNA glycosylase/lyase responsible for the demethylation of Tos17 and this DNA demethylase plays a critical role in promoting Tos17 transposition in rice calli.

  4. Linking maternal and somatic 5S rRNA types with different sequence-specific non-LTR retrotransposons.

    Science.gov (United States)

    Locati, Mauro D; Pagano, Johanna F B; Ensink, Wim A; van Olst, Marina; van Leeuwen, Selina; Nehrdich, Ulrike; Zhu, Kongju; Spaink, Herman P; Girard, Geneviève; Rauwerda, Han; Jonker, Martijs J; Dekker, Rob J; Breit, Timo M

    2017-04-01

    5S rRNA is a ribosomal core component, transcribed from many gene copies organized in genomic repeats. Some eukaryotic species have two 5S rRNA types defined by their predominant expression in oogenesis or adult tissue. Our next-generation sequencing study on zebrafish egg, embryo, and adult tissue identified maternal-type 5S rRNA that is exclusively accumulated during oogenesis, replaced throughout the embryogenesis by a somatic-type, and thus virtually absent in adult somatic tissue. The maternal-type 5S rDNA contains several thousands of gene copies on chromosome 4 in tandem repeats with small intergenic regions, whereas the somatic-type is present in only 12 gene copies on chromosome 18 with large intergenic regions. The nine-nucleotide variation between the two 5S rRNA types likely affects TFIII binding and riboprotein L5 binding, probably leading to storage of maternal-type rRNA. Remarkably, these sequence differences are located exactly at the sequence-specific target site for genome integration by the 5S rRNA-specific Mutsu retrotransposon family. Thus, we could define maternal- and somatic-type MutsuDr subfamilies. Furthermore, we identified four additional maternal-type and two new somatic-type MutsuDr subfamilies, each with their own target sequence. This target-site specificity, frequently intact maternal-type retrotransposon elements, plus specific presence of Mutsu retrotransposon RNA and piRNA in egg and adult tissue, suggest an involvement of retrotransposons in achieving the differential copy number of the two types of 5S rDNA loci. © 2017 Locati et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  5. Comparative genomic analysis reveals multiple long terminal repeats, lineage-specific amplification, and frequent interelement recombination for Cassandra retrotransposon in pear (Pyrus bretschneideri Rehd.).

    Science.gov (United States)

    Yin, Hao; Du, Jianchang; Li, Leiting; Jin, Cong; Fan, Lian; Li, Meng; Wu, Jun; Zhang, Shaoling

    2014-06-04

    Cassandra transposable elements belong to a specific group of terminal-repeat retrotransposons in miniature (TRIM). Although Cassandra TRIM elements have been found in almost all vascular plants, detailed investigations on the nature, abundance, amplification timeframe, and evolution have not been performed in an individual genome. We therefore conducted a comprehensive analysis of Cassandra retrotransposons using the newly sequenced pear genome along with four other Rosaceae species, including apple, peach, mei, and woodland strawberry. Our data reveal several interesting findings for this particular retrotransposon family: 1) A large number of the intact copies contain three, four, or five long terminal repeats (LTRs) (∼20% in pear); 2) intact copies and solo LTRs with or without target site duplications are both common (∼80% vs. 20%) in each genome; 3) the elements exhibit an overall unbiased distribution among the chromosomes; 4) the elements are most successfully amplified in pear (5,032 copies); and 5) the evolutionary relationships of these elements vary among different lineages, species, and evolutionary time. These results indicate that Cassandra retrotransposons contain more complex structures (elements with multiple LTRs) than what we have known previously, and that frequent interelement unequal recombination followed by transposition may play a critical role in shaping and reshaping host genomes. Thus this study provides insights into the property, propensity, and molecular mechanisms governing the formation and amplification of Cassandra retrotransposons, and enhances our understanding of the structural variation, evolutionary history, and transposition process of LTR retrotransposons in plants. © The Author(s) 2014. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  6. Insertion of a solo LTR retrotransposon associates with spur mutations in 'Red Delicious' apple (Malus × domestica).

    Science.gov (United States)

    Han, Mengxue; Sun, Qibao; Zhou, Junyong; Qiu, Huarong; Guo, Jing; Lu, Lijuan; Mu, Wenlei; Sun, Jun

    2017-09-01

    Insertion of a solo LTR, which possesses strong bidirectional, stem-specific promoter activities, is associated with the evolution of a dwarfing apple spur mutation. Spur mutations in apple scions revolutionized global apple production. Since long terminal repeat (LTR) retrotransposons are tightly related to natural mutations, inter-retrotransposon-amplified polymorphism technique and genome walking were used to find sequences in the apple genome based on these LTRs. In 'Red Delicious' spur mutants, a novel, 2190-bp insertion was identified as a spur-specific, solo LTR (sLTR) located at the 1038th nucleotide of another sLTR, which was 1536 bp in length. This insertion-within-an-insertion was localized within a preexisting Gypsy-50 retrotransposon at position 3,762,767 on chromosome 4. The analysis of transcriptional activity of the two sLTRs (the 2190- and 1536-bp inserts) indicated that the 2190-bp sLTR is a promoter, capable of bidirectional transcription. GUS expression in the 2190-bp-sense and 2190-bp-antisense transgenic lines was prominent in stems. In contrast, no promoter activity from either the sense or the antisense strand of the 1536-bp sLTR was detected. From ~150 kb of DNA on each side of the 2190 bp, sLTR insertion site, corresponding to 300 kb of the 'Golden Delicious' genome, 23 genes were predicted. Ten genes had predicted functions that could affect shoot development. This first report, of a sLTR insertion associated with the evolution of apple spur mutation, will facilitate apple breeding, cloning of spur-related genes, and discovery of mechanisms behind dwarf habit.

  7. Identification and chromosomal localization of the monkey retrotransposon in Mesa sp

    Czech Academy of Sciences Publication Activity Database

    Balint-Kurti, P.; Clendennen, S.; Doleželová, Marie; Valárik, Miroslav; Doležel, Jaroslav; Beetham, G. M.

    2000-01-01

    Roč. 263, č. 6 (2000), s. 908-915 ISSN 0026-8925 R&D Projects: GA ČR GV521/96/K117; GA AV ČR IAA5020803; GA MŠk ME 376 Institutional research plan: CEZ:AV0Z5038910 Keywords : In situ hybridization * chromosomal localization * monkey retrotransposon Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 2.462, year: 2000

  8. LTR retrotransposon dynamics in the evolution of the olive (Olea europaea) genome

    Czech Academy of Sciences Publication Activity Database

    Barghini, E.; Natali, L.; Giordani, T.; Cossu, R.M.; Scalabrin, S.; Cattonaro, F.; Šimková, Hana; Vrána, Jan; Doležel, Jaroslav; Morgante, M.; Cavallini, A.

    2015-01-01

    Roč. 22, č. 1 (2015), s. 91-100 ISSN 1340-2838 R&D Projects: GA ČR GBP501/12/G090; GA MŠk(CZ) LO1204 Institutional support: RVO:61389030 Keywords : LTR retrotransposons * next-generation sequencing * olive Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 5.267, year: 2015

  9. Altruistic functions for selfish DNA.

    Science.gov (United States)

    Faulkner, Geoffrey J; Carninci, Piero

    2009-09-15

    Mammalian genomes are comprised of 30-50% transposed elements (TEs). The vast majority of these TEs are truncated and mutated fragments of retrotransposons that are no longer capable of transposition. Although initially regarded as important factors in the evolution of gene regulatory networks, TEs are now commonly perceived as neutrally evolving and non-functional genomic elements. In a major development, recent works have strongly contradicted this "selfish DNA" or "junk DNA" dogma by demonstrating that TEs use a host of novel promoters to generate RNA on a massive scale across most eukaryotic cells. This transcription frequently functions to control the expression of protein-coding genes via alternative promoters, cis regulatory non protein-coding RNAs and the formation of double stranded short RNAs. If considered in sum, these findings challenge the designation of TEs as selfish and neutrally evolving genomic elements. Here, we will expand upon these themes and discuss challenges in establishing novel TE functions in vivo.

  10. Efficient DNA Fingerprinting Based on the Targeted Sequencing of Active Retrotransposon Insertion Sites Using a Bench-Top High-Throughput Sequencing Platform

    OpenAIRE

    Monden, Yuki; Yamamoto, Ayaka; Shindo, Akiko; Tahara, Makoto

    2014-01-01

    In many crop species, DNA fingerprinting is required for the precise identification of cultivars to protect the rights of breeders. Many families of retrotransposons have multiple copies throughout the eukaryotic genome and their integrated copies are inherited genetically. Thus, their insertion polymorphisms among cultivars are useful for DNA fingerprinting. In this study, we conducted a DNA fingerprinting based on the insertion polymorphisms of active retrotransposon families (Rtsp-1 and LI...

  11. Development of an efficient retrotransposon-based fingerprinting method for rapid pea variety identification.

    Science.gov (United States)

    Smýkal, Petr

    2006-01-01

    Fast and efficient DNA fingerprinting of crop cultivars and individuals is frequently used in both theoretical population genetics and in practical breeding. Numerous DNA marker technologies exist and the ratio of speed, cost and accuracy are of importance. Therefore even in species where highly accurate and polymorphic marker systems are available, such as microsatellite SSR (simple sequence repeats), also alternative methods may be of interest. Thanks to their high abundance and ubiquity, temporary mobile retrotransposable elements come into recent focus. Their properties, such as genome wide distribution and well-defined origin of individual insertions by descent, predetermine them for use as molecular markers. In this study, several Ty3-gypsy type retrotransposons have been developed and adopted for the inter-retrotransposon amplified polymorphism (IRAP) method, which is suitable for fast and efficient pea cultivar fingerprinting. The method can easily distinguish even between genetically closely related pea cultivars and provide high polymorphic information content (PIC) in a single PCR analysis.

  12. Recent expansion of heat-activated retrotransposons in the coral symbiont Symbiodinium microadriaticum

    KAUST Repository

    Chen, Jit Ern

    2017-10-20

    Rising sea surface temperature is the main cause of global coral reef decline. Abnormally high temperatures trigger the breakdown of the symbiotic association between corals and their photosynthetic symbionts in the genus Symbiodinium. Higher genetic variation resulting from shorter generation times has previously been proposed to provide increased adaptability to Symbiodinium compared to the host. Retrotransposition is a significant source of genetic variation in eukaryotes and some transposable elements are specifically expressed under adverse environmental conditions. We present transcriptomic and phylogenetic evidence for the existence of heat stress-activated Ty1-copia-type LTR retrotransposons in the coral symbiont Symbiodinium microadriaticum. Genome-wide analyses of emergence patterns of these elements further indicate recent expansion events in the genome of S. microadriaticum. Our findings suggest that acute temperature increases can activate specific retrotransposons in the Symbiodinium genome with potential impacts on the rate of retrotransposition and the generation of genetic variation under heat stress.The ISME Journal advance online publication, 20 October 2017; doi:10.1038/ismej.2017.179.

  13. A widespread occurrence of extra open reading frames in plant Ty3/gypsy retrotransposons

    Czech Academy of Sciences Publication Activity Database

    Steinbauerová, Veronika; Neumann, Pavel; Novák, Petr; Macas, Jiří

    2011-01-01

    Roč. 139, 11-12 (2011), s. 1543-1555 ISSN 0016-6707 Institutional research plan: CEZ:AV0Z50510513 Keywords : Additional ORFs * LTR retrotransposons * Repetitive DNA * Plant genome Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 2.148, year: 2011

  14. Comparative studies of the endonucleases from two related Xenopus laevis retrotransposons, Tx1L and Tx2L: target site specificity and evolutionary implications.

    Science.gov (United States)

    Christensen, S; Pont-Kingdon, G; Carroll, D

    2000-01-01

    In the genome of the South African frog, Xenopus laevis, there are two complex families of transposable elements, Tx1 and Tx2, that have identical overall structures, but distinct sequences. In each family there are approximately 1500 copies of an apparent DNA-based element (Tx1D and Tx2D). Roughly 10% of these elements in each family are interrupted by a non-LTR retrotransposon (Tx1L and Tx2L). Each retrotransposon is flanked by a 23-bp target duplication of a specific D element sequence. In earlier work, we showed that the endonuclease domain (Tx1L EN) located in the second open reading frame (ORF2) of Tx1L encodes a protein that makes a single-strand cut precisely at the expected site within its target sequence, supporting the idea that Tx1L is a site-specific retrotransposon. In this study, we express the endonuclease domain of Tx2L (Tx2L EN) and compare the target preferences of the two enzymes. Each endonuclease shows some preference for its cognate target, on the order of 5-fold over the non-cognate target. The observed discrimination is not sufficient, however, to explain the observation that no cross-occupancy is observed - that is, L elements of one family have never been found within D elements of the other family. Possible sources of additional specificity are discussed. We also compare two hypotheses regarding the genome duplication event that led to the contemporary pseudotetraploid character of Xenopus laevis in light of the Tx1L and Tx2L data.

  15. Young, intact and nested retrotransposons are abundant in the onion and asparagus genomes.

    Science.gov (United States)

    Vitte, C; Estep, M C; Leebens-Mack, J; Bennetzen, J L

    2013-09-01

    Although monocotyledonous plants comprise one of the two major groups of angiosperms and include >65 000 species, comprehensive genome analysis has been focused mainly on the Poaceae (grass) family. Due to this bias, most of the conclusions that have been drawn for monocot genome evolution are based on grasses. It is not known whether these conclusions apply to many other monocots. To extend our understanding of genome evolution in the monocots, Asparagales genomic sequence data were acquired and the structural properties of asparagus and onion genomes were analysed. Specifically, several available onion and asparagus bacterial artificial chromosomes (BACs) with contig sizes >35 kb were annotated and analysed, with a particular focus on the characterization of long terminal repeat (LTR) retrotransposons. The results reveal that LTR retrotransposons are the major components of the onion and garden asparagus genomes. These elements are mostly intact (i.e. with two LTRs), have mainly inserted within the past 6 million years and are piled up into nested structures. Analysis of shotgun genomic sequence data and the observation of two copies for some transposable elements (TEs) in annotated BACs indicates that some families have become particularly abundant, as high as 4-5 % (asparagus) or 3-4 % (onion) of the genome for the most abundant families, as also seen in large grass genomes such as wheat and maize. Although previous annotations of contiguous genomic sequences have suggested that LTR retrotransposons were highly fragmented in these two Asparagales genomes, the results presented here show that this was largely due to the methodology used. In contrast, this current work indicates an ensemble of genomic features similar to those observed in the Poaceae.

  16. The Sinbad retrotransposon from the genome of the human blood fluke, Schistosoma mansoni, and the distribution of related Pao-like elements

    Directory of Open Access Journals (Sweden)

    Morales Maria E

    2005-02-01

    Full Text Available Abstract Background Of the major families of long terminal repeat (LTR retrotransposons, the Pao/BEL family is probably the least well studied. It is becoming apparent that numerous LTR retrotransposons and other mobile genetic elements have colonized the genome of the human blood fluke, Schistosoma mansoni. Results A proviral form of Sinbad, a new LTR retrotransposon, was identified in the genome of S. mansoni. Phylogenetic analysis indicated that Sinbad belongs to one of five discreet subfamilies of Pao/BEL like elements. BLAST searches of whole genomes and EST databases indicated that members of this clade occurred in species of the Insecta, Nematoda, Echinodermata and Chordata, as well as Platyhelminthes, but were absent from all plants, fungi and lower eukaryotes examined. Among the deuterostomes examined, only aquatic species harbored these types of elements. All four species of nematode examined were positive for Sinbad sequences, although among insect and vertebrate genomes, some were positive and some negative. The full length, consensus Sinbad retrotransposon was 6,287 bp long and was flanked at its 5'- and 3'-ends by identical LTRs of 386 bp. Sinbad displayed a triple Cys-His RNA binding motif characteristic of Gag of Pao/BEL-like elements, followed by the enzymatic domains of protease, reverse transcriptase (RT, RNAseH, and integrase, in that order. A phylogenetic tree of deduced RT sequences from 26 elements revealed that Sinbad was most closely related to an unnamed element from the zebrafish Danio rerio and to Saci-1, also from S. mansoni. It was also closely related to Pao from Bombyx mori and to Ninja of Drosophila simulans. Sinbad was only distantly related to the other schistosome LTR retrotransposons Boudicca, Gulliver, Saci-2, Saci-3, and Fugitive, which are gypsy-like. Southern hybridization and bioinformatics analyses indicated that there were about 50 copies of Sinbad in the S. mansoni genome. The presence of ESTs

  17. Identification and characterization of argonaute protein, Ago2 and its associated small RNAs in Schistosoma japonicum.

    Directory of Open Access Journals (Sweden)

    Pengfei Cai

    Full Text Available BACKGROUND: The complex life cycle of the genus Schistosoma drives the parasites to employ subtle developmentally dependent gene regulatory machineries. Small non-coding RNAs (sncRNAs are essential gene regulatory factors that, through their impact on mRNA and genome stability, control stage-specific gene expression. Abundant sncRNAs have been identified in this genus. However, their functionally associated partners, Argonaute family proteins, which are the key components of the RNA-induced silencing complex (RISC, have not yet been fully explored. METHODOLOGY/PRINCIPAL FINDINGS: Two monoclonal antibodies (mAbs specific to Schistosoma japonicum Argonaute protein Ago2 (SjAgo2, but not SjAgo1 and SjAgo3, were generated. Soluble adult worm antigen preparation (SWAP was subjected to immunoprecipitation with the mAbs and the captured SjAgo2 protein was subsequently confirmed by Western blot and mass spectrometry (MS analysis. The small RNA population associated with native SjAgo2 in adult parasites was extracted from the immunoprecipitated complex and subjected to library construction. High-through-put sequencing of these libraries yielded a total of ≈50 million high-quality reads. Classification of these small RNAs showed that endogenous siRNAs (endo-siRNAs generated from transposable elements (TEs, especially from the subclasses of LINE and LTR, were prominent. Further bioinformatics analysis revealed that siRNAs derived from ten types of well-defined retrotransposons were dramatically enriched in the SjAgo2-specific libraries compared to small RNA libraries constructed with total small RNAs from separated adult worms. These results suggest that a key function of SjAgo2 is to maintain genome stability through suppressing the activities of retrotransposons. CONCLUSIONS/SIGNIFICANCE: In this study, we identified and characterized one of the three S. japonicum Argonautes, SjAgo2, and its associated small RNAs were found to be predominantly derived

  18. Retrotransposon silencing by DNA methylation can drive mammalian genomic imprinting.

    Directory of Open Access Journals (Sweden)

    Shunsuke Suzuki

    2007-04-01

    Full Text Available Among mammals, only eutherians and marsupials are viviparous and have genomic imprinting that leads to parent-of-origin-specific differential gene expression. We used comparative analysis to investigate the origin of genomic imprinting in mammals. PEG10 (paternally expressed 10 is a retrotransposon-derived imprinted gene that has an essential role for the formation of the placenta of the mouse. Here, we show that an orthologue of PEG10 exists in another therian mammal, the marsupial tammar wallaby (Macropus eugenii, but not in a prototherian mammal, the egg-laying platypus (Ornithorhynchus anatinus, suggesting its close relationship to the origin of placentation in therian mammals. We have discovered a hitherto missing link of the imprinting mechanism between eutherians and marsupials because tammar PEG10 is the first example of a differentially methylated region (DMR associated with genomic imprinting in marsupials. Surprisingly, the marsupial DMR was strictly limited to the 5' region of PEG10, unlike the eutherian DMR, which covers the promoter regions of both PEG10 and the adjacent imprinted gene SGCE. These results not only demonstrate a common origin of the DMR-associated imprinting mechanism in therian mammals but provide the first demonstration that DMR-associated genomic imprinting in eutherians can originate from the repression of exogenous DNA sequences and/or retrotransposons by DNA methylation.

  19. Regulation of rice root development by a retrotransposon acting as a microRNA sponge.

    Science.gov (United States)

    Cho, Jungnam; Paszkowski, Jerzy

    2017-08-26

    It is well documented that transposable elements (TEs) can regulate the expression of neighbouring genes. However, their ability to act in trans and influence ectopic loci has been reported rarely. We searched in rice transcriptomes for tissue-specific expression of TEs and found them to be regulated developmentally. They often shared sequence homology with co-expressed genes and contained potential microRNA-binding sites, which suggested possible contributions to gene regulation. In fact, we have identified a retrotransposon that is highly transcribed in roots and whose spliced transcript constitutes a target mimic for miR171. miR171 destabilizes mRNAs encoding the root-specific family of SCARECROW-Like transcription factors. We demonstrate that retrotransposon-derived transcripts act as decoys for miR171, triggering its degradation and thus results in the root-specific accumulation of SCARECROW-Like mRNAs. Such transposon-mediated post-transcriptional control of miR171 levels is conserved in diverse rice species.

  20. The role of retrotransposons in gene family expansions in the human and mouse genomes

    Czech Academy of Sciences Publication Activity Database

    Janoušek, Václav; Laukaitis, C. M.; Yanchukov, Alexey; Karn, R. C.

    2016-01-01

    Roč. 8, č. 9 (2016), s. 2632-2650 ISSN 1759-6653 R&D Projects: GA MŠk EE2.3.20.0303 Institutional support: RVO:68081766 Keywords : gene families * transposable elements * retrotransposons * LINE * LTR * SINE Subject RIV: EB - Genetics ; Molecular Biology Impact factor: 3.979, year: 2016

  1. Complete sequence of Tvv1, a family of Ty 1 copia-like retrotransposons of Vitis vinifera L., reconstituted by chromosome walking.

    Science.gov (United States)

    Pelsy, F.; Merdinoglu, D.

    2002-09-01

    A chromosome-walking strategy was used to sequence and characterize retrotransposons in the grapevine genome. The reconstitution of a family of retroelements, named Tvv1, was achieved by six successive steps. These elements share a single, highly conserved open reading frame 4,153 nucleotides-long, putatively encoding the gag, pro, int, rt and rh proteins. Comparison of the Tvv1 open reading frame coding potential with those of drosophila copia and tobacco Tnt1, revealed that Tvv1 is closely related to Ty 1 copia-like retrotransposons. A highly variable untranslated leader region, upstream of the open reading frame, allowed us to differentiate Tvv1 variants, which represent a family of at least 28 copies, in varying sizes. This internal region is flanked by two long terminal repeats in direct orientation, sized between 149 and 157 bp. Among elements theoretically sized from 4,970 to 5,550 bp, we describe the full-length sequence of a reference element Tvv1-1, 5,343 nucleotides-long. The full-length sequence of Tvv1-1 compared to pea PDR1 shows a 53.3% identity. In addition, both elements contain long terminal repeats of nearly the same size in which the U5 region could be entirely absent. Therefore, we assume that Tvv1 and PDR1 could constitute a particular class of short LTRs retroelements.

  2. Genome-wide analysis of LTR-retrotransposon diversity and its impact on the evolution of the genus Helianthus (L.).

    Science.gov (United States)

    Mascagni, Flavia; Giordani, Tommaso; Ceccarelli, Marilena; Cavallini, Andrea; Natali, Lucia

    2017-08-18

    Genome divergence by mobile elements activity and recombination is a continuous process that plays a key role in the evolution of species. Nevertheless, knowledge on retrotransposon-related variability among species belonging to the same genus is still limited. Considering the importance of the genus Helianthus, a model system for studying the ecological genetics of speciation and adaptation, we performed a comparative analysis of the repetitive genome fraction across ten species and one subspecies of sunflower, focusing on long terminal repeat retrotransposons at superfamily, lineage and sublineage levels. After determining the relative genome size of each species, genomic DNA was isolated and subjected to Illumina sequencing. Then, different assembling and clustering approaches allowed exploring the repetitive component of all genomes. On average, repetitive DNA in Helianthus species represented more than 75% of the genome, being composed mostly by long terminal repeat retrotransposons. Also, the prevalence of Gypsy over Copia superfamily was observed and, among lineages, Chromovirus was by far the most represented. Although nearly all the same sublineages are present in all species, we found considerable variability in the abundance of diverse retrotransposon lineages and sublineages, especially between annual and perennial species. This large variability should indicate that different events of amplification or loss related to these elements occurred following species separation and should have been involved in species differentiation. Our data allowed us inferring on the extent of interspecific repetitive DNA variation related to LTR-RE abundance, investigating the relationship between changes of LTR-RE abundance and the evolution of the genus, and determining the degree of coevolution of different LTR-RE lineages or sublineages between and within species. Moreover, the data suggested that LTR-RE abundance in a species was affected by the annual or perennial

  3. Low levels of LTR retrotransposon deletion by ectopic recombination in the gigantic genomes of salamanders.

    Science.gov (United States)

    Frahry, Matthew Blake; Sun, Cheng; Chong, Rebecca A; Mueller, Rachel Lockridge

    2015-02-01

    Across the tree of life, species vary dramatically in nuclear genome size. Mutations that add or remove sequences from genomes-insertions or deletions, or indels-are the ultimate source of this variation. Differences in the tempo and mode of insertion and deletion across taxa have been proposed to contribute to evolutionary diversity in genome size. Among vertebrates, most of the largest genomes are found within the salamanders, an amphibian clade with genome sizes ranging from ~14 to ~120 Gb. Salamander genomes have been shown to experience slower rates of DNA loss through small (i.e., genomes. However, no studies have addressed DNA loss from salamander genomes resulting from larger deletions. Here, we focus on one type of large deletion-ectopic-recombination-mediated removal of LTR retrotransposon sequences. In ectopic recombination, double-strand breaks are repaired using a "wrong" (i.e., ectopic, or non-allelic) template sequence-typically another locus of similar sequence. When breaks occur within the LTR portions of LTR retrotransposons, ectopic-recombination-mediated repair can produce deletions that remove the internal transposon sequence and the equivalent of one of the two LTR sequences. These deletions leave a signature in the genome-a solo LTR sequence. We compared levels of solo LTRs in the genomes of four salamander species with levels present in five vertebrates with smaller genomes. Our results demonstrate that salamanders have low levels of solo LTRs, suggesting that ectopic-recombination-mediated deletion of LTR retrotransposons occurs more slowly than in other vertebrates with smaller genomes.

  4. Retrotransposon Proliferation Coincident with the Evolution of Dioecy in Asparagus.

    Science.gov (United States)

    Harkess, Alex; Mercati, Francesco; Abbate, Loredana; McKain, Michael; Pires, J Chris; Sala, Tea; Sunseri, Francesco; Falavigna, Agostino; Leebens-Mack, Jim

    2016-09-08

    Current phylogenetic sampling reveals that dioecy and an XY sex chromosome pair evolved once, or possibly twice, in the genus Asparagus Although there appear to be some lineage-specific polyploidization events, the base chromosome number of 2n = 2× = 20 is relatively conserved across the Asparagus genus. Regardless, dioecious species tend to have larger genomes than hermaphroditic species. Here, we test whether this genome size expansion in dioecious species is related to a polyploidization and subsequent chromosome fusion, or to retrotransposon proliferation in dioecious species. We first estimate genome sizes, or use published values, for four hermaphrodites and four dioecious species distributed across the phylogeny, and show that dioecious species typically have larger genomes than hermaphroditic species. Utilizing a phylogenomic approach, we find no evidence for ancient polyploidization contributing to increased genome sizes of sampled dioecious species. We do find support for an ancient whole genome duplication (WGD) event predating the diversification of the Asparagus genus. Repetitive DNA content of the four hermaphroditic and four dioecious species was characterized based on randomly sampled whole genome shotgun sequencing, and common elements were annotated. Across our broad phylogenetic sampling, Ty-1 Copia retroelements, in particular, have undergone a marked proliferation in dioecious species. In the absence of a detectable WGD event, retrotransposon proliferation is the most likely explanation for the precipitous increase in genome size in dioecious Asparagus species. Copyright © 2016 Harkess et al.

  5. Expression of protein-coding genes embedded in ribosomal DNA

    DEFF Research Database (Denmark)

    Johansen, Steinar D; Haugen, Peik; Nielsen, Henrik

    2007-01-01

    Ribosomal DNA (rDNA) is a specialised chromosomal location that is dedicated to high-level transcription of ribosomal RNA genes. Interestingly, rDNAs are frequently interrupted by parasitic elements, some of which carry protein genes. These are non-LTR retrotransposons and group II introns that e...... in the nucleolus....

  6. Protein function prediction using neighbor relativity in protein-protein interaction network.

    Science.gov (United States)

    Moosavi, Sobhan; Rahgozar, Masoud; Rahimi, Amir

    2013-04-01

    There is a large gap between the number of discovered proteins and the number of functionally annotated ones. Due to the high cost of determining protein function by wet-lab research, function prediction has become a major task for computational biology and bioinformatics. Some researches utilize the proteins interaction information to predict function for un-annotated proteins. In this paper, we propose a novel approach called "Neighbor Relativity Coefficient" (NRC) based on interaction network topology which estimates the functional similarity between two proteins. NRC is calculated for each pair of proteins based on their graph-based features including distance, common neighbors and the number of paths between them. In order to ascribe function to an un-annotated protein, NRC estimates a weight for each neighbor to transfer its annotation to the unknown protein. Finally, the unknown protein will be annotated by the top score transferred functions. We also investigate the effect of using different coefficients for various types of functions. The proposed method has been evaluated on Saccharomyces cerevisiae and Homo sapiens interaction networks. The performance analysis demonstrates that NRC yields better results in comparison with previous protein function prediction approaches that utilize interaction network. Copyright © 2012 Elsevier Ltd. All rights reserved.

  7. PwRn1, a novel Ty3/gypsy-like retrotransposon of Paragonimus westermani: molecular characters and its differentially preserved mobile potential according to host chromosomal polyploidy

    Directory of Open Access Journals (Sweden)

    Kong Yoon

    2008-10-01

    Full Text Available Abstract Background Retrotransposons have been known to involve in the remodeling and evolution of host genome. These reverse transcribing elements, which show a complex evolutionary pathway with diverse intermediate forms, have been comprehensively analyzed from a wide range of host genomes, while the information remains limited to only a few species in the phylum Platyhelminthes. Results A LTR retrotransposon and its homologs with a strong phylogenetic affinity toward CsRn1 of Clonorchis sinensis were isolated from a trematode parasite Paragonimus westermani via a degenerate PCR method and from an insect species Anopheles gambiae by in silico analysis of the whole mosquito genome, respectively. These elements, designated PwRn1 and AgCR-1 – AgCR-14 conserved unique features including a t-RNATrp primer binding site and the unusual CHCC signature of Gag proteins. Their flanking LTRs displayed >97% nucleotide identities and thus, these elements were likely to have expanded recently in the trematode and insect genomes. They evolved heterogeneous expression strategies: a single fused ORF, two separate ORFs with an identical reading frame and two ORFs overlapped by -1 frameshifting. Phylogenetic analyses suggested that the elements with the separate ORFs had evolved from an ancestral form(s with the overlapped ORFs. The mobile potential of PwRn1 was likely to be maintained differentially in association with the karyotype of host genomes, as was examined by the presence/absence of intergenomic polymorphism and mRNA transcripts. Conclusion Our results on the structural diversity of CsRn1-like elements can provide a molecular tool to dissect a more detailed evolutionary episode of LTR retrotransposons. The PwRn1-associated genomic polymorphism, which is substantial in diploids, will also be informative in addressing genomic diversification following inter-/intra-specific hybridization in P. westermani populations.

  8. Topology-function conservation in protein-protein interaction networks.

    Science.gov (United States)

    Davis, Darren; Yaveroğlu, Ömer Nebil; Malod-Dognin, Noël; Stojmirovic, Aleksandar; Pržulj, Nataša

    2015-05-15

    Proteins underlay the functioning of a cell and the wiring of proteins in protein-protein interaction network (PIN) relates to their biological functions. Proteins with similar wiring in the PIN (topology around them) have been shown to have similar functions. This property has been successfully exploited for predicting protein functions. Topological similarity is also used to guide network alignment algorithms that find similarly wired proteins between PINs of different species; these similarities are used to transfer annotation across PINs, e.g. from model organisms to human. To refine these functional predictions and annotation transfers, we need to gain insight into the variability of the topology-function relationships. For example, a function may be significantly associated with specific topologies, while another function may be weakly associated with several different topologies. Also, the topology-function relationships may differ between different species. To improve our understanding of topology-function relationships and of their conservation among species, we develop a statistical framework that is built upon canonical correlation analysis. Using the graphlet degrees to represent the wiring around proteins in PINs and gene ontology (GO) annotations to describe their functions, our framework: (i) characterizes statistically significant topology-function relationships in a given species, and (ii) uncovers the functions that have conserved topology in PINs of different species, which we term topologically orthologous functions. We apply our framework to PINs of yeast and human, identifying seven biological process and two cellular component GO terms to be topologically orthologous for the two organisms. © The Author 2015. Published by Oxford University Press.

  9. Functionality of system components: Conservation of protein function in protein feature space

    DEFF Research Database (Denmark)

    Jensen, Lars Juhl; Ussery, David; Brunak, Søren

    2003-01-01

    well on organisms other than the one on which it was trained. We evaluate the performance of such a method, ProtFun, which relies on protein features as its sole input, and show that the method gives similar performance for most eukaryotes and performs much better than anticipated on archaea......Many protein features useful for prediction of protein function can be predicted from sequence, including posttranslational modifications, subcellular localization, and physical/chemical properties. We show here that such protein features are more conserved among orthologs than paralogs, indicating...... they are crucial for protein function and thus subject to selective pressure. This means that a function prediction method based on sequence-derived features may be able to discriminate between proteins with different function even when they have highly similar structure. Also, such a method is likely to perform...

  10. Genome-wide LORE1 retrotransposon mutagenesis and high-throughput insertion detection in Lotus japonicus

    DEFF Research Database (Denmark)

    Urbanski, Dorian Fabian; Malolepszy, Anna; Stougaard, Jens

    2012-01-01

    Insertion mutants facilitate functional analysis of genes, but for most plant species it has been difficult to identify a suitable mutagen and to establish large populations for reverse genetics. The main challenge is developing efficient high-throughput procedures for both mutagenesis and insert......Insertion mutants facilitate functional analysis of genes, but for most plant species it has been difficult to identify a suitable mutagen and to establish large populations for reverse genetics. The main challenge is developing efficient high-throughput procedures for both mutagenesis...... plants. The identified insertions showed that the endogenous LORE1 retrotransposon is well suited for insertion mutagenesis due to its homogenous gene targeting and exonic insertion preference. Since LORE1 transposition occurs in the germline, harvesting seeds from a single founder line and cultivating...... progeny generates a complete mutant population. This ease of LORE1 mutagenesis combined with the efficient FSTpoolit protocol, which exploits 2D pooling, Illumina sequencing, and automated data analysis, allows highly cost-efficient development of a comprehensive reverse genetic resource....

  11. Protein-protein interaction network-based detection of functionally similar proteins within species.

    Science.gov (United States)

    Song, Baoxing; Wang, Fen; Guo, Yang; Sang, Qing; Liu, Min; Li, Dengyun; Fang, Wei; Zhang, Deli

    2012-07-01

    Although functionally similar proteins across species have been widely studied, functionally similar proteins within species showing low sequence similarity have not been examined in detail. Identification of these proteins is of significant importance for understanding biological functions, evolution of protein families, progression of co-evolution, and convergent evolution and others which cannot be obtained by detection of functionally similar proteins across species. Here, we explored a method of detecting functionally similar proteins within species based on graph theory. After denoting protein-protein interaction networks using graphs, we split the graphs into subgraphs using the 1-hop method. Proteins with functional similarities in a species were detected using a method of modified shortest path to compare these subgraphs and to find the eligible optimal results. Using seven protein-protein interaction networks and this method, some functionally similar proteins with low sequence similarity that cannot detected by sequence alignment were identified. By analyzing the results, we found that, sometimes, it is difficult to separate homologous from convergent evolution. Evaluation of the performance of our method by gene ontology term overlap showed that the precision of our method was excellent. Copyright © 2012 Wiley Periodicals, Inc.

  12. Functional aspects of protein flexibility

    DEFF Research Database (Denmark)

    Teilum, Kaare; Olsen, Johan G; Kragelund, Birthe B

    2009-01-01

    this into an intuitive perception of protein function is challenging. Flexibility is of overwhelming importance for protein function, and the changes in protein structure during interactions with binding partners can be dramatic. The present review addresses protein flexibility, focusing on protein-ligand interactions...

  13. The Alu neurodegeneration hypothesis: A primate-specific mechanism for neuronal transcription noise, mitochondrial dysfunction, and manifestation of neurodegenerative disease.

    Science.gov (United States)

    Larsen, Peter A; Lutz, Michael W; Hunnicutt, Kelsie E; Mihovilovic, Mirta; Saunders, Ann M; Yoder, Anne D; Roses, Allen D

    2017-07-01

    It is hypothesized that retrotransposons have played a fundamental role in primate evolution and that enhanced neurologic retrotransposon activity in humans may underlie the origin of higher cognitive function. As a potential consequence of this enhanced activity, it is likely that neurons are susceptible to deleterious retrotransposon pathways that can disrupt mitochondrial function. An example is observed in the TOMM40 gene, encoding a β-barrel protein critical for mitochondrial preprotein transport. Primate-specific Alu retrotransposons have repeatedly inserted into TOMM40 introns, and at least one variant associated with late-onset Alzheimer's disease originated from an Alu insertion event. We provide evidence of enriched Alu content in mitochondrial genes and postulate that Alus can disrupt mitochondrial populations in neurons, thereby setting the stage for progressive neurologic dysfunction. This Alu neurodegeneration hypothesis is compatible with decades of research and offers a plausible mechanism for the disruption of neuronal mitochondrial homeostasis, ultimately cascading into neurodegenerative disease. Copyright © 2017 The Authors. Published by Elsevier Inc. All rights reserved.

  14. A LTR copia retrotransposon and Mutator transposons interrupt Pgip genes in cultivated and wild wheats.

    Science.gov (United States)

    Di Giovanni, Michela; Cenci, Alberto; Janni, Michela; D'Ovidio, Renato

    2008-04-01

    Polygalacturonase-inhibiting proteins (PGIPs) are leucine-rich repeat (LRR) proteins involved in plant defence. Wheat pgip genes have been isolated from the B (Tapgip1) and D (Tapgip2) genomes, and now we report the identification of pgip genes from the A genomes of wild and cultivated wheats. By Southern blots and sequence analysis of BAC clones we demonstrated that wheat contains a single copy pgip gene per genome and the one from the A genome, pgip3, is inactivated by the insertion of a long terminal repeat copia retrotranspon within the fourth LRR. We demonstrated also that this retrotransposon insertion is present in Triticum urartu and all the polyploidy wheats assayed, but is absent in T. monococcum (Tmpgip3), suggesting that this insertion took place after the divergence between T. monococcum and T. urartu, but before the formation of the polyploid wheats. We identified also two independent insertion events of new Class II transposable elements, Vacuna, belonging to the Mutator superfamily, that interrupted the Tdipgip1 gene of T. turgidum ssp. dicoccoides. The occurrence of these transposons within the coding region of Tdipgip1 facilitated the mapping of the Pgip locus in the pericentric region of the short arm of chromosome group 7. We speculate that the inactivation of pgip genes are tolerated because of redundancy of PGIP activities in the wheat genome.

  15. Isolation and characterization of reverse transcriptase fragments of LTR retrotransposons from the genome of Chenopodium quinoa (Amaranthaceae).

    Science.gov (United States)

    Kolano, Bozena; Bednara, Edyta; Weiss-Schneeweiss, Hanna

    2013-10-01

    High heterogeneity was observed among conserved domains of reverse transcriptase ( rt ) isolated from quinoa. Only one Ty1- copia rt was highly amplified. Reverse transcriptase sequences were located predominantly in pericentromeric region of quinoa chromosomes. The heterogeneity, genomic abundance, and chromosomal distribution of reverse transcriptase (rt)-coding fragments of Ty1-copia and Ty3-gypsy long terminal repeat retrotransposons were analyzed in the Chenopodium quinoa genome. Conserved domains of the rt gene were amplified and characterized using degenerate oligonucleotide primer pairs. Sequence analyses indicated that half of Ty1-copia rt (51 %) and 39 % of Ty3-gypsy rt fragments contained intact reading frames. High heterogeneity among rt sequences was observed for both Ty1-copia and Ty3-gypsy rt amplicons, with Ty1-copia more heterogeneous than Ty3-gypsy. Most of the isolated rt fragments were present in quinoa genome in low copy numbers, with only one highly amplified Ty1-copia rt sequence family. The gypsy-like RNase H fragments co-amplified with Ty1-copia-degenerate primers were shown to be highly amplified in the quinoa genome indicating either higher abundance of some gypsy families of which rt domains could not be amplified, or independent evolution of this gypsy-region in quinoa. Both Ty1-copia and Ty3-gypsy retrotransposons were preferentially located in pericentromeric heterochromatin of quinoa chromosomes. Phylogenetic analyses of newly amplified rt fragments together with well-characterized retrotransposon families from other organisms allowed identification of major lineages of retroelements in the genome of quinoa and provided preliminary insight into their evolutionary dynamics.

  16. Coevolution between a family of parasite virulence effectors and a class of LINE-1 retrotransposons.

    Directory of Open Access Journals (Sweden)

    Soledad Sacristán

    2009-10-01

    Full Text Available Parasites are able to evolve rapidly and overcome host defense mechanisms, but the molecular basis of this adaptation is poorly understood. Powdery mildew fungi (Erysiphales, Ascomycota are obligate biotrophic parasites infecting nearly 10,000 plant genera. They obtain their nutrients from host plants through specialized feeding structures known as haustoria. We previously identified the AVR(k1 powdery mildew-specific gene family encoding effectors that contribute to the successful establishment of haustoria. Here, we report the extensive proliferation of the AVR(k1 gene family throughout the genome of B. graminis, with sequences diverging in formae speciales adapted to infect different hosts. Also, importantly, we have discovered that the effectors have coevolved with a particular family of LINE-1 retrotransposons, named TE1a. The coevolution of these two entities indicates a mutual benefit to the association, which could ultimately contribute to parasite adaptation and success. We propose that the association would benefit 1 the powdery mildew fungus, by providing a mechanism for amplifying and diversifying effectors and 2 the associated retrotransposons, by providing a basis for their maintenance through selection in the fungal genome.

  17. Architectures and Functional Coverage of Protein-Protein Interfaces

    Science.gov (United States)

    Tuncbag, Nurcan; Gursoy, Attila; Guney, Emre; Nussinov, Ruth; Keskin, Ozlem

    2008-01-01

    The diverse range of cellular functions is performed by a limited number of protein folds existing in nature. One may similarly expect that cellular functional diversity would be covered by a limited number of protein-protein interface architectures. Here, we present 8205 interface clusters, each representing unique interface architecture. This dataset of protein-protein interfaces is analyzed and compared with older datasets. We observe that the number of both biological and crystal interfaces increase significantly compared to the number of PDB entries. Further, we find that the number of distinct interface architectures grows at a much faster rate than the number of folds and is yet to level off. We further analyze the growth trend of the functional coverage by constructing functional interaction networks from interfaces. The functional coverage is also found to steadily increase. Interestingly, we also observe that despite the diversity of interface architectures, some are more favorable and frequently used, and of particular interest, those are the ones which are also preferred in single chains. PMID:18620705

  18. Efficient DNA fingerprinting based on the targeted sequencing of active retrotransposon insertion sites using a bench-top high-throughput sequencing platform.

    Science.gov (United States)

    Monden, Yuki; Yamamoto, Ayaka; Shindo, Akiko; Tahara, Makoto

    2014-10-01

    In many crop species, DNA fingerprinting is required for the precise identification of cultivars to protect the rights of breeders. Many families of retrotransposons have multiple copies throughout the eukaryotic genome and their integrated copies are inherited genetically. Thus, their insertion polymorphisms among cultivars are useful for DNA fingerprinting. In this study, we conducted a DNA fingerprinting based on the insertion polymorphisms of active retrotransposon families (Rtsp-1 and LIb) in sweet potato. Using 38 cultivars, we identified 2,024 insertion sites in the two families with an Illumina MiSeq sequencing platform. Of these insertion sites, 91.4% appeared to be polymorphic among the cultivars and 376 cultivar-specific insertion sites were identified, which were converted directly into cultivar-specific sequence-characterized amplified region (SCAR) markers. A phylogenetic tree was constructed using these insertion sites, which corresponded well with known pedigree information, thereby indicating their suitability for genetic diversity studies. Thus, the genome-wide comparative analysis of active retrotransposon insertion sites using the bench-top MiSeq sequencing platform is highly effective for DNA fingerprinting without any requirement for whole genome sequence information. This approach may facilitate the development of practical polymerase chain reaction-based cultivar diagnostic system and could also be applied to the determination of genetic relationships. © The Author 2014. Published by Oxford University Press on behalf of Kazusa DNA Research Institute.

  19. Egg-specific expression of protein with DNA methyltransferase activity in the biocarcinogenic liver fluke Clonorchis sinensis.

    Science.gov (United States)

    Kim, Seon-Hee; Cho, Hye-Jeong; Sohn, Woon-Mok; Ahn, Chun-Seob; Kong, Yoon; Yang, Hyun-Jong; Bae, Young-An

    2015-08-01

    Despite recent reports regarding the biology of cytosine methylation in Schistosoma mansoni, the impact of the regulatory machinery remains unclear in diverse platyhelminthes. This ambiguity is reinforced by discoveries of DNA methyltransferase 2 (DNMT2)-only organisms and the substrate specificity of DNMT2 preferential to RNA molecules. Here, we characterized a novel DNA methyltransferase, named CsDNMT2, in a liver fluke Clonorchis sinensis. The protein exhibited structural properties conserved in other members of the DNMT2 family. The native and recombinant CsDNMT2 exhibited considerable enzymatic activity on DNA. The spatiotemporal expression of CsDNMT2 mirrored that of 5-methylcytosine (5 mC), both of which were elevated in the C. sinensis eggs. However, CsDNMT2 and 5 mC were marginally detected in other histological regions of C. sinensis adults including ovaries and seminal receptacle. The methylation site seemed not related to genomic loci occupied by progenies of an active long-terminal-repeat retrotransposon. Taken together, our data strongly suggest that C. sinensis has preserved the functional DNA methylation machinery and that DNMT2 acts as a genuine alternative to DNMT1/DNMT3 to methylate DNA in the DNMT2-only organism. The epigenetic regulation would target functional genes primarily involved in the formation and/or maturation of eggs, rather than retrotransposons.

  20. Protein kinase substrate identification on functional protein arrays

    Directory of Open Access Journals (Sweden)

    Zhou Fang

    2008-02-01

    Full Text Available Abstract Background Over the last decade, kinases have emerged as attractive therapeutic targets for a number of different diseases, and numerous high throughput screening efforts in the pharmaceutical community are directed towards discovery of compounds that regulate kinase function. The emerging utility of systems biology approaches has necessitated the development of multiplex tools suitable for proteomic-scale experiments to replace lower throughput technologies such as mass spectroscopy for the study of protein phosphorylation. Recently, a new approach for identifying substrates of protein kinases has applied the miniaturized format of functional protein arrays to characterize phosphorylation for thousands of candidate protein substrates in a single experiment. This method involves the addition of protein kinases in solution to arrays of immobilized proteins to identify substrates using highly sensitive radioactive detection and hit identification algorithms. Results To date, the factors required for optimal performance of protein array-based kinase substrate identification have not been described. In the current study, we have carried out a detailed characterization of the protein array-based method for kinase substrate identification, including an examination of the effects of time, buffer compositions, and protein concentration on the results. The protein array approach was compared to standard solution-based assays for assessing substrate phosphorylation, and a correlation of greater than 80% was observed. The results presented here demonstrate how novel substrates for protein kinases can be quickly identified from arrays containing thousands of human proteins to provide new clues to protein kinase function. In addition, a pooling-deconvolution strategy was developed and applied that enhances characterization of specific kinase-substrate relationships and decreases reagent consumption. Conclusion Functional protein microarrays are an

  1. Canola/rapeseed protein-functionality and nutrition

    Directory of Open Access Journals (Sweden)

    Wanasundara Janitha P.D.

    2016-07-01

    Full Text Available Protein rich meal is a valuable co-product of canola/rapeseed oil extraction. Seed storage proteins that include cruciferin (11S and napin (2S dominate the protein complement of canola while oleosins, lipid transfer proteins and other minor proteins of non-storage nature are also found. Although oil-free canola meal contains 36–40% protein on a dry weight basis, non-protein components including fibre, polymeric phenolics, phytates and sinapine, etc. of the seed coat and cellular components make protein less suitable for food use. Separation of canola protein from non-protein components is a technical challenge but necessary to obtain full nutritional and functional potential of protein. Process conditions of raw material and protein preparation are critical of nutritional and functional value of the final protein product. The storage proteins of canola can satisfy many nutritional and functional requirements for food applications. Protein macromolecules of canola also provide functionalities required in applications beyond edible uses; there exists substantial potential as a source of plant protein and a renewable biopolymer. Available information at present is mostly based on the protein products that can be obtained as mixtures of storage protein types and other chemical constituents of the seed; therefore, full potential of canola storage proteins is yet to be revealed.

  2. DNA methylation inhibits expression and transposition of the Neurospora Tad retrotransposon.

    Science.gov (United States)

    Zhou, Y; Cambareri, E B; Kinsey, J A

    2001-06-01

    Tad is a LINE-like retrotransposon of the filamentous fungus Neurospora crassa. We have analyzed both expression and transposition of this element using strains with a single copy of Tad located in the 5' noncoding sequences of the am (glutamate dehydrogenase) gene. Tad in this position has been shown to carry a de novo cytosine methylation signal which causes reversible methylation of both Tad and am upstream sequences. Here we find that methylation of the Tad sequences inhibits both Tad expression and transposition. This inhibition can be relieved by the use of 5-azacytidine, a drug which reduces cytosine methylation, or by placing the Tad/am sequences in a dim-2 genetic background.

  3. Origins of Protein Functions in Cells

    Science.gov (United States)

    Seelig, Burchard; Pohorille, Andrzej

    2011-01-01

    In modern organisms proteins perform a majority of cellular functions, such as chemical catalysis, energy transduction and transport of material across cell walls. Although great strides have been made towards understanding protein evolution, a meaningful extrapolation from contemporary proteins to their earliest ancestors is virtually impossible. In an alternative approach, the origin of water-soluble proteins was probed through the synthesis and in vitro evolution of very large libraries of random amino acid sequences. In combination with computer modeling and simulations, these experiments allow us to address a number of fundamental questions about the origins of proteins. Can functionality emerge from random sequences of proteins? How did the initial repertoire of functional proteins diversify to facilitate new functions? Did this diversification proceed primarily through drawing novel functionalities from random sequences or through evolution of already existing proto-enzymes? Did protein evolution start from a pool of proteins defined by a frozen accident and other collections of proteins could start a different evolutionary pathway? Although we do not have definitive answers to these questions yet, important clues have been uncovered. In one example (Keefe and Szostak, 2001), novel ATP binding proteins were identified that appear to be unrelated in both sequence and structure to any known ATP binding proteins. One of these proteins was subsequently redesigned computationally to bind GTP through introducing several mutations that introduce targeted structural changes to the protein, improve its binding to guanine and prevent water from accessing the active center. This study facilitates further investigations of individual evolutionary steps that lead to a change of function in primordial proteins. In a second study (Seelig and Szostak, 2007), novel enzymes were generated that can join two pieces of RNA in a reaction for which no natural enzymes are known

  4. Functional assignment to JEV proteins using SVM.

    Science.gov (United States)

    Sahoo, Ganesh Chandra; Dikhit, Manas Ranjan; Das, Pradeep

    2008-01-01

    Identification of different protein functions facilitates a mechanistic understanding of Japanese encephalitis virus (JEV) infection and opens novel means for drug development. Support vector machines (SVM), useful for predicting the functional class of distantly related proteins, is employed to ascribe a possible functional class to Japanese encephalitis virus protein. Our study from SVMProt and available JE virus sequences suggests that structural and nonstructural proteins of JEV genome possibly belong to diverse protein functions, are expected to occur in the life cycle of JE virus. Protein functions common to both structural and non-structural proteins are iron-binding, metal-binding, lipid-binding, copper-binding, transmembrane, outer membrane, channels/Pores - Pore-forming toxins (proteins and peptides) group of proteins. Non-structural proteins perform functions like actin binding, zinc-binding, calcium-binding, hydrolases, Carbon-Oxygen Lyases, P-type ATPase, proteins belonging to major facilitator family (MFS), secreting main terminal branch (MTB) family, phosphotransfer-driven group translocators and ATP-binding cassette (ABC) family group of proteins. Whereas structural proteins besides belonging to same structural group of proteins (capsid, structural, envelope), they also perform functions like nuclear receptor, antibiotic resistance, RNA-binding, DNA-binding, magnesium-binding, isomerase (intra-molecular), oxidoreductase and participate in type II (general) secretory pathway (IISP).

  5. Protein domain recurrence and order can enhance prediction of protein functions

    KAUST Repository

    Abdel Messih, Mario A.

    2012-09-07

    Motivation: Burgeoning sequencing technologies have generated massive amounts of genomic and proteomic data. Annotating the functions of proteins identified in this data has become a big and crucial problem. Various computational methods have been developed to infer the protein functions based on either the sequences or domains of proteins. The existing methods, however, ignore the recurrence and the order of the protein domains in this function inference. Results: We developed two new methods to infer protein functions based on protein domain recurrence and domain order. Our first method, DRDO, calculates the posterior probability of the Gene Ontology terms based on domain recurrence and domain order information, whereas our second method, DRDO-NB, relies on the nave Bayes methodology using the same domain architecture information. Our large-scale benchmark comparisons show strong improvements in the accuracy of the protein function inference achieved by our new methods, demonstrating that domain recurrence and order can provide important information for inference of protein functions. The Author(s) 2012. Published by Oxford University Press.

  6. Biases in the experimental annotations of protein function and their effect on our understanding of protein function space.

    Directory of Open Access Journals (Sweden)

    Alexandra M Schnoes

    Full Text Available The ongoing functional annotation of proteins relies upon the work of curators to capture experimental findings from scientific literature and apply them to protein sequence and structure data. However, with the increasing use of high-throughput experimental assays, a small number of experimental studies dominate the functional protein annotations collected in databases. Here, we investigate just how prevalent is the "few articles - many proteins" phenomenon. We examine the experimentally validated annotation of proteins provided by several groups in the GO Consortium, and show that the distribution of proteins per published study is exponential, with 0.14% of articles providing the source of annotations for 25% of the proteins in the UniProt-GOA compilation. Since each of the dominant articles describes the use of an assay that can find only one function or a small group of functions, this leads to substantial biases in what we know about the function of many proteins. Mass-spectrometry, microscopy and RNAi experiments dominate high throughput experiments. Consequently, the functional information derived from these experiments is mostly of the subcellular location of proteins, and of the participation of proteins in embryonic developmental pathways. For some organisms, the information provided by different studies overlap by a large amount. We also show that the information provided by high throughput experiments is less specific than those provided by low throughput experiments. Given the experimental techniques available, certain biases in protein function annotation due to high-throughput experiments are unavoidable. Knowing that these biases exist and understanding their characteristics and extent is important for database curators, developers of function annotation programs, and anyone who uses protein function annotation data to plan experiments.

  7. PIWI Proteins and PIWI-Interacting RNA

    DEFF Research Database (Denmark)

    Han, Yi Neng; Li, Yuan; Xia, Sheng Qiang

    2017-01-01

    tissue types as well and play important roles in transposon silencing, epigenetic regulation, gene and protein regulation, genome rearrangement, spermatogenesis and germ stem-cell maintenance. PIWI proteins were first discovered in Drosophila and they play roles in spermatogenesis, germline stem-cell......P-Element induced wimpy testis (PIWI)-interacting RNAs (piRNAs) are a type of noncoding RNAs (ncRNAs) and interact with PIWI proteins. piRNAs were primarily described in the germline, but emerging evidence revealed that piRNAs are expressed in a tissue-specific manner among multiple human somatic...... maintenance, self-renewal, retrotransposons silencing and the male germline mobility control. A growing number of studies have demonstrated that several piRNA and PIWI proteins are aberrantly expressed in various kinds of cancers and may probably serve as a novel biomarker and therapeutic target for cancer...

  8. Biases in the Experimental Annotations of Protein Function and Their Effect on Our Understanding of Protein Function Space

    Science.gov (United States)

    Schnoes, Alexandra M.; Ream, David C.; Thorman, Alexander W.; Babbitt, Patricia C.; Friedberg, Iddo

    2013-01-01

    The ongoing functional annotation of proteins relies upon the work of curators to capture experimental findings from scientific literature and apply them to protein sequence and structure data. However, with the increasing use of high-throughput experimental assays, a small number of experimental studies dominate the functional protein annotations collected in databases. Here, we investigate just how prevalent is the “few articles - many proteins” phenomenon. We examine the experimentally validated annotation of proteins provided by several groups in the GO Consortium, and show that the distribution of proteins per published study is exponential, with 0.14% of articles providing the source of annotations for 25% of the proteins in the UniProt-GOA compilation. Since each of the dominant articles describes the use of an assay that can find only one function or a small group of functions, this leads to substantial biases in what we know about the function of many proteins. Mass-spectrometry, microscopy and RNAi experiments dominate high throughput experiments. Consequently, the functional information derived from these experiments is mostly of the subcellular location of proteins, and of the participation of proteins in embryonic developmental pathways. For some organisms, the information provided by different studies overlap by a large amount. We also show that the information provided by high throughput experiments is less specific than those provided by low throughput experiments. Given the experimental techniques available, certain biases in protein function annotation due to high-throughput experiments are unavoidable. Knowing that these biases exist and understanding their characteristics and extent is important for database curators, developers of function annotation programs, and anyone who uses protein function annotation data to plan experiments. PMID:23737737

  9. Transcriptional analysis of the HeT-A retrotransposon in mutant and wild type stocks reveals high sequence variability at Drosophila telomeres and other unusual features

    Directory of Open Access Journals (Sweden)

    Piñeyro David

    2011-11-01

    Full Text Available Abstract Background Telomere replication in Drosophila depends on the transposition of a domesticated retroelement, the HeT-A retrotransposon. The sequence of the HeT-A retrotransposon changes rapidly resulting in differentiated subfamilies. This pattern of sequence change contrasts with the essential function with which the HeT-A is entrusted and brings about questions concerning the extent of sequence variability, the telomere contribution of different subfamilies, and whether wild type and mutant Drosophila stocks show different HeT-A scenarios. Results A detailed study on the variability of HeT-A reveals that both the level of variability and the number of subfamilies are higher than previously reported. Comparisons between GIII, a strain with longer telomeres, and its parental strain Oregon-R indicate that both strains have the same set of HeT-A subfamilies. Finally, the presence of a highly conserved splicing pattern only in its antisense transcripts indicates a putative regulatory, functional or structural role for the HeT-A RNA. Interestingly, our results also suggest that most HeT-A copies are actively expressed regardless of which telomere and where in the telomere they are located. Conclusions Our study demonstrates how the HeT-A sequence changes much faster than previously reported resulting in at least nine different subfamilies most of which could actively contribute to telomere extension in Drosophila. Interestingly, the only significant difference observed between Oregon-R and GIII resides in the nature and proportion of the antisense transcripts, suggesting a possible mechanism that would in part explain the longer telomeres of the GIII stock.

  10. Genomic change, retrotransposon mobilization and extensive cytosine methylation alteration in Brassica napus introgressions from two intertribal hybridizations.

    Science.gov (United States)

    Zhang, Xueli; Ge, Xianhong; Shao, Yujiao; Sun, Genlou; Li, Zaiyun

    2013-01-01

    Hybridization and introgression represent important means for the transfer and/or de novo origination of traits and play an important role in facilitating speciation and plant breeding. Two sets of introgression lines in Brassica napus L. were previously established by its intertribal hybridizations with two wild species and long-term selection. In this study, the methods of amplified fragment length polymorphisms (AFLP), sequence-specific amplification polymorphism (SSAP) and methylation-sensitive amplified polymorphism (MSAP) were used to determine their genomic change, retrotransposon mobilization and cytosine methylation alteration in these lines. The genomic change revealed by the loss or gain of AFLP bands occurred for ∼10% of the total bands amplified in the two sets of introgressions, while no bands specific for wild species were detected. The new and absent SSAP bands appeared for 9 out of 11 retrotransposons analyzed, with low frequency of new bands and their total percentage of about 5% in both sets. MSAP analysis indicated that methylation changes were common in these lines (33.4-39.8%) and the hypermethylation was more frequent than hypomethylation. Our results suggested that certain extents of genetic and epigenetic alterations were induced by hybridization and alien DNA introgression. The cryptic mechanism of these changes and potential application of these lines in breeding were also discussed.

  11. Inferring the Functions of Proteins from the Interrelationships between Functional Categories.

    Science.gov (United States)

    Taha, Kamal

    2018-01-01

    This study proposes a new method to determine the functions of an unannotated protein. The proteins and amino acid residues mentioned in biomedical texts associated with an unannotated protein can be considered as characteristics terms for , which are highly predictive of the potential functions of . Similarly, proteins and amino acid residues mentioned in biomedical texts associated with proteins annotated with a functional category can be considered as characteristics terms of . We introduce in this paper an information extraction system called IFP_IFC that predicts the functions of an unannotated protein by representing and each functional category by a vector of weights. Each weight reflects the degree of association between a characteristic term and (or a characteristic term and ). First, IFP_IFC constructs a network, whose nodes represent the different functional categories, and its edges the interrelationships between the nodes. Then, it determines the functions of by employing random walks with restarts on the mentioned network. The walker is the vector of . Finally, is assigned to the functional categories of the nodes in the network that are visited most by the walker. We evaluated the quality of IFP_IFC by comparing it experimentally with two other systems. Results showed marked improvement.

  12. Piwi Is Required to Limit Exhaustion of Aging Somatic Stem Cells

    Directory of Open Access Journals (Sweden)

    Pedro Sousa-Victor

    2017-09-01

    Full Text Available Sophisticated mechanisms that preserve genome integrity are critical to ensure the maintenance of regenerative capacity while preventing transformation of somatic stem cells (SCs, yet little is known about mechanisms regulating genome maintenance in these cells. Here, we show that intestinal stem cells (ISCs induce the Argonaute family protein Piwi in response to JAK/STAT signaling during acute proliferative episodes. Piwi function is critical to ensure heterochromatin maintenance, suppress retrotransposon activation, and prevent DNA damage in homeostasis and under regenerative pressure. Accordingly, loss of Piwi results in the loss of actively dividing ISCs and their progenies by apoptosis. We further show that Piwi expression is sufficient to allay age-related retrotransposon expression, DNA damage, apoptosis, and mis-differentiation phenotypes in the ISC lineage, improving epithelial homeostasis. Our data identify a role for Piwi in the regulation of somatic SC function, and they highlight the importance of retrotransposon control in somatic SC maintenance.

  13. Incorporating functional inter-relationships into protein function prediction algorithms

    Directory of Open Access Journals (Sweden)

    Kumar Vipin

    2009-05-01

    Full Text Available Abstract Background Functional classification schemes (e.g. the Gene Ontology that serve as the basis for annotation efforts in several organisms are often the source of gold standard information for computational efforts at supervised protein function prediction. While successful function prediction algorithms have been developed, few previous efforts have utilized more than the protein-to-functional class label information provided by such knowledge bases. For instance, the Gene Ontology not only captures protein annotations to a set of functional classes, but it also arranges these classes in a DAG-based hierarchy that captures rich inter-relationships between different classes. These inter-relationships present both opportunities, such as the potential for additional training examples for small classes from larger related classes, and challenges, such as a harder to learn distinction between similar GO terms, for standard classification-based approaches. Results We propose a method to enhance the performance of classification-based protein function prediction algorithms by addressing the issue of using these interrelationships between functional classes constituting functional classification schemes. Using a standard measure for evaluating the semantic similarity between nodes in an ontology, we quantify and incorporate these inter-relationships into the k-nearest neighbor classifier. We present experiments on several large genomic data sets, each of which is used for the modeling and prediction of over hundred classes from the GO Biological Process ontology. The results show that this incorporation produces more accurate predictions for a large number of the functional classes considered, and also that the classes benefitted most by this approach are those containing the fewest members. In addition, we show how our proposed framework can be used for integrating information from the entire GO hierarchy for improving the accuracy of

  14. Protein Functionalized Nanodiamond Arrays

    Directory of Open Access Journals (Sweden)

    Liu YL

    2010-01-01

    Full Text Available Abstract Various nanoscale elements are currently being explored for bio-applications, such as in bio-images, bio-detection, and bio-sensors. Among them, nanodiamonds possess remarkable features such as low bio-cytotoxicity, good optical property in fluorescent and Raman spectra, and good photostability for bio-applications. In this work, we devise techniques to position functionalized nanodiamonds on self-assembled monolayer (SAMs arrays adsorbed on silicon and ITO substrates surface using electron beam lithography techniques. The nanodiamond arrays were functionalized with lysozyme to target a certain biomolecule or protein specifically. The optical properties of the nanodiamond-protein complex arrays were characterized by a high throughput confocal microscope. The synthesized nanodiamond-lysozyme complex arrays were found to still retain their functionality in interacting with E. coli.

  15. Genomic change, retrotransposon mobilization and extensive cytosine methylation alteration in Brassica napus introgressions from two intertribal hybridizations.

    Directory of Open Access Journals (Sweden)

    Xueli Zhang

    Full Text Available Hybridization and introgression represent important means for the transfer and/or de novo origination of traits and play an important role in facilitating speciation and plant breeding. Two sets of introgression lines in Brassica napus L. were previously established by its intertribal hybridizations with two wild species and long-term selection. In this study, the methods of amplified fragment length polymorphisms (AFLP, sequence-specific amplification polymorphism (SSAP and methylation-sensitive amplified polymorphism (MSAP were used to determine their genomic change, retrotransposon mobilization and cytosine methylation alteration in these lines. The genomic change revealed by the loss or gain of AFLP bands occurred for ∼10% of the total bands amplified in the two sets of introgressions, while no bands specific for wild species were detected. The new and absent SSAP bands appeared for 9 out of 11 retrotransposons analyzed, with low frequency of new bands and their total percentage of about 5% in both sets. MSAP analysis indicated that methylation changes were common in these lines (33.4-39.8% and the hypermethylation was more frequent than hypomethylation. Our results suggested that certain extents of genetic and epigenetic alterations were induced by hybridization and alien DNA introgression. The cryptic mechanism of these changes and potential application of these lines in breeding were also discussed.

  16. Tye7 regulates yeast Ty1 retrotransposon sense and antisense transcription in response to adenylic nucleotides stress.

    Science.gov (United States)

    Servant, Géraldine; Pinson, Benoit; Tchalikian-Cosson, Aurélie; Coulpier, Fanny; Lemoine, Sophie; Pennetier, Carole; Bridier-Nahmias, Antoine; Todeschini, Anne Laure; Fayol, Hélène; Daignan-Fornier, Bertrand; Lesage, Pascale

    2012-07-01

    Transposable elements play a fundamental role in genome evolution. It is proposed that their mobility, activated under stress, induces mutations that could confer advantages to the host organism. Transcription of the Ty1 LTR-retrotransposon of Saccharomyces cerevisiae is activated in response to a severe deficiency in adenylic nucleotides. Here, we show that Ty2 and Ty3 are also stimulated under these stress conditions, revealing the simultaneous activation of three active Ty retrotransposon families. We demonstrate that Ty1 activation in response to adenylic nucleotide depletion requires the DNA-binding transcription factor Tye7. Ty1 is transcribed in both sense and antisense directions. We identify three Tye7 potential binding sites in the region of Ty1 DNA sequence where antisense transcription starts. We show that Tye7 binds to Ty1 DNA and regulates Ty1 antisense transcription. Altogether, our data suggest that, in response to adenylic nucleotide reduction, TYE7 is induced and activates Ty1 mRNA transcription, possibly by controlling Ty1 antisense transcription. We also provide the first evidence that Ty1 antisense transcription can be regulated by environmental stress conditions, pointing to a new level of control of Ty1 activity by stress, as Ty1 antisense RNAs play an important role in regulating Ty1 mobility at both the transcriptional and post-transcriptional stages.

  17. UET: a database of evolutionarily-predicted functional determinants of protein sequences that cluster as functional sites in protein structures.

    Science.gov (United States)

    Lua, Rhonald C; Wilson, Stephen J; Konecki, Daniel M; Wilkins, Angela D; Venner, Eric; Morgan, Daniel H; Lichtarge, Olivier

    2016-01-04

    The structure and function of proteins underlie most aspects of biology and their mutational perturbations often cause disease. To identify the molecular determinants of function as well as targets for drugs, it is central to characterize the important residues and how they cluster to form functional sites. The Evolutionary Trace (ET) achieves this by ranking the functional and structural importance of the protein sequence positions. ET uses evolutionary distances to estimate functional distances and correlates genotype variations with those in the fitness phenotype. Thus, ET ranks are worse for sequence positions that vary among evolutionarily closer homologs but better for positions that vary mostly among distant homologs. This approach identifies functional determinants, predicts function, guides the mutational redesign of functional and allosteric specificity, and interprets the action of coding sequence variations in proteins, people and populations. Now, the UET database offers pre-computed ET analyses for the protein structure databank, and on-the-fly analysis of any protein sequence. A web interface retrieves ET rankings of sequence positions and maps results to a structure to identify functionally important regions. This UET database integrates several ways of viewing the results on the protein sequence or structure and can be found at http://mammoth.bcm.tmc.edu/uet/. © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

  18. Structural symmetry and protein function.

    Science.gov (United States)

    Goodsell, D S; Olson, A J

    2000-01-01

    The majority of soluble and membrane-bound proteins in modern cells are symmetrical oligomeric complexes with two or more subunits. The evolutionary selection of symmetrical oligomeric complexes is driven by functional, genetic, and physicochemical needs. Large proteins are selected for specific morphological functions, such as formation of rings, containers, and filaments, and for cooperative functions, such as allosteric regulation and multivalent binding. Large proteins are also more stable against denaturation and have a reduced surface area exposed to solvent when compared with many individual, smaller proteins. Large proteins are constructed as oligomers for reasons of error control in synthesis, coding efficiency, and regulation of assembly. Symmetrical oligomers are favored because of stability and finite control of assembly. Several functions limit symmetry, such as interaction with DNA or membranes, and directional motion. Symmetry is broken or modified in many forms: quasisymmetry, in which identical subunits adopt similar but different conformations; pleomorphism, in which identical subunits form different complexes; pseudosymmetry, in which different molecules form approximately symmetrical complexes; and symmetry mismatch, in which oligomers of different symmetries interact along their respective symmetry axes. Asymmetry is also observed at several levels. Nearly all complexes show local asymmetry at the level of side chain conformation. Several complexes have reciprocating mechanisms in which the complex is asymmetric, but, over time, all subunits cycle through the same set of conformations. Global asymmetry is only rarely observed. Evolution of oligomeric complexes may favor the formation of dimers over complexes with higher cyclic symmetry, through a mechanism of prepositioned pairs of interacting residues. However, examples have been found for all of the crystallographic point groups, demonstrating that functional need can drive the evolution of

  19. Roles for text mining in protein function prediction.

    Science.gov (United States)

    Verspoor, Karin M

    2014-01-01

    The Human Genome Project has provided science with a hugely valuable resource: the blueprints for life; the specification of all of the genes that make up a human. While the genes have all been identified and deciphered, it is proteins that are the workhorses of the human body: they are essential to virtually all cell functions and are the primary mechanism through which biological function is carried out. Hence in order to fully understand what happens at a molecular level in biological organisms, and eventually to enable development of treatments for diseases where some aspect of a biological system goes awry, we must understand the functions of proteins. However, experimental characterization of protein function cannot scale to the vast amount of DNA sequence data now available. Computational protein function prediction has therefore emerged as a problem at the forefront of modern biology (Radivojac et al., Nat Methods 10(13):221-227, 2013).Within the varied approaches to computational protein function prediction that have been explored, there are several that make use of biomedical literature mining. These methods take advantage of information in the published literature to associate specific proteins with specific protein functions. In this chapter, we introduce two main strategies for doing this: association of function terms, represented as Gene Ontology terms (Ashburner et al., Nat Genet 25(1):25-29, 2000), to proteins based on information in published articles, and a paradigm called LEAP-FS (Literature-Enhanced Automated Prediction of Functional Sites) in which literature mining is used to validate the predictions of an orthogonal computational protein function prediction method.

  20. Function and structure of GFP-like proteins in the protein data bank.

    Science.gov (United States)

    Ong, Wayne J-H; Alvarez, Samuel; Leroux, Ivan E; Shahid, Ramza S; Samma, Alex A; Peshkepija, Paola; Morgan, Alicia L; Mulcahy, Shawn; Zimmer, Marc

    2011-04-01

    The RCSB protein databank contains 266 crystal structures of green fluorescent proteins (GFP) and GFP-like proteins. This is the first systematic analysis of all the GFP-like structures in the pdb. We have used the pdb to examine the function of fluorescent proteins (FP) in nature, aspects of excited state proton transfer (ESPT) in FPs, deformation from planarity of the chromophore and chromophore maturation. The conclusions reached in this review are that (1) The lid residues are highly conserved, particularly those on the "top" of the β-barrel. They are important to the function of GFP-like proteins, perhaps in protecting the chromophore or in β-barrel formation. (2) The primary/ancestral function of GFP-like proteins may well be to aid in light induced electron transfer. (3) The structural prerequisites for light activated proton pumps exist in many structures and it's possible that like bioluminescence, proton pumps are secondary functions of GFP-like proteins. (4) In most GFP-like proteins the protein matrix exerts a significant strain on planar chromophores forcing most GFP-like proteins to adopt non-planar chromophores. These chromophoric deviations from planarity play an important role in determining the fluorescence quantum yield. (5) The chemospatial characteristics of the chromophore cavity determine the isomerization state of the chromophore. The cavities of highlighter proteins that can undergo cis/trans isomerization have chemospatial properties that are common to both cis and trans GFP-like proteins.

  1. Insights into Hox protein function from a large scale combinatorial analysis of protein domains.

    Directory of Open Access Journals (Sweden)

    Samir Merabet

    2011-10-01

    Full Text Available Protein function is encoded within protein sequence and protein domains. However, how protein domains cooperate within a protein to modulate overall activity and how this impacts functional diversification at the molecular and organism levels remains largely unaddressed. Focusing on three domains of the central class Drosophila Hox transcription factor AbdominalA (AbdA, we used combinatorial domain mutations and most known AbdA developmental functions as biological readouts to investigate how protein domains collectively shape protein activity. The results uncover redundancy, interactivity, and multifunctionality of protein domains as salient features underlying overall AbdA protein activity, providing means to apprehend functional diversity and accounting for the robustness of Hox-controlled developmental programs. Importantly, the results highlight context-dependency in protein domain usage and interaction, allowing major modifications in domains to be tolerated without general functional loss. The non-pleoitropic effect of domain mutation suggests that protein modification may contribute more broadly to molecular changes underlying morphological diversification during evolution, so far thought to rely largely on modification in gene cis-regulatory sequences.

  2. Quantitative protein localization signatures reveal an association between spatial and functional divergences of proteins.

    Science.gov (United States)

    Loo, Lit-Hsin; Laksameethanasan, Danai; Tung, Yi-Ling

    2014-03-01

    Protein subcellular localization is a major determinant of protein function. However, this important protein feature is often described in terms of discrete and qualitative categories of subcellular compartments, and therefore it has limited applications in quantitative protein function analyses. Here, we present Protein Localization Analysis and Search Tools (PLAST), an automated analysis framework for constructing and comparing quantitative signatures of protein subcellular localization patterns based on microscopy images. PLAST produces human-interpretable protein localization maps that quantitatively describe the similarities in the localization patterns of proteins and major subcellular compartments, without requiring manual assignment or supervised learning of these compartments. Using the budding yeast Saccharomyces cerevisiae as a model system, we show that PLAST is more accurate than existing, qualitative protein localization annotations in identifying known co-localized proteins. Furthermore, we demonstrate that PLAST can reveal protein localization-function relationships that are not obvious from these annotations. First, we identified proteins that have similar localization patterns and participate in closely-related biological processes, but do not necessarily form stable complexes with each other or localize at the same organelles. Second, we found an association between spatial and functional divergences of proteins during evolution. Surprisingly, as proteins with common ancestors evolve, they tend to develop more diverged subcellular localization patterns, but still occupy similar numbers of compartments. This suggests that divergence of protein localization might be more frequently due to the development of more specific localization patterns over ancestral compartments than the occupation of new compartments. PLAST enables systematic and quantitative analyses of protein localization-function relationships, and will be useful to elucidate protein

  3. Scoring functions for protein-protein interactions.

    Science.gov (United States)

    Moal, Iain H; Moretti, Rocco; Baker, David; Fernández-Recio, Juan

    2013-12-01

    The computational evaluation of protein-protein interactions will play an important role in organising the wealth of data being generated by high-throughput initiatives. Here we discuss future applications, report recent developments and identify areas requiring further investigation. Many functions have been developed to quantify the structural and energetic properties of interacting proteins, finding use in interrelated challenges revolving around the relationship between sequence, structure and binding free energy. These include loop modelling, side-chain refinement, docking, multimer assembly, affinity prediction, affinity change upon mutation, hotspots location and interface design. Information derived from models optimised for one of these challenges can be used to benefit the others, and can be unified within the theoretical frameworks of multi-task learning and Pareto-optimal multi-objective learning. Copyright © 2013 Elsevier Ltd. All rights reserved.

  4. Recognition of functional sites in protein structures.

    Science.gov (United States)

    Shulman-Peleg, Alexandra; Nussinov, Ruth; Wolfson, Haim J

    2004-06-04

    Recognition of regions on the surface of one protein, that are similar to a binding site of another is crucial for the prediction of molecular interactions and for functional classifications. We first describe a novel method, SiteEngine, that assumes no sequence or fold similarities and is able to recognize proteins that have similar binding sites and may perform similar functions. We achieve high efficiency and speed by introducing a low-resolution surface representation via chemically important surface points, by hashing triangles of physico-chemical properties and by application of hierarchical scoring schemes for a thorough exploration of global and local similarities. We proceed to rigorously apply this method to functional site recognition in three possible ways: first, we search a given functional site on a large set of complete protein structures. Second, a potential functional site on a protein of interest is compared with known binding sites, to recognize similar features. Third, a complete protein structure is searched for the presence of an a priori unknown functional site, similar to known sites. Our method is robust and efficient enough to allow computationally demanding applications such as the first and the third. From the biological standpoint, the first application may identify secondary binding sites of drugs that may lead to side-effects. The third application finds new potential sites on the protein that may provide targets for drug design. Each of the three applications may aid in assigning a function and in classification of binding patterns. We highlight the advantages and disadvantages of each type of search, provide examples of large-scale searches of the entire Protein Data Base and make functional predictions.

  5. Prediction of functional sites in proteins using conserved functional group analysis.

    Science.gov (United States)

    Innis, C Axel; Anand, A Prem; Sowdhamini, R

    2004-04-02

    A detailed knowledge of a protein's functional site is an absolute prerequisite for understanding its mode of action at the molecular level. However, the rapid pace at which sequence and structural information is being accumulated for proteins greatly exceeds our ability to determine their biochemical roles experimentally. As a result, computational methods are required which allow for the efficient processing of the evolutionary information contained in this wealth of data, in particular that related to the nature and location of functionally important sites and residues. The method presented here, referred to as conserved functional group (CFG) analysis, relies on a simplified representation of the chemical groups found in amino acid side-chains to identify functional sites from a single protein structure and a number of its sequence homologues. We show that CFG analysis can fully or partially predict the location of functional sites in approximately 96% of the 470 cases tested and that, unlike other methods available, it is able to tolerate wide variations in sequence identity. In addition, we discuss its potential in a structural genomics context, where automation, scalability and efficiency are critical, and an increasing number of protein structures are determined with no prior knowledge of function. This is exemplified by our analysis of the hypothetical protein Ydde_Ecoli, whose structure was recently solved by members of the North East Structural Genomics consortium. Although the proposed active site for this protein needs to be validated experimentally, this example illustrates the scope of CFG analysis as a general tool for the identification of residues likely to play an important role in a protein's biochemical function. Thus, our method offers a convenient solution to rapidly and automatically process the vast amounts of data that are beginning to emerge from structural genomics projects.

  6. The functional properties, modification and utilization of whey proteins

    Directory of Open Access Journals (Sweden)

    B. G. Venter

    1986-03-01

    Full Text Available Whey protein has an excellent nutritional value and exhibits a functional potential. In comparison with certain other food proteins, the whey protein content of essential amino acids is extremely favourable for human consumption. Depending on the heat-treatment history thereof, soluble whey proteins with utilizable functional properties, apart from high biological value, true digestibility, protein efficiency ratio and nett protein utilization, can be recovered. Various technological and chemical recovery processes have been designed. Chemically and enzymatically modified whey protein is manufactured to obtain technological and functional advantages. The important functional properties of whey proteins, namely hydration, gelation, emulsifying and foaming properties, are reviewed.

  7. MIWI2 as an Effector of DNA Methylation and Gene Silencing in Embryonic Male Germ Cells

    Directory of Open Access Journals (Sweden)

    Kanako Kojima-Kita

    2016-09-01

    Full Text Available During the development of mammalian embryonic germ cells, global demethylation and de novo DNA methylation take place. In mouse embryonic germ cells, two PIWI family proteins, MILI and MIWI2, are essential for the de novo DNA methylation of retrotransposons, presumably through PIWI-interacting RNAs (piRNAs. Although piRNA-associated MIWI2 has been reported to play critical roles in the process, its molecular mechanisms have remained unclear. To identify the mechanism, transgenic mice were produced; they contained a fusion protein of MIWI2 and a zinc finger (ZF that recognized the promoter region of a type A LINE-1 gene. The ZF-MIWI2 fusion protein brought about DNA methylation, suppression of the type A LINE-1 gene, and a partial rescue of the impaired spermatogenesis of MILI-null mice. In addition, ZF-MIWI2 was associated with the proteins involved in DNA methylation. These data indicate that MIWI2 functions as an effector of de novo DNA methylation of the retrotransposon.

  8. Epigenetic control of mammalian LINE-1 retrotransposon by retinoblastoma proteins

    Energy Technology Data Exchange (ETDEWEB)

    Montoya-Durango, Diego E. [Department of Biochemistry and Molecular Biology and Center for Genetics and Molecular Medicine, University of Louisville School of Medicine Health Sciences Center, Louisville, KY 40202 (United States); Liu, Yongqing [James Graham Brown Cancer Center and Department of Ophthalmology and Visual Sciences, University of Louisville School of Medicine Health Sciences Center, Louisville, KY 40202 (United States); Teneng, Ivo; Kalbfleisch, Ted; Lacy, Mary E.; Steffen, Marlene C. [Department of Biochemistry and Molecular Biology and Center for Genetics and Molecular Medicine, University of Louisville School of Medicine Health Sciences Center, Louisville, KY 40202 (United States); Ramos, Kenneth S., E-mail: kenneth.ramos@louisville.edu [Department of Biochemistry and Molecular Biology and Center for Genetics and Molecular Medicine, University of Louisville School of Medicine Health Sciences Center, Louisville, KY 40202 (United States)

    2009-06-01

    Long interspersed nuclear elements (LINEs or L1 elements) are targeted for epigenetic silencing during early embryonic development and remain inactive in most cells and tissues. Here we show that E2F-Rb family complexes participate in L1 elements epigenetic regulation via nucleosomal histone modifications and recruitment of histone deacetylases (HDACs) HDAC1 and HDAC2. Our experiments demonstrated that (i) Rb and E2F interact with human and mouse L1 elements, (ii) L1 elements are deficient in both heterochromatin-associated histone marks H3 tri methyl K9 and H4 tri methyl K20 in Rb family triple knock out (Rb, p107, and p130) fibroblasts (TKO), (iii) L1 promoter exhibits increased histone H3 acetylation in the absence of HDAC1 and HDAC2 recruitment, (iv) L1 expression in TKO fibroblasts is upregulated compared to wild type counterparts, (v) L1 expression increases in the presence of the HDAC inhibitor TSA. On the basis of these findings we propose a model in which L1 sequences throughout the genome serve as centers for heterochromatin formation in an Rb family-dependent manner. As such, Rb proteins and L1 elements may play key roles in heterochromatin formation beyond pericentromeric chromosomal regions. These findings describe a novel mechanism of L1 reactivation in mammalian cells mediated by failure of corepressor protein recruitment by Rb, loss of histone epigenetic marks, heterochromatin formation, and increased histone H3 acetylation.

  9. Epigenetic control of mammalian LINE-1 retrotransposon by retinoblastoma proteins

    International Nuclear Information System (INIS)

    Montoya-Durango, Diego E.; Liu, Yongqing; Teneng, Ivo; Kalbfleisch, Ted; Lacy, Mary E.; Steffen, Marlene C.; Ramos, Kenneth S.

    2009-01-01

    Long interspersed nuclear elements (LINEs or L1 elements) are targeted for epigenetic silencing during early embryonic development and remain inactive in most cells and tissues. Here we show that E2F-Rb family complexes participate in L1 elements epigenetic regulation via nucleosomal histone modifications and recruitment of histone deacetylases (HDACs) HDAC1 and HDAC2. Our experiments demonstrated that (i) Rb and E2F interact with human and mouse L1 elements, (ii) L1 elements are deficient in both heterochromatin-associated histone marks H3 tri methyl K9 and H4 tri methyl K20 in Rb family triple knock out (Rb, p107, and p130) fibroblasts (TKO), (iii) L1 promoter exhibits increased histone H3 acetylation in the absence of HDAC1 and HDAC2 recruitment, (iv) L1 expression in TKO fibroblasts is upregulated compared to wild type counterparts, (v) L1 expression increases in the presence of the HDAC inhibitor TSA. On the basis of these findings we propose a model in which L1 sequences throughout the genome serve as centers for heterochromatin formation in an Rb family-dependent manner. As such, Rb proteins and L1 elements may play key roles in heterochromatin formation beyond pericentromeric chromosomal regions. These findings describe a novel mechanism of L1 reactivation in mammalian cells mediated by failure of corepressor protein recruitment by Rb, loss of histone epigenetic marks, heterochromatin formation, and increased histone H3 acetylation.

  10. Linking structural features of protein complexes and biological function.

    Science.gov (United States)

    Sowmya, Gopichandran; Breen, Edmond J; Ranganathan, Shoba

    2015-09-01

    Protein-protein interaction (PPI) establishes the central basis for complex cellular networks in a biological cell. Association of proteins with other proteins occurs at varying affinities, yet with a high degree of specificity. PPIs lead to diverse functionality such as catalysis, regulation, signaling, immunity, and inhibition, playing a crucial role in functional genomics. The molecular principle of such interactions is often elusive in nature. Therefore, a comprehensive analysis of known protein complexes from the Protein Data Bank (PDB) is essential for the characterization of structural interface features to determine structure-function relationship. Thus, we analyzed a nonredundant dataset of 278 heterodimer protein complexes, categorized into major functional classes, for distinguishing features. Interestingly, our analysis has identified five key features (interface area, interface polar residue abundance, hydrogen bonds, solvation free energy gain from interface formation, and binding energy) that are discriminatory among the functional classes using Kruskal-Wallis rank sum test. Significant correlations between these PPI interface features amongst functional categories are also documented. Salt bridges correlate with interface area in regulator-inhibitors (r = 0.75). These representative features have implications for the prediction of potential function of novel protein complexes. The results provide molecular insights for better understanding of PPIs and their relation to biological functions. © 2015 The Protein Society.

  11. Exploring Protein Function Using the Saccharomyces Genome Database.

    Science.gov (United States)

    Wong, Edith D

    2017-01-01

    Elucidating the function of individual proteins will help to create a comprehensive picture of cell biology, as well as shed light on human disease mechanisms, possible treatments, and cures. Due to its compact genome, and extensive history of experimentation and annotation, the budding yeast Saccharomyces cerevisiae is an ideal model organism in which to determine protein function. This information can then be leveraged to infer functions of human homologs. Despite the large amount of research and biological data about S. cerevisiae, many proteins' functions remain unknown. Here, we explore ways to use the Saccharomyces Genome Database (SGD; http://www.yeastgenome.org ) to predict the function of proteins and gain insight into their roles in various cellular processes.

  12. Protein Function Prediction Based on Sequence and Structure Information

    KAUST Repository

    Smaili, Fatima Z.

    2016-05-25

    The number of available protein sequences in public databases is increasing exponentially. However, a significant fraction of these sequences lack functional annotation which is essential to our understanding of how biological systems and processes operate. In this master thesis project, we worked on inferring protein functions based on the primary protein sequence. In the approach we follow, 3D models are first constructed using I-TASSER. Functions are then deduced by structurally matching these predicted models, using global and local similarities, through three independent enzyme commission (EC) and gene ontology (GO) function libraries. The method was tested on 250 “hard” proteins, which lack homologous templates in both structure and function libraries. The results show that this method outperforms the conventional prediction methods based on sequence similarity or threading. Additionally, our method could be improved even further by incorporating protein-protein interaction information. Overall, the method we use provides an efficient approach for automated functional annotation of non-homologous proteins, starting from their sequence.

  13. Unveiling protein functions through the dynamics of the interaction network.

    Directory of Open Access Journals (Sweden)

    Irene Sendiña-Nadal

    Full Text Available Protein interaction networks have become a tool to study biological processes, either for predicting molecular functions or for designing proper new drugs to regulate the main biological interactions. Furthermore, such networks are known to be organized in sub-networks of proteins contributing to the same cellular function. However, the protein function prediction is not accurate and each protein has traditionally been assigned to only one function by the network formalism. By considering the network of the physical interactions between proteins of the yeast together with a manual and single functional classification scheme, we introduce a method able to reveal important information on protein function, at both micro- and macro-scale. In particular, the inspection of the properties of oscillatory dynamics on top of the protein interaction network leads to the identification of misclassification problems in protein function assignments, as well as to unveil correct identification of protein functions. We also demonstrate that our approach can give a network representation of the meta-organization of biological processes by unraveling the interactions between different functional classes.

  14. Biomimetic devices functionalized by membrane channel proteins

    Science.gov (United States)

    Schmidt, Jacob

    2004-03-01

    We are developing a new family of active materials which derive their functional properties from membrane proteins. These materials have two primary components: the proteins and the membranes themselves. I will discuss our recent work directed toward development of a generic platform for a "plug-and-play" philosophy of membrane protein engineering. By creating a stable biomimetic polymer membrane a single molecular monolayer thick, we will enable the exploitation of the function of any membrane protein, from pores and pumps to sensors and energy transducers. Our initial work has centered on the creation, study, and characterization of the biomimetic membranes. We are attempting to make large areas of membrane monolayers using Langmuir-Blodgett film formation as well as through arrays of microfabricated black lipid membrane-type septa. A number of techniques allow the insertion of protein into the membranes. As a benchmark, we have been employing a model system of voltage-gated pore proteins, which have electrically controllable porosities. I will report on the progress of this work, the characterization of the membranes, protein insertion processes, and the yield and functionality of the composite.

  15. Improving protein function prediction methods with integrated literature data

    Directory of Open Access Journals (Sweden)

    Gabow Aaron P

    2008-04-01

    Full Text Available Abstract Background Determining the function of uncharacterized proteins is a major challenge in the post-genomic era due to the problem's complexity and scale. Identifying a protein's function contributes to an understanding of its role in the involved pathways, its suitability as a drug target, and its potential for protein modifications. Several graph-theoretic approaches predict unidentified functions of proteins by using the functional annotations of better-characterized proteins in protein-protein interaction networks. We systematically consider the use of literature co-occurrence data, introduce a new method for quantifying the reliability of co-occurrence and test how performance differs across species. We also quantify changes in performance as the prediction algorithms annotate with increased specificity. Results We find that including information on the co-occurrence of proteins within an abstract greatly boosts performance in the Functional Flow graph-theoretic function prediction algorithm in yeast, fly and worm. This increase in performance is not simply due to the presence of additional edges since supplementing protein-protein interactions with co-occurrence data outperforms supplementing with a comparably-sized genetic interaction dataset. Through the combination of protein-protein interactions and co-occurrence data, the neighborhood around unknown proteins is quickly connected to well-characterized nodes which global prediction algorithms can exploit. Our method for quantifying co-occurrence reliability shows superior performance to the other methods, particularly at threshold values around 10% which yield the best trade off between coverage and accuracy. In contrast, the traditional way of asserting co-occurrence when at least one abstract mentions both proteins proves to be the worst method for generating co-occurrence data, introducing too many false positives. Annotating the functions with greater specificity is harder

  16. The Effect of Pulsed Streamer-like Discharge in Liquid on Transcriptional Activation of Retrotransposon Genes of a Red Alga, Porphyra Yezoensis

    OpenAIRE

    Ohno, T.; Li, Z.; Lin, X.F.; Zhang, W.B.; Takano, H.; Takio, S.; Namihira, T.; Akiyama, H.; オオノ, ツヨシ; ナミヒラ, タカオ; アキヤマ, ヒデノリ; 大野, 剛史; 浪平, 隆男; 秋山, 秀典

    2007-01-01

    Retrotransposons are mobile genetic elements thataccomplished transposition via an RNA intermediate.These elements can be transcriptionally activated by stressfactors, such as UV light, ozone, pathogens, woundingand drought. A red alga, porphyra yezoensis has recentlybeen recognized as a model plant for fundamental andapplied study in marine biological science. In this paper,pulsed streamer-like discharge in liquid was used as a newstress condition, and the transcription level of a copia-like...

  17. Functions of intrinsic disorder in transmembrane proteins

    DEFF Research Database (Denmark)

    Kjaergaard, Magnus; Kragelund, Birthe B.

    2017-01-01

    Intrinsic disorder is common in integral membrane proteins, particularly in the intracellular domains. Despite this observation, these domains are not always recognized as being disordered. In this review, we will discuss the biological functions of intrinsically disordered regions of membrane...... receptors. The functions of the disordered regions are many and varied. We will discuss selected examples including: (1) Organization of receptors, kinases, phosphatases and second messenger sources into signaling complexes. (2) Modulation of the membrane-embedded domain function by ball-and-chain like...... mechanisms. (3) Trafficking of membrane proteins. (4) Transient membrane associations. (5) Post-translational modifications most notably phosphorylation and (6) disorder-linked isoform dependent function. We finish the review by discussing the future challenges facing the membrane protein community regarding...

  18. Studying Membrane Protein Structure and Function Using Nanodiscs

    DEFF Research Database (Denmark)

    Huda, Pie

    The structure and dynamic of membrane proteins can provide valuable information about general functions, diseases and effects of various drugs. Studying membrane proteins are a challenge as an amphiphilic environment is necessary to stabilise the protein in a functionally and structurally relevant...... form. This is most typically achieved through the use of detergent based reconstitution systems. However, time and again such systems fail to provide a suitable environment causing aggregation and inactivation. Nanodiscs are self-assembled lipoproteins containing two membrane scaffold proteins...... and a lipid bilayer in defined nanometer size, which can act as a stabiliser for membrane proteins. This enables both functional and structural investigation of membrane proteins in a detergent free environment which is closer to the native situation. Understanding the self-assembly of nanodiscs is important...

  19. Structure-based inference of molecular functions of proteins of unknown function from Berkeley Structural Genomics Center

    Energy Technology Data Exchange (ETDEWEB)

    Kim, Sung-Hou; Shin, Dong Hae; Hou, Jingtong; Chandonia, John-Marc; Das, Debanu; Choi, In-Geol; Kim, Rosalind; Kim, Sung-Hou

    2007-09-02

    Advances in sequence genomics have resulted in an accumulation of a huge number of protein sequences derived from genome sequences. However, the functions of a large portion of them cannot be inferred based on the current methods of sequence homology detection to proteins of known functions. Three-dimensional structure can have an important impact in providing inference of molecular function (physical and chemical function) of a protein of unknown function. Structural genomics centers worldwide have been determining many 3-D structures of the proteins of unknown functions, and possible molecular functions of them have been inferred based on their structures. Combined with bioinformatics and enzymatic assay tools, the successful acceleration of the process of protein structure determination through high throughput pipelines enables the rapid functional annotation of a large fraction of hypothetical proteins. We present a brief summary of the process we used at the Berkeley Structural Genomics Center to infer molecular functions of proteins of unknown function.

  20. Analysis of substructural variation in families of enzymatic proteins with applications to protein function prediction

    Directory of Open Access Journals (Sweden)

    Fofanov Viacheslav Y

    2010-05-01

    Full Text Available Abstract Background Structural variations caused by a wide range of physico-chemical and biological sources directly influence the function of a protein. For enzymatic proteins, the structure and chemistry of the catalytic binding site residues can be loosely defined as a substructure of the protein. Comparative analysis of drug-receptor substructures across and within species has been used for lead evaluation. Substructure-level similarity between the binding sites of functionally similar proteins has also been used to identify instances of convergent evolution among proteins. In functionally homologous protein families, shared chemistry and geometry at catalytic sites provide a common, local point of comparison among proteins that may differ significantly at the sequence, fold, or domain topology levels. Results This paper describes two key results that can be used separately or in combination for protein function analysis. The Family-wise Analysis of SubStructural Templates (FASST method uses all-against-all substructure comparison to determine Substructural Clusters (SCs. SCs characterize the binding site substructural variation within a protein family. In this paper we focus on examples of automatically determined SCs that can be linked to phylogenetic distance between family members, segregation by conformation, and organization by homology among convergent protein lineages. The Motif Ensemble Statistical Hypothesis (MESH framework constructs a representative motif for each protein cluster among the SCs determined by FASST to build motif ensembles that are shown through a series of function prediction experiments to improve the function prediction power of existing motifs. Conclusions FASST contributes a critical feedback and assessment step to existing binding site substructure identification methods and can be used for the thorough investigation of structure-function relationships. The application of MESH allows for an automated

  1. Text mining improves prediction of protein functional sites.

    Directory of Open Access Journals (Sweden)

    Karin M Verspoor

    Full Text Available We present an approach that integrates protein structure analysis and text mining for protein functional site prediction, called LEAP-FS (Literature Enhanced Automated Prediction of Functional Sites. The structure analysis was carried out using Dynamics Perturbation Analysis (DPA, which predicts functional sites at control points where interactions greatly perturb protein vibrations. The text mining extracts mentions of residues in the literature, and predicts that residues mentioned are functionally important. We assessed the significance of each of these methods by analyzing their performance in finding known functional sites (specifically, small-molecule binding sites and catalytic sites in about 100,000 publicly available protein structures. The DPA predictions recapitulated many of the functional site annotations and preferentially recovered binding sites annotated as biologically relevant vs. those annotated as potentially spurious. The text-based predictions were also substantially supported by the functional site annotations: compared to other residues, residues mentioned in text were roughly six times more likely to be found in a functional site. The overlap of predictions with annotations improved when the text-based and structure-based methods agreed. Our analysis also yielded new high-quality predictions of many functional site residues that were not catalogued in the curated data sources we inspected. We conclude that both DPA and text mining independently provide valuable high-throughput protein functional site predictions, and that integrating the two methods using LEAP-FS further improves the quality of these predictions.

  2. Text Mining Improves Prediction of Protein Functional Sites

    Science.gov (United States)

    Cohn, Judith D.; Ravikumar, Komandur E.

    2012-01-01

    We present an approach that integrates protein structure analysis and text mining for protein functional site prediction, called LEAP-FS (Literature Enhanced Automated Prediction of Functional Sites). The structure analysis was carried out using Dynamics Perturbation Analysis (DPA), which predicts functional sites at control points where interactions greatly perturb protein vibrations. The text mining extracts mentions of residues in the literature, and predicts that residues mentioned are functionally important. We assessed the significance of each of these methods by analyzing their performance in finding known functional sites (specifically, small-molecule binding sites and catalytic sites) in about 100,000 publicly available protein structures. The DPA predictions recapitulated many of the functional site annotations and preferentially recovered binding sites annotated as biologically relevant vs. those annotated as potentially spurious. The text-based predictions were also substantially supported by the functional site annotations: compared to other residues, residues mentioned in text were roughly six times more likely to be found in a functional site. The overlap of predictions with annotations improved when the text-based and structure-based methods agreed. Our analysis also yielded new high-quality predictions of many functional site residues that were not catalogued in the curated data sources we inspected. We conclude that both DPA and text mining independently provide valuable high-throughput protein functional site predictions, and that integrating the two methods using LEAP-FS further improves the quality of these predictions. PMID:22393388

  3. Human Milk: Bioactive Proteins/Peptides and Functional Properties.

    Science.gov (United States)

    Lönnerdal, Bo

    2016-06-23

    Breastfeeding has been associated with many benefits, both in the short and in the long term. Infants being breastfed generally have less illness and have better cognitive development at 1 year of age than formula-fed infants. Later in life, they have a lower risk of obesity, diabetes and cardiovascular disease. Several components in breast milk may be responsible for these different outcomes, but bioactive proteins/peptides likely play a major role. Some proteins in breast milk are comparatively resistant towards digestion and may therefore exert their functions in the gastrointestinal tract in intact form or as larger fragments. Other milk proteins may be partially digested in the upper small intestine and the resulting peptides may exert functions in the lower small intestine. Lactoferrin, lysozyme and secretory IgA have been found intact in the stool of breastfed infants and are therefore examples of proteins that are resistant against proteolytic degradation in the gut. Together, these proteins serve protective roles against infection and support immune function in the immature infant. α-lactalbumin, β-casein, κ-casein and osteopontin are examples of proteins that are partially digested in the upper small intestine, and the resulting peptides influence functions in the gut. Such functions include stimulation of immune function, mineral and trace element absorption and defense against infection. © 2016 Nestec Ltd., Vevey/S. Karger AG, Basel.

  4. Transcription Factor Functional Protein-Protein Interactions in Plant Defense Responses

    Directory of Open Access Journals (Sweden)

    Murilo S. Alves

    2014-03-01

    Full Text Available Responses to biotic stress in plants lead to dramatic reprogramming of gene expression, favoring stress responses at the expense of normal cellular functions. Transcription factors are master regulators of gene expression at the transcriptional level, and controlling the activity of these factors alters the transcriptome of the plant, leading to metabolic and phenotypic changes in response to stress. The functional analysis of interactions between transcription factors and other proteins is very important for elucidating the role of these transcriptional regulators in different signaling cascades. In this review, we present an overview of protein-protein interactions for the six major families of transcription factors involved in plant defense: basic leucine zipper containing domain proteins (bZIP, amino-acid sequence WRKYGQK (WRKY, myelocytomatosis related proteins (MYC, myeloblastosis related proteins (MYB, APETALA2/ ETHYLENE-RESPONSIVE ELEMENT BINDING FACTORS (AP2/EREBP and no apical meristem (NAM, Arabidopsis transcription activation factor (ATAF, and cup-shaped cotyledon (CUC (NAC. We describe the interaction partners of these transcription factors as molecular responses during pathogen attack and the key components of signal transduction pathways that take place during plant defense responses. These interactions determine the activation or repression of response pathways and are crucial to understanding the regulatory networks that modulate plant defense responses.

  5. Nutritional and functional properties of whey proteins concentrate and isolate

    Directory of Open Access Journals (Sweden)

    Zoran Herceg

    2006-12-01

    Full Text Available Whey protein fractions represent 18 - 20 % of total milk nitrogen content. Nutritional value in addition to diverse physico - chemical and functional properties make whey proteins highly suitable for application in foodstuffs. In the most cases, whey proteins are used because of their functional properties. Whey proteins possess favourable functional characteristics such as gelling, water binding, emulsification and foaming ability. Due to application of new process techniques (membrane fractionation techniques, it is possible to produce various whey - protein based products. The most important products based on the whey proteins are whey protein concentrates (WPC and whey protein isolates (WPI. The aim of this paper was to give comprehensive review of nutritional and functional properties of the most common used whey proteins (whey protein concentrate - WPC and whey protein isolate - WPI in the food industry.

  6. Random heteropolymers preserve protein function in foreign environments

    Science.gov (United States)

    Panganiban, Brian; Qiao, Baofu; Jiang, Tao; DelRe, Christopher; Obadia, Mona M.; Nguyen, Trung Dac; Smith, Anton A. A.; Hall, Aaron; Sit, Izaac; Crosby, Marquise G.; Dennis, Patrick B.; Drockenmuller, Eric; Olvera de la Cruz, Monica; Xu, Ting

    2018-03-01

    The successful incorporation of active proteins into synthetic polymers could lead to a new class of materials with functions found only in living systems. However, proteins rarely function under the conditions suitable for polymer processing. On the basis of an analysis of trends in protein sequences and characteristic chemical patterns on protein surfaces, we designed four-monomer random heteropolymers to mimic intrinsically disordered proteins for protein solubilization and stabilization in non-native environments. The heteropolymers, with optimized composition and statistical monomer distribution, enable cell-free synthesis of membrane proteins with proper protein folding for transport and enzyme-containing plastics for toxin bioremediation. Controlling the statistical monomer distribution in a heteropolymer, rather than the specific monomer sequence, affords a new strategy to interface with biological systems for protein-based biomaterials.

  7. Dissociation of activated protein C functions by elimination of protein S cofactor enhancement.

    LENUS (Irish Health Repository)

    Harmon, Shona

    2008-11-07

    Activated protein C (APC) plays a critical anticoagulant role in vivo by inactivating procoagulant factor Va and factor VIIIa and thus down-regulating thrombin generation. In addition, APC bound to the endothelial cell protein C receptor can initiate protease-activated receptor-1 (PAR-1)-mediated cytoprotective signaling. Protein S constitutes a critical cofactor for the anticoagulant function of APC but is not known to be involved in regulating APC-mediated protective PAR-1 signaling. In this study we utilized a site-directed mutagenesis strategy to characterize a putative protein S binding region within the APC Gla domain. Three single amino acid substitutions within the APC Gla domain (D35T, D36A, and A39V) were found to mildly impair protein S-dependent anticoagulant activity (<2-fold) but retained entirely normal cytoprotective activity. However, a single amino acid substitution (L38D) ablated the ability of protein S to function as a cofactor for this APC variant. Consequently, in assays of protein S-dependent factor Va proteolysis using purified proteins or in the plasma milieu, APC-L38D variant exhibited minimal residual anticoagulant activity compared with wild type APC. Despite the location of Leu-38 in the Gla domain, APC-L38D interacted normally with endothelial cell protein C receptor and retained its ability to trigger PAR-1 mediated cytoprotective signaling in a manner indistinguishable from that of wild type APC. Consequently, elimination of protein S cofactor enhancement of APC anticoagulant function represents a novel and effective strategy by which to separate the anticoagulant and cytoprotective functions of APC for potential therapeutic gain.

  8. A collaborative filtering approach for protein-protein docking scoring functions.

    Science.gov (United States)

    Bourquard, Thomas; Bernauer, Julie; Azé, Jérôme; Poupon, Anne

    2011-04-22

    A protein-protein docking procedure traditionally consists in two successive tasks: a search algorithm generates a large number of candidate conformations mimicking the complex existing in vivo between two proteins, and a scoring function is used to rank them in order to extract a native-like one. We have already shown that using Voronoi constructions and a well chosen set of parameters, an accurate scoring function could be designed and optimized. However to be able to perform large-scale in silico exploration of the interactome, a near-native solution has to be found in the ten best-ranked solutions. This cannot yet be guaranteed by any of the existing scoring functions. In this work, we introduce a new procedure for conformation ranking. We previously developed a set of scoring functions where learning was performed using a genetic algorithm. These functions were used to assign a rank to each possible conformation. We now have a refined rank using different classifiers (decision trees, rules and support vector machines) in a collaborative filtering scheme. The scoring function newly obtained is evaluated using 10 fold cross-validation, and compared to the functions obtained using either genetic algorithms or collaborative filtering taken separately. This new approach was successfully applied to the CAPRI scoring ensembles. We show that for 10 targets out of 12, we are able to find a near-native conformation in the 10 best ranked solutions. Moreover, for 6 of them, the near-native conformation selected is of high accuracy. Finally, we show that this function dramatically enriches the 100 best-ranking conformations in near-native structures.

  9. dAdd1 and dXNP prevent genome instability by maintaining HP1a localization at Drosophila telomeres.

    Science.gov (United States)

    Chavez, Joselyn; Murillo-Maldonado, Juan Manuel; Bahena, Vanessa; Cruz, Ana Karina; Castañeda-Sortibrán, América; Rodriguez-Arnaiz, Rosario; Zurita, Mario; Valadez-Graham, Viviana

    2017-12-01

    Telomeres are important contributors to genome stability, as they prevent linear chromosome end degradation and contribute to the avoidance of telomeric fusions. An important component of the telomeres is the heterochromatin protein 1a (HP1a). Mutations in Su(var)205, the gene encoding HP1a in Drosophila, result in telomeric fusions, retrotransposon regulation loss and larger telomeres, leading to chromosome instability. Previously, it was found that several proteins physically interact with HP1a, including dXNP and dAdd1 (orthologues to the mammalian ATRX gene). In this study, we found that mutations in the genes encoding the dXNP and dAdd1 proteins affect chromosome stability, causing chromosomal aberrations, including telomeric defects, similar to those observed in Su(var)205 mutants. In somatic cells, we observed that dXNP and dAdd1 participate in the silencing of the telomeric HTT array of retrotransposons, preventing anomalous retrotransposon transcription and integration. Furthermore, the lack of dAdd1 results in the loss of HP1a from the telomeric regions without affecting other chromosomal HP1a binding sites; mutations in dxnp also affected HP1a localization but not at all telomeres, suggesting a specialized role for dAdd1 and dXNP proteins in locating HP1a at the tips of the chromosomes. These results place dAdd1 as an essential regulator of HP1a localization and function in the telomere heterochromatic domain.

  10. Functionalization of whey proteins by reactive supercritical fluid extrusion

    Directory of Open Access Journals (Sweden)

    Khanitta Ruttarattanamongkol

    2012-09-01

    Full Text Available Whey protein, a by-product from cheese-making, is often used in a variety of food formulations due to its unsurpassednutritional quality and inherent functional properties. However, the possibilities for the improvement and upgrading of wheyprotein utilization still need to be explored. Reactive supercritical fluid extrusion (SCFX is a novel technique that has beenrecently reported to successfully functionalize commercially available whey proteins into a product with enhanced functionalproperties. The specific goal of this review is to provide fundamental understanding of the reinforcement mechanism andprocessing of protein functionalization by reactive SCFX process. The superimposed extrusion variables and their interactionmechanism affect the physico-chemical properties of whey proteins. By understanding the structure, functional properties andprocessing relationships of such materials, the rational design criteria for novel functionalized proteins could be developedand effectively utilized in food systems.

  11. Identification of SSR and retrotransposon-based molecular markers linked to morphological characters in oily sunfl ower (Helianthus annuus L.) under natural and water-limited states.

    Science.gov (United States)

    Ali, Soleimani Gezeljeh; Darvishzadeh, Reza; Ebrahimi, Asa; Bihamta, Mohammad Reza

    2018-03-01

    Sunflower is an important source of edible oil. Drought is known as an important factor limiting the growth and productivity of field crops in most parts of the world. Agricultural biotechnology mainly aims at developing crops with higher tolerance to the challenging environmental conditions, such as drought. This study examined a number of morphological characters, along with relative water content (RWC) in 100 inbred sunflower lines. A 10 × 10 simple lattice design with two replications was employed to measure the mentioned parameters under natural and water-limited states during two successive years. In molecular trial, 30 simple sequence repeat (SSR) primer pairs, as well as 14 inter-retrotransposon amplified polymorphism (IRAP) and 14 retrotransposon-microsatellite amplified polymorphism (REMAP) primer combinations were used for DNA fingerprinting of the lines. Most of the examined characters had lower average values under water-limited than natural states. Maximum and minimum reductions were observed in the cases of yield and oil percentage, respectively. The broad-sense heritabilities for all the examined characters were 0.20-0.73 and 0.10-0.34 under natural and water-limited states, respectively. In the studied samples, 8.97% of the 435 possible locus pairs of the SSRs represented significant linkage disequilibrium (LD) levels. In the association analysis using SSR markers, 22 and 21 markers were identified (P ≤ 0.05) for the studied characters under natural and water-limited states, respectively. The corresponding values were 50 and 37 using retrotransposon-based molecular markers. Some detected markers were communal between the characters under water-limited and natural states. This was in line with the phenotypic correlations detected between the characters. Communal markers facilitate the simultaneous selection of several characters and can thus improve the efficacy of selection based on markers in the plant-breeding activities.

  12. Usher protein functions in hair cells and photoreceptors.

    Science.gov (United States)

    Cosgrove, Dominic; Zallocchi, Marisa

    2014-01-01

    The 10 different genes associated with the deaf/blind disorder, Usher syndrome, encode a number of structurally and functionally distinct proteins, most expressed as multiple isoforms/protein variants. Functional characterization of these proteins suggests a role in stereocilia development in cochlear hair cells, likely owing to adhesive interactions in hair bundles. In mature hair cells, homodimers of the Usher cadherins, cadherin 23 and protocadherin 15, interact to form a structural fiber, the tip link, and the linkages that anchor the taller stereocilia's actin cytoskeleton core to the shorter adjacent stereocilia and the elusive mechanotransduction channels, explaining the deafness phenotype when these molecular interactions are perturbed. The conundrum is that photoreceptors lack a synonymous mechanotransduction apparatus, and so a common theory for Usher protein function in the two neurosensory cell types affected in Usher syndrome is lacking. Recent evidence linking photoreceptor cell dysfunction in the shaker 1 mouse model for Usher syndrome to light-induced protein translocation defects, combined with localization of an Usher protein interactome at the periciliary region of the photoreceptors suggests Usher proteins might regulate protein trafficking between the inner and outer segments of photoreceptors. A distinct Usher protein complex is trafficked to the ribbon synapses of hair cells, and synaptic defects have been reported in Usher mutants in both hair cells and photoreceptors. This review aims to clarify what is known about Usher protein function at the synaptic and apical poles of hair cells and photoreceptors and the prospects for identifying a unifying pathobiological mechanism to explain deaf/blindness in Usher syndrome. Copyright © 2013 Elsevier Ltd. All rights reserved.

  13. Automated quantitative assessment of proteins' biological function in protein knowledge bases.

    Science.gov (United States)

    Mayr, Gabriele; Lepperdinger, Günter; Lackner, Peter

    2008-01-01

    Primary protein sequence data are archived in databases together with information regarding corresponding biological functions. In this respect, UniProt/Swiss-Prot is currently the most comprehensive collection and it is routinely cross-examined when trying to unravel the biological role of hypothetical proteins. Bioscientists frequently extract single entries and further evaluate those on a subjective basis. In lieu of a standardized procedure for scoring the existing knowledge regarding individual proteins, we here report about a computer-assisted method, which we applied to score the present knowledge about any given Swiss-Prot entry. Applying this quantitative score allows the comparison of proteins with respect to their sequence yet highlights the comprehension of functional data. pfs analysis may be also applied for quality control of individual entries or for database management in order to rank entry listings.

  14. Automated Quantitative Assessment of Proteins' Biological Function in Protein Knowledge Bases

    Directory of Open Access Journals (Sweden)

    Gabriele Mayr

    2008-01-01

    Full Text Available Primary protein sequence data are archived in databases together with information regarding corresponding biological functions. In this respect, UniProt/Swiss-Prot is currently the most comprehensive collection and it is routinely cross-examined when trying to unravel the biological role of hypothetical proteins. Bioscientists frequently extract single entries and further evaluate those on a subjective basis. In lieu of a standardized procedure for scoring the existing knowledge regarding individual proteins, we here report about a computer-assisted method, which we applied to score the present knowledge about any given Swiss-Prot entry. Applying this quantitative score allows the comparison of proteins with respect to their sequence yet highlights the comprehension of functional data. pfs analysis may be also applied for quality control of individual entries or for database management in order to rank entry listings.

  15. AVID: An integrative framework for discovering functional relationships among proteins

    Directory of Open Access Journals (Sweden)

    Keating Amy E

    2005-06-01

    Full Text Available Abstract Background Determining the functions of uncharacterized proteins is one of the most pressing problems in the post-genomic era. Large scale protein-protein interaction assays, global mRNA expression analyses and systematic protein localization studies provide experimental information that can be used for this purpose. The data from such experiments contain many false positives and false negatives, but can be processed using computational methods to provide reliable information about protein-protein relationships and protein function. An outstanding and important goal is to predict detailed functional annotation for all uncharacterized proteins that is reliable enough to effectively guide experiments. Results We present AVID, a computational method that uses a multi-stage learning framework to integrate experimental results with sequence information, generating networks reflecting functional similarities among proteins. We illustrate use of the networks by making predictions of detailed Gene Ontology (GO annotations in three categories: molecular function, biological process, and cellular component. Applied to the yeast Saccharomyces cerevisiae, AVID provides 37,451 pair-wise functional linkages between 4,191 proteins. These relationships are ~65–78% accurate, as assessed by cross-validation testing. Assignments of highly detailed functional descriptors to proteins, based on the networks, are estimated to be ~67% accurate for GO categories describing molecular function and cellular component and ~52% accurate for terms describing biological process. The predictions cover 1,490 proteins with no previous annotation in GO and also assign more detailed functions to many proteins annotated only with less descriptive terms. Predictions made by AVID are largely distinct from those made by other methods. Out of 37,451 predicted pair-wise relationships, the greatest number shared in common with another method is 3,413. Conclusion AVID provides

  16. Scoring protein relationships in functional interaction networks predicted from sequence data.

    Directory of Open Access Journals (Sweden)

    Gaston K Mazandu

    Full Text Available UNLABELLED: The abundance of diverse biological data from various sources constitutes a rich source of knowledge, which has the power to advance our understanding of organisms. This requires computational methods in order to integrate and exploit these data effectively and elucidate local and genome wide functional connections between protein pairs, thus enabling functional inferences for uncharacterized proteins. These biological data are primarily in the form of sequences, which determine functions, although functional properties of a protein can often be predicted from just the domains it contains. Thus, protein sequences and domains can be used to predict protein pair-wise functional relationships, and thus contribute to the function prediction process of uncharacterized proteins in order to ensure that knowledge is gained from sequencing efforts. In this work, we introduce information-theoretic based approaches to score protein-protein functional interaction pairs predicted from protein sequence similarity and conserved protein signature matches. The proposed schemes are effective for data-driven scoring of connections between protein pairs. We applied these schemes to the Mycobacterium tuberculosis proteome to produce a homology-based functional network of the organism with a high confidence and coverage. We use the network for predicting functions of uncharacterised proteins. AVAILABILITY: Protein pair-wise functional relationship scores for Mycobacterium tuberculosis strain CDC1551 sequence data and python scripts to compute these scores are available at http://web.cbio.uct.ac.za/~gmazandu/scoringschemes.

  17. Recognizing the SINEs of Infection: Regulation of Retrotransposon Expression and Modulation of Host Cell Processes

    Directory of Open Access Journals (Sweden)

    William Dunker

    2017-12-01

    Full Text Available Short interspersed elements (SINEs are a family of retrotransposons evolutionarily derived from cellular RNA polymerase III transcripts. Over evolutionary time, SINEs have expanded throughout the human genome and today comprise ~11% of total chromosomal DNA. While generally transcriptionally silent in healthy somatic cells, SINE expression increases during a variety of types of stresses, including DNA virus infection. The relevance of SINE expression to viral infection was largely unexplored, however, recent years have seen great progress towards defining the impact of SINE expression on viral replication and host gene expression. Here we review the origin and diversity of SINE elements and their transcriptional control, with an emphasis on how their expression impacts host cell biology during viral infection.

  18. The contact activation proteins: a structure/function overview

    NARCIS (Netherlands)

    Meijers, J. C.; McMullen, B. A.; Bouma, B. N.

    1992-01-01

    In recent years, extensive knowledge has been obtained on the structure/function relationships of blood coagulation proteins. In this overview, we present recent developments on the structure/function relationships of the contact activation proteins: factor XII, high molecular weight kininogen,

  19. Alkylation damage by lipid electrophiles targets functional protein systems.

    Science.gov (United States)

    Codreanu, Simona G; Ullery, Jody C; Zhu, Jing; Tallman, Keri A; Beavers, William N; Porter, Ned A; Marnett, Lawrence J; Zhang, Bing; Liebler, Daniel C

    2014-03-01

    Protein alkylation by reactive electrophiles contributes to chemical toxicities and oxidative stress, but the functional impact of alkylation damage across proteomes is poorly understood. We used Click chemistry and shotgun proteomics to profile the accumulation of proteome damage in human cells treated with lipid electrophile probes. Protein target profiles revealed three damage susceptibility classes, as well as proteins that were highly resistant to alkylation. Damage occurred selectively across functional protein interaction networks, with the most highly alkylation-susceptible proteins mapping to networks involved in cytoskeletal regulation. Proteins with lower damage susceptibility mapped to networks involved in protein synthesis and turnover and were alkylated only at electrophile concentrations that caused significant toxicity. Hierarchical susceptibility of proteome systems to alkylation may allow cells to survive sublethal damage while protecting critical cell functions.

  20. Alkylation Damage by Lipid Electrophiles Targets Functional Protein Systems*

    Science.gov (United States)

    Codreanu, Simona G.; Ullery, Jody C.; Zhu, Jing; Tallman, Keri A.; Beavers, William N.; Porter, Ned A.; Marnett, Lawrence J.; Zhang, Bing; Liebler, Daniel C.

    2014-01-01

    Protein alkylation by reactive electrophiles contributes to chemical toxicities and oxidative stress, but the functional impact of alkylation damage across proteomes is poorly understood. We used Click chemistry and shotgun proteomics to profile the accumulation of proteome damage in human cells treated with lipid electrophile probes. Protein target profiles revealed three damage susceptibility classes, as well as proteins that were highly resistant to alkylation. Damage occurred selectively across functional protein interaction networks, with the most highly alkylation-susceptible proteins mapping to networks involved in cytoskeletal regulation. Proteins with lower damage susceptibility mapped to networks involved in protein synthesis and turnover and were alkylated only at electrophile concentrations that caused significant toxicity. Hierarchical susceptibility of proteome systems to alkylation may allow cells to survive sublethal damage while protecting critical cell functions. PMID:24429493

  1. Identification and characterization of REC66, a Ty1-copia-like retrotransposon in the genome of red flower of Mirabilis jalapa L.

    Directory of Open Access Journals (Sweden)

    Shunri Jiang

    2017-01-01

    Full Text Available Mirabilis jalapa Lis the most commonly grown ornamental species of Mirabilis and is available in a range of brilliant colors. However, genetic research on Mirabilis jalapa Lis limited. Using fluorescent differential display (FDD screening, we report the identification of a novel Ty1-copia-like retrotransposon in the genome of the red flower of Mirabilis jalapa L, and we named it REC66based on its sequence homology to the GAG protein from Ty1-copiaretrotransposon. Using degenerate primers based on the DNA sequence of REC66, a total of fourteen different variants in reverse transcriptase (RT sequence were recovered from the genomic DNA. These RT sequences show a high degree of heterogeneity characterized mainly by deletion mutation; they can be divided into three subfamilies, of which the majority encode defective RT. This is the first report of a Ty1-copiaretrotransposon in Mirabilis jalapa L. The finding could be helpful for the development of new molecular markers for genetic studies, particularly on the origin and evolutionary relationships of M. jalapa L, and the study of Ty1-copiaretrotransposons and plant genome evolution in the genus Mirabilisor family Nyctaginaceae.

  2. AcEST: BP918427 [AcEST

    Lifescience Database Archive (English)

    Full Text Available tative uncharacterized protein OS=Vitis... 116 7e-25 tr|Q8RZ67|Q8RZ67_ORYSJ Putative rice retrotransposon retrofit... + Sbjct: 384 ATVRIILSLAVTSGLRLHKLDVKNAFLHGFLNEEVYMEQPPGYTDPY 430 >tr|Q8RZ67|Q8RZ67_ORYSJ Putative rice retrotransposon retrofit

  3. Isolation of two new retrotransposon sequences and development of molecular and cytological markers for Dasypyrum villosum (L.).

    Science.gov (United States)

    Zhang, Jie; Jiang, Yun; Xuan, Pu; Guo, Yuanlin; Deng, Guangbing; Yu, Maoqun; Long, Hai

    2017-10-01

    Dasypyrum villosum is a valuable genetic resource for wheat improvement. With the aim to efficiently monitor the D. villosum chromatin introduced into common wheat, two novel retrotransposon sequences were isolated by RAPD, and were successfully converted to D. villosum-specific SCAR markers. In addition, we constructed a chromosomal karyotype of D. villosum. Our results revealed that different accessions of D. villosum showed slightly different signal patterns, indicating that distribution of repeats did not diverge significantly among D. villosum accessions. The two SCAR markers and FISH karyotype of D. villosum could be used for efficient and precise identification of D. villosum chromatin in wheat breeding.

  4. Usher protein functions in hair cells and photoreceptors

    OpenAIRE

    Cosgrove, Dominic; Zallocchi, Marisa

    2013-01-01

    The 10 different genes associated with the deaf/blind disorder, Usher syndrome, encode a number of structurally and functionally distinct proteins, most expressed as multiple isoforms/protein variants. Functional characterization of these proteins suggests a role in stereocilia development in cochlear hair cells, likely owing to adhesive interactions in hair bundles. In mature hair cells, homodimers of the Usher cadherins, cadherin 23 and protocadherin 15, interact to form a structural fiber,...

  5. Structure and function of nanoparticle-protein conjugates

    International Nuclear Information System (INIS)

    Aubin-Tam, M-E; Hamad-Schifferli, K

    2008-01-01

    Conjugation of proteins to nanoparticles has numerous applications in sensing, imaging, delivery, catalysis, therapy and control of protein structure and activity. Therefore, characterizing the nanoparticle-protein interface is of great importance. A variety of covalent and non-covalent linking chemistries have been reported for nanoparticle attachment. Site-specific labeling is desirable in order to control the protein orientation on the nanoparticle, which is crucial in many applications such as fluorescence resonance energy transfer. We evaluate methods for successful site-specific attachment. Typically, a specific protein residue is linked directly to the nanoparticle core or to the ligand. As conjugation often affects the protein structure and function, techniques to probe structure and activity are assessed. We also examine how molecular dynamics simulations of conjugates would complete those experimental techniques in order to provide atomistic details on the effect of nanoparticle attachment. Characterization studies of nanoparticle-protein complexes show that the structure and function are influenced by the chemistry of the nanoparticle ligand, the nanoparticle size, the nanoparticle material, the stoichiometry of the conjugates, the labeling site on the protein and the nature of the linkage (covalent versus non-covalent)

  6. Assessment of genetic diversity among Indian potato (Solanum tuberosum L.) collection using microsatellite and retrotransposon based marker systems.

    Science.gov (United States)

    Sharma, Vishakha; Nandineni, Madhusudan R

    2014-04-01

    Potato (Solanum tuberosum) is an important non-cereal crop throughout the world and is highly recommended for ensuring global food security. Owing to the complexities in genetics and inheritance pattern of potato, the conventional method of cross breeding for developing improved varieties has been difficult. Identification and tagging of desirable traits with informative molecular markers would aid in the development of improved varieties. Insertional polymorphism of copia-like and gypsy-like long terminal repeat retrotransposons (RTN) were investigated among 47 potato varieties from India using Inter-Retrotransposon Amplified Polymorphism (IRAP) and Retrotransposon Microsatellite Amplified Polymorphism (REMAP) marker techniques and were compared with the DNA profiles obtained with simple sequence repeats (SSRs). The genetic polymorphism, efficiency of polymorphism and effectiveness of marker systems were evaluated to assess the extent of genetic diversity among Indian potato varieties. A total of 139 polymorphic SSR alleles, 270 IRAP and 98 REMAP polymorphic bands, showing polymorphism of 100%, 87.9% and 68.5%, respectively, were used for detailed characterization of the genetic relationships among potato varieties by using cluster analysis and principal coordinate analysis (PCoA). IRAP analysis resulted in the highest number of polymorphic bands with an average of 15 polymorphic bands per assay unit when compared to the other two marker systems. Based on pair-wise comparison, the genetic similarity was calculated using Dice similarity coefficient. The SSRs showed a wide range in genetic similarity values (0.485-0.971) as compared to IRAP (0.69-0.911) and REMAP (0.713-0.947). A Mantel's matrix correspondence test showed a high positive correlation (r=0.6) between IRAP and REMAP, an intermediate value (r=0.58) for IRAP and SSR and the lowest value (r=0.17) for SSR and REMAP. Statistically significant cophenetic correlation coefficient values, of 0.961, 0.941 and 0

  7. Phytochemicals perturb membranes and promiscuously alter protein function.

    Science.gov (United States)

    Ingólfsson, Helgi I; Thakur, Pratima; Herold, Karl F; Hobart, E Ashley; Ramsey, Nicole B; Periole, Xavier; de Jong, Djurre H; Zwama, Martijn; Yilmaz, Duygu; Hall, Katherine; Maretzky, Thorsten; Hemmings, Hugh C; Blobel, Carl; Marrink, Siewert J; Koçer, Armağan; Sack, Jon T; Andersen, Olaf S

    2014-08-15

    A wide variety of phytochemicals are consumed for their perceived health benefits. Many of these phytochemicals have been found to alter numerous cell functions, but the mechanisms underlying their biological activity tend to be poorly understood. Phenolic phytochemicals are particularly promiscuous modifiers of membrane protein function, suggesting that some of their actions may be due to a common, membrane bilayer-mediated mechanism. To test whether bilayer perturbation may underlie this diversity of actions, we examined five bioactive phenols reported to have medicinal value: capsaicin from chili peppers, curcumin from turmeric, EGCG from green tea, genistein from soybeans, and resveratrol from grapes. We find that each of these widely consumed phytochemicals alters lipid bilayer properties and the function of diverse membrane proteins. Molecular dynamics simulations show that these phytochemicals modify bilayer properties by localizing to the bilayer/solution interface. Bilayer-modifying propensity was verified using a gramicidin-based assay, and indiscriminate modulation of membrane protein function was demonstrated using four proteins: membrane-anchored metalloproteases, mechanosensitive ion channels, and voltage-dependent potassium and sodium channels. Each protein exhibited similar responses to multiple phytochemicals, consistent with a common, bilayer-mediated mechanism. Our results suggest that many effects of amphiphilic phytochemicals are due to cell membrane perturbations, rather than specific protein binding.

  8. DNA mimic proteins: functions, structures, and bioinformatic analysis.

    Science.gov (United States)

    Wang, Hao-Ching; Ho, Chun-Han; Hsu, Kai-Cheng; Yang, Jinn-Moon; Wang, Andrew H-J

    2014-05-13

    DNA mimic proteins have DNA-like negative surface charge distributions, and they function by occupying the DNA binding sites of DNA binding proteins to prevent these sites from being accessed by DNA. DNA mimic proteins control the activities of a variety of DNA binding proteins and are involved in a wide range of cellular mechanisms such as chromatin assembly, DNA repair, transcription regulation, and gene recombination. However, the sequences and structures of DNA mimic proteins are diverse, making them difficult to predict by bioinformatic search. To date, only a few DNA mimic proteins have been reported. These DNA mimics were not found by searching for functional motifs in their sequences but were revealed only by structural analysis of their charge distribution. This review highlights the biological roles and structures of 16 reported DNA mimic proteins. We also discuss approaches that might be used to discover new DNA mimic proteins.

  9. MOV10 RNA helicase is a potent inhibitor of retrotransposition in cells.

    Directory of Open Access Journals (Sweden)

    John L Goodier

    Full Text Available MOV10 protein, a putative RNA helicase and component of the RNA-induced silencing complex (RISC, inhibits retrovirus replication. We show that MOV10 also severely restricts human LINE1 (L1, Alu, and SVA retrotransposons. MOV10 associates with the L1 ribonucleoprotein particle, along with other RNA helicases including DDX5, DHX9, DDX17, DDX21, and DDX39A. However, unlike MOV10, these other helicases do not strongly inhibit retrotransposition, an activity dependent upon intact helicase domains. MOV10 association with retrotransposons is further supported by its colocalization with L1 ORF1 protein in stress granules, by cytoplasmic structures associated with RNA silencing, and by the ability of MOV10 to reduce endogenous and ectopic L1 expression. The majority of the human genome is repetitive DNA, most of which is the detritus of millions of years of accumulated retrotransposition. Retrotransposons remain active mutagens, and their insertion can disrupt gene function. Therefore, the host has evolved defense mechanisms to protect against retrotransposition, an arsenal we are only beginning to understand. With homologs in other vertebrates, insects, and plants, MOV10 may represent an ancient and innate form of immunity against both infective viruses and endogenous retroelements.

  10. Nutritional and functional properties of whey proteins concentrate and isolate

    OpenAIRE

    Zoran Herceg; Anet Režek

    2006-01-01

    Whey protein fractions represent 18 - 20 % of total milk nitrogen content. Nutritional value in addition to diverse physico - chemical and functional properties make whey proteins highly suitable for application in foodstuffs. In the most cases, whey proteins are used because of their functional properties. Whey proteins possess favourable functional characteristics such as gelling, water binding, emulsification and foaming ability. Due to application of new process techniques (membrane fract...

  11. Discovering functional interdependence relationship in PPI networks for protein complex identification.

    Science.gov (United States)

    Lam, Winnie W M; Chan, Keith C C

    2012-04-01

    Protein molecules interact with each other in protein complexes to perform many vital functions, and different computational techniques have been developed to identify protein complexes in protein-protein interaction (PPI) networks. These techniques are developed to search for subgraphs of high connectivity in PPI networks under the assumption that the proteins in a protein complex are highly interconnected. While these techniques have been shown to be quite effective, it is also possible that the matching rate between the protein complexes they discover and those that are previously determined experimentally be relatively low and the "false-alarm" rate can be relatively high. This is especially the case when the assumption of proteins in protein complexes being more highly interconnected be relatively invalid. To increase the matching rate and reduce the false-alarm rate, we have developed a technique that can work effectively without having to make this assumption. The name of the technique called protein complex identification by discovering functional interdependence (PCIFI) searches for protein complexes in PPI networks by taking into consideration both the functional interdependence relationship between protein molecules and the network topology of the network. The PCIFI works in several steps. The first step is to construct a multiple-function protein network graph by labeling each vertex with one or more of the molecular functions it performs. The second step is to filter out protein interactions between protein pairs that are not functionally interdependent of each other in the statistical sense. The third step is to make use of an information-theoretic measure to determine the strength of the functional interdependence between all remaining interacting protein pairs. Finally, the last step is to try to form protein complexes based on the measure of the strength of functional interdependence and the connectivity between proteins. For performance evaluation

  12. Automatically extracting functionally equivalent proteins from SwissProt

    Directory of Open Access Journals (Sweden)

    Martin Andrew CR

    2008-10-01

    Full Text Available Abstract Background There is a frequent need to obtain sets of functionally equivalent homologous proteins (FEPs from different species. While it is usually the case that orthology implies functional equivalence, this is not always true; therefore datasets of orthologous proteins are not appropriate. The information relevant to extracting FEPs is contained in databanks such as UniProtKB/Swiss-Prot and a manual analysis of these data allow FEPs to be extracted on a one-off basis. However there has been no resource allowing the easy, automatic extraction of groups of FEPs – for example, all instances of protein C. We have developed FOSTA, an automatically generated database of FEPs annotated as having the same function in UniProtKB/Swiss-Prot which can be used for large-scale analysis. The method builds a candidate list of homologues and filters out functionally diverged proteins on the basis of functional annotations using a simple text mining approach. Results Large scale evaluation of our FEP extraction method is difficult as there is no gold-standard dataset against which the method can be benchmarked. However, a manual analysis of five protein families confirmed a high level of performance. A more extensive comparison with two manually verified functional equivalence datasets also demonstrated very good performance. Conclusion In summary, FOSTA provides an automated analysis of annotations in UniProtKB/Swiss-Prot to enable groups of proteins already annotated as functionally equivalent, to be extracted. Our results demonstrate that the vast majority of UniProtKB/Swiss-Prot functional annotations are of high quality, and that FOSTA can interpret annotations successfully. Where FOSTA is not successful, we are able to highlight inconsistencies in UniProtKB/Swiss-Prot annotation. Most of these would have presented equal difficulties for manual interpretation of annotations. We discuss limitations and possible future extensions to FOSTA, and

  13. Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions.

    Science.gov (United States)

    Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M; Oldfield, Christopher J; Dunker, A Keith; Uversky, Vladimir N; Obradovic, Zoran

    2007-05-01

    Identifying relationships between function, amino acid sequence, and protein structure represents a major challenge. In this study, we propose a bioinformatics approach that identifies functional keywords in the Swiss-Prot database that correlate with intrinsic disorder. A statistical evaluation is employed to rank the significance of these correlations. Protein sequence data redundancy and the relationship between protein length and protein structure were taken into consideration to ensure the quality of the statistical inferences. Over 200,000 proteins from the Swiss-Prot database were analyzed using this approach. The predictions of intrinsic disorder were carried out using PONDR VL3E predictor of long disordered regions that achieves an accuracy of above 86%. Overall, out of the 710 Swiss-Prot functional keywords that were each associated with at least 20 proteins, 238 were found to be strongly positively correlated with predicted long intrinsically disordered regions, whereas 302 were strongly negatively correlated with such regions. The remaining 170 keywords were ambiguous without strong positive or negative correlation with the disorder predictions. These functions cover a large variety of biological activities and imply that disordered regions are characterized by a wide functional repertoire. Our results agree well with literature findings, as we were able to find at least one illustrative example of functional disorder or order shown experimentally for the vast majority of keywords showing the strongest positive or negative correlation with intrinsic disorder. This work opens a series of three papers, which enriches the current view of protein structure-function relationships, especially with regards to functionalities of intrinsically disordered proteins, and provides researchers with a novel tool that could be used to improve the understanding of the relationships between protein structure and function. The first paper of the series describes our

  14. SitesIdentify: a protein functional site prediction tool

    Directory of Open Access Journals (Sweden)

    Doig Andrew J

    2009-11-01

    Full Text Available Abstract Background The rate of protein structures being deposited in the Protein Data Bank surpasses the capacity to experimentally characterise them and therefore computational methods to analyse these structures have become increasingly important. Identifying the region of the protein most likely to be involved in function is useful in order to gain information about its potential role. There are many available approaches to predict functional site, but many are not made available via a publicly-accessible application. Results Here we present a functional site prediction tool (SitesIdentify, based on combining sequence conservation information with geometry-based cleft identification, that is freely available via a web-server. We have shown that SitesIdentify compares favourably to other functional site prediction tools in a comparison of seven methods on a non-redundant set of 237 enzymes with annotated active sites. Conclusion SitesIdentify is able to produce comparable accuracy in predicting functional sites to its closest available counterpart, but in addition achieves improved accuracy for proteins with few characterised homologues. SitesIdentify is available via a webserver at http://www.manchester.ac.uk/bioinformatics/sitesidentify/

  15. Emerging functions of ribosomal proteins in gene-specific transcription and translation

    International Nuclear Information System (INIS)

    Lindstroem, Mikael S.

    2009-01-01

    Ribosomal proteins have remained highly conserved during evolution presumably reflecting often critical functions in ribosome biogenesis or mature ribosome function. In addition, several ribosomal proteins possess distinct extra-ribosomal functions in apoptosis, DNA repair and transcription. An increasing number of ribosomal proteins have been shown to modulate the trans-activation function of important regulatory proteins such as NF-κB, p53, c-Myc and nuclear receptors. Furthermore, a subset of ribosomal proteins can bind directly to untranslated regions of mRNA resulting in transcript-specific translational control outside of the ribosome itself. Collectively, these findings suggest that ribosomal proteins may have a wider functional repertoire within the cell than previously thought. The future challenge is to identify and validate these novel functions in the background of an often essential primary function in ribosome biogenesis and cell growth.

  16. The PANTHER database of protein families, subfamilies, functions and pathways

    OpenAIRE

    Mi, Huaiyu; Lazareva-Ulitsky, Betty; Loo, Rozina; Kejariwal, Anish; Vandergriff, Jody; Rabkin, Steven; Guo, Nan; Muruganujan, Anushya; Doremieux, Olivier; Campbell, Michael J.; Kitano, Hiroaki; Thomas, Paul D.

    2004-01-01

    PANTHER is a large collection of protein families that have been subdivided into functionally related subfamilies, using human expertise. These subfamilies model the divergence of specific functions within protein families, allowing more accurate association with function (ontology terms and pathways), as well as inference of amino acids important for functional specificity. Hidden Markov models (HMMs) are built for each family and subfamily for classifying additional protein sequences. The l...

  17. Functional Advantages of Conserved Intrinsic Disorder in RNA-Binding Proteins.

    Science.gov (United States)

    Varadi, Mihaly; Zsolyomi, Fruzsina; Guharoy, Mainak; Tompa, Peter

    2015-01-01

    Proteins form large macromolecular assemblies with RNA that govern essential molecular processes. RNA-binding proteins have often been associated with conformational flexibility, yet the extent and functional implications of their intrinsic disorder have never been fully assessed. Here, through large-scale analysis of comprehensive protein sequence and structure datasets we demonstrate the prevalence of intrinsic structural disorder in RNA-binding proteins and domains. We addressed their functionality through a quantitative description of the evolutionary conservation of disordered segments involved in binding, and investigated the structural implications of flexibility in terms of conformational stability and interface formation. We conclude that the functional role of intrinsically disordered protein segments in RNA-binding is two-fold: first, these regions establish extended, conserved electrostatic interfaces with RNAs via induced fit. Second, conformational flexibility enables them to target different RNA partners, providing multi-functionality, while also ensuring specificity. These findings emphasize the functional importance of intrinsically disordered regions in RNA-binding proteins.

  18. Functional Advantages of Conserved Intrinsic Disorder in RNA-Binding Proteins.

    Directory of Open Access Journals (Sweden)

    Mihaly Varadi

    Full Text Available Proteins form large macromolecular assemblies with RNA that govern essential molecular processes. RNA-binding proteins have often been associated with conformational flexibility, yet the extent and functional implications of their intrinsic disorder have never been fully assessed. Here, through large-scale analysis of comprehensive protein sequence and structure datasets we demonstrate the prevalence of intrinsic structural disorder in RNA-binding proteins and domains. We addressed their functionality through a quantitative description of the evolutionary conservation of disordered segments involved in binding, and investigated the structural implications of flexibility in terms of conformational stability and interface formation. We conclude that the functional role of intrinsically disordered protein segments in RNA-binding is two-fold: first, these regions establish extended, conserved electrostatic interfaces with RNAs via induced fit. Second, conformational flexibility enables them to target different RNA partners, providing multi-functionality, while also ensuring specificity. These findings emphasize the functional importance of intrinsically disordered regions in RNA-binding proteins.

  19. Functionality of alternative protein in gluten-free product development.

    Science.gov (United States)

    Deora, Navneet Singh; Deswal, Aastha; Mishra, Hari Niwas

    2015-07-01

    Celiac disease is an immune-mediated disease triggered in genetically susceptible individuals by ingested gluten from wheat, rye, barley, and other closely related cereal grains. The current treatment for celiac disease is life-long adherence to a strict gluten-exclusion diet. The replacement of gluten presents a significant technological challenge, as it is an essential structure-building protein, which is necessary for formulating high-quality baked goods. A major limitation in the production of gluten-free products is the lack of protein functionality in non-wheat cereals. Additionally, commercial gluten-free mixes usually contain only carbohydrates, which may significantly limit the amount of protein in the diet. In the recent past, various approaches are attempted to incorporate protein-based ingredients and to modify the functional properties for gluten-free product development. This review aims to the highlight functionality of the alternative protein-based ingredients, which can be utilized for gluten-free product development both functionally as well as nutritionally. © The Author(s) 2014.

  20. Exploring protein dynamics space: the dynasome as the missing link between protein structure and function.

    Directory of Open Access Journals (Sweden)

    Ulf Hensen

    Full Text Available Proteins are usually described and classified according to amino acid sequence, structure or function. Here, we develop a minimally biased scheme to compare and classify proteins according to their internal mobility patterns. This approach is based on the notion that proteins not only fold into recurring structural motifs but might also be carrying out only a limited set of recurring mobility motifs. The complete set of these patterns, which we tentatively call the dynasome, spans a multi-dimensional space with axes, the dynasome descriptors, characterizing different aspects of protein dynamics. The unique dynamic fingerprint of each protein is represented as a vector in the dynasome space. The difference between any two vectors, consequently, gives a reliable measure of the difference between the corresponding protein dynamics. We characterize the properties of the dynasome by comparing the dynamics fingerprints obtained from molecular dynamics simulations of 112 proteins but our approach is, in principle, not restricted to any specific source of data of protein dynamics. We conclude that: 1. the dynasome consists of a continuum of proteins, rather than well separated classes. 2. For the majority of proteins we observe strong correlations between structure and dynamics. 3. Proteins with similar function carry out similar dynamics, which suggests a new method to improve protein function annotation based on protein dynamics.

  1. Coiled-Coil Proteins Facilitated the Functional Expansion of the Centrosome

    Science.gov (United States)

    Kuhn, Michael; Hyman, Anthony A.; Beyer, Andreas

    2014-01-01

    Repurposing existing proteins for new cellular functions is recognized as a main mechanism of evolutionary innovation, but its role in organelle evolution is unclear. Here, we explore the mechanisms that led to the evolution of the centrosome, an ancestral eukaryotic organelle that expanded its functional repertoire through the course of evolution. We developed a refined sequence alignment technique that is more sensitive to coiled coil proteins, which are abundant in the centrosome. For proteins with high coiled-coil content, our algorithm identified 17% more reciprocal best hits than BLAST. Analyzing 108 eukaryotic genomes, we traced the evolutionary history of centrosome proteins. In order to assess how these proteins formed the centrosome and adopted new functions, we computationally emulated evolution by iteratively removing the most recently evolved proteins from the centrosomal protein interaction network. Coiled-coil proteins that first appeared in the animal–fungi ancestor act as scaffolds and recruit ancestral eukaryotic proteins such as kinases and phosphatases to the centrosome. This process created a signaling hub that is crucial for multicellular development. Our results demonstrate how ancient proteins can be co-opted to different cellular localizations, thereby becoming involved in novel functions. PMID:24901223

  2. Growing functional modules from a seed protein via integration of protein interaction and gene expression data

    Directory of Open Access Journals (Sweden)

    Dimitrakopoulou Konstantina

    2007-10-01

    Full Text Available Abstract Background Nowadays modern biology aims at unravelling the strands of complex biological structures such as the protein-protein interaction (PPI networks. A key concept in the organization of PPI networks is the existence of dense subnetworks (functional modules in them. In recent approaches clustering algorithms were applied at these networks and the resulting subnetworks were evaluated by estimating the coverage of well-established protein complexes they contained. However, most of these algorithms elaborate on an unweighted graph structure which in turn fails to elevate those interactions that would contribute to the construction of biologically more valid and coherent functional modules. Results In the current study, we present a method that corroborates the integration of protein interaction and microarray data via the discovery of biologically valid functional modules. Initially the gene expression information is overlaid as weights onto the PPI network and the enriched PPI graph allows us to exploit its topological aspects, while simultaneously highlights enhanced functional association in specific pairs of proteins. Then we present an algorithm that unveils the functional modules of the weighted graph by expanding a kernel protein set, which originates from a given 'seed' protein used as starting-point. Conclusion The integrated data and the concept of our approach provide reliable functional modules. We give proofs based on yeast data that our method manages to give accurate results in terms both of structural coherency, as well as functional consistency.

  3. Single proteins that serve linked functions in intracellular and extracellular microenvironments

    Energy Technology Data Exchange (ETDEWEB)

    Radisky, Derek C.; Stallings-Mann, Melody; Hirai, Yohei; Bissell, Mina J.

    2009-06-03

    Maintenance of organ homeostasis and control of appropriate response to environmental alterations requires intimate coordination of cellular function and tissue organization. An important component of this coordination may be provided by proteins that can serve distinct, but linked, functions on both sides of the plasma membrane. Here we present a novel hypothesis in which non-classical secretion can provide a mechanism through which single proteins can integrate complex tissue functions. Single genes can exert a complex, dynamic influence through a number of different processes that act to multiply the function of the gene product(s). Alternative splicing can create many different transcripts that encode proteins of diverse, even antagonistic, function from a single gene. Posttranslational modifications can alter the stability, activity, localization, and even basic function of proteins. A protein can exist in different subcellular localizations. More recently, it has become clear that single proteins can function both inside and outside the cell. These proteins often lack defined secretory signal sequences, and transit the plasma membrane by mechanisms separate from the classical ER/Golgi secretory process. When examples of such proteins are examined individually, the multifunctionality and lack of a signal sequence are puzzling - why should a protein with a well known function in one context function in such a distinct fashion in another? We propose that one reason for a single protein to perform intracellular and extracellular roles is to coordinate organization and maintenance of a global tissue function. Here, we describe in detail three specific examples of proteins that act in this fashion, outlining their specific functions in the extracellular space and in the intracellular space, and we discuss how these functions may be linked. We present epimorphin/syntaxin-2, which may coordinate morphogenesis of secretory organs (as epimorphin) with control of

  4. Proteins of unknown function in the Protein Data Bank (PDB): an inventory of true uncharacterized proteins and computational tools for their analysis.

    Science.gov (United States)

    Nadzirin, Nurul; Firdaus-Raih, Mohd

    2012-10-08

    Proteins of uncharacterized functions form a large part of many of the currently available biological databases and this situation exists even in the Protein Data Bank (PDB). Our analysis of recent PDB data revealed that only 42.53% of PDB entries (1084 coordinate files) that were categorized under "unknown function" are true examples of proteins of unknown function at this point in time. The remainder 1465 entries also annotated as such appear to be able to have their annotations re-assessed, based on the availability of direct functional characterization experiments for the protein itself, or for homologous sequences or structures thus enabling computational function inference.

  5. Protein mislocalization: mechanisms, functions and clinical applications in cancer

    Science.gov (United States)

    Wang, Xiaohong; Li, Shulin

    2014-01-01

    The changes from normal cells to cancer cells are primarily regulated by genome instability, which foster hallmark functions of cancer through multiple mechanisms including protein mislocalization. Mislocalization of these proteins, including oncoproteins, tumor suppressors, and other cancer-related proteins, can interfere with normal cellular function and cooperatively drive tumor development and metastasis. This review describes the cancer-related effects of protein subcellular mislocalization, the related mislocalization mechanisms, and the potential application of this knowledge to cancer diagnosis, prognosis, and therapy. PMID:24709009

  6. The structure and function of endophilin proteins

    DEFF Research Database (Denmark)

    Kjaerulff, Ole; Brodin, Lennart; Jung, Anita

    2011-01-01

    Members of the BAR domain protein superfamily are essential elements of cellular traffic. Endophilins are among the best studied BAR domain proteins. They have a prominent function in synaptic vesicle endocytosis (SVE), receptor trafficking and apoptosis, and in other processes that require...

  7. Collagen targeting using multivalent protein-functionalized dendrimers

    NARCIS (Netherlands)

    Breurken, M.; Lempens, E.H.M.; Temming, R.P.; Helms, B.A.; Meijer, E.W.; Merkx, M.

    2011-01-01

    Collagen is an attractive marker for tissue remodeling in a variety of common disease processes. Here we report the preparation of protein dendrimers as multivalent collagen targeting ligands by native chemical ligation of the collagen binding protein CNA35 to cysteine-functionalized dendritic

  8. Computational design of proteins with novel structure and functions

    International Nuclear Information System (INIS)

    Yang Wei; Lai Lu-Hua

    2016-01-01

    Computational design of proteins is a relatively new field, where scientists search the enormous sequence space for sequences that can fold into desired structure and perform desired functions. With the computational approach, proteins can be designed, for example, as regulators of biological processes, novel enzymes, or as biotherapeutics. These approaches not only provide valuable information for understanding of sequence–structure–function relations in proteins, but also hold promise for applications to protein engineering and biomedical research. In this review, we briefly introduce the rationale for computational protein design, then summarize the recent progress in this field, including de novo protein design, enzyme design, and design of protein–protein interactions. Challenges and future prospects of this field are also discussed. (topical review)

  9. Milk protein tailoring to improve functional and biological properties

    Directory of Open Access Journals (Sweden)

    JEAN-MARC CHOBERT

    2012-01-01

    Full Text Available Proteins are involved in every aspects of life: structure, motion, catalysis, recognition and regulation. Today's highly sophisticated science of the modifications of proteins has ancient roots. The tailoring of proteins for food and medical uses precedes the beginning of what is called biochemistry. Chemical modification of proteins was pursued early in the twentieth century as an analytical procedure for side-chain amino acids. Later, methods were developed for specific inactivation of biologically active proteins and titration of their essential groups. Enzymatic modifications were mainly developed in the seventies when many more enzymes became economically available. Protein engineering has become a valuable tool for creating or improving proteins for practical use and has provided new insights into protein structure and function. The actual and potential use of milk proteins as food ingredients has been a popular topic for research over the past 40 years. With today's sophisticated analytical, biochemical and biological research tools, the presence of compounds with biological activity has been demonstrated. Improvements in separation techniques and enzyme technology have enabled efficient and economic isolation and modification of milk proteins, which has made possible their use as functional foods, dietary supplements, nutraceuticals and medical foods. In this review, some chemical and enzymatic modifications of milk proteins are described, with particular focus on their functional and biological properties.

  10. Protein Function Prediction Based on Sequence and Structure Information

    KAUST Repository

    Smaili, Fatima Z.

    2016-01-01

    operate. In this master thesis project, we worked on inferring protein functions based on the primary protein sequence. In the approach we follow, 3D models are first constructed using I-TASSER. Functions are then deduced by structurally matching

  11. Combining modularity, conservation, and interactions of proteins significantly increases precision and coverage of protein function prediction

    Directory of Open Access Journals (Sweden)

    Sers Christine T

    2010-12-01

    Full Text Available Abstract Background While the number of newly sequenced genomes and genes is constantly increasing, elucidation of their function still is a laborious and time-consuming task. This has led to the development of a wide range of methods for predicting protein functions in silico. We report on a new method that predicts function based on a combination of information about protein interactions, orthology, and the conservation of protein networks in different species. Results We show that aggregation of these independent sources of evidence leads to a drastic increase in number and quality of predictions when compared to baselines and other methods reported in the literature. For instance, our method generates more than 12,000 novel protein functions for human with an estimated precision of ~76%, among which are 7,500 new functional annotations for 1,973 human proteins that previously had zero or only one function annotated. We also verified our predictions on a set of genes that play an important role in colorectal cancer (MLH1, PMS2, EPHB4 and could confirm more than 73% of them based on evidence in the literature. Conclusions The combination of different methods into a single, comprehensive prediction method infers thousands of protein functions for every species included in the analysis at varying, yet always high levels of precision and very good coverage.

  12. Divergence, recombination and retention of functionality during protein evolution

    Directory of Open Access Journals (Sweden)

    Xu Yanlong O

    2005-09-01

    Full Text Available Abstract We have only a vague idea of precisely how protein sequences evolve in the context of protein structure and function. This is primarily because structural and functional contexts are not easily predictable from the primary sequence, and evaluating patterns of evolution at individual residue positions is also difficult. As a result of increasing biodiversity in genomics studies, progress is being made in detecting context-dependent variation in substitution processes, but it remains unclear exactly what context-dependent patterns we should be looking for. To address this, we have been simulating protein evolution in the context of structure and function using lattice models of proteins and ligands (or substrates. These simulations include thermodynamic features of protein stability and population dynamics. We refer to this approach as 'ab initio evolution' to emphasise the fact that the equilibrium details of fitness distributions arise from the physical principles of the system and not from any preconceived notions or arbitrary mathematical distributions. Here, we present results on the retention of functionality in homologous recombinants following population divergence. A central result is that protein structure characteristics can strongly influence recombinant functionality. Exceptional structures with many sequence options evolve quickly and tend to retain functionality -- even in highly diverged recombinants. By contrast, the more common structures with fewer sequence options evolve more slowly, but the fitness of recombinants drops off rapidly as homologous proteins diverge. These results have implications for understanding viral evolution, speciation and directed evolutionary experiments. Our analysis of the divergence process can also guide improved methods for accurately approximating folding probabilities in more complex but realistic systems.

  13. Usher proteins in inner ear structure and function.

    Science.gov (United States)

    Ahmed, Zubair M; Frolenkov, Gregory I; Riazuddin, Saima

    2013-11-01

    Usher syndrome (USH) is a neurosensory disorder affecting both hearing and vision in humans. Linkage studies of families of USH patients, studies in animals, and characterization of purified proteins have provided insight into the molecular mechanisms of hearing. To date, 11 USH proteins have been identified, and evidence suggests that all of them are crucial for the function of the mechanosensory cells of the inner ear, the hair cells. Most USH proteins are localized to the stereocilia of the hair cells, where mechano-electrical transduction (MET) of sound-induced vibrations occurs. Therefore, elucidation of the functions of USH proteins in the stereocilia is a prerequisite to understanding the exact mechanisms of MET.

  14. Moonlighting microtubule-associated proteins: regulatory functions by day and pathological functions at night.

    Science.gov (United States)

    Oláh, J; Tőkési, N; Lehotzky, A; Orosz, F; Ovádi, J

    2013-11-01

    The sensing, integrating, and coordinating features of the eukaryotic cells are achieved by the complex ultrastructural arrays and multifarious functions of the cytoskeletal network. Cytoskeleton comprises fibrous protein networks of microtubules, actin, and intermediate filaments. These filamentous polymer structures are highly dynamic and undergo constant and rapid reorganization during cellular processes. The microtubular system plays a crucial role in the brain, as it is involved in an enormous number of cellular events including cell differentiation and pathological inclusion formation. These multifarious functions of microtubules can be achieved by their decoration with proteins/enzymes that exert specific effects on the dynamics and organization of the cytoskeleton and mediate distinct functions due to their moonlighting features. This mini-review focuses on two aspects of the microtubule cytoskeleton. On the one hand, we describe the heteroassociation of tubulin/microtubules with metabolic enzymes, which in addition to their catalytic activities stabilize microtubule structures via their cross-linking functions. On the other hand, we focus on the recently identified moonlighting tubulin polymerization promoting protein, TPPP/p25. TPPP/p25 is a microtubule-associated protein and it displays distinct physiological or pathological (aberrant) functions; thus it is a prototype of Neomorphic Moonlighting Proteins. The expression of TPPP/p25 is finely controlled in the human brain; this protein is indispensable for the development of projections of oligodendrocytes that are responsible for the ensheathment of axons. The nonphysiological, higher or lower TPPP/p25 level leads to distinct CNS diseases. Mechanisms contributing to the control of microtubule stability and dynamics by metabolic enzymes and TPPP/p25 will be discussed. Copyright © 2013 Wiley Periodicals, Inc.

  15. Functional Anthology of Intrinsic Disorder. I. Biological Processes and Functions of Proteins with Long Disordered Regions

    Science.gov (United States)

    Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M.; Oldfield, Christopher J.; Dunker, A. Keith; Uversky, Vladimir N.; Obradovic, Zoran

    2008-01-01

    Identifying relationships between function, amino acid sequence and protein structure represents a major challenge. In this study we propose a bioinformatics approach that identifies functional keywords in the Swiss-Prot database that correlate with intrinsic disorder. A statistical evaluation is employed to rank the significance of these correlations. Protein sequence data redundancy and the relationship between protein length and protein structure were taken into consideration to ensure the quality of the statistical inferences. Over 200,000 proteins from Swiss-Prot database were analyzed using this approach. The predictions of intrinsic disorder were carried out using PONDR VL3E predictor of long disordered regions that achieves an accuracy of above 86%. Overall, out of the 710 Swiss-Prot functional keywords that were each associated with at least 20 proteins, 238 were found to be strongly positively correlated with predicted long intrinsically disordered regions, whereas 302 were strongly negatively correlated with such regions. The remaining 170 keywords were ambiguous without strong positive or negative correlation with the disorder predictions. These functions cover a large variety of biological activities and imply that disordered regions are characterized by a wide functional repertoire. Our results agree well with literature findings, as we were able to find at least one illustrative example of functional disorder or order shown experimentally for the vast majority of keywords showing the strongest positive or negative correlation with intrinsic disorder. This work opens a series of three papers, which enriches the current view of protein structure-function relationships, especially with regards to functionalities of intrinsically disordered proteins and provides researchers with a novel tool that could be used to improve the understanding of the relationships between protein structure and function. The first paper of the series describes our statistical

  16. Predicting Protein Function via Semantic Integration of Multiple Networks.

    Science.gov (United States)

    Yu, Guoxian; Fu, Guangyuan; Wang, Jun; Zhu, Hailong

    2016-01-01

    Determining the biological functions of proteins is one of the key challenges in the post-genomic era. The rapidly accumulated large volumes of proteomic and genomic data drives to develop computational models for automatically predicting protein function in large scale. Recent approaches focus on integrating multiple heterogeneous data sources and they often get better results than methods that use single data source alone. In this paper, we investigate how to integrate multiple biological data sources with the biological knowledge, i.e., Gene Ontology (GO), for protein function prediction. We propose a method, called SimNet, to Semantically integrate multiple functional association Networks derived from heterogenous data sources. SimNet firstly utilizes GO annotations of proteins to capture the semantic similarity between proteins and introduces a semantic kernel based on the similarity. Next, SimNet constructs a composite network, obtained as a weighted summation of individual networks, and aligns the network with the kernel to get the weights assigned to individual networks. Then, it applies a network-based classifier on the composite network to predict protein function. Experiment results on heterogenous proteomic data sources of Yeast, Human, Mouse, and Fly show that, SimNet not only achieves better (or comparable) results than other related competitive approaches, but also takes much less time. The Matlab codes of SimNet are available at https://sites.google.com/site/guoxian85/simnet.

  17. The function of communities in protein interaction networks at multiple scales

    Directory of Open Access Journals (Sweden)

    Jones Nick S

    2010-07-01

    Full Text Available Abstract Background If biology is modular then clusters, or communities, of proteins derived using only protein interaction network structure should define protein modules with similar biological roles. We investigate the link between biological modules and network communities in yeast and its relationship to the scale at which we probe the network. Results Our results demonstrate that the functional homogeneity of communities depends on the scale selected, and that almost all proteins lie in a functionally homogeneous community at some scale. We judge functional homogeneity using a novel test and three independent characterizations of protein function, and find a high degree of overlap between these measures. We show that a high mean clustering coefficient of a community can be used to identify those that are functionally homogeneous. By tracing the community membership of a protein through multiple scales we demonstrate how our approach could be useful to biologists focusing on a particular protein. Conclusions We show that there is no one scale of interest in the community structure of the yeast protein interaction network, but we can identify the range of resolution parameters that yield the most functionally coherent communities, and predict which communities are most likely to be functionally homogeneous.

  18. Identifying the molecular functions of electron transport proteins using radial basis function networks and biochemical properties.

    Science.gov (United States)

    Le, Nguyen-Quoc-Khanh; Nguyen, Trinh-Trung-Duong; Ou, Yu-Yen

    2017-05-01

    The electron transport proteins have an important role in storing and transferring electrons in cellular respiration, which is the most proficient process through which cells gather energy from consumed food. According to the molecular functions, the electron transport chain components could be formed with five complexes with several different electron carriers and functions. Therefore, identifying the molecular functions in the electron transport chain is vital for helping biologists understand the electron transport chain process and energy production in cells. This work includes two phases for discriminating electron transport proteins from transport proteins and classifying categories of five complexes in electron transport proteins. In the first phase, the performances from PSSM with AAIndex feature set were successful in identifying electron transport proteins in transport proteins with achieved sensitivity of 73.2%, specificity of 94.1%, and accuracy of 91.3%, with MCC of 0.64 for independent data set. With the second phase, our method can approach a precise model for identifying of five complexes with different molecular functions in electron transport proteins. The PSSM with AAIndex properties in five complexes achieved MCC of 0.51, 0.47, 0.42, 0.74, and 1.00 for independent data set, respectively. We suggest that our study could be a power model for determining new proteins that belongs into which molecular function of electron transport proteins. Copyright © 2017 Elsevier Inc. All rights reserved.

  19. An expanding universe of the non-coding genome in cancer biology.

    Science.gov (United States)

    Xue, Bin; He, Lin

    2014-06-01

    Neoplastic transformation is caused by accumulation of genetic and epigenetic alterations that ultimately convert normal cells into tumor cells with uncontrolled proliferation and survival, unlimited replicative potential and invasive growth [Hanahan,D. et al. (2011) Hallmarks of cancer: the next generation. Cell, 144, 646-674]. Although the majority of the cancer studies have focused on the functions of protein-coding genes, emerging evidence has started to reveal the importance of the vast non-coding genome, which constitutes more than 98% of the human genome. A number of non-coding RNAs (ncRNAs) derived from the 'dark matter' of the human genome exhibit cancer-specific differential expression and/or genomic alterations, and it is increasingly clear that ncRNAs, including small ncRNAs and long ncRNAs (lncRNAs), play an important role in cancer development by regulating protein-coding gene expression through diverse mechanisms. In addition to ncRNAs, nearly half of the mammalian genomes consist of transposable elements, particularly retrotransposons. Once depicted as selfish genomic parasites that propagate at the expense of host fitness, retrotransposon elements could also confer regulatory complexity to the host genomes during development and disease. Reactivation of retrotransposons in cancer, while capable of causing insertional mutagenesis and genome rearrangements to promote oncogenesis, could also alter host gene expression networks to favor tumor development. Taken together, the functional significance of non-coding genome in tumorigenesis has been previously underestimated, and diverse transcripts derived from the non-coding genome could act as integral functional components of the oncogene and tumor suppressor network. © The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

  20. Functional studies on the phosphatidychloride transfer protein

    NARCIS (Netherlands)

    Brouwer, A.P.M. de

    2002-01-01

    The phosphatidylcholine transfer protein (PC-TP) has been studied for over 30 years now. Despite extensive research concerning the biochemical, biophysical and structural properties of PC-TP, the function of this protein is still elusive. We have studied in vitro the folding and the mechanism of PC

  1. Functional dynamics of cell surface membrane proteins.

    Science.gov (United States)

    Nishida, Noritaka; Osawa, Masanori; Takeuchi, Koh; Imai, Shunsuke; Stampoulis, Pavlos; Kofuku, Yutaka; Ueda, Takumi; Shimada, Ichio

    2014-04-01

    Cell surface receptors are integral membrane proteins that receive external stimuli, and transmit signals across plasma membranes. In the conventional view of receptor activation, ligand binding to the extracellular side of the receptor induces conformational changes, which convert the structure of the receptor into an active conformation. However, recent NMR studies of cell surface membrane proteins have revealed that their structures are more dynamic than previously envisioned, and they fluctuate between multiple conformations in an equilibrium on various timescales. In addition, NMR analyses, along with biochemical and cell biological experiments indicated that such dynamical properties are critical for the proper functions of the receptors. In this review, we will describe several NMR studies that revealed direct linkage between the structural dynamics and the functions of the cell surface membrane proteins, such as G-protein coupled receptors (GPCRs), ion channels, membrane transporters, and cell adhesion molecules. Copyright © 2013 Elsevier Inc. All rights reserved.

  2. New insights into potential functions for the protein 4.1superfamily of proteins in kidney epithelium

    Energy Technology Data Exchange (ETDEWEB)

    Calinisan, Venice; Gravem, Dana; Chen, Ray Ping-Hsu; Brittin,Sachi; Mohandas, Narla; Lecomte, Marie-Christine; Gascard, Philippe

    2005-06-17

    Members of the protein 4.1 family of adapter proteins are expressed in a broad panel of tissues including various epithelia where they likely play an important role in maintenance of cell architecture and polarity and in control of cell proliferation. We have recently characterized the structure and distribution of three members of the protein 4.1 family, 4.1B, 4.1R and 4.1N, in mouse kidney. We describe here binding partners for renal 4.1 proteins, identified through the screening of a rat kidney yeast two-hybrid system cDNA library. The identification of putative protein 4.1-based complexes enables us to envision potential functions for 4.1 proteins in kidney: organization of signaling complexes, response to osmotic stress, protein trafficking, and control of cell proliferation. We discuss the relevance of these protein 4.1-based interactions in kidney physio-pathology in the context of their previously identified functions in other cells and tissues. Specifically, we will focus on renal 4.1 protein interactions with beta amyloid precursor protein (beta-APP), 14-3-3 proteins, and the cell swelling-activated chloride channel pICln. We also discuss the functional relevance of another member of the protein 4.1 superfamily, ezrin, in kidney physiopathology.

  3. Developing Novel Protein-based Materials using Ultrabithorax: Production, Characterization, and Functionalization

    Science.gov (United States)

    Huang, Zhao

    2011-12-01

    Compared to 'conventional' materials made from metal, glass, or ceramics, protein-based materials have unique mechanical properties. Furthermore, the morphology, mechanical properties, and functionality of protein-based materials may be optimized via sequence engineering for use in a variety of applications, including textile materials, biosensors, and tissue engineering scaffolds. The development of recombinant DNA technology has enabled the production and engineering of protein-based materials ex vivo. However, harsh production conditions can compromise the mechanical properties of protein-based materials and diminish their ability to incorporate functional proteins. Developing a new generation of protein-based materials is crucial to (i) improve materials assembly conditions, (ii) create novel mechanical properties, and (iii) expand the capacity to carry functional protein/peptide sequences. This thesis describes development of novel protein-based materials using Ultrabithorax, a member of the Hox family of proteins that regulate developmental pathways in Drosophila melanogaster. The experiments presented (i) establish the conditions required for the assembly of Ubx-based materials, (ii) generate a wide range of Ubx morphologies, (iii) examine the mechanical properties of Ubx fibers, (iv) incorporate protein functions to Ubx-based materials via gene fusion, (v) pattern protein functions within the Ubx materials, and (vi) examine the biocompatibility of Ubx materials in vitro. Ubx-based materials assemble at mild conditions compatible with protein folding and activity, which enables Ubx chimeric materials to retain the function of appended proteins in spatial patterns determined by materials assembly. Ubx-based materials also display mechanical properties comparable to existing protein-based materials and demonstrate good biocompatibility with living cells in vitro. Taken together, this research demonstrates the unique features and future potential of novel Ubx

  4. A three-way approach for protein function classification.

    Directory of Open Access Journals (Sweden)

    Hafeez Ur Rehman

    Full Text Available The knowledge of protein functions plays an essential role in understanding biological cells and has a significant impact on human life in areas such as personalized medicine, better crops and improved therapeutic interventions. Due to expense and inherent difficulty of biological experiments, intelligent methods are generally relied upon for automatic assignment of functions to proteins. The technological advancements in the field of biology are improving our understanding of biological processes and are regularly resulting in new features and characteristics that better describe the role of proteins. It is inevitable to neglect and overlook these anticipated features in designing more effective classification techniques. A key issue in this context, that is not being sufficiently addressed, is how to build effective classification models and approaches for protein function prediction by incorporating and taking advantage from the ever evolving biological information. In this article, we propose a three-way decision making approach which provides provisions for seeking and incorporating future information. We considered probabilistic rough sets based models such as Game-Theoretic Rough Sets (GTRS and Information-Theoretic Rough Sets (ITRS for inducing three-way decisions. An architecture of protein functions classification with probabilistic rough sets based three-way decisions is proposed and explained. Experiments are carried out on Saccharomyces cerevisiae species dataset obtained from Uniprot database with the corresponding functional classes extracted from the Gene Ontology (GO database. The results indicate that as the level of biological information increases, the number of deferred cases are reduced while maintaining similar level of accuracy.

  5. Proteins of Unknown Function in the Protein Data Bank (PDB: An Inventory of True Uncharacterized Proteins and Computational Tools for Their Analysis

    Directory of Open Access Journals (Sweden)

    Nurul Nadzirin

    2012-10-01

    Full Text Available Proteins of uncharacterized functions form a large part of many of the currently available biological databases and this situation exists even in the Protein Data Bank (PDB. Our analysis of recent PDB data revealed that only 42.53% of PDB entries (1084 coordinate files that were categorized under “unknown function” are true examples of proteins of unknown function at this point in time. The remainder 1465 entries also annotated as such appear to be able to have their annotations re-assessed, based on the availability of direct functional characterization experiments for the protein itself, or for homologous sequences or structures thus enabling computational function inference.

  6. Functions and structures of eukaryotic recombination proteins

    International Nuclear Information System (INIS)

    Ogawa, Tomoko

    1994-01-01

    We have found that Rad51 and RecA Proteins form strikingly similar structures together with dsDNA and ATP. Their right handed helical nucleoprotein filaments extend the B-form DNA double helixes to 1.5 times in length and wind the helix. The similarity and uniqueness of their structures must reflect functional homologies between these proteins. Therefore, it is highly probable that similar recombination proteins are present in various organisms of different evolutional states. We have succeeded to clone RAD51 genes from human, mouse, chicken and fission yeast genes, and found that the homologues are widely distributed in eukaryotes. The HsRad51 and MmRad51 or ChRad51 proteins consist of 339 amino acids differing only by 4 or 12 amino acids, respectively, and highly homologous to both yeast proteins, but less so to Dmcl. All of these proteins are homologous to the region from residues 33 to 240 of RecA which was named ''homologous core. The homologous core is likely to be responsible for functions common for all of them, such as the formation of helical nucleoprotein filament that is considered to be involved in homologous pairing in the recombination reaction. The mouse gene is transcribed at a high level in thymus, spleen, testis, and ovary, at lower level in brain and at a further lower level in some other tissues. It is transcribed efficiently in recombination active tissues. A clear functional difference of Rad51 homologues from RecA was suggested by the failure of heterologous genes to complement the deficiency of Scrad51 mutants. This failure seems to reflect the absence of a compatible partner, such as ScRad52 protein in the case of ScRad51 protein, between different species. Thus, these discoveries play a role of the starting point to understand the fundamental gene targeting in mammalian cells and in gene therapy. (J.P.N.)

  7. Functional similarities between the dictyostelium protein AprA and the human protein dipeptidyl-peptidase IV.

    Science.gov (United States)

    Herlihy, Sarah E; Tang, Yu; Phillips, Jonathan E; Gomer, Richard H

    2017-03-01

    Autocrine proliferation repressor protein A (AprA) is a protein secreted by Dictyostelium discoideum cells. Although there is very little sequence similarity between AprA and any human protein, AprA has a predicted structural similarity to the human protein dipeptidyl peptidase IV (DPPIV). AprA is a chemorepellent for Dictyostelium cells, and DPPIV is a chemorepellent for neutrophils. This led us to investigate if AprA and DPPIV have additional functional similarities. We find that like AprA, DPPIV is a chemorepellent for, and inhibits the proliferation of, D. discoideum cells, and that AprA binds some DPPIV binding partners such as fibronectin. Conversely, rAprA has DPPIV-like protease activity. These results indicate a functional similarity between two eukaryotic chemorepellent proteins with very little sequence similarity, and emphasize the usefulness of using a predicted protein structure to search a protein structure database, in addition to searching for proteins with similar sequences. © 2016 The Protein Society.

  8. An evaluation of a SVA retrotransposon in the FUS promoter as a transcriptional regulator and its association to ALS.

    Directory of Open Access Journals (Sweden)

    Abigail L Savage

    Full Text Available Genetic mutations of FUS have been linked to many diseases including Amyotrophic Lateral Sclerosis (ALS and Frontotemporal Lobar Degeneration. A primate specific and polymorphic retrotransposon of the SINE-VNTR-Alu (SVA family is present upstream of the FUS gene. Here we have demonstrated that this retrotransposon can act as a classical transcriptional regulatory domain in the context of a reporter gene construct both in vitro in the human SK-N-AS neuroblastoma cell line and in vivo in a chick embryo model. We have also demonstrated that the SVA is composed of multiple distinct regulatory domains, one of which is a variable number tandem repeat (VNTR. The ability of the SVA and its component parts to direct reporter gene expression supported a hypothesis that this region could direct differential FUS expression in vivo. The SVA may therefore contribute to the modulation of FUS expression exhibited in and associated with neurological disorders including ALS where FUS regulation may be an important parameter in progression of the disease. As VNTRs are often clinical associates for disease progression we determined the extent of polymorphism within the SVA. In total 2 variants of the SVA were identified based within a central VNTR. Preliminary analysis addressed the association of these SVA variants within a small sporadic ALS cohort but did not reach statistical significance, although we did not include other parameters such as SNPs within the SVA or an environmental factor in this analysis. The latter may be particularly important as the transcriptional and epigenetic properties of the SVA are likely to be directed by the environment of the cell.

  9. Production of functional protein hydrolysates from Egyptian breeds ...

    African Journals Online (AJOL)

    Production of functional protein hydrolysates from Egyptian breeds of soybean and lupin seeds. AA khalil, SS Mohamed, FS Taha, EN Karlsson. Abstract. Enzymatic hydrolysis is an agro-processing aid that can be utilized in order to improve nutritional quality of protein extracts from many sources. In this study, protein ...

  10. Membrane Protein Production in Lactococcus lactis for Functional Studies.

    Science.gov (United States)

    Seigneurin-Berny, Daphne; King, Martin S; Sautron, Emiline; Moyet, Lucas; Catty, Patrice; André, François; Rolland, Norbert; Kunji, Edmund R S; Frelet-Barrand, Annie

    2016-01-01

    Due to their unique properties, expression and study of membrane proteins in heterologous systems remains difficult. Among the bacterial systems available, the Gram-positive lactic bacterium, Lactococcus lactis, traditionally used in food fermentations, is nowadays widely used for large-scale production and functional characterization of bacterial and eukaryotic membrane proteins. The aim of this chapter is to describe the different possibilities for the functional characterization of peripheral or intrinsic membrane proteins expressed in Lactococcus lactis.

  11. Hierarchical partitioning of metazoan protein conservation profiles provides new functional insights.

    Directory of Open Access Journals (Sweden)

    Jonathan Witztum

    Full Text Available The availability of many complete, annotated proteomes enables the systematic study of the relationships between protein conservation and functionality. We explore this question based solely on the presence or absence of protein homologues (a.k.a. conservation profiles. We study 18 metazoans, from two distinct points of view: the human's and the fly's. Using the GOrilla gene ontology (GO analysis tool, we explore functional enrichment of the "universal proteins", those with homologues in all 17 other species, and of the "non-universal proteins". A large number of GO terms are strongly enriched in both human and fly universal proteins. Most of these functions are known to be essential. A smaller number of GO terms, exhibiting markedly different properties, are enriched in both human and fly non-universal proteins. We further explore the non-universal proteins, whose conservation profiles are consistent with the "tree of life" (TOL consistent, as well as the TOL inconsistent proteins. Finally, we applied Quantum Clustering to the conservation profiles of the TOL consistent proteins. Each cluster is strongly associated with one or a small number of specific monophyletic clades in the tree of life. The proteins in many of these clusters exhibit strong functional enrichment associated with the "life style" of the related clades. Most previous approaches for studying function and conservation are "bottom up", studying protein families one by one, and separately assessing the conservation of each. By way of contrast, our approach is "top down". We globally partition the set of all proteins hierarchically, as described above, and then identify protein families enriched within different subdivisions. While supporting previous findings, our approach also provides a tool for discovering novel relations between protein conservation profiles, functionality, and evolutionary history as represented by the tree of life.

  12. Diversity and functions of protein glycosylation in insects.

    Science.gov (United States)

    Walski, Tomasz; De Schutter, Kristof; Van Damme, Els J M; Smagghe, Guy

    2017-04-01

    The majority of proteins is modified with carbohydrate structures. This modification, called glycosylation, was shown to be crucial for protein folding, stability and subcellular location, as well as protein-protein interactions, recognition and signaling. Protein glycosylation is involved in multiple physiological processes, including embryonic development, growth, circadian rhythms, cell attachment as well as maintenance of organ structure, immunity and fertility. Although the general principles of glycosylation are similar among eukaryotic organisms, insects synthesize a distinct repertoire of glycan structures compared to plants and vertebrates. Consequently, a number of unique insect glycans mediate functions specific to this class of invertebrates. For instance, the core α1,3-fucosylation of N-glycans is absent in vertebrates, while in insects this modification is crucial for the development of wings and the nervous system. At present, most of the data on insect glycobiology comes from research in Drosophila. Yet, progressively more information on the glycan structures and the importance of glycosylation in other insects like beetles, caterpillars, aphids and bees is becoming available. This review gives a summary of the current knowledge and recent progress related to glycan diversity and function(s) of protein glycosylation in insects. We focus on N- and O-glycosylation, their synthesis, physiological role(s), as well as the molecular and biochemical basis of these processes. Copyright © 2017 Elsevier Ltd. All rights reserved.

  13. Retrotransposons of the Tnt1B family are mobile in Nicotiana plumbaginifolia and can induce alternative splicing of the host gene upon insertion.

    Science.gov (United States)

    Leprinc, A S; Grandbastien, M A; Christian, M

    2001-11-01

    Active retrotransposons have been identified in Nicotiana plumbaginifolia by their ability to disrupt the nitrate reductase gene in chlorate-resistant mutants selected from protoplast-derived cultures. In mutants E23 and F97, two independent insertions of Tnp2, a new retrotransposon closely related to the tobacco Tnt1 elements, were detected in the nitrate reductase gene. These two Tnp2 elements are members of the Tnt1B subfamily which shows that Tnt1B elements can be active and mutagenic in the N. plumbaginifolia genome. Furthermore, these results suggest that Tnt1B is the most active family of Tntl elements in N. plumbaginifolia, whereas in tobacco only members of the Tnt1A subfamily were found inserted in the nitrate reductase gene. The transcriptional regulations of Tnp2 and Tnt1A elements are most probably different due to non-conserved U3 regions. Our results thus support the hypothesis that different Nicotiana species contain different active Tntl subfamilies and that only one active Tntl subfamily might be maintained in each of these species. The Tnp2 insertion found in the F97 mutant was found to be spliced out of the nitrate reductase mRNA by activation of cryptic donor and acceptor sites in the nitrate reductase and the Tnp2 sequences respectively.

  14. Wiki-pi: a web-server of annotated human protein-protein interactions to aid in discovery of protein function.

    Directory of Open Access Journals (Sweden)

    Naoki Orii

    Full Text Available Protein-protein interactions (PPIs are the basis of biological functions. Knowledge of the interactions of a protein can help understand its molecular function and its association with different biological processes and pathways. Several publicly available databases provide comprehensive information about individual proteins, such as their sequence, structure, and function. There also exist databases that are built exclusively to provide PPIs by curating them from published literature. The information provided in these web resources is protein-centric, and not PPI-centric. The PPIs are typically provided as lists of interactions of a given gene with links to interacting partners; they do not present a comprehensive view of the nature of both the proteins involved in the interactions. A web database that allows search and retrieval based on biomedical characteristics of PPIs is lacking, and is needed. We present Wiki-Pi (read Wiki-π, a web-based interface to a database of human PPIs, which allows users to retrieve interactions by their biomedical attributes such as their association to diseases, pathways, drugs and biological functions. Each retrieved PPI is shown with annotations of both of the participant proteins side-by-side, creating a basis to hypothesize the biological function facilitated by the interaction. Conceptually, it is a search engine for PPIs analogous to PubMed for scientific literature. Its usefulness in generating novel scientific hypotheses is demonstrated through the study of IGSF21, a little-known gene that was recently identified to be associated with diabetic retinopathy. Using Wiki-Pi, we infer that its association to diabetic retinopathy may be mediated through its interactions with the genes HSPB1, KRAS, TMSB4X and DGKD, and that it may be involved in cellular response to external stimuli, cytoskeletal organization and regulation of molecular activity. The website also provides a wiki-like capability allowing users

  15. Epigenetic regulation of transcription and possible functions of mammalian short interspersed elements, SINEs.

    Science.gov (United States)

    Ichiyanagi, Kenji

    2013-01-01

    Short interspersed elements (SINEs) are a class of retrotransposons, which amplify their copy numbers in their host genomes by retrotransposition. More than a million copies of SINEs are present in a mammalian genome, constituting over 10% of the total genomic sequence. In contrast to the other two classes of retrotransposons, long interspersed elements (LINEs) and long terminal repeat (LTR) elements, SINEs are transcribed by RNA polymerase III. However, like LINEs and LTR elements, the SINE transcription is likely regulated by epigenetic mechanisms such as DNA methylation, at least for human Alu and mouse B1. Whereas SINEs and other transposable elements have long been thought as selfish or junk DNA, recent studies have revealed that they play functional roles at their genomic locations, for example, as distal enhancers, chromatin boundaries and binding sites of many transcription factors. These activities imply that SINE retrotransposition has shaped the regulatory network and chromatin landscape of their hosts. Whereas it is thought that the epigenetic mechanisms were originated as a host defense system against proliferation of parasitic elements, this review discusses a possibility that the same mechanisms are also used to regulate the SINE-derived functions.

  16. Diversity, classification and function of the plant protein kinase superfamily

    OpenAIRE

    Lehti-Shiu, Melissa D.; Shiu, Shin-Han

    2012-01-01

    Eukaryotic protein kinases belong to a large superfamily with hundreds to thousands of copies and are components of essentially all cellular functions. The goals of this study are to classify protein kinases from 25 plant species and to assess their evolutionary history in conjunction with consideration of their molecular functions. The protein kinase superfamily has expanded in the flowering plant lineage, in part through recent duplications. As a result, the flowering plant protein kinase r...

  17. Functional similarities between the dictyostelium protein AprA and the human protein dipeptidyl‐peptidase IV

    Science.gov (United States)

    Herlihy, Sarah E.; Tang, Yu; Phillips, Jonathan E.

    2017-01-01

    Abstract Autocrine proliferation repressor protein A (AprA) is a protein secreted by Dictyostelium discoideum cells. Although there is very little sequence similarity between AprA and any human protein, AprA has a predicted structural similarity to the human protein dipeptidyl peptidase IV (DPPIV). AprA is a chemorepellent for Dictyostelium cells, and DPPIV is a chemorepellent for neutrophils. This led us to investigate if AprA and DPPIV have additional functional similarities. We find that like AprA, DPPIV is a chemorepellent for, and inhibits the proliferation of, D. discoideum cells, and that AprA binds some DPPIV binding partners such as fibronectin. Conversely, rAprA has DPPIV‐like protease activity. These results indicate a functional similarity between two eukaryotic chemorepellent proteins with very little sequence similarity, and emphasize the usefulness of using a predicted protein structure to search a protein structure database, in addition to searching for proteins with similar sequences. PMID:28028841

  18. Geometrical comparison of two protein structures using Wigner-D functions.

    Science.gov (United States)

    Saberi Fathi, S M; White, Diana T; Tuszynski, Jack A

    2014-10-01

    In this article, we develop a quantitative comparison method for two arbitrary protein structures. This method uses a root-mean-square deviation characterization and employs a series expansion of the protein's shape function in terms of the Wigner-D functions to define a new criterion, which is called a "similarity value." We further demonstrate that the expansion coefficients for the shape function obtained with the help of the Wigner-D functions correspond to structure factors. Our method addresses the common problem of comparing two proteins with different numbers of atoms. We illustrate it with a worked example. © 2014 Wiley Periodicals, Inc.

  19. HP1γ function is required for male germ cell survival and spermatogenesis

    Directory of Open Access Journals (Sweden)

    Brown Jeremy P

    2010-04-01

    Full Text Available Abstract Background HP1 proteins are conserved components of eukaryotic constitutive heterochromatin. In mammals, there are three genes that encode HP1-like proteins, termed HP1α, HP1β and HP1γ, which have a high degree of homology This paper describes for the first time, to our knowledge, the physiological function of HP1γ using a gene-targeted mouse. Results While targeting the Cbx3 gene (encoding the HP1γ protein with a conditional targeting vector, we generated a hypomorphic allele (Cbx3hypo, which resulted in much reduced (barely detectable levels of HP1γ protein. Homozygotes for the hypomorphic allele (Cbx3hypo/hypo are rare, with only 1% of Cbx3hypo/hypo animals reaching adulthood. Adult males exhibit a severe hypogonadism that is associated with a loss of germ cells, with some seminiferous tubules retaining only the supporting Sertoli cells (Sertoli cell-only phenotype. The percentage of seminiferous tubules that are positive for L1 ORF1 protein (ORF1p in Cbx3hypo/hypo testes is greater than that for wild-type testes, indicating that L1 retrotransposon silencing is reversed, leading to ectopic expression of ORF1p in Cbx3hypo/hypo germ cells. Conclusions The Cbx3 gene product (the HP1γ protein has a non-redundant function during spermatogenesis that cannot be compensated for by the other two HP1 isotypes. The Cbx3hypo/hypo spermatogenesis defect is similar to that found in Miwi2 and Dnmt3L mutants. The Cbx3 gene-targeted mice generated in this study provide an appropriate model for the study of HP1γ in transposon silencing and parental imprinting.

  20. The yeast Ty3 retrotransposon contains a 5'-3' bipartite primer-binding site and encodes nucleocapsid protein NCp9 functionally homologous to HIV-1 NCp7.

    Science.gov (United States)

    Gabus, C; Ficheux, D; Rau, M; Keith, G; Sandmeyer, S; Darlix, J L

    1998-08-17

    Retroviruses, including HIV-1 and the distantly related yeast retroelement Ty3, all encode a nucleoprotein required for virion structure and replication. During an in vitro comparison of HIV-1 and Ty3 nucleoprotein function in RNA dimerization and cDNA synthesis, we discovered a bipartite primer-binding site (PBS) for Ty3 composed of sequences located at opposite ends of the genome. Ty3 cDNA synthesis requires the 3' PBS for primer tRNAiMet annealing to the genomic RNA, and the 5' PBS, in cis or in trans, as the reverse transcription start site. Ty3 RNA alone is unable to dimerize, but formation of dimeric tRNAiMet bound to the PBS was found to direct dimerization of Ty3 RNA-tRNAiMet. Interestingly, HIV-1 nucleocapsid protein NCp7 and Ty3 NCp9 were interchangeable using HIV-1 and Ty3 RNA template-primer systems. Our findings impact on the understanding of non-canonical reverse transcription as well as on the use of Ty3 systems to screen for anti-NCp7 drugs.

  1. Non-coding RNAs enter mitosis: functions, conservation and implications

    OpenAIRE

    Pek, Jun Wei; Kai, Toshie

    2011-01-01

    Abstract Nuage (or commonly known as chromatoid body in mammals) is a conserved germline-specific organelle that has been linked to the Piwi-interacting RNA (piRNA) pathway. piRNAs are a class of gonadal-specific RNAs that are ~23-29 nucleotides in length and protect genome stability by repressing the expression of deleterious retrotransposons. More recent studies in Drosophila have implicated the piRNA pathway in other functions including canalization of embryonic development, regulation of ...

  2. Tet protein function during Drosophila development.

    Directory of Open Access Journals (Sweden)

    Fei Wang

    Full Text Available The TET (Ten-eleven translocation 1, 2 and 3 proteins have been shown to function as DNA hydroxymethylases in vertebrates and their requirements have been documented extensively. Recently, the Tet proteins have been shown to also hydroxylate 5-methylcytosine in RNA. 5-hydroxymethylcytosine (5hmrC is enriched in messenger RNA but the function of this modification has yet to be elucidated. Because Cytosine methylation in DNA is barely detectable in Drosophila, it serves as an ideal model to study the biological function of 5hmrC. Here, we characterized the temporal and spatial expression and requirement of Tet throughout Drosophila development. We show that Tet is essential for viability as Tet complete loss-of-function animals die at the late pupal stage. Tet is highly expressed in neuronal tissues and at more moderate levels in somatic muscle precursors in embryos and larvae. Depletion of Tet in muscle precursors at early embryonic stages leads to defects in larval locomotion and late pupal lethality. Although Tet knock-down in neuronal tissue does not cause lethality, it is essential for neuronal function during development through its affects upon locomotion in larvae and the circadian rhythm of adult flies. Further, we report the function of Tet in ovarian morphogenesis. Together, our findings provide basic insights into the biological function of Tet in Drosophila, and may illuminate observed neuronal and muscle phenotypes observed in vertebrates.

  3. A proteomics strategy to elucidate functional protein-protein interactions applied to EGF signaling

    DEFF Research Database (Denmark)

    Blagoev, B.; Kratchmarova, I.; Ong, S.E.

    2003-01-01

    Mass spectrometry-based proteomics can reveal protein-protein interactions on a large scale, but it has been difficult to separate background binding from functionally important interactions and still preserve weak binders. To investigate the epidermal growth factor receptor (EGFR) pathway, we em...

  4. Challenges in the Development of Functional Assays of Membrane Proteins

    Directory of Open Access Journals (Sweden)

    Sophie Demarche

    2012-11-01

    Full Text Available Lipid bilayers are natural barriers of biological cells and cellular compartments. Membrane proteins integrated in biological membranes enable vital cell functions such as signal transduction and the transport of ions or small molecules. In order to determine the activity of a protein of interest at defined conditions, the membrane protein has to be integrated into artificial lipid bilayers immobilized on a surface. For the fabrication of such biosensors expertise is required in material science, surface and analytical chemistry, molecular biology and biotechnology. Specifically, techniques are needed for structuring surfaces in the micro- and nanometer scale, chemical modification and analysis, lipid bilayer formation, protein expression, purification and solubilization, and most importantly, protein integration into engineered lipid bilayers. Electrochemical and optical methods are suitable to detect membrane activity-related signals. The importance of structural knowledge to understand membrane protein function is obvious. Presently only a few structures of membrane proteins are solved at atomic resolution. Functional assays together with known structures of individual membrane proteins will contribute to a better understanding of vital biological processes occurring at biological membranes. Such assays will be utilized in the discovery of drugs, since membrane proteins are major drug targets.

  5. Using RNA Interference to Study Protein Function

    OpenAIRE

    Curtis, Carol D.; Nardulli, Ann M.

    2009-01-01

    RNA interference can be extremely useful in determining the function of an endogenously-expressed protein in its normal cellular environment. In this chapter, we describe a method that uses small interfering RNA (siRNA) to knock down mRNA and protein expression in cultured cells so that the effect of a putative regulatory protein on gene expression can be delineated. Methods of assessing the effectiveness of the siRNA procedure using real time quantitative PCR and Western analysis are also in...

  6. Intricate knots in proteins: Function and evolution.

    Directory of Open Access Journals (Sweden)

    Peter Virnau

    2006-09-01

    Full Text Available Our investigation of knotted structures in the Protein Data Bank reveals the most complicated knot discovered to date. We suggest that the occurrence of this knot in a human ubiquitin hydrolase might be related to the role of the enzyme in protein degradation. While knots are usually preserved among homologues, we also identify an exception in a transcarbamylase. This allows us to exemplify the function of knots in proteins and to suggest how they may have been created.

  7. Exploring overlapping functional units with various structure in protein interaction networks.

    Directory of Open Access Journals (Sweden)

    Xiao-Fei Zhang

    Full Text Available Revealing functional units in protein-protein interaction (PPI networks are important for understanding cellular functional organization. Current algorithms for identifying functional units mainly focus on cohesive protein complexes which have more internal interactions than external interactions. Most of these approaches do not handle overlaps among complexes since they usually allow a protein to belong to only one complex. Moreover, recent studies have shown that other non-cohesive structural functional units beyond complexes also exist in PPI networks. Thus previous algorithms that just focus on non-overlapping cohesive complexes are not able to present the biological reality fully. Here, we develop a new regularized sparse random graph model (RSRGM to explore overlapping and various structural functional units in PPI networks. RSRGM is principally dominated by two model parameters. One is used to define the functional units as groups of proteins that have similar patterns of connections to others, which allows RSRGM to detect non-cohesive structural functional units. The other one is used to represent the degree of proteins belonging to the units, which supports a protein belonging to more than one revealed unit. We also propose a regularizer to control the smoothness between the estimators of these two parameters. Experimental results on four S. cerevisiae PPI networks show that the performance of RSRGM on detecting cohesive complexes and overlapping complexes is superior to that of previous competing algorithms. Moreover, RSRGM has the ability to discover biological significant functional units besides complexes.

  8. Automatic annotation of protein motif function with Gene Ontology terms

    Directory of Open Access Journals (Sweden)

    Gopalakrishnan Vanathi

    2004-09-01

    Full Text Available Abstract Background Conserved protein sequence motifs are short stretches of amino acid sequence patterns that potentially encode the function of proteins. Several sequence pattern searching algorithms and programs exist foridentifying candidate protein motifs at the whole genome level. However, amuch needed and importanttask is to determine the functions of the newly identified protein motifs. The Gene Ontology (GO project is an endeavor to annotate the function of genes or protein sequences with terms from a dynamic, controlled vocabulary and these annotations serve well as a knowledge base. Results This paperpresents methods to mine the GO knowledge base and use the association between the GO terms assigned to a sequence and the motifs matched by the same sequence as evidence for predicting the functions of novel protein motifs automatically. The task of assigning GO terms to protein motifsis viewed as both a binary classification and information retrieval problem, where PROSITE motifs are used as samples for mode training and functional prediction. The mutual information of a motif and aGO term association isfound to be a very useful feature. We take advantageof the known motifs to train a logistic regression classifier, which allows us to combine mutual information with other frequency-based features and obtain a probability of correctassociation. The trained logistic regression model has intuitively meaningful and logically plausible parameter values, and performs very well empirically according to our evaluation criteria. Conclusions In this research, different methods for automatic annotation of protein motifs have been investigated. Empirical result demonstrated that the methods have a great potential for detecting and augmenting information about thefunctions of newly discovered candidate protein motifs.

  9. Crystallization of bi-functional ligand protein complexes.

    Science.gov (United States)

    Antoni, Claudia; Vera, Laura; Devel, Laurent; Catalani, Maria Pia; Czarny, Bertrand; Cassar-Lajeunesse, Evelyn; Nuti, Elisa; Rossello, Armando; Dive, Vincent; Stura, Enrico Adriano

    2013-06-01

    Homodimerization is important in signal transduction and can play a crucial role in many other biological systems. To obtaining structural information for the design of molecules able to control the signalization pathways, the proteins involved will have to be crystallized in complex with ligands that induce dimerization. Bi-functional drugs have been generated by linking two ligands together chemically and the relative crystallizability of complexes with mono-functional and bi-functional ligands has been evaluated. There are problems associated with crystallization with such ligands, but overall, the advantages appear to be greater than the drawbacks. The study involves two matrix metalloproteinases, MMP-12 and MMP-9. Using flexible and rigid linkers we show that it is possible to control the crystal packing and that by changing the ligand-enzyme stoichiometric ratio, one can toggle between having one bi-functional ligand binding to two enzymes and having the same ligand bound to each enzyme. The nature of linker and its point of attachment on the ligand can be varied to aid crystallization, and such variations can also provide valuable structural information about the interactions made by the linker with the protein. We report here the crystallization and structure determination of seven ligand-dimerized complexes. These results suggest that the use of bi-functional drugs can be extended beyond the realm of protein dimerization to include all drug design projects. Copyright © 2013 Elsevier Inc. All rights reserved.

  10. Post-translational processing targets functionally diverse proteins in Mycoplasma hyopneumoniae.

    Science.gov (United States)

    Tacchi, Jessica L; Raymond, Benjamin B A; Haynes, Paul A; Berry, Iain J; Widjaja, Michael; Bogema, Daniel R; Woolley, Lauren K; Jenkins, Cheryl; Minion, F Chris; Padula, Matthew P; Djordjevic, Steven P

    2016-02-01

    Mycoplasma hyopneumoniae is a genome-reduced, cell wall-less, bacterial pathogen with a predicted coding capacity of less than 700 proteins and is one of the smallest self-replicating pathogens. The cell surface of M. hyopneumoniae is extensively modified by processing events that target the P97 and P102 adhesin families. Here, we present analyses of the proteome of M. hyopneumoniae-type strain J using protein-centric approaches (one- and two-dimensional GeLC-MS/MS) that enabled us to focus on global processing events in this species. While these approaches only identified 52% of the predicted proteome (347 proteins), our analyses identified 35 surface-associated proteins with widely divergent functions that were targets of unusual endoproteolytic processing events, including cell adhesins, lipoproteins and proteins with canonical functions in the cytosol that moonlight on the cell surface. Affinity chromatography assays that separately used heparin, fibronectin, actin and host epithelial cell surface proteins as bait recovered cleavage products derived from these processed proteins, suggesting these fragments interact directly with the bait proteins and display previously unrecognized adhesive functions. We hypothesize that protein processing is underestimated as a post-translational modification in genome-reduced bacteria and prokaryotes more broadly, and represents an important mechanism for creating cell surface protein diversity. © 2016 The Authors.

  11. Role of AAA(+)-proteins in peroxisome biogenesis and function.

    Science.gov (United States)

    Grimm, Immanuel; Erdmann, Ralf; Girzalsky, Wolfgang

    2016-05-01

    Mutations in the PEX1 gene, which encodes a protein required for peroxisome biogenesis, are the most common cause of the Zellweger spectrum diseases. The recognition that Pex1p shares a conserved ATP-binding domain with p97 and NSF led to the discovery of the extended family of AAA+-type ATPases. So far, four AAA+-type ATPases are related to peroxisome function. Pex6p functions together with Pex1p in peroxisome biogenesis, ATAD1/Msp1p plays a role in membrane protein targeting and a member of the Lon-family of proteases is associated with peroxisomal quality control. This review summarizes the current knowledge on the AAA+-proteins involved in peroxisome biogenesis and function.

  12. Proteins with Novel Structure, Function and Dynamics

    Science.gov (United States)

    Pohorille, Andrew

    2014-01-01

    Recently, a small enzyme that ligates two RNA fragments with the rate of 10(exp 6) above background was evolved in vitro (Seelig and Szostak, Nature 448:828-831, 2007). This enzyme does not resemble any contemporary protein (Chao et al., Nature Chem. Biol. 9:81-83, 2013). It consists of a dynamic, catalytic loop, a small, rigid core containing two zinc ions coordinated by neighboring amino acids, and two highly flexible tails that might be unimportant for protein function. In contrast to other proteins, this enzyme does not contain ordered secondary structure elements, such as alpha-helix or beta-sheet. The loop is kept together by just two interactions of a charged residue and a histidine with a zinc ion, which they coordinate on the opposite side of the loop. Such structure appears to be very fragile. Surprisingly, computer simulations indicate otherwise. As the coordinating, charged residue is mutated to alanine, another, nearby charged residue takes its place, thus keeping the structure nearly intact. If this residue is also substituted by alanine a salt bridge involving two other, charged residues on the opposite sides of the loop keeps the loop in place. These adjustments are facilitated by high flexibility of the protein. Computational predictions have been confirmed experimentally, as both mutants retain full activity and overall structure. These results challenge our notions about what is required for protein activity and about the relationship between protein dynamics, stability and robustness. We hypothesize that small, highly dynamic proteins could be both active and fault tolerant in ways that many other proteins are not, i.e. they can adjust to retain their structure and activity even if subjected to mutations in structurally critical regions. This opens the doors for designing proteins with novel functions, structures and dynamics that have not been yet considered.

  13. Semantic integration to identify overlapping functional modules in protein interaction networks

    Directory of Open Access Journals (Sweden)

    Ramanathan Murali

    2007-07-01

    Full Text Available Abstract Background The systematic analysis of protein-protein interactions can enable a better understanding of cellular organization, processes and functions. Functional modules can be identified from the protein interaction networks derived from experimental data sets. However, these analyses are challenging because of the presence of unreliable interactions and the complex connectivity of the network. The integration of protein-protein interactions with the data from other sources can be leveraged for improving the effectiveness of functional module detection algorithms. Results We have developed novel metrics, called semantic similarity and semantic interactivity, which use Gene Ontology (GO annotations to measure the reliability of protein-protein interactions. The protein interaction networks can be converted into a weighted graph representation by assigning the reliability values to each interaction as a weight. We presented a flow-based modularization algorithm to efficiently identify overlapping modules in the weighted interaction networks. The experimental results show that the semantic similarity and semantic interactivity of interacting pairs were positively correlated with functional co-occurrence. The effectiveness of the algorithm for identifying modules was evaluated using functional categories from the MIPS database. We demonstrated that our algorithm had higher accuracy compared to other competing approaches. Conclusion The integration of protein interaction networks with GO annotation data and the capability of detecting overlapping modules substantially improve the accuracy of module identification.

  14. A novel linkage map of sugarcane with evidence for clustering of retrotransposon-based markers

    Directory of Open Access Journals (Sweden)

    Palhares Alessandra C

    2012-06-01

    Full Text Available Abstract Background The development of sugarcane as a sustainable crop has unlimited applications. The crop is one of the most economically viable for renewable energy production, and CO2 balance. Linkage maps are valuable tools for understanding genetic and genomic organization, particularly in sugarcane due to its complex polyploid genome of multispecific origins. The overall objective of our study was to construct a novel sugarcane linkage map, compiling AFLP and EST-SSR markers, and to generate data on the distribution of markers anchored to sequences of scIvana_1, a complete sugarcane transposable element, and member of the Copia superfamily. Results The mapping population parents (‘IAC66-6’ and ‘TUC71-7’ contributed equally to polymorphisms, independent of marker type, and generated markers that were distributed into nearly the same number of co-segregation groups (or CGs. Bi-parentally inherited alleles provided the integration of 19 CGs. The marker number per CG ranged from two to 39. The total map length was 4,843.19 cM, with a marker density of 8.87 cM. Markers were assembled into 92 CGs that ranged in length from 1.14 to 404.72 cM, with an estimated average length of 52.64 cM. The greatest distance between two adjacent markers was 48.25 cM. The scIvana_1-based markers (56 were positioned on 21 CGs, but were not regularly distributed. Interestingly, the distance between adjacent scIvana_1-based markers was less than 5 cM, and was observed on five CGs, suggesting a clustered organization. Conclusions Results indicated the use of a NBS-profiling technique was efficient to develop retrotransposon-based markers in sugarcane. The simultaneous maximum-likelihood estimates of linkage and linkage phase based strategies confirmed the suitability of its approach to estimate linkage, and construct the linkage map. Interestingly, using our genetic data it was possible to calculate the number of retrotransposon scIvana_1 (~60

  15. JAFA: a protein function annotation meta-server

    DEFF Research Database (Denmark)

    Friedberg, Iddo; Harder, Tim; Godzik, Adam

    2006-01-01

    Annotations, or JAFA server. JAFA queries several function prediction servers with a protein sequence and assembles the returned predictions in a legible, non-redundant format. In this manner, JAFA combines the predictions of several servers to provide a comprehensive view of what are the predicted functions...

  16. Sub-grouping and sub-functionalization of the RIFIN multi-copy protein family

    Directory of Open Access Journals (Sweden)

    Sonnhammer Erik L

    2008-01-01

    Full Text Available Abstract Background Parasitic protozoans possess many multicopy gene families which have central roles in parasite survival and virulence. The number and variability of members of these gene families often make it difficult to predict possible functions of the encoded proteins. The families of extra-cellular proteins that are exposed to a host immune response have been driven via immune selection to become antigenically variant, and thereby avoid immune recognition while maintaining protein function to establish a chronic infection. Results We have combined phylogenetic and function shift analyses to study the evolution of the RIFIN proteins, which are antigenically variant and are encoded by the largest multicopy gene family in Plasmodium falciparum. We show that this family can be subdivided into two major groups that we named A- and B-RIFIN proteins. This suggested sub-grouping is supported by a recently published study that showed that, despite the presence of the Plasmodium export (PEXEL motif in all RIFIN variants, proteins from each group have different cellular localizations during the intraerythrocytic life cycle of the parasite. In the present study we show that function shift analysis, a novel technique to predict functional divergence between sub-groups of a protein family, indicates that RIFINs have undergone neo- or sub-functionalization. Conclusion These results question the general trend of clustering large antigenically variant protein groups into homogenous families. Assigning functions to protein families requires their subdivision into meaningful groups such as we have shown for the RIFIN protein family. Using phylogenetic and function shift analysis methods, we identify new directions for the investigation of this broad and complex group of proteins.

  17. Designing sequence to control protein function in an EF-hand protein.

    Science.gov (United States)

    Bunick, Christopher G; Nelson, Melanie R; Mangahas, Sheryll; Hunter, Michael J; Sheehan, Jonathan H; Mizoue, Laura S; Bunick, Gerard J; Chazin, Walter J

    2004-05-19

    The extent of conformational change that calcium binding induces in EF-hand proteins is a key biochemical property specifying Ca(2+) sensor versus signal modulator function. To understand how differences in amino acid sequence lead to differences in the response to Ca(2+) binding, comparative analyses of sequence and structures, combined with model building, were used to develop hypotheses about which amino acid residues control Ca(2+)-induced conformational changes. These results were used to generate a first design of calbindomodulin (CBM-1), a calbindin D(9k) re-engineered with 15 mutations to respond to Ca(2+) binding with a conformational change similar to that of calmodulin. The gene for CBM-1 was synthesized, and the protein was expressed and purified. Remarkably, this protein did not exhibit any non-native-like molten globule properties despite the large number of mutations and the nonconservative nature of some of them. Ca(2+)-induced changes in CD intensity and in the binding of the hydrophobic probe, ANS, implied that CBM-1 does undergo Ca(2+) sensorlike conformational changes. The X-ray crystal structure of Ca(2+)-CBM-1 determined at 1.44 A resolution reveals the anticipated increase in hydrophobic surface area relative to the wild-type protein. A nascent calmodulin-like hydrophobic docking surface was also found, though it is occluded by the inter-EF-hand loop. The results from this first calbindomodulin design are discussed in terms of progress toward understanding the relationships between amino acid sequence, protein structure, and protein function for EF-hand CaBPs, as well as the additional mutations for the next CBM design.

  18. Structural and Function Prediction of Musa acuminata subsp. Malaccensis Protein

    Directory of Open Access Journals (Sweden)

    Anum Munir

    2016-03-01

    Full Text Available Hypothetical proteins (HPs are the proteins whose presence has been anticipated, yet in vivo function has not been built up. Illustrating the structural and functional privileged insights of these HPs might likewise prompt a superior comprehension of the protein-protein associations or networks in diverse types of life. Bananas (Musa acuminata spp., including sweet and cooking types, are giant perennial monocotyledonous herbs of the order Zingiberales, a sister grouped to the all-around considered Poales, which incorporate oats. Bananas are crucial for nourishment security in numerous tropical and subtropical nations and the most prominent organic product in industrialized nations. In the present study, the hypothetical protein of M. acuminata (Banana was chosen for analysis and modeling by distinctive bioinformatics apparatuses and databases. As indicated by primary and secondary structure analysis, XP_009393594.1 is a stable hydrophobic protein containing a noteworthy extent of α-helices; Homology modeling was done utilizing SWISS-MODEL server where the templates identity with XP_009393594.1 protein was less which demonstrated novelty of our protein. Ab initio strategy was conducted to produce its 3D structure. A few evaluations of quality assessment and validation parameters determined the generated protein model as stable with genuinely great quality. Functional analysis was completed by ProtFun 2.2, and KEGG (KAAS, recommended that the hypothetical protein is a transcription factor with cytoplasmic domain as zinc finger. The protein was observed to be vital for translation process, involved in metabolism, signaling and cellular processes, genetic information processing and Zinc ion binding. It is suggested that further test approval would help to anticipate the structures and functions of other uncharacterized proteins of different plants and living being.

  19. Functional characterization of Arabidopsis thaliana transthyretin-like protein.

    Science.gov (United States)

    Pessoa, João; Sárkány, Zsuzsa; Ferreira-da-Silva, Frederico; Martins, Sónia; Almeida, Maria R; Li, Jianming; Damas, Ana M

    2010-02-18

    Arabidopsis thaliana transthyretin-like (TTL) protein is a potential substrate in the brassinosteroid signalling cascade, having a role that moderates plant growth. Moreover, sequence homology revealed two sequence domains similar to 2-oxo-4-hydroxy-4-carboxy-5-ureidoimidazoline (OHCU) decarboxylase (N-terminal domain) and 5-hydroxyisourate (5-HIU) hydrolase (C-terminal domain). TTL is a member of the transthyretin-related protein family (TRP), which comprises a number of proteins with sequence homology to transthyretin (TTR) and the characteristic C-terminal sequence motif Tyr-Arg-Gly-Ser. TRPs are single domain proteins that form tetrameric structures with 5-HIU hydrolase activity. Experimental evidence is fundamental for knowing if TTL is a tetrameric protein, formed by the association of the 5-HIU hydrolase domains and, in this case, if the structural arrangement allows for OHCU decarboxylase activity. This work reports about the biochemical and functional characterization of TTL. The TTL gene was cloned and the protein expressed and purified for biochemical and functional characterization. The results show that TTL is composed of four subunits, with a moderately elongated shape. We also found evidence for 5-HIU hydrolase and OHCU decarboxylase activities in vitro, in the full-length protein. The Arabidopsis thaliana transthyretin-like (TTL) protein is a tetrameric bifunctional enzyme, since it has 5-HIU hydrolase and OHCU decarboxylase activities, which were simultaneously observed in vitro.

  20. Direct Capture of Functional Proteins from Mammalian Plasma Membranes into Nanodiscs.

    Science.gov (United States)

    Roy, Jahnabi; Pondenis, Holly; Fan, Timothy M; Das, Aditi

    2015-10-20

    Mammalian plasma membrane proteins make up the largest class of drug targets yet are difficult to study in a cell free system because of their intransigent nature. Herein, we perform direct encapsulation of plasma membrane proteins derived from mammalian cells into a functional nanodisc library. Peptide fingerprinting was used to analyze the proteome of the incorporated proteins in nanodiscs and to further demonstrate that the lipid composition of the nanodiscs directly affects the class of protein that is incorporated. Furthermore, the functionality of the incorporated membrane proteome was evaluated by measuring the activity of membrane proteins: Na(+)/K(+)-ATPase and receptor tyrosine kinases. This work is the first report of the successful establishment and characterization of a cell free functional library of mammalian membrane proteins into nanodiscs.

  1. Functional classification of protein structures by local structure matching in graph representation.

    Science.gov (United States)

    Mills, Caitlyn L; Garg, Rohan; Lee, Joslynn S; Tian, Liang; Suciu, Alexandru; Cooperman, Gene; Beuning, Penny J; Ondrechen, Mary Jo

    2018-03-31

    As a result of high-throughput protein structure initiatives, over 14,400 protein structures have been solved by structural genomics (SG) centers and participating research groups. While the totality of SG data represents a tremendous contribution to genomics and structural biology, reliable functional information for these proteins is generally lacking. Better functional predictions for SG proteins will add substantial value to the structural information already obtained. Our method described herein, Graph Representation of Active Sites for Prediction of Function (GRASP-Func), predicts quickly and accurately the biochemical function of proteins by representing residues at the predicted local active site as graphs rather than in Cartesian coordinates. We compare the GRASP-Func method to our previously reported method, structurally aligned local sites of activity (SALSA), using the ribulose phosphate binding barrel (RPBB), 6-hairpin glycosidase (6-HG), and Concanavalin A-like Lectins/Glucanase (CAL/G) superfamilies as test cases. In each of the superfamilies, SALSA and the much faster method GRASP-Func yield similar correct classification of previously characterized proteins, providing a validated benchmark for the new method. In addition, we analyzed SG proteins using our SALSA and GRASP-Func methods to predict function. Forty-one SG proteins in the RPBB superfamily, nine SG proteins in the 6-HG superfamily, and one SG protein in the CAL/G superfamily were successfully classified into one of the functional families in their respective superfamily by both methods. This improved, faster, validated computational method can yield more reliable predictions of function that can be used for a wide variety of applications by the community. © 2018 The Authors Protein Science published by Wiley Periodicals, Inc. on behalf of The Protein Society.

  2. SM30 protein function during sea urchin larval spicule formation.

    Science.gov (United States)

    Wilt, Fred; Killian, Christopher E; Croker, Lindsay; Hamilton, Patricia

    2013-08-01

    A central issue in better understanding the process of biomineralization is to elucidate the function of occluded matrix proteins present in mineralized tissues. A potent approach to addressing this issue utilizes specific inhibitors of expression of known genes. Application of antisense oligonucleotides that specifically suppress translation of a given mRNA are capable of causing aberrant biomineralization, thereby revealing, at least in part, a likely function of the protein and gene under investigation. We have applied this approach to study the possible function(s) of the SM30 family of proteins, which are found in spicules, teeth, spines, and tests of Strongylocentrotus purpuratus as well as other euechinoid sea urchins. It is possible using the anti-SM30 morpholino-oligonucleotides (MO's) to reduce the level of these proteins to very low levels, yet the development of skeletal spicules in the embryo shows little or no aberration. This surprising result requires re-thinking about the role of these, and possibly other occluded matrix proteins. Copyright © 2013 Elsevier Inc. All rights reserved.

  3. MM-ISMSA: An Ultrafast and Accurate Scoring Function for Protein-Protein Docking.

    Science.gov (United States)

    Klett, Javier; Núñez-Salgado, Alfonso; Dos Santos, Helena G; Cortés-Cabrera, Álvaro; Perona, Almudena; Gil-Redondo, Rubén; Abia, David; Gago, Federico; Morreale, Antonio

    2012-09-11

    An ultrafast and accurate scoring function for protein-protein docking is presented. It includes (1) a molecular mechanics (MM) part based on a 12-6 Lennard-Jones potential; (2) an electrostatic component based on an implicit solvent model (ISM) with individual desolvation penalties for each partner in the protein-protein complex plus a hydrogen bonding term; and (3) a surface area (SA) contribution to account for the loss of water contacts upon protein-protein complex formation. The accuracy and performance of the scoring function, termed MM-ISMSA, have been assessed by (1) comparing the total binding energies, the electrostatic term, and its components (charge-charge and individual desolvation energies), as well as the per residue contributions, to results obtained with well-established methods such as APBSA or MM-PB(GB)SA for a set of 1242 decoy protein-protein complexes and (2) testing its ability to recognize the docking solution closest to the experimental structure as that providing the most favorable total binding energy. For this purpose, a test set consisting of 15 protein-protein complexes with known 3D structure mixed with 10 decoys for each complex was used. The correlation between the values afforded by MM-ISMSA and those from the other methods is quite remarkable (r(2) ∼ 0.9), and only 0.2-5.0 s (depending on the number of residues) are spent on a single calculation including an all vs all pairwise energy decomposition. On the other hand, MM-ISMSA correctly identifies the best docking solution as that closest to the experimental structure in 80% of the cases. Finally, MM-ISMSA can process molecular dynamics trajectories and reports the results as averaged values with their standard deviations. MM-ISMSA has been implemented as a plugin to the widely used molecular graphics program PyMOL, although it can also be executed in command-line mode. MM-ISMSA is distributed free of charge to nonprofit organizations.

  4. Non-coding RNAs enter mitosis: functions, conservation and implications

    Directory of Open Access Journals (Sweden)

    Kai Toshie

    2011-02-01

    Full Text Available Abstract Nuage (or commonly known as chromatoid body in mammals is a conserved germline-specific organelle that has been linked to the Piwi-interacting RNA (piRNA pathway. piRNAs are a class of gonadal-specific RNAs that are ~23-29 nucleotides in length and protect genome stability by repressing the expression of deleterious retrotransposons. More recent studies in Drosophila have implicated the piRNA pathway in other functions including canalization of embryonic development, regulation of maternal gene expression and telomere protection. We have recently shown that Vasa (known as Mouse Vasa Homolog in mouse, a nuage component, plays a mitotic role in promoting chromosome condensation and segregation by facilitating robust chromosomal localization of condensin I in the Drosophila germline. Vasa functions together with Aubergine (a PIWI family protein and Spindle-E/mouse TDRD-9, two other nuage components that are involved in the piRNA pathway, therefore providing a link between the piRNA pathway and mitotic chromosome condensation. Here, we propose and discuss possible models for the role of Vasa and the piRNA pathway during mitosis. We also highlight relevant studies implicating mitotic roles for RNAs and/or nuage in other model systems and their implications for cancer development.

  5. Phospholipid liposomes functionalized by protein

    Science.gov (United States)

    Glukhova, O. E.; Savostyanov, G. V.; Grishina, O. A.

    2015-03-01

    Finding new ways to deliver neurotrophic drugs to the brain in newborns is one of the contemporary problems of medicine and pharmaceutical industry. Modern researches in this field indicate the promising prospects of supramolecular transport systems for targeted drug delivery to the brain which can overcome the blood-brain barrier (BBB). Thus, the solution of this problem is actual not only for medicine, but also for society as a whole because it determines the health of future generations. Phospholipid liposomes due to combination of lipo- and hydrophilic properties are considered as the main future objects in medicine for drug delivery through the BBB as well as increasing their bioavailability and toxicity. Liposomes functionalized by various proteins were used as transport systems for ease of liposomes use. Designing of modification oligosaccharide of liposomes surface is promising in the last decade because it enables the delivery of liposomes to specific receptor of human cells by selecting ligand and it is widely used in pharmacology for the treatment of several diseases. The purpose of this work is creation of a coarse-grained model of bilayer of phospholipid liposomes, functionalized by specific to the structural elements of the BBB proteins, as well as prediction of the most favorable orientation and position of the molecules in the generated complex by methods of molecular docking for the formation of the structure. Investigation of activity of the ligand molecule to protein receptor of human cells by the methods of molecular dynamics was carried out.

  6. Protein domain recurrence and order can enhance prediction of protein functions

    KAUST Repository

    Abdel Messih, Mario A.; Chitale, Meghana; Bajic, Vladimir B.; Kihara, Daisuke; Gao, Xin

    2012-01-01

    Motivation: Burgeoning sequencing technologies have generated massive amounts of genomic and proteomic data. Annotating the functions of proteins identified in this data has become a big and crucial problem. Various computational methods have been

  7. Prediction of human protein function from post-translational modifications and localization features

    DEFF Research Database (Denmark)

    Jensen, Lars Juhl; Gupta, Ramneek; Blom, Nikolaj

    2002-01-01

    a number of functional attributes that are more directly related to the linear sequence of amino acids, and hence easier to predict, than protein structure. These attributes include features associated with post-translational modifications and protein sorting, but also much simpler aspects......We have developed an entirely sequence-based method that identifies and integrates relevant features that can be used to assign proteins of unknown function to functional classes, and enzyme categories for enzymes. We show that strategies for the elucidation of protein function may benefit from...

  8. Jatropha seed protein functional properties for technical applications

    NARCIS (Netherlands)

    Lestari, D.; Mulder, W.J.; Sanders, J.P.M.

    2011-01-01

    Jatropha press cake, by-product after oil expression from Jatropha seeds, contains 24–28% protein on dry basis. Objectives of this research were to investigate functional properties, such as solubility, emulsifying, foaming, film forming, and adhesive properties, of Jatropha press cake proteins and

  9. Stoichiometric balance of protein copy numbers is measurable and functionally significant in a protein-protein interaction network for yeast endocytosis.

    Science.gov (United States)

    Holland, David O; Johnson, Margaret E

    2018-03-01

    Stoichiometric balance, or dosage balance, implies that proteins that are subunits of obligate complexes (e.g. the ribosome) should have copy numbers expressed to match their stoichiometry in that complex. Establishing balance (or imbalance) is an important tool for inferring subunit function and assembly bottlenecks. We show here that these correlations in protein copy numbers can extend beyond complex subunits to larger protein-protein interactions networks (PPIN) involving a range of reversible binding interactions. We develop a simple method for quantifying balance in any interface-resolved PPINs based on network structure and experimentally observed protein copy numbers. By analyzing such a network for the clathrin-mediated endocytosis (CME) system in yeast, we found that the real protein copy numbers were significantly more balanced in relation to their binding partners compared to randomly sampled sets of yeast copy numbers. The observed balance is not perfect, highlighting both under and overexpressed proteins. We evaluate the potential cost and benefits of imbalance using two criteria. First, a potential cost to imbalance is that 'leftover' proteins without remaining functional partners are free to misinteract. We systematically quantify how this misinteraction cost is most dangerous for strong-binding protein interactions and for network topologies observed in biological PPINs. Second, a more direct consequence of imbalance is that the formation of specific functional complexes depends on relative copy numbers. We therefore construct simple kinetic models of two sub-networks in the CME network to assess multi-protein assembly of the ARP2/3 complex and a minimal, nine-protein clathrin-coated vesicle forming module. We find that the observed, imperfectly balanced copy numbers are less effective than balanced copy numbers in producing fast and complete multi-protein assemblies. However, we speculate that strategic imbalance in the vesicle forming module

  10. Analysis of hepatocellular carcinoma and metastatic hepatic carcinoma via functional modules in a protein-protein interaction network

    Directory of Open Access Journals (Sweden)

    Jun Pan

    2014-01-01

    Full Text Available Introduction: This study aims to identify protein clusters with potential functional relevance in the pathogenesis of hepatocellular carcinoma (HCC and metastatic hepatic carcinoma using network analysis. Materials and Methods: We used human protein interaction data to build a protein-protein interaction network with Cytoscape and then derived functional clusters using MCODE. Combining the gene expression profiles, we calculated the functional scores for the clusters and selected statistically significant clusters. Meanwhile, Gene Ontology was used to assess the functionality of these clusters. Finally, a support vector machine was trained on the gold standard data sets. Results: The differentially expressed genes of HCC were mainly involved in metabolic and signaling processes. We acquired 13 significant modules from the gene expression profiles. The area under the curve value based on the differentially expressed modules were 98.31%, which outweighed the classification with DEGs. Conclusions: Differentially expressed modules are valuable to screen biomarkers combined with functional modules.

  11. RACK1, A Multifaceted Scaffolding Protein: Structure and Function

    LENUS (Irish Health Repository)

    Adams, David R

    2011-10-06

    Abstract The Receptor for Activated C Kinase 1 (RACK1) is a member of the tryptophan-aspartate repeat (WD-repeat) family of proteins and shares significant homology to the β subunit of G-proteins (Gβ). RACK1 adopts a seven-bladed β-propeller structure which facilitates protein binding. RACK1 has a significant role to play in shuttling proteins around the cell, anchoring proteins at particular locations and in stabilising protein activity. It interacts with the ribosomal machinery, with several cell surface receptors and with proteins in the nucleus. As a result, RACK1 is a key mediator of various pathways and contributes to numerous aspects of cellular function. Here, we discuss RACK1 gene and structure and its role in specific signaling pathways, and address how posttranslational modifications facilitate subcellular location and translocation of RACK1. This review condenses several recent studies suggesting a role for RACK1 in physiological processes such as development, cell migration, central nervous system (CN) function and circadian rhythm as well as reviewing the role of RACK1 in disease.

  12. Functionality of extrusion--texturized whey proteins.

    Science.gov (United States)

    Onwulata, C I; Konstance, R P; Cooke, P H; Farrell, H M

    2003-11-01

    Whey, a byproduct of the cheesemaking process, is concentrated by processors to make whey protein concentrates (WPC) and isolates (WPI). Only 50% of whey proteins are used in foods. In order to increase their usage, texturizing WPC, WPI, and whey albumin is proposed to create ingredients with new functionality. Extrusion processing texturizes globular proteins by shearing and stretching them into aligned or entangled fibrous bundles. In this study, WPC, WPI, and whey albumin were extruded in a twin screw extruder at approximately 38% moisture content (15.2 ml/min, feed rate 25 g/min) and, at different extrusion cook temperatures, at the same temperature for the last four zones before the die (35, 50, 75, and 100 degrees C, respectively). Protein solubility, gelation, foaming, and digestibility were determined in extrudates. Degree of extrusion-induced insolubility (denaturation) or texturization, determined by lack of solubility at pH 7 for WPI, increased from 30 to 60, 85, and 95% for the four temperature conditions 35, 50, 75, and 100 degrees C, respectively. Gel strength of extruded isolates increased initially 115% (35 degrees C) and 145% (50 degrees C), but gel strength was lost at 75 and 100 degrees C. Denaturation at these melt temperatures had minimal effect on foaming and digestibility. Varying extrusion cook temperature allowed a new controlled rate of denaturation, indicating that a texturized ingredient with a predetermined functionality based on degree of denaturation can be created.

  13. ROLE OF TYROSINE-SULFATED PROTEINS IN RETINAL STRUCTURE AND FUNCTION

    Science.gov (United States)

    Kanan, Y.; Al-Ubaidi, M.R.

    2014-01-01

    The extracellular matrix (ECM) plays a significant role in cellular and retinal health. The study of retinal tyrosine-sulfated proteins is an important first step toward understanding the role of ECM in retinal health and diseases. These secreted proteins are members of the retinal ECM. Tyrosine sulfation was shown to be necessary for the development of proper retinal structure and function. The importance of tyrosine sulfation is further demonstrated by the evolutionary presence of tyrosylprotein sulfotransferases, enzymes that catalyze proteins’ tyrosine sulfation, and the compensatory abilities of these enzymes. Research has identified four tyrosine-sulfated retinal proteins: fibulin 2, vitronectin, complement factor H (CFH), and opticin. Vitronectin and CFH regulate the activation of the complement system and are involved in the etiology of some cases of age-related macular degeneration. Analysis of the role of tyrosine sulfation in fibulin function showed that sulfation influences the protein's ability to regulate growth and migration. Although opticin was recently shown to exhibit anti-angiogenic properties, it is not yet determined what role sulfation plays in that function. Future studies focusing on identifying all of the tyrosine-sulfated retinal proteins would be instrumental in determining the impact of sulfation on retinal protein function in retinal homeostasis and diseases. PMID:25819460

  14. Structure and function of homodomain-leucine zipper (HD-Zip) proteins.

    Science.gov (United States)

    Elhiti, Mohamed; Stasolla, Claudio

    2009-02-01

    Homeodomain-leucine zipper (HD-Zip) proteins are transcription factors unique to plants and are encoded by more than 25 genes in Arabidopsis thaliana. Based on sequence analyses these proteins have been classified into four distinct groups: HD-Zip I-IV. HD-Zip proteins are characterized by the presence of two functional domains; a homeodomain (HD) responsible for DNA binding and a leucine zipper domain (Zip) located immediately C-terminal to the homeodomain and involved in protein-protein interaction. Despite sequence similarities HD-ZIP proteins participate in a variety of processes during plant growth and development. HD-Zip I proteins are generally involved in responses related to abiotic stress, abscisic acid (ABA), blue light, de-etiolation and embryogenesis. HD-Zip II proteins participate in light response, shade avoidance and auxin signalling. Members of the third group (HD-Zip III) control embryogenesis, leaf polarity, lateral organ initiation and meristem function. HD-Zip IV proteins play significant roles during anthocyanin accumulation, differentiation of epidermal cells, trichome formation and root development.

  15. Functional characterization of Arabidopsis thaliana transthyretin-like protein

    Directory of Open Access Journals (Sweden)

    Almeida Maria R

    2010-02-01

    Full Text Available Abstract Background Arabidopsis thaliana transthyretin-like (TTL protein is a potential substrate in the brassinosteroid signalling cascade, having a role that moderates plant growth. Moreover, sequence homology revealed two sequence domains similar to 2-oxo-4-hydroxy-4-carboxy-5-ureidoimidazoline (OHCU decarboxylase (N-terminal domain and 5-hydroxyisourate (5-HIU hydrolase (C-terminal domain. TTL is a member of the transthyretin-related protein family (TRP, which comprises a number of proteins with sequence homology to transthyretin (TTR and the characteristic C-terminal sequence motif Tyr-Arg-Gly-Ser. TRPs are single domain proteins that form tetrameric structures with 5-HIU hydrolase activity. Experimental evidence is fundamental for knowing if TTL is a tetrameric protein, formed by the association of the 5-HIU hydrolase domains and, in this case, if the structural arrangement allows for OHCU decarboxylase activity. This work reports about the biochemical and functional characterization of TTL. Results The TTL gene was cloned and the protein expressed and purified for biochemical and functional characterization. The results show that TTL is composed of four subunits, with a moderately elongated shape. We also found evidence for 5-HIU hydrolase and OHCU decarboxylase activities in vitro, in the full-length protein. Conclusions The Arabidopsis thaliana transthyretin-like (TTL protein is a tetrameric bifunctional enzyme, since it has 5-HIU hydrolase and OHCU decarboxylase activities, which were simultaneously observed in vitro.

  16. Liver Function Status in some Nigerian Children with Protein Energy ...

    African Journals Online (AJOL)

    Objective: To ascertain functional status of the liver in Nigeria Children with Protein energy malnutrition. Materials and Methods: Liver function tests were performed on a total of 88 children with protein energy malnutrition (PEM). These were compared with 22 apparently well-nourished children who served as controls.

  17. Functionalization of protein-based nanocages for drug delivery applications.

    Science.gov (United States)

    Schoonen, Lise; van Hest, Jan C M

    2014-07-07

    Traditional drug delivery strategies involve drugs which are not targeted towards the desired tissue. This can lead to undesired side effects, as normal cells are affected by the drugs as well. Therefore, new systems are now being developed which combine targeting functionalities with encapsulation of drug cargo. Protein nanocages are highly promising drug delivery platforms due to their perfectly defined structures, biocompatibility, biodegradability and low toxicity. A variety of protein nanocages have been modified and functionalized for these types of applications. In this review, we aim to give an overview of different types of modifications of protein-based nanocontainers for drug delivery applications.

  18. Functional structural motifs for protein-ligand, protein-protein, and protein-nucleic acid interactions and their connection to supersecondary structures.

    Science.gov (United States)

    Kinjo, Akira R; Nakamura, Haruki

    2013-01-01

    Protein functions are mediated by interactions between proteins and other molecules. One useful approach to analyze protein functions is to compare and classify the structures of interaction interfaces of proteins. Here, we describe the procedures for compiling a database of interface structures and efficiently comparing the interface structures. To do so requires a good understanding of the data structures of the Protein Data Bank (PDB). Therefore, we also provide a detailed account of the PDB exchange dictionary necessary for extracting data that are relevant for analyzing interaction interfaces and secondary structures. We identify recurring structural motifs by classifying similar interface structures, and we define a coarse-grained representation of supersecondary structures (SSS) which represents a sequence of two or three secondary structure elements including their relative orientations as a string of four to seven letters. By examining the correspondence between structural motifs and SSS strings, we show that no SSS string has particularly high propensity to be found interaction interfaces in general, indicating any SSS can be used as a binding interface. When individual structural motifs are examined, there are some SSS strings that have high propensity for particular groups of structural motifs. In addition, it is shown that while the SSS strings found in particular structural motifs for nonpolymer and protein interfaces are as abundant as in other structural motifs that belong to the same subunit, structural motifs for nucleic acid interfaces exhibit somewhat stronger preference for SSS strings. In regard to protein folds, many motif-specific SSS strings were found across many folds, suggesting that SSS may be a useful description to investigate the universality of ligand binding modes.

  19. Functional equivalency inferred from "authoritative sources" in networks of homologous proteins.

    Science.gov (United States)

    Natarajan, Shreedhar; Jakobsson, Eric

    2009-06-12

    A one-on-one mapping of protein functionality across different species is a critical component of comparative analysis. This paper presents a heuristic algorithm for discovering the Most Likely Functional Counterparts (MoLFunCs) of a protein, based on simple concepts from network theory. A key feature of our algorithm is utilization of the user's knowledge to assign high confidence to selected functional identification. We show use of the algorithm to retrieve functional equivalents for 7 membrane proteins, from an exploration of almost 40 genomes form multiple online resources. We verify the functional equivalency of our dataset through a series of tests that include sequence, structure and function comparisons. Comparison is made to the OMA methodology, which also identifies one-on-one mapping between proteins from different species. Based on that comparison, we believe that incorporation of user's knowledge as a key aspect of the technique adds value to purely statistical formal methods.

  20. Density functional study of molecular interactions in secondary structures of proteins.

    Science.gov (United States)

    Takano, Yu; Kusaka, Ayumi; Nakamura, Haruki

    2016-01-01

    Proteins play diverse and vital roles in biology, which are dominated by their three-dimensional structures. The three-dimensional structure of a protein determines its functions and chemical properties. Protein secondary structures, including α-helices and β-sheets, are key components of the protein architecture. Molecular interactions, in particular hydrogen bonds, play significant roles in the formation of protein secondary structures. Precise and quantitative estimations of these interactions are required to understand the principles underlying the formation of three-dimensional protein structures. In the present study, we have investigated the molecular interactions in α-helices and β-sheets, using ab initio wave function-based methods, the Hartree-Fock method (HF) and the second-order Møller-Plesset perturbation theory (MP2), density functional theory, and molecular mechanics. The characteristic interactions essential for forming the secondary structures are discussed quantitatively.

  1. Lipid Bilayer Composition Affects Transmembrane Protein Orientation and Function

    Directory of Open Access Journals (Sweden)

    Katie D. Hickey

    2011-01-01

    Full Text Available Sperm membranes change in structure and composition upon ejaculation to undergo capacitation, a molecular transformation which enables spermatozoa to undergo the acrosome reaction and be capable of fertilization. Changes to the membrane environment including lipid composition, specifically lipid microdomains, may be responsible for enabling capacitation. To study the effect of lipid environment on proteins, liposomes were created using lipids extracted from bull sperm membranes, with or without a protein (Na+ K+-ATPase or -amylase. Protein incorporation, function, and orientation were determined. Fluorescence resonance energy transfer (FRET confirmed protein inclusion in the lipid bilayer, and protein function was confirmed using a colourometric assay of phosphate production from ATP cleavage. In the native lipid liposomes, ATPase was oriented with the subunit facing the outer leaflet, while changing the lipid composition to 50% native lipids and 50% exogenous lipids significantly altered this orientation of Na+ K+-ATPase within the membranes.

  2. Prediction of heterodimeric protein complexes from weighted protein-protein interaction networks using novel features and kernel functions.

    Directory of Open Access Journals (Sweden)

    Peiying Ruan

    Full Text Available Since many proteins express their functional activity by interacting with other proteins and forming protein complexes, it is very useful to identify sets of proteins that form complexes. For that purpose, many prediction methods for protein complexes from protein-protein interactions have been developed such as MCL, MCODE, RNSC, PCP, RRW, and NWE. These methods have dealt with only complexes with size of more than three because the methods often are based on some density of subgraphs. However, heterodimeric protein complexes that consist of two distinct proteins occupy a large part according to several comprehensive databases of known complexes. In this paper, we propose several feature space mappings from protein-protein interaction data, in which each interaction is weighted based on reliability. Furthermore, we make use of prior knowledge on protein domains to develop feature space mappings, domain composition kernel and its combination kernel with our proposed features. We perform ten-fold cross-validation computational experiments. These results suggest that our proposed kernel considerably outperforms the naive Bayes-based method, which is the best existing method for predicting heterodimeric protein complexes.

  3. Hypothesis: NDL proteins function in stress responses by regulating microtubule organization

    OpenAIRE

    Khatri, Nisha; Mudgil, Yashwanti

    2015-01-01

    N-MYC DOWNREGULATED-LIKE proteins (NDL), members of the alpha/beta hydrolase superfamily were recently rediscovered as interactors of G-protein signaling in Arabidopsis thaliana. Although the precise molecular function of NDL proteins is still elusive, in animals these proteins play protective role in hypoxia and expression is induced by hypoxia and nickel, indicating role in stress. Homology of NDL1 with animal counterpart N-MYC DOWNREGULATED GENE (NDRG) suggests similar functions in animals...

  4. The Rules and Functions of Nucleocytoplasmic Shuttling Proteins.

    Science.gov (United States)

    Fu, Xuekun; Liang, Chao; Li, Fangfei; Wang, Luyao; Wu, Xiaoqiu; Lu, Aiping; Xiao, Guozhi; Zhang, Ge

    2018-05-12

    Biological macromolecules are the basis of life activities. There is a separation of spatial dimension between DNA replication and RNA biogenesis, and protein synthesis, which is an interesting phenomenon. The former occurs in the cell nucleus, while the latter in the cytoplasm. The separation requires protein to transport across the nuclear envelope to realize a variety of biological functions. Nucleocytoplasmic transport of protein including import to the nucleus and export to the cytoplasm is a complicated process that requires involvement and interaction of many proteins. In recent years, many studies have found that proteins constantly shuttle between the cytoplasm and the nucleus. These shuttling proteins play a crucial role as transport carriers and signal transduction regulators within cells. In this review, we describe the mechanism of nucleocytoplasmic transport of shuttling proteins and summarize some important diseases related shuttling proteins.

  5. Integrative approaches to the prediction of protein functions based on the feature selection

    Directory of Open Access Journals (Sweden)

    Lee Hyunju

    2009-12-01

    Full Text Available Abstract Background Protein function prediction has been one of the most important issues in functional genomics. With the current availability of various genomic data sets, many researchers have attempted to develop integration models that combine all available genomic data for protein function prediction. These efforts have resulted in the improvement of prediction quality and the extension of prediction coverage. However, it has also been observed that integrating more data sources does not always increase the prediction quality. Therefore, selecting data sources that highly contribute to the protein function prediction has become an important issue. Results We present systematic feature selection methods that assess the contribution of genome-wide data sets to predict protein functions and then investigate the relationship between genomic data sources and protein functions. In this study, we use ten different genomic data sources in Mus musculus, including: protein-domains, protein-protein interactions, gene expressions, phenotype ontology, phylogenetic profiles and disease data sources to predict protein functions that are labelled with Gene Ontology (GO terms. We then apply two approaches to feature selection: exhaustive search feature selection using a kernel based logistic regression (KLR, and a kernel based L1-norm regularized logistic regression (KL1LR. In the first approach, we exhaustively measure the contribution of each data set for each function based on its prediction quality. In the second approach, we use the estimated coefficients of features as measures of contribution of data sources. Our results show that the proposed methods improve the prediction quality compared to the full integration of all data sources and other filter-based feature selection methods. We also show that contributing data sources can differ depending on the protein function. Furthermore, we observe that highly contributing data sets can be similar among

  6. Forging the Basis for Developing Protein-Ligand Interaction Scoring Functions.

    Science.gov (United States)

    Liu, Zhihai; Su, Minyi; Han, Li; Liu, Jie; Yang, Qifan; Li, Yan; Wang, Renxiao

    2017-02-21

    In structure-based drug design, scoring functions are widely used for fast evaluation of protein-ligand interactions. They are often applied in combination with molecular docking and de novo design methods. Since the early 1990s, a whole spectrum of protein-ligand interaction scoring functions have been developed. Regardless of their technical difference, scoring functions all need data sets combining protein-ligand complex structures and binding affinity data for parametrization and validation. However, data sets of this kind used to be rather limited in terms of size and quality. On the other hand, standard metrics for evaluating scoring function used to be ambiguous. Scoring functions are often tested in molecular docking or even virtual screening trials, which do not directly reflect the genuine quality of scoring functions. Collectively, these underlying obstacles have impeded the invention of more advanced scoring functions. In this Account, we describe our long-lasting efforts to overcome these obstacles, which involve two related projects. On the first project, we have created the PDBbind database. It is the first database that systematically annotates the protein-ligand complexes in the Protein Data Bank (PDB) with experimental binding data. This database has been updated annually since its first public release in 2004. The latest release (version 2016) provides binding data for 16 179 biomolecular complexes in PDB. Data sets provided by PDBbind have been applied to many computational and statistical studies on protein-ligand interaction and various subjects. In particular, it has become a major data resource for scoring function development. On the second project, we have established the Comparative Assessment of Scoring Functions (CASF) benchmark for scoring function evaluation. Our key idea is to decouple the "scoring" process from the "sampling" process, so scoring functions can be tested in a relatively pure context to reflect their quality. In our

  7. Printing Proteins as Microarrays for High-Throughput Function Determination

    Science.gov (United States)

    MacBeath, Gavin; Schreiber, Stuart L.

    2000-09-01

    Systematic efforts are currently under way to construct defined sets of cloned genes for high-throughput expression and purification of recombinant proteins. To facilitate subsequent studies of protein function, we have developed miniaturized assays that accommodate extremely low sample volumes and enable the rapid, simultaneous processing of thousands of proteins. A high-precision robot designed to manufacture complementary DNA microarrays was used to spot proteins onto chemically derivatized glass slides at extremely high spatial densities. The proteins attached covalently to the slide surface yet retained their ability to interact specifically with other proteins, or with small molecules, in solution. Three applications for protein microarrays were demonstrated: screening for protein-protein interactions, identifying the substrates of protein kinases, and identifying the protein targets of small molecules.

  8. Lipid-mediated protein functionalization of electrospun polycaprolactone fibers

    Directory of Open Access Journals (Sweden)

    C. Cohn

    2016-05-01

    Full Text Available In this study, electrospun polycaprolactone (PCL fibers are plasma-treated and chemically conjugated with cholesteryl succinyl silane (CSS. In addition to Raman spectroscopy, an immobilization study of DiO as a fluorescent probe of lipid membranes provides evidence supporting the CSS coating of plasma-treated PCL fibers. Further, anti-CD20 antibodies are used as a model protein to evaluate the potential of lipid-mediated protein immobilization as a mechanism to functionalize the CSS-PCL fiber scaffolds. Upon anti-CD20 functionalization, the CSS-PCL fiber scaffolds capture Granta-22 cells 2.4 times more than the PCL control does, although the two fiber scaffolds immobilize a comparable amount of anti-CD20. Taken together, results from the present study demonstrate that the CSS coating and CSS-mediated antibody immobilization offers an appealing strategy to functionalize electrospun synthetic polymer fibers and confer cell-specific functions on the fiber scaffolds, which can be mechanically robust but often lack biological functions.

  9. Structuring detergents for extracting and stabilizing functional membrane proteins.

    Directory of Open Access Journals (Sweden)

    Rima Matar-Merheb

    Full Text Available BACKGROUND: Membrane proteins are privileged pharmaceutical targets for which the development of structure-based drug design is challenging. One underlying reason is the fact that detergents do not stabilize membrane domains as efficiently as natural lipids in membranes, often leading to a partial to complete loss of activity/stability during protein extraction and purification and preventing crystallization in an active conformation. METHODOLOGY/PRINCIPAL FINDINGS: Anionic calix[4]arene based detergents (C4Cn, n=1-12 were designed to structure the membrane domains through hydrophobic interactions and a network of salt bridges with the basic residues found at the cytosol-membrane interface of membrane proteins. These compounds behave as surfactants, forming micelles of 5-24 nm, with the critical micellar concentration (CMC being as expected sensitive to pH ranging from 0.05 to 1.5 mM. Both by 1H NMR titration and Surface Tension titration experiments, the interaction of these molecules with the basic amino acids was confirmed. They extract membrane proteins from different origins behaving as mild detergents, leading to partial extraction in some cases. They also retain protein functionality, as shown for BmrA (Bacillus multidrug resistance ATP protein, a membrane multidrug-transporting ATPase, which is particularly sensitive to detergent extraction. These new detergents allow BmrA to bind daunorubicin with a Kd of 12 µM, a value similar to that observed after purification using dodecyl maltoside (DDM. They preserve the ATPase activity of BmrA (which resets the protein to its initial state after drug efflux much more efficiently than SDS (sodium dodecyl sulphate, FC12 (Foscholine 12 or DDM. They also maintain in a functional state the C4Cn-extracted protein upon detergent exchange with FC12. Finally, they promote 3D-crystallization of the membrane protein. CONCLUSION/SIGNIFICANCE: These compounds seem promising to extract in a functional state

  10. Functional discrimination of membrane proteins using machine learning techniques

    Directory of Open Access Journals (Sweden)

    Yabuki Yukimitsu

    2008-03-01

    Full Text Available Abstract Background Discriminating membrane proteins based on their functions is an important task in genome annotation. In this work, we have analyzed the characteristic features of amino acid residues in membrane proteins that perform major functions, such as channels/pores, electrochemical potential-driven transporters and primary active transporters. Results We observed that the residues Asp, Asn and Tyr are dominant in channels/pores whereas the composition of hydrophobic residues, Phe, Gly, Ile, Leu and Val is high in electrochemical potential-driven transporters. The composition of all the amino acids in primary active transporters lies in between other two classes of proteins. We have utilized different machine learning algorithms, such as, Bayes rule, Logistic function, Neural network, Support vector machine, Decision tree etc. for discriminating these classes of proteins. We observed that most of the algorithms have discriminated them with similar accuracy. The neural network method discriminated the channels/pores, electrochemical potential-driven transporters and active transporters with the 5-fold cross validation accuracy of 64% in a data set of 1718 membrane proteins. The application of amino acid occurrence improved the overall accuracy to 68%. In addition, we have discriminated transporters from other α-helical and β-barrel membrane proteins with the accuracy of 85% using k-nearest neighbor method. The classification of transporters and all other proteins (globular and membrane showed the accuracy of 82%. Conclusion The performance of discrimination with amino acid occurrence is better than that with amino acid composition. We suggest that this method could be effectively used to discriminate transporters from all other globular and membrane proteins, and classify them into channels/pores, electrochemical and active transporters.

  11. CHEMICAL COMPOSITION AND FUNCTIONAL PROPERTIES OF RICE PROTEIN CONCENTRATES

    Directory of Open Access Journals (Sweden)

    V. V. Kolpakova

    2015-01-01

    Full Text Available Traditionally rice and products of its processing are used to cook porridge, pilaf, lettuce, confectionery, fish, dairy and meat products. At the same time new ways of its processing with releasing of protein products for more effective using, including the use of a glutenfree diet, are developing. The task of this study was a comparative research of nutrition and biological value and functional properties of protein and protein-calcium concentrates produced from rice flour milled from white and brown rice. The traditional and special methods were used. Concentrates were isolated with enzyme preparations of xylanase and amylolytic activity with the next dissolution of protein in diluted hydrochloric acid. Concentrates differed in the content of mineral substances (calcium, zinc, iron and other elements, amino acids and functional properties. The values of the functional properties and indicators of the nutritional value of concentrates from white rice show the advisability of their using in food products, including gluten-free products prepared on the basis of the emulsion and foam systems, and concentrates from brown rice in food products prepared on the basis of using of the emulsion systems. Protein concentrates of brown rice have a low foaming capacity and there is no foam stability at all.

  12. Retrotransposons Are the Major Contributors to the Expansion of the Drosophila ananassae Muller F Element

    Directory of Open Access Journals (Sweden)

    Wilson Leung

    2017-08-01

    Full Text Available The discordance between genome size and the complexity of eukaryotes can partly be attributed to differences in repeat density. The Muller F element (∼5.2 Mb is the smallest chromosome in Drosophila melanogaster, but it is substantially larger (>18.7 Mb in D. ananassae. To identify the major contributors to the expansion of the F element and to assess their impact, we improved the genome sequence and annotated the genes in a 1.4-Mb region of the D. ananassae F element, and a 1.7-Mb region from the D element for comparison. We find that transposons (particularly LTR and LINE retrotransposons are major contributors to this expansion (78.6%, while Wolbachia sequences integrated into the D. ananassae genome are minor contributors (0.02%. Both D. melanogaster and D. ananassae F-element genes exhibit distinct characteristics compared to D-element genes (e.g., larger coding spans, larger introns, more coding exons, and lower codon bias, but these differences are exaggerated in D. ananassae. Compared to D. melanogaster, the codon bias observed in D. ananassae F-element genes can primarily be attributed to mutational biases instead of selection. The 5′ ends of F-element genes in both species are enriched in dimethylation of lysine 4 on histone 3 (H3K4me2, while the coding spans are enriched in H3K9me2. Despite differences in repeat density and gene characteristics, D. ananassae F-element genes show a similar range of expression levels compared to genes in euchromatic domains. This study improves our understanding of how transposons can affect genome size and how genes can function within highly repetitive domains.

  13. Retrotransposons Are the Major Contributors to the Expansion of the Drosophila ananassae Muller F Element

    Science.gov (United States)

    Shaffer, Christopher D.; Chen, Elizabeth J.; Quisenberry, Thomas J.; Ko, Kevin; Braverman, John M.; Giarla, Thomas C.; Mortimer, Nathan T.; Reed, Laura K.; Smith, Sheryl T.; Robic, Srebrenka; McCartha, Shannon R.; Perry, Danielle R.; Prescod, Lindsay M.; Sheppard, Zenyth A.; Saville, Ken J.; McClish, Allison; Morlock, Emily A.; Sochor, Victoria R.; Stanton, Brittney; Veysey-White, Isaac C.; Revie, Dennis; Jimenez, Luis A.; Palomino, Jennifer J.; Patao, Melissa D.; Patao, Shane M.; Himelblau, Edward T.; Campbell, Jaclyn D.; Hertz, Alexandra L.; McEvilly, Maddison F.; Wagner, Allison R.; Youngblom, James; Bedi, Baljit; Bettincourt, Jeffery; Duso, Erin; Her, Maiye; Hilton, William; House, Samantha; Karimi, Masud; Kumimoto, Kevin; Lee, Rebekah; Lopez, Darryl; Odisho, George; Prasad, Ricky; Robbins, Holly Lyn; Sandhu, Tanveer; Selfridge, Tracy; Tsukashima, Kara; Yosif, Hani; Kokan, Nighat P.; Britt, Latia; Zoellner, Alycia; Spana, Eric P.; Chlebina, Ben T.; Chong, Insun; Friedman, Harrison; Mammo, Danny A.; Ng, Chun L.; Nikam, Vinayak S.; Schwartz, Nicholas U.; Xu, Thomas Q.; Burg, Martin G.; Batten, Spencer M.; Corbeill, Lindsay M.; Enoch, Erica; Ensign, Jesse J.; Franks, Mary E.; Haiker, Breanna; Ingles, Judith A.; Kirkland, Lyndsay D.; Lorenz-Guertin, Joshua M.; Matthews, Jordan; Mittig, Cody M.; Monsma, Nicholaus; Olson, Katherine J.; Perez-Aragon, Guillermo; Ramic, Alen; Ramirez, Jordan R.; Scheiber, Christopher; Schneider, Patrick A.; Schultz, Devon E.; Simon, Matthew; Spencer, Eric; Wernette, Adam C.; Wykle, Maxine E.; Zavala-Arellano, Elizabeth; McDonald, Mitchell J.; Ostby, Kristine; Wendland, Peter; DiAngelo, Justin R.; Ceasrine, Alexis M.; Cox, Amanda H.; Docherty, James E.B.; Gingras, Robert M.; Grieb, Stephanie M.; Pavia, Michael J.; Personius, Casey L.; Polak, Grzegorz L.; Beach, Dale L.; Cerritos, Heaven L.; Horansky, Edward A.; Sharif, Karim A.; Moran, Ryan; Parrish, Susan; Bickford, Kirsten; Bland, Jennifer; Broussard, Juliana; Campbell, Kerry; Deibel, Katelynn E.; Forka, Richard; Lemke, Monika C.; Nelson, Marlee B.; O'Keeffe, Catherine; Ramey, S. Mariel; Schmidt, Luke; Villegas, Paola; Jones, Christopher J.; Christ, Stephanie L.; Mamari, Sami; Rinaldi, Adam S.; Stity, Ghazal; Hark, Amy T.; Scheuerman, Mark; Silver Key, S. Catherine; McRae, Briana D.; Haberman, Adam S.; Asinof, Sam; Carrington, Harriette; Drumm, Kelly; Embry, Terrance; McGuire, Richard; Miller-Foreman, Drew; Rosen, Stella; Safa, Nadia; Schultz, Darrin; Segal, Matt; Shevin, Yakov; Svoronos, Petros; Vuong, Tam; Skuse, Gary; Paetkau, Don W.; Bridgman, Rachael K.; Brown, Charlotte M.; Carroll, Alicia R.; Gifford, Francesca M.; Gillespie, Julie Beth; Herman, Susan E.; Holtcamp, Krystal L.; Host, Misha A.; Hussey, Gabrielle; Kramer, Danielle M.; Lawrence, Joan Q.; Martin, Madeline M.; Niemiec, Ellen N.; O'Reilly, Ashleigh P.; Pahl, Olivia A.; Quintana, Guadalupe; Rettie, Elizabeth A.S.; Richardson, Torie L.; Rodriguez, Arianne E.; Rodriguez, Mona O.; Schiraldi, Laura; Smith, Joanna J.; Sugrue, Kelsey F.; Suriano, Lindsey J.; Takach, Kaitlyn E.; Vasquez, Arielle M.; Velez, Ximena; Villafuerte, Elizabeth J.; Vives, Laura T.; Zellmer, Victoria R.; Hauke, Jeanette; Hauser, Charles R.; Barker, Karolyn; Cannon, Laurie; Parsamian, Perouza; Parsons, Samantha; Wichman, Zachariah; Bazinet, Christopher W.; Johnson, Diana E.; Bangura, Abubakarr; Black, Jordan A.; Chevee, Victoria; Einsteen, Sarah A.; Hilton, Sarah K.; Kollmer, Max; Nadendla, Rahul; Stamm, Joyce; Fafara-Thompson, Antoinette E.; Gygi, Amber M.; Ogawa, Emmy E.; Van Camp, Matt; Kocsisova, Zuzana; Leatherman, Judith L.; Modahl, Cassie M.; Rubin, Michael R.; Apiz-Saab, Susana S.; Arias-Mejias, Suzette M.; Carrion-Ortiz, Carlos F.; Claudio-Vazquez, Patricia N.; Espada-Green, Debbie M.; Feliciano-Camacho, Marium; Gonzalez-Bonilla, Karina M.; Taboas-Arroyo, Mariela; Vargas-Franco, Dorianmarie; Montañez-Gonzalez, Raquel; Perez-Otero, Joseph; Rivera-Burgos, Myrielis; Rivera-Rosario, Francisco J.; Eisler, Heather L.; Alexander, Jackie; Begley, Samatha K.; Gabbard, Deana; Allen, Robert J.; Aung, Wint Yan; Barshop, William D.; Boozalis, Amanda; Chu, Vanessa P.; Davis, Jeremy S.; Duggal, Ryan N.; Franklin, Robert; Gavinski, Katherine; Gebreyesus, Heran; Gong, Henry Z.; Greenstein, Rachel A.; Guo, Averill D.; Hanson, Casey; Homa, Kaitlin E.; Hsu, Simon C.; Huang, Yi; Huo, Lucy; Jacobs, Sarah; Jia, Sasha; Jung, Kyle L.; Wai-Chee Kong, Sarah; Kroll, Matthew R.; Lee, Brandon M.; Lee, Paul F.; Levine, Kevin M.; Li, Amy S.; Liu, Chengyu; Liu, Max Mian; Lousararian, Adam P.; Lowery, Peter B.; Mallya, Allyson P.; Marcus, Joseph E.; Ng, Patrick C.; Nguyen, Hien P.; Patel, Ruchik; Precht, Hashini; Rastogi, Suchita; Sarezky, Jonathan M.; Schefkind, Adam; Schultz, Michael B.; Shen, Delia; Skorupa, Tara; Spies, Nicholas C.; Stancu, Gabriel; Vivian Tsang, Hiu Man; Turski, Alice L.; Venkat, Rohit; Waldman, Leah E.; Wang, Kaidi; Wang, Tracy; Wei, Jeffrey W.; Wu, Dennis Y.; Xiong, David D.; Yu, Jack; Zhou, Karen; McNeil, Gerard P.; Fernandez, Robert W.; Menzies, Patrick Gomez; Gu, Tingting; Buhler, Jeremy; Mardis, Elaine R.; Elgin, Sarah C.R.

    2017-01-01

    The discordance between genome size and the complexity of eukaryotes can partly be attributed to differences in repeat density. The Muller F element (∼5.2 Mb) is the smallest chromosome in Drosophila melanogaster, but it is substantially larger (>18.7 Mb) in D. ananassae. To identify the major contributors to the expansion of the F element and to assess their impact, we improved the genome sequence and annotated the genes in a 1.4-Mb region of the D. ananassae F element, and a 1.7-Mb region from the D element for comparison. We find that transposons (particularly LTR and LINE retrotransposons) are major contributors to this expansion (78.6%), while Wolbachia sequences integrated into the D. ananassae genome are minor contributors (0.02%). Both D. melanogaster and D. ananassae F-element genes exhibit distinct characteristics compared to D-element genes (e.g., larger coding spans, larger introns, more coding exons, and lower codon bias), but these differences are exaggerated in D. ananassae. Compared to D. melanogaster, the codon bias observed in D. ananassae F-element genes can primarily be attributed to mutational biases instead of selection. The 5′ ends of F-element genes in both species are enriched in dimethylation of lysine 4 on histone 3 (H3K4me2), while the coding spans are enriched in H3K9me2. Despite differences in repeat density and gene characteristics, D. ananassae F-element genes show a similar range of expression levels compared to genes in euchromatic domains. This study improves our understanding of how transposons can affect genome size and how genes can function within highly repetitive domains. PMID:28667019

  14. Functional analysis of thermostable proteins involved in carbohydrate metabolism

    NARCIS (Netherlands)

    Akerboom, A.P.

    2007-01-01

    Thermostable proteins can resist temperature stress whilst keeping their integrity and functionality. In many cases, thermostable proteins originate from hyperthermophilic microorganisms that thrive in extreme environments. These systems are generally located close to geothermal (volcanic) activity,

  15. BRICHOS - a superfamily of multidomain proteins with diverse functions

    Directory of Open Access Journals (Sweden)

    Johansson Jan

    2009-09-01

    Full Text Available Abstract Background The BRICHOS domain has been found in 8 protein families with a wide range of functions and a variety of disease associations, such as respiratory distress syndrome, dementia and cancer. The domain itself is thought to have a chaperone function, and indeed three of the families are associated with amyloid formation, but its structure and many of its functional properties are still unknown. Findings The proteins in the BRICHOS superfamily have four regions with distinct properties. We have analysed the BRICHOS proteins focusing on sequence conservation, amino acid residue properties, native disorder and secondary structure predictions. Residue conservation shows large variations between the regions, and the spread of residue conservation between different families can vary greatly within the regions. The secondary structure predictions for the BRICHOS proteins show remarkable coherence even where sequence conservation is low, and there seems to be little native disorder. Conclusions The greatly variant rates of conservation indicates different functional constraints among the regions and among the families. We present three previously unknown BRICHOS families; group A, which may be ancestral to the ITM2 families; group B, which is a close relative to the gastrokine families, and group C, which appears to be a truly novel, disjoint BRICHOS family. The C-terminal region of group C has nearly identical sequences in all species ranging from fish to man and is seemingly unique to this family, indicating critical functional or structural properties.

  16. A comprehensive software suite for protein family construction and functional site prediction.

    Directory of Open Access Journals (Sweden)

    David Renfrew Haft

    Full Text Available In functionally diverse protein families, conservation in short signature regions may outperform full-length sequence comparisons for identifying proteins that belong to a subgroup within which one specific aspect of their function is conserved. The SIMBAL workflow (Sites Inferred by Metabolic Background Assertion Labeling is a data-mining procedure for finding such signature regions. It begins by using clues from genomic context, such as co-occurrence or conserved gene neighborhoods, to build a useful training set from a large number of uncharacterized but mutually homologous proteins. When training set construction is successful, the YES partition is enriched in proteins that share function with the user's query sequence, while the NO partition is depleted. A selected query sequence is then mined for short signature regions whose closest matches overwhelmingly favor proteins from the YES partition. High-scoring signature regions typically contain key residues critical to functional specificity, so proteins with the highest sequence similarity across these regions tend to share the same function. The SIMBAL algorithm was described previously, but significant manual effort, expertise, and a supporting software infrastructure were required to prepare the requisite training sets. Here, we describe a new, distributable software suite that speeds up and simplifies the process for using SIMBAL, most notably by providing tools that automate training set construction. These tools have broad utility for comparative genomics, allowing for flexible collection of proteins or protein domains based on genomic context as well as homology, a capability that can greatly assist in protein family construction. Armed with this new software suite, SIMBAL can serve as a fast and powerful in silico alternative to direct experimentation for characterizing proteins and their functional interactions.

  17. MASiVEdb: the Sirevirus Plant Retrotransposon Database

    Directory of Open Access Journals (Sweden)

    Bousios Alexandros

    2012-04-01

    Full Text Available Abstract Background Sireviruses are an ancient genus of the Copia superfamily of LTR retrotransposons, and the only one that has exclusively proliferated within plant genomes. Based on experimental data and phylogenetic analyses, Sireviruses have successfully infiltrated many branches of the plant kingdom, extensively colonizing the genomes of grass species. Notably, it was recently shown that they have been a major force in the make-up and evolution of the maize genome, where they currently occupy ~21% of the nuclear content and ~90% of the Copia population. It is highly likely, therefore, that their life dynamics have been fundamental in the genome composition and organization of a plethora of plant hosts. To assist studies into their impact on plant genome evolution and also facilitate accurate identification and annotation of transposable elements in sequencing projects, we developed MASiVEdb (Mapping and Analysis of SireVirus Elements Database, a collective and systematic resource of Sireviruses in plants. Description Taking advantage of the increasing availability of plant genomic sequences, and using an updated version of MASiVE, an algorithm specifically designed to identify Sireviruses based on their highly conserved genome structure, we populated MASiVEdb (http://bat.infspire.org/databases/masivedb/ with data on 16,243 intact Sireviruses (total length >158Mb discovered in 11 fully-sequenced plant genomes. MASiVEdb is unlike any other transposable element database, providing a multitude of highly curated and detailed information on a specific genus across its hosts, such as complete set of coordinates, insertion age, and an analytical breakdown of the structure and gene complement of each element. All data are readily available through basic and advanced query interfaces, batch retrieval, and downloadable files. A purpose-built system is also offered for detecting and visualizing similarity between user sequences and Sireviruses, as

  18. Identification of functional candidates amongst hypothetical proteins of Treponema pallidum ssp. pallidum.

    Science.gov (United States)

    Naqvi, Ahmad Abu Turab; Shahbaaz, Mohd; Ahmad, Faizan; Hassan, Md Imtaiyaz

    2015-01-01

    Syphilis is a globally occurring venereal disease, and its infection is propagated through sexual contact. The causative agent of syphilis, Treponema pallidum ssp. pallidum, a Gram-negative sphirochaete, is an obligate human parasite. Genome of T. pallidum ssp. pallidum SS14 strain (RefSeq NC_010741.1) encodes 1,027 proteins, of which 444 proteins are known as hypothetical proteins (HPs), i.e., proteins of unknown functions. Here, we performed functional annotation of HPs of T. pallidum ssp. pallidum using various database, domain architecture predictors, protein function annotators and clustering tools. We have analyzed the sequences of 444 HPs of T. pallidum ssp. pallidum and subsequently predicted the function of 207 HPs with a high level of confidence. However, functions of 237 HPs are predicted with less accuracy. We found various enzymes, transporters, binding proteins in the annotated group of HPs that may be possible molecular targets, facilitating for the survival of pathogen. Our comprehensive analysis helps to understand the mechanism of pathogenesis to provide many novel potential therapeutic interventions.

  19. Intronic L1 retrotransposons and nested genes cause transcriptional interference by inducing intron retention, exonization and cryptic polyadenylation.

    Directory of Open Access Journals (Sweden)

    Kristel Kaer

    Full Text Available Transcriptional interference has been recently recognized as an unexpectedly complex and mostly negative regulation of genes. Despite a relatively few studies that emerged in recent years, it has been demonstrated that a readthrough transcription derived from one gene can influence the transcription of another overlapping or nested gene. However, the molecular effects resulting from this interaction are largely unknown.Using in silico chromosome walking, we searched for prematurely terminated transcripts bearing signatures of intron retention or exonization of intronic sequence at their 3' ends upstream to human L1 retrotransposons, protein-coding and noncoding nested genes. We demonstrate that transcriptional interference induced by intronic L1s (or other repeated DNAs and nested genes could be characterized by intron retention, forced exonization and cryptic polyadenylation. These molecular effects were revealed from the analysis of endogenous transcripts derived from different cell lines and tissues and confirmed by the expression of three minigenes in cell culture. While intron retention and exonization were comparably observed in introns upstream to L1s, forced exonization was preferentially detected in nested genes. Transcriptional interference induced by L1 or nested genes was dependent on the presence or absence of cryptic splice sites, affected the inclusion or exclusion of the upstream exon and the use of cryptic polyadenylation signals.Our results suggest that transcriptional interference induced by intronic L1s and nested genes could influence the transcription of the large number of genes in normal as well as in tumor tissues. Therefore, this type of interference could have a major impact on the regulation of the host gene expression.

  20. Functional requirements of the yellow fever virus capsid protein.

    Science.gov (United States)

    Patkar, Chinmay G; Jones, Christopher T; Chang, Yu-hsuan; Warrier, Ranjit; Kuhn, Richard J

    2007-06-01

    Although it is known that the flavivirus capsid protein is essential for genome packaging and formation of infectious particles, the minimal requirements of the dimeric capsid protein for virus assembly/disassembly have not been characterized. By use of a trans-packaging system that involved packaging a yellow fever virus (YFV) replicon into pseudo-infectious particles by supplying the YFV structural proteins using a Sindbis virus helper construct, the functional elements within the YFV capsid protein (YFC) were characterized. Various N- and C-terminal truncations, internal deletions, and point mutations of YFC were analyzed for their ability to package the YFV replicon. Consistent with previous reports on the tick-borne encephalitis virus capsid protein, YFC demonstrates remarkable functional flexibility. Nearly 40 residues of YFC could be removed from the N terminus while the ability to package replicon RNA was retained. Additionally, YFC containing a deletion of approximately 27 residues of the C terminus, including a complete deletion of C-terminal helix 4, was functional. Internal deletions encompassing the internal hydrophobic sequence in YFC were, in general, tolerated to a lesser extent. Site-directed mutagenesis of helix 4 residues predicted to be involved in intermonomeric interactions were also analyzed, and although single mutations did not affect packaging, a YFC with the double mutation of leucine 81 and valine 88 was nonfunctional. The effects of mutations in YFC on the viability of YFV infection were also analyzed, and these results were similar to those obtained using the replicon packaging system, thus underscoring the flexibility of YFC with respect to the requirements for its functioning.

  1. Role of the MAGUK protein family in synapse formation and function.

    Science.gov (United States)

    Oliva, Carlos; Escobedo, Pía; Astorga, César; Molina, Claudia; Sierralta, Jimena

    2012-01-01

    Synaptic function is crucially dependent on the spatial organization of the presynaptic and postsynaptic apparatuses and the juxtaposition of both membrane compartments. This precise arrangement is achieved by a protein network at the submembrane region of each cell that is built around scaffold proteins. The membrane-associated guanylate kinase (MAGUK) family of proteins is a widely expressed and well-conserved group of proteins that plays an essential role in the formation and regulation of this scaffolding. Here, we review general features of this protein family, focusing on the discs large and calcium/calmodulin-dependent serine protein kinase subfamilies of MAGUKs in the formation, function, and plasticity of synapses. Copyright © 2011 Wiley Periodicals, Inc.

  2. Arabidopsis thaliana mTERF proteins: evolution and functional classification

    Directory of Open Access Journals (Sweden)

    Tatjana eKleine

    2012-10-01

    Full Text Available Organellar gene expression (OGE is crucial for plant development, photosynthesis and respiration, but our understanding of the mechanisms that control it is still relatively poor. Thus, OGE requires various nucleus-encoded proteins that promote transcription, splicing, trimming and editing of organellar RNAs, and regulate translation. In metazoans, proteins of the mitochondrial Transcription tERmination Factor (mTERF family interact with the mitochondrial chromosome and regulate transcriptional initiation and termination. Sequencing of the Arabidopsis thaliana genome led to the identification of a diversified MTERF gene family but, in contrast to mammalian mTERFs, knowledge about the function of these proteins in photosynthetic organisms is scarce. In this hypothesis article, I show that tandem duplications and one block duplication contributed to the large number of MTERF genes in A. thaliana, and propose that the expansion of the family is related to the evolution of land plants. The MTERF genes - especially the duplicated genes - display a number of distinct mRNA accumulation patterns, suggesting functional diversification of mTERF proteins to increase adaptability to environmental changes. Indeed, hypothetical functions for the different mTERF proteins can be predicted using co-expression analysis and gene ontology annotations. On this basis, mTERF proteins can be sorted into five groups. Members of the chloroplast and chloroplast-associated clusters are principally involved in chloroplast gene expression, embryogenesis and protein catabolism, while representatives of the mitochondrial cluster seem to participate in DNA and RNA metabolism in that organelle. Moreover, members of the mitochondrion-associated cluster and the low expression group may act in the nucleus and/or the cytosol. As proteins involved in OGE and presumably nuclear gene expression, mTERFs are ideal candidates for the coordination of the expression of organelle and nuclear

  3. Bioorthogonal fluorescent labeling of functional G-protein-coupled receptors

    DEFF Research Database (Denmark)

    Tian, He; Naganathan, Saranga; Kazmi, Manija A

    2014-01-01

    Novel methods are required for site-specific, quantitative fluorescent labeling of G-protein-coupled receptors (GPCRs) and other difficult-to-express membrane proteins. Ideally, fluorescent probes should perturb the native structure and function as little as possible. We evaluated bioorthogonal...

  4. Chaos game representation of functional protein sequences, and simulation and multifractal analysis of induced measures

    International Nuclear Information System (INIS)

    Zu-Guo, Yu; Qian-Jun, Xiao; Long, Shi; Jun-Wu, Yu; Anh, Vo

    2010-01-01

    Investigating the biological function of proteins is a key aspect of protein studies. Bioinformatic methods become important for studying the biological function of proteins. In this paper, we first give the chaos game representation (CGR) of randomly-linked functional protein sequences, then propose the use of the recurrent iterated function systems (RIFS) in fractal theory to simulate the measure based on their chaos game representations. This method helps to extract some features of functional protein sequences, and furthermore the biological functions of these proteins. Then multifractal analysis of the measures based on the CGRs of randomly-linked functional protein sequences are performed. We find that the CGRs have clear fractal patterns. The numerical results show that the RIFS can simulate the measure based on the CGR very well. The relative standard error and the estimated probability matrix in the RIFS do not depend on the order to link the functional protein sequences. The estimated probability matrices in the RIFS with different biological functions are evidently different. Hence the estimated probability matrices in the RIFS can be used to characterise the difference among linked functional protein sequences with different biological functions. From the values of the D q curves, one sees that these functional protein sequences are not completely random. The D q of all linked functional proteins studied are multifractal-like and sufficiently smooth for the C q (analogous to specific heat) curves to be meaningful. Furthermore, the D q curves of the measure μ based on their CGRs for different orders to link the functional protein sequences are almost identical if q ≥ 0. Finally, the C q curves of all linked functional proteins resemble a classical phase transition at a critical point. (cross-disciplinary physics and related areas of science and technology)

  5. Annotating Protein Functional Residues by Coupling High-Throughput Fitness Profile and Homologous-Structure Analysis.

    Science.gov (United States)

    Du, Yushen; Wu, Nicholas C; Jiang, Lin; Zhang, Tianhao; Gong, Danyang; Shu, Sara; Wu, Ting-Ting; Sun, Ren

    2016-11-01

    Identification and annotation of functional residues are fundamental questions in protein sequence analysis. Sequence and structure conservation provides valuable information to tackle these questions. It is, however, limited by the incomplete sampling of sequence space in natural evolution. Moreover, proteins often have multiple functions, with overlapping sequences that present challenges to accurate annotation of the exact functions of individual residues by conservation-based methods. Using the influenza A virus PB1 protein as an example, we developed a method to systematically identify and annotate functional residues. We used saturation mutagenesis and high-throughput sequencing to measure the replication capacity of single nucleotide mutations across the entire PB1 protein. After predicting protein stability upon mutations, we identified functional PB1 residues that are essential for viral replication. To further annotate the functional residues important to the canonical or noncanonical functions of viral RNA-dependent RNA polymerase (vRdRp), we performed a homologous-structure analysis with 16 different vRdRp structures. We achieved high sensitivity in annotating the known canonical polymerase functional residues. Moreover, we identified a cluster of noncanonical functional residues located in the loop region of the PB1 β-ribbon. We further demonstrated that these residues were important for PB1 protein nuclear import through the interaction with Ran-binding protein 5. In summary, we developed a systematic and sensitive method to identify and annotate functional residues that are not restrained by sequence conservation. Importantly, this method is generally applicable to other proteins about which homologous-structure information is available. To fully comprehend the diverse functions of a protein, it is essential to understand the functionality of individual residues. Current methods are highly dependent on evolutionary sequence conservation, which is

  6. PDZ-containing proteins: alternative splicing as a source of functional diversity.

    Science.gov (United States)

    Sierralta, Jimena; Mendoza, Carolina

    2004-12-01

    Scaffold proteins allow specific protein complexes to be assembled in particular regions of the cell at which they organize subcellular structures and signal transduction complexes. This characteristic is especially important for neurons, which are highly polarized cells. Among the domains contained by scaffold proteins, the PSD-95, Discs-large, ZO-1 (PDZ) domains are of particular relevance in signal transduction processes and maintenance of neuronal and epithelial polarity. These domains are specialized in the binding of the carboxyl termini of proteins allowing membrane proteins to be localized by the anchoring to the cytoskeleton mediated by PDZ-containing scaffold proteins. In vivo studies carried out in Drosophila have taught that the role of many scaffold proteins is not limited to a single process; thus, in many cases the same genes are expressed in different tissues and participate in apparently very diverse processes. In addition to the differential expression of interactors of scaffold proteins, the expression of variants of these molecular scaffolds as the result of the alternative processing of the genes that encode them is proving to be a very important source of variability and complexity on a main theme. Alternative splicing in the nervous system is well documented, where specific isoforms play roles in neurotransmission, ion channel function, neuronal cell recognition, and are developmentally regulated making it a major mechanism of functional diversity. Here we review the current state of knowledge about the diversity and the known function of PDZ-containing proteins in Drosophila with emphasis in the role played by alternatively processed forms in the diversity of functions attributed to this family of proteins.

  7. Structures and Corresponding Functions of Five Types of Picornaviral 2A Proteins

    Directory of Open Access Journals (Sweden)

    Xiaoyao Yang

    2017-07-01

    Full Text Available Among the few non-structural proteins encoded by the picornaviral genome, the 2A protein is particularly special, irrespective of structure or function. During the evolution of the Picornaviridae family, the 2A protein has been highly non-conserved. We believe that the 2A protein in this family can be classified into at least five distinct types according to previous studies. These five types are (A chymotrypsin-like 2A, (B Parechovirus-like 2A, (C hepatitis-A-virus-like 2A, (D Aphthovirus-like 2A, and (E 2A sequence of the genus Cardiovirus. We carried out a phylogenetic analysis and found that there was almost no homology between each type. Subsequently, we aligned the sequences within each type and found that the functional motifs in each type are highly conserved. These different motifs perform different functions. Therefore, in this review, we introduce the structures and functions of these five types of 2As separately. Based on the structures and functions, we provide suggestions to combat picornaviruses. The complexity and diversity of the 2A protein has caused great difficulties in functional and antiviral research. In this review, researchers can find useful information on the 2A protein and thus conduct improved antiviral research.

  8. Expanded explorations into the optimization of an energy function for protein design

    Science.gov (United States)

    Huang, Yao-ming; Bystroff, Christopher

    2014-01-01

    Nature possesses a secret formula for the energy as a function of the structure of a protein. In protein design, approximations are made to both the structural representation of the molecule and to the form of the energy equation, such that the existence of a general energy function for proteins is by no means guaranteed. Here we present new insights towards the application of machine learning to the problem of finding a general energy function for protein design. Machine learning requires the definition of an objective function, which carries with it the implied definition of success in protein design. We explored four functions, consisting of two functional forms, each with two criteria for success. Optimization was carried out by a Monte Carlo search through the space of all variable parameters. Cross-validation of the optimized energy function against a test set gave significantly different results depending on the choice of objective function, pointing to relative correctness of the built-in assumptions. Novel energy cross-terms correct for the observed non-additivity of energy terms and an imbalance in the distribution of predicted amino acids. This paper expands on the work presented at ACM-BCB, Orlando FL , October 2012. PMID:24384706

  9. Preparation of functional lupine protein fractions by dry separation

    NARCIS (Netherlands)

    Pelgrom, P.J.M.; Berghout, J.A.M.; Goot, van der A.J.; Boom, R.M.; Schutyser, M.A.I.

    2014-01-01

    Lupine protein concentrate is a promising ingredient that can be obtained by a combination of milling and air classification, generally called dry fractionation. This is a more sustainable route than conventional wet extraction and delivers a protein concentrate with native functional properties.

  10. Characterization and potential functional significance of human-chimpanzee large INDEL variation

    Directory of Open Access Journals (Sweden)

    Polavarapu Nalini

    2011-10-01

    Full Text Available Abstract Background Although humans and chimpanzees have accumulated significant differences in a number of phenotypic traits since diverging from a common ancestor about six million years ago, their genomes are more than 98.5% identical at protein-coding loci. This modest degree of nucleotide divergence is not sufficient to explain the extensive phenotypic differences between the two species. It has been hypothesized that the genetic basis of the phenotypic differences lies at the level of gene regulation and is associated with the extensive insertion and deletion (INDEL variation between the two species. To test the hypothesis that large INDELs (80 to 12,000 bp may have contributed significantly to differences in gene regulation between the two species, we categorized human-chimpanzee INDEL variation mapping in or around genes and determined whether this variation is significantly correlated with previously determined differences in gene expression. Results Extensive, large INDEL variation exists between the human and chimpanzee genomes. This variation is primarily attributable to retrotransposon insertions within the human lineage. There is a significant correlation between differences in gene expression and large human-chimpanzee INDEL variation mapping in genes or in proximity to them. Conclusions The results presented herein are consistent with the hypothesis that large INDELs, particularly those associated with retrotransposons, have played a significant role in human-chimpanzee regulatory evolution.

  11. Binding proteins of somatomedins and their functions

    International Nuclear Information System (INIS)

    Kostelecka, Z.; Blahovec, J.

    1998-01-01

    In this paper the functions of binding proteins are discussed. One variable that provides insulin-like growth factors (IGFs) control at the extracellular level is the presence of high-affinity, soluble insulin-like growth factor proteins (IGFBPs). IGFBP-1 inhibits IGF effect on human osteosarcoma cells. Increased concentration of IGFBP-3 inhibits the proliferation of breast cancer cell line MCF 7 either directly or by competition for IGF receptors. Maybe IGFBPs work as anti-mitogens and IGFs are potential promotors of cancer growth

  12. Intracellular Transport and Kinesin Superfamily Proteins: Structure, Function and Dynamics

    Science.gov (United States)

    Hirokawa, N.; Takemura, R.

    Using various molecular cell biological and molecular genetic approaches, we identified kinesin superfamily proteins (KIFs) and characterized their significant functions in intracellular transport, which is fundamental for cellular morphogenesis, functioning, and survival. We showed that KIFs not only transport various membranous organelles, proteins complexes and mRNAs fundamental for cellular functions but also play significant roles in higher brain functions such as memory and learning, determination of important developmental processes such as left-right asymmetry formation and brain wiring. We also elucidated that KIFs recognize and bind to their specific cargoes using scaffolding or adaptor protein complexes. Concerning the mechanism of motility, we discovered the simplest unique monomeric motor KIF1A and determined by molecular biophysics, cryoelectron microscopy and X-ray crystallography that KIF1A can move on a microtubule processively as a monomer by biased Brownian motion and by hydolyzing ATP.

  13. Hypothesis: NDL proteins function in stress responses by regulating microtubule organization.

    Science.gov (United States)

    Khatri, Nisha; Mudgil, Yashwanti

    2015-01-01

    N-MYC DOWNREGULATED-LIKE proteins (NDL), members of the alpha/beta hydrolase superfamily were recently rediscovered as interactors of G-protein signaling in Arabidopsis thaliana. Although the precise molecular function of NDL proteins is still elusive, in animals these proteins play protective role in hypoxia and expression is induced by hypoxia and nickel, indicating role in stress. Homology of NDL1 with animal counterpart N-MYC DOWNREGULATED GENE (NDRG) suggests similar functions in animals and plants. It is well established that stress responses leads to the microtubule depolymerization and reorganization which is crucial for stress tolerance. NDRG is a microtubule-associated protein which mediates the microtubule organization in animals by causing acetylation and increases the stability of α-tubulin. As NDL1 is highly homologous to NDRG, involvement of NDL1 in the microtubule organization during plant stress can also be expected. Discovery of interaction of NDL with protein kinesin light chain- related 1, enodomembrane family protein 70, syntaxin-23, tubulin alpha-2 chain, as a part of G protein interactome initiative encourages us to postulate microtubule stabilizing functions for NDL family in plants. Our search for NDL interactors in G protein interactome also predicts the role of NDL proteins in abiotic stress tolerance management. Based on published report in animals and predicted interacting partners for NDL in G protein interactome lead us to hypothesize involvement of NDL in the microtubule organization during abiotic stress management in plants.

  14. Using analyses of amino Acid coevolution to understand protein structure and function.

    Science.gov (United States)

    Ashenberg, Orr; Laub, Michael T

    2013-01-01

    Determining which residues of a protein contribute to a specific function is a difficult problem. Analyses of amino acid covariation within a protein family can serve as a useful guide by identifying residues that are functionally coupled. Covariation analyses have been successfully used on several different protein families to identify residues that work together to promote folding, enable protein-protein interactions, or contribute to an enzymatic activity. Covariation is a statistical signal that can be measured in a multiple sequence alignment of homologous proteins. As sequence databases have expanded dramatically, covariation analyses have become easier and more powerful. In this chapter, we describe how functional covariation arises during the evolution of proteins and how this signal can be distinguished from various background signals. We discuss the basic methodology for performing amino acid covariation analysis, using bacterial two-component signal transduction proteins as an example. We provide practical suggestions for each step of the process including assembly of protein sequences, construction of a multiple sequence alignment, measurement of covariation, and analysis of results. Copyright © 2013 Elsevier Inc. All rights reserved.

  15. Structural and Functional Annotation of Hypothetical Proteins of O139

    Directory of Open Access Journals (Sweden)

    Md. Saiful Islam

    2015-06-01

    Full Text Available In developing countries threat of cholera is a significant health concern whenever water purification and sewage disposal systems are inadequate. Vibrio cholerae is one of the responsible bacteria involved in cholera disease. The complete genome sequence of V. cholerae deciphers the presence of various genes and hypothetical proteins whose function are not yet understood. Hence analyzing and annotating the structure and function of hypothetical proteins is important for understanding the V. cholerae. V. cholerae O139 is the most common and pathogenic bacterial strain among various V. cholerae strains. In this study sequence of six hypothetical proteins of V. cholerae O139 has been annotated from NCBI. Various computational tools and databases have been used to determine domain family, protein-protein interaction, solubility of protein, ligand binding sites etc. The three dimensional structure of two proteins were modeled and their ligand binding sites were identified. We have found domains and families of only one protein. The analysis revealed that these proteins might have antibiotic resistance activity, DNA breaking-rejoining activity, integrase enzyme activity, restriction endonuclease, etc. Structural prediction of these proteins and detection of binding sites from this study would indicate a potential target aiding docking studies for therapeutic designing against cholera.

  16. Automatic discovery of cross-family sequence features associated with protein function

    Directory of Open Access Journals (Sweden)

    Krings Andrea

    2006-01-01

    Full Text Available Abstract Background Methods for predicting protein function directly from amino acid sequences are useful tools in the study of uncharacterised protein families and in comparative genomics. Until now, this problem has been approached using machine learning techniques that attempt to predict membership, or otherwise, to predefined functional categories or subcellular locations. A potential drawback of this approach is that the human-designated functional classes may not accurately reflect the underlying biology, and consequently important sequence-to-function relationships may be missed. Results We show that a self-supervised data mining approach is able to find relationships between sequence features and functional annotations. No preconceived ideas about functional categories are required, and the training data is simply a set of protein sequences and their UniProt/Swiss-Prot annotations. The main technical aspect of the approach is the co-evolution of amino acid-based regular expressions and keyword-based logical expressions with genetic programming. Our experiments on a strictly non-redundant set of eukaryotic proteins reveal that the strongest and most easily detected sequence-to-function relationships are concerned with targeting to various cellular compartments, which is an area already well studied both experimentally and computationally. Of more interest are a number of broad functional roles which can also be correlated with sequence features. These include inhibition, biosynthesis, transcription and defence against bacteria. Despite substantial overlaps between these functions and their corresponding cellular compartments, we find clear differences in the sequence motifs used to predict some of these functions. For example, the presence of polyglutamine repeats appears to be linked more strongly to the "transcription" function than to the general "nuclear" function/location. Conclusion We have developed a novel and useful approach for

  17. A semi-nonparametric mixture model for selecting functionally consistent proteins.

    Science.gov (United States)

    Yu, Lianbo; Doerge, Rw

    2010-09-28

    High-throughput technologies have led to a new era of proteomics. Although protein microarray experiments are becoming more common place there are a variety of experimental and statistical issues that have yet to be addressed, and that will carry over to new high-throughput technologies unless they are investigated. One of the largest of these challenges is the selection of functionally consistent proteins. We present a novel semi-nonparametric mixture model for classifying proteins as consistent or inconsistent while controlling the false discovery rate and the false non-discovery rate. The performance of the proposed approach is compared to current methods via simulation under a variety of experimental conditions. We provide a statistical method for selecting functionally consistent proteins in the context of protein microarray experiments, but the proposed semi-nonparametric mixture model method can certainly be generalized to solve other mixture data problems. The main advantage of this approach is that it provides the posterior probability of consistency for each protein.

  18. Dry fractionation for production of functional pea protein concentrates

    NARCIS (Netherlands)

    Pelgrom, P.J.M.; Vissers, A.M.; Boom, R.M.; Schutyser, M.A.I.

    2013-01-01

    Dry milling in combination with air classification was evaluated as an alternative to conventional wet extraction of protein from yellow field peas (Pisum sativum). Major advantages of dry fractionation are retention of native functionality of proteins and its lower energy and water use. Peas were

  19. A large-scale evaluation of computational protein function prediction

    NARCIS (Netherlands)

    Radivojac, P.; Clark, W.T.; Oron, T.R.; Schnoes, A.M.; Wittkop, T.; Kourmpetis, Y.A.I.; Dijk, van A.D.J.; Friedberg, I.

    2013-01-01

    Automated annotation of protein function is challenging. As the number of sequenced genomes rapidly grows, the overwhelming majority of protein products can only be annotated computationally. If computational predictions are to be relied upon, it is crucial that the accuracy of these methods be

  20. Evolved Escherichia coli strains for amplified, functional expression of membrane proteins.

    Science.gov (United States)

    Gul, Nadia; Linares, Daniel M; Ho, Franz Y; Poolman, Bert

    2014-01-09

    The major barrier to the physical characterization and structure determination of membrane proteins is low protein yield and/or low functionality in recombinant expression. The enteric bacterium Escherichia coli is the most widely employed organism for producing recombinant proteins. Beside several advantages of this expression host, one major drawback is that the protein of interest does not always adopt its native conformation and may end up in large insoluble aggregates. We describe a robust strategy to increase the likelihood of overexpressing membrane proteins in a functional state. The method involves fusion in tandem of green fluorescent protein and the erythromycin resistance protein (23S ribosomal RNA adenine N-6 methyltransferase, ErmC) to the C-terminus of a target membrane protein. The fluorescence of green fluorescent protein is used to report the folding state of the target protein, whereas ErmC is used to select for increased expression. By gradually increasing the erythromycin concentration of the medium and testing different membrane protein targets, we obtained a number of evolved strains of which four (NG2, NG3, NG5 and NG6) were characterized and their genome was fully sequenced. Strikingly, each of the strains carried a mutation in the hns gene, whose product is involved in genome organization and transcriptional silencing. The degree of expression of (membrane) proteins correlates with the severity of the hns mutation, but cells in which hns was deleted showed an intermediate expression performance. We propose that (partial) removal of the transcriptional silencing mechanism changes the levels of proteins essential for the functional overexpression of membrane proteins. © 2013.

  1. Computational structural and functional analysis of hypothetical proteins of Staphylococcus aureus

    OpenAIRE

    Mohan, Ramadevi; Venugopal, Subhashree

    2012-01-01

    Genome sequencing projects has led to an explosion of large amount of gene products in which many are of hypothetical proteins with unknown function. Analyzing and annotating the functions of hypothetical proteins is important in Staphylococcus aureus which is a pathogenic bacterium that cause multiple types of diseases by infecting various sites in humans and animals. In this study, ten hypothetical proteins of Staphylococcus aureus were retrieved from NCBI and analyzed for their structural ...

  2. Membrane proteins bind lipids selectively to modulate their structure and function.

    Science.gov (United States)

    Laganowsky, Arthur; Reading, Eamonn; Allison, Timothy M; Ulmschneider, Martin B; Degiacomi, Matteo T; Baldwin, Andrew J; Robinson, Carol V

    2014-06-05

    Previous studies have established that the folding, structure and function of membrane proteins are influenced by their lipid environments and that lipids can bind to specific sites, for example, in potassium channels. Fundamental questions remain however regarding the extent of membrane protein selectivity towards lipids. Here we report a mass spectrometry approach designed to determine the selectivity of lipid binding to membrane protein complexes. We investigate the mechanosensitive channel of large conductance (MscL) from Mycobacterium tuberculosis and aquaporin Z (AqpZ) and the ammonia channel (AmtB) from Escherichia coli, using ion mobility mass spectrometry (IM-MS), which reports gas-phase collision cross-sections. We demonstrate that folded conformations of membrane protein complexes can exist in the gas phase. By resolving lipid-bound states, we then rank bound lipids on the basis of their ability to resist gas phase unfolding and thereby stabilize membrane protein structure. Lipids bind non-selectively and with high avidity to MscL, all imparting comparable stability; however, the highest-ranking lipid is phosphatidylinositol phosphate, in line with its proposed functional role in mechanosensation. AqpZ is also stabilized by many lipids, with cardiolipin imparting the most significant resistance to unfolding. Subsequently, through functional assays we show that cardiolipin modulates AqpZ function. Similar experiments identify AmtB as being highly selective for phosphatidylglycerol, prompting us to obtain an X-ray structure in this lipid membrane-like environment. The 2.3 Å resolution structure, when compared with others obtained without lipid bound, reveals distinct conformational changes that re-position AmtB residues to interact with the lipid bilayer. Our results demonstrate that resistance to unfolding correlates with specific lipid-binding events, enabling a distinction to be made between lipids that merely bind from those that modulate membrane

  3. Functional impact of the human mobilome.

    Science.gov (United States)

    Babatz, Timothy D; Burns, Kathleen H

    2013-06-01

    The human genome is replete with interspersed repetitive sequences derived from the propagation of mobile DNA elements. Three families of human retrotransposons remain active today: LINE1, Alu, and SVA elements. Since 1988, de novo insertions at previously recognized disease loci have been shown to generate highly penetrant alleles in Mendelian disorders. Only recently has the extent of germline-transmitted retrotransposon insertion polymorphism (RIP) in human populations been fully realized. Also exciting are recent studies of somatic retrotransposition in human tissues and reports of tumor-specific insertions, suggesting roles in tissue heterogeneity and tumorigenesis. Here we discuss mobile elements in human disease with an emphasis on exciting developments from the last several years. Copyright © 2013 Elsevier Ltd. All rights reserved.

  4. Integration of relational and hierarchical network information for protein function prediction

    Directory of Open Access Journals (Sweden)

    Jiang Xiaoyu

    2008-08-01

    Full Text Available Abstract Background In the current climate of high-throughput computational biology, the inference of a protein's function from related measurements, such as protein-protein interaction relations, has become a canonical task. Most existing technologies pursue this task as a classification problem, on a term-by-term basis, for each term in a database, such as the Gene Ontology (GO database, a popular rigorous vocabulary for biological functions. However, ontology structures are essentially hierarchies, with certain top to bottom annotation rules which protein function predictions should in principle follow. Currently, the most common approach to imposing these hierarchical constraints on network-based classifiers is through the use of transitive closure to predictions. Results We propose a probabilistic framework to integrate information in relational data, in the form of a protein-protein interaction network, and a hierarchically structured database of terms, in the form of the GO database, for the purpose of protein function prediction. At the heart of our framework is a factorization of local neighborhood information in the protein-protein interaction network across successive ancestral terms in the GO hierarchy. We introduce a classifier within this framework, with computationally efficient implementation, that produces GO-term predictions that naturally obey a hierarchical 'true-path' consistency from root to leaves, without the need for further post-processing. Conclusion A cross-validation study, using data from the yeast Saccharomyces cerevisiae, shows our method offers substantial improvements over both standard 'guilt-by-association' (i.e., Nearest-Neighbor and more refined Markov random field methods, whether in their original form or when post-processed to artificially impose 'true-path' consistency. Further analysis of the results indicates that these improvements are associated with increased predictive capabilities (i.e., increased

  5. Optimization of functionalization conditions for protein analysis by AFM

    Energy Technology Data Exchange (ETDEWEB)

    Arroyo-Hernández, María, E-mail: maria.arroyo@ctb.upm.es [Centro de Tecnología Biomédica, Universidad Politécnica de Madrid, 28223 Pozuelo de Alarcón, Madrid (Spain); Departamento de Ciencia de Materiales, ETSI Caminos, Canales y Puertos, Universidad Politécnica de Madrid, 28040 Madrid (Spain); Daza, Rafael; Pérez-Rigueiro, Jose; Elices, Manuel; Nieto-Márquez, Jorge; Guinea, Gustavo V. [Centro de Tecnología Biomédica, Universidad Politécnica de Madrid, 28223 Pozuelo de Alarcón, Madrid (Spain); Departamento de Ciencia de Materiales, ETSI Caminos, Canales y Puertos, Universidad Politécnica de Madrid, 28040 Madrid (Spain)

    2014-10-30

    Highlights: • Highest fluorescence is obtained for central conditions. • Largest primary amine contribution is obtained for central conditions. • RMS roughness is smaller than 1 nm for all functional films. • Selected deposition conditions lead to proper RMS and functionality values. • LDH proteins adsorbed on AVS-films were observed by AFM. - Abstract: Activated vapor silanization (AVS) is used to functionalize silicon surfaces through deposition of amine-containing thin films. AVS combines vapor silanization and chemical vapor deposition techniques and allows the properties of the functionalized layers (thickness, amine concentration and topography) to be controlled by tuning the deposition conditions. An accurate characterization is performed to correlate the deposition conditions and functional-film properties. In particular, it is shown that smooth surfaces with a sufficient surface density of amine groups may be obtained with this technique. These surfaces are suitable for the study of proteins with atomic force microscopy.

  6. Functional Advantages of Conserved Intrinsic Disorder in RNA-Binding Proteins

    OpenAIRE

    Varadi, Mihaly; Zsolyomi, Fruzsina; Guharoy, Mainak; Tompa, Peter

    2015-01-01

    Proteins form large macromolecular assemblies with RNA that govern essential molecular processes. RNA-binding proteins have often been associated with conformational flexibility, yet the extent and functional implications of their intrinsic disorder have never been fully assessed. Here, through large-scale analysis of comprehensive protein sequence and structure datasets we demonstrate the prevalence of intrinsic structural disorder in RNA-binding proteins and domains. We addressed their func...

  7. Surface dynamics in allosteric regulation of protein-protein interactions: modulation of calmodulin functions by Ca2+.

    Directory of Open Access Journals (Sweden)

    Yosef Y Kuttner

    2013-04-01

    Full Text Available Knowledge of the structural basis of protein-protein interactions (PPI is of fundamental importance for understanding the organization and functioning of biological networks and advancing the design of therapeutics which target PPI. Allosteric modulators play an important role in regulating such interactions by binding at site(s orthogonal to the complex interface and altering the protein's propensity for complex formation. In this work, we apply an approach recently developed by us for analyzing protein surfaces based on steered molecular dynamics simulation (SMD to the study of the dynamic properties of functionally distinct conformations of a model protein, calmodulin (CaM, whose ability to interact with target proteins is regulated by the presence of the allosteric modulator Ca(2+. Calmodulin is a regulatory protein that acts as an intracellular Ca(2+ sensor to control a wide variety of cellular processes. We demonstrate that SMD analysis is capable of pinpointing CaM surfaces implicated in the recognition of both the allosteric modulator Ca(2+ and target proteins. Our analysis of changes in the dynamic properties of the CaM backbone elicited by Ca(2+ binding yielded new insights into the molecular mechanism of allosteric regulation of CaM-target interactions.

  8. Insulator function and topological domain border strength scale with architectural protein occupancy

    Science.gov (United States)

    2014-01-01

    Background Chromosome conformation capture studies suggest that eukaryotic genomes are organized into structures called topologically associating domains. The borders of these domains are highly enriched for architectural proteins with characterized roles in insulator function. However, a majority of architectural protein binding sites localize within topological domains, suggesting sites associated with domain borders represent a functionally different subclass of these regulatory elements. How topologically associating domains are established and what differentiates border-associated from non-border architectural protein binding sites remain unanswered questions. Results By mapping the genome-wide target sites for several Drosophila architectural proteins, including previously uncharacterized profiles for TFIIIC and SMC-containing condensin complexes, we uncover an extensive pattern of colocalization in which architectural proteins establish dense clusters at the borders of topological domains. Reporter-based enhancer-blocking insulator activity as well as endogenous domain border strength scale with the occupancy level of architectural protein binding sites, suggesting co-binding by architectural proteins underlies the functional potential of these loci. Analyses in mouse and human stem cells suggest that clustering of architectural proteins is a general feature of genome organization, and conserved architectural protein binding sites may underlie the tissue-invariant nature of topologically associating domains observed in mammals. Conclusions We identify a spectrum of architectural protein occupancy that scales with the topological structure of chromosomes and the regulatory potential of these elements. Whereas high occupancy architectural protein binding sites associate with robust partitioning of topologically associating domains and robust insulator function, low occupancy sites appear reserved for gene-specific regulation within topological domains. PMID

  9. Experimental parameterization of an energy function for the simulation of unfolded proteins

    DEFF Research Database (Denmark)

    Norgaard, A.B.; Ferkinghoff-Borg, Jesper; Lindorff-Larsen, K.

    2008-01-01

    The determination of conformational preferences in unfolded and disordered proteins is an important challenge in structural biology. We here describe an algorithm to optimize energy functions for the simulation of unfolded proteins. The procedure is based on the maximum likelihood principle and e...... and can be applied to a range of experimental data and energy functions including the force fields used in molecular dynamics simulations.......The determination of conformational preferences in unfolded and disordered proteins is an important challenge in structural biology. We here describe an algorithm to optimize energy functions for the simulation of unfolded proteins. The procedure is based on the maximum likelihood principle...

  10. Functional advantages of dynamic protein disorder.

    Science.gov (United States)

    Berlow, Rebecca B; Dyson, H Jane; Wright, Peter E

    2015-09-14

    Intrinsically disordered proteins participate in many important cellular regulatory processes. The absence of a well-defined structure in the free state of a disordered domain, and even on occasion when it is bound to physiological partners, is fundamental to its function. Disordered domains are frequently the location of multiple sites for post-translational modification, the key element of metabolic control in the cell. When a disordered domain folds upon binding to a partner, the resulting complex buries a far greater surface area than in an interaction of comparably-sized folded proteins, thus maximizing specificity at modest protein size. Disorder also maintains accessibility of sites for post-translational modification. Because of their inherent plasticity, disordered domains frequently adopt entirely different structures when bound to different partners, increasing the repertoire of available interactions without the necessity for expression of many different proteins. This feature also adds to the faithfulness of cellular regulation, as the availability of a given disordered domain depends on competition between various partners relevant to different cellular processes. Copyright © 2015 Federation of European Biochemical Societies. Published by Elsevier B.V. All rights reserved.

  11. Nanodisc-Tm: Rapid functional assessment of nanodisc reconstituted membrane proteins by CPM assay.

    Science.gov (United States)

    Ashok, Yashwanth; Jaakola, Veli-Pekka

    2016-01-01

    Membrane proteins are generally unstable in detergents. Therefore, biochemical and biophysical studies of membrane proteins in lipidic environments provides a near native-like environment suitable for membrane proteins. However, manipulation of proteins embedded in lipid bilayer has remained difficult. Methods such as nanodiscs and lipid cubic phase have been developed for easy manipulation of membrane proteins and have yielded significant insights into membrane proteins. Traditionally functional reconstitution of receptors in nanodiscs has been studied with radioligands. We present a simple and faster method for studying the functionality of reconstituted membrane proteins for routine characterization of protein batches after initial optimization of suitable conditions using radioligands. The benefits of the method are •Faster and generic method to assess functional reconstitution of membrane proteins.•Adaptable in high throughput format (≥96 well format).•Stability measurement in near-native lipid environment and lipid dependent melting temperatures.

  12. Cost Function Network-based Design of Protein-Protein Interactions: predicting changes in binding affinity.

    Science.gov (United States)

    Viricel, Clément; de Givry, Simon; Schiex, Thomas; Barbe, Sophie

    2018-02-20

    Accurate and economic methods to predict change in protein binding free energy upon mutation are imperative to accelerate the design of proteins for a wide range of applications. Free energy is defined by enthalpic and entropic contributions. Following the recent progresses of Artificial Intelligence-based algorithms for guaranteed NP-hard energy optimization and partition function computation, it becomes possible to quickly compute minimum energy conformations and to reliably estimate the entropic contribution of side-chains in the change of free energy of large protein interfaces. Using guaranteed Cost Function Network algorithms, Rosetta energy functions and Dunbrack's rotamer library, we developed and assessed EasyE and JayZ, two methods for binding affinity estimation that ignore or include conformational entropic contributions on a large benchmark of binding affinity experimental measures. If both approaches outperform most established tools, we observe that side-chain conformational entropy brings little or no improvement on most systems but becomes crucial in some rare cases. as open-source Python/C ++ code at sourcesup.renater.fr/projects/easy-jayz. thomas.schiex@inra.fr and sophie.barbe@insa-toulouse.fr. Supplementary data are available at Bioinformatics online.

  13. Rheological and Functional Properties of Catfish Skin Protein Hydrolysates

    Science.gov (United States)

    Catfish skin is an abundant and underutilized resource that can be used as a unique protein source to make fish skin hydrolysates. The objectives of this study were to: isolating soluble and insoluble proteins from hydrolyzed catfish skin and study the chemical and functional properties of the prote...

  14. Rapid production of functionalized recombinant proteins: marrying ligation independent cloning and in vitro protein ligation.

    Science.gov (United States)

    Kushnir, Susanna; Marsac, Yoann; Breitling, Reinhard; Granovsky, Igor; Brok-Volchanskaya, Vera; Goody, Roger S; Becker, Christian F W; Alexandrov, Kirill

    2006-01-01

    Functional genomics and proteomics have been very active fields since the sequencing of several genomes was completed. To assign a physiological role to the newly discovered coding genes with unknown function, new generic methods for protein production, purification, and targeted functionalization are needed. This work presents a new vector, pCYSLIC, that allows rapid generation of Escherichia coli expression constructs via ligation-independent cloning (LIC). The vector is designed to facilitate protein purification by either Ni-NTA or GSH affinity chromatography. Subsequent proteolytic removal of affinity tags liberates an N-terminal cysteine residue that is then used for covalent modification of the target protein with different biophysical probes via protein ligation. The described system has been tested on 36 mammalian Rab GTPases, and it was demonstrated that recombinant GTPases produced with pCYSLIC could be efficiently modified with fluorescein or biotin in vitro. Finally, LIC was compared with the recently developed In-Fusion cloning method, and it was demonstrated that In-Fusion provides superior flexibility in choice of expression vector. By the application of In-Fusion cloning Cys-Rab6A GTPase with an N-terminal cysteine residue was generated employing unmodified pET30a vector and TVMV protease.

  15. Retrotransposons Are the Major Contributors to the Expansion of the Drosophila ananassae Muller F Element.

    Science.gov (United States)

    Leung, Wilson; Shaffer, Christopher D; Chen, Elizabeth J; Quisenberry, Thomas J; Ko, Kevin; Braverman, John M; Giarla, Thomas C; Mortimer, Nathan T; Reed, Laura K; Smith, Sheryl T; Robic, Srebrenka; McCartha, Shannon R; Perry, Danielle R; Prescod, Lindsay M; Sheppard, Zenyth A; Saville, Ken J; McClish, Allison; Morlock, Emily A; Sochor, Victoria R; Stanton, Brittney; Veysey-White, Isaac C; Revie, Dennis; Jimenez, Luis A; Palomino, Jennifer J; Patao, Melissa D; Patao, Shane M; Himelblau, Edward T; Campbell, Jaclyn D; Hertz, Alexandra L; McEvilly, Maddison F; Wagner, Allison R; Youngblom, James; Bedi, Baljit; Bettincourt, Jeffery; Duso, Erin; Her, Maiye; Hilton, William; House, Samantha; Karimi, Masud; Kumimoto, Kevin; Lee, Rebekah; Lopez, Darryl; Odisho, George; Prasad, Ricky; Robbins, Holly Lyn; Sandhu, Tanveer; Selfridge, Tracy; Tsukashima, Kara; Yosif, Hani; Kokan, Nighat P; Britt, Latia; Zoellner, Alycia; Spana, Eric P; Chlebina, Ben T; Chong, Insun; Friedman, Harrison; Mammo, Danny A; Ng, Chun L; Nikam, Vinayak S; Schwartz, Nicholas U; Xu, Thomas Q; Burg, Martin G; Batten, Spencer M; Corbeill, Lindsay M; Enoch, Erica; Ensign, Jesse J; Franks, Mary E; Haiker, Breanna; Ingles, Judith A; Kirkland, Lyndsay D; Lorenz-Guertin, Joshua M; Matthews, Jordan; Mittig, Cody M; Monsma, Nicholaus; Olson, Katherine J; Perez-Aragon, Guillermo; Ramic, Alen; Ramirez, Jordan R; Scheiber, Christopher; Schneider, Patrick A; Schultz, Devon E; Simon, Matthew; Spencer, Eric; Wernette, Adam C; Wykle, Maxine E; Zavala-Arellano, Elizabeth; McDonald, Mitchell J; Ostby, Kristine; Wendland, Peter; DiAngelo, Justin R; Ceasrine, Alexis M; Cox, Amanda H; Docherty, James E B; Gingras, Robert M; Grieb, Stephanie M; Pavia, Michael J; Personius, Casey L; Polak, Grzegorz L; Beach, Dale L; Cerritos, Heaven L; Horansky, Edward A; Sharif, Karim A; Moran, Ryan; Parrish, Susan; Bickford, Kirsten; Bland, Jennifer; Broussard, Juliana; Campbell, Kerry; Deibel, Katelynn E; Forka, Richard; Lemke, Monika C; Nelson, Marlee B; O'Keeffe, Catherine; Ramey, S Mariel; Schmidt, Luke; Villegas, Paola; Jones, Christopher J; Christ, Stephanie L; Mamari, Sami; Rinaldi, Adam S; Stity, Ghazal; Hark, Amy T; Scheuerman, Mark; Silver Key, S Catherine; McRae, Briana D; Haberman, Adam S; Asinof, Sam; Carrington, Harriette; Drumm, Kelly; Embry, Terrance; McGuire, Richard; Miller-Foreman, Drew; Rosen, Stella; Safa, Nadia; Schultz, Darrin; Segal, Matt; Shevin, Yakov; Svoronos, Petros; Vuong, Tam; Skuse, Gary; Paetkau, Don W; Bridgman, Rachael K; Brown, Charlotte M; Carroll, Alicia R; Gifford, Francesca M; Gillespie, Julie Beth; Herman, Susan E; Holtcamp, Krystal L; Host, Misha A; Hussey, Gabrielle; Kramer, Danielle M; Lawrence, Joan Q; Martin, Madeline M; Niemiec, Ellen N; O'Reilly, Ashleigh P; Pahl, Olivia A; Quintana, Guadalupe; Rettie, Elizabeth A S; Richardson, Torie L; Rodriguez, Arianne E; Rodriguez, Mona O; Schiraldi, Laura; Smith, Joanna J; Sugrue, Kelsey F; Suriano, Lindsey J; Takach, Kaitlyn E; Vasquez, Arielle M; Velez, Ximena; Villafuerte, Elizabeth J; Vives, Laura T; Zellmer, Victoria R; Hauke, Jeanette; Hauser, Charles R; Barker, Karolyn; Cannon, Laurie; Parsamian, Perouza; Parsons, Samantha; Wichman, Zachariah; Bazinet, Christopher W; Johnson, Diana E; Bangura, Abubakarr; Black, Jordan A; Chevee, Victoria; Einsteen, Sarah A; Hilton, Sarah K; Kollmer, Max; Nadendla, Rahul; Stamm, Joyce; Fafara-Thompson, Antoinette E; Gygi, Amber M; Ogawa, Emmy E; Van Camp, Matt; Kocsisova, Zuzana; Leatherman, Judith L; Modahl, Cassie M; Rubin, Michael R; Apiz-Saab, Susana S; Arias-Mejias, Suzette M; Carrion-Ortiz, Carlos F; Claudio-Vazquez, Patricia N; Espada-Green, Debbie M; Feliciano-Camacho, Marium; Gonzalez-Bonilla, Karina M; Taboas-Arroyo, Mariela; Vargas-Franco, Dorianmarie; Montañez-Gonzalez, Raquel; Perez-Otero, Joseph; Rivera-Burgos, Myrielis; Rivera-Rosario, Francisco J; Eisler, Heather L; Alexander, Jackie; Begley, Samatha K; Gabbard, Deana; Allen, Robert J; Aung, Wint Yan; Barshop, William D; Boozalis, Amanda; Chu, Vanessa P; Davis, Jeremy S; Duggal, Ryan N; Franklin, Robert; Gavinski, Katherine; Gebreyesus, Heran; Gong, Henry Z; Greenstein, Rachel A; Guo, Averill D; Hanson, Casey; Homa, Kaitlin E; Hsu, Simon C; Huang, Yi; Huo, Lucy; Jacobs, Sarah; Jia, Sasha; Jung, Kyle L; Wai-Chee Kong, Sarah; Kroll, Matthew R; Lee, Brandon M; Lee, Paul F; Levine, Kevin M; Li, Amy S; Liu, Chengyu; Liu, Max Mian; Lousararian, Adam P; Lowery, Peter B; Mallya, Allyson P; Marcus, Joseph E; Ng, Patrick C; Nguyen, Hien P; Patel, Ruchik; Precht, Hashini; Rastogi, Suchita; Sarezky, Jonathan M; Schefkind, Adam; Schultz, Michael B; Shen, Delia; Skorupa, Tara; Spies, Nicholas C; Stancu, Gabriel; Vivian Tsang, Hiu Man; Turski, Alice L; Venkat, Rohit; Waldman, Leah E; Wang, Kaidi; Wang, Tracy; Wei, Jeffrey W; Wu, Dennis Y; Xiong, David D; Yu, Jack; Zhou, Karen; McNeil, Gerard P; Fernandez, Robert W; Menzies, Patrick Gomez; Gu, Tingting; Buhler, Jeremy; Mardis, Elaine R; Elgin, Sarah C R

    2017-08-07

    The discordance between genome size and the complexity of eukaryotes can partly be attributed to differences in repeat density. The Muller F element (∼5.2 Mb) is the smallest chromosome in Drosophila melanogaster , but it is substantially larger (>18.7 Mb) in D. ananassae To identify the major contributors to the expansion of the F element and to assess their impact, we improved the genome sequence and annotated the genes in a 1.4-Mb region of the D. ananassae F element, and a 1.7-Mb region from the D element for comparison. We find that transposons (particularly LTR and LINE retrotransposons) are major contributors to this expansion (78.6%), while Wolbachia sequences integrated into the D. ananassae genome are minor contributors (0.02%). Both D. melanogaster and D. ananassae F-element genes exhibit distinct characteristics compared to D-element genes ( e.g. , larger coding spans, larger introns, more coding exons, and lower codon bias), but these differences are exaggerated in D. ananassae Compared to D. melanogaster , the codon bias observed in D. ananassae F-element genes can primarily be attributed to mutational biases instead of selection. The 5' ends of F-element genes in both species are enriched in dimethylation of lysine 4 on histone 3 (H3K4me2), while the coding spans are enriched in H3K9me2. Despite differences in repeat density and gene characteristics, D. ananassae F-element genes show a similar range of expression levels compared to genes in euchromatic domains. This study improves our understanding of how transposons can affect genome size and how genes can function within highly repetitive domains. Copyright © 2017 Leung et al.

  16. Outer membrane protein functions as integrator of protein import and DNA inheritance in mitochondria

    Science.gov (United States)

    Käser, Sandro; Oeljeklaus, Silke; Týč, Jiří; Vaughan, Sue; Warscheid, Bettina; Schneider, André

    2016-01-01

    Trypanosomatids are one of the earliest diverging eukaryotes that have fully functional mitochondria. pATOM36 is a trypanosomatid-specific essential mitochondrial outer membrane protein that has been implicated in protein import. Changes in the mitochondrial proteome induced by ablation of pATOM36 and in vitro assays show that pATOM36 is required for the assembly of the archaic translocase of the outer membrane (ATOM), the functional analog of the TOM complex in other organisms. Reciprocal pull-down experiments and immunofluorescence analyses demonstrate that a fraction of pATOM36 interacts and colocalizes with TAC65, a previously uncharacterized essential component of the tripartite attachment complex (TAC). The TAC links the single-unit mitochondrial genome to the basal body of the flagellum and mediates the segregation of the replicated mitochondrial genomes. RNAi experiments show that pATOM36, in line with its dual localization, is not only essential for ATOM complex assembly but also for segregation of the replicated mitochondrial genomes. However, the two functions are distinct, as a truncated version of pATOM36 lacking the 75 C-terminal amino acids can rescue kinetoplast DNA missegregation but not the lack of ATOM complex assembly. Thus, pATOM36 has a dual function and integrates mitochondrial protein import with mitochondrial DNA inheritance. PMID:27436903

  17. [Functional properties of mesquite bean protein (Prosopis juliflora)].

    Science.gov (United States)

    Holmquist-Donquis, I; Ruíz de Rey, G

    1997-12-01

    A protein concentrate was prepared from whole mesquite bean (Prosopis juliflora) to evaluate and characterize its functional properties; solubility index, effects of moist heat on its solubility, water sorption, fat absorption, foaming capability and foam stability, emulsifying capacity, viscosity and the effects of NaCl and temperature on some of these properties. These properties were evaluated by procedures used to determine its potential application as a food ingredient and its market potential as a new protein source. The protein isoelectric point ranged between pH 4.00-4.50. Maximum solubility was obtained at a pH 10.00 in a 0.75 M NaCl solution and under heat treatment at 112 degrees C for 5 min. Under the studied conditions the amount of water absorbed and the fat absorption capacity, strongly suggest the mesquite bean protein utilization in foods where both properties are important in order to enhances flavor retention and mouth-feel improvement. Although its foaming capability was larger than that of the egg albumin under similar pH conditions, the protein concentrate did not show a good stability, however, both properties could be improved. Emulsifying capacity as a pH function, showed a positive correlation (r = 0.8435 with a signification level of p = 0.004) with the solubility index but, decreased with NaCl even at low concentrations. For these reasons, the uses of mesquite bean protein for this property will be determined by the pH and ionic strength of the product to be processed.

  18. JNK Signaling: Regulation and Functions Based on Complex Protein-Protein Partnerships

    Science.gov (United States)

    Zeke, András; Misheva, Mariya

    2016-01-01

    SUMMARY The c-Jun N-terminal kinases (JNKs), as members of the mitogen-activated protein kinase (MAPK) family, mediate eukaryotic cell responses to a wide range of abiotic and biotic stress insults. JNKs also regulate important physiological processes, including neuronal functions, immunological actions, and embryonic development, via their impact on gene expression, cytoskeletal protein dynamics, and cell death/survival pathways. Although the JNK pathway has been under study for >20 years, its complexity is still perplexing, with multiple protein partners of JNKs underlying the diversity of actions. Here we review the current knowledge of JNK structure and isoforms as well as the partnerships of JNKs with a range of intracellular proteins. Many of these proteins are direct substrates of the JNKs. We analyzed almost 100 of these target proteins in detail within a framework of their classification based on their regulation by JNKs. Examples of these JNK substrates include a diverse assortment of nuclear transcription factors (Jun, ATF2, Myc, Elk1), cytoplasmic proteins involved in cytoskeleton regulation (DCX, Tau, WDR62) or vesicular transport (JIP1, JIP3), cell membrane receptors (BMPR2), and mitochondrial proteins (Mcl1, Bim). In addition, because upstream signaling components impact JNK activity, we critically assessed the involvement of signaling scaffolds and the roles of feedback mechanisms in the JNK pathway. Despite a clarification of many regulatory events in JNK-dependent signaling during the past decade, many other structural and mechanistic insights are just beginning to be revealed. These advances open new opportunities to understand the role of JNK signaling in diverse physiological and pathophysiological states. PMID:27466283

  19. PANTHER: A Library of Protein Families and Subfamilies Indexed by Function

    OpenAIRE

    Thomas, Paul D.; Campbell, Michael J.; Kejariwal, Anish; Mi, Huaiyu; Karlak, Brian; Daverman, Robin; Diemer, Karen; Muruganujan, Anushya; Narechania, Apurva

    2003-01-01

    In the genomic era, one of the fundamental goals is to characterize the function of proteins on a large scale. We describe a method, PANTHER, for relating protein sequence relationships to function relationships in a robust and accurate way. PANTHER is composed of two main components: the PANTHER library (PANTHER/LIB) and the PANTHER index (PANTHER/X). PANTHER/LIB is a collection of “books,” each representing a protein family as a multiple sequence alignment, a Hidden Markov Model (HMM)...

  20. The construction of an amino acid network for understanding protein structure and function.

    Science.gov (United States)

    Yan, Wenying; Zhou, Jianhong; Sun, Maomin; Chen, Jiajia; Hu, Guang; Shen, Bairong

    2014-06-01

    Amino acid networks (AANs) are undirected networks consisting of amino acid residues and their interactions in three-dimensional protein structures. The analysis of AANs provides novel insight into protein science, and several common amino acid network properties have revealed diverse classes of proteins. In this review, we first summarize methods for the construction and characterization of AANs. We then compare software tools for the construction and analysis of AANs. Finally, we review the application of AANs for understanding protein structure and function, including the identification of functional residues, the prediction of protein folding, analyzing protein stability and protein-protein interactions, and for understanding communication within and between proteins.

  1. Cysteine regulation of protein function--as exemplified by NMDA-receptor modulation.

    Science.gov (United States)

    Lipton, Stuart A; Choi, Yun-Beom; Takahashi, Hiroto; Zhang, Dongxian; Li, Weizhong; Godzik, Adam; Bankston, Laurie A

    2002-09-01

    Until recently cysteine residues, especially those located extracellularly, were thought to be important for metal coordination, catalysis and protein structure by forming disulfide bonds - but they were not thought to regulate protein function. However, this is not the case. Crucial cysteine residues can be involved in modulation of protein activity and signaling events via other reactions of their thiol (sulfhydryl; -SH) groups. These reactions can take several forms, such as redox events (chemical reduction or oxidation), chelation of transition metals (chiefly Zn(2+), Mn(2+) and Cu(2+)) or S-nitrosylation [the catalyzed transfer of a nitric oxide (NO) group to a thiol group]. In several cases, these disparate reactions can compete with one another for the same thiol group on a single cysteine residue, forming a molecular switch composed of a latticework of possible redox, NO or Zn(2+) modifications to control protein function. Thiol-mediated regulation of protein function can also involve reactions of cysteine residues that affect ligand binding allosterically. This article reviews the basis for these molecular cysteine switches, drawing on the NMDA receptor as an exemplary protein, and proposes a molecular model for the action of S-nitrosylation based on recently derived crystal structures.

  2. Annotating Protein Functional Residues by Coupling High-Throughput Fitness Profile and Homologous-Structure Analysis

    Directory of Open Access Journals (Sweden)

    Yushen Du

    2016-11-01

    Full Text Available Identification and annotation of functional residues are fundamental questions in protein sequence analysis. Sequence and structure conservation provides valuable information to tackle these questions. It is, however, limited by the incomplete sampling of sequence space in natural evolution. Moreover, proteins often have multiple functions, with overlapping sequences that present challenges to accurate annotation of the exact functions of individual residues by conservation-based methods. Using the influenza A virus PB1 protein as an example, we developed a method to systematically identify and annotate functional residues. We used saturation mutagenesis and high-throughput sequencing to measure the replication capacity of single nucleotide mutations across the entire PB1 protein. After predicting protein stability upon mutations, we identified functional PB1 residues that are essential for viral replication. To further annotate the functional residues important to the canonical or noncanonical functions of viral RNA-dependent RNA polymerase (vRdRp, we performed a homologous-structure analysis with 16 different vRdRp structures. We achieved high sensitivity in annotating the known canonical polymerase functional residues. Moreover, we identified a cluster of noncanonical functional residues located in the loop region of the PB1 β-ribbon. We further demonstrated that these residues were important for PB1 protein nuclear import through the interaction with Ran-binding protein 5. In summary, we developed a systematic and sensitive method to identify and annotate functional residues that are not restrained by sequence conservation. Importantly, this method is generally applicable to other proteins about which homologous-structure information is available.

  3. Ion Binding Energies Determining Functional Transport of ClC Proteins

    Science.gov (United States)

    Yu, Tao; Guo, Xu; Zou, Xian-Wu; Sang, Jian-Ping

    2014-06-01

    The ClC-type proteins, a large family of chloride transport proteins ubiquitously expressed in biological organisms, have been extensively studied for decades. Biological function of ClC proteins can be reflected by analyzing the binding situation of Cl- ions. We investigate ion binding properties of ClC-ec1 protein with the atomic molecular dynamics simulation approach. The calculated electrostatic binding energy results indicate that Cl- at the central binding site Scen has more binding stability than the internal binding site Sint. Quantitative comparison between the latest experimental heat release data isothermal titration calorimetry (ITC) and our calculated results demonstrates that chloride ions prefer to bind at Scen than Sint in the wild-type ClC-ec1 structure and prefer to bind at Sext and Scen than Sint in mutant E148A/E148Q structures. Even though the chloride ions make less contribution to heat release when binding to Sint and are relatively unstable in the Cl- pathway, they are still part contributors for the Cl- functional transport. This work provides a guide rule to estimate the importance of Cl- at the binding sites and how chloride ions have influences on the function of ClC proteins.

  4. Multivesicular Bodies in Neurons: Distribution, Protein Content, and Trafficking Functions

    Science.gov (United States)

    VON BARTHELD, CHRISTOPHER S.; ALTICK, AMY L.

    2011-01-01

    Summary Multivesicular bodies (MVBs) are intracellular endosomal organelles characterized by multiple internal vesicles that are enclosed within a single outer membrane. MVBs were initially regarded as purely prelysosomal structures along the degradative endosomal pathway of internalized proteins. MVBs are now known to be involved in numerous endocytic and trafficking functions, including protein sorting, recycling, transport, storage, and release. This review of neuronal MVBs summarizes their research history, morphology, distribution, accumulation of cargo and constitutive proteins, transport, and theories of functions of MVBs in neurons and glia. Due to their complex morphologies, neurons have expanded trafficking and signaling needs, beyond those of “geometrically simpler” cells, but it is not known whether neuronal MVBs perform additional transport and signaling functions. This review examines the concept of compartment-specific MVB functions in endosomal protein trafficking and signaling within synapses, axons, dendrites and cell bodies. We critically evaluate reports of the accumulation of neuronal MVBs based on evidence of stress-induced MVB formation. Furthermore, we discuss potential functions of neuronal and glial MVBs in development, in dystrophic neuritic syndromes, injury, disease, and aging. MVBs may play a role in Alzheimer’s, Huntington’s, and Niemann-Pick diseases, some types of frontotemporal dementia, prion and virus trafficking, as well as in adaptive responses of neurons to trauma and toxin or drug exposure. Functions of MVBs in neurons have been much neglected, and major gaps in knowledge currently exist. Developing truly MVB-specific markers would help to elucidate the roles of neuronal MVBs in intra- and intercellular signaling of normal and diseased neurons. PMID:21216273

  5. In silico functional elucidation of uncharacterized proteins of Chlamydia abortus strain LLG.

    Science.gov (United States)

    Singh, Gagandeep; Sharma, Dixit; Singh, Vikram; Rani, Jyoti; Marotta, Francessco; Kumar, Manoj; Mal, Gorakh; Singh, Birbal

    2017-03-01

    This study reports structural modeling, molecular dynamics profiling of hypothetical proteins in Chlamydia abortus genome database. The hypothetical protein sequences were extracted from C. abortus LLG Genome Database for functional elucidation using in silico methods. Fifty-one proteins with their roles in defense, binding and transporting other biomolecules were unraveled. Forty-five proteins were found to be nonhomologous to proteins present in hosts infected by C. abortus . Of these, 31 proteins were related to virulence. The structural modeling of two proteins, first, WP_006344020.1 (phosphorylase) and second, WP_006344325.1 (chlamydial protease/proteasome-like activity factor) were accomplished. The conserved active sites necessary for the catalytic function were analyzed. The finally concluded proteins are envisioned as possible targets for developing drugs to curtail chlamydial infections, however, and should be validated by molecular biological methods.

  6. Functionalized linear poly(amidoamine)s are efficient vectors for intracellular protein delivery

    NARCIS (Netherlands)

    Coué, G.M.J.P.C.; Engbersen, Johannes F.J.

    2011-01-01

    An effective intracellular protein delivery system was developed based on functionalized linear poly(amidoamine)s (PAAs) that form self-assembled cationic nanocomplexes with oppositely charged proteins. Three differently functionalized PAAs were synthesized, two of these having repetitive disulfide

  7. Composite Structural Motifs of Binding Sites for Delineating Biological Functions of Proteins

    Science.gov (United States)

    Kinjo, Akira R.; Nakamura, Haruki

    2012-01-01

    Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs that represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures. PMID:22347478

  8. Structure modification and functionality of whey proteins: quantitative structure-activity relationship approach.

    Science.gov (United States)

    Nakai, S; Li-Chan, E

    1985-10-01

    According to the original idea of quantitative structure-activity relationship, electric, hydrophobic, and structural parameters should be taken into consideration for elucidating functionality. Changes in these parameters are reflected in the property of protein solubility upon modification of whey proteins by heating. Although solubility is itself a functional property, it has been utilized to explain other functionalities of proteins. However, better correlations were obtained when hydrophobic parameters of the proteins were used in conjunction with solubility. Various treatments reported in the literature were applied to whey protein concentrate in an attempt to obtain whipping and gelling properties similar to those of egg white. Mapping simplex optimization was used to search for the best results. Improvement in whipping properties by pepsin hydrolysis may have been due to higher protein solubility, and good gelling properties resulting from polyphosphate treatment may have been due to an increase in exposable hydrophobicity. However, the results of angel food cake making were still unsatisfactory.

  9. Physicochemical and functional properties of protein isolate obtained from cottonseed meal.

    Science.gov (United States)

    Ma, Mengting; Ren, Yanjing; Xie, Wei; Zhou, Dayun; Tang, Shurong; Kuang, Meng; Wang, Yanqin; Du, Shuang-Kui

    2018-02-01

    To investigate the effect of preparation methods of cottonseed meals on protein properties, the physicochemical and functional properties of proteins isolated from hot-pressed solvent extraction cottonseed meal (HCM), cold-pressed solvent extraction cottonseed meal (CCM) and subcritical fluid extraction cottonseed meal (SCM) were investigated. Cottonseed proteins had two major bands (at about 45 and 50kD), two X-ray diffraction peaks (8.5° and 19.5°) and one endothermic peak (94.31°C-97.72°C). Proteins of HCM showed relatively more β-sheet (38.3%-40.5%), and less β-turn (22.2%-25.8%) and α-helix (15.8%-19.5%), indicating the presence of highly denatured protein molecules. Proteins of CCM and SCM exhibited high water/oil absorption capacity, emulsifying abilities, surface hydrophobicity and fluorescence intensity, suggesting that the proteins have potential as functional ingredients in the food industry. Copyright © 2017 Elsevier Ltd. All rights reserved.

  10. Integrative Identification of Arabidopsis Mitochondrial Proteome and Its Function Exploitation through Protein Interaction Network

    Science.gov (United States)

    Cui, Jian; Liu, Jinghua; Li, Yuhua; Shi, Tieliu

    2011-01-01

    Mitochondria are major players on the production of energy, and host several key reactions involved in basic metabolism and biosynthesis of essential molecules. Currently, the majority of nucleus-encoded mitochondrial proteins are unknown even for model plant Arabidopsis. We reported a computational framework for predicting Arabidopsis mitochondrial proteins based on a probabilistic model, called Naive Bayesian Network, which integrates disparate genomic data generated from eight bioinformatics tools, multiple orthologous mappings, protein domain properties and co-expression patterns using 1,027 microarray profiles. Through this approach, we predicted 2,311 candidate mitochondrial proteins with 84.67% accuracy and 2.53% FPR performances. Together with those experimental confirmed proteins, 2,585 mitochondria proteins (named CoreMitoP) were identified, we explored those proteins with unknown functions based on protein-protein interaction network (PIN) and annotated novel functions for 26.65% CoreMitoP proteins. Moreover, we found newly predicted mitochondrial proteins embedded in particular subnetworks of the PIN, mainly functioning in response to diverse environmental stresses, like salt, draught, cold, and wound etc. Candidate mitochondrial proteins involved in those physiological acitivites provide useful targets for further investigation. Assigned functions also provide comprehensive information for Arabidopsis mitochondrial proteome. PMID:21297957

  11. Evolved Escherichia coli Strains for Amplified, Functional Expression of Membrane Proteins

    NARCIS (Netherlands)

    Gul, Nadia; Linares, Daniel M.; Ho, Franz Y.; Poolman, Bert

    2014-01-01

    The major barrier to the physical characterization and structure determination of membrane proteins is low protein yield and/or low functionality in recombinant expression. The enteric bacterium Escherichia coli is the most widely employed organism for producing recombinant proteins. Beside several

  12. Watching proteins function with picosecond X-ray crystallography and molecular dynamics simulations.

    Science.gov (United States)

    Anfinrud, Philip

    2006-03-01

    Time-resolved electron density maps of myoglobin, a ligand-binding heme protein, have been stitched together into movies that unveil with molecular dynamics (MD) calculations and picosecond time-resolved X-ray structures provides single-molecule insights into mechanisms of protein function. Ensemble-averaged MD simulations of the L29F mutant of myoglobin following ligand dissociation reproduce the direction, amplitude, and timescales of crystallographically-determined structural changes. This close agreement with experiments at comparable resolution in space and time validates the individual MD trajectories, which identify and structurally characterize a conformational switch that directs dissociated ligands to one of two nearby protein cavities. This unique combination of simulation and experiment unveils functional protein motions and illustrates at an atomic level relationships among protein structure, dynamics, and function. In collaboration with Friedrich Schotte and Gerhard Hummer, NIH.

  13. The comprehensive native interactome of a fully functional tagged prion protein.

    Directory of Open Access Journals (Sweden)

    Dorothea Rutishauser

    Full Text Available The enumeration of the interaction partners of the cellular prion protein, PrP(C, may help clarifying its elusive molecular function. Here we added a carboxy proximal myc epitope tag to PrP(C. When expressed in transgenic mice, PrP(myc carried a GPI anchor, was targeted to lipid rafts, and was glycosylated similarly to PrP(C. PrP(myc antagonized the toxicity of truncated PrP, restored prion infectibility of PrP(C-deficient mice, and was physically incorporated into PrP(Sc aggregates, indicating that it possessed all functional characteristics of genuine PrP(C. We then immunopurified myc epitope-containing protein complexes from PrP(myc transgenic mouse brains. Gentle differential elution with epitope-mimetic decapeptides, or a scrambled version thereof, yielded 96 specifically released proteins. Quantitative mass spectrometry with isotope-coded tags identified seven proteins which co-eluted equimolarly with PrP(C and may represent component of a multiprotein complex. Selected PrP(C interactors were validated using independent methods. Several of these proteins appear to exert functions in axomyelinic maintenance.

  14. ProLanGO: Protein Function Prediction Using Neural Machine Translation Based on a Recurrent Neural Network.

    Science.gov (United States)

    Cao, Renzhi; Freitas, Colton; Chan, Leong; Sun, Miao; Jiang, Haiqing; Chen, Zhangxin

    2017-10-17

    With the development of next generation sequencing techniques, it is fast and cheap to determine protein sequences but relatively slow and expensive to extract useful information from protein sequences because of limitations of traditional biological experimental techniques. Protein function prediction has been a long standing challenge to fill the gap between the huge amount of protein sequences and the known function. In this paper, we propose a novel method to convert the protein function problem into a language translation problem by the new proposed protein sequence language "ProLan" to the protein function language "GOLan", and build a neural machine translation model based on recurrent neural networks to translate "ProLan" language to "GOLan" language. We blindly tested our method by attending the latest third Critical Assessment of Function Annotation (CAFA 3) in 2016, and also evaluate the performance of our methods on selected proteins whose function was released after CAFA competition. The good performance on the training and testing datasets demonstrates that our new proposed method is a promising direction for protein function prediction. In summary, we first time propose a method which converts the protein function prediction problem to a language translation problem and applies a neural machine translation model for protein function prediction.

  15. Nanobody Technology: A Versatile Toolkit for Microscopic Imaging, Protein-Protein Interaction Analysis, and Protein Function Exploration.

    Science.gov (United States)

    Beghein, Els; Gettemans, Jan

    2017-01-01

    Over the last two decades, nanobodies or single-domain antibodies have found their way in research, diagnostics, and therapy. These antigen-binding fragments, derived from Camelid heavy chain only antibodies, possess remarkable characteristics that favor their use over conventional antibodies or fragments thereof, in selected areas of research. In this review, we assess the current status of nanobodies as research tools in diverse aspects of fundamental research. We discuss the use of nanobodies as detection reagents in fluorescence microscopy and focus on recent advances in super-resolution microscopy. Second, application of nanobody technology in investigating protein-protein interactions is reviewed, with emphasis on possible uses in mass spectrometry. Finally, we discuss the potential value of nanobodies in studying protein function, and we focus on their recently reported application in targeted protein degradation. Throughout the review, we highlight state-of-the-art engineering strategies that could expand nanobody versatility and we suggest future applications of the technology in the selected areas of fundamental research.

  16. The Chern-Simons current in time series of knots and links in proteins

    Science.gov (United States)

    Capozziello, Salvatore; Pincak, Richard

    2018-06-01

    A superspace model of knots and links for DNA time series data is proposed to take into account the feedback loop from docking to undocking state of protein-protein interactions. In particular, the direction of interactions between the 8 hidden states of DNA is considered. It is a E8 ×E8 unified spin model where the genotype, from active and inactive side of DNA time data series, can be considered for any living organism. The mathematical model is borrowed from loop-quantum gravity and adapted to biology. It is used to derive equations for gene expression describing transitions from ground to excited states, and for the 8 coupling states between geneon and anti-geneon transposon and retrotransposon in trash DNA. Specifically, we adopt a modified Grothendieck cohomology and a modified Khovanov cohomology for biology. The result is a Chern-Simons current in (8 + 3) extradimensions of a given unoriented supermanifold with ghost fields of protein structures. The 8 dimensions come from the 8 hidden states of spinor field of genetic code. The extradimensions come from the 3 types of principle fiber bundle in the secondary protein.

  17. Mechanisms of EHD/RME-1 Protein Function in Endocytic Transport

    Science.gov (United States)

    Grant, Barth D.; Caplan, Steve

    2009-01-01

    The evolutionarily conserved Eps15 homology domain (EHD)/receptor-mediated endocytosis (RME)-1 family of C-terminal EH domain proteins has recently come under intense scrutiny because of its importance in intracellular membrane transport, especially with regard to the recycling of receptors from endosomes to the plasma membrane. Recent studies have shed new light on the mode by which these adenosine triphosphatases function on endosomal membranes in mammals and Caenorhabditis elegans. This review highlights our current understanding of the physiological roles of these proteins in vivo, discussing conserved features as well as emerging functional differences between individual mammalian paralogs. In addition, these findings are discussed in light of the identification of novel EHD/RME-1 protein and lipid interactions and new structural data for proteins in this family, indicating intriguing similarities to the Dynamin superfamily of large guanosine triphosphatases. PMID:18801062

  18. Functional and technological properties of camel milk proteins: a review

    DEFF Research Database (Denmark)

    Hailu, Yonas; Hansen, Egon Bech; Seifu, Eyassu

    2016-01-01

    This review summarises current knowledge on camel milk proteins, with focus on significant peculiarities in protein composition and molecular properties. Camel milk is traditionally consumed as a fresh or naturally fermented product. Within the last couple of years, an increasing quantity is being...... processed in dairy plants, and a number of consumer products have been marketed. A better understanding of the technological and functional properties, as required for product improvement, has been gained in the past years. Absence of the whey protein β-LG and a low proportion of к-casein cause differences...... in relation to dairy processing. In addition to the technological properties, there are also implications for human nutrition and camel milk proteins are of interest for applications in infant foods, for food preservation and in functional foods. Proposed health benefits include inhibition of the angiotensin...

  19. The functional range of heat shock proteins to combat environmental toxicity

    International Nuclear Information System (INIS)

    Mahmood, K.; Mahmood, Q.; Pervez, A.; Nasreen, S.

    2012-01-01

    Almost all the organisms possess a system to cope with the harsh physiochemical factors of environment. Such a system is based on a group of stress genes, which show rapid responses in form of stress proteins, especially heat shock proteins, when cells are confronted with insult. Heat shock proteins are now known to express in response to variety of toxic and stress conditions including diseases. As a molecular chaperone, against cytotoxicity, these ensure the functional ability of cells by repairing the denatured proteins, cellular structures like cytoskeleton and centrosomes and processes dealing with protein synthesis are stabilized or repaired during a second stress in stress tolerant cells and organisms. In unstressed cells these play an imperative role in the synthesis and transport of normal proteins. Their role in certain diseases reveals their potential application in medical field. Certain Hsp are helpful in coping carcinogenicity caused environmental pollutants and have been suggested to have anti-apoptotic, anti stress and anti-allergic function. Their expression is tissue and species specific with respect to type, intensity and duration of a toxicant. These are developmentally regulated and help in process of differentiation and thus their abnormal regulation impairs the normal development. However, their role as bio marker in risk assessment of environmental pollution warrants further research. Due to broad functional range, therefore, present review is embracing the functional aspects of smaller and Hsp 70 families expressing in animals under toxic conditions. (author)

  20. Global functional atlas of Escherichia coli encompassing previously uncharacterized proteins.

    Science.gov (United States)

    Hu, Pingzhao; Janga, Sarath Chandra; Babu, Mohan; Díaz-Mejía, J Javier; Butland, Gareth; Yang, Wenhong; Pogoutse, Oxana; Guo, Xinghua; Phanse, Sadhna; Wong, Peter; Chandran, Shamanta; Christopoulos, Constantine; Nazarians-Armavil, Anaies; Nasseri, Negin Karimi; Musso, Gabriel; Ali, Mehrab; Nazemof, Nazila; Eroukova, Veronika; Golshani, Ashkan; Paccanaro, Alberto; Greenblatt, Jack F; Moreno-Hagelsieb, Gabriel; Emili, Andrew

    2009-04-28

    One-third of the 4,225 protein-coding genes of Escherichia coli K-12 remain functionally unannotated (orphans). Many map to distant clades such as Archaea, suggesting involvement in basic prokaryotic traits, whereas others appear restricted to E. coli, including pathogenic strains. To elucidate the orphans' biological roles, we performed an extensive proteomic survey using affinity-tagged E. coli strains and generated comprehensive genomic context inferences to derive a high-confidence compendium for virtually the entire proteome consisting of 5,993 putative physical interactions and 74,776 putative functional associations, most of which are novel. Clustering of the respective probabilistic networks revealed putative orphan membership in discrete multiprotein complexes and functional modules together with annotated gene products, whereas a machine-learning strategy based on network integration implicated the orphans in specific biological processes. We provide additional experimental evidence supporting orphan participation in protein synthesis, amino acid metabolism, biofilm formation, motility, and assembly of the bacterial cell envelope. This resource provides a "systems-wide" functional blueprint of a model microbe, with insights into the biological and evolutionary significance of previously uncharacterized proteins.

  1. Protein functional links in Trypanosoma brucei, identified by gene fusion analysis

    Directory of Open Access Journals (Sweden)

    Trimpalis Philip

    2011-07-01

    Full Text Available Abstract Background Domain or gene fusion analysis is a bioinformatics method for detecting gene fusions in one organism by comparing its genome to that of other organisms. The occurrence of gene fusions suggests that the two original genes that participated in the fusion are functionally linked, i.e. their gene products interact either as part of a multi-subunit protein complex, or in a metabolic pathway. Gene fusion analysis has been used to identify protein functional links in prokaryotes as well as in eukaryotic model organisms, such as yeast and Drosophila. Results In this study we have extended this approach to include a number of recently sequenced protists, four of which are pathogenic, to identify fusion linked proteins in Trypanosoma brucei, the causative agent of African sleeping sickness. We have also examined the evolution of the gene fusion events identified, to determine whether they can be attributed to fusion or fission, by looking at the conservation of the fused genes and of the individual component genes across the major eukaryotic and prokaryotic lineages. We find relatively limited occurrence of gene fusions/fissions within the protist lineages examined. Our results point to two trypanosome-specific gene fissions, which have recently been experimentally confirmed, one fusion involving proteins involved in the same metabolic pathway, as well as two novel putative functional links between fusion-linked protein pairs. Conclusions This is the first study of protein functional links in T. brucei identified by gene fusion analysis. We have used strict thresholds and only discuss results which are highly likely to be genuine and which either have already been or can be experimentally verified. We discuss the possible impact of the identification of these novel putative protein-protein interactions, to the development of new trypanosome therapeutic drugs.

  2. Protein kinase inhibitor peptide (PKI): a family of endogenous neuropeptides that modulate neuronal cAMP-dependent protein kinase function.

    Science.gov (United States)

    Dalton, George D; Dewey, William L

    2006-02-01

    Signal transduction cascades involving cAMP-dependent protein kinase are highly conserved among a wide variety of organisms. Given the universal nature of this enzyme it is not surprising that cAMP-dependent protein kinase plays a critical role in numerous cellular processes. This is particularly evident in the nervous system where cAMP-dependent protein kinase is involved in neurotransmitter release, gene transcription, and synaptic plasticity. Protein kinase inhibitor peptide (PKI) is an endogenous thermostable peptide that modulates cAMP-dependent protein kinase function. PKI contains two distinct functional domains within its amino acid sequence that allow it to: (1) potently and specifically inhibit the activity of the free catalytic subunit of cAMP-dependent protein kinase and (2) export the free catalytic subunit of cAMP-dependent protein kinase from the nucleus. Three distinct PKI isoforms (PKIalpha, PKIbeta, PKIgamma) have been identified and each isoform is expressed in the brain. PKI modulates neuronal synaptic activity, while PKI also is involved in morphogenesis and symmetrical left-right axis formation. In addition, PKI also plays a role in regulating gene expression induced by cAMP-dependent protein kinase. Future studies should identify novel physiological functions for endogenous PKI both in the nervous system and throughout the body. Most interesting will be the determination whether functional differences exist between individual PKI isoforms which is an intriguing possibility since these isoforms exhibit: (1) cell-type specific tissue expression patterns, (2) different potencies for the inhibition of cAMP-dependent protein kinase activity, and (3) expression patterns that are hormonally, developmentally and cell-cycle regulated. Finally, synthetic peptide analogs of endogenous PKI will continue to be invaluable tools that are used to elucidate the role of cAMP-dependent protein kinase in a variety of cellular processes throughout the nervous

  3. Prediction of human protein function according to Gene Ontology categories

    DEFF Research Database (Denmark)

    Jensen, Lars Juhl; Gupta, Ramneek; Stærfeldt, Hans Henrik

    2003-01-01

    developed a method for prediction of protein function for a subset of classes from the Gene Ontology classification scheme. This subset includes several pharmaceutically interesting categories-transcription factors, receptors, ion channels, stress and immune response proteins, hormones and growth factors...

  4. Multiple structure-intrinsic disorder interactions regulate and coordinate Hox protein function

    Science.gov (United States)

    Bondos, Sarah

    During animal development, Hox transcription factors determine fate of developing tissues to generate diverse organs and appendages. Hox proteins are famous for their bizarre mutant phenotypes, such as replacing antennae with legs. Clearly, the functions of individual Hox proteins must be distinct and reliable in vivo, or the organism risks malformation or death. However, within the Hox protein family, the DNA-binding homeodomains are highly conserved and the amino acids that contact DNA are nearly invariant. These observations raise the question: How do different Hox proteins correctly identify their distinct target genes using a common DNA binding domain? One possible means to modulate DNA binding is through the influence of the non-homeodomain protein regions, which differ significantly among Hox proteins. However genetic approaches never detected intra-protein interactions, and early biochemical attempts were hindered because the special features of ``intrinsically disordered'' sequences were not appreciated. We propose the first-ever structural model of a Hox protein to explain how specific contacts between distant, intrinsically disordered regions of the protein and the homeodomain regulate DNA binding and coordinate this activity with other Hox molecular functions.

  5. Feature Selection and the Class Imbalance Problem in Predicting Protein Function from Sequence

    NARCIS (Netherlands)

    Al-Shahib, A.; Breitling, R.; Gilbert, D.

    2005-01-01

    Abstract: When the standard approach to predict protein function by sequence homology fails, other alternative methods can be used that require only the amino acid sequence for predicting function. One such approach uses machine learning to predict protein function directly from amino acid sequence

  6. Simplified Method for Predicting a Functional Class of Proteins in Transcription Factor Complexes

    KAUST Repository

    Piatek, Marek J.

    2013-07-12

    Background:Initiation of transcription is essential for most of the cellular responses to environmental conditions and for cell and tissue specificity. This process is regulated through numerous proteins, their ligands and mutual interactions, as well as interactions with DNA. The key such regulatory proteins are transcription factors (TFs) and transcription co-factors (TcoFs). TcoFs are important since they modulate the transcription initiation process through interaction with TFs. In eukaryotes, transcription requires that TFs form different protein complexes with various nuclear proteins. To better understand transcription regulation, it is important to know the functional class of proteins interacting with TFs during transcription initiation. Such information is not fully available, since not all proteins that act as TFs or TcoFs are yet annotated as such, due to generally partial functional annotation of proteins. In this study we have developed a method to predict, using only sequence composition of the interacting proteins, the functional class of human TF binding partners to be (i) TF, (ii) TcoF, or (iii) other nuclear protein. This allows for complementing the annotation of the currently known pool of nuclear proteins. Since only the knowledge of protein sequences is required in addition to protein interaction, the method should be easily applicable to many species.Results:Based on experimentally validated interactions between human TFs with different TFs, TcoFs and other nuclear proteins, our two classification systems (implemented as a web-based application) achieve high accuracies in distinguishing TFs and TcoFs from other nuclear proteins, and TFs from TcoFs respectively.Conclusion:As demonstrated, given the fact that two proteins are capable of forming direct physical interactions and using only information about their sequence composition, we have developed a completely new method for predicting a functional class of TF interacting protein partners

  7. Improved Functional Characteristics of Whey Protein Hydrolysates in Food Industry

    Science.gov (United States)

    Jeewanthi, Renda Kankanamge Chaturika; Lee, Na-Kyoung; Paik, Hyun-Dong

    2015-01-01

    This review focuses on the enhanced functional characteristics of enzymatic hydrolysates of whey proteins (WPHs) in food applications compared to intact whey proteins (WPs). WPs are applied in foods as whey protein concentrates (WPCs), whey protein isolates (WPIs), and WPHs. WPs are byproducts of cheese production, used in a wide range of food applications due to their nutritional validity, functional activities, and cost effectiveness. Enzymatic hydrolysis yields improved functional and nutritional benefits in contrast to heat denaturation or native applications. WPHs improve solubility over a wide range of pH, create viscosity through water binding, and promote cohesion, adhesion, and elasticity. WPHs form stronger but more flexible edible films than WPC or WPI. WPHs enhance emulsification, bind fat, and facilitate whipping, compared to intact WPs. Extensive hydrolyzed WPHs with proper heat applications are the best emulsifiers and addition of polysaccharides improves the emulsification ability of WPHs. Also, WPHs improve the sensorial properties like color, flavor, and texture but impart a bitter taste in case where extensive hydrolysis (degree of hydrolysis greater than 8%). It is important to consider the type of enzyme, hydrolysis conditions, and WPHs production method based on the nature of food application. PMID:26761849

  8. Functional properties of whey protein and its application in nanocomposite materials and functional foods

    Science.gov (United States)

    Walsh, Helen

    Whey is a byproduct of cheese making; whey proteins are globular proteins which can be modified and polymerized to add functional benefits, these benefits can be both nutritional and structural in foods. Modified proteins can be used in non-foods, being of particular interest in polymer films and coatings. Food packaging materials, including plastics, can linings, interior coatings of paper containers, and beverage cap sealing materials, are generally made of synthetic petroleum based compounds. These synthetic materials may pose a potential human health risk due to presence of certain chemicals such as Bisphenol A (BPA). They also add to environmental pollution, being difficult to degrade. Protein-based materials do not have the same issues as synthetics and so can be used as alternatives in many packaging types. As proteins are generally hydrophilic they must be modified structurally and their performance enhanced by the addition of waterproofing agents. Polymerization of whey proteins results in a network, adding both strength and flexibility. The most interesting of the food-safe waterproofing agents are the (large aspect ratio) nanoclays. Nanoclays are relatively inexpensive, widely available and have low environmental impact. The clay surface can be modified to make it organophilic and so compatible with organic polymers. The objective of this study is the use of polymerized whey protein (PWP), with reinforcing nanoclays, to produce flexible surface coatings which limit the transfer of contents while maintaining food safety. Four smectite and kaolin type clays, one treated and three natural were assessed for strengthening qualities and the potential waterproofing and plasticizing benefits of other additives were also analyzed. The nutritional benefits of whey proteins can also be used to enhance the protein content of various foodstuffs. Drinkable yogurt is a popular beverage in the US and other countries and is considered a functional food, especially when

  9. Experimental-confirmation and functional-annotation of predicted proteins in the chicken genome

    Directory of Open Access Journals (Sweden)

    McCarthy Fiona M

    2007-11-01

    Full Text Available Abstract Background The chicken genome was sequenced because of its phylogenetic position as a non-mammalian vertebrate, its use as a biomedical model especially to study embryology and development, its role as a source of human disease organisms and its importance as the major source of animal derived food protein. However, genomic sequence data is, in itself, of limited value; generally it is not equivalent to understanding biological function. The benefit of having a genome sequence is that it provides a basis for functional genomics. However, the sequence data currently available is poorly structurally and functionally annotated and many genes do not have standard nomenclature assigned. Results We analysed eight chicken tissues and improved the chicken genome structural annotation by providing experimental support for the in vivo expression of 7,809 computationally predicted proteins, including 30 chicken proteins that were only electronically predicted or hypothetical translations in human. To improve functional annotation (based on Gene Ontology, we mapped these identified proteins to their human and mouse orthologs and used this orthology to transfer Gene Ontology (GO functional annotations to the chicken proteins. The 8,213 orthology-based GO annotations that we produced represent an 8% increase in currently available chicken GO annotations. Orthologous chicken products were also assigned standardized nomenclature based on current chicken nomenclature guidelines. Conclusion We demonstrate the utility of high-throughput expression proteomics for rapid experimental structural annotation of a newly sequenced eukaryote genome. These experimentally-supported predicted proteins were further annotated by assigning the proteins with standardized nomenclature and functional annotation. This method is widely applicable to a diverse range of species. Moreover, information from one genome can be used to improve the annotation of other genomes and

  10. DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier

    KAUST Repository

    Kulmanov, Maxat

    2017-09-27

    Motivation A large number of protein sequences are becoming available through the application of novel high-throughput sequencing technologies. Experimental functional characterization of these proteins is time-consuming and expensive, and is often only done rigorously for few selected model organisms. Computational function prediction approaches have been suggested to fill this gap. The functions of proteins are classified using the Gene Ontology (GO), which contains over 40 000 classes. Additionally, proteins have multiple functions, making function prediction a large-scale, multi-class, multi-label problem. Results We have developed a novel method to predict protein function from sequence. We use deep learning to learn features from protein sequences as well as a cross-species protein–protein interaction network. Our approach specifically outputs information in the structure of the GO and utilizes the dependencies between GO classes as background information to construct a deep learning model. We evaluate our method using the standards established by the Computational Assessment of Function Annotation (CAFA) and demonstrate a significant improvement over baseline methods such as BLAST, in particular for predicting cellular locations.

  11. Genomic and phylogenetic evidence of VIPER retrotransposon domestication in trypanosomatids

    Directory of Open Access Journals (Sweden)

    Adriana Ludwig

    Full Text Available Transposable elements are important residents of eukaryotic genomes and eventually the host can domesticate them to serve cellular functions. We reported here a possible domestication event of the vestigial interposed retroelement (VIPER in trypanosomatids. We found a large gene in a syntenic location in Leishmania braziliensis, L. panamensis, Leptomanas pyrrhocoris, and Crithidia fasciculata whose products share similarity in the C-terminal portion with the third protein of VIPER. No remnants of other VIPER regions surrounding the gene sequence were found. We hypothesise that the domestication event occurred more than 50 mya and the conservation of this gene suggests it might perform some function in the host species.

  12. Dietary protein effects on irradiated rat kidney function

    International Nuclear Information System (INIS)

    Mahler, P.A.; Yatuin, M.B.

    1984-01-01

    The authors have previously reported that unilaterally nephrectomized, kidney irradiated young male S-D rats have an increased median survival when placed on a low (4%) protein diet, as compared to a normal (20%) or high (50%) protein diet (200, 103, and 59 days respectively for 14 Gy irradiation). They have expanded these studies to examine the effects of irradiation and dietary protein levels on kidney function, by examining the parameters of blood urea nitrogen, serum creatinine, urine urea nitrogen, urine creatinine, urine osmolarity, urine volume, and water consumption. Irradiated 20% protein diet animals show an increase in water consumption and urine production and also a decrease in urine osmolarity, urine urea concentration and urine creatinine concentration. These changes all support the hypothesis the kidney irradiated rats fed a normal protein diet have a reduced capability to concentrate urine compared to nonirradiated control rats. Evaluation of the same parameters in irradiated rats fed a 4% protein diet does not indicate a similar loss of concentrating capability. Whether this protection is due to the growth inhibition of the 4% protein diet or some other phenomena remains to be determined

  13. Dietary fatty acids and membrane protein function.

    Science.gov (United States)

    Murphy, M G

    1990-02-01

    In recent years, there has been growing public awareness of the potential health benefits of dietary fatty acids, and of the distinction between the effects of the omega6 and omega3 polyunsaturated fatty acids that are concentrated in vegetable and fish oils, respectively. A part of the biologic effectiveness of the two families of polyunsaturated fatty acids resides in their relative roles as precursors of the eicosanoids. However, we are also beginning to appreciate that as the major components of the hydrophobic core of the membrane bilayer, they can interact with and directly influence the functioning of select integral membrane proteins. Among the most important of these are the enzymes, receptors, and ion channels that are situated in the plasma membrane of the cell, since they carry out the communication and homeostatic processes that are necessary for normal cell function. This review examines current information regarding the effects of diet-induced changes in plasma membrane fatty acid composition on several specific enzymes (adenylate cyclase, 5'-nucleotidase, Na(+)/K(+)-ATPase) and cell-surface receptors (opiate, adrenergic, insulin). Dietary manipulation studies have demonstrated a sensitivity of each to a fatty acid environment that is variably dependent on the nature of the fatty acid(s) and/or source of the membrane. The molecular mechanisms appear to involve fatty acid-dependent effects on protein conformation, on the "fluidity" and/or thickness of the membrane, or on protein synthesis. Together, the results of these studies reinforce the concept that dietary fats have the potential to regulate physiologic function and to further our understanding of how this occurs at a membrane level.

  14. Non-equilibrium coupling of protein structure and function to translation-elongation kinetics.

    Science.gov (United States)

    Sharma, Ajeet K; O'Brien, Edward P

    2018-04-01

    Protein folding research has been dominated by the assumption that thermodynamics determines protein structure and function. And that when the folding process is compromised in vivo the proteostasis machinery-chaperones, deaggregases, the proteasome-work to restore proteins to their soluble, functional form or degrade them to maintain the cellular pool of proteins in a quasi-equilibrium state. During the past decade, however, more and more proteins have been identified for which altering only their speed of synthesis alters their structure and function, the efficiency of the down-stream processes they take part in, and cellular phenotype. Indeed, evidence has emerged that evolutionary selection pressures have encoded translation-rate information into mRNA molecules to coordinate diverse co-translational processes. Thus, non-equilibrium physics can play a fundamental role in influencing nascent protein behavior, mRNA sequence evolution, and disease. Here, we discuss how our understanding of this phenomenon is being advanced by the application of theoretical tools from the physical sciences. Copyright © 2018 Elsevier Ltd. All rights reserved.

  15. A transthyretin-related protein is functionally expressed in Herbaspirillum seropedicae.

    Science.gov (United States)

    Matiollo, Camila; Vernal, Javier; Ecco, Gabriela; Bertoldo, Jean Borges; Razzera, Guilherme; de Souza, Emanuel M; Pedrosa, Fábio O; Terenzi, Hernán

    2009-10-02

    Transthyretin-related proteins (TRPs) constitute a family of proteins structurally related to transthyretin (TTR) and are found in a large range of bacterial, fungal, plant, invertebrate, and vertebrate species. However, it was recently recognized that both prokaryotic and eukaryotic members of this family are not functionally related to transthyretins. TRPs are in fact involved in the purine catabolic pathway and function as hydroxyisourate hydrolases. An open reading frame encoding a protein similar to the Escherichia coli TRP was identified in Herbaspirillum seropedicae genome (Hs_TRP). It was cloned, overexpressed in E. coli, and purified to homogeneity. Mass spectrometry data confirmed the identity of this protein, and circular dichroism spectrum indicated a predominance of beta-sheet structure, as expected for a TRP. We have demonstrated that Hs_TRP is a 5-hydroxyisourate hydrolase and by site-directed mutagenesis the importance of three conserved catalytic residues for Hs_TRP activity was further confirmed. The production of large quantities of this recombinant protein opens up the possibility of obtaining its 3D-structure and will help further investigations into purine catabolism.

  16. Optimizing scoring function of protein-nucleic acid interactions with both affinity and specificity.

    Directory of Open Access Journals (Sweden)

    Zhiqiang Yan

    Full Text Available Protein-nucleic acid (protein-DNA and protein-RNA recognition is fundamental to the regulation of gene expression. Determination of the structures of the protein-nucleic acid recognition and insight into their interactions at molecular level are vital to understanding the regulation function. Recently, quantitative computational approach has been becoming an alternative of experimental technique for predicting the structures and interactions of biomolecular recognition. However, the progress of protein-nucleic acid structure prediction, especially protein-RNA, is far behind that of the protein-ligand and protein-protein structure predictions due to the lack of reliable and accurate scoring function for quantifying the protein-nucleic acid interactions. In this work, we developed an accurate scoring function (named as SPA-PN, SPecificity and Affinity of the Protein-Nucleic acid interactions for protein-nucleic acid interactions by incorporating both the specificity and affinity into the optimization strategy. Specificity and affinity are two requirements of highly efficient and specific biomolecular recognition. Previous quantitative descriptions of the biomolecular interactions considered the affinity, but often ignored the specificity owing to the challenge of specificity quantification. We applied our concept of intrinsic specificity to connect the conventional specificity, which circumvents the challenge of specificity quantification. In addition to the affinity optimization, we incorporated the quantified intrinsic specificity into the optimization strategy of SPA-PN. The testing results and comparisons with other scoring functions validated that SPA-PN performs well on both the prediction of binding affinity and identification of native conformation. In terms of its performance, SPA-PN can be widely used to predict the protein-nucleic acid structures and quantify their interactions.

  17. Investigation and identification of functional post-translational modification sites associated with drug binding and protein-protein interactions.

    Science.gov (United States)

    Su, Min-Gang; Weng, Julia Tzu-Ya; Hsu, Justin Bo-Kai; Huang, Kai-Yao; Chi, Yu-Hsiang; Lee, Tzong-Yi

    2017-12-21

    Protein post-translational modification (PTM) plays an essential role in various cellular processes that modulates the physical and chemical properties, folding, conformation, stability and activity of proteins, thereby modifying the functions of proteins. The improved throughput of mass spectrometry (MS) or MS/MS technology has not only brought about a surge in proteome-scale studies, but also contributed to a fruitful list of identified PTMs. However, with the increase in the number of identified PTMs, perhaps the more crucial question is what kind of biological mechanisms these PTMs are involved in. This is particularly important in light of the fact that most protein-based pharmaceuticals deliver their therapeutic effects through some form of PTM. Yet, our understanding is still limited with respect to the local effects and frequency of PTM sites near pharmaceutical binding sites and the interfaces of protein-protein interaction (PPI). Understanding PTM's function is critical to our ability to manipulate the biological mechanisms of protein. In this study, to understand the regulation of protein functions by PTMs, we mapped 25,835 PTM sites to proteins with available three-dimensional (3D) structural information in the Protein Data Bank (PDB), including 1785 modified PTM sites on the 3D structure. Based on the acquired structural PTM sites, we proposed to use five properties for the structural characterization of PTM substrate sites: the spatial composition of amino acids, residues and side-chain orientations surrounding the PTM substrate sites, as well as the secondary structure, division of acidity and alkaline residues, and solvent-accessible surface area. We further mapped the structural PTM sites to the structures of drug binding and PPI sites, identifying a total of 1917 PTM sites that may affect PPI and 3951 PTM sites associated with drug-target binding. An integrated analytical platform (CruxPTM), with a variety of methods and online molecular docking

  18. Patchwork structure-function analysis of the Sendai virus matrix protein.

    Science.gov (United States)

    Mottet-Osman, Geneviève; Miazza, Vincent; Vidalain, Pierre-Olivier; Roux, Laurent

    2014-09-01

    Paramyxoviruses contain a bi-lipidic envelope decorated by two transmembrane glycoproteins and carpeted on the inner surface with a layer of matrix proteins (M), thought to bridge the glycoproteins with the viral nucleocapsids. To characterize M structure-function features, a set of M domains were mutated or deleted. The genes encoding these modified M were incorporated into recombinant Sendai viruses and expressed as supplemental proteins. Using a method of integrated suppression complementation system (ISCS), the functions of these M mutants were analyzed in the context of the infection. Cellular membrane association, localization at the cell periphery, nucleocapsid binding, cellular protein interactions and promotion of viral particle formation were characterized in relation with the mutations. At the end, lack of nucleocapsid binding go together with lack of cell surface localization and both features definitely correlate with loss of M global function estimated by viral particle production. Copyright © 2014 Elsevier Inc. All rights reserved.

  19. The evolution of function in strictosidine synthase-like proteins.

    Science.gov (United States)

    Hicks, Michael A; Barber, Alan E; Giddings, Lesley-Ann; Caldwell, Jenna; O'Connor, Sarah E; Babbitt, Patricia C

    2011-11-01

    The exponential growth of sequence data provides abundant information for the discovery of new enzyme reactions. Correctly annotating the functions of highly diverse proteins can be difficult, however, hindering use of this information. Global analysis of large superfamilies of related proteins is a powerful strategy for understanding the evolution of reactions by identifying catalytic commonalities and differences in reaction and substrate specificity, even when only a few members have been biochemically or structurally characterized. A comparison of >2500 sequences sharing the six-bladed β-propeller fold establishes sequence, structural, and functional links among the three subgroups of the functionally diverse N6P superfamily: the arylesterase-like and senescence marker protein-30/gluconolactonase/luciferin-regenerating enzyme-like (SGL) subgroups, representing enzymes that catalyze lactonase and related hydrolytic reactions, and the so-called strictosidine synthase-like (SSL) subgroup. Metal-coordinating residues were identified as broadly conserved in the active sites of all three subgroups except for a few proteins from the SSL subgroup, which have been experimentally determined to catalyze the quite different strictosidine synthase (SS) reaction, a metal-independent condensation reaction. Despite these differences, comparison of conserved catalytic features of the arylesterase-like and SGL enzymes with the SSs identified similar structural and mechanistic attributes between the hydrolytic reactions catalyzed by the former and the condensation reaction catalyzed by SS. The results also suggest that despite their annotations, the great majority of these >500 SSL sequences do not catalyze the SS reaction; rather, they likely catalyze hydrolytic reactions typical of the other two subgroups instead. This prediction was confirmed experimentally for one of these proteins. Copyright © 2011 Wiley-Liss, Inc.

  20. Bioinformatic analysis of microRNA biogenesis and function related proteins in eleven animal genomes.

    Science.gov (United States)

    Liu, Xiuying; Luo, GuanZheng; Bai, Xiujuan; Wang, Xiu-Jie

    2009-10-01

    MicroRNAs are approximately 22 nt long small non-coding RNAs that play important regulatory roles in eukaryotes. The biogenesis and functional processes of microRNAs require the participation of many proteins, of which, the well studied ones are Dicer, Drosha, Argonaute and Exportin 5. To systematically study these four protein families, we screened 11 animal genomes to search for genes encoding above mentioned proteins, and identified some new members for each family. Domain analysis results revealed that most proteins within the same family share identical or similar domains. Alternative spliced transcript variants were found for some proteins. We also examined the expression patterns of these proteins in different human tissues and identified other proteins that could potentially interact with these proteins. These findings provided systematic information on the four key proteins involved in microRNA biogenesis and functional pathways in animals, and will shed light on further functional studies of these proteins.

  1. Xanthophylls as modulators of membrane protein function.

    Science.gov (United States)

    Ruban, Alexander V; Johnson, Matthew P

    2010-12-01

    This review discusses the structural aspect of the role of photosynthetic antenna xanthophylls. It argues that xanthophyll hydrophobicity/polarity could explain the reason for xanthophyll variety and help to understand their recently emerging function--control of membrane organization and the work of membrane proteins. The structure of a xanthophyll molecule is discussed in relation to other amphiphilic compounds like lipids, detergents, etc. Xanthophyll composition of membrane proteins, the role of their variety in protein function are discussed using as an example for the major light harvesting antenna complex of photosystem II, LHCII, from higher plants. A new empirical parameter, hydrophobicity parameter (H-parameter), has been introduced as an effective measure of the hydrophobicity of the xanthophyll complement of LHCII from different xanthophyll biosynthesis mutants of Arabidopsis. Photosystem II quantum efficiency was found to correlate well with the H-parameter of LHCII xanthophylls. PSII down-regulation by non-photochemical chlorophyll fluorescence quenching, NPQ, had optimum corresponding to the wild-type xanthophyll composition, where lutein occupies intrinsic sites, L1 and L2. Xanthophyll polarity/hydrophobicity alteration by the activity of the xanthophyll cycle explains the allosteric character of NPQ regulation, memory of illumination history and the hysteretic nature of the relationship between the triggering factor, ΔpH, and the energy dissipation process. Copyright © 2010 Elsevier Inc. All rights reserved.

  2. Predicting Structure and Function for Novel Proteins of an Extremophilic Iron Oxidizing Bacterium

    Science.gov (United States)

    Wheeler, K.; Zemla, A.; Banfield, J.; Thelen, M.

    2007-12-01

    Proteins isolated from uncultivated microbial populations represent the functional components of microbial processes and contribute directly to community fitness under natural conditions. Investigations into proteins in the environment are hindered by the lack of genome data, or where available, the high proportion of proteins of unknown function. We have identified thousands of proteins from biofilms in the extremely acidic drainage outflow of an iron mine ecosystem (1). With an extensive genomic and proteomic foundation, we have focused directly on the problem of several hundred proteins of unknown function within this well-defined model system. Here we describe the geobiological insights gained by using a high throughput computational approach for predicting structure and function of 421 novel proteins from the biofilm community. We used a homology based modeling system to compare these proteins to those of known structure (AS2TS) (2). This approach has resulted in the assignment of structures to 360 proteins (85%) and provided functional information for up to 75% of the modeled proteins. Detailed examination of the modeling results enables confident, high-throughput prediction of the roles of many of the novel proteins within the microbial community. For instance, one prediction places a protein in the phosphoenolpyruvate/pyruvate domain superfamily as a carboxylase that fills in a gap in an otherwise complete carbon cycle. Particularly important for a community in such a metal rich environment is the evolution of over 25% of the novel proteins that contain a metal cofactor; of these, one third are likely Fe containing proteins. Two of the most abundant proteins in biofilm samples are unusual c-type cytochromes. Both of these proteins catalyze iron- oxidation, a key metabolic reaction supporting the energy requirements of this community. Structural models of these cytochromes verify our experimental results on heme binding and electron transfer reactivity, and

  3. Broadening the functionality of a J-protein/Hsp70 molecular chaperone system.

    Science.gov (United States)

    Schilke, Brenda A; Ciesielski, Szymon J; Ziegelhoffer, Thomas; Kamiya, Erina; Tonelli, Marco; Lee, Woonghee; Cornilescu, Gabriel; Hines, Justin K; Markley, John L; Craig, Elizabeth A

    2017-10-01

    By binding to a multitude of polypeptide substrates, Hsp70-based molecular chaperone systems perform a range of cellular functions. All J-protein co-chaperones play the essential role, via action of their J-domains, of stimulating the ATPase activity of Hsp70, thereby stabilizing its interaction with substrate. In addition, J-proteins drive the functional diversity of Hsp70 chaperone systems through action of regions outside their J-domains. Targeting to specific locations within a cellular compartment and binding of specific substrates for delivery to Hsp70 have been identified as modes of J-protein specialization. To better understand J-protein specialization, we concentrated on Saccharomyces cerevisiae SIS1, which encodes an essential J-protein of the cytosol/nucleus. We selected suppressors that allowed cells lacking SIS1 to form colonies. Substitutions changing single residues in Ydj1, a J-protein, which, like Sis1, partners with Hsp70 Ssa1, were isolated. These gain-of-function substitutions were located at the end of the J-domain, suggesting that suppression was connected to interaction with its partner Hsp70, rather than substrate binding or subcellular localization. Reasoning that, if YDJ1 suppressors affect Ssa1 function, substitutions in Hsp70 itself might also be able to overcome the cellular requirement for Sis1, we carried out a selection for SSA1 suppressor mutations. Suppressing substitutions were isolated that altered sites in Ssa1 affecting the cycle of substrate interaction. Together, our results point to a third, additional means by which J-proteins can drive Hsp70's ability to function in a wide range of cellular processes-modulating the Hsp70-substrate interaction cycle.

  4. Application of empirical hydration distribution functions around polar atoms for assessing hydration structures of proteins

    International Nuclear Information System (INIS)

    Matsuoka, Daisuke; Nakasako, Masayoshi

    2013-01-01

    Highlights: ► Empirical distribution functions of water molecules in protein hydration are made. ► The functions measure how hydrogen-bond geometry in hydration deviate from ideal. ► The functions assess experimentally identified hydration structures of protein. - Abstract: To quantitatively characterize hydrogen-bond geometry in local hydration structures of proteins, we constructed a set of empirical hydration distribution functions (EHDFs) around polar protein atoms in the main and side chains of 11 types of hydrophilic amino acids (D. Matsuoka, M. Nakasako, Journal of Physical Chemistry B 113 (2009) 11274). The functions are the ensemble average of possible hydration patterns around the polar atoms, and describe the anisotropic deviations from ideal hydrogen bond geometry. In addition, we defined probability distribution function of hydration water molecules (PDFH) over the hydrophilic surface of a protein as the sum of EHDFs of solvent accessible polar protein atoms. The functions envelop most of hydration sites identified in crystal structures of proteins (D. Matsuoka, M. Nakasako, Journal of Physical Chemistry B 114 (2010) 4652). Here we propose the application of EHDFs and PDFHs for assessing crystallographically identified hydration structures of proteins. First, hydration water molecules are classified with respect to the geometry in hydrogen bonds in referring EHDFs. Difference Fourier electron density map weighted by PDFH of protein is proposed to identify easily density peaks as candidates of hydration water molecules. A computer program implementing those ideas was developed and used for assessing hydration structures of proteins

  5. A surprising role for conformational entropy in protein function

    Science.gov (United States)

    Wand, A. Joshua; Moorman, Veronica R.; Harpole, Kyle W.

    2014-01-01

    Formation of high-affinity complexes is critical for the majority of enzymatic reactions involving proteins. The creation of the family of Michaelis and other intermediate complexes during catalysis clearly involves a complicated manifold of interactions that are diverse and complex. Indeed, computing the energetics of interactions between proteins and small molecule ligands using molecular structure alone remains a grand challenge. One of the most difficult contributions to the free energy of protein-ligand complexes to experimentally access is that due to changes in protein conformational entropy. Fortunately, recent advances in solution nuclear magnetic resonance (NMR) relaxation methods have enabled the use of measures-of-motion between conformational states of a protein as a proxy for conformational entropy. This review briefly summarizes the experimental approaches currently employed to characterize fast internal motion in proteins, how this information is used to gain insight into conformational entropy, what has been learned and what the future may hold for this emerging view of protein function. PMID:23478875

  6. Multifarious Functions of the Fragile X Mental Retardation Protein.

    Science.gov (United States)

    Davis, Jenna K; Broadie, Kendal

    2017-10-01

    Fragile X syndrome (FXS), a heritable intellectual and autism spectrum disorder (ASD), results from the loss of Fragile X mental retardation protein (FMRP). This neurodevelopmental disease state exhibits neural circuit hyperconnectivity and hyperexcitability. Canonically, FMRP functions as an mRNA-binding translation suppressor, but recent findings have enormously expanded its proposed roles. Although connections between burgeoning FMRP functions remain unknown, recent advances have extended understanding of its involvement in RNA, channel, and protein binding that modulate calcium signaling, activity-dependent critical period development, and the excitation-inhibition (E/I) neural circuitry balance. In this review, we contextualize 3 years of FXS model research. Future directions extrapolated from recent advances focus on discovering links between FMRP roles to determine whether FMRP has a multitude of unrelated functions or whether combinatorial mechanisms can explain its multifaceted existence. Copyright © 2017 Elsevier Ltd. All rights reserved.

  7. Global functional atlas of Escherichia coli encompassing previously uncharacterized proteins.

    Directory of Open Access Journals (Sweden)

    Pingzhao Hu

    2009-04-01

    Full Text Available One-third of the 4,225 protein-coding genes of Escherichia coli K-12 remain functionally unannotated (orphans. Many map to distant clades such as Archaea, suggesting involvement in basic prokaryotic traits, whereas others appear restricted to E. coli, including pathogenic strains. To elucidate the orphans' biological roles, we performed an extensive proteomic survey using affinity-tagged E. coli strains and generated comprehensive genomic context inferences to derive a high-confidence compendium for virtually the entire proteome consisting of 5,993 putative physical interactions and 74,776 putative functional associations, most of which are novel. Clustering of the respective probabilistic networks revealed putative orphan membership in discrete multiprotein complexes and functional modules together with annotated gene products, whereas a machine-learning strategy based on network integration implicated the orphans in specific biological processes. We provide additional experimental evidence supporting orphan participation in protein synthesis, amino acid metabolism, biofilm formation, motility, and assembly of the bacterial cell envelope. This resource provides a "systems-wide" functional blueprint of a model microbe, with insights into the biological and evolutionary significance of previously uncharacterized proteins.

  8. Functional modules by relating protein interaction networks and gene expression.

    Science.gov (United States)

    Tornow, Sabine; Mewes, H W

    2003-11-01

    Genes and proteins are organized on the basis of their particular mutual relations or according to their interactions in cellular and genetic networks. These include metabolic or signaling pathways and protein interaction, regulatory or co-expression networks. Integrating the information from the different types of networks may lead to the notion of a functional network and functional modules. To find these modules, we propose a new technique which is based on collective, multi-body correlations in a genetic network. We calculated the correlation strength of a group of genes (e.g. in the co-expression network) which were identified as members of a module in a different network (e.g. in the protein interaction network) and estimated the probability that this correlation strength was found by chance. Groups of genes with a significant correlation strength in different networks have a high probability that they perform the same function. Here, we propose evaluating the multi-body correlations by applying the superparamagnetic approach. We compare our method to the presently applied mean Pearson correlations and show that our method is more sensitive in revealing functional relationships.

  9. Cognitive Function and Heat Shock Protein 70 in Children With Temporal Lobe Epilepsy.

    Science.gov (United States)

    Oraby, Azza M; Raouf, Ehab R Abdol; El-Saied, Mostafa M; Abou-Khadra, Maha K; Helal, Suzette I; Hashish, Adel F

    2017-01-01

    We conducted the present study to examine cognitive function and serum heat shock protein 70 levels among children with temporal lobe epilepsy. The Stanford-Binet Intelligence Test was carried out to examine cognitive function in 30 children with temporal lobe epilepsy and 30 controls. Serum heat shock protein 70 levels were determined with an enzyme-linked immunosorbent assay. The epilepsy group had significantly lower cognitive function testing scores and significantly higher serum heat shock protein 70 levels than the control group; there were significant negative correlations between serum heat shock protein 70 levels and short-term memory and composite scores. Children with uncontrolled seizures had significantly lower verbal reasoning scores and significantly higher serum heat shock protein 70 levels than children with controlled seizures. Children with temporal lobe epilepsy have cognitive dysfunction and elevated levels of serum heat shock protein 70, which may be considered a stress biomarker.

  10. Orientation-dependent backbone-only residue pair scoring functions for fixed backbone protein design

    Directory of Open Access Journals (Sweden)

    Bordner Andrew J

    2010-04-01

    Full Text Available Abstract Background Empirical scoring functions have proven useful in protein structure modeling. Most such scoring functions depend on protein side chain conformations. However, backbone-only scoring functions do not require computationally intensive structure optimization and so are well suited to protein design, which requires fast score evaluation. Furthermore, scoring functions that account for the distinctive relative position and orientation preferences of residue pairs are expected to be more accurate than those that depend only on the separation distance. Results Residue pair scoring functions for fixed backbone protein design were derived using only backbone geometry. Unlike previous studies that used spherical harmonics to fit 2D angular distributions, Gaussian Mixture Models were used to fit the full 3D (position only and 6D (position and orientation distributions of residue pairs. The performance of the 1D (residue separation only, 3D, and 6D scoring functions were compared by their ability to identify correct threading solutions for a non-redundant benchmark set of protein backbone structures. The threading accuracy was found to steadily increase with increasing dimension, with the 6D scoring function achieving the highest accuracy. Furthermore, the 3D and 6D scoring functions were shown to outperform side chain-dependent empirical potentials from three other studies. Next, two computational methods that take advantage of the speed and pairwise form of these new backbone-only scoring functions were investigated. The first is a procedure that exploits available sequence data by averaging scores over threading solutions for homologs. This was evaluated by applying it to the challenging problem of identifying interacting transmembrane alpha-helices and found to further improve prediction accuracy. The second is a protein design method for determining the optimal sequence for a backbone structure by applying Belief Propagation

  11. TTT and PIKK Complex Genes Reverted to Single Copy Following Polyploidization and Retain Function Despite Massive Retrotransposition in Maize.

    Science.gov (United States)

    Garcia, Nelson; Messing, Joachim

    2017-01-01

    The TEL2, TTI1, and TTI2 proteins are co-chaperones for heat shock protein 90 (HSP90) to regulate the protein folding and maturation of phosphatidylinositol 3-kinase-related kinases (PIKKs). Referred to as the TTT complex, the genes that encode them are highly conserved from man to maize. TTT complex and PIKK genes exist mostly as single copy genes in organisms where they have been characterized. Members of this interacting protein network in maize were identified and synteny analyses were performed to study their evolution. Similar to other species, there is only one copy of each of these genes in maize which was due to a loss of the duplicated copy created by ancient allotetraploidy. Moreover, the retained copies of the TTT complex and the PIKK genes tolerated extensive retrotransposon insertion in their introns that resulted in increased gene lengths and gene body methylation, without apparent effect in normal gene expression and function. The results raise an interesting question on whether the reversion to single copy was due to selection against deleterious unbalanced gene duplications between members of the complex as predicted by the gene balance hypothesis, or due to neutral loss of extra copies. Uneven alteration of dosage either by adding extra copies or modulating gene expression of complex members is being proposed as a means to investigate whether the data supports the gene balance hypothesis or not.

  12. TTT and PIKK Complex Genes Reverted to Single Copy Following Polyploidization and Retain Function Despite Massive Retrotransposition in Maize

    Directory of Open Access Journals (Sweden)

    Nelson Garcia

    2017-11-01

    Full Text Available The TEL2, TTI1, and TTI2 proteins are co-chaperones for heat shock protein 90 (HSP90 to regulate the protein folding and maturation of phosphatidylinositol 3-kinase-related kinases (PIKKs. Referred to as the TTT complex, the genes that encode them are highly conserved from man to maize. TTT complex and PIKK genes exist mostly as single copy genes in organisms where they have been characterized. Members of this interacting protein network in maize were identified and synteny analyses were performed to study their evolution. Similar to other species, there is only one copy of each of these genes in maize which was due to a loss of the duplicated copy created by ancient allotetraploidy. Moreover, the retained copies of the TTT complex and the PIKK genes tolerated extensive retrotransposon insertion in their introns that resulted in increased gene lengths and gene body methylation, without apparent effect in normal gene expression and function. The results raise an interesting question on whether the reversion to single copy was due to selection against deleterious unbalanced gene duplications between members of the complex as predicted by the gene balance hypothesis, or due to neutral loss of extra copies. Uneven alteration of dosage either by adding extra copies or modulating gene expression of complex members is being proposed as a means to investigate whether the data supports the gene balance hypothesis or not.

  13. Functional properties of tropical banded cricket (Gryllodes sigillatus) protein hydrolysates.

    Science.gov (United States)

    Hall, Felicia G; Jones, Owen G; O'Haire, Marguerite E; Liceaga, Andrea M

    2017-06-01

    Recently, the benefits of entomophagy have been widely discussed. Due to western cultures' reluctance, entomophagy practices are leaning more towards incorporating insects into food products. In this study, whole crickets (Gryllodes sigillatus) were hydrolyzed with alcalase at 0.5, 1.5, and 3.0% (w/w) for 30, 60, and 90min. Degree of hydrolysis (DH), amino acid composition, solubility, emulsion and foaming properties were evaluated. Hydrolysis produced peptides with 26-52% DH compared to the control containing no enzyme (5% DH). Protein solubility of hydrolysates improved (p30% soluble protein at pH 3 and 7 and 50-90% at alkaline pH, compared with the control. Emulsion activity index ranged from 7 to 32m 2 /g, while foamability ranged from 100 to 155% for all hydrolysates. These improved functional properties demonstrate the potential to develop cricket protein hydrolysates as a source of functional alternative protein in food ingredient formulations. Copyright © 2016 Elsevier Ltd. All rights reserved.

  14. Functional analysis of virion host shutoff protein of pseudorabies virus

    International Nuclear Information System (INIS)

    Lin, H.-W.; Chang, Y.-Y.; Wong, M.-L.; Lin, J.-W.; Chang, T.-J.

    2004-01-01

    During lytic infection, the virion host shutoff (vhs) protein of alphaherpesviruses causes the degradation of mRNAs nonspecifically. In this work, we cloned the vhs gene (UL41 open reading frame) of pseudorabies virus (PRV; TNL strain) by PCR, and its nucleotide sequences were determined. The PCR product of vhs gene was subcloned into the prokaryotic pET32b expression vector, and production of the recombinant vhs protein was examined by SDS-PAGE. Result of Western blotting demonstrated that our recombinant vhs protein reacted with antiserum against a synthetic peptide of 17 amino acids of the vhs protein. After purification with nickel-chelate affinity chromatography, the purified recombinant vhs protein exhibited in vitro ribonuclease activity as expected. We further cloned the vhs gene into eukaryotic expression vectors and investigated the intracellular function of vhs protein by DNA transfection. By transient trasfection and CAT assay, we found the CAT activity was reduced in the presence of vhs, indicating that degradation of mRNA of the CAT gene was caused by the vhs. Furthermore, our results showed that the plaque formation of pseudorabies virus was blocked by exogenous vhs. Taken together, we have cloned the vhs gene of pseudorabies virus (TNL strain) and conducted functional analysis of the recombinant vhs protein in vitro as well as in vivo

  15. NPPD: A Protein-Protein Docking Scoring Function Based on Dyadic Differences in Networks of Hydrophobic and Hydrophilic Amino Acid Residues

    Directory of Open Access Journals (Sweden)

    Edward S. C. Shih

    2015-03-01

    Full Text Available Protein-protein docking (PPD predictions usually rely on the use of a scoring function to rank docking models generated by exhaustive sampling. To rank good models higher than bad ones, a large number of scoring functions have been developed and evaluated, but the methods used for the computation of PPD predictions remain largely unsatisfactory. Here, we report a network-based PPD scoring function, the NPPD, in which the network consists of two types of network nodes, one for hydrophobic and the other for hydrophilic amino acid residues, and the nodes are connected when the residues they represent are within a certain contact distance. We showed that network parameters that compute dyadic interactions and those that compute heterophilic interactions of the amino acid networks thus constructed allowed NPPD to perform well in a benchmark evaluation of 115 PPD scoring functions, most of which, unlike NPPD, are based on some sort of protein-protein interaction energy. We also showed that NPPD was highly complementary to these energy-based scoring functions, suggesting that the combined use of conventional scoring functions and NPPD might significantly improve the accuracy of current PPD predictions.

  16. Overlapping functions of argonaute proteins in patterning and morphogenesis of Drosophila embryos.

    Directory of Open Access Journals (Sweden)

    Wibke J Meyer

    2006-08-01

    Full Text Available Argonaute proteins are essential components of the molecular machinery that drives RNA silencing. In Drosophila, different members of the Argonaute family of proteins have been assigned to distinct RNA silencing pathways. While Ago1 is required for microRNA function, Ago2 is a crucial component of the RNA-induced silencing complex in siRNA-triggered RNA interference. Drosophila Ago2 contains an unusual amino-terminus with two types of imperfect glutamine-rich repeats (GRRs of unknown function. Here we show that the GRRs of Ago2 are essential for the normal function of the protein. Alleles with reduced numbers of GRRs cause specific disruptions in two morphogenetic processes associated with the midblastula transition: membrane growth and microtubule-based organelle transport. These defects do not appear to result from disruption of siRNA-dependent processes but rather suggest an interference of the mutant Ago2 proteins in an Ago1-dependent pathway. Using loss-of-function alleles, we further demonstrate that Ago1 and Ago2 act in a partially redundant manner to control the expression of the segment-polarity gene wingless in the early embryo. Our findings argue against a strict separation of Ago1 and Ago2 functions and suggest that these proteins act in concert to control key steps of the midblastula transition and of segmental patterning.

  17. KARAKTERISTIK FUNGSIONAL PROTEIN MISELIUM JAMUR TIRAM MERAH MUDA DAN MERANG [Functional Characteristics of Protein Mycelium of Pink Oyster and Paddy Straw Mushrooms

    Directory of Open Access Journals (Sweden)

    Sukarno*

    2014-06-01

    Full Text Available Mycelium of mushroom contained high protein, which determined its functional characteristics such as water holding capacity (WHC, oil holding capacity (OAC, emulsion stability, and gel formation. This study aimed to determine the protein functional properties of Pleurotus flabellatus and Volvariella volvacea mycelia. Information obtained can be used to increase utilization of the mycelia as source of food. Mycelia biomass were obtained by growing the fungal cultures in Potato Dextrose Broth (PDB on shaker at 100-150 rpm. Mycelia were harvested three times at 7, 8, and 9-days after inoculation for measuring their protein contents by kjehdahl method. Functional properties of mycelium protein measured were WHC, OAC, emulsion stability, and gel formation by folding test method. Based on the analysis of protein content in dry weight basis, 8-day old P. flabellatus and V. volvacea mycelia produced the highest protein contents with the value were 31.72 and 19.98%, respectively. Further analysis of protein functional properties showed that P. flabellatus mycelium had 10.38% of WHC, 0.52 mL/g of OAC, 57.14% of emulsion stability and gel strength level with the valueof 2, whereas the V. volvacea mycelium had 15.89% of WHC, 0.80 mL/g of OAC, 48.69% of emulsion stability, and did not form a gel. Protein functional properties of P. flabellatus were better than that of V. volvacea mycelium in terms of protein content, emulsion stability, and gel formation.

  18. Nanoporous microbead supported bilayers: stability, physical characterization, and incorporation of functional transmembrane proteins.

    Energy Technology Data Exchange (ETDEWEB)

    Davis, Ryan W. (University of New Mexico, Albuquerque, NM); Brozik, James A. (University of New Mexico, Albuquerque, NM); Brozik, Susan Marie; Cox, Jason M. (University of New Mexico, Albuquerque, NM); Lopez, Gabriel P. (University of New Mexico, Albuquerque, NM); Barrick, Todd A. (University of New Mexico, Albuquerque, NM); Flores, Adrean (University of New Mexico, Albuquerque, NM)

    2007-03-01

    The introduction of functional transmembrane proteins into supported bilayer-based biomimetic systems presents a significant challenge for biophysics. Among the various methods for producing supported bilayers, liposomal fusion offers a versatile method for the introduction of membrane proteins into supported bilayers on a variety of substrates. In this study, the properties of protein containing unilamellar phosphocholine lipid bilayers on nanoporous silica microspheres are investigated. The effects of the silica substrate, pore structure, and the substrate curvature on the stability of the membrane and the functionality of the membrane protein are determined. Supported bilayers on porous silica microspheres show a significant increase in surface area on surfaces with structures in excess of 10 nm as well as an overall decrease in stability resulting from increasing pore size and curvature. Comparison of the liposomal and detergent-mediated introduction of purified bacteriorhodopsin (bR) and the human type 3 serotonin receptor (5HT3R) are investigated focusing on the resulting protein function, diffusion, orientation, and incorporation efficiency. In both cases, functional proteins are observed; however, the reconstitution efficiency and orientation selectivity are significantly enhanced through detergent-mediated protein reconstitution. The results of these experiments provide a basis for bulk ionic and fluorescent dye-based compartmentalization assays as well as single-molecule optical and single-channel electrochemical interrogation of transmembrane proteins in a biomimetic platform.

  19. Fundamental Characteristics of AAA+ Protein Family Structure and Function.

    Science.gov (United States)

    Miller, Justin M; Enemark, Eric J

    2016-01-01

    Many complex cellular events depend on multiprotein complexes known as molecular machines to efficiently couple the energy derived from adenosine triphosphate hydrolysis to the generation of mechanical force. Members of the AAA+ ATPase superfamily (ATPases Associated with various cellular Activities) are critical components of many molecular machines. AAA+ proteins are defined by conserved modules that precisely position the active site elements of two adjacent subunits to catalyze ATP hydrolysis. In many cases, AAA+ proteins form a ring structure that translocates a polymeric substrate through the central channel using specialized loops that project into the central channel. We discuss the major features of AAA+ protein structure and function with an emphasis on pivotal aspects elucidated with archaeal proteins.

  20. Regulation of membrane protein function by lipid bilayer elasticity-a single molecule technology to measure the bilayer properties experienced by an embedded protein

    International Nuclear Information System (INIS)

    Lundbaek, Jens August

    2006-01-01

    Membrane protein function is generally regulated by the molecular composition of the host lipid bilayer. The underlying mechanisms have long remained enigmatic. Some cases involve specific molecular interactions, but very often lipids and other amphiphiles, which are adsorbed to lipid bilayers, regulate a number of structurally unrelated proteins in an apparently non-specific manner. It is well known that changes in the physical properties of a lipid bilayer (e.g., thickness or monolayer spontaneous curvature) can affect the function of an embedded protein. However, the role of such changes, in the general regulation of membrane protein function, is unclear. This is to a large extent due to lack of a generally accepted framework in which to understand the many observations. The present review summarizes studies which have demonstrated that the hydrophobic interactions between a membrane protein and the host lipid bilayer provide an energetic coupling, whereby protein function can be regulated by the bilayer elasticity. The feasibility of this 'hydrophobic coupling mechanism' has been demonstrated using the gramicidin channel, a model membrane protein, in planar lipid bilayers. Using voltage-dependent sodium channels, N-type calcium channels and GABA A receptors, it has been shown that membrane protein function in living cells can be regulated by amphiphile induced changes in bilayer elasticity. Using the gramicidin channel as a molecular force transducer, a nanotechnology to measure the elastic properties experienced by an embedded protein has been developed. A theoretical and technological framework, to study the regulation of membrane protein function by lipid bilayer elasticity, has been established

  1. UTILIZATION OF PLANT PROTEINS IN FUNCTIONAL NUTRITION

    Directory of Open Access Journals (Sweden)

    V. G. Kulakov

    2017-01-01

    Full Text Available Development of functional food products technology is considered to be a prospect way for creating new food products. Such products are known to be popular among consumers. Utilization of plant proteins allows to widen and improve food assortment and quality. The article represents a review of plant proteins utilization in production of functional food. For optimization of flour confectionery chemical composition the authors utilized a method of receipts modeling. Simulation of combined products is based on the principles of food combinatorics and aims to create recipes of new types of food products on basis of methods of mathematical optimization by reasonable selection of the basic raw materials, ingredients, food additives and dietary supplements, totality of which ensures formation desired organoleptic, physical and chemical properties product as well as a predetermined level of food, biological and energy value. Modeling process of combined products recipes includes the following three stages: preparation of input data for the design, formalization requirements for the composition and properties of raw ingredients and quality final product, process modeling; product design with desired structural properties.

  2. STRING 8--a global view on proteins and their functional interactions in 630 organisms

    DEFF Research Database (Denmark)

    Jensen, Lars Juhl; Kuhn, Michael; Stark, Manuel

    2008-01-01

    Functional partnerships between proteins are at the core of complex cellular phenotypes, and the networks formed by interacting proteins provide researchers with crucial scaffolds for modeling, data reduction and annotation. STRING is a database and web resource dedicated to protein-protein inter......Functional partnerships between proteins are at the core of complex cellular phenotypes, and the networks formed by interacting proteins provide researchers with crucial scaffolds for modeling, data reduction and annotation. STRING is a database and web resource dedicated to protein......-protein interactions, including both physical and functional interactions. It weights and integrates information from numerous sources, including experimental repositories, computational prediction methods and public text collections, thus acting as a meta-database that maps all interaction evidence onto a common set...... of genomes and proteins. The most important new developments in STRING 8 over previous releases include a URL-based programming interface, which can be used to query STRING from other resources, improved interaction prediction via genomic neighborhood in prokaryotes, and the inclusion of protein structures...

  3. DeepGO: predicting protein functions from sequence and interactions using a deep ontology-aware classifier.

    Science.gov (United States)

    Kulmanov, Maxat; Khan, Mohammed Asif; Hoehndorf, Robert; Wren, Jonathan

    2018-02-15

    A large number of protein sequences are becoming available through the application of novel high-throughput sequencing technologies. Experimental functional characterization of these proteins is time-consuming and expensive, and is often only done rigorously for few selected model organisms. Computational function prediction approaches have been suggested to fill this gap. The functions of proteins are classified using the Gene Ontology (GO), which contains over 40 000 classes. Additionally, proteins have multiple functions, making function prediction a large-scale, multi-class, multi-label problem. We have developed a novel method to predict protein function from sequence. We use deep learning to learn features from protein sequences as well as a cross-species protein-protein interaction network. Our approach specifically outputs information in the structure of the GO and utilizes the dependencies between GO classes as background information to construct a deep learning model. We evaluate our method using the standards established by the Computational Assessment of Function Annotation (CAFA) and demonstrate a significant improvement over baseline methods such as BLAST, in particular for predicting cellular locations. Web server: http://deepgo.bio2vec.net, Source code: https://github.com/bio-ontology-research-group/deepgo. robert.hoehndorf@kaust.edu.sa. Supplementary data are available at Bioinformatics online. © The Author(s) 2017. Published by Oxford University Press.

  4. Small sets of interacting proteins suggest functional linkage mechanisms via Bayesian analogical reasoning.

    Science.gov (United States)

    Airoldi, Edoardo M; Heller, Katherine A; Silva, Ricardo

    2011-07-01

    Proteins and protein complexes coordinate their activity to execute cellular functions. In a number of experimental settings, including synthetic genetic arrays, genetic perturbations and RNAi screens, scientists identify a small set of protein interactions of interest. A working hypothesis is often that these interactions are the observable phenotypes of some functional process, which is not directly observable. Confirmatory analysis requires finding other pairs of proteins whose interaction may be additional phenotypical evidence about the same functional process. Extant methods for finding additional protein interactions rely heavily on the information in the newly identified set of interactions. For instance, these methods leverage the attributes of the individual proteins directly, in a supervised setting, in order to find relevant protein pairs. A small set of protein interactions provides a small sample to train parameters of prediction methods, thus leading to low confidence. We develop RBSets, a computational approach to ranking protein interactions rooted in analogical reasoning; that is, the ability to learn and generalize relations between objects. Our approach is tailored to situations where the training set of protein interactions is small, and leverages the attributes of the individual proteins indirectly, in a Bayesian ranking setting that is perhaps closest to propensity scoring in mathematical psychology. We find that RBSets leads to good performance in identifying additional interactions starting from a small evidence set of interacting proteins, for which an underlying biological logic in terms of functional processes and signaling pathways can be established with some confidence. Our approach is scalable and can be applied to large databases with minimal computational overhead. Our results suggest that analogical reasoning within a Bayesian ranking problem is a promising new approach for real-time biological discovery. Java code is available at

  5. Rift Valley fever virus NSs protein functions and the similarity to other bunyavirus NSs proteins.

    Science.gov (United States)

    Ly, Hoai J; Ikegami, Tetsuro

    2016-07-02

    Rift Valley fever is a mosquito-borne zoonotic disease that affects both ruminants and humans. The nonstructural (NS) protein, which is a major virulence factor for Rift Valley fever virus (RVFV), is encoded on the S-segment. Through the cullin 1-Skp1-Fbox E3 ligase complex, the NSs protein promotes the degradation of at least two host proteins, the TFIIH p62 and the PKR proteins. NSs protein bridges the Fbox protein with subsequent substrates, and facilitates the transfer of ubiquitin. The SAP30-YY1 complex also bridges the NSs protein with chromatin DNA, affecting cohesion and segregation of chromatin DNA as well as the activation of interferon-β promoter. The presence of NSs filaments in the nucleus induces DNA damage responses and causes cell-cycle arrest, p53 activation, and apoptosis. Despite the fact that NSs proteins have poor amino acid similarity among bunyaviruses, the strategy utilized to hijack host cells are similar. This review will provide and summarize an update of recent findings pertaining to the biological functions of the NSs protein of RVFV as well as the differences from those of other bunyaviruses.

  6. Missense mutation Lys18Asn in dystrophin that triggers X-linked dilated cardiomyopathy decreases protein stability, increases protein unfolding, and perturbs protein structure, but does not affect protein function.

    Directory of Open Access Journals (Sweden)

    Surinder M Singh

    Full Text Available Genetic mutations in a vital muscle protein dystrophin trigger X-linked dilated cardiomyopathy (XLDCM. However, disease mechanisms at the fundamental protein level are not understood. Such molecular knowledge is essential for developing therapies for XLDCM. Our main objective is to understand the effect of disease-causing mutations on the structure and function of dystrophin. This study is on a missense mutation K18N. The K18N mutation occurs in the N-terminal actin binding domain (N-ABD. We created and expressed the wild-type (WT N-ABD and its K18N mutant, and purified to homogeneity. Reversible folding experiments demonstrated that both mutant and WT did not aggregate upon refolding. Mutation did not affect the protein's overall secondary structure, as indicated by no changes in circular dichroism of the protein. However, the mutant is thermodynamically less stable than the WT (denaturant melts, and unfolds faster than the WT (stopped-flow kinetics. Despite having global secondary structure similar to that of the WT, mutant showed significant local structural changes at many amino acids when compared with the WT (heteronuclear NMR experiments. These structural changes indicate that the effect of mutation is propagated over long distances in the protein structure. Contrary to these structural and stability changes, the mutant had no significant effect on the actin-binding function as evident from co-sedimentation and depolymerization assays. These results summarize that the K18N mutation decreases thermodynamic stability, accelerates unfolding, perturbs protein structure, but does not affect the function. Therefore, K18N is a stability defect rather than a functional defect. Decrease in stability and increase in unfolding decrease the net population of dystrophin molecules available for function, which might trigger XLDCM. Consistently, XLDCM patients have decreased levels of dystrophin in cardiac muscle.

  7. Protein functional features are reflected in the patterns of mRNA translation speed.

    Science.gov (United States)

    López, Daniel; Pazos, Florencio

    2015-07-09

    The degeneracy of the genetic code makes it possible for the same amino acid string to be coded by different messenger RNA (mRNA) sequences. These "synonymous mRNAs" may differ largely in a number of aspects related to their overall translational efficiency, such as secondary structure content and availability of the encoded transfer RNAs (tRNAs). Consequently, they may render different yields of the translated polypeptides. These mRNA features related to translation efficiency are also playing a role locally, resulting in a non-uniform translation speed along the mRNA, which has been previously related to some protein structural features and also used to explain some dramatic effects of "silent" single-nucleotide-polymorphisms (SNPs). In this work we perform the first large scale analysis of the relationship between three experimental proxies of mRNA local translation efficiency and the local features of the corresponding encoded proteins. We found that a number of protein functional and structural features are reflected in the patterns of ribosome occupancy, secondary structure and tRNA availability along the mRNA. One or more of these proxies of translation speed have distinctive patterns around the mRNA regions coding for certain protein local features. In some cases the three patterns follow a similar trend. We also show specific examples where these patterns of translation speed point to the protein's important structural and functional features. This support the idea that the genome not only codes the protein functional features as sequences of amino acids, but also as subtle patterns of mRNA properties which, probably through local effects on the translation speed, have some consequence on the final polypeptide. These results open the possibility of predicting a protein's functional regions based on a single genomic sequence, and have implications for heterologous protein expression and fine-tuning protein function.

  8. Effects of Hydrolysed Whey Proteins on the Techno-Functional Characteristics of Whey Protein-Based Films

    Directory of Open Access Journals (Sweden)

    Klaus Noller

    2013-03-01

    Full Text Available Pure whey protein isolate (WPI-based cast films are very brittle due to its strong formation of protein cross-linking of disulphide bonding, hydrogen bonding as well as hydrophobic and electrostatic interactions. However, this strong cross-linking is the reason for its final barrier performance. To overcome film brittleness of whey protein layers, plasticisers like glycerol are used. It reduces intermolecular interactions, increases the mobility of polymer chains and thus film flexibility can be achieved. The objective of this study was to investigate the influence of hydrolysed whey protein isolate (WPI in whey protein isolate-based cast films on their techno-functional properties. Due to the fact, that the addition of glycerol is necessary but at the same time increases the free volume in the film leading to higher oxygen and water vapour permeability, the glycerol concentration was kept constant. Cast films with different ratios of hydrolysed and not hydrolysed WPI were produced. They were characterised in order to determine the influence of the lower molecular weight caused by the addition of hydrolysed WPI on the techno-functional properties. This study showed that increasing hydrolysed WPI concentrations significantly change the mechanical properties while maintaining the oxygen and water vapour permeability. The tensile and elastic film properties decreased significantly by reducing the average molecular weight whereas the yellowish coloration and the surface tension considerably increased. This study provided new data which put researchers and material developers in a position to tailor the characteristics of whey protein based films according to their intended application and further processing.

  9. The semenogelins: proteins with functions beyond reproduction?

    Science.gov (United States)

    Jonsson, M; Lundwall, A; Malm, J

    2006-12-01

    The coagulum proteins of human semen, semenogelins I and II, are secreted in abundance by the seminal vesicles. Their function in reproduction is poorly understood as they are rapidly degraded in ejaculated semen. However, more recent results indicate that it is time to put the semenogelins in a broader physiological perspective that goes beyond reproduction and fertility.

  10. The semenogelins: proteins with functions beyond reproduction?

    OpenAIRE

    Jonsson, Magnus; Lundwall, Åke; Malm, Johan

    2006-01-01

    The coagulum proteins of human semen, semenogelins I and II, are secreted in abundance by the seminal vesicles. Their function in reproduction is poorly understood as they are rapidly degraded in ejaculated semen. However, more recent results indicate that it is time to put the semenogelins in a broader physiological perspective that goes beyond reproduction and fertility.

  11. Parametric Bayesian priors and better choice of negative examples improve protein function prediction.

    Science.gov (United States)

    Youngs, Noah; Penfold-Brown, Duncan; Drew, Kevin; Shasha, Dennis; Bonneau, Richard

    2013-05-01

    Computational biologists have demonstrated the utility of using machine learning methods to predict protein function from an integration of multiple genome-wide data types. Yet, even the best performing function prediction algorithms rely on heuristics for important components of the algorithm, such as choosing negative examples (proteins without a given function) or determining key parameters. The improper choice of negative examples, in particular, can hamper the accuracy of protein function prediction. We present a novel approach for choosing negative examples, using a parameterizable Bayesian prior computed from all observed annotation data, which also generates priors used during function prediction. We incorporate this new method into the GeneMANIA function prediction algorithm and demonstrate improved accuracy of our algorithm over current top-performing function prediction methods on the yeast and mouse proteomes across all metrics tested. Code and Data are available at: http://bonneaulab.bio.nyu.edu/funcprop.html

  12. GalaxyDock BP2 score: a hybrid scoring function for accurate protein-ligand docking

    Science.gov (United States)

    Baek, Minkyung; Shin, Woong-Hee; Chung, Hwan Won; Seok, Chaok

    2017-07-01

    Protein-ligand docking is a useful tool for providing atomic-level understanding of protein functions in nature and design principles for artificial ligands or proteins with desired properties. The ability to identify the true binding pose of a ligand to a target protein among numerous possible candidate poses is an essential requirement for successful protein-ligand docking. Many previously developed docking scoring functions were trained to reproduce experimental binding affinities and were also used for scoring binding poses. However, in this study, we developed a new docking scoring function, called GalaxyDock BP2 Score, by directly training the scoring power of binding poses. This function is a hybrid of physics-based, empirical, and knowledge-based score terms that are balanced to strengthen the advantages of each component. The performance of the new scoring function exhibits significant improvement over existing scoring functions in decoy pose discrimination tests. In addition, when the score is used with the GalaxyDock2 protein-ligand docking program, it outperformed other state-of-the-art docking programs in docking tests on the Astex diverse set, the Cross2009 benchmark set, and the Astex non-native set. GalaxyDock BP2 Score and GalaxyDock2 with this score are freely available at http://galaxy.seoklab.org/softwares/galaxydock.html.

  13. Enzymatic functionalization of a nanobody using protein insertion technology.

    Science.gov (United States)

    Crasson, O; Rhazi, N; Jacquin, O; Freichels, A; Jérôme, C; Ruth, N; Galleni, M; Filée, P; Vandevenne, M

    2015-10-01

    Antibody-based products constitute one of the most attractive biological molecules for diagnostic, medical imagery and therapeutic purposes with very few side effects. Their development has become a major priority of biotech and pharmaceutical industries. Recently, a growing number of modified antibody-based products have emerged including fragments, multi-specific and conjugate antibodies. In this study, using protein engineering, we have functionalized the anti-hen egg-white lysozyme (HEWL) camelid VHH antibody fragment (cAb-Lys3), by insertion into a solvent-exposed loop of the Bacillus licheniformis β-lactamase BlaP. We showed that the generated hybrid protein conserved its enzymatic activity while the displayed nanobody retains its ability to inhibit HEWL with a nanomolar affinity range. Then, we successfully implemented the functionalized cAb-Lys3 in enzyme-linked immunosorbent assay, potentiometric biosensor and drug screening assays. The hybrid protein was also expressed on the surface of phage particles and, in this context, was able to interact specifically with HEWL while the β-lactamase activity was used to monitor phage interactions. Finally, using thrombin-cleavage sites surrounding the permissive insertion site in the β-lactamase, we reported an expression system in which the nanobody can be easily separated from its carrier protein. Altogether, our study shows that insertion into the BlaP β-lactamase constitutes a suitable technology to functionalize nanobodies and allows the creation of versatile tools that can be used in innovative biotechnological assays. © The Author 2015. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  14. The E4 protein; structure, function and patterns of expression

    Energy Technology Data Exchange (ETDEWEB)

    Doorbar, John, E-mail: jdoorba@nimr.mrc.ac.uk

    2013-10-15

    The papillomavirus E4 open reading frame (ORF) is contained within the E2 ORF, with the primary E4 gene-product (E1{sup ∧}E4) being translated from a spliced mRNA that includes the E1 initiation codon and adjacent sequences. E4 is located centrally within the E2 gene, in a region that encodes the E2 protein′s flexible hinge domain. Although a number of minor E4 transcripts have been reported, it is the product of the abundant E1{sup ∧}E4 mRNA that has been most extensively analysed. During the papillomavirus life cycle, the E1{sup ∧}E4 gene products generally become detectable at the onset of vegetative viral genome amplification as the late stages of infection begin. E4 contributes to genome amplification success and virus synthesis, with its high level of expression suggesting additional roles in virus release and/or transmission. In general, E4 is easily visualised in biopsy material by immunostaining, and can be detected in lesions caused by diverse papillomavirus types, including those of dogs, rabbits and cattle as well as humans. The E4 protein can serve as a biomarker of active virus infection, and in the case of high-risk human types also disease severity. In some cutaneous lesions, E4 can be expressed at higher levels than the virion coat proteins, and can account for as much as 30% of total lesional protein content. The E4 proteins of the Beta, Gamma and Mu HPV types assemble into distinctive cytoplasmic, and sometimes nuclear, inclusion granules. In general, the E4 proteins are expressed before L2 and L1, with their structure and function being modified, first by kinases as the infected cell progresses through the S and G2 cell cycle phases, but also by proteases as the cell exits the cell cycle and undergoes true terminal differentiation. The kinases that regulate E4 also affect other viral proteins simultaneously, and include protein kinase A, Cyclin-dependent kinase, members of the MAP Kinase family and protein kinase C. For HPV16 E1{sup

  15. Functional diversification of structurally alike NLR proteins in plants.

    Science.gov (United States)

    Chakraborty, Joydeep; Jain, Akansha; Mukherjee, Dibya; Ghosh, Suchismita; Das, Sampa

    2018-04-01

    In due course of evolution many pathogens alter their effector molecules to modulate the host plants' metabolism and immune responses triggered upon proper recognition by the intracellular nucleotide-binding oligomerization domain containing leucine-rich repeat (NLR) proteins. Likewise, host plants have also evolved with diversified NLR proteins as a survival strategy to win the battle against pathogen invasion. NLR protein indeed detects pathogen derived effector proteins leading to the activation of defense responses associated with programmed cell death (PCD). In this interactive process, genome structure and plasticity play pivotal role in the development of innate immunity. Despite being quite conserved with similar biological functions in all eukaryotes, the intracellular NLR immune receptor proteins happen to be structurally distinct. Recent studies have made progress in identifying transcriptional regulatory complexes activated by NLR proteins. In this review, we attempt to decipher the intracellular NLR proteins mediated surveillance across the evolutionarily diverse taxa, highlighting some of the recent updates on NLR protein compartmentalization, molecular interactions before and after activation along with insights into the finer role of these receptor proteins to combat invading pathogens upon their recognition. Latest information on NLR sensors, helpers and NLR proteins with integrated domains in the context of plant pathogen interactions are also discussed. Copyright © 2018 Elsevier B.V. All rights reserved.

  16. Sensory and Functionality Differences of Whey Protein Isolate Bleached by Hydrogen or Benzoyl Peroxide.

    Science.gov (United States)

    Smith, Tucker J; Foegeding, E Allen; Drake, MaryAnne

    2015-10-01

    Whey protein is a highly functional food ingredient used in a wide variety of applications. A large portion of fluid whey produced in the United States is derived from Cheddar cheese manufacture and contains annatto (norbixin), and therefore must be bleached. The objective of this study was to compare sensory and functionality differences between whey protein isolate (WPI) bleached by benzoyl peroxide (BP) or hydrogen peroxide (HP). HP and BP bleached WPI and unbleached controls were manufactured in triplicate. Descriptive sensory analysis and gas chromatography-mass spectrometry were conducted to determine flavor differences between treatments. Functionality differences were evaluated by measurement of foam stability, protein solubility, SDS-PAGE, and effect of NaCl concentration on gelation relative to an unbleached control. HP bleached WPI had higher concentrations of lipid oxidation and sulfur containing volatile compounds than both BP and unbleached WPI (P protein loss at pH 4.6 of WPI decreased by bleaching with either hydrogen peroxide or benzoyl peroxide (P whey with either BP or HP resulted in protein degradation, which likely contributed to functionality differences. These results demonstrate that bleaching has flavor effects as well as effects on many of the functionality characteristics of whey proteins. Whey protein isolate (WPI) is often used for its functional properties, but the effect of oxidative bleaching chemicals on the functional properties of WPI is not known. This study identifies the effects of hydrogen peroxide and benzoyl peroxide on functional and flavor characteristics of WPI bleached by hydrogen and benzoyl peroxide and provides insights for the product applications which may benefit from bleaching. © 2015 Institute of Food Technologists®

  17. Solid state protein monolayers: Morphological, conformational, and functional properties

    Science.gov (United States)

    Pompa, P. P.; Biasco, A.; Frascerra, V.; Calabi, F.; Cingolani, R.; Rinaldi, R.; Verbeet, M. Ph.; de Waal, E.; Canters, G. W.

    2004-12-01

    We have studied the morphological, conformational, and electron-transfer (ET) function of the metalloprotein azurin in the solid state, by a combination of physical investigation methods, namely atomic force microscopy, intrinsic fluorescence spectroscopy, and scanning tunneling microscopy. We demonstrate that a "solid state protein film" maintains its nativelike conformation and ET function, even after removal of the aqueous solvent.

  18. Mung bean proteins and peptides: nutritional, functional and bioactive properties

    Directory of Open Access Journals (Sweden)

    Zhu Yi-Shen

    2018-02-01

    Full Text Available To date, no extensive literature review exists regarding potential uses of mung bean proteins and peptides. As mung bean has long been widely used as a food source, early studies evaluated mung bean nutritional value against the Food and Agriculture Organization of the United Nations (FAO/the World Health Organization (WHO amino acids dietary recommendations. The comparison demonstrated mung bean to be a good protein source, except for deficiencies in sulphur-containing amino acids, methionine and cysteine. Methionine and cysteine residues have been introduced into the 8S globulin through protein engineering technology. Subsequently, purified mung bean proteins and peptides have facilitated the study of their structural and functional properties. Two main types of extraction methods have been reported for isolation of proteins and peptides from mung bean flours, permitting sequencing of major proteins present in mung bean, including albumins and globulins (notably 8S globulin. However, the sequence for albumin deposited in the UniProt database differs from other sequences reported in the literature. Meanwhile, a limited number of reports have revealed other useful bioactivities for proteins and hydrolysed peptides, including angiotensin-converting enzyme inhibitory activity, anti-fungal activity and trypsin inhibitory activity. Consequently, several mung bean hydrolysed peptides have served as effective food additives to prevent proteolysis during storage. Ultimately, further research will reveal other nutritional, functional and bioactive properties of mung bean for uses in diverse applications.

  19. Molecular design and nanoparticle-mediated intracellular delivery of functional proteins to target cellular pathways

    Science.gov (United States)

    Shah, Dhiral Ashwin

    Intracellular delivery of specific proteins and peptides represents a novel method to influence stem cells for gain-of-function and loss-of-function. Signaling control is vital in stem cells, wherein intricate control of and interplay among critical pathways directs the fate of these cells into either self-renewal or differentiation. The most common route to manipulate cellular function involves the introduction of genetic material such as full-length genes and shRNA into the cell to generate (or prevent formation of) the target protein, and thereby ultimately alter cell function. However, viral-mediated gene delivery may result in relatively slow expression of proteins and prevalence of oncogene insertion into the cell, which can alter cell function in an unpredictable fashion, and non-viral delivery may lead to low efficiency of genetic delivery. For example, the latter case plagues the generation of induced pluripotent stem cells (iPSCs) and hinders their use for in vivo applications. Alternatively, introducing proteins into cells that specifically recognize and influence target proteins, can result in immediate deactivation or activation of key signaling pathways within the cell. In this work, we demonstrate the cellular delivery of functional proteins attached to hydrophobically modified silica (SiNP) nanoparticles to manipulate specifically targeted cell signaling proteins. In the Wnt signaling pathway, we have targeted the phosphorylation activity of glycogen synthase kinase-3beta (GSK-3beta) by designing a chimeric protein and delivering it in neural stem cells. Confocal imaging indicates that the SiNP-chimeric protein conjugates were efficiently delivered to the cytosol of human embryonic kidney cells and rat neural stem cells, presumably via endocytosis. This uptake impacted the Wnt signaling cascade, indicated by the elevation of beta-catenin levels, and increased transcription of Wnt target genes, such as c-MYC. The results presented here suggest that

  20. Combining protein sequence, structure, and dynamics: A novel approach for functional evolution analysis of PAS domain superfamily.

    Science.gov (United States)

    Dong, Zheng; Zhou, Hongyu; Tao, Peng

    2018-02-01

    PAS domains are widespread in archaea, bacteria, and eukaryota, and play important roles in various functions. In this study, we aim to explore functional evolutionary relationship among proteins in the PAS domain superfamily in view of the sequence-structure-dynamics-function relationship. We collected protein sequences and crystal structure data from RCSB Protein Data Bank of the PAS domain superfamily belonging to three biological functions (nucleotide binding, photoreceptor activity, and transferase activity). Protein sequences were aligned and then used to select sequence-conserved residues and build phylogenetic tree. Three-dimensional structure alignment was also applied to obtain structure-conserved residues. The protein dynamics were analyzed using elastic network model (ENM) and validated by molecular dynamics (MD) simulation. The result showed that the proteins with same function could be grouped by sequence similarity, and proteins in different functional groups displayed statistically significant difference in their vibrational patterns. Interestingly, in all three functional groups, conserved amino acid residues identified by sequence and structure conservation analysis generally have a lower fluctuation than other residues. In addition, the fluctuation of conserved residues in each biological function group was strongly correlated with the corresponding biological function. This research suggested a direct connection in which the protein sequences were related to various functions through structural dynamics. This is a new attempt to delineate functional evolution of proteins using the integrated information of sequence, structure, and dynamics. © 2017 The Protein Society.

  1. Properties and Functions of the Dengue Virus Capsid Protein.

    Science.gov (United States)

    Byk, Laura A; Gamarnik, Andrea V

    2016-09-29

    Dengue virus affects hundreds of millions of people each year around the world, causing a tremendous social and economic impact on affected countries. The aim of this review is to summarize our current knowledge of the functions, structure, and interactions of the viral capsid protein. The primary role of capsid is to package the viral genome. There are two processes linked to this function: the recruitment of the viral RNA during assembly and the release of the genome during infection. Although particle assembly takes place on endoplasmic reticulum membranes, capsid localizes in nucleoli and lipid droplets. Why capsid accumulates in these locations during infection remains unknown. In this review, we describe available data and discuss new ideas on dengue virus capsid functions and interactions. We believe that a deeper understanding of how the capsid protein works during infection will create opportunities for novel antiviral strategies, which are urgently needed to control dengue virus infections.

  2. SITEX 2.0: Projections of protein functional sites on eukaryotic genes. Extension with orthologous genes.

    Science.gov (United States)

    Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A

    2017-04-01

    Functional sites define the diversity of protein functions and are the central object of research of the structural and functional organization of proteins. The mechanisms underlying protein functional sites emergence and their variability during evolution are distinguished by duplication, shuffling, insertion and deletion of the exons in genes. The study of the correlation between a site structure and exon structure serves as the basis for the in-depth understanding of sites organization. In this regard, the development of programming resources that allow the realization of the mutual projection of exon structure of genes and primary and tertiary structures of encoded proteins is still the actual problem. Previously, we developed the SitEx system that provides information about protein and gene sequences with mapped exon borders and protein functional sites amino acid positions. The database included information on proteins with known 3D structure. However, data with respect to orthologs was not available. Therefore, we added the projection of sites positions to the exon structures of orthologs in SitEx 2.0. We implemented a search through database using site conservation variability and site discontinuity through exon structure. Inclusion of the information on orthologs allowed to expand the possibilities of SitEx usage for solving problems regarding the analysis of the structural and functional organization of proteins. Database URL: http://www-bionet.sscc.ru/sitex/ .

  3. Functional Anthology of Intrinsic Disorder. III. Ligands, Postranslational Modifications and Diseases Associated with Intrinsically Disordered Proteins

    Science.gov (United States)

    Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M.; Oldfield, Christopher J.; Dunker, A. Keith; Obradovic, Zoran; Uversky, Vladimir N.

    2008-01-01

    Currently, the understanding of the relationships between function, amino acid sequence and protein structure continues to represent one of the major challenges of the modern protein science. As much as 50% of eukaryotic proteins are likely to contain functionally important long disordered regions. Many proteins are wholly disordered but still possess numerous biologically important functions. However, the number of experimentally confirmed disordered proteins with known biological functions is substantially smaller than their actual number in nature. Therefore, there is a crucial need for novel bioinformatics approaches that allow projection of the current knowledge from a few experimentally verified examples to much larger groups of known and potential proteins. The elaboration of a bioinformatics tool for the analysis of functional diversity of intrinsically disordered proteins and application of this data mining tool to >200,000 proteins from Swiss-Prot database, each annotated with at least one of the 875 functional keywords was described in the first paper of this series (Xie H., Vucetic S., Iakoucheva L.M., Oldfield C.J., Dunker A.K., Obradovic Z., Uversky V.N. (2006) Functional anthology of intrinsic disorder. I. Biological processes and functions of proteins with long disordered regions. J. Proteome Res.). Using this tool, we have found that out of the 711 Swiss-Prot functional keywords associated with at least 20 proteins, 262 were strongly positively correlated with long intrinsically disordered regions, and 302 were strongly negatively correlated. Illustrative examples of functional disorder or order were found for the vast majority of keywords showing strongest positive or negative correlation with intrinsic disorder, respectively. Some 80 Swiss-Prot keywords associated with disorder- and order-driven biological processes and protein functions were described in the first paper (Xie H., Vucetic S., Iakoucheva L.M., Oldfield C.J., Dunker A.K., Obradovic

  4. DMPD: G-protein-coupled receptor expression, function, and signaling in macrophages. [Dynamic Macrophage Pathway CSML Database

    Lifescience Database Archive (English)

    Full Text Available 17456803 G-protein-coupled receptor expression, function, and signaling in macropha...2007 Apr 24. (.png) (.svg) (.html) (.csml) Show G-protein-coupled receptor expression, function, and signali...ng in macrophages. PubmedID 17456803 Title G-protein-coupled receptor expression, function

  5. From Sequence and Forces to Structure, Function and Evolution of Intrinsically Disordered Proteins

    Science.gov (United States)

    Forman-Kay, Julie D.; Mittag, Tanja

    2015-01-01

    Intrinsically disordered proteins (IDPs), which lack persistent structure, are a challenge to structural biology due to the inapplicability of standard methods for characterization of folded proteins as well as their deviation from the dominant structure/function paradigm. Their widespread presence and involvement in biological function, however, has spurred the growing acceptance of the importance of IDPs and the development of new tools for studying their structure, dynamics and function. The interplay of folded and disordered domains or regions for function and the existence of a continuum of protein states with respect to conformational energetics, motional timescales and compactness is shaping a unified understanding of structure-dynamics-disorder/function relationships. On the 20th anniversary of this journal, Structure, we provide a historical perspective on the investigation of IDPs and summarize the sequence features and physical forces that underlie their unique structural, functional and evolutionary properties. PMID:24010708

  6. Protein profile of human hepatocarcinoma cell line SMMC-7721: Identification and functional analysis

    Institute of Scientific and Technical Information of China (English)

    Yi Feng; Zhong-Min Tian; Ming-Xi Wan; Zhao-Bin Zheng

    2007-01-01

    AIM: To investigate the protein profile of human hepatocarcinoma cell line SMMC-7721, to analyze the specific functions of abundant expressed proteins in the processes of hepatocarcinoma genesis, growth and metastasis, to identify the hepatocarcinoma-specific biomarkers for the early prediction in diagnosis, and to explore the new drug targets for liver cancer therapy.METHODS: Total proteins from human hepatocarcinomacell line SMMC-7721 were separated by two-dimensional electrophoresis (2DE). The silver-stained gel was analyzed by 2DE software Image Master 2D Elite.Interesting protein spots were identified by peptide mass fingerprinting based on matrix-assisted laser desorption/ionization time-of-flight mass spectrometry (MALDI-TOF-MS)and database searching.RESULTS: We obtained protein profile of human hepatocarcinoma cell line SMMC-7721. Among the twenty-one successfully identified proteins, mitofilin,endoplasmic reticulum protein ERp29, ubiquinol-cytochrome C reductase complex core protein Ⅰ,peroxisomal enoyl CoA hydratase, peroxiredoxin-4 and probable 3-oxoacid CoA transferase 1 precursor were the six novel proteins identified in human hepatocarcinoma cells or tissues. Specific functions of the identified heat-shock proteins were analyzed in detail, and the results suggested that these proteins might promote tumorigenesis via inhibiting cell death induced by several cancer-related stresses or via inhibiting apoptosis at multiple points in the apoptotic signal pathway. Other identified chaperones and cancer-related proteins were also analyzed.CONCLUSION: Based on the protein profile of SMMC-7721 cells, functional analysis suggests that the identified chaperones and cancer-related proteins have their own pathways to contribute to the tumorigenesis, tumor growth and metastasis of liver cancer. Furthermore, proteomic analysis is indicated to be feasible in the cancer study.

  7. Computer analysis of protein functional sites projection on exon structure of genes in Metazoa.

    Science.gov (United States)

    Medvedeva, Irina V; Demenkov, Pavel S; Ivanisenko, Vladimir A

    2015-01-01

    Study of the relationship between the structural and functional organization of proteins and their coding genes is necessary for an understanding of the evolution of molecular systems and can provide new knowledge for many applications for designing proteins with improved medical and biological properties. It is well known that the functional properties of proteins are determined by their functional sites. Functional sites are usually represented by a small number of amino acid residues that are distantly located from each other in the amino acid sequence. They are highly conserved within their functional group and vary significantly in structure between such groups. According to this facts analysis of the general properties of the structural organization of the functional sites at the protein level and, at the level of exon-intron structure of the coding gene is still an actual problem. One approach to this analysis is the projection of amino acid residue positions of the functional sites along with the exon boundaries to the gene structure. In this paper, we examined the discontinuity of the functional sites in the exon-intron structure of genes and the distribution of lengths and phases of the functional site encoding exons in vertebrate genes. We have shown that the DNA fragments coding the functional sites were in the same exons, or in close exons. The observed tendency to cluster the exons that code functional sites which could be considered as the unit of protein evolution. We studied the characteristics of the structure of the exon boundaries that code, and do not code, functional sites in 11 Metazoa species. This is accompanied by a reduced frequency of intercodon gaps (phase 0) in exons encoding the amino acid residue functional site, which may be evidence of the existence of evolutionary limitations to the exon shuffling. These results characterize the features of the coding exon-intron structure that affect the functionality of the encoded protein and

  8. Strategies for specifically directing metal functionalization of protein nanotubes: constructing protein coated silver nanowires

    International Nuclear Information System (INIS)

    Carreño-Fuentes, Liliana; Palomares, Laura A; Ramírez, Octavio T; Ascencio, Jorge A; Medina, Ariosto; Aguila, Sergio

    2013-01-01

    Biological molecules that self-assemble in the nanoscale range are useful multifunctional materials. Rotavirus VP6 protein self-assembles into tubular structures in the absence of other rotavirus proteins. Here, we present strategies for selectively directing metal functionalization to the lumen of VP6 nanotubes. The specific in situ metal reduction in the inner surface of nanotube walls was achieved by the simple modification of a method previously reported to functionalize the nanotube outer surface. Silver nanorods and nanowires as long as 1.5 μm were formed inside the nanotubes by coalescence of nanoparticles. Such one-dimensional structures were longer than others previously obtained using bioscaffolds. The interactions between silver ions and the nanotube were simulated to understand the conditions that allowed nanowire formation. Molecular docking showed that a naturally occurring arrangement of aspartate residues enabled the stabilization of silver ions on the internal surface of the VP6 nanotubes. This is the first time that such a spatial arrangement has been proposed for the nucleation of silver nanoparticles, opening the possibility of using such an array to direct functionalization of other biomolecules. These results demonstrate the natural capabilities of VP6 nanotubes to function as a versatile biotemplate for nanomaterials. (paper)

  9. C-Terminal Fluorescent Labeling Impairs Functionality of DNA Mismatch Repair Proteins

    Science.gov (United States)

    Brieger, Angela; Plotz, Guido; Hinrichsen, Inga; Passmann, Sandra; Adam, Ronja; Zeuzem, Stefan

    2012-01-01

    The human DNA mismatch repair (MMR) process is crucial to maintain the integrity of the genome and requires many different proteins which interact perfectly and coordinated. Germline mutations in MMR genes are responsible for the development of the hereditary form of colorectal cancer called Lynch syndrome. Various mutations mainly in two MMR proteins, MLH1 and MSH2, have been identified so far, whereas 55% are detected within MLH1, the essential component of the heterodimer MutLα (MLH1 and PMS2). Most of those MLH1 variants are pathogenic but the relevance of missense mutations often remains unclear. Many different recombinant systems are applied to filter out disease-associated proteins whereby fluorescent tagged proteins are frequently used. However, dye labeling might have deleterious effects on MutLα's functionality. Therefore, we analyzed the consequences of N- and C-terminal fluorescent labeling on expression level, cellular localization and MMR activity of MutLα. Besides significant influence of GFP- or Red-fusion on protein expression we detected incorrect shuttling of single expressed C-terminal GFP-tagged PMS2 into the nucleus and found that C-terminal dye labeling impaired MMR function of MutLα. In contrast, N-terminal tagged MutLαs retained correct functionality and can be recommended both for the analysis of cellular localization and MMR efficiency. PMID:22348133

  10. C-terminal fluorescent labeling impairs functionality of DNA mismatch repair proteins.

    Directory of Open Access Journals (Sweden)

    Angela Brieger

    Full Text Available The human DNA mismatch repair (MMR process is crucial to maintain the integrity of the genome and requires many different proteins which interact perfectly and coordinated. Germline mutations in MMR genes are responsible for the development of the hereditary form of colorectal cancer called Lynch syndrome. Various mutations mainly in two MMR proteins, MLH1 and MSH2, have been identified so far, whereas 55% are detected within MLH1, the essential component of the heterodimer MutLα (MLH1 and PMS2. Most of those MLH1 variants are pathogenic but the relevance of missense mutations often remains unclear. Many different recombinant systems are applied to filter out disease-associated proteins whereby fluorescent tagged proteins are frequently used. However, dye labeling might have deleterious effects on MutLα's functionality. Therefore, we analyzed the consequences of N- and C-terminal fluorescent labeling on expression level, cellular localization and MMR activity of MutLα. Besides significant influence of GFP- or Red-fusion on protein expression we detected incorrect shuttling of single expressed C-terminal GFP-tagged PMS2 into the nucleus and found that C-terminal dye labeling impaired MMR function of MutLα. In contrast, N-terminal tagged MutLαs retained correct functionality and can be recommended both for the analysis of cellular localization and MMR efficiency.

  11. Knowledge, perceptions and preferences of elderly regarding protein-enriched functional food.

    Science.gov (United States)

    van der Zanden, Lotte D T; van Kleef, Ellen; de Wijk, René A; van Trijp, Hans C M

    2014-09-01

    Promoting protein consumption in the elderly population may contribute to improving the quality of their later years in life. Our study aimed to explore knowledge, perceptions and preferences of elderly consumers regarding protein-enriched food. We conducted three focus groups with independently living (ID) elderly (N = 24, Mage = 67 years) and three with elderly living in a residential home (RH) (N = 18, Mage = 83 years). Both the ID and RH elderly were predominantly sceptical about functional food in general. Confusion, distrust and a perceived lack of personal relevance were main perceived barriers to purchasing and consuming these products, although a majority of the participants did report occasionally consuming at least one type of functional food. For the ID elderly, medical advice was an important facilitator that could overcome barriers to purchasing and consuming protein-enriched food, indicating the importance of personal relevance for this group. For the RH elderly, in contrast, sensory appeal of protein-enriched foods was a facilitator. Carrier preferences were similar for the two groups; the elderly preferred protein-enriched foods based on healthy products that they consumed frequently. Future studies should explore ways to deal with the confusion and distrust regarding functional food within the heterogeneous population of elderly. Copyright © 2014 Elsevier Ltd. All rights reserved.

  12. The hydroxyl-functionalized magnetic particles for purification of glycan-binding proteins.

    Science.gov (United States)

    Sun, Xiuxuan; Yang, Ganglong; Sun, Shisheng; Quan, Rui; Dai, Weiwei; Li, Bin; Chen, Chao; Li, Zheng

    2009-12-01

    Glycan-protein interactions play important biological roles in biological processes. Although there are some methods such as glycan arrays that may elucidate recognition events between carbohydrates and protein as well as screen the important glycan-binding proteins, there is a lack of simple effectively separate method to purify them from complex samples. In proteomics studies, fractionation of samples can help to reduce their complexity and to enrich specific classes of proteins for subsequent downstream analyses. Herein, a rapid simple method for purification of glycan-binding proteins from proteomic samples was developed using hydroxyl-coated magnetic particles coupled with underivatized carbohydrate. Firstly, the epoxy-coated magnetic particles were further hydroxyl functionalized with 4-hydroxybenzhydrazide, then the carbohydrates were efficiently immobilized on hydroxyl functionalized surface of magnetic particles by formation of glycosidic bond with the hemiacetal group at the reducing end of the suitable carbohydrates via condensation. All conditions of this method were optimized. The magnetic particle-carbohydrate conjugates were used to purify the glycan-binding proteins from human serum. The fractionated glycan-binding protein population was displayed by SDS-PAGE. The result showed that the amount of 1 mg magnetic particles coupled with mannose in acetate buffer (pH 5.4) was 10 micromol. The fractionated glycan-binding protein population in human serum could be eluted from the magnetic particle-mannose conjugates by 0.1% SDS. The methodology could work together with the glycan microarrays for screening and purification of the important GBPs from complex protein samples.

  13. Functional properties and Solubility of date seed proteins as ...

    African Journals Online (AJOL)

    Med ali

    2013-03-06

    Mar 6, 2013 ... Key words: Phoenix dactylifera L, date palm seed, fibre, protein, functional properties. INTRODUCTION. The date .... was employed to perform dynamic measurements. ... are likely to be composed of high-molecular weight.

  14. The role of oligomerization and cooperative regulation in protein function: the case of tryptophan synthase.

    Directory of Open Access Journals (Sweden)

    M Qaiser Fatmi

    Full Text Available The oligomerization/co-localization of protein complexes and their cooperative regulation in protein function is a key feature in many biological systems. The synergistic regulation in different subunits often enhances the functional properties of the multi-enzyme complex. The present study used molecular dynamics and Brownian dynamics simulations to study the effects of allostery, oligomerization and intermediate channeling on enhancing the protein function of tryptophan synthase (TRPS. TRPS uses a set of α/β-dimeric units to catalyze the last two steps of L-tryptophan biosynthesis, and the rate is remarkably slower in the isolated monomers. Our work shows that without their binding partner, the isolated monomers are stable and more rigid. The substrates can form fairly stable interactions with the protein in both forms when the protein reaches the final ligand-bound conformations. Our simulations also revealed that the α/β-dimeric unit stabilizes the substrate-protein conformation in the ligand binding process, which lowers the conformation transition barrier and helps the protein conformations shift from an open/inactive form to a closed/active form. Brownian dynamics simulations with a coarse-grained model illustrate how protein conformations affect substrate channeling. The results highlight the complex roles of protein oligomerization and the fine balance between rigidity and dynamics in protein function.

  15. Loss of function of cinnamyl alcohol dehydrogenase 1 leads to unconventional lignin and a temperature-sensitive growth defect in Medicago truncatula

    OpenAIRE

    Zhao, Qiao; Tobimatsu, Yuki; Zhou, Rui; Pattathil, Sivakumar; Gallego-Giraldo, Lina; Fu, Chunxiang; Jackson, Lisa A.; Hahn, Michael G.; Kim, Hoon; Chen, Fang; Ralph, John; Dixon, Richard A.

    2013-01-01

    There is considerable debate over the capacity of the cell wall polymer lignin to incorporate unnatural monomer units. We have identified Tnt1 retrotransposon insertion mutants of barrel medic (Medicago truncatula) that show reduced lignin autofluorescence under UV microscopy and red coloration in interfascicular fibers. The phenotype is caused by insertion of retrotransposons into a gene annotated as encoding cinnamyl alcohol dehydrogenase, here designated M. truncatula CAD1. NMR analysis in...

  16. Composition and functionality of whey protein phospholipid concentrate and delactosed permeate.

    Science.gov (United States)

    Levin, M A; Burrington, K J; Hartel, R W

    2016-09-01

    Whey protein phospholipid concentrate (WPPC) and delactosed permeate (DLP) are 2 coproducts of cheese whey processing that are currently underused. Past research has shown that WPPC and DLP can be used together as a functional dairy ingredient in foods such as ice cream, soup, and caramel. However, the scope of the research has been limited to 1 WPPC supplier. The objective of this research was to fully characterize a range of WPPC. Four WPPC samples and 1 DLP sample were analyzed for chemical composition and functionality. This analysis showed that WPPC composition was highly variable between suppliers and lots. In addition, the functionality of the WPPC varies depending on the supplier and testing pH, and cannot be correlated with fat or protein content because of differences in processing. The addition of DLP to WPPC affects functionality. In general, WPPC has a high water-holding capacity, is relatively heat stable, has low foamability, and does not aid in emulsion stability. The gel strength and texture are highly dependent on the amount of protein. To be able to use these 2 dairy products, the composition and functionality must be fully understood. Copyright © 2016 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  17. Functional dissection of the Hox protein Abdominal-B in Drosophila cell culture

    Energy Technology Data Exchange (ETDEWEB)

    Zhai, Zongzhao [Key Laboratory of the Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beichen West Road, Chaoyang, Beijing 100101 (China); CellNetworks - Cluster of Excellence, Centre for Organismal Studies (COS) Heidelberg, University of Heidelberg, D-69120 Heidelberg (Germany); Graduate School of Chinese Academy of Sciences, Beijing 100039 (China); Yang, Xingke, E-mail: yangxk@ioz.ac.cn [Key Laboratory of the Zoological Systematics and Evolution, Institute of Zoology, Chinese Academy of Sciences, Beichen West Road, Chaoyang, Beijing 100101 (China); Lohmann, Ingrid, E-mail: ilohmann@flydev.org [CellNetworks - Cluster of Excellence, Centre for Organismal Studies (COS) Heidelberg, University of Heidelberg, D-69120 Heidelberg (Germany)

    2011-11-04

    Highlights: Black-Right-Pointing-Pointer ct340 CRM was identified to be the posterior spiracle enhancer of gene cut. Black-Right-Pointing-Pointer ct340 is under the direct transcriptional control of Hox protein Abd-B. Black-Right-Pointing-Pointer An efficient cloning system was developed to assay protein-DNA interaction. Black-Right-Pointing-Pointer New features of Abd-B dependent target gene regulation were detected. -- Abstract: Hox transcription factors regulate the morphogenesis along the anterior-posterior (A/P) body axis through the interaction with small cis-regulatory modules (CRMs) of their target gene, however so far very few Hox CRMs are known and have been analyzed in detail. In this study we have identified a new Hox CRM, ct340, which guides the expression of the cell type specification gene cut (ct) in the posterior spiracle under the direct control of the Hox protein Abdominal-B (Abd-B). Using the ct340 enhancer activity as readout, an efficient cloning system to generate VP16 activation domain fusion protein was developed to unambiguously test protein-DNA interaction in Drosophila cell culture. By functionally dissecting the Abd-B protein, new features of Abd-B dependent target gene regulation were detected. Due to its easy adaptability, this system can be generally used to map functional domains within sequence-specific transcriptional factors in Drosophila cell culture, and thus provide preliminary knowledge of the protein functional domain structure for further in vivo analysis.

  18. Bam35 tectivirus intraviral interaction map unveils new function and localization of phage ORFan proteins.

    Science.gov (United States)

    Berjón-Otero, Mónica; Lechuga, Ana; Mehla, Jitender; Uetz, Peter; Salas, Margarita; Redrejo-Rodríguez, Modesto

    2017-07-26

    Tectiviridae comprises a group of tail-less, icosahedral, membrane-containing bacteriophages that can be divided into two groups by their hosts, either Gram-negative or Gram-positive bacteria. While the first group is composed of PRD1 and nearly identical well characterized lytic viruses, the second one includes more variable temperate phages, like GIL16 or Bam35, whose hosts are Bacillus cereus and related Gram-positive bacteria.In the genome of Bam35, nearly half of the 32 annotated open reading frames (ORFs) have no homologs in databases (ORFans), being putative proteins of unknown function, which hinders the understanding of their biology. With the aim of increasing the knowledge of the viral proteome, we carried out a comprehensive yeast two-hybrid analysis among all the putative proteins encoded by the Bam35 genome. The resulting protein interactome comprises 76 unique interactions among 24 proteins, of which 12 have an unknown function. These results suggested that the P17 protein is the minor capsid protein of Bam35 and P24 is the penton protein, being the latter also supported by iterative threading protein modeling. Moreover, the inner membrane transglycosylase protein P26 could have an additional structural role. We also detected interactions involving non-structural proteins, such as the DNA binding protein P1 and the genome terminal protein (P4), which was confirmed by co-immunoprecipitation of recombinant proteins. Altogether, our results provide a functional view of the Bam35 viral proteome, with a focus on the composition and organization of the viral particle. IMPORTANCE Tail-less viruses of the family Tectiviridae can infect commensal and pathogenic Gram-positive and Gram-negative bacteria. Moreover, they have been proposed to be at the evolutionary origin of several groups of large eukaryotic DNA viruses and self-replicating plasmids. However, due to their ancient origin and complex diversity, many tectiviral proteins are ORFans of unknown

  19. Stapled Voltage-Gated Calcium Channel (CaV) α-Interaction Domain (AID) Peptides Act As Selective Protein-Protein Interaction Inhibitors of CaV Function.

    Science.gov (United States)

    Findeisen, Felix; Campiglio, Marta; Jo, Hyunil; Abderemane-Ali, Fayal; Rumpf, Christine H; Pope, Lianne; Rossen, Nathan D; Flucher, Bernhard E; DeGrado, William F; Minor, Daniel L

    2017-06-21

    For many voltage-gated ion channels (VGICs), creation of a properly functioning ion channel requires the formation of specific protein-protein interactions between the transmembrane pore-forming subunits and cystoplasmic accessory subunits. Despite the importance of such protein-protein interactions in VGIC function and assembly, their potential as sites for VGIC modulator development has been largely overlooked. Here, we develop meta-xylyl (m-xylyl) stapled peptides that target a prototypic VGIC high affinity protein-protein interaction, the interaction between the voltage-gated calcium channel (Ca V ) pore-forming subunit α-interaction domain (AID) and cytoplasmic β-subunit (Ca V β). We show using circular dichroism spectroscopy, X-ray crystallography, and isothermal titration calorimetry that the m-xylyl staples enhance AID helix formation are structurally compatible with native-like AID:Ca V β interactions and reduce the entropic penalty associated with AID binding to Ca V β. Importantly, electrophysiological studies reveal that stapled AID peptides act as effective inhibitors of the Ca V α 1 :Ca V β interaction that modulate Ca V function in an Ca V β isoform-selective manner. Together, our studies provide a proof-of-concept demonstration of the use of protein-protein interaction inhibitors to control VGIC function and point to strategies for improved AID-based Ca V modulator design.

  20. Identification and chromosomal distribution of copia-like retrotransposon sequences in the coffee (Coffea L. genome

    Directory of Open Access Journals (Sweden)

    Juan-Carlos Herrera

    2013-12-01

    Full Text Available The presence of copia-like transposable elements in seven coffee (Coffea sp. species, including the cultivated Coffea arabica, was investigated. The highly conserved domains of the reverse transcriptase (RT present in the copia retrotransposons were amplified by PCR using degenerated primers. Fragments of roughly 300 bp were obtained and the nucleotide sequence was determined for 36 clones, 19 of which showed good quality. The deduced amino acid sequences were compared by multiple alignment analysis. The data suggested two distinct coffee RT groups, designated as CRTG1 and CRTG2. The sequence identities among the groups ranged from 52 to 60% for CRTG1 and 74 to 85% for CRTG2. The multiple alignment analysis revealed that some of the clones in CRTG1 were closely related to the representative elements present in other plant species such as Brassica napus, Populus ciliata and Picea abis. Furthermore, the chromosomal localization of the RT domains in C. arabica and their putative ancestors was investigated by fluorescence in situ hybridization (FISH analysis. FISH signals were observed throughout the chromosomes following a similar dispersed pattern with some localized regions exhibiting higher concentrations of those elements, providing new evidence of their relative conservation and stability in the coffee genome

  1. Simplified Swarm Optimization-Based Function Module Detection in Protein–Protein Interaction Networks

    Directory of Open Access Journals (Sweden)

    Xianghan Zheng

    2017-04-01

    Full Text Available Proteomics research has become one of the most important topics in the field of life science and natural science. At present, research on protein–protein interaction networks (PPIN mainly focuses on detecting protein complexes or function modules. However, existing approaches are either ineffective or incomplete. In this paper, we investigate detection mechanisms of functional modules in PPIN, including open database, existing detection algorithms, and recent solutions. After that, we describe the proposed approach based on the simplified swarm optimization (SSO algorithm and the knowledge of Gene Ontology (GO. The proposed solution implements the SSO algorithm for clustering proteins with similar function, and imports biological gene ontology knowledge for further identifying function complexes and improving detection accuracy. Furthermore, we use four different categories of species datasets for experiment: fruitfly, mouse, scere, and human. The testing and analysis result show that the proposed solution is feasible, efficient, and could achieve a higher accuracy of prediction than existing approaches.

  2. Regulation of membrane protein function by lipid bilayer elasticity—a single molecule technology to measure the bilayer properties experienced by an embedded protein

    DEFF Research Database (Denmark)

    Lundbæk, Jens August

    2008-01-01

    , regulate a number of structurally unrelated proteins in an apparently non-specific manner. It is well known that changes in the physical properties of a lipid bilayer (e.g., thickness or monolayer spontaneous curvature) can affect the function of an embedded protein. However, the role of such changes......-dependent sodium channels, N-type calcium channels and GABAA receptors, it has been shown that membrane protein function in living cells can be regulated by amphiphile induced changes in bilayer elasticity. Using the gramicidin channel as a molecular force transducer, a nanotechnology to measure the elastic...... properties experienced by an embedded protein has been developed. A theoretical and technological framework, to study the regulation of membrane protein function by lipid bilayer elasticity, has been established....

  3. Genomic Enzymology: Web Tools for Leveraging Protein Family Sequence-Function Space and Genome Context to Discover Novel Functions.

    Science.gov (United States)

    Gerlt, John A

    2017-08-22

    The exponentially increasing number of protein and nucleic acid sequences provides opportunities to discover novel enzymes, metabolic pathways, and metabolites/natural products, thereby adding to our knowledge of biochemistry and biology. The challenge has evolved from generating sequence information to mining the databases to integrating and leveraging the available information, i.e., the availability of "genomic enzymology" web tools. Web tools that allow identification of biosynthetic gene clusters are widely used by the natural products/synthetic biology community, thereby facilitating the discovery of novel natural products and the enzymes responsible for their biosynthesis. However, many novel enzymes with interesting mechanisms participate in uncharacterized small-molecule metabolic pathways; their discovery and functional characterization also can be accomplished by leveraging information in protein and nucleic acid databases. This Perspective focuses on two genomic enzymology web tools that assist the discovery novel metabolic pathways: (1) Enzyme Function Initiative-Enzyme Similarity Tool (EFI-EST) for generating sequence similarity networks to visualize and analyze sequence-function space in protein families and (2) Enzyme Function Initiative-Genome Neighborhood Tool (EFI-GNT) for generating genome neighborhood networks to visualize and analyze the genome context in microbial and fungal genomes. Both tools have been adapted to other applications to facilitate target selection for enzyme discovery and functional characterization. As the natural products community has demonstrated, the enzymology community needs to embrace the essential role of web tools that allow the protein and genome sequence databases to be leveraged for novel insights into enzymological problems.

  4. Dynamic functional modules in co-expressed protein interaction networks of dilated cardiomyopathy

    Directory of Open Access Journals (Sweden)

    Oyang Yen-Jen

    2010-10-01

    Full Text Available Abstract Background Molecular networks represent the backbone of molecular activity within cells and provide opportunities for understanding the mechanism of diseases. While protein-protein interaction data constitute static network maps, integration of condition-specific co-expression information provides clues to the dynamic features of these networks. Dilated cardiomyopathy is a leading cause of heart failure. Although previous studies have identified putative biomarkers or therapeutic targets for heart failure, the underlying molecular mechanism of dilated cardiomyopathy remains unclear. Results We developed a network-based comparative analysis approach that integrates protein-protein interactions with gene expression profiles and biological function annotations to reveal dynamic functional modules under different biological states. We found that hub proteins in condition-specific co-expressed protein interaction networks tended to be differentially expressed between biological states. Applying this method to a cohort of heart failure patients, we identified two functional modules that significantly emerged from the interaction networks. The dynamics of these modules between normal and disease states further suggest a potential molecular model of dilated cardiomyopathy. Conclusions We propose a novel framework to analyze the interaction networks in different biological states. It successfully reveals network modules closely related to heart failure; more importantly, these network dynamics provide new insights into the cause of dilated cardiomyopathy. The revealed molecular modules might be used as potential drug targets and provide new directions for heart failure therapy.

  5. Tight junction-associated MARVEL proteins marveld3, tricellulin, and occludin have distinct but overlapping functions.

    Science.gov (United States)

    Raleigh, David R; Marchiando, Amanda M; Zhang, Yong; Shen, Le; Sasaki, Hiroyuki; Wang, Yingmin; Long, Manyuan; Turner, Jerrold R

    2010-04-01

    In vitro studies have demonstrated that occludin and tricellulin are important for tight junction barrier function, but in vivo data suggest that loss of these proteins can be overcome. The presence of a heretofore unknown, yet related, protein could explain these observations. Here, we report marvelD3, a novel tight junction protein that, like occludin and tricellulin, contains a conserved four-transmembrane MARVEL (MAL and related proteins for vesicle trafficking and membrane link) domain. Phylogenetic tree reconstruction; analysis of RNA and protein tissue distribution; immunofluorescent and electron microscopic examination of subcellular localization; characterization of intracellular trafficking, protein interactions, dynamic behavior, and siRNA knockdown effects; and description of remodeling after in vivo immune activation show that marvelD3, occludin, and tricellulin have distinct but overlapping functions at the tight junction. Although marvelD3 is able to partially compensate for occludin or tricellulin loss, it cannot fully restore function. We conclude that marvelD3, occludin, and tricellulin define the tight junction-associated MARVEL protein family. The data further suggest that these proteins are best considered as a group with both redundant and unique contributions to epithelial function and tight junction regulation.

  6. Research of the complex of functional and technological properties of animal protein

    Directory of Open Access Journals (Sweden)

    Олена Борисівна Дроменко

    2016-12-01

    Full Text Available The analysis of the results of analytical and practical research of the complex of functional and technological properties of animal protein Gelexcel A-95 as the basis for creation of complex functional additives is shown. The regularities of their changes are determined depending on technological factors. Rational parameters of animal protein rehydration, gelation conditions, emulsification for further use in the process of production of meat products are identified

  7. Functional NifD-K fusion protein in Azotobacter vinelandii is a homodimeric complex equivalent to the native heterotetrameric MoFe protein

    International Nuclear Information System (INIS)

    Lahiri, Surobhi; Pulakat, Lakshmi; Gavini, Nara

    2005-01-01

    The MoFe protein of the complex metalloenzyme nitrogenase folds as a heterotetramer containing two copies each of the homologous α and β subunits, encoded by the nifD and the nifK genes respectively. Recently, the functional expression of a fusion NifD-K protein of nitrogenase was demonstrated in Azotobacter vinelandii, strongly implying that the MoFe protein is flexible as it could accommodate major structural changes, yet remain functional [M.H. Suh, L. Pulakat, N. Gavini, J. Biol. Chem. 278 (2003) 5353-5360]. This finding led us to further explore the type of interaction between the fused MoFe protein units. We aimed to determine whether an interaction exists between the two fusion MoFe proteins to form a homodimer that is equivalent to native heterotetrameric MoFe protein. Using the Bacteriomatch Two-Hybrid System, translationally fused constructs of NifD-K (fusion) with the full-length λCI of the pBT bait vector and also NifD-K (fusion) with the N-terminal α-RNAP of the pTRG target vector were made. To compare the extent of interaction between the fused NifD-K proteins to that of the β-β interactions in the native MoFe protein, we proceeded to generate translationally fused constructs of NifK with the α-RNAP of the pTRG vector and λCI protein of the pBT vector. The strength of the interaction between the proteins in study was determined by measuring the β-galactosidase activity and extent of ampicillin resistance of the colonies expressing these proteins. This analysis demonstrated that direct protein-protein interaction exists between NifD-K fusion proteins, suggesting that they exist as homodimers. As the interaction takes place at the β-interfaces of the NifD-K fusion proteins, we propose that these homodimers of NifD-K fusion protein may function in a similar manner as that of the heterotetrameric native MoFe protein. The observation that the extent of protein-protein interaction between the β-subunits of the native MoFe protein in Bacterio

  8. Thick Filament Protein Network, Functions, and Disease Association.

    Science.gov (United States)

    Wang, Li; Geist, Janelle; Grogan, Alyssa; Hu, Li-Yen R; Kontrogianni-Konstantopoulos, Aikaterini

    2018-03-13

    Sarcomeres consist of highly ordered arrays of thick myosin and thin actin filaments along with accessory proteins. Thick filaments occupy the center of sarcomeres where they partially overlap with thin filaments. The sliding of thick filaments past thin filaments is a highly regulated process that occurs in an ATP-dependent manner driving muscle contraction. In addition to myosin that makes up the backbone of the thick filament, four other proteins which are intimately bound to the thick filament, myosin binding protein-C, titin, myomesin, and obscurin play important structural and regulatory roles. Consistent with this, mutations in the respective genes have been associated with idiopathic and congenital forms of skeletal and cardiac myopathies. In this review, we aim to summarize our current knowledge on the molecular structure, subcellular localization, interacting partners, function, modulation via posttranslational modifications, and disease involvement of these five major proteins that comprise the thick filament of striated muscle cells. © 2018 American Physiological Society. Compr Physiol 8:631-709, 2018. Copyright © 2018 American Physiological Society. All rights reserved.

  9. From Green to Blue: Site-Directed Mutagenesis of the Green Fluorescent Protein to Teach Protein Structure-Function Relationships

    Science.gov (United States)

    Giron, Maria D.; Salto, Rafael

    2011-01-01

    Structure-function relationship studies in proteins are essential in modern Cell Biology. Laboratory exercises that allow students to familiarize themselves with basic mutagenesis techniques are essential in all Genetic Engineering courses to teach the relevance of protein structure. We have implemented a laboratory course based on the…

  10. Mutagenesis Objective Search and Selection Tool (MOSST: an algorithm to predict structure-function related mutations in proteins

    Directory of Open Access Journals (Sweden)

    Asenjo Juan A

    2011-04-01

    Full Text Available Abstract Background Functionally relevant artificial or natural mutations are difficult to assess or predict if no structure-function information is available for a protein. This is especially important to correctly identify functionally significant non-synonymous single nucleotide polymorphisms (nsSNPs or to design a site-directed mutagenesis strategy for a target protein. A new and powerful methodology is proposed to guide these two decision strategies, based only on conservation rules of physicochemical properties of amino acids extracted from a multiple alignment of a protein family where the target protein belongs, with no need of explicit structure-function relationships. Results A statistical analysis is performed over each amino acid position in the multiple protein alignment, based on different amino acid physical or chemical characteristics, including hydrophobicity, side-chain volume, charge and protein conformational parameters. The variances of each of these properties at each position are combined to obtain a global statistical indicator of the conservation degree of each property. Different types of physicochemical conservation are defined to characterize relevant and irrelevant positions. The differences between statistical variances are taken together as the basis of hypothesis tests at each position to search for functionally significant mutable sites and to identify specific mutagenesis targets. The outcome is used to statistically predict physicochemical consensus sequences based on different properties and to calculate the amino acid propensities at each position in a given protein. Hence, amino acid positions are identified that are putatively responsible for function, specificity, stability or binding interactions in a family of proteins. Once these key functional positions are identified, position-specific statistical distributions are applied to divide the 20 common protein amino acids in each position of the protein

  11. Functional divergence outlines the evolution of novel protein ...

    Indian Academy of Sciences (India)

    2013-10-01

    Oct 1, 2013 ... identified a number of vital amino acid sites which contribute to predicted functional diversity. We have ... Taking this into account, in this study we looked into the possibility of ... to the structure of NifH protein and solubility accessibility of ..... ment through sequence weighting, position-specific gap penalties.

  12. Identification of functional candidates amongst hypothetical proteins of Mycobacterium leprae Br4923, a causative agent of leprosy.

    Science.gov (United States)

    Naqvi, Ahmad Abu Turab; Ahmad, Faizan; Hassan, Md Imtaiyaz

    2015-01-01

    Mycobacterium leprae is an intracellular obligate parasite that causes leprosy in humans, and it leads to the destruction of peripheral nerves and skin deformation. Here, we report an extensive analysis of the hypothetical proteins (HPs) from M. leprae strain Br4923, assigning their functions to better understand the mechanism of pathogenesis and to search for potential therapeutic interventions. The genome of M. leprae encodes 1604 proteins, of which the functions of 632 are not known (HPs). In this paper, we predicted the probable functions of 312 HPs. First, we classified all HPs into families and subfamilies on the basis of sequence similarity, followed by domain assignment, which provides many clues for their possible function. However, the functions of 320 proteins were not predicted because of low sequence similarity with proteins of known function. Annotated HPs were categorized into enzymes, binding proteins, transporters, and proteins involved in cellular processes. We found several novel proteins whose functions were unknown for M. leprae. These proteins have a requisite association with bacterial virulence and pathogenicity. Finally, our sequence-based analysis will be helpful for further validation and the search for potential drug targets while developing effective drugs to cure leprosy.

  13. AptRank: an adaptive PageRank model for protein function prediction on   bi-relational graphs.

    Science.gov (United States)

    Jiang, Biaobin; Kloster, Kyle; Gleich, David F; Gribskov, Michael

    2017-06-15

    Diffusion-based network models are widely used for protein function prediction using protein network data and have been shown to outperform neighborhood-based and module-based methods. Recent studies have shown that integrating the hierarchical structure of the Gene Ontology (GO) data dramatically improves prediction accuracy. However, previous methods usually either used the GO hierarchy to refine the prediction results of multiple classifiers, or flattened the hierarchy into a function-function similarity kernel. No study has taken the GO hierarchy into account together with the protein network as a two-layer network model. We first construct a Bi-relational graph (Birg) model comprised of both protein-protein association and function-function hierarchical networks. We then propose two diffusion-based methods, BirgRank and AptRank, both of which use PageRank to diffuse information on this two-layer graph model. BirgRank is a direct application of traditional PageRank with fixed decay parameters. In contrast, AptRank utilizes an adaptive diffusion mechanism to improve the performance of BirgRank. We evaluate the ability of both methods to predict protein function on yeast, fly and human protein datasets, and compare with four previous methods: GeneMANIA, TMC, ProteinRank and clusDCA. We design four different validation strategies: missing function prediction, de novo function prediction, guided function prediction and newly discovered function prediction to comprehensively evaluate predictability of all six methods. We find that both BirgRank and AptRank outperform the previous methods, especially in missing function prediction when using only 10% of the data for training. The MATLAB code is available at https://github.rcac.purdue.edu/mgribsko/aptrank . gribskov@purdue.edu. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  14. Functionalization of 3D scaffolds with protein-releasing biomaterials for intracellular delivery.

    Science.gov (United States)

    Seras-Franzoso, Joaquin; Steurer, Christoph; Roldán, Mònica; Vendrell, Meritxell; Vidaurre-Agut, Carla; Tarruella, Anna; Saldaña, Laura; Vilaboa, Nuria; Parera, Marc; Elizondo, Elisa; Ratera, Imma; Ventosa, Nora; Veciana, Jaume; Campillo-Fernández, Alberto J; García-Fruitós, Elena; Vázquez, Esther; Villaverde, Antonio

    2013-10-10

    Appropriate combinations of mechanical and biological stimuli are required to promote proper colonization of substrate materials in regenerative medicine. In this context, 3D scaffolds formed by compatible and biodegradable materials are under continuous development in an attempt to mimic the extracellular environment of mammalian cells. We have here explored how novel 3D porous scaffolds constructed by polylactic acid, polycaprolactone or chitosan can be decorated with bacterial inclusion bodies, submicron protein particles formed by releasable functional proteins. A simple dipping-based decoration method tested here specifically favors the penetration of the functional particles deeper than 300μm from the materials' surface. The functionalized surfaces support the intracellular delivery of biologically active proteins to up to more than 80% of the colonizing cells, a process that is slightly influenced by the chemical nature of the scaffold. The combination of 3D soft scaffolds and protein-based sustained release systems (Bioscaffolds) offers promise in the fabrication of bio-inspired hybrid matrices for multifactorial control of cell proliferation in tissue engineering under complex architectonic setting-ups. © 2013.

  15. Structure/Function of the Novel Proteins LCIB and LCIC in the Chlamydomonas CCM

    Energy Technology Data Exchange (ETDEWEB)

    Martin, Spalding H. [Iowa State Univ., Ames, IA (United States). Dept. of Genetics, Development and Cell Biology

    2017-05-09

    The goal of this project was to investigate the function of two novel proteins, LCIB and LCIC, which together form an essential protein complex that is required for function of a carbon-dioxide-concentrating mechanism (CCM) required by microalgae to grow in environments where carbon dioxide levels are at or below air equilibration levels.

  16. An attempt to understand kidney's protein handling function by comparing plasma and urine proteomes.

    Directory of Open Access Journals (Sweden)

    Lulu Jia

    Full Text Available BACKGROUND: With the help of proteomics technology, the human plasma and urine proteomes, which closely represent the protein compositions of the input and output of the kidney, respectively, have been profiled in much greater detail by different research teams. Many datasets have been accumulated to form "reference profiles" of the plasma and urine proteomes. Comparing these two proteomes may help us understand the protein handling aspect of kidney function in a way, however, which has been unavailable until the recent advances in proteomics technology. METHODOLOGY/PRINCIPAL FINDINGS: After removing secreted proteins downstream of the kidney, 2611 proteins in plasma and 1522 in urine were identified with high confidence and compared based on available proteomic data to generate three subproteomes, the plasma-only subproteome, the plasma-and-urine subproteome, and the urine-only subproteome, and they correspond to three groups of proteins that are handled in three different ways by the kidney. The available experimental molecular weights of the proteins in the three subproteomes were collected and analyzed. Since the functions of the overrepresented proteins in the plasma-and-urine subproteome are probably the major functions that can be routinely regulated by excretion from the kidney in physiological conditions, Gene Ontology term enrichment in the plasma-and-urine subproteome versus the whole plasma proteome was analyzed. Protease activity, calcium and growth factor binding proteins, and coagulation and immune response-related proteins were found to be enriched. CONCLUSION/SIGNIFICANCE: The comparison method described in this paper provides an illustration of a new approach for studying organ functions with a proteomics methodology. Because of its distinctive input (plasma and output (urine, it is reasonable to predict that the kidney will be the first organ whose functions are further elucidated by proteomic methods in the near future. It

  17. Composition, structure and functional properties of protein concentrates and isolates produced from walnut (Juglans regia L.).

    Science.gov (United States)

    Mao, Xiaoying; Hua, Yufei

    2012-01-01

    In this study, composition, structure and the functional properties of protein concentrate (WPC) and protein isolate (WPI) produced from defatted walnut flour (DFWF) were investigated. The results showed that the composition and structure of walnut protein concentrate (WPC) and walnut protein isolate (WPI) were significantly different. The molecular weight distribution of WPI was uniform and the protein composition of DFWF and WPC was complex with the protein aggregation. H(0) of WPC was significantly higher (p structure of WPI was similar to WPC. WPI showed big flaky plate like structures; whereas WPC appeared as a small flaky and more compact structure. The most functional properties of WPI were better than WPC. In comparing most functional properties of WPI and WPC with soybean protein concentrate and isolate, WPI and WPC showed higher fat absorption capacity (FAC). Emulsifying properties and foam properties of WPC and WPI in alkaline pH were comparable with that of soybean protein concentrate and isolate. Walnut protein concentrates and isolates can be considered as potential functional food ingredients.

  18. Nck adapter proteins: functional versatility in T cells

    Directory of Open Access Journals (Sweden)

    Janssen Ottmar

    2009-02-01

    Full Text Available Abstract Nck is a ubiquitously expressed adapter protein that is almost exclusively built of one SH2 domain and three SH3 domains. The two isoproteins of Nck are functionally redundant in many aspects and differ in only few amino acids that are mostly located in the linker regions between the interaction modules. Nck proteins connect receptor and non-receptor tyrosine kinases to the machinery of actin reorganisation. Thereby, Nck regulates activation-dependent processes during cell polarisation and migration and plays a crucial role in the signal transduction of a variety of receptors including for instance PDGF-, HGF-, VEGF- and Ephrin receptors. In most cases, the SH2 domain mediates binding to the phosphorylated receptor or associated phosphoproteins, while SH3 domain interactions lead to the formation of larger protein complexes. In T lymphocytes, Nck plays a pivotal role in the T cell receptor (TCR-induced reorganisation of the actin cytoskeleton and the formation of the immunological synapse. However, in this context, two different mechanisms and adapter complexes are discussed. In the first scenario, dependent on an activation-induced conformational change in the CD3ε subunits, a direct binding of Nck to components of the TCR/CD3 complex was shown. In the second scenario, Nck is recruited to the TCR complex via phosphorylated Slp76, another central constituent of the membrane proximal activation complex. Over the past years, a large number of putative Nck interactors have been identified in different cellular systems that point to diverse additional functions of the adapter protein, e.g. in the control of gene expression and proliferation.

  19. Spatial separation and bidirectional trafficking of proteins using a multi-functional reporter

    Directory of Open Access Journals (Sweden)

    Klaubert Dieter H

    2008-04-01

    Full Text Available Abstract Background The ability to specifically label proteins within living cells can provide information about their dynamics and function. To study a membrane protein, we fused a multi-functional reporter protein, HaloTag®, to the extracellular domain of a truncated integrin. Results Using the HaloTag technology, we could study the localization, trafficking and processing of an integrin-HaloTag fusion, which we showed had cellular dynamics consistent with native integrins. By labeling live cells with different fluorescent impermeable and permeable ligands, we showed spatial separation of plasma membrane and internal pools of the integrin-HaloTag fusion, and followed these protein pools over time to study bi-directional trafficking. In addition to combining the HaloTag reporter protein with different fluorophores, we also employed an affinity tag to achieve cell capture. Conclusion The HaloTag technology was used successfully to study expression, trafficking, spatial separation and real-time translocation of an integrin-HaloTag fusion, thereby demonstrating that this technology can be a powerful tool to investigate membrane protein biology in live cells.

  20. Evaluation of epididymal function through specific protein on spermatozoa.

    Science.gov (United States)

    Del Río, A G; De Sánchez, L Z; Sirena, A

    1984-01-01

    Investigations were focused on the characterization of specific epididymal proteins on the human spermatozoa as a representative parameter for epididymal function. An easy and attainable method, suitable for investigators and clinical use, is proposed in this article.

  1. Roles of water in protein structure and function studied by molecular liquid theory.

    Science.gov (United States)

    Imai, Takashi

    2009-01-01

    The roles of water in the structure and function of proteins have not been completely elucidated. Although molecular simulation has been widely used for the investigation of protein structure and function, it is not always useful for elucidating the roles of water because the effect of water ranges from atomic to thermodynamic level. The three-dimensional reference interaction site model (3D-RISM) theory, which is a statistical-mechanical theory of molecular liquids, can yield the solvation structure at the atomic level and calculate the thermodynamic quantities from the intermolecular potentials. In the last few years, the author and coworkers have succeeded in applying the 3D-RISM theory to protein aqueous solution systems and demonstrated that the theory is useful for investigating the roles of water. This article reviews some of the recent applications and findings, which are concerned with molecular recognition by protein, protein folding, and the partial molar volume of protein which is related to the pressure effect on protein.

  2. Fast dynamics perturbation analysis for prediction of protein functional sites

    Directory of Open Access Journals (Sweden)

    Cohn Judith D

    2008-01-01

    Full Text Available Abstract Background We present a fast version of the dynamics perturbation analysis (DPA algorithm to predict functional sites in protein structures. The original DPA algorithm finds regions in proteins where interactions cause a large change in the protein conformational distribution, as measured using the relative entropy Dx. Such regions are associated with functional sites. Results The Fast DPA algorithm, which accelerates DPA calculations, is motivated by an empirical observation that Dx in a normal-modes model is highly correlated with an entropic term that only depends on the eigenvalues of the normal modes. The eigenvalues are accurately estimated using first-order perturbation theory, resulting in a N-fold reduction in the overall computational requirements of the algorithm, where N is the number of residues in the protein. The performance of the original and Fast DPA algorithms was compared using protein structures from a standard small-molecule docking test set. For nominal implementations of each algorithm, top-ranked Fast DPA predictions overlapped the true binding site 94% of the time, compared to 87% of the time for original DPA. In addition, per-protein recall statistics (fraction of binding-site residues that are among predicted residues were slightly better for Fast DPA. On the other hand, per-protein precision statistics (fraction of predicted residues that are among binding-site residues were slightly better using original DPA. Overall, the performance of Fast DPA in predicting ligand-binding-site residues was comparable to that of the original DPA algorithm. Conclusion Compared to the original DPA algorithm, the decreased run time with comparable performance makes Fast DPA well-suited for implementation on a web server and for high-throughput analysis.

  3. Structural fragment clustering reveals novel structural and functional motifs in α-helical transmembrane proteins

    Directory of Open Access Journals (Sweden)

    Vassilev Boris

    2010-04-01

    Full Text Available Abstract Background A large proportion of an organism's genome encodes for membrane proteins. Membrane proteins are important for many cellular processes, and several diseases can be linked to mutations in them. With the tremendous growth of sequence data, there is an increasing need to reliably identify membrane proteins from sequence, to functionally annotate them, and to correctly predict their topology. Results We introduce a technique called structural fragment clustering, which learns sequential motifs from 3D structural fragments. From over 500,000 fragments, we obtain 213 statistically significant, non-redundant, and novel motifs that are highly specific to α-helical transmembrane proteins. From these 213 motifs, 58 of them were assigned to function and checked in the scientific literature for a biological assessment. Seventy percent of the motifs are found in co-factor, ligand, and ion binding sites, 30% at protein interaction interfaces, and 12% bind specific lipids such as glycerol or cardiolipins. The vast majority of motifs (94% appear across evolutionarily unrelated families, highlighting the modularity of functional design in membrane proteins. We describe three novel motifs in detail: (1 a dimer interface motif found in voltage-gated chloride channels, (2 a proton transfer motif found in heme-copper oxidases, and (3 a convergently evolved interface helix motif found in an aspartate symporter, a serine protease, and cytochrome b. Conclusions Our findings suggest that functional modules exist in membrane proteins, and that they occur in completely different evolutionary contexts and cover different binding sites. Structural fragment clustering allows us to link sequence motifs to function through clusters of structural fragments. The sequence motifs can be applied to identify and characterize membrane proteins in novel genomes.

  4. SVM-Prot 2016: A Web-Server for Machine Learning Prediction of Protein Functional Families from Sequence Irrespective of Similarity.

    Science.gov (United States)

    Li, Ying Hong; Xu, Jing Yu; Tao, Lin; Li, Xiao Feng; Li, Shuang; Zeng, Xian; Chen, Shang Ying; Zhang, Peng; Qin, Chu; Zhang, Cheng; Chen, Zhe; Zhu, Feng; Chen, Yu Zong

    2016-01-01

    Knowledge of protein function is important for biological, medical and therapeutic studies, but many proteins are still unknown in function. There is a need for more improved functional prediction methods. Our SVM-Prot web-server employed a machine learning method for predicting protein functional families from protein sequences irrespective of similarity, which complemented those similarity-based and other methods in predicting diverse classes of proteins including the distantly-related proteins and homologous proteins of different functions. Since its publication in 2003, we made major improvements to SVM-Prot with (1) expanded coverage from 54 to 192 functional families, (2) more diverse protein descriptors protein representation, (3) improved predictive performances due to the use of more enriched training datasets and more variety of protein descriptors, (4) newly integrated BLAST analysis option for assessing proteins in the SVM-Prot predicted functional families that were similar in sequence to a query protein, and (5) newly added batch submission option for supporting the classification of multiple proteins. Moreover, 2 more machine learning approaches, K nearest neighbor and probabilistic neural networks, were added for facilitating collective assessment of protein functions by multiple methods. SVM-Prot can be accessed at http://bidd2.nus.edu.sg/cgi-bin/svmprot/svmprot.cgi.

  5. Chimeras taking shape: Potential functions of proteins encoded by chimeric RNA transcripts

    Science.gov (United States)

    Frenkel-Morgenstern, Milana; Lacroix, Vincent; Ezkurdia, Iakes; Levin, Yishai; Gabashvili, Alexandra; Prilusky, Jaime; del Pozo, Angela; Tress, Michael; Johnson, Rory; Guigo, Roderic; Valencia, Alfonso

    2012-01-01

    Chimeric RNAs comprise exons from two or more different genes and have the potential to encode novel proteins that alter cellular phenotypes. To date, numerous putative chimeric transcripts have been identified among the ESTs isolated from several organisms and using high throughput RNA sequencing. The few corresponding protein products that have been characterized mostly result from chromosomal translocations and are associated with cancer. Here, we systematically establish that some of the putative chimeric transcripts are genuinely expressed in human cells. Using high throughput RNA sequencing, mass spectrometry experimental data, and functional annotation, we studied 7424 putative human chimeric RNAs. We confirmed the expression of 175 chimeric RNAs in 16 human tissues, with an abundance varying from 0.06 to 17 RPKM (Reads Per Kilobase per Million mapped reads). We show that these chimeric RNAs are significantly more tissue-specific than non-chimeric transcripts. Moreover, we present evidence that chimeras tend to incorporate highly expressed genes. Despite the low expression level of most chimeric RNAs, we show that 12 novel chimeras are translated into proteins detectable in multiple shotgun mass spectrometry experiments. Furthermore, we confirm the expression of three novel chimeric proteins using targeted mass spectrometry. Finally, based on our functional annotation of exon organization and preserved domains, we discuss the potential features of chimeric proteins with illustrative examples and suggest that chimeras significantly exploit signal peptides and transmembrane domains, which can alter the cellular localization of cognate proteins. Taken together, these findings establish that some chimeric RNAs are translated into potentially functional proteins in humans. PMID:22588898

  6. Protein denaturation and functional properties of Lenient Steam Injection heat treated whey protein concentrate

    DEFF Research Database (Denmark)

    Dickow, Jonatan Ahrens; Kaufmann, Niels; Wiking, Lars

    2012-01-01

    Whey protein concentrate (WPC) was heat treated by use of the novel heat treatment method of Lenient Steam Injection (LSI) to elucidate new functional properties in relation to heat-induced gelation of heat treated WPC. Denaturation was measured by both DSC and FPLC, and the results of the two...... methods were highly correlated. Temperatures of up to 90 °C were applicable using LSI, whereas only 68 °C could be reached by plate heat exchange before coagulation/fouling. Denaturation of whey proteins increased with increasing heat treatment temperature up to a degree of 30–35% denaturation at 90 °C...

  7. Lacritin and other new proteins of the lacrimal functional unit.

    Science.gov (United States)

    McKown, Robert L; Wang, Ningning; Raab, Ronald W; Karnati, Roy; Zhang, Yinghui; Williams, Patricia B; Laurie, Gordon W

    2009-05-01

    The lacrimal functional unit (LFU) is defined by the 2007 International Dry Eye WorkShop as 'an integrated system comprising the lacrimal glands, ocular surface (cornea, conjunctiva and meibomian glands) and lids, and the sensory and motor nerves that connect them'. The LFU maintains a healthy ocular surface primarily through a properly functioning tear film that provides protection, lubrication, and an environment for corneal epithelial cell renewal. LFU cells express thousands of proteins. Over 200 new LFU proteins have been discovered in the last decade. Lacritin is a new LFU-specific growth factor in human tears that flows through ducts to target corneal epithelial cells on the ocular surface. When applied topically in rabbits, lacritin appears to increase the volume of basal tear secretion. Lacritin is one of only a handful of tear proteins preliminarily reported to be downregulated in blepharitis and in two dry eye syndromes. Computational analysis predicts an ordered C-terminal domain that binds the corneal epithelial cell surface proteoglycan syndecan-1 (SDC1) and is required for lacritin's low nanomolar mitogenic activity. The lacritin-binding site on the N-terminus of SDC1 is exposed by heparanase. Heparanase is constitutively expressed by the corneal epithelium and appears to be a normal constituent of tears. Binding triggers rapid signaling to downstream NFAT and mTOR. A wealth of other new proteins, originally designated as hypothetical when first identified by genomic sequencing, are expressed by the human LFU including: ALS2CL, ARHGEF19, KIAA1109, PLXNA1, POLG, WIPI1 and ZMIZ2. Their demonstrated or implied roles in human genetic disease or basic cellular functions are fuel for new investigation. Addressing topical areas in ocular surface physiology with new LFU proteins may reveal interesting new biological mechanisms and help get to the heart of ocular surface dysfunction.

  8. Production of Lupinus angustifolius protein hydrolysates with improved functional properties

    Directory of Open Access Journals (Sweden)

    Millán, Francisco

    2005-06-01

    Full Text Available Protein hydrolysates wer e obtained from lupin flour and from the purified globulin α -conglutin, and their functional properties were studied. Hydrolysis with alcalase for 60 minutes yielded degrees of hydrolysis ranging from 4 % to 11 % for lupin flour, and from 4 % to 13% for α -conglutin. Protein solubility, oil absorption, foam capacity and stability, emulsifying activity, and emulsion stability of hydrolysates with 6% degree of hydrolysis were determined and compared with the properties of the original flour. The protein hydrolysates showed better functional properties than the original proteins. Most importantly, the solubility of the α -conglutin and L. angustifolius flour hydrolysates was increased by 43 % and 52 %, respectively. Thus, lupin seed protein hydrolysates have improved functional properties and could be used in the elaboration of a variety of products such as breads, cakes, and salad dressings.Se obtuvieron hidrolizados proteicos de la harina del altramuz y de la globulina α - conglutina purificada y se estudiaron sus propiedades funcionales. La hidrólisis con alcalasa durante 60 minutos produjo hidrolizados con grados de hidrólisis entre el 4 % y el 11 % para la harina y entre el 4 % y el 13 % para la α - conglutina. Se estudió en un hidrolizado con un 6 % de grado de hidrólisis la solubilidad proteica, absorción de aceite, capacidad y estabilidad espumante y actividad y estabilidad emulsificante. Los hidrolizados proteicos mostraron mejores propiedades funcionales que las proteínas originales. Más aún, la solubilidad de los hidrolizados de α - conglutina y la harina se incrementó en un 43 % y 52 % respectivamente. Así pues, hidrolizados de proteínas de semilla de lupino presentan mejores propiedades funcionales y podrían usarse en la elaboración de productos como pan, dulces, salsas o cremas.

  9. Functional improvement of antibody fragments using a novel phage coat protein III fusion system

    DEFF Research Database (Denmark)

    Jensen, Kim Bak; Larsen, Martin; Pedersen, Jesper Søndergaard

    2002-01-01

    Functional expressions of proteins often depend on the presence of host specific factors. Frequently recombinant expression strategies of proteins in foreign hosts, such as bacteria, have been associated with poor yields or significant loss of functionality. Improvements in the performance of het......(s) of the filamentous phage coat protein III. Furthermore, it will be shown that the observed effect is neither due to improved stability nor increased avidity....

  10. Yellow Mealworm Protein for Food Purposes - Extraction and Functional Properties.

    Directory of Open Access Journals (Sweden)

    Xue Zhao

    Full Text Available A protocol for extraction of yellow mealworm larvae proteins was established, conditions were evaluated and the resulting protein extract was characterised. The freeze-dried yellow mealworm larvae contained around 33% fat, 51% crude protein and 43% true protein on a dry matter basis. The true protein content of the protein extract was about 75%, with an extraction rate of 70% under optimised extraction conditions using 0.25 M NaOH, a NaOH solution:ethanol defatted worm ratio of 15:1 mL/g, 40°C for 1 h and extraction twice. The protein extract was a good source of essential amino acids. The lowest protein solubility in distilled water solution was found between pH 4 and 5, and increased with either increasing or decreasing pH. Lower solubility was observed in 0.5 M NaCl solution compared with distilled water. The rheological tests indicated that temperature, sample concentration, addition of salt and enzyme, incubation time and pH alterations influenced the elastic modulus of yellow mealworm protein extract (YMPE. These results demonstrate that the functional properties of YMPE can be modified for different food applications.

  11. Yellow Mealworm Protein for Food Purposes - Extraction and Functional Properties

    Science.gov (United States)

    Zhao, Xue; Vázquez-Gutiérrez, José Luis; Johansson, Daniel P.; Landberg, Rikard; Langton, Maud

    2016-01-01

    A protocol for extraction of yellow mealworm larvae proteins was established, conditions were evaluated and the resulting protein extract was characterised. The freeze-dried yellow mealworm larvae contained around 33% fat, 51% crude protein and 43% true protein on a dry matter basis. The true protein content of the protein extract was about 75%, with an extraction rate of 70% under optimised extraction conditions using 0.25 M NaOH, a NaOH solution:ethanol defatted worm ratio of 15:1 mL/g, 40°C for 1 h and extraction twice. The protein extract was a good source of essential amino acids. The lowest protein solubility in distilled water solution was found between pH 4 and 5, and increased with either increasing or decreasing pH. Lower solubility was observed in 0.5 M NaCl solution compared with distilled water. The rheological tests indicated that temperature, sample concentration, addition of salt and enzyme, incubation time and pH alterations influenced the elastic modulus of yellow mealworm protein extract (YMPE). These results demonstrate that the functional properties of YMPE can be modified for different food applications. PMID:26840533

  12. Characterisation and functional properties of watermelon (Citrullus lanatus) seed proteins.

    Science.gov (United States)

    Wani, Ali Abas; Sogi, Dalbir Singh; Singh, Preeti; Wani, Idrees Ahmed; Shivhare, Uma S

    2011-01-15

    People in developing countries depend largely on non-conventional protein sources to augment the availability of proteins in their diets. Watermelon seed meal is reported to contain an adequate amount of nutritional proteins that could be extracted for use as nutritional ingredients in food products. Osborne classification showed that globulin was the major protein (≥500 g kg (-1)) present in watermelon seed meal, followed by albumin and glutelin. Sodium dodecyl sulfate polyacrylamide gel electrophoresis indicated that the polypeptides had low molecular weights ranging from 35 to 47 kDa. Isoelectric focusing revealed that the isoelectric point of most proteins was in the acidic range 4-6. These proteins are rich in aspartic acid, glutamic acid and serine. An increase in pH (5-9) significantly (P watermelon protein fractions respectively, while surface hydrophobicity ranged from 126.4 to 173.2 and from 125.8 to 169.3 respectively. The foaming and emulsifying properties of albumin were better than those of the other proteins studied. The good nutritional and functional properties of watermelon seed meal proteins suggest their potential use in food formulations. Copyright © 2010 Society of Chemical Industry.

  13. Programmable release of multiple protein drugs from aptamer-functionalized hydrogels via nucleic acid hybridization.

    Science.gov (United States)

    Battig, Mark R; Soontornworajit, Boonchoy; Wang, Yong

    2012-08-01

    Polymeric delivery systems have been extensively studied to achieve localized and controlled release of protein drugs. However, it is still challenging to control the release of multiple protein drugs in distinct stages according to the progress of disease or treatment. This study successfully demonstrates that multiple protein drugs can be released from aptamer-functionalized hydrogels with adjustable release rates at predetermined time points using complementary sequences (CSs) as biomolecular triggers. Because both aptamer-protein interactions and aptamer-CS hybridization are sequence-specific, aptamer-functionalized hydrogels constitute a promising polymeric delivery system for the programmable release of multiple protein drugs to treat complex human diseases.

  14. Purification and Initial Functions of Sex-Specific Storage Protein 2 in Bombyx mori.

    Science.gov (United States)

    Chen, Jianqing; Shu, Tejun; Chen, Jian; Ye, Man; Lv, Zhengbing; Nie, Zuoming; Gai, Qijing; Yu, Wei; Zhang, Yaozhou

    2015-08-01

    In this study, we identified a heat-resistant protein from the chrysalis stage of the silkworm which we named sex-specific storage protein 2 (SSP2). This protein was stable even at 80 °C, and has an amino acid sequence that is 90.65 % homologous to SP2. We utilized the heat-resistant characteristics of SSP2 to purify the protein and maintain its biological activity. In addition, using flow cytometry and the MTT assay, we found that SSP2 had anti-apoptotic effects on BmN cells, and that SSP2 could also inhibit cell apoptosis induced by chemical factors. These results suggest that SSP2 has a cell-protective function, and provides a basis for future work on the function of storage proteins in silkworm.

  15. Structure-based functional annotation of putative conserved proteins having lyase activity from Haemophilus influenzae.

    Science.gov (United States)

    Shahbaaz, Mohd; Ahmad, Faizan; Imtaiyaz Hassan, Md

    2015-06-01

    Haemophilus influenzae is a small pleomorphic Gram-negative bacteria which causes several chronic diseases, including bacteremia, meningitis, cellulitis, epiglottitis, septic arthritis, pneumonia, and empyema. Here we extensively analyzed the sequenced genome of H. influenzae strain Rd KW20 using protein family databases, protein structure prediction, pathways and genome context methods to assign a precise function to proteins whose functions are unknown. These proteins are termed as hypothetical proteins (HPs), for which no experimental information is available. Function prediction of these proteins would surely be supportive to precisely understand the biochemical pathways and mechanism of pathogenesis of Haemophilus influenzae. During the extensive analysis of H. influenzae genome, we found the presence of eight HPs showing lyase activity. Subsequently, we modeled and analyzed three-dimensional structure of all these HPs to determine their functions more precisely. We found these HPs possess cystathionine-β-synthase, cyclase, carboxymuconolactone decarboxylase, pseudouridine synthase A and C, D-tagatose-1,6-bisphosphate aldolase and aminodeoxychorismate lyase-like features, indicating their corresponding functions in the H. influenzae. Lyases are actively involved in the regulation of biosynthesis of various hormones, metabolic pathways, signal transduction, and DNA repair. Lyases are also considered as a key player for various biological processes. These enzymes are critically essential for the survival and pathogenesis of H. influenzae and, therefore, these enzymes may be considered as a potential target for structure-based rational drug design. Our structure-function relationship analysis will be useful to search and design potential lead molecules based on the structure of these lyases, for drug design and discovery.

  16. Chronic dietary supplementation with soy protein improves muscle function in rats.

    Directory of Open Access Journals (Sweden)

    Ramzi J Khairallah

    Full Text Available Athletes as well as elderly or hospitalized patients use dietary protein supplementation to maintain or grow skeletal muscle. It is recognized that high quality protein is needed for muscle accretion, and can be obtained from both animal and plant-based sources. There is interest to understand whether these sources differ in their ability to maintain or stimulate muscle growth and function. In this study, baseline muscle performance was assessed in 50 adult Sprague-Dawley rats after which they were assigned to one of five semi-purified "Western" diets (n = 10/group differing only in protein source, namely 19 kcal% protein from either milk protein isolate (MPI, whey protein isolate (WPI, soy protein isolate (SPI, soy protein concentrate (SPC or enzyme-treated soy protein (SPE. The diets were fed for 8 weeks at which point muscle performance testing was repeated and tissues were collected for analysis. There was no significant difference in food consumption or body weights over time between the diet groups nor were there differences in terminal organ and muscle weights or in serum lipids, creatinine or myostatin. Compared with MPI-fed rats, rats fed WPI and SPC displayed a greater maximum rate of contraction using the in vivo measure of muscle performance (p<0.05 with increases ranging from 13.3-27.5% and 22.8-29.5%, respectively at 60, 80, 100 and 150 Hz. When the maximum force was normalized to body weight, SPC-fed rats displayed increased force compared to MPI (p<0.05, whereas when normalized to gastrocnemius weight, WPI-fed rats displayed increased force compared to MPI (p<0.05. There was no difference between groups using in situ muscle performance. In conclusion, soy protein consumption, in high-fat diet, resulted in muscle function comparable to whey protein and improved compared to milk protein. The benefits seen with soy or whey protein were independent of changes in muscle mass or fiber cross-sectional area.

  17. The functional significance of the autolysis loop in protein C and activated protein C.

    Science.gov (United States)

    Yang, Likui; Manithody, Chandrashekhara; Rezaie, Alireza R

    2005-07-01

    The autolysis loop of activated protein C (APC) is five residues longer than the autolysis loop of other vitamin K-dependent coagulation proteases. To investigate the role of this loop in the zymogenic and anticoagulant properties of the molecule, a protein C mutant was constructed in which the autolysis loop of the protein was replaced with the corresponding loop of factor X. The protein C mutant was activated by thrombin with approximately 5-fold higher rate in the presence of Ca2+. Both kinetics and direct binding studies revealed that the Ca2+ affinity of the mutant has been impaired approximately 3-fold. The result of a factor Va degradation assay revealed that the anticoagulant function of the mutant has been improved 4-5-fold in the absence but not in the presence of protein S. The improvement was due to a better recognition of both the P1-Arg506 and P1-Arg306 cleavage sites by the mutant protease. However, the plasma half-life of the mutant was markedly shortened due to faster inactivation by plasma serpins. These results suggest that the autolysis loop of protein C is critical for the Ca(2+)-dependence of activation by thrombin. Moreover, a longer autolysis loop in APC is not optimal for interaction with factor Va in the absence of protein S, but it contributes to the lack of serpin reactivity and longer half-life of the protease in plasma.

  18. New in protein structure and function annotation: hotspots, single nucleotide polymorphisms and the 'Deep Web'.

    Science.gov (United States)

    Bromberg, Yana; Yachdav, Guy; Ofran, Yanay; Schneider, Reinhard; Rost, Burkhard

    2009-05-01

    The rapidly increasing quantity of protein sequence data continues to widen the gap between available sequences and annotations. Comparative modeling suggests some aspects of the 3D structures of approximately half of all known proteins; homology- and network-based inferences annotate some aspect of function for a similar fraction of the proteome. For most known protein sequences, however, there is detailed knowledge about neither their function nor their structure. Comprehensive efforts towards the expert curation of sequence annotations have failed to meet the demand of the rapidly increasing number of available sequences. Only the automated prediction of protein function in the absence of homology can close the gap between available sequences and annotations in the foreseeable future. This review focuses on two novel methods for automated annotation, and briefly presents an outlook on how modern web software may revolutionize the field of protein sequence annotation. First, predictions of protein binding sites and functional hotspots, and the evolution of these into the most successful type of prediction of protein function from sequence will be discussed. Second, a new tool, comprehensive in silico mutagenesis, which contributes important novel predictions of function and at the same time prepares for the onset of the next sequencing revolution, will be described. While these two new sub-fields of protein prediction represent the breakthroughs that have been achieved methodologically, it will then be argued that a different development might further change the way biomedical researchers benefit from annotations: modern web software can connect the worldwide web in any browser with the 'Deep Web' (ie, proprietary data resources). The availability of this direct connection, and the resulting access to a wealth of data, may impact drug discovery and development more than any existing method that contributes to protein annotation.

  19. A method for partitioning the information contained in a protein sequence between its structure and function.

    Science.gov (United States)

    Possenti, Andrea; Vendruscolo, Michele; Camilloni, Carlo; Tiana, Guido

    2018-05-23

    Proteins employ the information stored in the genetic code and translated into their sequences to carry out well-defined functions in the cellular environment. The possibility to encode for such functions is controlled by the balance between the amount of information supplied by the sequence and that left after that the protein has folded into its structure. We study the amount of information necessary to specify the protein structure, providing an estimate that keeps into account the thermodynamic properties of protein folding. We thus show that the information remaining in the protein sequence after encoding for its structure (the 'information gap') is very close to what needed to encode for its function and interactions. Then, by predicting the information gap directly from the protein sequence, we show that it may be possible to use these insights from information theory to discriminate between ordered and disordered proteins, to identify unknown functions, and to optimize artificially-designed protein sequences. This article is protected by copyright. All rights reserved. © 2018 Wiley Periodicals, Inc.

  20. Ubiquitin-like protein UBL5 promotes the functional integrity of the Fanconi anemia pathway.

    Science.gov (United States)

    Oka, Yasuyoshi; Bekker-Jensen, Simon; Mailand, Niels

    2015-05-12

    Ubiquitin and ubiquitin-like proteins (UBLs) function in a wide array of cellular processes. UBL5 is an atypical UBL that does not form covalent conjugates with cellular proteins and which has a known role in modulating pre-mRNA splicing. Here, we report an unexpected involvement of human UBL5 in promoting the function of the Fanconi anemia (FA) pathway for repair of DNA interstrand crosslinks (ICLs), mediated by a specific interaction with the central FA pathway component FANCI. UBL5-deficient cells display spliceosome-independent reduction of FANCI protein stability, defective FANCI function in response to DNA damage and hypersensitivity to ICLs. By mapping the sequence determinants underlying UBL5-FANCI binding, we generated separation-of-function mutants to demonstrate that key aspects of FA pathway function, including FANCI-FANCD2 heterodimerization, FANCD2 and FANCI monoubiquitylation and maintenance of chromosome stability after ICLs, are compromised when the UBL5-FANCI interaction is selectively inhibited by mutations in either protein. Together, our findings establish UBL5 as a factor that promotes the functionality of the FA DNA repair pathway. © 2015 The Authors.

  1. The Structure and Function of Non-Collagenous Bone Proteins

    Science.gov (United States)

    Hook, Magnus

    1997-01-01

    The long-term goal for this program is to determine the structural and functional relationships of bone proteins and proteins that interact with bone. This information will used to design useful pharmacological compounds that will have a beneficial effect in osteoporotic patients and in the osteoporotic-like effects experienced on long duration space missions. The first phase of this program, funded under a cooperative research agreement with NASA through the Texas Medical Center, aimed to develop powerful recombinant expression systems and purification methods for production of large amounts of target proteins. Proteins expressed in sufficient'amount and purity would be characterized by a variety of structural methods, and made available for crystallization studies. In order to increase the likelihood of crystallization and subsequent high resolution solution of structures, we undertook to develop expression of normal and mutant forms of proteins by bacterial and mammalian cells. In addition to the main goals of this program, we would also be able to provide reagents for other related studies, including development of anti-fibrotic and anti-metastatic therapeutics.

  2. Topological and functional properties of the small GTPases protein interaction network.

    Directory of Open Access Journals (Sweden)

    Anna Delprato

    Full Text Available Small GTP binding proteins of the Ras superfamily (Ras, Rho, Rab, Arf, and Ran regulate key cellular processes such as signal transduction, cell proliferation, cell motility, and vesicle transport. A great deal of experimental evidence supports the existence of signaling cascades and feedback loops within and among the small GTPase subfamilies suggesting that these proteins function in a coordinated and cooperative manner. The interplay occurs largely through association with bi-partite regulatory and effector proteins but can also occur through the active form of the small GTPases themselves. In order to understand the connectivity of the small GTPases signaling routes, a systems-level approach that analyzes data describing direct and indirect interactions was used to construct the small GTPases protein interaction network. The data were curated from the Search Tool for the Retrieval of Interacting Genes (STRING database and include only experimentally validated interactions. The network method enables the conceptualization of the overall structure as well as the underlying organization of the protein-protein interactions. The interaction network described here is comprised of 778 nodes and 1943 edges and has a scale-free topology. Rac1, Cdc42, RhoA, and HRas are identified as the hubs. Ten sub-network motifs are also identified in this study with themes in apoptosis, cell growth/proliferation, vesicle traffic, cell adhesion/junction dynamics, the nicotinamide adenine dinucleotide phosphate (NADPH oxidase response, transcription regulation, receptor-mediated endocytosis, gene silencing, and growth factor signaling. Bottleneck proteins that bridge signaling paths and proteins that overlap in multiple small GTPase networks are described along with the functional annotation of all proteins in the network.

  3. Versatile microsphere attachment of GFP-labeled motors and other tagged proteins with preserved functionality

    Directory of Open Access Journals (Sweden)

    Michael Bugiel

    2015-11-01

    Full Text Available Microspheres are often used as handles for protein purification or force spectroscopy. For example, optical tweezers apply forces on trapped particles to which motor proteins are attached. However, even though many attachment strategies exist, procedures are often limited to a particular biomolecule and prone to non-specific protein or surface attachment. Such interactions may lead to loss of protein functionality or microsphere clustering. Here, we describe a versatile coupling procedure for GFP-tagged proteins via a polyethylene glycol linker preserving the functionality of the coupled proteins. The procedure combines well-established protocols, is highly reproducible, reliable, and can be used for a large variety of proteins. The coupling is efficient and can be tuned to the desired microsphere-to-protein ratio. Moreover, microspheres hardly cluster or adhere to surfaces. Furthermore, the procedure can be adapted to different tags providing flexibility and a promising attachment strategy for any tagged protein.

  4. Functional anthology of intrinsic disorder. 3. Ligands, post-translational modifications, and diseases associated with intrinsically disordered proteins.

    Science.gov (United States)

    Xie, Hongbo; Vucetic, Slobodan; Iakoucheva, Lilia M; Oldfield, Christopher J; Dunker, A Keith; Obradovic, Zoran; Uversky, Vladimir N

    2007-05-01

    Currently, the understanding of the relationships between function, amino acid sequence, and protein structure continues to represent one of the major challenges of the modern protein science. As many as 50% of eukaryotic proteins are likely to contain functionally important long disordered regions. Many proteins are wholly disordered but still possess numerous biologically important functions. However, the number of experimentally confirmed disordered proteins with known biological functions is substantially smaller than their actual number in nature. Therefore, there is a crucial need for novel bionformatics approaches that allow projection of the current knowledge from a few experimentally verified examples to much larger groups of known and potential proteins. The elaboration of a bioinformatics tool for the analysis of functional diversity of intrinsically disordered proteins and application of this data mining tool to >200 000 proteins from the Swiss-Prot database, each annotated with at least one of the 875 functional keywords, was described in the first paper of this series (Xie, H.; Vucetic, S.; Iakoucheva, L. M.; Oldfield, C. J.; Dunker, A. K.; Obradovic, Z.; Uversky, V.N. Functional anthology of intrinsic disorder. 1. Biological processes and functions of proteins with long disordered regions. J. Proteome Res. 2007, 5, 1882-1898). Using this tool, we have found that out of the 710 Swiss-Prot functional keywords associated with at least 20 proteins, 262 were strongly positively correlated with long intrinsically disordered regions, and 302 were strongly negatively correlated. Illustrative examples of functional disorder or order were found for the vast majority of keywords showing strongest positive or negative correlation with intrinsic disorder, respectively. Some 80 Swiss-Prot keywords associated with disorder- and order-driven biological processes and protein functions were described in the first paper (see above). The second paper of the series was

  5. Protein aggregation in bacteria: the thin boundary between functionality and toxicity.

    Science.gov (United States)

    Bednarska, Natalia G; Schymkowitz, Joost; Rousseau, Frederic; Van Eldere, Johan

    2013-09-01

    Misfolding and aggregation of proteins have a negative impact on all living organisms. In recent years, aggregation has been studied in detail due to its involvement in neurodegenerative diseases, including Alzheimer's, Parkinson's and Huntington's diseases, and type II diabetes--all associated with accumulation of amyloid fibrils. This research highlighted the central importance of protein homeostasis, or proteostasis for short, defined as the cellular state in which the proteome is both stable and functional. It implicates an equilibrium between synthesis, folding, trafficking, aggregation, disaggregation and degradation. In accordance with the eukaryotic systems, it has been documented that protein aggregation also reduces fitness of bacterial cells, but although our understanding of the cellular protein quality control systems is perhaps most detailed in bacteria, the use of bacterial proteostasis as a drug target remains little explored. Here we describe protein aggregation as a normal physiological process and its role in bacterial virulence and we shed light on how bacteria defend themselves against the toxic threat of aggregates. We review the impact of aggregates on bacterial viability and look at the ways that bacteria use to maintain a balance between aggregation and functionality. The proteostasis in bacteria can be interrupted via overexpression of proteins, certain antibiotics such as aminoglycosides, as well as antimicrobial peptides--all leading to loss of cell viability. Therefore intracellular protein aggregation and disruption of proteostatic balance in bacteria open up another strategy that should be explored towards the discovery of new antimicrobials.

  6. Association of papillomavirus E6 proteins with either MAML1 or E6AP clusters E6 proteins by structure, function, and evolutionary relatedness.

    Directory of Open Access Journals (Sweden)

    Nicole Brimer

    2017-12-01

    Full Text Available Papillomavirus E6 proteins bind to LXXLL peptide motifs displayed on targeted cellular proteins. Alpha genus HPV E6 proteins associate with the cellular ubiquitin ligase E6AP (UBE3A, by binding to an LXXLL peptide (ELTLQELLGEE displayed by E6AP, thereby stimulating E6AP ubiquitin ligase activity. Beta, Gamma, and Delta genera E6 proteins bind a similar LXXLL peptide (WMSDLDDLLGS on the cellular transcriptional co-activator MAML1 and thereby repress Notch signaling. We expressed 45 different animal and human E6 proteins from diverse papillomavirus genera to ascertain the overall preference of E6 proteins for E6AP or MAML1. E6 proteins from all HPV genera except Alpha preferentially interacted with MAML1 over E6AP. Among animal papillomaviruses, E6 proteins from certain ungulate (SsPV1 from pigs and cetacean (porpoises and dolphins hosts functionally resembled Alpha genus HPV by binding and targeting the degradation of E6AP. Beta genus HPV E6 proteins functionally clustered with Delta, Pi, Tau, Gamma, Chi, Mu, Lambda, Iota, Dyokappa, Rho, and Dyolambda E6 proteins to bind and repress MAML1. None of the tested E6 proteins physically and functionally interacted with both MAML1 and E6AP, indicating an evolutionary split. Further, interaction of an E6 protein was insufficient to activate degradation of E6AP, indicating that E6 proteins that target E6AP co-evolved to separately acquire both binding and triggering of ubiquitin ligase activation. E6 proteins with similar biological function clustered together in phylogenetic trees and shared structural features. This suggests that the divergence of E6 proteins from either MAML1 or E6AP binding preference is a major event in papillomavirus evolution.

  7. Quality assessment of protein model-structures based on structural and functional similarities.

    Science.gov (United States)

    Konopka, Bogumil M; Nebel, Jean-Christophe; Kotulska, Malgorzata

    2012-09-21

    Experimental determination of protein 3D structures is expensive, time consuming and sometimes impossible. A gap between number of protein structures deposited in the World Wide Protein Data Bank and the number of sequenced proteins constantly broadens. Computational modeling is deemed to be one of the ways to deal with the problem. Although protein 3D structure prediction is a difficult task, many tools are available. These tools can model it from a sequence or partial structural information, e.g. contact maps. Consequently, biologists have the ability to generate automatically a putative 3D structure model of any protein. However, the main issue becomes evaluation of the model quality, which is one of the most important challenges of structural biology. GOBA--Gene Ontology-Based Assessment is a novel Protein Model Quality Assessment Program. It estimates the compatibility between a model-structure and its expected function. GOBA is based on the assumption that a high quality model is expected to be structurally similar to proteins functionally similar to the prediction target. Whereas DALI is used to measure structure similarity, protein functional similarity is quantified using standardized and hierarchical description of proteins provided by Gene Ontology combined with Wang's algorithm for calculating semantic similarity. Two approaches are proposed to express the quality of protein model-structures. One is a single model quality assessment method, the other is its modification, which provides a relative measure of model quality. Exhaustive evaluation is performed on data sets of model-structures submitted to the CASP8 and CASP9 contests. The validation shows that the method is able to discriminate between good and bad model-structures. The best of tested GOBA scores achieved 0.74 and 0.8 as a mean Pearson correlation to the observed quality of models in our CASP8 and CASP9-based validation sets. GOBA also obtained the best result for two targets of CASP8, and

  8. Conserved generation of short products at piRNA loci

    Directory of Open Access Journals (Sweden)

    Khorshid Mohsen

    2011-01-01

    Full Text Available Abstract Background The piRNA pathway operates in animal germ lines to ensure genome integrity through retrotransposon silencing. The Piwi protein-associated small RNAs (piRNAs guide Piwi proteins to retrotransposon transcripts, which are degraded and thereby post-transcriptionally silenced through a ping-pong amplification process. Cleavage of the retrotransposon transcript defines at the same time the 5' end of a secondary piRNA that will in turn guide a Piwi protein to a primary piRNA precursor, thereby amplifying primary piRNAs. Although several studies provided evidence that this mechanism is conserved among metazoa, how the process is initiated and what enzymatic activities are responsible for generating the primary and secondary piRNAs are not entirely clear. Results Here we analyzed small RNAs from three mammalian species, seeking to gain further insight into the mechanisms responsible for the piRNA amplification loop. We found that in all these species piRNA-directed targeting is accompanied by the generation of short sequences that have a very precisely defined length, 19 nucleotides, and a specific spatial relationship with the guide piRNAs. Conclusions This suggests that the processing of the 5' product of piRNA-guided cleavage occurs while the piRNA target is engaged by the Piwi protein. Although they are not stabilized through methylation of their 3' ends, the 19-mers are abundant not only in testes lysates but also in immunoprecipitates of Miwi and Mili proteins. They will enable more accurate identification of piRNA loci in deep sequencing data sets.

  9. The Link between Dietary Protein Intake, Skeletal Muscle Function and Health in Older Adults

    Directory of Open Access Journals (Sweden)

    Jamie I. Baum

    2015-07-01

    Full Text Available Skeletal muscle mass and function are progressively lost with age, a condition referred to as sarcopenia. By the age of 60, many older adults begin to be affected by muscle loss. There is a link between decreased muscle mass and strength and adverse health outcomes such as obesity, diabetes and cardiovascular disease. Data suggest that increasing dietary protein intake at meals may counterbalance muscle loss in older individuals due to the increased availability of amino acids, which stimulate muscle protein synthesis by activating the mammalian target of rapamycin (mTORC1. Increased muscle protein synthesis can lead to increased muscle mass, strength and function over time. This review aims to address the current recommended dietary allowance (RDA for protein and whether or not this value meets the needs for older adults based upon current scientific evidence. The current RDA for protein is 0.8 g/kg body weight/day. However, literature suggests that consuming protein in amounts greater than the RDA can improve muscle mass, strength and function in older adults.

  10. Acetylation of pregnane X receptor protein determines selective function independent of ligand activation

    International Nuclear Information System (INIS)

    Biswas, Arunima; Pasquel, Danielle; Tyagi, Rakesh Kumar; Mani, Sridhar

    2011-01-01

    Research highlights: → Pregnane X receptor (PXR), a major regulatory protein, is modified by acetylation. → PXR undergoes dynamic deacetylation upon ligand-mediated activation. → SIRT1 partially mediates PXR deacetylation. → PXR deacetylation per se induces lipogenesis mimicking ligand-mediated activation. -- Abstract: Pregnane X receptor (PXR), like other members of its class of nuclear receptors, undergoes post-translational modification [PTM] (e.g., phosphorylation). However, it is unknown if acetylation (a major and common form of protein PTM) is observed on PXR and, if it is, whether it is of functional consequence. PXR has recently emerged as an important regulatory protein with multiple ligand-dependent functions. In the present work we show that PXR is indeed acetylated in vivo. SIRT1 (Sirtuin 1), a NAD-dependent class III histone deacetylase and a member of the sirtuin family of proteins, partially mediates deacetylation of PXR. Most importantly, the acetylation status of PXR regulates its selective function independent of ligand activation.

  11. Nopaline-type Ti plasmid of Agrobacterium encodes a VirF-like functional F-box protein.

    Science.gov (United States)

    Lacroix, Benoît; Citovsky, Vitaly

    2015-11-20

    During Agrobacterium-mediated genetic transformation of plants, several bacterial virulence (Vir) proteins are translocated into the host cell to facilitate infection. One of the most important of such translocated factors is VirF, an F-box protein produced by octopine strains of Agrobacterium, which presumably facilitates proteasomal uncoating of the invading T-DNA from its associated proteins. The presence of VirF also is thought to be involved in differences in host specificity between octopine and nopaline strains of Agrobacterium, with the current dogma being that no functional VirF is encoded by nopaline strains. Here, we show that a protein with homology to octopine VirF is encoded by the Ti plasmid of the nopaline C58 strain of Agrobacterium. This protein, C58VirF, possesses the hallmarks of functional F-box proteins: it contains an active F-box domain and specifically interacts, via its F-box domain, with SKP1-like (ASK) protein components of the plant ubiquitin/proteasome system. Thus, our data suggest that nopaline strains of Agrobacterium have evolved to encode a functional F-box protein VirF.

  12. Evolutionary Conservation and Emerging Functional Diversity of the Cytosolic Hsp70:J Protein Chaperone Network of Arabidopsis thaliana.

    Science.gov (United States)

    Verma, Amit K; Diwan, Danish; Raut, Sandeep; Dobriyal, Neha; Brown, Rebecca E; Gowda, Vinita; Hines, Justin K; Sahi, Chandan

    2017-06-07

    Heat shock proteins of 70 kDa (Hsp70s) partner with structurally diverse Hsp40s (J proteins), generating distinct chaperone networks in various cellular compartments that perform myriad housekeeping and stress-associated functions in all organisms. Plants, being sessile, need to constantly maintain their cellular proteostasis in response to external environmental cues. In these situations, the Hsp70:J protein machines may play an important role in fine-tuning cellular protein quality control. Although ubiquitous, the functional specificity and complexity of the plant Hsp70:J protein network has not been studied. Here, we analyzed the J protein network in the cytosol of Arabidopsis thaliana and, using yeast genetics, show that the functional specificities of most plant J proteins in fundamental chaperone functions are conserved across long evolutionary timescales. Detailed phylogenetic and functional analysis revealed that increased number, regulatory differences, and neofunctionalization in J proteins together contribute to the emerging functional diversity and complexity in the Hsp70:J protein network in higher plants. Based on the data presented, we propose that higher plants have orchestrated their "chaperome," especially their J protein complement, according to their specialized cellular and physiological stipulations. Copyright © 2017 Verma et al.

  13. Functional characterization of fidgetin, an AAA-family protein mutated in fidget mice

    International Nuclear Information System (INIS)

    Yang Yan; Mahaffey, Connie L.; Berube, Nathalie; Nystuen, Arne; Frankel, Wayne N.

    2005-01-01

    The mouse fidget mutation is an autosomal recessive mutation that renders reduced or absent semicircular canals, microphthalmia, and various skeletal abnormalities to affected mice. We previously identified the defective gene which encodes fidgetin, a new member of the ATPases associated with diverse cellular activities (AAA proteins). Here, we report on the subcellular localization of fidgetin as well as that of two closely related proteins, fidgetin-like 1 and fidgetin-like 2. Epitope-tagging and immunostaining revealed that both fidgetin and fidgetin-like 2 were predominantly localized to the nucleus, whereas fidgetin-like 1 was both nuclear and cytoplasmic. Furthermore, deletion studies identified a putative bipartite nuclear localization signal in the middle portion of the fidgetin protein. Since AAA proteins are known to form functional hetero- or homo-hexamers, we used reciprocal immunoprecipitation to examine the potential interaction among these proteins. We found that fidgetin interacted with itself and this specific interaction was abolished when either the N- or C-terminus of the protein was truncated. Taken together, our results suggest that fidgetin is a nuclear AAA-family protein with the potential to form homo-oligomers, thus representing the first step towards the elucidation of fidgetin's cellular function and the disease mechanism in fidget mutant mice

  14. Comparison of structure, function and regulation of plant cold shock domain proteins to bacterial and animal cold shock domain proteins.

    Science.gov (United States)

    Chaikam, Vijay; Karlson, Dale T

    2010-01-01

    The cold shock domain (CSD) is among the most ancient and well conserved nucleic acid binding domains from bacteria to higher animals and plants. The CSD facilitates binding to RNA, ssDNA and dsDNA and most functions attributed to cold shock domain proteins are mediated by this nucleic acid binding activity. In prokaryotes, cold shock domain proteins only contain a single CSD and are termed cold shock proteins (Csps). In animal model systems, various auxiliary domains are present in addition to the CSD and are commonly named Y-box proteins. Similar to animal CSPs, plant CSPs contain auxiliary C-terminal domains in addition to their N-terminal CSD. Cold shock domain proteins have been shown to play important roles in development and stress adaptation in wide variety of organisms. In this review, the structure, function and regulation of plant CSPs are compared and contrasted to the characteristics of bacterial and animal CSPs. [BMB reports 2010; 43(1): 1-8].

  15. MODexplorer: an integrated tool for exploring protein sequence, structure and function relationships.

    KAUST Repository

    Kosinski, Jan; Barbato, Alessandro; Tramontano, Anna

    2013-01-01

    SUMMARY: MODexplorer is an integrated tool aimed at exploring the sequence, structural and functional diversity in protein families useful in homology modeling and in analyzing protein families in general. It takes as input either the sequence or the structure of a protein and provides alignments with its homologs along with a variety of structural and functional annotations through an interactive interface. The annotations include sequence conservation, similarity scores, ligand-, DNA- and RNA-binding sites, secondary structure, disorder, crystallographic structure resolution and quality scores of models implied by the alignments to the homologs of known structure. MODexplorer can be used to analyze sequence and structural conservation among the structures of similar proteins, to find structures of homologs solved in different conformational state or with different ligands and to transfer functional annotations. Furthermore, if the structure of the query is not known, MODexplorer can be used to select the modeling templates taking all this information into account and to build a comparative model. AVAILABILITY AND IMPLEMENTATION: Freely available on the web at http://modorama.biocomputing.it/modexplorer. Website implemented in HTML and JavaScript with all major browsers supported. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

  16. MODexplorer: an integrated tool for exploring protein sequence, structure and function relationships.

    KAUST Repository

    Kosinski, Jan

    2013-02-08

    SUMMARY: MODexplorer is an integrated tool aimed at exploring the sequence, structural and functional diversity in protein families useful in homology modeling and in analyzing protein families in general. It takes as input either the sequence or the structure of a protein and provides alignments with its homologs along with a variety of structural and functional annotations through an interactive interface. The annotations include sequence conservation, similarity scores, ligand-, DNA- and RNA-binding sites, secondary structure, disorder, crystallographic structure resolution and quality scores of models implied by the alignments to the homologs of known structure. MODexplorer can be used to analyze sequence and structural conservation among the structures of similar proteins, to find structures of homologs solved in different conformational state or with different ligands and to transfer functional annotations. Furthermore, if the structure of the query is not known, MODexplorer can be used to select the modeling templates taking all this information into account and to build a comparative model. AVAILABILITY AND IMPLEMENTATION: Freely available on the web at http://modorama.biocomputing.it/modexplorer. Website implemented in HTML and JavaScript with all major browsers supported. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

  17. Molecular cloning, functional expression and subcellular localization of two putative vacuolar voltage-gated chloride channels in rice (Oryza sativa L.).

    Science.gov (United States)

    Nakamura, Atsuko; Fukuda, Atsunori; Sakai, Shingo; Tanaka, Yoshiyuki

    2006-01-01

    We isolated two cDNA clones (OsCLC-1 and OsCLC-2) homologous to tobacco CLC-Nt1, which encodes a voltage-gated chloride channel, from rice (Oryza sativa L. ssp. japonica, cv. Nipponbare). The deduced amino acid sequences were highly conserved (87.9% identity with each other). Southern blot analysis of the rice genomic DNA revealed that OsCLC-1 and OsCLC-2 were single-copy genes on chromosomes 4 and 2, respectively. OsCLC-1 was expressed in most tissues, whereas OsCLC-2 was expressed only in the roots, nodes, internodes and leaf sheaths. The level of expression of OsCLC-1, but not of OsCLC-2, was increased by treatment with NaCl. Both genes could partly substitute for GEF1, which encodes the sole chloride channel in yeast, by restoring growth under ionic stress. These results indicate that both genes are chloride channel genes. The proteins from both genes were immunochemically detected in the tonoplast fraction. Tagged synthetic green fluorescent protein which was fused to OsCLC-1 or OsCLC-2 localized in the vacuolar membranes. These results indicate that the proteins may play a role in the transport of chloride ions across the vacuolar membrane. We isolated loss-of-function mutants of both genes from a panel of rice mutants produced by the insertion of a retrotransposon, Tos17, in the exon region, and found inhibition of growth at all life stages.

  18. Artificial receptor-functionalized nanoshell: facile preparation, fast separation and specific protein recognition

    Science.gov (United States)

    Ouyang, Ruizhuo; Lei, Jianping; Ju, Huangxian

    2010-05-01

    This work combined molecular imprinting technology with superparamagnetic nanospheres as the core to prepare artificial receptor-functionalized magnetic nanoparticles for separation of homologous proteins. Using dopamine as a functional monomer, novel surface protein-imprinted superparamagnetic polydopamine (PDA) core-shell nanoparticles were successfully prepared in physiological conditions, which could maintain the natural structure of a protein template and achieved the development of molecularly imprinted polymers (MIPs) from one dimension to zero dimension for efficient recognition towards large biomolecules. The resultant nanoparticles could be used for convenient magnetic separation of homologous proteins with high specificity. The nanoparticles possessed good monodispersibility, uniform surface morphology and high saturation magnetization value. The bound amounts of template proteins measured by both indirect and direct methods were in good agreement. The maximum number of imprinted cavities on the surface of the bovine hemoglobin (Hb)-imprinted nanoshell was 2.21 × 1018 g - 1, which well matched their maximum binding capacity toward bovine Hb. Both the simple method for preparation of MIPs and the magnetic nanospheres showed good application potential in fast separation, effective concentration and selective biosensing of large protein molecules.

  19. Turkish Tombul hazelnut (Corylus avellana L.) protein concentrates: functional and rheological properties.

    Science.gov (United States)

    Tatar, F; Tunç, M T; Kahyaoglu, T

    2015-02-01

    Turkish Tombul hazelnut consumed as natural or processed forms were evaluated to obtain protein concentrate. Defatted hazelnut flour protein (DHFP) and defatted hazelnut cake protein (DHCP) were produced from defatted hazelnut flour (DHF) and defatted hazelnut cake (DHC), respectively. The functional properties (protein solubility, emulsifying properties, foaming capacity, and colour), and dynamic rheological characteristics of protein concentrates were measured. The protein contents of samples varied in the range of 35-48 % (w/w, db) and 91-92 % (w/w, db) for DHF/DHC and DHFP/DHCP samples, respectively. The significant difference for water/fat absorption capacity, emulsion stability between DHF and DHC were determined. On the other hand, the solubility and emulsion activity of DHF and DHC were not significantly different (p > 0.05). Emulsion stability of DHFP (%46) was higher than that of DHCP (%35) but other functional properties were found similar. According to these results, the DHCP could be used as DHFP in food product formulations. The DHFP and DHCP samples showed different apparent viscosity at the same temperature and concentration, the elastic modulus (G' value) of DHPC was also found higher than that of DHFP samples.

  20. Artificial receptor-functionalized nanoshell: facile preparation, fast separation and specific protein recognition

    Energy Technology Data Exchange (ETDEWEB)

    Ouyang, Ruizhuo; Lei Jianping; Ju Huangxian, E-mail: jpl@nju.edu.cn, E-mail: hxju@nju.edu.cn [Key Laboratory of Analytical Chemistry for Life Science (Education Ministry of China), Department of Chemistry, Nanjing University, Nanjing 210093 (China)

    2010-05-07

    This work combined molecular imprinting technology with superparamagnetic nanospheres as the core to prepare artificial receptor-functionalized magnetic nanoparticles for separation of homologous proteins. Using dopamine as a functional monomer, novel surface protein-imprinted superparamagnetic polydopamine (PDA) core-shell nanoparticles were successfully prepared in physiological conditions, which could maintain the natural structure of a protein template and achieved the development of molecularly imprinted polymers (MIPs) from one dimension to zero dimension for efficient recognition towards large biomolecules. The resultant nanoparticles could be used for convenient magnetic separation of homologous proteins with high specificity. The nanoparticles possessed good monodispersibility, uniform surface morphology and high saturation magnetization value. The bound amounts of template proteins measured by both indirect and direct methods were in good agreement. The maximum number of imprinted cavities on the surface of the bovine hemoglobin (Hb)-imprinted nanoshell was 2.21 x 10{sup 18} g{sup -1}, which well matched their maximum binding capacity toward bovine Hb. Both the simple method for preparation of MIPs and the magnetic nanospheres showed good application potential in fast separation, effective concentration and selective biosensing of large protein molecules.