WorldWideScience

Sample records for eukaryotic linear motif

  1. Convergent evolution and mimicry of protein linear motifs in host-pathogen interactions.

    Science.gov (United States)

    Chemes, Lucía Beatriz; de Prat-Gay, Gonzalo; Sánchez, Ignacio Enrique

    2015-06-01

    Pathogen linear motif mimics are highly evolvable elements that facilitate rewiring of host protein interaction networks. Host linear motifs and pathogen mimics differ in sequence, leading to thermodynamic and structural differences in the resulting protein-protein interactions. Moreover, the functional output of a mimic depends on the motif and domain repertoire of the pathogen protein. Regulatory evolution mediated by linear motifs can be understood by measuring evolutionary rates, quantifying positive and negative selection and performing phylogenetic reconstructions of linear motif natural history. Convergent evolution of linear motif mimics is widespread among unrelated proteins from viral, prokaryotic and eukaryotic pathogens and can also take place within individual protein phylogenies. Statistics, biochemistry and laboratory models of infection link pathogen linear motifs to phenotypic traits such as tropism, virulence and oncogenicity. In vitro evolution experiments and analysis of natural sequences suggest that changes in linear motif composition underlie pathogen adaptation to a changing environment. Copyright © 2015 Elsevier Ltd. All rights reserved.

  2. iELM—a web server to explore short linear motif-mediated interactions

    Science.gov (United States)

    Weatheritt, Robert J.; Jehl, Peter; Dinkel, Holger; Gibson, Toby J.

    2012-01-01

    The recent expansion in our knowledge of protein–protein interactions (PPIs) has allowed the annotation and prediction of hundreds of thousands of interactions. However, the function of many of these interactions remains elusive. The interactions of Eukaryotic Linear Motif (iELM) web server provides a resource for predicting the function and positional interface for a subset of interactions mediated by short linear motifs (SLiMs). The iELM prediction algorithm is based on the annotated SLiM classes from the Eukaryotic Linear Motif (ELM) resource and allows users to explore both annotated and user-generated PPI networks for SLiM-mediated interactions. By incorporating the annotated information from the ELM resource, iELM provides functional details of PPIs. This can be used in proteomic analysis, for example, to infer whether an interaction promotes complex formation or degradation. Furthermore, details of the molecular interface of the SLiM-mediated interactions are also predicted. This information is displayed in a fully searchable table, as well as graphically with the modular architecture of the participating proteins extracted from the UniProt and Phospho.ELM resources. A network figure is also presented to aid the interpretation of results. The iELM server supports single protein queries as well as large-scale proteomic submissions and is freely available at http://i.elm.eu.org. PMID:22638578

  3. How pathogens use linear motifs to perturb host cell networks

    KAUST Repository

    Via, Allegra; Uyar, Bora; Brun, Christine; Zanzoni, Andreas

    2015-01-01

    Molecular mimicry is one of the powerful stratagems that pathogens employ to colonise their hosts and take advantage of host cell functions to guarantee their replication and dissemination. In particular, several viruses have evolved the ability to interact with host cell components through protein short linear motifs (SLiMs) that mimic host SLiMs, thus facilitating their internalisation and the manipulation of a wide range of cellular networks. Here we present convincing evidence from the literature that motif mimicry also represents an effective, widespread hijacking strategy in prokaryotic and eukaryotic parasites. Further insights into host motif mimicry would be of great help in the elucidation of the molecular mechanisms behind host cell invasion and the development of anti-infective therapeutic strategies.

  4. C-terminal motif prediction in eukaryotic proteomes using comparative genomics and statistical over-representation across protein families

    Directory of Open Access Journals (Sweden)

    Cutler Sean R

    2007-06-01

    Full Text Available Abstract Background The carboxy termini of proteins are a frequent site of activity for a variety of biologically important functions, ranging from post-translational modification to protein targeting. Several short peptide motifs involved in protein sorting roles and dependent upon their proximity to the C-terminus for proper function have already been characterized. As a limited number of such motifs have been identified, the potential exists for genome-wide statistical analysis and comparative genomics to reveal novel peptide signatures functioning in a C-terminal dependent manner. We have applied a novel methodology to the prediction of C-terminal-anchored peptide motifs involving a simple z-statistic and several techniques for improving the signal-to-noise ratio. Results We examined the statistical over-representation of position-specific C-terminal tripeptides in 7 eukaryotic proteomes. Sequence randomization models and simple-sequence masking were applied to the successful reduction of background noise. Similarly, as C-terminal homology among members of large protein families may artificially inflate tripeptide counts in an irrelevant and obfuscating manner, gene-family clustering was performed prior to the analysis in order to assess tripeptide over-representation across protein families as opposed to across all proteins. Finally, comparative genomics was used to identify tripeptides significantly occurring in multiple species. This approach has been able to predict, to our knowledge, all C-terminally anchored targeting motifs present in the literature. These include the PTS1 peroxisomal targeting signal (SKL*, the ER-retention signal (K/HDEL*, the ER-retrieval signal for membrane bound proteins (KKxx*, the prenylation signal (CC* and the CaaX box prenylation motif. In addition to a high statistical over-representation of these known motifs, a collection of significant tripeptides with a high propensity for biological function exists

  5. The ARTT motif and a unified structural understanding of substraterecognition in ADP ribosylating bacterial toxins and eukaryotic ADPribosyltransferases

    Energy Technology Data Exchange (ETDEWEB)

    Han, S.; Tainer, J.A.

    2001-08-01

    ADP-ribosylation is a widely occurring and biologically critical covalent chemical modification process in pathogenic mechanisms, intracellular signaling systems, DNA repair, and cell division. The reaction is catalyzed by ADP-ribosyltransferases, which transfer the ADP-ribose moiety of NAD to a target protein with nicotinamide release. A family of bacterial toxins and eukaryotic enzymes has been termed the mono-ADP-ribosyltransferases, in distinction to the poly-ADP-ribosyltransferases, which catalyze the addition of multiple ADP-ribose groups to the carboxyl terminus of eukaryotic nucleoproteins. Despite the limited primary sequence homology among the different ADP-ribosyltransferases, a central cleft bearing NAD-binding pocket formed by the two perpendicular b-sheet core has been remarkably conserved between bacterial toxins and eukaryotic mono- and poly-ADP-ribosyltransferases. The majority of bacterial toxins and eukaryotic mono-ADP-ribosyltransferases are characterized by conserved His and catalytic Glu residues. In contrast, Diphtheria toxin, Pseudomonas exotoxin A, and eukaryotic poly-ADP-ribosyltransferases are characterized by conserved Arg and catalytic Glu residues. The NAD-binding core of a binary toxin and a C3-like toxin family identified an ARTT motif (ADP-ribosylating turn-turn motif) that is implicated in substrate specificity and recognition by structural and mutagenic studies. Here we apply structure-based sequence alignment and comparative structural analyses of all known structures of ADP-ribosyltransfeases to suggest that this ARTT motif is functionally important in many ADP-ribosylating enzymes that bear a NAD binding cleft as characterized by conserved Arg and catalytic Glu residues. Overall, structure-based sequence analysis reveals common core structures and conserved active sites of ADP-ribosyltransferases to support similar NAD binding mechanisms but differing mechanisms of target protein binding via sequence variations within the ARTT

  6. Discovery of candidate KEN-box motifs using cell cycle keyword enrichment combined with native disorder prediction and motif conservation.

    Science.gov (United States)

    Michael, Sushama; Travé, Gilles; Ramu, Chenna; Chica, Claudia; Gibson, Toby J

    2008-02-15

    KEN-box-mediated target selection is one of the mechanisms used in the proteasomal destruction of mitotic cell cycle proteins via the APC/C complex. While annotating the Eukaryotic Linear Motif resource (ELM, http://elm.eu.org/), we found that KEN motifs were significantly enriched in human protein entries with cell cycle keywords in the UniProt/Swiss-Prot database-implying that KEN-boxes might be more common than reported. Matches to short linear motifs in protein database searches are not, per se, significant. KEN-box enrichment with cell cycle Gene Ontology terms suggests that collectively these motifs are functional but does not prove that any given instance is so. Candidates were surveyed for native disorder prediction using GlobPlot and IUPred and for motif conservation in homologues. Among >25 strong new candidates, the most notable are human HIPK2, CHFR, CDC27, Dab2, Upf2, kinesin Eg5, DNA Topoisomerase 1 and yeast Cdc5 and Swi5. A similar number of weaker candidates were present. These proteins have yet to be tested for APC/C targeted destruction, providing potential new avenues of research.

  7. Role of NH2-terminal hydrophobic motif in the subcellular localization of ATP-binding cassette protein subfamily D: Common features in eukaryotic organisms

    International Nuclear Information System (INIS)

    Lee, Asaka; Asahina, Kota; Okamoto, Takumi; Kawaguchi, Kosuke; Kostsin, Dzmitry G.; Kashiwayama, Yoshinori; Takanashi, Kojiro; Yazaki, Kazufumi; Imanaka, Tsuneo; Morita, Masashi

    2014-01-01

    Highlights: • ABCD proteins classifies based on with or without NH 2 -terminal hydrophobic segment. • The ABCD proteins with the segment are targeted peroxisomes. • The ABCD proteins without the segment are targeted to the endoplasmic reticulum. • The role of the segment in organelle targeting is conserved in eukaryotic organisms. - Abstract: In mammals, four ATP-binding cassette (ABC) proteins belonging to subfamily D have been identified. ABCD1–3 possesses the NH 2 -terminal hydrophobic region and are targeted to peroxisomes, while ABCD4 lacking the region is targeted to the endoplasmic reticulum (ER). Based on hydropathy plot analysis, we found that several eukaryotes have ABCD protein homologs lacking the NH 2 -terminal hydrophobic segment (H0 motif). To investigate whether the role of the NH 2 -terminal H0 motif in subcellular localization is conserved across species, we expressed ABCD proteins from several species (metazoan, plant and fungi) in fusion with GFP in CHO cells and examined their subcellular localization. ABCD proteins possessing the NH 2 -terminal H0 motif were localized to peroxisomes, while ABCD proteins lacking this region lost this capacity. In addition, the deletion of the NH 2 -terminal H0 motif of ABCD protein resulted in their localization to the ER. These results suggest that the role of the NH 2 -terminal H0 motif in organelle targeting is widely conserved in living organisms

  8. Interaction of the RNP1 motif in PRT1 with HCR1 promotes 40S binding of eukaryotic initiation factor 3 in yeast

    DEFF Research Database (Denmark)

    Nielsen, Klaus H; Valásek, Leos; Sykes, Caroah

    2006-01-01

    We found that mutating the RNP1 motif in the predicted RRM domain in yeast eukaryotic initiation factor 3 (eIF3) subunit b/PRT1 (prt1-rnp1) impairs its direct interactions in vitro with both eIF3a/TIF32 and eIF3j/HCR1. The rnp1 mutation in PRT1 confers temperature-sensitive translation initiation...

  9. Core signalling motif displaying multistability through multi-state enzymes

    DEFF Research Database (Denmark)

    Feng, Song; Saez Cornellana, Meritxell; Wiuf, Carsten Henrik

    2016-01-01

    Bistability, and more generally multistability, is a key system dynamics feature enabling decision-making and memory in cells. Deciphering the molecular determinants of multistability is thus crucial for a better understanding of cellular pathways and their (re)engineering in synthetic biology....... Here, we show that a key motif found predominantly in eukaryotic signalling systems, namely a futile signalling cycle, can display bistability when featuring a two-state kinase. We provide necessary and sufficient mathematical conditions on the kinetic parameters of this motif that guarantee...... the existence of multiple steady states. These conditions foster the intuition that bistability arises as a consequence of competition between the two states of the kinase. Extending from this result, we find that increasing the number of kinase states linearly translates into an increase in the number...

  10. Discrepancy variation of dinucleotide microsatellite repeats in eukaryotic genomes

    Directory of Open Access Journals (Sweden)

    HUAN GAO

    2009-01-01

    Full Text Available To address whether there are differences of variation among repeat motif types and among taxonomic groups, we present here an analysis of variation and correlation of dinucleotide microsatellite repeats in eukaryotic genomes. Ten taxonomic groups were compared, those being primates, mammalia (excluding primates and rodentia, rodentia, birds, fish, amphibians and reptiles, insects, molluscs, plants and fungi, respectively. The data used in the analysis is from the literature published in the Journal of Molecular Ecology Notes. Analysis of variation reveals that there are no significant differences between AC and AG repeat motif types. Moreover, the number of alleles correlates positively with the copy number in both AG and AC repeats. Similar conclusions can be obtained from each taxonomic group. These results strongly suggest that the increase of SSR variation is almost linear with the increase of the copy number of each repeat motif. As well, the results suggest that the variability of SSR in the genomes of low-ranking species seem to be more than that of high-ranking species, excluding primates and fungi.

  11. Phyloproteomic Analysis of 11780 Six-Residue-Long Motifs Occurrences

    Directory of Open Access Journals (Sweden)

    O. V. Galzitskaya

    2015-01-01

    Full Text Available How is it possible to find good traits for phylogenetic reconstructions? Here, we present a new phyloproteomic criterion that is an occurrence of simple motifs which can be imprints of evolution history. We studied the occurrences of 11780 six-residue-long motifs consisting of two randomly located amino acids in 97 eukaryotic and 25 bacterial proteomes. For all eukaryotic proteomes, with the exception of the Amoebozoa, Stramenopiles, and Diplomonadida kingdoms, the number of proteins containing the motifs from the first group (one of the two amino acids occurs once at the terminal position made about 20%; in the case of motifs from the second (one of two amino acids occurs one time within the pattern and third (the two amino acids occur randomly groups, 30% and 50%, respectively. For bacterial proteomes, this relationship was 10%, 27%, and 63%, respectively. The matrices of correlation coefficients between numbers of proteins where a motif from the set of 11780 motifs appears at least once in 9 kingdoms and 5 phyla of bacteria were calculated. Among the correlation coefficients for eukaryotic proteomes, the correlation between the animal and fungi kingdoms (0.62 is higher than between fungi and plants (0.54. Our study provides support that animals and fungi are sibling kingdoms. Comparison of the frequencies of six-residue-long motifs in different proteomes allows obtaining phylogenetic relationships based on similarities between these frequencies: the Diplomonadida kingdoms are more close to Bacteria than to Eukaryota; Stramenopiles and Amoebozoa are more close to each other than to other kingdoms of Eukaryota.

  12. MPN+, a putative catalytic motif found in a subset of MPN domain proteins from eukaryotes and prokaryotes, is critical for Rpn11 function

    Directory of Open Access Journals (Sweden)

    Hofmann Kay

    2002-09-01

    Full Text Available Abstract Background Three macromolecular assemblages, the lid complex of the proteasome, the COP9-Signalosome (CSN and the eIF3 complex, all consist of multiple proteins harboring MPN and PCI domains. Up to now, no specific function for any of these proteins has been defined, nor has the importance of these motifs been elucidated. In particular Rpn11, a lid subunit, serves as the paradigm for MPN-containing proteins as it is highly conserved and important for proteasome function. Results We have identified a sequence motif, termed the MPN+ motif, which is highly conserved in a subset of MPN domain proteins such as Rpn11 and Csn5/Jab1, but is not present outside of this subfamily. The MPN+ motif consists of five polar residues that resemble the active site residues of hydrolytic enzyme classes, particularly that of metalloproteases. By using site-directed mutagenesis, we show that the MPN+ residues are important for the function of Rpn11, while a highly conserved Cys residue outside of the MPN+ motif is not essential. Single amino acid substitutions in MPN+ residues all show similar phenotypes, including slow growth, sensitivity to temperature and amino acid analogs, and general proteasome-dependent proteolysis defects. Conclusions The MPN+ motif is abundant in certain MPN-domain proteins, including newly identified proteins of eukaryotes, bacteria and archaea thought to act outside of the traditional large PCI/MPN complexes. The putative catalytic nature of the MPN+ motif makes it a good candidate for a pivotal enzymatic function, possibly a proteasome-associated deubiquitinating activity and a CSN-associated Nedd8/Rub1-removing activity.

  13. Exploiting publicly available biological and biochemical information for the discovery of novel short linear motifs.

    KAUST Repository

    Sayadi, Ahmed; Briganti, Leonardo; Tramontano, Anna; Via, Allegra

    2011-01-01

    The function of proteins is often mediated by short linear segments of their amino acid sequence, called Short Linear Motifs or SLiMs, the identification of which can provide important information about a protein function. However, the short length

  14. Large-scale analysis of phosphorylation site occupancy in eukaryotic proteins

    DEFF Research Database (Denmark)

    Rao, R Shyama Prasad; Møller, Ian Max

    2012-01-01

    in proteins is currently lacking. We have therefore analyzed the occurrence and occupancy of phosphorylated sites (~ 100,281) in a large set of eukaryotic proteins (~ 22,995). Phosphorylation probability was found to be much higher in both the  termini of protein sequences and this is much pronounced...... maximum randomness. An analysis of phosphorylation motifs indicated that just 40 motifs and a much lower number of associated kinases might account for nearly 50% of the known phosphorylations in eukaryotic proteins. Our results provide a broad picture of the phosphorylation sites in eukaryotic proteins.......Many recent high throughput technologies have enabled large-scale discoveries of new phosphorylation sites and phosphoproteins. Although they have provided a number of insights into protein phosphorylation and the related processes, an inclusive analysis on the nature of phosphorylated sites...

  15. Exploiting publicly available biological and biochemical information for the discovery of novel short linear motifs.

    KAUST Repository

    Sayadi, Ahmed

    2011-07-20

    The function of proteins is often mediated by short linear segments of their amino acid sequence, called Short Linear Motifs or SLiMs, the identification of which can provide important information about a protein function. However, the short length of the motifs and their variable degree of conservation makes their identification hard since it is difficult to correctly estimate the statistical significance of their occurrence. Consequently, only a small fraction of them have been discovered so far. We describe here an approach for the discovery of SLiMs based on their occurrence in evolutionarily unrelated proteins belonging to the same biological, signalling or metabolic pathway and give specific examples of its effectiveness in both rediscovering known motifs and in discovering novel ones. An automatic implementation of the procedure, available for download, allows significant motifs to be identified, automatically annotated with functional, evolutionary and structural information and organized in a database that can be inspected and queried. An instance of the database populated with pre-computed data on seven organisms is accessible through a publicly available server and we believe it constitutes by itself a useful resource for the life sciences (http://www.biocomputing.it/modipath).

  16. A systems wide mass spectrometric based linear motif screen to identify dominant in-vivo interacting proteins for the ubiquitin ligase MDM2.

    Science.gov (United States)

    Nicholson, Judith; Scherl, Alex; Way, Luke; Blackburn, Elizabeth A; Walkinshaw, Malcolm D; Ball, Kathryn L; Hupp, Ted R

    2014-06-01

    Linear motifs mediate protein-protein interactions (PPI) that allow expansion of a target protein interactome at a systems level. This study uses a proteomics approach and linear motif sub-stratifications to expand on PPIs of MDM2. MDM2 is a multi-functional protein with over one hundred known binding partners not stratified by hierarchy or function. A new linear motif based on a MDM2 interaction consensus is used to select novel MDM2 interactors based on Nutlin-3 responsiveness in a cell-based proteomics screen. MDM2 binds a subset of peptide motifs corresponding to real proteins with a range of allosteric responses to MDM2 ligands. We validate cyclophilin B as a novel protein with a consensus MDM2 binding motif that is stabilised by Nutlin-3 in vivo, thus identifying one of the few known interactors of MDM2 that is stabilised by Nutlin-3. These data invoke two modes of peptide binding at the MDM2 N-terminus that rely on a consensus core motif to control the equilibrium between MDM2 binding proteins. This approach stratifies MDM2 interacting proteins based on the linear motif feature and provides a new biomarker assay to define clinically relevant Nutlin-3 responsive MDM2 interactors. Copyright © 2014 Elsevier Inc. All rights reserved.

  17. [Structure and evolution of the eukaryotic FANCJ-like proteins].

    Science.gov (United States)

    Wuhe, Jike; Zefeng, Wu; Sanhong, Fan; Xuguang, Xi

    2015-02-01

    The FANCJ-like protein family is a class of ATP-dependent helicases that can catalytically unwind duplex DNA along the 5'-3' direction. It is involved in the processes of DNA damage repair, homologous recombination and G-quadruplex DNA unwinding, and plays a critical role in maintaining genome integrity. In this study, we systemically analyzed FNACJ-like proteins from 47 eukaryotic species and discussed their sequences diversity, origin and evolution, motif organization patterns and spatial structure differences. Four members of FNACJ-like proteins, including XPD, CHL1, RTEL1 and FANCJ, were found in eukaryotes, but some of them were seriously deficient in most fungi and some insects. For example, the Zygomycota fungi lost RTEL1, Basidiomycota and Ascomycota fungi lost RTEL1 and FANCJ, and Diptera insect lost FANCJ. FANCJ-like proteins contain canonical motor domains HD1 and HD2, and the HD1 domain further integrates with three unique domains Fe-S, Arch and Extra-D. Fe-S and Arch domains are relatively conservative in all members of the family, but the Extra-D domain is lost in XPD and differs from one another in rest members. There are 7, 10 and 2 specific motifs found from the three unique domains respectively, while 5 and 12 specific motifs are found from HD1 and HD2 domains except the conserved motifs reported previously. By analyzing the arrangement pattern of these specific motifs, we found that RTEL1 and FANCJ are more closer and share two specific motifs Vb2 and Vc in HD2 domain, which are likely related with their G-quadruplex DNA unwinding activity. The evidence of evolution showed that FACNJ-like proteins were originated from a helicase, which has a HD1 domain inserted by extra Fe-S domain and Arch domain. By three continuous gene duplication events and followed specialization, eukaryotes finally possessed the current four members of FANCJ-like proteins.

  18. Identity and functions of CxxC-derived motifs.

    Science.gov (United States)

    Fomenko, Dmitri E; Gladyshev, Vadim N

    2003-09-30

    Two cysteines separated by two other residues (the CxxC motif) are employed by many redox proteins for formation, isomerization, and reduction of disulfide bonds and for other redox functions. The place of the C-terminal cysteine in this motif may be occupied by serine (the CxxS motif), modifying the functional repertoire of redox proteins. Here we found that the CxxC motif may also give rise to a motif, in which the C-terminal cysteine is replaced with threonine (the CxxT motif). Moreover, in contrast to a view that the N-terminal cysteine in the CxxC motif always serves as a nucleophilic attacking group, this residue could also be replaced with threonine (the TxxC motif), serine (the SxxC motif), or other residues. In each of these CxxC-derived motifs, the presence of a downstream alpha-helix was strongly favored. A search for conserved CxxC-derived motif/helix patterns in four complete genomes representing bacteria, archaea, and eukaryotes identified known redox proteins and suggested possible redox functions for several additional proteins. Catalytic sites in peroxiredoxins were major representatives of the TxxC motif, whereas those in glutathione peroxidases represented the CxxT motif. Structural assessments indicated that threonines in these enzymes could stabilize catalytic thiolates, suggesting revisions to previously proposed catalytic triads. Each of the CxxC-derived motifs was also observed in natural selenium-containing proteins, in which selenocysteine was present in place of a catalytic cysteine.

  19. Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

    Science.gov (United States)

    Fauteux, François; Strömvik, Martina V

    2009-01-01

    Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP) gene promoters from three plant families, namely Brassicaceae (mustards), Fabaceae (legumes) and Poaceae (grasses) using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L.) Heynh.), soybean (Glycine max (L.) Merr.) and rice (Oryza sativa L.) respectively. We have identified three conserved motifs (two RY-like and one ACGT-like) in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination of conserved motifs

  20. Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

    Directory of Open Access Journals (Sweden)

    Fauteux François

    2009-10-01

    Full Text Available Abstract Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP gene promoters from three plant families, namely Brassicaceae (mustards, Fabaceae (legumes and Poaceae (grasses using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L. Heynh., soybean (Glycine max (L. Merr. and rice (Oryza sativa L. respectively. We have identified three conserved motifs (two RY-like and one ACGT-like in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination

  1. Detecting remote sequence homology in disordered proteins: discovery of conserved motifs in the N-termini of Mononegavirales phosphoproteins.

    Directory of Open Access Journals (Sweden)

    David Karlin

    Full Text Available Paramyxovirinae are a large group of viruses that includes measles virus and parainfluenza viruses. The viral Phosphoprotein (P plays a central role in viral replication. It is composed of a highly variable, disordered N-terminus and a conserved C-terminus. A second viral protein alternatively expressed, the V protein, also contains the N-terminus of P, fused to a zinc finger. We suspected that, despite their high variability, the N-termini of P/V might all be homologous; however, using standard approaches, we could previously identify sequence conservation only in some Paramyxovirinae. We now compared the N-termini using sensitive sequence similarity search programs, able to detect residual similarities unnoticeable by conventional approaches. We discovered that all Paramyxovirinae share a short sequence motif in their first 40 amino acids, which we called soyuz1. Despite its short length (11-16aa, several arguments allow us to conclude that soyuz1 probably evolved by homologous descent, unlike linear motifs. Conservation across such evolutionary distances suggests that soyuz1 plays a crucial role and experimental data suggest that it binds the viral nucleoprotein to prevent its illegitimate self-assembly. In some Paramyxovirinae, the N-terminus of P/V contains a second motif, soyuz2, which might play a role in blocking interferon signaling. Finally, we discovered that the P of related Mononegavirales contain similarly overlooked motifs in their N-termini, and that their C-termini share a previously unnoticed structural similarity suggesting a common origin. Our results suggest several testable hypotheses regarding the replication of Mononegavirales and suggest that disordered regions with little overall sequence similarity, common in viral and eukaryotic proteins, might contain currently overlooked motifs (intermediate in length between linear motifs and disordered domains that could be detected simply by comparing orthologous proteins.

  2. Nucleophosmin integrates within the nucleolus via multi-modal interactions with proteins displaying R-rich linear motifs and rRNA.

    Science.gov (United States)

    Mitrea, Diana M; Cika, Jaclyn A; Guy, Clifford S; Ban, David; Banerjee, Priya R; Stanley, Christopher B; Nourse, Amanda; Deniz, Ashok A; Kriwacki, Richard W

    2016-02-02

    The nucleolus is a membrane-less organelle formed through liquid-liquid phase separation of its components from the surrounding nucleoplasm. Here, we show that nucleophosmin (NPM1) integrates within the nucleolus via a multi-modal mechanism involving multivalent interactions with proteins containing arginine-rich linear motifs (R-motifs) and ribosomal RNA (rRNA). Importantly, these R-motifs are found in canonical nucleolar localization signals. Based on a novel combination of biophysical approaches, we propose a model for the molecular organization within liquid-like droplets formed by the N-terminal domain of NPM1 and R-motif peptides, thus providing insights into the structural organization of the nucleolus. We identify multivalency of acidic tracts and folded nucleic acid binding domains, mediated by N-terminal domain oligomerization, as structural features required for phase separation of NPM1 with other nucleolar components in vitro and for localization within mammalian nucleoli. We propose that one mechanism of nucleolar localization involves phase separation of proteins within the nucleolus.

  3. Monitoring lysin motif-ligand interactions via tryptophan analog fluorescence spectroscopy

    NARCIS (Netherlands)

    Petrovic, Dejan M.; Leenhouts, Kees; van Roosmalen, Maarten L.; KleinJan, Fenneke; Broos, Jaap

    2012-01-01

    The lysin motif (LysM) is a peptidoglycan binding protein domain found in a wide range of prokaryotes and eukaryotes. Various techniques have been used to study the LysM-ligand interaction, but a sensitive spectroscopic method to directly monitor this interaction has not been reported. Here a

  4. Solution structure of an archaeal DNA binding protein with an eukaryotic zinc finger fold.

    Directory of Open Access Journals (Sweden)

    Florence Guillière

    Full Text Available While the basal transcription machinery in archaea is eukaryal-like, transcription factors in archaea and their viruses are usually related to bacterial transcription factors. Nevertheless, some of these organisms show predicted classical zinc fingers motifs of the C2H2 type, which are almost exclusively found in proteins of eukaryotes and most often associated with transcription regulators. In this work, we focused on the protein AFV1p06 from the hyperthermophilic archaeal virus AFV1. The sequence of the protein consists of the classical eukaryotic C2H2 motif with the fourth histidine coordinating zinc missing, as well as of N- and C-terminal extensions. We showed that the protein AFV1p06 binds zinc and solved its solution structure by NMR. AFV1p06 displays a zinc finger fold with a novel structure extension and disordered N- and C-termini. Structure calculations show that a glutamic acid residue that coordinates zinc replaces the fourth histidine of the C2H2 motif. Electromobility gel shift assays indicate that the protein binds to DNA with different affinities depending on the DNA sequence. AFV1p06 is the first experimentally characterised archaeal zinc finger protein with a DNA binding activity. The AFV1p06 protein family has homologues in diverse viruses of hyperthermophilic archaea. A phylogenetic analysis points out a common origin of archaeal and eukaryotic C2H2 zinc fingers.

  5. Linear motif atlas for phosphorylation-dependent signaling

    DEFF Research Database (Denmark)

    Miller, Martin Lee; Jensen, LJ; Diella, F

    2008-01-01

    bind to them remains a challenge. NetPhorest is an atlas of consensus sequence motifs that covers 179 kinases and 104 phosphorylation-dependent binding domains [Src homology 2 (SH2), phosphotyrosine binding (PTB), BRCA1 C-terminal (BRCT), WW, and 14-3-3]. The atlas reveals new aspects of signaling...

  6. Motif statistics and spike correlations in neuronal networks

    International Nuclear Information System (INIS)

    Hu, Yu; Shea-Brown, Eric; Trousdale, James; Josić, Krešimir

    2013-01-01

    Motifs are patterns of subgraphs of complex networks. We studied the impact of such patterns of connectivity on the level of correlated, or synchronized, spiking activity among pairs of cells in a recurrent network of integrate and fire neurons. For a range of network architectures, we find that the pairwise correlation coefficients, averaged across the network, can be closely approximated using only three statistics of network connectivity. These are the overall network connection probability and the frequencies of two second order motifs: diverging motifs, in which one cell provides input to two others, and chain motifs, in which two cells are connected via a third intermediary cell. Specifically, the prevalence of diverging and chain motifs tends to increase correlation. Our method is based on linear response theory, which enables us to express spiking statistics using linear algebra, and a resumming technique, which extrapolates from second order motifs to predict the overall effect of coupling on network correlation. Our motif-based results seek to isolate the effect of network architecture perturbatively from a known network state. (paper)

  7. Annotating RNA motifs in sequences and alignments.

    Science.gov (United States)

    Gardner, Paul P; Eldai, Hisham

    2015-01-01

    RNA performs a diverse array of important functions across all cellular life. These functions include important roles in translation, building translational machinery and maturing messenger RNA. More recent discoveries include the miRNAs and bacterial sRNAs that regulate gene expression, the thermosensors, riboswitches and other cis-regulatory elements that help prokaryotes sense their environment and eukaryotic piRNAs that suppress transposition. However, there can be a long period between the initial discovery of a RNA and determining its function. We present a bioinformatic approach to characterize RNA motifs, which are critical components of many RNA structure-function relationships. These motifs can, in some instances, provide researchers with functional hypotheses for uncharacterized RNAs. Moreover, we introduce a new profile-based database of RNA motifs--RMfam--and illustrate some applications for investigating the evolution and functional characterization of RNA. All the data and scripts associated with this work are available from: https://github.com/ppgardne/RMfam. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  8. SiteBinder: an improved approach for comparing multiple protein structural motifs.

    Science.gov (United States)

    Sehnal, David; Vařeková, Radka Svobodová; Huber, Heinrich J; Geidl, Stanislav; Ionescu, Crina-Maria; Wimmerová, Michaela; Koča, Jaroslav

    2012-02-27

    There is a paramount need to develop new techniques and tools that will extract as much information as possible from the ever growing repository of protein 3D structures. We report here on the development of a software tool for the multiple superimposition of large sets of protein structural motifs. Our superimposition methodology performs a systematic search for the atom pairing that provides the best fit. During this search, the RMSD values for all chemically relevant pairings are calculated by quaternion algebra. The number of evaluated pairings is markedly decreased by using PDB annotations for atoms. This approach guarantees that the best fit will be found and can be applied even when sequence similarity is low or does not exist at all. We have implemented this methodology in the Web application SiteBinder, which is able to process up to thousands of protein structural motifs in a very short time, and which provides an intuitive and user-friendly interface. Our benchmarking analysis has shown the robustness, efficiency, and versatility of our methodology and its implementation by the successful superimposition of 1000 experimentally determined structures for each of 32 eukaryotic linear motifs. We also demonstrate the applicability of SiteBinder using three case studies. We first compared the structures of 61 PA-IIL sugar binding sites containing nine different sugars, and we found that the sugar binding sites of PA-IIL and its mutants have a conserved structure despite their binding different sugars. We then superimposed over 300 zinc finger central motifs and revealed that the molecular structure in the vicinity of the Zn atom is highly conserved. Finally, we superimposed 12 BH3 domains from pro-apoptotic proteins. Our findings come to support the hypothesis that there is a structural basis for the functional segregation of BH3-only proteins into activators and enablers.

  9. An SVD-based comparison of nine whole eukaryotic genomes supports a coelomate rather than ecdysozoan lineage

    Directory of Open Access Journals (Sweden)

    Stuart Gary W

    2004-12-01

    Full Text Available Abstract Background Eukaryotic whole genome sequences are accumulating at an impressive rate. Effective methods for comparing multiple whole eukaryotic genomes on a large scale are needed. Most attempted solutions involve the production of large scale alignments, and many of these require a high stringency pre-screen for putative orthologs in order to reduce the effective size of the dataset and provide a reasonably high but unknown fraction of correctly aligned homologous sites for comparison. As an alternative, highly efficient methods that do not require the pre-alignment of operationally defined orthologs are also being explored. Results A non-alignment method based on the Singular Value Decomposition (SVD was used to compare the predicted protein complement of nine whole eukaryotic genomes ranging from yeast to man. This analysis resulted in the simultaneous identification and definition of a large number of well conserved motifs and gene families, and produced a species tree supporting one of two conflicting hypotheses of metazoan relationships. Conclusions Our SVD-based analysis of the entire protein complement of nine whole eukaryotic genomes suggests that highly conserved motifs and gene families can be identified and effectively compared in a single coherent definition space for the easy extraction of gene and species trees. While this occurs without the explicit definition of orthologs or homologous sites, the analysis can provide a basis for these definitions.

  10. Structural modelling and phylogenetic analyses of PgeIF4A2 (Eukaryotic translation initiation factor) from Pennisetum glaucum reveal signature motifs with a role in stress tolerance and development.

    Science.gov (United States)

    Agarwal, Aakrati; Mudgil, Yashwanti; Pandey, Saurabh; Fartyal, Dhirendra; Reddy, Malireddy K

    2016-01-01

    Eukaryotic translation initiation factor 4A (eIF4A) is an indispensable component of the translation machinery and also play a role in developmental processes and stress alleviation in plants and animals. Different eIF4A isoforms are present in the cytosol of the cell, namely, eIF4A1, eIF4A2, and eIF4A3 and their expression is tightly regulated in cap-dependent translation. We revealed the structural model of PgeIF4A2 protein using the crystal structure of Homo sapiens eIF4A3 (PDB ID: 2J0S) as template by Modeller 9.12. The resultant PgeIF4A2 model structure was refined by PROCHECK, ProSA, Verify3D and RMSD that showed the model structure is reliable with 77 % amino acid sequence identity with template. Investigation revealed two conserved signatures for ATP-dependent RNA Helicase DEAD-box conserved site (VLDEADEML) and RNA helicase DEAD-box type, Q-motif in sheet-turn-helix and α-helical region respectively. All these conserved motifs are responsible for response during developmental stages and stress tolerance in plants.

  11. Novel core promoter elements and a cognate transcription factor in the divergent unicellular eukaryote Trichomonas vaginalis.

    Science.gov (United States)

    Smith, Alias J; Chudnovsky, Lorissa; Simoes-Barbosa, Augusto; Delgadillo-Correa, Maria G; Jonsson, Zophonias O; Wohlschlegel, James A; Johnson, Patricia J

    2011-04-01

    A highly conserved DNA initiator (Inr) element has been the only core promoter element described in the divergent unicellular eukaryote Trichomonas vaginalis, although genome analyses reveal that only ∼75% of protein-coding genes appear to contain an Inr. In search of another core promoter element(s), a nonredundant database containing 5' untranslated regions of expressed T. vaginalis genes was searched for overrepresented DNA motifs and known eukaryotic core promoter elements. In addition to identifying the Inr, two elements that lack sequence similarity to the known protein-coding gene core promoter, motif 3 (M3) and motif 5 (M5), were identified. Mutational and functional analyses demonstrate that both are novel core promoter elements. M3 [(A/G/T)(A/G)C(G/C)G(T/C)T(T/A/G)] resembles a Myb recognition element (MRE) and is bound specifically by a unique protein with a Myb-like DNA binding domain. The M5 element (CCTTT) overlaps the transcription start site and replaces the Inr as an alternative, gene-specific initiator element. Transcription specifically initiates at the second cytosine within M5, in contrast to characteristic initiation by RNA polymerase II at an adenosine. In promoters that combine M3 with either M5 or Inr, transcription initiation is regulated by the M3 motif.

  12. Novel Core Promoter Elements and a Cognate Transcription Factor in the Divergent Unicellular Eukaryote Trichomonas vaginalis▿

    Science.gov (United States)

    Smith, Alias J.; Chudnovsky, Lorissa; Simoes-Barbosa, Augusto; Delgadillo-Correa, Maria G.; Jonsson, Zophonias O.; Wohlschlegel, James A.; Johnson, Patricia J.

    2011-01-01

    A highly conserved DNA initiator (Inr) element has been the only core promoter element described in the divergent unicellular eukaryote Trichomonas vaginalis, although genome analyses reveal that only ∼75% of protein-coding genes appear to contain an Inr. In search of another core promoter element(s), a nonredundant database containing 5′ untranslated regions of expressed T. vaginalis genes was searched for overrepresented DNA motifs and known eukaryotic core promoter elements. In addition to identifying the Inr, two elements that lack sequence similarity to the known protein-coding gene core promoter, motif 3 (M3) and motif 5 (M5), were identified. Mutational and functional analyses demonstrate that both are novel core promoter elements. M3 [(A/G/T)(A/G)C(G/C)G(T/C)T(T/A/G)] resembles a Myb recognition element (MRE) and is bound specifically by a unique protein with a Myb-like DNA binding domain. The M5 element (CCTTT) overlaps the transcription start site and replaces the Inr as an alternative, gene-specific initiator element. Transcription specifically initiates at the second cytosine within M5, in contrast to characteristic initiation by RNA polymerase II at an adenosine. In promoters that combine M3 with either M5 or Inr, transcription initiation is regulated by the M3 motif. PMID:21245378

  13. A novel human AP endonuclease with conserved zinc-finger-like motifs involved in DNA strand break responses

    Science.gov (United States)

    Kanno, Shin-ichiro; Kuzuoka, Hiroyuki; Sasao, Shigeru; Hong, Zehui; Lan, Li; Nakajima, Satoshi; Yasui, Akira

    2007-01-01

    DNA damage causes genome instability and cell death, but many of the cellular responses to DNA damage still remain elusive. We here report a human protein, PALF (PNK and APTX-like FHA protein), with an FHA (forkhead-associated) domain and novel zinc-finger-like CYR (cysteine–tyrosine–arginine) motifs that are involved in responses to DNA damage. We found that the CYR motif is widely distributed among DNA repair proteins of higher eukaryotes, and that PALF, as well as a Drosophila protein with tandem CYR motifs, has endo- and exonuclease activities against abasic site and other types of base damage. PALF accumulates rapidly at single-strand breaks in a poly(ADP-ribose) polymerase 1 (PARP1)-dependent manner in human cells. Indeed, PALF interacts directly with PARP1 and is required for its activation and for cellular resistance to methyl-methane sulfonate. PALF also interacts directly with KU86, LIGASEIV and phosphorylated XRCC4 proteins and possesses endo/exonuclease activity at protruding DNA ends. Various treatments that produce double-strand breaks induce formation of PALF foci, which fully coincide with γH2AX foci. Thus, PALF and the CYR motif may play important roles in DNA repair of higher eukaryotes. PMID:17396150

  14. Deciphering functional glycosaminoglycan motifs in development.

    Science.gov (United States)

    Townley, Robert A; Bülow, Hannes E

    2018-03-23

    Glycosaminoglycans (GAGs) such as heparan sulfate, chondroitin/dermatan sulfate, and keratan sulfate are linear glycans, which when attached to protein backbones form proteoglycans. GAGs are essential components of the extracellular space in metazoans. Extensive modifications of the glycans such as sulfation, deacetylation and epimerization create structural GAG motifs. These motifs regulate protein-protein interactions and are thereby repsonsible for many of the essential functions of GAGs. This review focusses on recent genetic approaches to characterize GAG motifs and their function in defined signaling pathways during development. We discuss a coding approach for GAGs that would enable computational analyses of GAG sequences such as alignments and the computation of position weight matrices to describe GAG motifs. Copyright © 2018 Elsevier Ltd. All rights reserved.

  15. Constraints and consequences of the emergence of amino acid repeats in eukaryotic proteins.

    Science.gov (United States)

    Chavali, Sreenivas; Chavali, Pavithra L; Chalancon, Guilhem; de Groot, Natalia Sanchez; Gemayel, Rita; Latysheva, Natasha S; Ing-Simmons, Elizabeth; Verstrepen, Kevin J; Balaji, Santhanam; Babu, M Madan

    2017-09-01

    Proteins with amino acid homorepeats have the potential to be detrimental to cells and are often associated with human diseases. Why, then, are homorepeats prevalent in eukaryotic proteomes? In yeast, homorepeats are enriched in proteins that are essential and pleiotropic and that buffer environmental insults. The presence of homorepeats increases the functional versatility of proteins by mediating protein interactions and facilitating spatial organization in a repeat-dependent manner. During evolution, homorepeats are preferentially retained in proteins with stringent proteostasis, which might minimize repeat-associated detrimental effects such as unregulated phase separation and protein aggregation. Their presence facilitates rapid protein divergence through accumulation of amino acid substitutions, which often affect linear motifs and post-translational-modification sites. These substitutions may result in rewiring protein interaction and signaling networks. Thus, homorepeats are distinct modules that are often retained in stringently regulated proteins. Their presence facilitates rapid exploration of the genotype-phenotype landscape of a population, thereby contributing to adaptation and fitness.

  16. A novel human AP endonuclease with conserved zinc-finger-like motifs involved in DNA strand break responses

    OpenAIRE

    Kanno, Shin-ichiro; Kuzuoka, Hiroyuki; Sasao, Shigeru; Hong, Zehui; Lan, Li; Nakajima, Satoshi; Yasui, Akira

    2007-01-01

    DNA damage causes genome instability and cell death, but many of the cellular responses to DNA damage still remain elusive. We here report a human protein, PALF (PNK and APTX-like FHA protein), with an FHA (forkhead-associated) domain and novel zinc-finger-like CYR (cysteine–tyrosine–arginine) motifs that are involved in responses to DNA damage. We found that the CYR motif is widely distributed among DNA repair proteins of higher eukaryotes, and that PALF, as well as a Drosophila protein with...

  17. Distinct gene number-genome size relationships for eukaryotes and non-eukaryotes: gene content estimation for dinoflagellate genomes.

    Directory of Open Access Journals (Sweden)

    Yubo Hou

    Full Text Available The ability to predict gene content is highly desirable for characterization of not-yet sequenced genomes like those of dinoflagellates. Using data from completely sequenced and annotated genomes from phylogenetically diverse lineages, we investigated the relationship between gene content and genome size using regression analyses. Distinct relationships between log(10-transformed protein-coding gene number (Y' versus log(10-transformed genome size (X', genome size in kbp were found for eukaryotes and non-eukaryotes. Eukaryotes best fit a logarithmic model, Y' = ln(-46.200+22.678X', whereas non-eukaryotes a linear model, Y' = 0.045+0.977X', both with high significance (p0.91. Total gene number shows similar trends in both groups to their respective protein coding regressions. The distinct correlations reflect lower and decreasing gene-coding percentages as genome size increases in eukaryotes (82%-1% compared to higher and relatively stable percentages in prokaryotes and viruses (97%-47%. The eukaryotic regression models project that the smallest dinoflagellate genome (3x10(6 kbp contains 38,188 protein-coding (40,086 total genes and the largest (245x10(6 kbp 87,688 protein-coding (92,013 total genes, corresponding to 1.8% and 0.05% gene-coding percentages. These estimates do not likely represent extraordinarily high functional diversity of the encoded proteome but rather highly redundant genomes as evidenced by high gene copy numbers documented for various dinoflagellate species.

  18. Pol II promoter prediction using characteristic 4-mer motifs: a machine learning approach

    Directory of Open Access Journals (Sweden)

    Shoyaib Mohammad

    2008-10-01

    Full Text Available Abstract Background Eukaryotic promoter prediction using computational analysis techniques is one of the most difficult jobs in computational genomics that is essential for constructing and understanding genetic regulatory networks. The increased availability of sequence data for various eukaryotic organisms in recent years has necessitated for better tools and techniques for the prediction and analysis of promoters in eukaryotic sequences. Many promoter prediction methods and tools have been developed to date but they have yet to provide acceptable predictive performance. One obvious criteria to improve on current methods is to devise a better system for selecting appropriate features of promoters that distinguish them from non-promoters. Secondly improved performance can be achieved by enhancing the predictive ability of the machine learning algorithms used. Results In this paper, a novel approach is presented in which 128 4-mer motifs in conjunction with a non-linear machine-learning algorithm utilising a Support Vector Machine (SVM are used to distinguish between promoter and non-promoter DNA sequences. By applying this approach to plant, Drosophila, human, mouse and rat sequences, the classification model has showed 7-fold cross-validation percentage accuracies of 83.81%, 94.82%, 91.25%, 90.77% and 82.35% respectively. The high sensitivity and specificity value of 0.86 and 0.90 for plant; 0.96 and 0.92 for Drosophila; 0.88 and 0.92 for human; 0.78 and 0.84 for mouse and 0.82 and 0.80 for rat demonstrate that this technique is less prone to false positive results and exhibits better performance than many other tools. Moreover, this model successfully identifies location of promoter using TATA weight matrix. Conclusion The high sensitivity and specificity indicate that 4-mer frequencies in conjunction with supervised machine-learning methods can be beneficial in the identification of RNA pol II promoters comparative to other methods. This

  19. CompariMotif: quick and easy comparisons of sequence motifs.

    Science.gov (United States)

    Edwards, Richard J; Davey, Norman E; Shields, Denis C

    2008-05-15

    CompariMotif is a novel tool for making motif-motif comparisons, identifying and describing similarities between regular expression motifs. CompariMotif can identify a number of different relationships between motifs, including exact matches, variants of degenerate motifs and complex overlapping motifs. Motif relationships are scored using shared information content, allowing the best matches to be easily identified in large comparisons. Many input and search options are available, enabling a list of motifs to be compared to itself (to identify recurring motifs) or to datasets of known motifs. CompariMotif can be run online at http://bioware.ucd.ie/ and is freely available for academic use as a set of open source Python modules under a GNU General Public License from http://bioinformatics.ucd.ie/shields/software/comparimotif/

  20. The conservation pattern of short linear motifs is highly correlated with the function of interacting protein domains

    Directory of Open Access Journals (Sweden)

    Wang Yiguo

    2008-10-01

    Full Text Available Abstract Background Many well-represented domains recognize primary sequences usually less than 10 amino acids in length, called Short Linear Motifs (SLiMs. Accurate prediction of SLiMs has been difficult because they are short (often Results Our combined approach revealed that SLiMs are highly conserved in proteins from functional classes that are known to interact with a specific domain, but that they are not conserved in most other protein groups. We found that SLiMs recognized by SH2 domains were highly conserved in receptor kinases/phosphatases, adaptor molecules, and tyrosine kinases/phosphatases, that SLiMs recognized by SH3 domains were highly conserved in cytoskeletal and cytoskeletal-associated proteins, that SLiMs recognized by PDZ domains were highly conserved in membrane proteins such as channels and receptors, and that SLiMs recognized by S/T kinase domains were highly conserved in adaptor molecules, S/T kinases/phosphatases, and proteins involved in transcription or cell cycle control. We studied Tyr-SLiMs recognized by SH2 domains in more detail, and found that SH2-recognized Tyr-SLiMs on the cytoplasmic side of membrane proteins are more highly conserved than those on the extra-cellular side. Also, we found that SH2-recognized Tyr-SLiMs that are associated with SH3 motifs and a tyrosine kinase phosphorylation motif are more highly conserved. Conclusion The interactome of protein domains is reflected by the evolutionary conservation of SLiMs recognized by these domains. Combining scoring matrixes derived from peptide libraries and conservation analysis, we would be able to find those protein groups that are more likely to interact with specific domains.

  1. The B7-1 cytoplasmic tail enhances intracellular transport and mammalian cell surface display of chimeric proteins in the absence of a linear ER export motif.

    Directory of Open Access Journals (Sweden)

    Yi-Chieh Lin

    Full Text Available Membrane-tethered proteins (mammalian surface display are increasingly being used for novel therapeutic and biotechnology applications. Maximizing surface expression of chimeric proteins on mammalian cells is important for these applications. We show that the cytoplasmic domain from the B7-1 antigen, a commonly used element for mammalian surface display, can enhance the intracellular transport and surface display of chimeric proteins in a Sar1 and Rab1 dependent fashion. However, mutational, alanine scanning and deletion analysis demonstrate the absence of linear ER export motifs in the B7 cytoplasmic domain. Rather, efficient intracellular transport correlated with the presence of predicted secondary structure in the cytoplasmic tail. Examination of the cytoplasmic domains of 984 human and 782 mouse type I transmembrane proteins revealed that many previously identified ER export motifs are rarely found in the cytoplasmic tail of type I transmembrane proteins. Our results suggest that efficient intracellular transport of B7 chimeric proteins is associated with the structure rather than to the presence of a linear ER export motif in the cytoplasmic tail, and indicate that short (less than ~ 10-20 amino acids and unstructured cytoplasmic tails should be avoided to express high levels of chimeric proteins on mammalian cells.

  2. Phylogenetic analysis of ferlin genes reveals ancient eukaryotic origins

    Directory of Open Access Journals (Sweden)

    Lek Monkol

    2010-07-01

    Full Text Available Abstract Background The ferlin gene family possesses a rare and identifying feature consisting of multiple tandem C2 domains and a C-terminal transmembrane domain. Much currently remains unknown about the fundamental function of this gene family, however, mutations in its two most well-characterised members, dysferlin and otoferlin, have been implicated in human disease. The availability of genome sequences from a wide range of species makes it possible to explore the evolution of the ferlin family, providing contextual insight into characteristic features that define the ferlin gene family in its present form in humans. Results Ferlin genes were detected from all species of representative phyla, with two ferlin subgroups partitioned within the ferlin phylogenetic tree based on the presence or absence of a DysF domain. Invertebrates generally possessed two ferlin genes (one with DysF and one without, with six ferlin genes in most vertebrates (three DysF, three non-DysF. Expansion of the ferlin gene family is evident between the divergence of lamprey (jawless vertebrates and shark (cartilaginous fish. Common to almost all ferlins is an N-terminal C2-FerI-C2 sandwich, a FerB motif, and two C-terminal C2 domains (C2E and C2F adjacent to the transmembrane domain. Preservation of these structural elements throughout eukaryotic evolution suggests a fundamental role of these motifs for ferlin function. In contrast, DysF, C2DE, and FerA are optional, giving rise to subtle differences in domain topologies of ferlin genes. Despite conservation of multiple C2 domains in all ferlins, the C-terminal C2 domains (C2E and C2F displayed higher sequence conservation and greater conservation of putative calcium binding residues across paralogs and orthologs. Interestingly, the two most studied non-mammalian ferlins (Fer-1 and Misfire in model organisms C. elegans and D. melanogaster, present as outgroups in the phylogenetic analysis, with results suggesting

  3. Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae

    Directory of Open Access Journals (Sweden)

    Christian J. Michel

    2017-12-01

    Full Text Available A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C 3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X , using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X , in the complete genome of the yeast Saccharomyces cerevisiae. Several properties of X motifs are identified by basic statistics (at the frequency level, and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R . We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae. We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae, but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions. This property is true for all cardinalities of X motifs (from 4 to 20 and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non- X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together

  4. Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae.

    Science.gov (United States)

    Michel, Christian J; Ngoune, Viviane Nguefack; Poch, Olivier; Ripp, Raymond; Thompson, Julie D

    2017-12-03

    A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading) frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X, using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X, in the complete genome of the yeast Saccharomyces cerevisiae . Several properties of X motifs are identified by basic statistics (at the frequency level), and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R. We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae . We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae , but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions). This property is true for all cardinalities of X motifs (from 4 to 20) and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non-X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together, represent the first

  5. Characterizing Motif Dynamics of Electric Brain Activity Using Symbolic Analysis

    Directory of Open Access Journals (Sweden)

    Massimiliano Zanin

    2014-10-01

    Full Text Available Motifs are small recurring circuits of interactions which constitute the backbone of networked systems. Characterizing motif dynamics is therefore key to understanding the functioning of such systems. Here we propose a method to define and quantify the temporal variability and time scales of electroencephalogram (EEG motifs of resting brain activity. Given a triplet of EEG sensors, links between them are calculated by means of linear correlation; each pattern of links (i.e., each motif is then associated to a symbol, and its appearance frequency is analyzed by means of Shannon entropy. Our results show that each motif becomes observable with different coupling thresholds and evolves at its own time scale, with fronto-temporal sensors emerging at high thresholds and changing at fast time scales, and parietal ones at low thresholds and changing at slower rates. Finally, while motif dynamics differed across individuals, for each subject, it showed robustness across experimental conditions, indicating that it could represent an individual dynamical signature.

  6. Use of Host-like Peptide Motifs in Viral Proteins Is a Prevalent Strategy in Host-Virus Interactions

    Directory of Open Access Journals (Sweden)

    Tzachi Hagai

    2014-06-01

    Full Text Available Viruses interact extensively with host proteins, but the mechanisms controlling these interactions are not well understood. We present a comprehensive analysis of eukaryotic linear motifs (ELMs in 2,208 viral genomes and reveal that viruses exploit molecular mimicry of host-like ELMs to possibly assist in host-virus interactions. Using a statistical genomics approach, we identify a large number of potentially functional ELMs and observe that the occurrence of ELMs is often evolutionarily conserved but not uniform across virus families. Some viral proteins contain multiple types of ELMs, in striking similarity to complex regulatory modules in host proteins, suggesting that ELMs may act combinatorially to assist viral replication. Furthermore, a simple evolutionary model suggests that the inherent structural simplicity of ELMs often enables them to tolerate mutations and evolve quickly. Our findings suggest that ELMs may allow fast rewiring of host-virus interactions, which likely assists rapid viral evolution and adaptation to diverse environments.

  7. MotifNet: a web-server for network motif analysis.

    Science.gov (United States)

    Smoly, Ilan Y; Lerman, Eugene; Ziv-Ukelson, Michal; Yeger-Lotem, Esti

    2017-06-15

    Network motifs are small topological patterns that recur in a network significantly more often than expected by chance. Their identification emerged as a powerful approach for uncovering the design principles underlying complex networks. However, available tools for network motif analysis typically require download and execution of computationally intensive software on a local computer. We present MotifNet, the first open-access web-server for network motif analysis. MotifNet allows researchers to analyze integrated networks, where nodes and edges may be labeled, and to search for motifs of up to eight nodes. The output motifs are presented graphically and the user can interactively filter them by their significance, number of instances, node and edge labels, and node identities, and view their instances. MotifNet also allows the user to distinguish between motifs that are centered on specific nodes and motifs that recur in distinct parts of the network. MotifNet is freely available at http://netbio.bgu.ac.il/motifnet . The website was implemented using ReactJs and supports all major browsers. The server interface was implemented in Python with data stored on a MySQL database. estiyl@bgu.ac.il or michaluz@cs.bgu.ac.il. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  8. The independent prokaryotic origins of eukaryotic fructose-1, 6-bisphosphatase and sedoheptulose-1, 7-bisphosphatase and the implications of their origins for the evolution of eukaryotic Calvin cycle

    Directory of Open Access Journals (Sweden)

    Jiang Yong-Hai

    2012-10-01

    Full Text Available Abstract Background In the Calvin cycle of eubacteria, the dephosphorylations of both fructose-1, 6-bisphosphate (FBP and sedoheptulose-1, 7-bisphosphate (SBP are catalyzed by the same bifunctional enzyme: fructose-1, 6-bisphosphatase/sedoheptulose-1, 7-bisphosphatase (F/SBPase, while in that of eukaryotic chloroplasts by two distinct enzymes: chloroplastic fructose-1, 6-bisphosphatase (FBPase and sedoheptulose-1, 7-bisphosphatase (SBPase, respectively. It was proposed that these two eukaryotic enzymes arose from the divergence of a common ancestral eubacterial bifunctional F/SBPase of mitochondrial origin. However, no specific affinity between SBPase and eubacterial FBPase or F/SBPase can be observed in the previous phylogenetic analyses, and it is hard to explain why SBPase and/or F/SBPase are/is absent from most extant nonphotosynthetic eukaryotes according to this scenario. Results Domain analysis indicated that eubacterial F/SBPase of two different resources contain distinct domains: proteobacterial F/SBPases contain typical FBPase domain, while cyanobacterial F/SBPases possess FBPase_glpX domain. Therefore, like prokaryotic FBPase, eubacterial F/SBPase can also be divided into two evolutionarily distant classes (Class I and II. Phylogenetic analysis based on a much larger taxonomic sampling than previous work revealed that all eukaryotic SBPase cluster together and form a close sister group to the clade of epsilon-proteobacterial Class I FBPase which are gluconeogenesis-specific enzymes, while all eukaryotic chloroplast FBPase group together with eukaryotic cytosolic FBPase and form another distinct clade which then groups with the Class I FBPase of diverse eubacteria. Motif analysis of these enzymes also supports these phylogenetic correlations. Conclusions There are two evolutionarily distant classes of eubacterial bifunctional F/SBPase. Eukaryotic FBPase and SBPase do not diverge from either of them but have two independent origins

  9. Feedback loops and reciprocal regulation: recurring motifs in the systems biology of the cell cycle

    OpenAIRE

    Ferrell, James E.

    2013-01-01

    The study of eukaryotic cell cycle regulation over the last several decades has led to a remarkably detailed understanding of the complex regulatory system that drives this fundamental process. This allows us to now look for recurring motifs in the regulatory system. Among these are negative feedback loops, which underpin checkpoints and generate cell cycle oscillations; positive feedback loops, which promote oscillations and make cell cycle transitions switch-like and unidirectional; and rec...

  10. Mutational analysis of the RecJ exonuclease of Escherichia coli: identification of phosphoesterase motifs.

    Science.gov (United States)

    Sutera, V A; Han, E S; Rajman, L A; Lovett, S T

    1999-10-01

    The recJ gene, identified in Escherichia coli, encodes a Mg(+2)-dependent 5'-to-3' exonuclease with high specificity for single-strand DNA. Genetic and biochemical experiments implicate RecJ exonuclease in homologous recombination, base excision, and methyl-directed mismatch repair. Genes encoding proteins with strong similarities to RecJ have been found in every eubacterial genome sequenced to date, with the exception of Mycoplasma and Mycobacterium tuberculosis. Multiple genes encoding proteins similar to RecJ are found in some eubacteria, including Bacillus and Helicobacter, and in the archaea. Among this divergent set of sequences, seven conserved motifs emerge. We demonstrate here that amino acids within six of these motifs are essential for both the biochemical and genetic functions of E. coli RecJ. These motifs may define interactions with Mg(2+) ions or substrate DNA. A large family of proteins more distantly related to RecJ is present in archaea, eubacteria, and eukaryotes, including a hypothetical protein in the MgPa adhesin operon of Mycoplasma, a domain of putative polyA polymerases in Synechocystis and Aquifex, PRUNE of Drosophila, and an exopolyphosphatase (PPX1) of Saccharomyces cereviseae. Because these six RecJ motifs are shared between exonucleases and exopolyphosphatases, they may constitute an ancient phosphoesterase domain now found in all kingdoms of life.

  11. Identification and characterization of a selenoprotein family containing a diselenide bond in a redox motif

    Science.gov (United States)

    Shchedrina, Valentina A.; Novoselov, Sergey V.; Malinouski, Mikalai Yu.; Gladyshev, Vadim N.

    2007-01-01

    Selenocysteine (Sec, U) insertion into proteins is directed by translational recoding of specific UGA codons located upstream of a stem-loop structure known as Sec insertion sequence (SECIS) element. Selenoproteins with known functions are oxidoreductases containing a single redox-active Sec in their active sites. In this work, we identified a family of selenoproteins, designated SelL, containing two Sec separated by two other residues to form a UxxU motif. SelL proteins show an unusual occurrence, being present in diverse aquatic organisms, including fish, invertebrates, and marine bacteria. Both eukaryotic and bacterial SelL genes use single SECIS elements for insertion of two Sec. In eukaryotes, the SECIS is located in the 3′ UTR, whereas the bacterial SelL SECIS is within a coding region and positioned at a distance that supports the insertion of either of the two Sec or both of these residues. SelL proteins possess a thioredoxin-like fold wherein the UxxU motif corresponds to the catalytic CxxC motif in thioredoxins, suggesting a redox function of SelL proteins. Distantly related SelL-like proteins were also identified in a variety of organisms that had either one or both Sec replaced with Cys. Danio rerio SelL, transiently expressed in mammalian cells, incorporated two Sec and localized to the cytosol. In these cells, it occurred in an oxidized form and was not reducible by DTT. In a bacterial expression system, we directly demonstrated the formation of a diselenide bond between the two Sec, establishing it as the first diselenide bond found in a natural protein. PMID:17715293

  12. BayesMotif: de novo protein sorting motif discovery from impure datasets.

    Science.gov (United States)

    Hu, Jianjun; Zhang, Fan

    2010-01-18

    Protein sorting is the process that newly synthesized proteins are transported to their target locations within or outside of the cell. This process is precisely regulated by protein sorting signals in different forms. A major category of sorting signals are amino acid sub-sequences usually located at the N-terminals or C-terminals of protein sequences. Genome-wide experimental identification of protein sorting signals is extremely time-consuming and costly. Effective computational algorithms for de novo discovery of protein sorting signals is needed to improve the understanding of protein sorting mechanisms. We formulated the protein sorting motif discovery problem as a classification problem and proposed a Bayesian classifier based algorithm (BayesMotif) for de novo identification of a common type of protein sorting motifs in which a highly conserved anchor is present along with a less conserved motif regions. A false positive removal procedure is developed to iteratively remove sequences that are unlikely to contain true motifs so that the algorithm can identify motifs from impure input sequences. Experiments on both implanted motif datasets and real-world datasets showed that the enhanced BayesMotif algorithm can identify anchored sorting motifs from pure or impure protein sequence dataset. It also shows that the false positive removal procedure can help to identify true motifs even when there is only 20% of the input sequences containing true motif instances. We proposed BayesMotif, a novel Bayesian classification based algorithm for de novo discovery of a special category of anchored protein sorting motifs from impure datasets. Compared to conventional motif discovery algorithms such as MEME, our algorithm can find less-conserved motifs with short highly conserved anchors. Our algorithm also has the advantage of easy incorporation of additional meta-sequence features such as hydrophobicity or charge of the motifs which may help to overcome the limitations of

  13. The reduced kinome of Ostreococcus tauri: core eukaryotic signalling components in a tractable model species.

    Science.gov (United States)

    Hindle, Matthew M; Martin, Sarah F; Noordally, Zeenat B; van Ooijen, Gerben; Barrios-Llerena, Martin E; Simpson, T Ian; Le Bihan, Thierry; Millar, Andrew J

    2014-08-02

    The current knowledge of eukaryote signalling originates from phenotypically diverse organisms. There is a pressing need to identify conserved signalling components among eukaryotes, which will lead to the transfer of knowledge across kingdoms. Two useful properties of a eukaryote model for signalling are (1) reduced signalling complexity, and (2) conservation of signalling components. The alga Ostreococcus tauri is described as the smallest free-living eukaryote. With less than 8,000 genes, it represents a highly constrained genomic palette. Our survey revealed 133 protein kinases and 34 protein phosphatases (1.7% and 0.4% of the proteome). We conducted phosphoproteomic experiments and constructed domain structures and phylogenies for the catalytic protein-kinases. For each of the major kinases families we review the completeness and divergence of O. tauri representatives in comparison to the well-studied kinomes of the laboratory models Arabidopsis thaliana and Saccharomyces cerevisiae, and of Homo sapiens. Many kinase clades in O. tauri were reduced to a single member, in preference to the loss of family diversity, whereas TKL and ABC1 clades were expanded. We also identified kinases that have been lost in A. thaliana but retained in O. tauri. For three, contrasting eukaryotic pathways - TOR, MAPK, and the circadian clock - we established the subset of conserved components and demonstrate conserved sites of substrate phosphorylation and kinase motifs. We conclude that O. tauri satisfies our two central requirements. Several of its kinases are more closely related to H. sapiens orthologs than S. cerevisiae is to H. sapiens. The greatly reduced kinome of O. tauri is therefore a suitable model for signalling in free-living eukaryotes.

  14. The Verrucomicrobia LexA-binding Motif: Insights into the Evolutionary Dynamics of the SOS Response

    Directory of Open Access Journals (Sweden)

    Ivan Erill

    2016-07-01

    Full Text Available The SOS response is the primary bacterial mechanism to address DNA damage, coordinating multiple cellular processes that include DNA repair, cell division and translesion synthesis. In contrast to other regulatory systems, the composition of the SOS genetic network and the binding motif of its transcriptional repressor, LexA, have been shown to vary greatly across bacterial clades, making it an ideal system to study the co-evolution of transcription factors and their regulons. Leveraging comparative genomics approaches and prior knowledge on the core SOS regulon, here we define the binding motif of the Verrucomicrobia, a recently described phylum of emerging interest due to its association with eukaryotic hosts. Site directed mutagenesis of the Verrucomicrobium spinosum recA promoter confirms that LexA binds a 14 bp palindromic motif with consensus sequence TGTTC-N4-GAACA. Computational analyses suggest that recognition of this novel motif is determined primarily by changes in base-contacting residues of the third alpha helix of the LexA helix-turn-helix DNA binding motif. In conjunction with comparative genomics analysis of the LexA regulon in the Verrucomicrobia phylum, electrophoretic shift assays reveal that LexA binds to operators in the promoter region of DNA repair genes and a mutagenesis cassette in this organism, and identify previously unreported components of the SOS response. The identification of tandem LexA-binding sites generating instances of other LexA-binding motifs in the lexA gene promoter of Verrucomicrobia species leads us to postulate a novel mechanism for LexA-binding motif evolution. This model, based on gene duplication, successfully addresses outstanding questions in the intricate co-evolution of the LexA protein, its binding motif and the regulatory network it controls.

  15. The Verrucomicrobia LexA-Binding Motif: Insights into the Evolutionary Dynamics of the SOS Response.

    Science.gov (United States)

    Erill, Ivan; Campoy, Susana; Kılıç, Sefa; Barbé, Jordi

    2016-01-01

    The SOS response is the primary bacterial mechanism to address DNA damage, coordinating multiple cellular processes that include DNA repair, cell division, and translesion synthesis. In contrast to other regulatory systems, the composition of the SOS genetic network and the binding motif of its transcriptional repressor, LexA, have been shown to vary greatly across bacterial clades, making it an ideal system to study the co-evolution of transcription factors and their regulons. Leveraging comparative genomics approaches and prior knowledge on the core SOS regulon, here we define the binding motif of the Verrucomicrobia, a recently described phylum of emerging interest due to its association with eukaryotic hosts. Site directed mutagenesis of the Verrucomicrobium spinosum recA promoter confirms that LexA binds a 14 bp palindromic motif with consensus sequence TGTTC-N4-GAACA. Computational analyses suggest that recognition of this novel motif is determined primarily by changes in base-contacting residues of the third alpha helix of the LexA helix-turn-helix DNA binding motif. In conjunction with comparative genomics analysis of the LexA regulon in the Verrucomicrobia phylum, electrophoretic shift assays reveal that LexA binds to operators in the promoter region of DNA repair genes and a mutagenesis cassette in this organism, and identify previously unreported components of the SOS response. The identification of tandem LexA-binding sites generating instances of other LexA-binding motifs in the lexA gene promoter of Verrucomicrobia species leads us to postulate a novel mechanism for LexA-binding motif evolution. This model, based on gene duplication, successfully addresses outstanding questions in the intricate co-evolution of the LexA protein, its binding motif and the regulatory network it controls.

  16. Structures and short linear motif of disordered transcription factor regions provide clues to the interactome of the cellular hub radical-induced cell death1

    DEFF Research Database (Denmark)

    O'Shea, Charlotte; Staby, Lasse; Bendsen, Sidsel Krogh

    2017-01-01

    Intrinsically disordered protein regions (IDRs) lack a well-defined three-dimensional structure, but often facilitate key protein functions. Some interactions between IDRs and folded protein domains rely on short linear motifs (SLiMs). These motifs are challenging to identify, but once found can...... point to larger networks of interactions, such as with proteins that serve as hubs for essential cellular functions. The stress-associated plant protein Radical-Induced Cell Death1 (RCD1) is one such hub, interacting with many transcription factors via their flexible IDRs. To identify the SLiM bound......046 formed different structures or were fuzzy in the complexes. These findings allow us to present a model of the stress-associated RCD1-transcription factor interactome and to contribute to the emerging understanding of the interactions between folded hubs and their intrinsically disordered partners....

  17. MotifMark: Finding regulatory motifs in DNA sequences.

    Science.gov (United States)

    Hassanzadeh, Hamid Reza; Kolhe, Pushkar; Isbell, Charles L; Wang, May D

    2017-07-01

    The interaction between proteins and DNA is a key driving force in a significant number of biological processes such as transcriptional regulation, repair, recombination, splicing, and DNA modification. The identification of DNA-binding sites and the specificity of target proteins in binding to these regions are two important steps in understanding the mechanisms of these biological activities. A number of high-throughput technologies have recently emerged that try to quantify the affinity between proteins and DNA motifs. Despite their success, these technologies have their own limitations and fall short in precise characterization of motifs, and as a result, require further downstream analysis to extract useful and interpretable information from a haystack of noisy and inaccurate data. Here we propose MotifMark, a new algorithm based on graph theory and machine learning, that can find binding sites on candidate probes and rank their specificity in regard to the underlying transcription factor. We developed a pipeline to analyze experimental data derived from compact universal protein binding microarrays and benchmarked it against two of the most accurate motif search methods. Our results indicate that MotifMark can be a viable alternative technique for prediction of motif from protein binding microarrays and possibly other related high-throughput techniques.

  18. Identification of the divergent calmodulin binding motif in yeast Ssb1/Hsp75 protein and in other HSP70 family members.

    Science.gov (United States)

    Heinen, R C; Diniz-Mendes, L; Silva, J T; Paschoalin, V M F

    2006-11-01

    Yeast soluble proteins were fractionated by calmodulin-agarose affinity chromatography and the Ca2+/calmodulin-binding proteins were analyzed by SDS-PAGE. One prominent protein of 66 kDa was excised from the gel, digested with trypsin and the masses of the resultant fragments were determined by MALDI/MS. Twenty-one of 38 monoisotopic peptide masses obtained after tryptic digestion were matched to the heat shock protein Ssb1/Hsp75, covering 37% of its sequence. Computational analysis of the primary structure of Ssb1/Hsp75 identified a unique potential amphipathic alpha-helix in its N-terminal ATPase domain with features of target regions for Ca2+/calmodulin binding. This region, which shares 89% similarity to the experimentally determined calmodulin-binding domain from mouse, Hsc70, is conserved in near half of the 113 members of the HSP70 family investigated, from yeast to plant and animals. Based on the sequence of this region, phylogenetic analysis grouped the HSP70s in three distinct branches. Two of them comprise the non-calmodulin binding Hsp70s BIP/GR78, a subfamily of eukaryotic HSP70 localized in the endoplasmic reticulum, and DnaK, a subfamily of prokaryotic HSP70. A third heterogeneous group is formed by eukaryotic cytosolic HSP70s containing the new calmodulin-binding motif and other cytosolic HSP70s whose sequences do not conform to those conserved motif, indicating that not all eukaryotic cytosolic Hsp70s are target for calmodulin regulation. Furthermore, the calmodulin-binding domain found in eukaryotic HSP70s is also the target for binding of Bag-1 - an enhancer of ADP/ATP exchange activity of Hsp70s. A model in which calmodulin displaces Bag-1 and modulates Ssb1/Hsp75 chaperone activity is discussed.

  19. Identification of the divergent calmodulin binding motif in yeast Ssb1/Hsp75 protein and in other HSP70 family members

    Directory of Open Access Journals (Sweden)

    R.C. Heinen

    2006-11-01

    Full Text Available Yeast soluble proteins were fractionated by calmodulin-agarose affinity chromatography and the Ca2+/calmodulin-binding proteins were analyzed by SDS-PAGE. One prominent protein of 66 kDa was excised from the gel, digested with trypsin and the masses of the resultant fragments were determined by MALDI/MS. Twenty-one of 38 monoisotopic peptide masses obtained after tryptic digestion were matched to the heat shock protein Ssb1/Hsp75, covering 37% of its sequence. Computational analysis of the primary structure of Ssb1/Hsp75 identified a unique potential amphipathic alpha-helix in its N-terminal ATPase domain with features of target regions for Ca2+/calmodulin binding. This region, which shares 89% similarity to the experimentally determined calmodulin-binding domain from mouse, Hsc70, is conserved in near half of the 113 members of the HSP70 family investigated, from yeast to plant and animals. Based on the sequence of this region, phylogenetic analysis grouped the HSP70s in three distinct branches. Two of them comprise the non-calmodulin binding Hsp70s BIP/GR78, a subfamily of eukaryotic HSP70 localized in the endoplasmic reticulum, and DnaK, a subfamily of prokaryotic HSP70. A third heterogeneous group is formed by eukaryotic cytosolic HSP70s containing the new calmodulin-binding motif and other cytosolic HSP70s whose sequences do not conform to those conserved motif, indicating that not all eukaryotic cytosolic Hsp70s are target for calmodulin regulation. Furthermore, the calmodulin-binding domain found in eukaryotic HSP70s is also the target for binding of Bag-1 - an enhancer of ADP/ATP exchange activity of Hsp70s. A model in which calmodulin displaces Bag-1 and modulates Ssb1/Hsp75 chaperone activity is discussed.

  20. Comparative analysis of evolutionarily conserved motifs of epidermal growth factor receptor 2 (HER2) predicts novel potential therapeutic epitopes

    DEFF Research Database (Denmark)

    Deng, Xiaohong; Zheng, Xuxu; Yang, Huanming

    2014-01-01

    druggable epitopes/targets. We employed the PROSITE Scan to detect structurally conserved motifs and PRINTS to search for linearly conserved motifs of ECD HER2. We found that the epitopes recognized by trastuzumab and pertuzumab are located in the predicted conserved motifs of ECD HER2, supporting our...

  1. Identification of multiple distinct Snf2 subfamilies with conserved structural motifs.

    Science.gov (United States)

    Flaus, Andrew; Martin, David M A; Barton, Geoffrey J; Owen-Hughes, Tom

    2006-01-01

    The Snf2 family of helicase-related proteins includes the catalytic subunits of ATP-dependent chromatin remodelling complexes found in all eukaryotes. These act to regulate the structure and dynamic properties of chromatin and so influence a broad range of nuclear processes. We have exploited progress in genome sequencing to assemble a comprehensive catalogue of over 1300 Snf2 family members. Multiple sequence alignment of the helicase-related regions enables 24 distinct subfamilies to be identified, a considerable expansion over earlier surveys. Where information is known, there is a good correlation between biological or biochemical function and these assignments, suggesting Snf2 family motor domains are tuned for specific tasks. Scanning of complete genomes reveals all eukaryotes contain members of multiple subfamilies, whereas they are less common and not ubiquitous in eubacteria or archaea. The large sample of Snf2 proteins enables additional distinguishing conserved sequence blocks within the helicase-like motor to be identified. The establishment of a phylogeny for Snf2 proteins provides an opportunity to make informed assignments of function, and the identification of conserved motifs provides a framework for understanding the mechanisms by which these proteins function.

  2. Proteome-level assessment of origin, prevalence and function of Leucine-Aspartic Acid (LD) motifs

    KAUST Repository

    Alam, Tanvir

    2018-03-11

    Short Linear Motifs (SLiMs) contribute to almost every cellular function by connecting appropriate protein partners. Accurate prediction of SLiMs is difficult due to their shortness and sequence degeneracy. Leucine-aspartic acid (LD) motifs are SLiMs that link paxillin family proteins to factors controlling (cancer) cell adhesion, motility and survival. The existence and importance of LD motifs beyond the paxillin family is poorly understood. To enable a proteome-wide assessment of these motifs, we developed an active-learning based framework that iteratively integrates computational predictions with experimental validation. Our analysis of the human proteome identified a dozen proteins that contain LD motifs, all being involved in cell adhesion and migration, and revealed a new type of inverse LD motif consensus. Our evolutionary analysis suggested that LD motif signalling originated in the common unicellular ancestor of opisthokonts and amoebozoa by co-opting nuclear export sequences. Inter-species comparison revealed a conserved LD signalling core, and reveals the emergence of species-specific adaptive connections, while maintaining a strong functional focus of the LD motif interactome. Collectively, our data elucidate the mechanisms underlying the origin and adaptation of an ancestral SLiM.

  3. Accurate quantification of microRNA via single strand displacement reaction on DNA origami motif.

    Directory of Open Access Journals (Sweden)

    Jie Zhu

    Full Text Available DNA origami is an emerging technology that assembles hundreds of staple strands and one single-strand DNA into certain nanopattern. It has been widely used in various fields including detection of biological molecules such as DNA, RNA and proteins. MicroRNAs (miRNAs play important roles in post-transcriptional gene repression as well as many other biological processes such as cell growth and differentiation. Alterations of miRNAs' expression contribute to many human diseases. However, it is still a challenge to quantitatively detect miRNAs by origami technology. In this study, we developed a novel approach based on streptavidin and quantum dots binding complex (STV-QDs labeled single strand displacement reaction on DNA origami to quantitatively detect the concentration of miRNAs. We illustrated a linear relationship between the concentration of an exemplary miRNA as miRNA-133 and the STV-QDs hybridization efficiency; the results demonstrated that it is an accurate nano-scale miRNA quantifier motif. In addition, both symmetrical rectangular motif and asymmetrical China-map motif were tested. With significant linearity in both motifs, our experiments suggested that DNA Origami motif with arbitrary shape can be utilized in this method. Since this DNA origami-based method we developed owns the unique advantages of simple, time-and-material-saving, potentially multi-targets testing in one motif and relatively accurate for certain impurity samples as counted directly by atomic force microscopy rather than fluorescence signal detection, it may be widely used in quantification of miRNAs.

  4. Accurate Quantification of microRNA via Single Strand Displacement Reaction on DNA Origami Motif

    Science.gov (United States)

    Lou, Jingyu; Li, Weidong; Li, Sheng; Zhu, Hongxin; Yang, Lun; Zhang, Aiping; He, Lin; Li, Can

    2013-01-01

    DNA origami is an emerging technology that assembles hundreds of staple strands and one single-strand DNA into certain nanopattern. It has been widely used in various fields including detection of biological molecules such as DNA, RNA and proteins. MicroRNAs (miRNAs) play important roles in post-transcriptional gene repression as well as many other biological processes such as cell growth and differentiation. Alterations of miRNAs' expression contribute to many human diseases. However, it is still a challenge to quantitatively detect miRNAs by origami technology. In this study, we developed a novel approach based on streptavidin and quantum dots binding complex (STV-QDs) labeled single strand displacement reaction on DNA origami to quantitatively detect the concentration of miRNAs. We illustrated a linear relationship between the concentration of an exemplary miRNA as miRNA-133 and the STV-QDs hybridization efficiency; the results demonstrated that it is an accurate nano-scale miRNA quantifier motif. In addition, both symmetrical rectangular motif and asymmetrical China-map motif were tested. With significant linearity in both motifs, our experiments suggested that DNA Origami motif with arbitrary shape can be utilized in this method. Since this DNA origami-based method we developed owns the unique advantages of simple, time-and-material-saving, potentially multi-targets testing in one motif and relatively accurate for certain impurity samples as counted directly by atomic force microscopy rather than fluorescence signal detection, it may be widely used in quantification of miRNAs. PMID:23990889

  5. Accurate quantification of microRNA via single strand displacement reaction on DNA origami motif.

    Science.gov (United States)

    Zhu, Jie; Feng, Xiaolu; Lou, Jingyu; Li, Weidong; Li, Sheng; Zhu, Hongxin; Yang, Lun; Zhang, Aiping; He, Lin; Li, Can

    2013-01-01

    DNA origami is an emerging technology that assembles hundreds of staple strands and one single-strand DNA into certain nanopattern. It has been widely used in various fields including detection of biological molecules such as DNA, RNA and proteins. MicroRNAs (miRNAs) play important roles in post-transcriptional gene repression as well as many other biological processes such as cell growth and differentiation. Alterations of miRNAs' expression contribute to many human diseases. However, it is still a challenge to quantitatively detect miRNAs by origami technology. In this study, we developed a novel approach based on streptavidin and quantum dots binding complex (STV-QDs) labeled single strand displacement reaction on DNA origami to quantitatively detect the concentration of miRNAs. We illustrated a linear relationship between the concentration of an exemplary miRNA as miRNA-133 and the STV-QDs hybridization efficiency; the results demonstrated that it is an accurate nano-scale miRNA quantifier motif. In addition, both symmetrical rectangular motif and asymmetrical China-map motif were tested. With significant linearity in both motifs, our experiments suggested that DNA Origami motif with arbitrary shape can be utilized in this method. Since this DNA origami-based method we developed owns the unique advantages of simple, time-and-material-saving, potentially multi-targets testing in one motif and relatively accurate for certain impurity samples as counted directly by atomic force microscopy rather than fluorescence signal detection, it may be widely used in quantification of miRNAs.

  6. Automated classification of RNA 3D motifs and the RNA 3D Motif Atlas

    Science.gov (United States)

    Petrov, Anton I.; Zirbel, Craig L.; Leontis, Neocles B.

    2013-01-01

    The analysis of atomic-resolution RNA three-dimensional (3D) structures reveals that many internal and hairpin loops are modular, recurrent, and structured by conserved non-Watson–Crick base pairs. Structurally similar loops define RNA 3D motifs that are conserved in homologous RNA molecules, but can also occur at nonhomologous sites in diverse RNAs, and which often vary in sequence. To further our understanding of RNA motif structure and sequence variability and to provide a useful resource for structure modeling and prediction, we present a new method for automated classification of internal and hairpin loop RNA 3D motifs and a new online database called the RNA 3D Motif Atlas. To classify the motif instances, a representative set of internal and hairpin loops is automatically extracted from a nonredundant list of RNA-containing PDB files. Their structures are compared geometrically, all-against-all, using the FR3D program suite. The loops are clustered into motif groups, taking into account geometric similarity and structural annotations and making allowance for a variable number of bulged bases. The automated procedure that we have implemented identifies all hairpin and internal loop motifs previously described in the literature. All motif instances and motif groups are assigned unique and stable identifiers and are made available in the RNA 3D Motif Atlas (http://rna.bgsu.edu/motifs), which is automatically updated every four weeks. The RNA 3D Motif Atlas provides an interactive user interface for exploring motif diversity and tools for programmatic data access. PMID:23970545

  7. A survey of motif finding Web tools for detecting binding site motifs in ChIP-Seq data.

    Science.gov (United States)

    Tran, Ngoc Tam L; Huang, Chun-Hsi

    2014-02-20

    ChIP-Seq (chromatin immunoprecipitation sequencing) has provided the advantage for finding motifs as ChIP-Seq experiments narrow down the motif finding to binding site locations. Recent motif finding tools facilitate the motif detection by providing user-friendly Web interface. In this work, we reviewed nine motif finding Web tools that are capable for detecting binding site motifs in ChIP-Seq data. We showed each motif finding Web tool has its own advantages for detecting motifs that other tools may not discover. We recommended the users to use multiple motif finding Web tools that implement different algorithms for obtaining significant motifs, overlapping resemble motifs, and non-overlapping motifs. Finally, we provided our suggestions for future development of motif finding Web tool that better assists researchers for finding motifs in ChIP-Seq data.

  8. Communities of microbial eukaryotes in the mammalian gut within the context of environmental eukaryotic diversity

    Energy Technology Data Exchange (ETDEWEB)

    Parfrey, Laura Wegener; Walters, William A.; Lauber, Christian L.; Clemente, Jose C.; Berg-Lyons, Donna; Teiling, Clotilde; Kodira, Chinnappa; Mohiuddin, Mohammed; Brunelle, Julie; Driscoll, Mark; Fierer, Noah; Gilbert, Jack A.; Knight, Rob

    2014-06-19

    Eukaryotic microbes (protists) residing in the vertebrate gut influence host health and disease, but their diversity and distribution in healthy hosts is poorly understood. Protists found in the gut are typically considered parasites, but many are commensal and some are beneficial. Further, the hygiene hypothesis predicts that association with our co-evolved microbial symbionts may be important to overall health. It is therefore imperative that we understand the normal diversity of our eukaryotic gut microbiota to test for such effects and avoid eliminating commensal organisms. We assembled a dataset of healthy individuals from two populations, one with traditional, agrarian lifestyles and a second with modern, westernized lifestyles, and characterized the human eukaryotic microbiota via high-throughput sequencing. To place the human gut microbiota within a broader context our dataset also includes gut samples from diverse mammals and samples from other aquatic and terrestrial environments. We curated the SILVA ribosomal database to reflect current knowledge of eukaryotic taxonomy and employ it as a phylogenetic framework to compare eukaryotic diversity across environment. We show that adults from the non-western population harbor a diverse community of protists, and diversity in the human gut is comparable to that in other mammals. However, the eukaryotic microbiota of the western population appears depauperate. The distribution of symbionts found in mammals reflects both host phylogeny and diet. Eukaryotic microbiota in the gut are less diverse and more patchily distributed than bacteria. More broadly, we show that eukaryotic communities in the gut are less diverse than in aquatic and terrestrial habitats, and few taxa are shared across habitat types, and diversity patterns of eukaryotes are correlated with those observed for bacteria. These results outline the distribution and diversity of microbial eukaryotic communities in the mammalian gut and across

  9. Short sequence motifs, overrepresented in mammalian conservednon-coding sequences

    Energy Technology Data Exchange (ETDEWEB)

    Minovitsky, Simon; Stegmaier, Philip; Kel, Alexander; Kondrashov,Alexey S.; Dubchak, Inna

    2007-02-21

    Background: A substantial fraction of non-coding DNAsequences of multicellular eukaryotes is under selective constraint. Inparticular, ~;5 percent of the human genome consists of conservednon-coding sequences (CNSs). CNSs differ from other genomic sequences intheir nucleotide composition and must play important functional roles,which mostly remain obscure.Results: We investigated relative abundancesof short sequence motifs in all human CNSs present in the human/mousewhole-genome alignments vs. three background sets of sequences: (i)weakly conserved or unconserved non-coding sequences (non-CNSs); (ii)near-promoter sequences (located between nucleotides -500 and -1500,relative to a start of transcription); and (iii) random sequences withthe same nucleotide composition as that of CNSs. When compared tonon-CNSs and near-promoter sequences, CNSs possess an excess of AT-richmotifs, often containing runs of identical nucleotides. In contrast, whencompared to random sequences, CNSs contain an excess of GC-rich motifswhich, however, lack CpG dinucleotides. Thus, abundance of short sequencemotifs in human CNSs, taken as a whole, is mostly determined by theiroverall compositional properties and not by overrepresentation of anyspecific short motifs. These properties are: (i) high AT-content of CNSs,(ii) a tendency, probably due to context-dependent mutation, of A's andT's to clump, (iii) presence of short GC-rich regions, and (iv) avoidanceof CpG contexts, due to their hypermutability. Only a small number ofshort motifs, overrepresented in all human CNSs are similar to bindingsites of transcription factors from the FOX family.Conclusion: Human CNSsas a whole appear to be too broad a class of sequences to possess strongfootprints of any short sequence-specific functions. Such footprintsshould be studied at the level of functional subclasses of CNSs, such asthose which flank genes with a particular pattern of expression. Overallproperties of CNSs are affected by

  10. Space-related pharma-motifs for fast search of protein binding motifs and polypharmacological targets.

    Science.gov (United States)

    Chiu, Yi-Yuan; Lin, Chun-Yu; Lin, Chih-Ta; Hsu, Kai-Cheng; Chang, Li-Zen; Yang, Jinn-Moon

    2012-01-01

    To discover a compound inhibiting multiple proteins (i.e. polypharmacological targets) is a new paradigm for the complex diseases (e.g. cancers and diabetes). In general, the polypharmacological proteins often share similar local binding environments and motifs. As the exponential growth of the number of protein structures, to find the similar structural binding motifs (pharma-motifs) is an emergency task for drug discovery (e.g. side effects and new uses for old drugs) and protein functions. We have developed a Space-Related Pharmamotifs (called SRPmotif) method to recognize the binding motifs by searching against protein structure database. SRPmotif is able to recognize conserved binding environments containing spatially discontinuous pharma-motifs which are often short conserved peptides with specific physico-chemical properties for protein functions. Among 356 pharma-motifs, 56.5% interacting residues are highly conserved. Experimental results indicate that 81.1% and 92.7% polypharmacological targets of each protein-ligand complex are annotated with same biological process (BP) and molecular function (MF) terms, respectively, based on Gene Ontology (GO). Our experimental results show that the identified pharma-motifs often consist of key residues in functional (active) sites and play the key roles for protein functions. The SRPmotif is available at http://gemdock.life.nctu.edu.tw/SRP/. SRPmotif is able to identify similar pharma-interfaces and pharma-motifs sharing similar binding environments for polypharmacological targets by rapidly searching against the protein structure database. Pharma-motifs describe the conservations of binding environments for drug discovery and protein functions. Additionally, these pharma-motifs provide the clues for discovering new sequence-based motifs to predict protein functions from protein sequence databases. We believe that SRPmotif is useful for elucidating protein functions and drug discovery.

  11. Motif-role-fingerprints: the building-blocks of motifs, clustering-coefficients and transitivities in directed networks.

    Directory of Open Access Journals (Sweden)

    Mark D McDonnell

    Full Text Available Complex networks are frequently characterized by metrics for which particular subgraphs are counted. One statistic from this category, which we refer to as motif-role fingerprints, differs from global subgraph counts in that the number of subgraphs in which each node participates is counted. As with global subgraph counts, it can be important to distinguish between motif-role fingerprints that are 'structural' (induced subgraphs and 'functional' (partial subgraphs. Here we show mathematically that a vector of all functional motif-role fingerprints can readily be obtained from an arbitrary directed adjacency matrix, and then converted to structural motif-role fingerprints by multiplying that vector by a specific invertible conversion matrix. This result demonstrates that a unique structural motif-role fingerprint exists for any given functional motif-role fingerprint. We demonstrate a similar result for the cases of functional and structural motif-fingerprints without node roles, and global subgraph counts that form the basis of standard motif analysis. We also explicitly highlight that motif-role fingerprints are elemental to several popular metrics for quantifying the subgraph structure of directed complex networks, including motif distributions, directed clustering coefficient, and transitivity. The relationships between each of these metrics and motif-role fingerprints also suggest new subtypes of directed clustering coefficients and transitivities. Our results have potential utility in analyzing directed synaptic networks constructed from neuronal connectome data, such as in terms of centrality. Other potential applications include anomaly detection in networks, identification of similar networks and identification of similar nodes within networks. Matlab code for calculating all stated metrics following calculation of functional motif-role fingerprints is provided as S1 Matlab File.

  12. MicroRNA categorization using sequence motifs and k-mers.

    Science.gov (United States)

    Yousef, Malik; Khalifa, Waleed; Acar, İlhan Erkin; Allmer, Jens

    2017-03-14

    Post-transcriptional gene dysregulation can be a hallmark of diseases like cancer and microRNAs (miRNAs) play a key role in the modulation of translation efficiency. Known pre-miRNAs are listed in miRBase, and they have been discovered in a variety of organisms ranging from viruses and microbes to eukaryotic organisms. The computational detection of pre-miRNAs is of great interest, and such approaches usually employ machine learning to discriminate between miRNAs and other sequences. Many features have been proposed describing pre-miRNAs, and we have previously introduced the use of sequence motifs and k-mers as useful ones. There have been reports of xeno-miRNAs detected via next generation sequencing. However, they may be contaminations and to aid that important decision-making process, we aimed to establish a means to differentiate pre-miRNAs from different species. To achieve distinction into species, we used one species' pre-miRNAs as the positive and another species' pre-miRNAs as the negative training and test data for the establishment of machine learned models based on sequence motifs and k-mers as features. This approach resulted in higher accuracy values between distantly related species while species with closer relation produced lower accuracy values. We were able to differentiate among species with increasing success when the evolutionary distance increases. This conclusion is supported by previous reports of fast evolutionary changes in miRNAs since even in relatively closely related species a fairly good discrimination was possible.

  13. The MHC motif viewer: a visualization tool for MHC binding motifs

    DEFF Research Database (Denmark)

    Rapin, Nicolas; Hoof, Ilka; Lund, Ole

    2010-01-01

    is hampered by the lack of tools for browsing and comparing specificity of these molecules. We have developed a Web server, MHC Motif Viewer, which allows the display of the binding motif for MHC class I proteins for human, chimpanzee, rhesus monkey, mouse, and swine, as well as HLA-DR protein sequences...

  14. Structural insight into RNA recognition motifs: versatile molecular Lego building blocks for biological systems.

    Science.gov (United States)

    Muto, Yutaka; Yokoyama, Shigeyuki

    2012-01-01

    'RNA recognition motifs (RRMs)' are common domain-folds composed of 80-90 amino-acid residues in eukaryotes, and have been identified in many cellular proteins. At first they were known as RNA binding domains. Through discoveries over the past 20 years, however, the RRMs have been shown to exhibit versatile molecular recognition activities and to behave as molecular Lego building blocks to construct biological systems. Novel RNA/protein recognition modes by RRMs are being identified, and more information about the molecular recognition by RRMs is becoming available. These RNA/protein recognition modes are strongly correlated with their biological significance. In this review, we would like to survey the recent progress on these versatile molecular recognition modules. Copyright © 2012 John Wiley & Sons, Ltd.

  15. Kopi dan Kakao dalam Kreasi Motif Batik Khas Jember

    Directory of Open Access Journals (Sweden)

    Irfa'ina Rohana Salma

    2015-06-01

    Full Text Available ABSTRAK Batik Jember selama ini identik dengan motif daun tembakau. Visualisasi daun tembakau dalam motif Batik Jember cukup lemah, yaitu kurang berkarakter karena motif yang muncul adalah seperti gambar daun pada umumnya. Oleh karena itu perlu diciptakan desain motif batik khas Jember yang sumber inspirasinya digali dari kekayaan alam lainnya dari Jember yang mempunyai bentuk spesifik dan karakteristik sehingga identitas motif bisa didapatkan dengan lebih kuat. Hasil alam khas Jember tersebut adalah kopi dan kakao. Tujuan penciptaan seni ini adalah untuk menghasilkan motif batik  baru yang mempunyai ciri khas Jember. Metode yang digunakan yaitu pengumpulan data, pengamatan mendalam terhadap objek penciptaan, pengkajian sumber inspirasi, pembuatan desain motif, dan perwujudan menjadi batik. Dari penciptaan seni ini berhasil dikreasikan 6 (enam motif batik yaitu: (1 Motif Uwoh Kopi; (2 Motif Godong Kopi;  (3 Motif Ceplok Kakao; (4 Motif Kakao Raja; (5 Motif Kakao Biru; dan (6 Motif Wiji Mukti. Berdasarkan hasil penilaian “Selera Estetika” diketahui bahwa motif yang paling banyak disukai adalah Motif Uwoh Kopi dan Motif Kakao Raja. Kata kunci: Motif Woh Kopi, Motif Godong Kopi, Motif Ceplok Kakao, Motif Kakao Raja, Motif Kakao Biru, Motif Wiji Mukti ABSTRACTBatik Jember is synonymous with tobacco leaf motif. Tobacco leaf shape is quite weak in the visual appearance characterized as that motif emerges like a picture of leaves in general. Therefore, it is necessary to create a distinctive design motif extracted from other natural resources of Jember that have specific shapes and characteristics that can be obtained as the stronger motif identity. The typical natural resources from Jember are coffee and cocoa. The purpose of the creation of this art is to produce the unique, creative and innovative batik and have specific characteristics of Jember. The method used are data collection, observation of the object, reviewing inspiration sources

  16. CombiMotif: A new algorithm for network motifs discovery in protein-protein interaction networks

    Science.gov (United States)

    Luo, Jiawei; Li, Guanghui; Song, Dan; Liang, Cheng

    2014-12-01

    Discovering motifs in protein-protein interaction networks is becoming a current major challenge in computational biology, since the distribution of the number of network motifs can reveal significant systemic differences among species. However, this task can be computationally expensive because of the involvement of graph isomorphic detection. In this paper, we present a new algorithm (CombiMotif) that incorporates combinatorial techniques to count non-induced occurrences of subgraph topologies in the form of trees. The efficiency of our algorithm is demonstrated by comparing the obtained results with the current state-of-the art subgraph counting algorithms. We also show major differences between unicellular and multicellular organisms. The datasets and source code of CombiMotif are freely available upon request.

  17. MSDmotif: exploring protein sites and motifs

    Directory of Open Access Journals (Sweden)

    Henrick Kim

    2008-07-01

    Full Text Available Abstract Background Protein structures have conserved features – motifs, which have a sufficient influence on the protein function. These motifs can be found in sequence as well as in 3D space. Understanding of these fragments is essential for 3D structure prediction, modelling and drug-design. The Protein Data Bank (PDB is the source of this information however present search tools have limited 3D options to integrate protein sequence with its 3D structure. Results We describe here a web application for querying the PDB for ligands, binding sites, small 3D structural and sequence motifs and the underlying database. Novel algorithms for chemical fragments, 3D motifs, ϕ/ψ sequences, super-secondary structure motifs and for small 3D structural motif associations searches are incorporated. The interface provides functionality for visualization, search criteria creation, sequence and 3D multiple alignment options. MSDmotif is an integrated system where a results page is also a search form. A set of motif statistics is available for analysis. This set includes molecule and motif binding statistics, distribution of motif sequences, occurrence of an amino-acid within a motif, correlation of amino-acids side-chain charges within a motif and Ramachandran plots for each residue. The binding statistics are presented in association with properties that include a ligand fragment library. Access is also provided through the distributed Annotation System (DAS protocol. An additional entry point facilitates XML requests with XML responses. Conclusion MSDmotif is unique by combining chemical, sequence and 3D data in a single search engine with a range of search and visualisation options. It provides multiple views of data found in the PDB archive for exploring protein structures.

  18. The calmodulin-binding, short linear motif, NSCaTE is conserved in L-type channel ancestors of vertebrate Cav1.2 and Cav1.3 channels.

    Directory of Open Access Journals (Sweden)

    Valentina Taiakina

    Full Text Available NSCaTE is a short linear motif of (xWxxx(I or Lxxxx, composed of residues with a high helix-forming propensity within a mostly disordered N-terminus that is conserved in L-type calcium channels from protostome invertebrates to humans. NSCaTE is an optional, lower affinity and calcium-sensitive binding site for calmodulin (CaM which competes for CaM binding with a more ancient, C-terminal IQ domain on L-type channels. CaM bound to N- and C- terminal tails serve as dual detectors to changing intracellular Ca(2+ concentrations, promoting calcium-dependent inactivation of L-type calcium channels. NSCaTE is absent in some arthropod species, and is also lacking in vertebrate L-type isoforms, Cav1.1 and Cav1.4 channels. The pervasiveness of a methionine just downstream from NSCaTE suggests that L-type channels could generate alternative N-termini lacking NSCaTE through the choice of translational start sites. Long N-terminus with an NSCaTE motif in L-type calcium channel homolog LCav1 from pond snail Lymnaea stagnalis has a faster calcium-dependent inactivation than a shortened N-termini lacking NSCaTE. NSCaTE effects are present in low concentrations of internal buffer (0.5 mM EGTA, but disappears in high buffer conditions (10 mM EGTA. Snail and mammalian NSCaTE have an alpha-helical propensity upon binding Ca(2+-CaM and can saturate both CaM N-terminal and C-terminal domains in the absence of a competing IQ motif. NSCaTE evolved in ancestors of the first animals with internal organs for promoting a more rapid, calcium-sensitive inactivation of L-type channels.

  19. The COG database: an updated version includes eukaryotes

    Directory of Open Access Journals (Sweden)

    Sverdlov Alexander V

    2003-09-01

    Full Text Available Abstract Background The availability of multiple, essentially complete genome sequences of prokaryotes and eukaryotes spurred both the demand and the opportunity for the construction of an evolutionary classification of genes from these genomes. Such a classification system based on orthologous relationships between genes appears to be a natural framework for comparative genomics and should facilitate both functional annotation of genomes and large-scale evolutionary studies. Results We describe here a major update of the previously developed system for delineation of Clusters of Orthologous Groups of proteins (COGs from the sequenced genomes of prokaryotes and unicellular eukaryotes and the construction of clusters of predicted orthologs for 7 eukaryotic genomes, which we named KOGs after eukaryotic orthologous groups. The COG collection currently consists of 138,458 proteins, which form 4873 COGs and comprise 75% of the 185,505 (predicted proteins encoded in 66 genomes of unicellular organisms. The eukaryotic orthologous groups (KOGs include proteins from 7 eukaryotic genomes: three animals (the nematode Caenorhabditis elegans, the fruit fly Drosophila melanogaster and Homo sapiens, one plant, Arabidopsis thaliana, two fungi (Saccharomyces cerevisiae and Schizosaccharomyces pombe, and the intracellular microsporidian parasite Encephalitozoon cuniculi. The current KOG set consists of 4852 clusters of orthologs, which include 59,838 proteins, or ~54% of the analyzed eukaryotic 110,655 gene products. Compared to the coverage of the prokaryotic genomes with COGs, a considerably smaller fraction of eukaryotic genes could be included into the KOGs; addition of new eukaryotic genomes is expected to result in substantial increase in the coverage of eukaryotic genomes with KOGs. Examination of the phyletic patterns of KOGs reveals a conserved core represented in all analyzed species and consisting of ~20% of the KOG set. This conserved portion of the

  20. Large variability of bathypelagic microbial eukaryotic communities across the world's oceans.

    Science.gov (United States)

    Pernice, Massimo C; Giner, Caterina R; Logares, Ramiro; Perera-Bel, Júlia; Acinas, Silvia G; Duarte, Carlos M; Gasol, Josep M; Massana, Ramon

    2016-04-01

    In this work, we study the diversity of bathypelagic microbial eukaryotes (0.8-20 μm) in the global ocean. Seawater samples from 3000 to 4000 m depth from 27 stations in the Atlantic, Pacific and Indian Oceans were analyzed by pyrosequencing the V4 region of the 18S ribosomal DNA. The relative abundance of the most abundant operational taxonomic units agreed with the results of a parallel metagenomic analysis, suggesting limited PCR biases in the tag approach. Although rarefaction curves for single stations were seldom saturated, the global analysis of all sequences together suggested an adequate recovery of bathypelagic diversity. Community composition presented a large variability among samples, which was poorly explained by linear geographic distance. In fact, the similarity between communities was better explained by water mass composition (26% of the variability) and the ratio in cell abundance between prokaryotes and microbial eukaryotes (21%). Deep diversity appeared dominated by four taxonomic groups (Collodaria, Chrysophytes, Basidiomycota and MALV-II) appearing in different proportions in each sample. Novel diversity amounted to 1% of the pyrotags and was lower than expected. Our study represents an essential step in the investigation of bathypelagic microbial eukaryotes, indicating dominating taxonomic groups and suggesting idiosyncratic assemblages in distinct oceanic regions.

  1. An approach to evaluate the topological significance of motifs and other patterns in regulatory networks

    Directory of Open Access Journals (Sweden)

    Wingender Edgar

    2009-05-01

    Full Text Available Abstract Background The identification of network motifs as statistically over-represented topological patterns has become one of the most promising topics in the analysis of complex networks. The main focus is commonly made on how they operate by means of their internal organization. Yet, their contribution to a network's global architecture is poorly understood. However, this requires switching from the abstract view of a topological pattern to the level of its instances. Here, we show how a recently proposed metric, the pairwise disconnectivity index, can be adapted to survey if and which kind of topological patterns and their instances are most important for sustaining the connectivity within a network. Results The pairwise disconnectivity index of a pattern instance quantifies the dependency of the pairwise connections between vertices in a network on the presence of this pattern instance. Thereby, it particularly considers how the coherence between the unique constituents of a pattern instance relates to the rest of a network. We have applied the method exemplarily to the analysis of 3-vertex topological pattern instances in the transcription networks of a bacteria (E. coli, a unicellular eukaryote (S. cerevisiae and higher eukaryotes (human, mouse, rat. We found that in these networks only very few pattern instances break lots of the pairwise connections between vertices upon the removal of an instance. Among them network motifs do not prevail. Rather, those patterns that are shared by the three networks exhibit a conspicuously enhanced pairwise disconnectivity index. Additionally, these are often located in close vicinity to each other or are even overlapping, since only a small number of genes are repeatedly present in most of them. Moreover, evidence has gathered that the importance of these pattern instances is due to synergistic rather than merely additive effects between their constituents. Conclusion A new method has been proposed

  2. Characterization of the free state ensemble of the CoRNR box motif by molecular dynamics simulations

    NARCIS (Netherlands)

    Cino, E.A.; Choy, W.Y.; Karttunen, M.E.J.

    2016-01-01

    Intrinsically disordered proteins (IDPs) and regions are highly prevalent in eukaryotic proteomes, and like folded proteins, they perform essential biological functions. Interaction sites in folded proteins are generally formed by tertiary structures, whereas IDPs use short segments called linear

  3. Statistical tests to compare motif count exceptionalities

    Directory of Open Access Journals (Sweden)

    Vandewalle Vincent

    2007-03-01

    Full Text Available Abstract Background Finding over- or under-represented motifs in biological sequences is now a common task in genomics. Thanks to p-value calculation for motif counts, exceptional motifs are identified and represent candidate functional motifs. The present work addresses the related question of comparing the exceptionality of one motif in two different sequences. Just comparing the motif count p-values in each sequence is indeed not sufficient to decide if this motif is significantly more exceptional in one sequence compared to the other one. A statistical test is required. Results We develop and analyze two statistical tests, an exact binomial one and an asymptotic likelihood ratio test, to decide whether the exceptionality of a given motif is equivalent or significantly different in two sequences of interest. For that purpose, motif occurrences are modeled by Poisson processes, with a special care for overlapping motifs. Both tests can take the sequence compositions into account. As an illustration, we compare the octamer exceptionalities in the Escherichia coli K-12 backbone versus variable strain-specific loops. Conclusion The exact binomial test is particularly adapted for small counts. For large counts, we advise to use the likelihood ratio test which is asymptotic but strongly correlated with the exact binomial test and very simple to use.

  4. Transfer of DNA from Bacteria to Eukaryotes

    Directory of Open Access Journals (Sweden)

    Benoît Lacroix

    2016-07-01

    Full Text Available Historically, the members of the Agrobacterium genus have been considered the only bacterial species naturally able to transfer and integrate DNA into the genomes of their eukaryotic hosts. Yet, increasing evidence suggests that this ability to genetically transform eukaryotic host cells might be more widespread in the bacterial world. Indeed, analyses of accumulating genomic data reveal cases of horizontal gene transfer from bacteria to eukaryotes and suggest that it represents a significant force in adaptive evolution of eukaryotic species. Specifically, recent reports indicate that bacteria other than Agrobacterium, such as Bartonella henselae (a zoonotic pathogen, Rhizobium etli (a plant-symbiotic bacterium related to Agrobacterium, or even Escherichia coli, have the ability to genetically transform their host cells under laboratory conditions. This DNA transfer relies on type IV secretion systems (T4SSs, the molecular machines that transport macromolecules during conjugative plasmid transfer and also during transport of proteins and/or DNA to the eukaryotic recipient cells. In this review article, we explore the extent of possible transfer of genetic information from bacteria to eukaryotic cells as well as the evolutionary implications and potential applications of this transfer.

  5. Motif decomposition of the phosphotyrosine proteome reveals a new N-terminal binding motif for SHIP2

    DEFF Research Database (Denmark)

    Miller, Martin Lee; Hanke, S.; Hinsby, A. M.

    2008-01-01

    set of 481 unique phosphotyrosine (Tyr(P)) peptides by sequence similarity to known ligands of the Src homology 2 (SH2) and the phosphotyrosine binding (PTB) domains. From 20 clusters we extracted 16 known and four new interaction motifs. Using quantitative mass spectrometry we pulled down Tyr......(P)-specific binding partners for peptides corresponding to the extracted motifs. We confirmed numerous previously known interaction motifs and found 15 new interactions mediated by phosphosites not previously known to bind SH2 or PTB. Remarkably, a novel hydrophobic N-terminal motif ((L/V/I)(L/V/I)pY) was identified...

  6. Design and evaluation of antimalarial peptides derived from prediction of short linear motifs in proteins related to erythrocyte invasion.

    Directory of Open Access Journals (Sweden)

    Alessandra Bianchin

    Full Text Available The purpose of this study was to investigate the blood stage of the malaria causing parasite, Plasmodium falciparum, to predict potential protein interactions between the parasite merozoite and the host erythrocyte and design peptides that could interrupt these predicted interactions. We screened the P. falciparum and human proteomes for computationally predicted short linear motifs (SLiMs in cytoplasmic portions of transmembrane proteins that could play roles in the invasion of the erythrocyte by the merozoite, an essential step in malarial pathogenesis. We tested thirteen peptides predicted to contain SLiMs, twelve of them palmitoylated to enhance membrane targeting, and found three that blocked parasite growth in culture by inhibiting the initiation of new infections in erythrocytes. Scrambled peptides for two of the most promising peptides suggested that their activity may be reflective of amino acid properties, in particular, positive charge. However, one peptide showed effects which were stronger than those of scrambled peptides. This was derived from human red blood cell glycophorin-B. We concluded that proteome-wide computational screening of the intracellular regions of both host and pathogen adhesion proteins provides potential lead peptides for the development of anti-malarial compounds.

  7. Competitive inhibition can linearize dose-response and generate a linear rectifier.

    Science.gov (United States)

    Savir, Yonatan; Tu, Benjamin P; Springer, Michael

    2015-09-23

    Many biological responses require a dynamic range that is larger than standard bi-molecular interactions allow, yet the also ability to remain off at low input. Here we mathematically show that an enzyme reaction system involving a combination of competitive inhibition, conservation of the total level of substrate and inhibitor, and positive feedback can behave like a linear rectifier-that is, a network motif with an input-output relationship that is linearly sensitive to substrate above a threshold but unresponsive below the threshold. We propose that the evolutionarily conserved yeast SAGA histone acetylation complex may possess the proper physiological response characteristics and molecular interactions needed to perform as a linear rectifier, and we suggest potential experiments to test this hypothesis. One implication of this work is that linear responses and linear rectifiers might be easier to evolve or synthetically construct than is currently appreciated.

  8. [Personal motif in art].

    Science.gov (United States)

    Gerevich, József

    2015-01-01

    One of the basic questions of the art psychology is whether a personal motif is to be found behind works of art and if so, how openly or indirectly it appears in the work itself. Analysis of examples and documents from the fine arts and literature allow us to conclude that the personal motif that can be identified by the viewer through symbols, at times easily at others with more difficulty, gives an emotional plus to the artistic product. The personal motif may be found in traumatic experiences, in communication to the model or with other emotionally important persons (mourning, disappointment, revenge, hatred, rivalry, revolt etc.), in self-searching, or self-analysis. The emotions are expressed in artistic activity either directly or indirectly. The intention nourished by the artist's identity (Kunstwollen) may stand in the way of spontaneous self-expression, channelling it into hidden paths. Under the influence of certain circumstances, the artist may arouse in the viewer, consciously or unconsciously, an illusionary, misleading image of himself. An examination of the personal motif is one of the important research areas of art therapy.

  9. Temporal motifs in time-dependent networks

    International Nuclear Information System (INIS)

    Kovanen, Lauri; Karsai, Márton; Kaski, Kimmo; Kertész, János; Saramäki, Jari

    2011-01-01

    Temporal networks are commonly used to represent systems where connections between elements are active only for restricted periods of time, such as telecommunication, neural signal processing, biochemical reaction and human social interaction networks. We introduce the framework of temporal motifs to study the mesoscale topological–temporal structure of temporal networks in which the events of nodes do not overlap in time. Temporal motifs are classes of similar event sequences, where the similarity refers not only to topology but also to the temporal order of the events. We provide a mapping from event sequences to coloured directed graphs that enables an efficient algorithm for identifying temporal motifs. We discuss some aspects of temporal motifs, including causality and null models, and present basic statistics of temporal motifs in a large mobile call network

  10. Motif enrichment tool.

    Science.gov (United States)

    Blatti, Charles; Sinha, Saurabh

    2014-07-01

    The Motif Enrichment Tool (MET) provides an online interface that enables users to find major transcriptional regulators of their gene sets of interest. MET searches the appropriate regulatory region around each gene and identifies which transcription factor DNA-binding specificities (motifs) are statistically overrepresented. Motif enrichment analysis is currently available for many metazoan species including human, mouse, fruit fly, planaria and flowering plants. MET also leverages high-throughput experimental data such as ChIP-seq and DNase-seq from ENCODE and ModENCODE to identify the regulatory targets of a transcription factor with greater precision. The results from MET are produced in real time and are linked to a genome browser for easy follow-up analysis. Use of the web tool is free and open to all, and there is no login requirement. ADDRESS: http://veda.cs.uiuc.edu/MET/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. Role of the Box C/D Motif in Localization of Small Nucleolar RNAs to Coiled Bodies and Nucleoli

    Science.gov (United States)

    Narayanan, Aarthi; Speckmann, Wayne; Terns, Rebecca; Terns, Michael P.

    1999-01-01

    Small nucleolar RNAs (snoRNAs) are a large family of eukaryotic RNAs that function within the nucleolus in the biogenesis of ribosomes. One major class of snoRNAs is the box C/D snoRNAs named for their conserved box C and box D sequence elements. We have investigated the involvement of cis-acting sequences and intranuclear structures in the localization of box C/D snoRNAs to the nucleolus by assaying the intranuclear distribution of fluorescently labeled U3, U8, and U14 snoRNAs injected into Xenopus oocyte nuclei. Analysis of an extensive panel of U3 RNA variants showed that the box C/D motif, comprised of box C′, box D, and the 3′ terminal stem of U3, is necessary and sufficient for the nucleolar localization of U3 snoRNA. Disruption of the elements of the box C/D motif of U8 and U14 snoRNAs also prevented nucleolar localization, indicating that all box C/D snoRNAs use a common nucleolar-targeting mechanism. Finally, we found that wild-type box C/D snoRNAs transiently associate with coiled bodies before they localize to nucleoli and that variant RNAs that lack an intact box C/D motif are detained within coiled bodies. These results suggest that coiled bodies play a role in the biogenesis and/or intranuclear transport of box C/D snoRNAs. PMID:10397754

  12. How natural a kind is "eukaryote?".

    Science.gov (United States)

    Doolittle, W Ford

    2014-06-02

    Systematics balances uneasily between realism and nominalism, uncommitted as to whether biological taxa are discoveries or inventions. If the former, they might be taken as natural kinds. I briefly review some philosophers' concepts of natural kinds and then argue that several of these apply well enough to "eukaryote." Although there are some sticky issues around genomic chimerism and when eukaryotes first appeared, if we allow for degrees in the naturalness of kinds, existing eukaryotes rank highly, higher than prokaryotes. Most biologists feel this intuitively: All I attempt to do here is provide some conceptual justification. Copyright © 2014 Cold Spring Harbor Laboratory Press; all rights reserved.

  13. Comparative genomics and evolution of eukaryotic phospholipidbiosynthesis

    Energy Technology Data Exchange (ETDEWEB)

    Lykidis, Athanasios

    2006-12-01

    Phospholipid biosynthetic enzymes produce diverse molecular structures and are often present in multiple forms encoded by different genes. This work utilizes comparative genomics and phylogenetics for exploring the distribution, structure and evolution of phospholipid biosynthetic genes and pathways in 26 eukaryotic genomes. Although the basic structure of the pathways was formed early in eukaryotic evolution, the emerging picture indicates that individual enzyme families followed unique evolutionary courses. For example, choline and ethanolamine kinases and cytidylyltransferases emerged in ancestral eukaryotes, whereas, multiple forms of the corresponding phosphatidyltransferases evolved mainly in a lineage specific manner. Furthermore, several unicellular eukaryotes maintain bacterial-type enzymes and reactions for the synthesis of phosphatidylglycerol and cardiolipin. Also, base-exchange phosphatidylserine synthases are widespread and ancestral enzymes. The multiplicity of phospholipid biosynthetic enzymes has been largely generated by gene expansion in a lineage specific manner. Thus, these observations suggest that phospholipid biosynthesis has been an actively evolving system. Finally, comparative genomic analysis indicates the existence of novel phosphatidyltransferases and provides a candidate for the uncharacterized eukaryotic phosphatidylglycerol phosphate phosphatase.

  14. Beauvericin synthetase contains a calmodulin binding motif in the entomopathogenic fungus Beauveria bassiana.

    Science.gov (United States)

    Kim, Jiyoung; Sung, Gi-Ho

    2018-03-19

    Beauvericin is a mycotoxin which has insecticidal, anti-microbial, anti-viral and anti-cancer activities. Beauvericin biosynthesis is rapidly catalyzed by the beauvericin synthetase (BEAS) in Beauveria bassiana. Ca 2+ plays crucial roles in multiple signaling pathways in eukaryotic cells. These Ca 2+ signals are partially decoded by Ca 2+ sensor calmodulin (CaM). In this report, we describe that B. bassiana BEAS (BbBEAS) can interact with CaM in a Ca 2+ -dependent manner. A synthetic BbBEAS peptide, corresponding to the putative CaM-binding motif, formed a stable complex with CaM in the presence of Ca 2+ . In addition, in vitro CaM-binding assay revealed that the His-tagged BbBEAS (amino acids 2421-2538) binds to CaM in a Ca 2+ -dependent manner. Therefore, this work suggests that BbBEAS is a novel CaM-binding protein in B. bassiana.

  15. UKIRAN KERAWANG ACEH GAYO SEBAGAI INSPIRASI PENCIPTAAN MOTIF BATIK KHAS GAYO

    Directory of Open Access Journals (Sweden)

    Irfa ina Rohana Salma

    2016-12-01

    Full Text Available ABSTRAK Industri batik mulai berkembang di Gayo, tetapi belum memiliki motif batik khas daerah. Oleh karena itu perlu diciptakan motif batik khas Gayo, dengan mengambil inspirasi dari ukiran yang terdapat pada rumah tradisional yang biasa disebut ukiran kerawang Gayo. Tujuan penciptaan seni ini adalah untuk menciptakan motif batik yang memiliki ciri khas Gayo. Metode yang digunakan yaitu eksplorasi ide, perancangan, dan perwujudan menjadi motif batik. Dalam kegiatan ini telah diciptakan enam motif batik khas Gayo yaitu: (1 Motif Ceplok Gayo; (2 Motif Gayo Tegak; (3 Motif Gayo Lurus; (4 Motif Parang Gayo; (5 Motif Gayo Lembut; dan (6 Motif Geometris Gayo. Hasil uji kesukaan terhadap motif kepada lima puluh responden menunjukkan bahwa Motif Ceplok Gayo paling banyak dipilih oleh responden yaitu sebesar 19%, sedangkan Motif Parang Gayo 18%, Motif Gayo Lembut 17%, Motif Geometris Gayo 17%, Motif Gayo Lurus 15% dan Motif Gayo Tegak 14%. Rata-rata motif yang dihasilkan mendapatkan apresiasi yang baik dari responden, sehingga semua motif layak diproduksi sebagai batik khas Gayo.Kata kunci: batik Gayo, Motif Ceplok Gayo, Motif Parang Gayo.ABSTRACTBatik industry began to develop in Gayo, but have not had a typical batik motif itself. Therefore, it is necessary to create batik motifs of Gayo, by taking inspiration from the carvings found in traditional houses commonly called kerawang Gayo. The purpose of this art is to create motifs those have a Gayo characteristic. The method used are the idea exploration, design, and motifs embodiment. In this activity has created six Gayo batik motifs, namely: (1 Motif Ceplok Gayo; (2 Motif Gayo Tegak; (3 Motif GayoLurus; (4 Motif Parang Gayo; (5 Motif Gayo Lembut; dan (6 Motif Geometris Gayo. The test results fondness of the motives to fifty respondents indicated that the Motif Ceplok Gayo most preferred by respondents ie 19%, while Motif Parang Gayo 18%, Motif Gayo Lembut 17%, Motif Geometris Gayo 17%, Motif Gayo

  16. Patterns of prokaryotic lateral gene transfers affecting parasitic microbial eukaryotes

    DEFF Research Database (Denmark)

    Alsmark, Cecilia; Foster, Peter G; Sicheritz-Pontén, Thomas

    2013-01-01

    BACKGROUND: The influence of lateral gene transfer on gene origins and biology in eukaryotes is poorly understood compared with those of prokaryotes. A number of independent investigations focusing on specific genes, individual genomes, or specific functional categories from various eukaryotes have...... approach to systematically investigate lateral gene transfer affecting the proteomes of thirteen, mainly parasitic, microbial eukaryotes, representing four of the six eukaryotic super-groups. All of the genomes investigated have been significantly affected by prokaryote-to-eukaryote lateral gene transfers...... indicated that lateral gene transfer does indeed affect eukaryotic genomes. However, the lack of common methodology and criteria in these studies makes it difficult to assess the general importance and influence of lateral gene transfer on eukaryotic genome evolution. RESULTS: We used a phylogenomic...

  17. Morphological and ecological complexity in early eukaryotic ecosystems.

    Science.gov (United States)

    Javaux, E J; Knoll, A H; Walter, M R

    2001-07-05

    Molecular phylogeny and biogeochemistry indicate that eukaryotes differentiated early in Earth history. Sequence comparisons of small-subunit ribosomal RNA genes suggest a deep evolutionary divergence of Eukarya and Archaea; C27-C29 steranes (derived from sterols synthesized by eukaryotes) and strong depletion of 13C (a biogeochemical signature of methanogenic Archaea) in 2,700 Myr old kerogens independently place a minimum age on this split. Steranes, large spheroidal microfossils, and rare macrofossils of possible eukaryotic origin occur in Palaeoproterozoic rocks. Until now, however, evidence for morphological and taxonomic diversification within the domain has generally been restricted to very late Mesoproterozoic and Neoproterozoic successions. Here we show that the cytoskeletal and ecological prerequisites for eukaryotic diversification were already established in eukaryotic microorganisms fossilized nearly 1,500 Myr ago in shales of the early Mesoproterozoic Roper Group in northern Australia.

  18. Large-scale discovery of promoter motifs in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Thomas A Down

    2007-01-01

    Full Text Available A key step in understanding gene regulation is to identify the repertoire of transcription factor binding motifs (TFBMs that form the building blocks of promoters and other regulatory elements. Identifying these experimentally is very laborious, and the number of TFBMs discovered remains relatively small, especially when compared with the hundreds of transcription factor genes predicted in metazoan genomes. We have used a recently developed statistical motif discovery approach, NestedMICA, to detect candidate TFBMs from a large set of Drosophila melanogaster promoter regions. Of the 120 motifs inferred in our initial analysis, 25 were statistically significant matches to previously reported motifs, while 87 appeared to be novel. Analysis of sequence conservation and motif positioning suggested that the great majority of these discovered motifs are predictive of functional elements in the genome. Many motifs showed associations with specific patterns of gene expression in the D. melanogaster embryo, and we were able to obtain confident annotation of expression patterns for 25 of our motifs, including eight of the novel motifs. The motifs are available through Tiffin, a new database of DNA sequence motifs. We have discovered many new motifs that are overrepresented in D. melanogaster promoter regions, and offer several independent lines of evidence that these are novel TFBMs. Our motif dictionary provides a solid foundation for further investigation of regulatory elements in Drosophila, and demonstrates techniques that should be applicable in other species. We suggest that further improvements in computational motif discovery should narrow the gap between the set of known motifs and the total number of transcription factors in metazoan genomes.

  19. Graph theoretic analysis of protein interaction networks of eukaryotes

    Science.gov (United States)

    Goh, K.-I.; Kahng, B.; Kim, D.

    2005-11-01

    Owing to the recent progress in high-throughput experimental techniques, the datasets of large-scale protein interactions of prototypical multicellular species, the nematode worm Caenorhabditis elegans and the fruit fly Drosophila melanogaster, have been assayed. The datasets are obtained mainly by using the yeast hybrid method, which contains false-positive and false-negative simultaneously. Accordingly, while it is desirable to test such datasets through further wet experiments, here we invoke recent developed network theory to test such high-throughput datasets in a simple way. Based on the fact that the key biological processes indispensable to maintaining life are conserved across eukaryotic species, and the comparison of structural properties of the protein interaction networks (PINs) of the two species with those of the yeast PIN, we find that while the worm and yeast PIN datasets exhibit similar structural properties, the current fly dataset, though most comprehensively screened ever, does not reflect generic structural properties correctly as it is. The modularity is suppressed and the connectivity correlation is lacking. Addition of interologs to the current fly dataset increases the modularity and enhances the occurrence of triangular motifs as well. The connectivity correlation function of the fly, however, remains distinct under such interolog additions, for which we present a possible scenario through an in silico modeling.

  20. MHC motif viewer

    DEFF Research Database (Denmark)

    Rapin, Nicolas Philippe Jean-Pierre; Hoof, Ilka; Lund, Ole

    2008-01-01

    . Algorithms that predict which peptides MHC molecules bind have recently been developed and cover many different alleles, but the utility of these algorithms is hampered by the lack of tools for browsing and comparing the specificity of these molecules. We have, therefore, developed a web server, MHC motif....... A special viewing feature, MHC fight, allows for display of the specificity of two different MHC molecules side by side. We show how the web server can be used to discover and display surprising similarities as well as differences between MHC molecules within and between different species. The MHC motif...

  1. Large variability of bathypelagic microbial eukaryotic communities across the world’s oceans

    KAUST Repository

    Pernice, Massimo C.

    2015-10-09

    In this work, we study the diversity of bathypelagic microbial eukaryotes (0.8–20 μm) in the global ocean. Seawater samples from 3000 to 4000 m depth from 27 stations in the Atlantic, Pacific and Indian Oceans were analyzed by pyrosequencing the V4 region of the 18S ribosomal DNA. The relative abundance of the most abundant operational taxonomic units agreed with the results of a parallel metagenomic analysis, suggesting limited PCR biases in the tag approach. Although rarefaction curves for single stations were seldom saturated, the global analysis of all sequences together suggested an adequate recovery of bathypelagic diversity. Community composition presented a large variability among samples, which was poorly explained by linear geographic distance. In fact, the similarity between communities was better explained by water mass composition (26% of the variability) and the ratio in cell abundance between prokaryotes and microbial eukaryotes (21%). Deep diversity appeared dominated by four taxonomic groups (Collodaria, Chrysophytes, Basidiomycota and MALV-II) appearing in different proportions in each sample. Novel diversity amounted to 1% of the pyrotags and was lower than expected. Our study represents an essential step in the investigation of bathypelagic microbial eukaryotes, indicating dominating taxonomic groups and suggesting idiosyncratic assemblages in distinct oceanic regions.

    The ISME Journal advance online publication, 9 October 2015; doi:10.1038/ismej.2015.170

  2. Atypical mitochondrial inheritance patterns in eukaryotes.

    Science.gov (United States)

    Breton, Sophie; Stewart, Donald T

    2015-10-01

    Mitochondrial DNA (mtDNA) is predominantly maternally inherited in eukaryotes. Diverse molecular mechanisms underlying the phenomenon of strict maternal inheritance (SMI) of mtDNA have been described, but the evolutionary forces responsible for its predominance in eukaryotes remain to be elucidated. Exceptions to SMI have been reported in diverse eukaryotic taxa, leading to the prediction that several distinct molecular mechanisms controlling mtDNA transmission are present among the eukaryotes. We propose that these mechanisms will be better understood by studying the deviations from the predominating pattern of SMI. This minireview summarizes studies on eukaryote species with unusual or rare mitochondrial inheritance patterns, i.e., other than the predominant SMI pattern, such as maternal inheritance of stable heteroplasmy, paternal leakage of mtDNA, biparental and strictly paternal inheritance, and doubly uniparental inheritance of mtDNA. The potential genes and mechanisms involved in controlling mitochondrial inheritance in these organisms are discussed. The linkage between mitochondrial inheritance and sex determination is also discussed, given that the atypical systems of mtDNA inheritance examined in this minireview are frequently found in organisms with uncommon sexual systems such as gynodioecy, monoecy, or andromonoecy. The potential of deviations from SMI for facilitating a better understanding of a number of fundamental questions in biology, such as the evolution of mtDNA inheritance, the coevolution of nuclear and mitochondrial genomes, and, perhaps, the role of mitochondria in sex determination, is considerable.

  3. Motif discovery in ranked lists of sequences

    DEFF Research Database (Denmark)

    Nielsen, Morten Muhlig; Tataru, Paula; Madsen, Tobias

    2016-01-01

    Motif analysis has long been an important method to characterize biological functionality and the current growth of sequencing-based genomics experiments further extends its potential. These diverse experiments often generate sequence lists ranked by some functional property. There is therefore...... advantage of the regular expression feature, including enrichments for combinations of different microRNA seed sites. The method is implemented and made publicly available as an R package and supports high parallelization on multi-core machinery....... a growing need for motif analysis methods that can exploit this coupled data structure and be tailored for specific biological questions. Here, we present an exploratory motif analysis tool, Regmex (REGular expression Motif EXplorer), which offers several methods to evaluate the correlation of motifs...

  4. Eukaryotes first: how could that be?

    Science.gov (United States)

    Mariscal, Carlos; Doolittle, W Ford

    2015-09-26

    In the half century since the formulation of the prokaryote : eukaryote dichotomy, many authors have proposed that the former evolved from something resembling the latter, in defiance of common (and possibly common sense) views. In such 'eukaryotes first' (EF) scenarios, the last universal common ancestor is imagined to have possessed significantly many of the complex characteristics of contemporary eukaryotes, as relics of an earlier 'progenotic' period or RNA world. Bacteria and Archaea thus must have lost these complex features secondarily, through 'streamlining'. If the canonical three-domain tree in which Archaea and Eukarya are sisters is accepted, EF entails that Bacteria and Archaea are convergently prokaryotic. We ask what this means and how it might be tested. © 2015 The Author(s).

  5. Fitness for synchronization of network motifs

    DEFF Research Database (Denmark)

    Vega, Y.M.; Vázquez-Prada, M.; Pacheco, A.F.

    2004-01-01

    We study the synchronization of Kuramoto's oscillators in small parts of networks known as motifs. We first report on the system dynamics for the case of a scale-free network and show the existence of a non-trivial critical point. We compute the probability that network motifs synchronize, and fi...... that the fitness for synchronization correlates well with motifs interconnectedness and structural complexity. Possible implications for present debates about network evolution in biological and other systems are discussed....

  6. The Genome of Naegleria gruberi Illuminates Early Eukaryotic Versatility

    Energy Technology Data Exchange (ETDEWEB)

    Fritz-Laylin, Lillian K.; Prochnik, Simon E.; Ginger, Michael L.; Dacks, Joel; Carpenter, Meredith L.; Field, Mark C.; Kuo, Alan; Paredez, Alex; Chapman, Jarrod; Pham, Jonathan; Shu, Shengqiang; Neupane, Rochak; Cipriano, Michael; Mancuso, Joel; Tu, Hank; Salamov, Asaf; Lindquist, Erika; Shapiro, Harris; Lucas, Susan; Grigoriev, Igor V.; Cande, W. Zacheus; Fulton, Chandler; Rokhsar, Daniel S.; Dawson, Scott C.

    2010-03-01

    Genome sequences of diverse free-living protists are essential for understanding eukaryotic evolution and molecular and cell biology. The free-living amoeboflagellate Naegleria gruberi belongs to a varied and ubiquitous protist clade (Heterolobosea) that diverged from other eukaryotic lineages over a billion years ago. Analysis of the 15,727 protein-coding genes encoded by Naegleria's 41 Mb nuclear genome indicates a capacity for both aerobic respiration and anaerobic metabolism with concomitant hydrogen production, with fundamental implications for the evolution of organelle metabolism. The Naegleria genome facilitates substantially broader phylogenomic comparisons of free-living eukaryotes than previously possible, allowing us to identify thousands of genes likely present in the pan-eukaryotic ancestor, with 40% likely eukaryotic inventions. Moreover, we construct a comprehensive catalog of amoeboid-motility genes. The Naegleria genome, analyzed in the context of other protists, reveals a remarkably complex ancestral eukaryote with a rich repertoire of cytoskeletal, sexual, signaling, and metabolic modules.

  7. The origin of the eukaryotic cell

    Science.gov (United States)

    Hartman, H.

    1984-01-01

    The endosymbiotic hypothesis for the origin of the eukaryotic cell has been applied to the origin of the mitochondria and chloroplasts. However as has been pointed out by Mereschowsky in 1905, it should also be applied to the nucleus as well. If the nucleus, mitochondria and chloroplasts are endosymbionts, then it is likely that the organism that did the engulfing was not a DNA-based organism. In fact, it is useful to postulate that this organism was a primitive RNA-based organism. This hypothesis would explain the preponderance of RNA viruses found in eukaryotic cells. The centriole and basal body do not have a double membrane or DNA. Like all MTOCs (microtubule organising centres), they have a structural or morphic RNA implicated in their formation. This would argue for their origin in the early RNA-based organism rather than in an endosymbiotic event involving bacteria. Finally, the eukaryotic cell uses RNA in ways quite unlike bacteria, thus pointing to a greater emphasis of RNA in both control and structure in the cell. The origin of the eukaryotic cell may tell us why it rather than its prokaryotic relative evolved into the metazoans who are reading this paper.

  8. Single Cell Genomics and Transcriptomics for Unicellular Eukaryotes

    Energy Technology Data Exchange (ETDEWEB)

    Ciobanu, Doina; Clum, Alicia; Singh, Vasanth; Salamov, Asaf; Han, James; Copeland, Alex; Grigoriev, Igor; James, Timothy; Singer, Steven; Woyke, Tanja; Malmstrom, Rex; Cheng, Jan-Fang

    2014-03-14

    Despite their small size, unicellular eukaryotes have complex genomes with a high degree of plasticity that allow them to adapt quickly to environmental changes. Unicellular eukaryotes live with prokaryotes and higher eukaryotes, frequently in symbiotic or parasitic niches. To this day their contribution to the dynamics of the environmental communities remains to be understood. Unfortunately, the vast majority of eukaryotic microorganisms are either uncultured or unculturable, making genome sequencing impossible using traditional approaches. We have developed an approach to isolate unicellular eukaryotes of interest from environmental samples, and to sequence and analyze their genomes and transcriptomes. We have tested our methods with six species: an uncharacterized protist from cellulose-enriched compost identified as Platyophrya, a close relative of P. vorax; the fungus Metschnikowia bicuspidate, a parasite of water flea Daphnia; the mycoparasitic fungi Piptocephalis cylindrospora, a parasite of Cokeromyces and Mucor; Caulochytrium protosteloides, a parasite of Sordaria; Rozella allomycis, a parasite of the water mold Allomyces; and the microalgae Chlamydomonas reinhardtii. Here, we present the four components of our approach: pre-sequencing methods, sequence analysis for single cell genome assembly, sequence analysis of single cell transcriptomes, and genome annotation. This technology has the potential to uncover the complexity of single cell eukaryotes and their role in the environmental samples.

  9. Aplikasi Ornamen Khas Maluku untuk Pengembangan Desain Motif Batik

    Directory of Open Access Journals (Sweden)

    Masiswo Masiswo

    2016-04-01

    Full Text Available ABSTRAKMaluku memiliki banyak ragam hias budaya warisan nilai leluhur berupa ornamen etnis yang merupakan kesenian dan keterampilan kerajinan. Hasil warisan tersebut sampai saat ini masih lestari hidup serta dapat dinikmati sebagai konsumsi rohani yang memuaskan manusia. Berkaitan dengan keberlangsungan nilai-nilai tradisi etnis yang berwujud pada ornamen-ornamen daerah Maluku, maka dikembangkan untuk kebutuhan manusia berupa motif batik pada kain. Pengembangan ornamen ini lebih menekankan pada representasi akan bentuk-bentuk ornamen yang diterapkan pada kerajinan batik berupa motif khas Maluku. Pengembangan alternatif desain motif batik dibuat tiga variasi yang bersumber dari ornamen khas Maluku dibuat prototipe produknya dan diuji ketahanan luntur warnanya. Hasil uji ketahanan luntur warna terhadap gosokan basah dari tiga prototipe produk berpredikat baik sekali terdapat pada “Motif Siwa” dan predikat baik pada motif “Siwa Talang” dan motif “Matahari Siwa Talang”.Kata kunci: desain, Maluku, motif batik, ornamenABSTRACTMaluku has much decorative ancestral cultural heritage value in the form of ornament ethnic arts and crafts skills. The result of the legacy is still sustainable living can be enjoyed as well as satisfying spiritual human consumption.Related to the sustainability of traditional values in the form of ethnic ornaments Maluku, it was developed for human needs in the form of batik cloth . The development of these ornaments will be more emphasis on the representation forms of ornamentation that is applied to a batik motif Maluku. Development of alternative design motif made three variations. The development of three alternative design motifs derived from the Maluku ornaments made and tested a prototype product color fastness. The test results of color fastness to wet rubbing of the three prototypes are excellent products predicated on the "Motif Siwa" and a good rating on the motif "Siwa Talang" and motif "Matahari Siwa

  10. Translational Control of Host Gene Expression by a Cys-Motif Protein Encoded in a Bracovirus.

    Directory of Open Access Journals (Sweden)

    Eunseong Kim

    Full Text Available Translational control is a strategy that various viruses use to manipulate their hosts to suppress acute antiviral response. Polydnaviruses, a group of insect double-stranded DNA viruses symbiotic to some endoparasitoid wasps, are divided into two genera: ichnovirus (IV and bracovirus (BV. In IV, some Cys-motif genes are known as host translation-inhibitory factors (HTIF. The genome of endoparasitoid wasp Cotesia plutellae contains a Cys-motif gene (Cp-TSP13 homologous to an HTIF known as teratocyte-secretory protein 14 (TSP14 of Microplitis croceipes. Cp-TSP13 consists of 129 amino acid residues with a predicted molecular weight of 13.987 kDa and pI value of 7.928. Genomic DNA region encoding its open reading frame has three introns. Cp-TSP13 possesses six conserved cysteine residues as other Cys-motif genes functioning as HTIF. Cp-TSP13 was expressed in Plutella xylostella larvae parasitized by C. plutellae. C. plutellae bracovirus (CpBV was purified and injected into non-parasitized P. xylostella that expressed Cp-TSP13. Cp-TSP13 was cloned into a eukaryotic expression vector and used to infect Sf9 cells to transiently express Cp-TSP13. The synthesized Cp-TSP13 protein was detected in culture broth. An overlaying experiment showed that the purified Cp-TSP13 entered hemocytes. It was localized in the cytosol. Recombinant Cp-TSP13 significantly inhibited protein synthesis of secretory proteins when it was added to in vitro cultured fat body. In addition, the recombinant Cp-TSP13 directly inhibited the translation of fat body mRNAs in in vitro translation assay using rabbit reticulocyte lysate. Moreover, the recombinant Cp-TSP13 significantly suppressed cellular immune responses by inhibiting hemocyte-spreading behavior. It also exhibited significant insecticidal activities by both injection and feeding routes. These results indicate that Cp-TSP13 is a viral HTIF.

  11. The limits of de novo DNA motif discovery.

    Directory of Open Access Journals (Sweden)

    David Simcha

    Full Text Available A major challenge in molecular biology is reverse-engineering the cis-regulatory logic that plays a major role in the control of gene expression. This program includes searching through DNA sequences to identify "motifs" that serve as the binding sites for transcription factors or, more generally, are predictive of gene expression across cellular conditions. Several approaches have been proposed for de novo motif discovery-searching sequences without prior knowledge of binding sites or nucleotide patterns. However, unbiased validation is not straightforward. We consider two approaches to unbiased validation of discovered motifs: testing the statistical significance of a motif using a DNA "background" sequence model to represent the null hypothesis and measuring performance in predicting membership in gene clusters. We demonstrate that the background models typically used are "too null," resulting in overly optimistic assessments of significance, and argue that performance in predicting TF binding or expression patterns from DNA motifs should be assessed by held-out data, as in predictive learning. Applying this criterion to common motif discovery methods resulted in universally poor performance, although there is a marked improvement when motifs are statistically significant against real background sequences. Moreover, on synthetic data where "ground truth" is known, discriminative performance of all algorithms is far below the theoretical upper bound, with pronounced "over-fitting" in training. A key conclusion from this work is that the failure of de novo discovery approaches to accurately identify motifs is basically due to statistical intractability resulting from the fixed size of co-regulated gene clusters, and thus such failures do not necessarily provide evidence that unfound motifs are not active biologically. Consequently, the use of prior knowledge to enhance motif discovery is not just advantageous but necessary. An implementation of

  12. Parole, Sintagmatik, dan Paradigmatik Motif Batik Mega Mendung

    Directory of Open Access Journals (Sweden)

    Rudi - Nababan

    2012-04-01

    Full Text Available ABSTRACT   Discussing traditional batik is related a lot to the organization system of fine arts element ac- companying it, either the pattern of the motif or the technique of the making. In this case, the motif of Mega Mendung Cirebon certainly has patterns and rules which are traditionally different from the other motifs in other areas. Through  semiotics analysis especially with Saussure and Pierce concept, it can be traced that batik with Cirebon motif, in this case Mega Mendung motif, has parole and langue system, as unique fine arts language in batik, and structure of visual syntagmatic and paradigmatic. In the context of batik motif as fine arts language, it is surely related to sign system as symbol and icon.       Keywords: visual semiotic, Cirebon’s batik.

  13. Potential of industrial biotechnology with cyanobacteria and eukaryotic microalgae.

    Science.gov (United States)

    Wijffels, René H; Kruse, Olaf; Hellingwerf, Klaas J

    2013-06-01

    Both cyanobacteria and eukaryotic microalgae are promising organisms for sustainable production of bulk products such as food, feed, materials, chemicals and fuels. In this review we will summarize the potential and current biotechnological developments. Cyanobacteria are promising host organisms for the production of small molecules that can be secreted such as ethanol, butanol, fatty acids and other organic acids. Eukaryotic microalgae are interesting for products for which cellular storage is important such as proteins, lipids, starch and alkanes. For the development of new and promising lines of production, strains of both cyanobacteria and eukaryotic microalgae have to be improved. Transformation systems have been much better developed in cyanobacteria. However, several products would be preferably produced with eukaryotic microalgae. In the case of cyanobacteria a synthetic-systems biology approach has a great potential to exploit cyanobacteria as cell factories. For eukaryotic microalgae transformation systems need to be further developed. A promising strategy is transformation of heterologous (prokaryotic and eukaryotic) genes in established eukaryotic hosts such as Chlamydomonas reinhardtii. Experimental outdoor pilots under containment for the production of genetically modified cyanobacteria and microalgae are in progress. For full scale production risks of release of genetically modified organisms need to be assessed. Copyright © 2013. Published by Elsevier Ltd.

  14. OSR1 regulates a subset of inward rectifier potassium channels via a binding motif variant.

    Science.gov (United States)

    Taylor, Clinton A; An, Sung-Wan; Kankanamalage, Sachith Gallolu; Stippec, Steve; Earnest, Svetlana; Trivedi, Ashesh T; Yang, Jonathan Zijiang; Mirzaei, Hamid; Huang, Chou-Long; Cobb, Melanie H

    2018-04-10

    The with-no-lysine (K) (WNK) signaling pathway to STE20/SPS1-related proline- and alanine-rich kinase (SPAK) and oxidative stress-responsive 1 (OSR1) kinase is an important mediator of cell volume and ion transport. SPAK and OSR1 associate with upstream kinases WNK 1-4, substrates, and other proteins through their C-terminal domains which interact with linear R-F-x-V/I sequence motifs. In this study we find that SPAK and OSR1 also interact with similar affinity with a motif variant, R-x-F-x-V/I. Eight of 16 human inward rectifier K + channels have an R-x-F-x-V motif. We demonstrate that two of these channels, Kir2.1 and Kir2.3, are activated by OSR1, while Kir4.1, which does not contain the motif, is not sensitive to changes in OSR1 or WNK activity. Mutation of the motif prevents activation of Kir2.3 by OSR1. Both siRNA knockdown of OSR1 and chemical inhibition of WNK activity disrupt NaCl-induced plasma membrane localization of Kir2.3. Our results suggest a mechanism by which WNK-OSR1 enhance Kir2.1 and Kir2.3 channel activity by increasing their plasma membrane localization. Regulation of members of the inward rectifier K + channel family adds functional and mechanistic insight into the physiological impact of the WNK pathway.

  15. Origins and evolution of viruses of eukaryotes: The ultimate modularity

    International Nuclear Information System (INIS)

    Koonin, Eugene V.; Dolja, Valerian V.; Krupovic, Mart

    2015-01-01

    Viruses and other selfish genetic elements are dominant entities in the biosphere, with respect to both physical abundance and genetic diversity. Various selfish elements parasitize on all cellular life forms. The relative abundances of different classes of viruses are dramatically different between prokaryotes and eukaryotes. In prokaryotes, the great majority of viruses possess double-stranded (ds) DNA genomes, with a substantial minority of single-stranded (ss) DNA viruses and only limited presence of RNA viruses. In contrast, in eukaryotes, RNA viruses account for the majority of the virome diversity although ssDNA and dsDNA viruses are common as well. Phylogenomic analysis yields tangible clues for the origins of major classes of eukaryotic viruses and in particular their likely roots in prokaryotes. Specifically, the ancestral genome of positive-strand RNA viruses of eukaryotes might have been assembled de novo from genes derived from prokaryotic retroelements and bacteria although a primordial origin of this class of viruses cannot be ruled out. Different groups of double-stranded RNA viruses derive either from dsRNA bacteriophages or from positive-strand RNA viruses. The eukaryotic ssDNA viruses apparently evolved via a fusion of genes from prokaryotic rolling circle-replicating plasmids and positive-strand RNA viruses. Different families of eukaryotic dsDNA viruses appear to have originated from specific groups of bacteriophages on at least two independent occasions. Polintons, the largest known eukaryotic transposons, predicted to also form virus particles, most likely, were the evolutionary intermediates between bacterial tectiviruses and several groups of eukaryotic dsDNA viruses including the proposed order “Megavirales” that unites diverse families of large and giant viruses. Strikingly, evolution of all classes of eukaryotic viruses appears to have involved fusion between structural and replicative gene modules derived from different sources

  16. Origins and evolution of viruses of eukaryotes: The ultimate modularity

    Energy Technology Data Exchange (ETDEWEB)

    Koonin, Eugene V., E-mail: koonin@ncbi.nlm.nih.gov [National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, Bethesda, MD 20894 (United States); Dolja, Valerian V., E-mail: doljav@science.oregonstate.edu [Department of Botany and Plant Pathology, Oregon State University, Corvallis, OR 97331 (United States); Krupovic, Mart, E-mail: krupovic@pasteur.fr [Institut Pasteur, Unité Biologie Moléculaire du Gène chez les Extrêmophiles, Department of Microbiology, Paris 75015 (France)

    2015-05-15

    Viruses and other selfish genetic elements are dominant entities in the biosphere, with respect to both physical abundance and genetic diversity. Various selfish elements parasitize on all cellular life forms. The relative abundances of different classes of viruses are dramatically different between prokaryotes and eukaryotes. In prokaryotes, the great majority of viruses possess double-stranded (ds) DNA genomes, with a substantial minority of single-stranded (ss) DNA viruses and only limited presence of RNA viruses. In contrast, in eukaryotes, RNA viruses account for the majority of the virome diversity although ssDNA and dsDNA viruses are common as well. Phylogenomic analysis yields tangible clues for the origins of major classes of eukaryotic viruses and in particular their likely roots in prokaryotes. Specifically, the ancestral genome of positive-strand RNA viruses of eukaryotes might have been assembled de novo from genes derived from prokaryotic retroelements and bacteria although a primordial origin of this class of viruses cannot be ruled out. Different groups of double-stranded RNA viruses derive either from dsRNA bacteriophages or from positive-strand RNA viruses. The eukaryotic ssDNA viruses apparently evolved via a fusion of genes from prokaryotic rolling circle-replicating plasmids and positive-strand RNA viruses. Different families of eukaryotic dsDNA viruses appear to have originated from specific groups of bacteriophages on at least two independent occasions. Polintons, the largest known eukaryotic transposons, predicted to also form virus particles, most likely, were the evolutionary intermediates between bacterial tectiviruses and several groups of eukaryotic dsDNA viruses including the proposed order “Megavirales” that unites diverse families of large and giant viruses. Strikingly, evolution of all classes of eukaryotic viruses appears to have involved fusion between structural and replicative gene modules derived from different sources

  17. Exhaustive search of linear information encoding protein-peptide recognition.

    Science.gov (United States)

    Kelil, Abdellali; Dubreuil, Benjamin; Levy, Emmanuel D; Michnick, Stephen W

    2017-04-01

    High-throughput in vitro methods have been extensively applied to identify linear information that encodes peptide recognition. However, these methods are limited in number of peptides, sequence variation, and length of peptides that can be explored, and often produce solutions that are not found in the cell. Despite the large number of methods developed to attempt addressing these issues, the exhaustive search of linear information encoding protein-peptide recognition has been so far physically unfeasible. Here, we describe a strategy, called DALEL, for the exhaustive search of linear sequence information encoded in proteins that bind to a common partner. We applied DALEL to explore binding specificity of SH3 domains in the budding yeast Saccharomyces cerevisiae. Using only the polypeptide sequences of SH3 domain binding proteins, we succeeded in identifying the majority of known SH3 binding sites previously discovered either in vitro or in vivo. Moreover, we discovered a number of sites with both non-canonical sequences and distinct properties that may serve ancillary roles in peptide recognition. We compared DALEL to a variety of state-of-the-art algorithms in the blind identification of known binding sites of the human Grb2 SH3 domain. We also benchmarked DALEL on curated biological motifs derived from the ELM database to evaluate the effect of increasing/decreasing the enrichment of the motifs. Our strategy can be applied in conjunction with experimental data of proteins interacting with a common partner to identify binding sites among them. Yet, our strategy can also be applied to any group of proteins of interest to identify enriched linear motifs or to exhaustively explore the space of linear information encoded in a polypeptide sequence. Finally, we have developed a webserver located at http://michnick.bcm.umontreal.ca/dalel, offering user-friendly interface and providing different scenarios utilizing DALEL.

  18. Phylogenetic analysis of the core histone doublet and DNA topo II genes of Marseilleviridae: evidence of proto-eukaryotic provenance.

    Science.gov (United States)

    Erives, Albert J

    2017-11-28

    While the genomes of eukaryotes and Archaea both encode the histone-fold domain, only eukaryotes encode the core histone paralogs H2A, H2B, H3, and H4. With DNA, these core histones assemble into the nucleosomal octamer underlying eukaryotic chromatin. Importantly, core histones for H2A and H3 are maintained as neofunctionalized paralogs adapted for general bulk chromatin (canonical H2 and H3) or specialized chromatin (H2A.Z enriched at gene promoters and cenH3s enriched at centromeres). In this context, the identification of core histone-like "doublets" in the cytoplasmic replication factories of the Marseilleviridae (MV) is a novel finding with possible relevance to understanding the origin of eukaryotic chromatin. Here, we analyze and compare the core histone doublet genes from all known MV genomes as well as other MV genes relevant to the origin of the eukaryotic replisome. Using different phylogenetic approaches, we show that MV histone domains encode obligate H2B-H2A and H4-H3 dimers of possible proto-eukaryotic origin. MV core histone moieties form sister clades to each of the four eukaryotic clades of canonical and variant core histones. This suggests that MV core histone moieties diverged prior to eukaryotic neofunctionalizations associated with paired linear chromosomes and variant histone octamer assembly. We also show that MV genomes encode a proto-eukaryotic DNA topoisomerase II enzyme that forms a sister clade to eukaryotes. This is a relevant finding given that DNA topo II influences histone deposition and chromatin compaction and is the second most abundant nuclear protein after histones. The combined domain architecture and phylogenomic analyses presented here suggest that a primitive origin for MV histone genes is a more parsimonious explanation than horizontal gene transfers + gene fusions + sufficient divergence to eliminate relatedness to eukaryotic neofunctionalizations within the H2A and H3 clades without loss of relatedness to each of

  19. SLiMScape 3.x: a Cytoscape 3 app for discovery of Short Linear Motifs in protein interaction networks [version 1; referees: 2 approved

    Directory of Open Access Journals (Sweden)

    Emily Olorin

    2015-08-01

    Full Text Available Short linear motifs (SLiMs are small protein sequence patterns that mediate a large number of critical protein-protein interactions, involved in processes such as complex formation, signal transduction, localisation and stabilisation. SLiMs show rapid evolutionary dynamics and are frequently the targets of molecular mimicry by pathogens. Identifying enriched sequence patterns due to convergent evolution in non-homologous proteins has proven to be a successful strategy for computational SLiM prediction. Tools of the SLiMSuite package use this strategy, using a statistical model to identify SLiM enrichment based on the evolutionary relationships, amino acid composition and predicted disorder of the input proteins. The quality of input data is critical for successful SLiM prediction. Cytoscape provides a user-friendly, interactive environment to explore interaction networks and select proteins based on common features, such as shared interaction partners. SLiMScape embeds tools of the SLiMSuite package for de novo SLiM discovery (SLiMFinder and QSLiMFinder and identifying occurrences/enrichment of known SLiMs (SLiMProb within this interactive framework. SLiMScape makes it easier to (1 generate high quality hypothesis-driven datasets for these tools, and (2 visualise predicted SLiM occurrences within the context of the network. To generate new predictions, users can select nodes from a protein network or provide a set of Uniprot identifiers. SLiMProb also requires additional query motif input. Jobs are then run remotely on the SLiMSuite server (http://rest.slimsuite.unsw.edu.au for subsequent retrieval and visualisation. SLiMScape can also be used to retrieve and visualise results from jobs run directly on the server. SLiMScape and SLiMSuite are open source and freely available via GitHub under GNU licenses.

  20. Bayesian centroid estimation for motif discovery.

    Science.gov (United States)

    Carvalho, Luis

    2013-01-01

    Biological sequences may contain patterns that signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequence of the set. We propose a new centroid estimator that arises from a refined and meaningful loss function for binding site inference. We discuss the main advantages of centroid estimation for motif discovery, including computational convenience, and how its principled derivation offers further insights about the posterior distribution of binding site configurations. We also illustrate, using simulated and real datasets, that the centroid estimator can differ from the traditional maximum a posteriori or maximum likelihood estimators.

  1. Bayesian centroid estimation for motif discovery.

    Directory of Open Access Journals (Sweden)

    Luis Carvalho

    Full Text Available Biological sequences may contain patterns that signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequence of the set. We propose a new centroid estimator that arises from a refined and meaningful loss function for binding site inference. We discuss the main advantages of centroid estimation for motif discovery, including computational convenience, and how its principled derivation offers further insights about the posterior distribution of binding site configurations. We also illustrate, using simulated and real datasets, that the centroid estimator can differ from the traditional maximum a posteriori or maximum likelihood estimators.

  2. Compositional patterns in the genomes of unicellular eukaryotes.

    Science.gov (United States)

    Costantini, Maria; Alvarez-Valin, Fernando; Costantini, Susan; Cammarano, Rosalia; Bernardi, Giorgio

    2013-11-05

    The genomes of multicellular eukaryotes are compartmentalized in mosaics of isochores, large and fairly homogeneous stretches of DNA that belong to a small number of families characterized by different average GC levels, by different gene concentration (that increase with GC), different chromatin structures, different replication timing in the cell cycle, and other different properties. A question raised by these basic results concerns how far back in evolution the compartmentalized organization of the eukaryotic genomes arose. In the present work we approached this problem by studying the compositional organization of the genomes from the unicellular eukaryotes for which full sequences are available, the sample used being representative. The average GC levels of the genomes from unicellular eukaryotes cover an extremely wide range (19%-60% GC) and the compositional patterns of individual genomes are extremely different but all genomes tested show a compositional compartmentalization. The average GC range of the genomes of unicellular eukaryotes is very broad (as broad as that of prokaryotes) and individual compositional patterns cover a very broad range from very narrow to very complex. Both features are not surprising for organisms that are very far from each other both in terms of phylogenetic distances and of environmental life conditions. Most importantly, all genomes tested, a representative sample of all supergroups of unicellular eukaryotes, are compositionally compartmentalized, a major difference with prokaryotes.

  3. The RNA recognition motif of eukaryotic translation initiation factor 3g (eIF3g) is required for resumption of scanning of posttermination ribosomes for reinitiation on GCN4 and together with eIF3i stimulates linear scanning.

    Science.gov (United States)

    Cuchalová, Lucie; Kouba, Tomás; Herrmannová, Anna; Dányi, István; Chiu, Wen-Ling; Valásek, Leos

    2010-10-01

    Recent reports have begun unraveling the details of various roles of individual eukaryotic translation initiation factor 3 (eIF3) subunits in translation initiation. Here we describe functional characterization of two essential Saccharomyces cerevisiae eIF3 subunits, g/Tif35 and i/Tif34, previously suggested to be dispensable for formation of the 48S preinitiation complexes (PICs) in vitro. A triple-Ala substitution of conserved residues in the RRM of g/Tif35 (g/tif35-KLF) or a single-point mutation in the WD40 repeat 6 of i/Tif34 (i/tif34-Q258R) produces severe growth defects and decreases the rate of translation initiation in vivo without affecting the integrity of eIF3 and formation of the 43S PICs in vivo. Both mutations also diminish induction of GCN4 expression, which occurs upon starvation via reinitiation. Whereas g/tif35-KLF impedes resumption of scanning for downstream reinitiation by 40S ribosomes terminating at upstream open reading frame 1 (uORF1) in the GCN4 mRNA leader, i/tif34-Q258R prevents full GCN4 derepression by impairing the rate of scanning of posttermination 40S ribosomes moving downstream from uORF1. In addition, g/tif35-KLF reduces processivity of scanning through stable secondary structures, and g/Tif35 specifically interacts with Rps3 and Rps20 located near the ribosomal mRNA entry channel. Together these results implicate g/Tif35 and i/Tif34 in stimulation of linear scanning and, specifically in the case of g/Tif35, also in proper regulation of the GCN4 reinitiation mechanism.

  4. Conservation and Variability of Meiosis Across the Eukaryotes.

    Science.gov (United States)

    Loidl, Josef

    2016-11-23

    Comparisons among a variety of eukaryotes have revealed considerable variability in the structures and processes involved in their meiosis. Nevertheless, conventional forms of meiosis occur in all major groups of eukaryotes, including early-branching protists. This finding confirms that meiosis originated in the common ancestor of all eukaryotes and suggests that primordial meiosis may have had many characteristics in common with conventional extant meiosis. However, it is possible that the synaptonemal complex and the delicate crossover control related to its presence were later acquisitions. Later still, modifications to meiotic processes occurred within different groups of eukaryotes. Better knowledge on the spectrum of derived and uncommon forms of meiosis will improve our understanding of many still mysterious aspects of the meiotic process and help to explain the evolutionary basis of functional adaptations to the meiotic program.

  5. CONTEMPORARY USAGE OF TRADITIONAL TURKISH MOTIFS IN PRODUCT DESIGNS

    Directory of Open Access Journals (Sweden)

    Tulay Gumuser

    2012-12-01

    Full Text Available The aim of this study is to identify the traditional Turkish motifs and its relations among present industrial designs. Traditional Turkish motifs played a very important role in 16th century onwards. The arts of the Ottoman Empire were used because of their symbolic meanings and unique styles. When we examine these motifs we encounter; Tiger Stripe, Three Spot (Çintemani, Rumi, Hatayi, Penç, Cloud, Crescent, Star, Crown, Hyacinth, Tulip and Carnation motifs. Nowadays, Turkish designers have begun to use these traditional Turkish motifs in their designs so as to create differences and awareness in the world design. The examples of these industrial designs, using the Turkish motifs, have survived and have Ottoman heritage and historical value. In this study, the Turkish motifs will be examined along with their focus on contemporary Turkish industrial designs used today.

  6. RNA motif search with data-driven element ordering.

    Science.gov (United States)

    Rampášek, Ladislav; Jimenez, Randi M; Lupták, Andrej; Vinař, Tomáš; Brejová, Broňa

    2016-05-18

    In this paper, we study the problem of RNA motif search in long genomic sequences. This approach uses a combination of sequence and structure constraints to uncover new distant homologs of known functional RNAs. The problem is NP-hard and is traditionally solved by backtracking algorithms. We have designed a new algorithm for RNA motif search and implemented a new motif search tool RNArobo. The tool enhances the RNAbob descriptor language, allowing insertions in helices, which enables better characterization of ribozymes and aptamers. A typical RNA motif consists of multiple elements and the running time of the algorithm is highly dependent on their ordering. By approaching the element ordering problem in a principled way, we demonstrate more than 100-fold speedup of the search for complex motifs compared to previously published tools. We have developed a new method for RNA motif search that allows for a significant speedup of the search of complex motifs that include pseudoknots. Such speed improvements are crucial at a time when the rate of DNA sequencing outpaces growth in computing. RNArobo is available at http://compbio.fmph.uniba.sk/rnarobo .

  7. Eukaryotic Cell Panorama

    Science.gov (United States)

    Goodsell, David S.

    2011-01-01

    Diverse biological data may be used to create illustrations of molecules in their cellular context. This report describes the scientific results that support an illustration of a eukaryotic cell, enlarged by one million times to show the distribution and arrangement of macromolecules. The panoramic cross section includes eight panels that extend…

  8. Interaction of tRNA with Eukaryotic Ribosome

    Directory of Open Access Journals (Sweden)

    Dmitri Graifer

    2015-03-01

    Full Text Available This paper is a review of currently available data concerning interactions of tRNAs with the eukaryotic ribosome at various stages of translation. These data include the results obtained by means of cryo-electron microscopy and X-ray crystallography applied to various model ribosomal complexes, site-directed cross-linking with the use of tRNA derivatives bearing chemically or photochemically reactive groups in the CCA-terminal fragment and chemical probing of 28S rRNA in the region of the peptidyl transferase center. Similarities and differences in the interactions of tRNAs with prokaryotic and eukaryotic ribosomes are discussed with concomitant consideration of the extent of resemblance between molecular mechanisms of translation in eukaryotes and bacteria.

  9. Identification of sequence motifs significantly associated with antisense activity

    Directory of Open Access Journals (Sweden)

    Peek Andrew S

    2007-06-01

    Full Text Available Abstract Background Predicting the suppression activity of antisense oligonucleotide sequences is the main goal of the rational design of nucleic acids. To create an effective predictive model, it is important to know what properties of an oligonucleotide sequence associate significantly with antisense activity. Also, for the model to be efficient we must know what properties do not associate significantly and can be omitted from the model. This paper will discuss the results of a randomization procedure to find motifs that associate significantly with either high or low antisense suppression activity, analysis of their properties, as well as the results of support vector machine modelling using these significant motifs as features. Results We discovered 155 motifs that associate significantly with high antisense suppression activity and 202 motifs that associate significantly with low suppression activity. The motifs range in length from 2 to 5 bases, contain several motifs that have been previously discovered as associating highly with antisense activity, and have thermodynamic properties consistent with previous work associating thermodynamic properties of sequences with their antisense activity. Statistical analysis revealed no correlation between a motif's position within an antisense sequence and that sequences antisense activity. Also, many significant motifs existed as subwords of other significant motifs. Support vector regression experiments indicated that the feature set of significant motifs increased correlation compared to all possible motifs as well as several subsets of the significant motifs. Conclusion The thermodynamic properties of the significantly associated motifs support existing data correlating the thermodynamic properties of the antisense oligonucleotide with antisense efficiency, reinforcing our hypothesis that antisense suppression is strongly associated with probe/target thermodynamics, as there are no enzymatic

  10. Genome-reconstruction for eukaryotes from complex natural microbial communities.

    Science.gov (United States)

    West, Patrick T; Probst, Alexander J; Grigoriev, Igor V; Thomas, Brian C; Banfield, Jillian F

    2018-04-01

    Microbial eukaryotes are integral components of natural microbial communities, and their inclusion is critical for many ecosystem studies, yet the majority of published metagenome analyses ignore eukaryotes. In order to include eukaryotes in environmental studies, we propose a method to recover eukaryotic genomes from complex metagenomic samples. A key step for genome recovery is separation of eukaryotic and prokaryotic fragments. We developed a k -mer-based strategy, EukRep, for eukaryotic sequence identification and applied it to environmental samples to show that it enables genome recovery, genome completeness evaluation, and prediction of metabolic potential. We used this approach to test the effect of addition of organic carbon on a geyser-associated microbial community and detected a substantial change of the community metabolism, with selection against almost all candidate phyla bacteria and archaea and for eukaryotes. Near complete genomes were reconstructed for three fungi placed within the Eurotiomycetes and an arthropod. While carbon fixation and sulfur oxidation were important functions in the geyser community prior to carbon addition, the organic carbon-impacted community showed enrichment for secreted proteases, secreted lipases, cellulose targeting CAZymes, and methanol oxidation. We demonstrate the broader utility of EukRep by reconstructing and evaluating relatively high-quality fungal, protist, and rotifer genomes from complex environmental samples. This approach opens the way for cultivation-independent analyses of whole microbial communities. © 2018 West et al.; Published by Cold Spring Harbor Laboratory Press.

  11. [Structure-functional organization of eukaryotic high-affinity copper importer CTR1 determines its ability to transport copper, silver and cisplatin].

    Science.gov (United States)

    Skvortsov, A N; Zatulovskiĭ, E A; Puchkova, L V

    2012-01-01

    It was shown recently, that high affinity Cu(I) importer eukaryotic protein CTR1 can also transport in vitro abiogenic Ag(I) ions and anticancer drug cisplatin. At present there is no rational explanation how CTR1 can transfer platinum group, which is different by coordination properties from highly similar Cu(I) and Ag(I). To understand this phenomenon we analyzed 25 sequences of chordate CTR1 proteins, and found out conserved patterns of organization of N-terminal extracellular part of CTR1 which correspond to initial metal binding. Extracellular copper-binding motifs were qualified by their coordination properties. It was shown that relative position of Met- and His-rich copper-binding motifs in CTR1 predisposes the extracellular CTR1 part to binding of copper, silver and cisplatin. Relation between tissue-specific expression of CTR1 gene, steady-state copper concentration, and silver and platinum accumulation in organs of mice in vivo was analyzed. Significant positive but incomplete correlation exists between these variables. Basing on structural and functional peculiarities of N-terminal part of CTR1 a hypothesis of coupled transport of copper and cisplatin has been suggested, which avoids the disagreement between CTR1-mediated cisplatin transport in vitro, and irreversible binding of platinum to Met-rich peptides.

  12. Analisis Unsur Matematika pada Motif Sulam Usus

    Directory of Open Access Journals (Sweden)

    Fredi Ganda Putra

    2017-12-01

    Full Text Available Based on interviews with researchers sources said that the beginning of the intestine embroidery is an art of genuine crafts. Called the intestine embroidery because this technique is a technique of combining a strand of cloth resembling the intestine formed according to the pattern by means of embroidered using a thread. Intestinal embroidery techniques were originally used to create a cover of the women's customary wardrobe of Lampung or often referred to as bebe. But not many people in Lampung, especially people who live in Lampung are still many who do not know and recognize the intestine embroidery because most only know tapis only characteristic of Lampung, besides that there are other cultural results that is embroidered intestine. There are still many who do not know that the intestine motif there is a knowledge of mathematics. The researcher's problem formulation is whether there are mathematical elements contained in the intestine embroidery motif based on the concept of geometry. The purpose of this study is to determine whether there are elements of mathematics contained in the intestine motif based on the concept of geometry. Subjects in this study consisted of 4 people obtained by purposive sampling technique. From the results of data analysis conducted by using descriptive analysis and discussion as follows: (1 Intestinal embroidery motif contains the meaning of mathematics and culture or often called Etnomatematika. On the meaning of culture there is a link between the embroidery intestine with a culture that has been there before as the existence of cultural linkage between Hindu belief Buddhism and there are similarities of motifs and decorative patterns contained in the motif embroidery intestine with ornamental variety in Indonesia. (2 The relationship between the intestine with mathematical motifs there are elements of mathematics such as geometry elements in the form of geometry of dimension one and dimension two, and the

  13. Insight into structure and assembly of the nuclear pore complex by utilizing the genome of a eukaryotic thermophile

    DEFF Research Database (Denmark)

    Amlacher, Stefan; Sarges, Phillip; Flemming, Dirk

    2011-01-01

    is composed of two large Nups, Nup192 and Nup170, which are flexibly bridged by short linear motifs made up of linker Nups, Nic96 and Nup53. This assembly illustrates how Nup interactions can generate structural plasticity within the NPC scaffold. Our findings therefore demonstrate the utility of the genome...

  14. Insights into the molecular evolution of the PDZ/LIM family and identification of a novel conserved protein motif.

    Directory of Open Access Journals (Sweden)

    Aartjan J W Te Velthuis

    Full Text Available The PDZ and LIM domain-containing protein family is encoded by a diverse group of genes whose phylogeny has currently not been analyzed. In mammals, ten genes are found that encode both a PDZ- and one or several LIM-domains. These genes are: ALP, RIL, Elfin (CLP36, Mystique, Enigma (LMP-1, Enigma homologue (ENH, ZASP (Cypher, Oracle, LMO7 and the two LIM domain kinases (LIMK1 and LIMK2. As conventional alignment and phylogenetic procedures of full-length sequences fell short of elucidating the evolutionary history of these genes, we started to analyze the PDZ and LIM domain sequences themselves. Using information from most sequenced eukaryotic lineages, our phylogenetic analysis is based on full-length cDNA-, EST-derived- and genomic- PDZ and LIM domain sequences of over 25 species, ranging from yeast to humans. Plant and protozoan homologs were not found. Our phylogenetic analysis identifies a number of domain duplication and rearrangement events, and shows a single convergent event during evolution of the PDZ/LIM family. Further, we describe the separation of the ALP and Enigma subfamilies in lower vertebrates and identify a novel consensus motif, which we call 'ALP-like motif' (AM. This motif is highly-conserved between ALP subfamily proteins of diverse organisms. We used here a combinatorial approach to define the relation of the PDZ and LIM domain encoding genes and to reconstruct their phylogeny. This analysis allowed us to classify the PDZ/LIM family and to suggest a meaningful model for the molecular evolution of the diverse gene architectures found in this multi-domain family.

  15. Beyond Agrobacterium-Mediated Transformation: Horizontal Gene Transfer from Bacteria to Eukaryotes.

    Science.gov (United States)

    Lacroix, Benoît; Citovsky, Vitaly

    2018-03-03

    Besides the massive gene transfer from organelles to the nuclear genomes, which occurred during the early evolution of eukaryote lineages, the importance of horizontal gene transfer (HGT) in eukaryotes remains controversial. Yet, increasing amounts of genomic data reveal many cases of bacterium-to-eukaryote HGT that likely represent a significant force in adaptive evolution of eukaryotic species. However, DNA transfer involved in genetic transformation of plants by Agrobacterium species has traditionally been considered as the unique example of natural DNA transfer and integration into eukaryotic genomes. Recent discoveries indicate that the repertoire of donor bacterial species and of recipient eukaryotic hosts potentially are much wider than previously thought, including donor bacterial species, such as plant symbiotic nitrogen-fixing bacteria (e.g., Rhizobium etli) and animal bacterial pathogens (e.g., Bartonella henselae, Helicobacter pylori), and recipient species from virtually all eukaryotic clades. Here, we review the molecular pathways and potential mechanisms of these trans-kingdom HGT events and discuss their utilization in biotechnology and research.

  16. On the Diversification of the Translation Apparatus across Eukaryotes

    Directory of Open Access Journals (Sweden)

    Greco Hernández

    2012-01-01

    Full Text Available Diversity is one of the most remarkable features of living organisms. Current assessments of eukaryote biodiversity reaches 1.5 million species, but the true figure could be several times that number. Diversity is ingrained in all stages and echelons of life, namely, the occupancy of ecological niches, behavioral patterns, body plans and organismal complexity, as well as metabolic needs and genetics. In this review, we will discuss that diversity also exists in a key biochemical process, translation, across eukaryotes. Translation is a fundamental process for all forms of life, and the basic components and mechanisms of translation in eukaryotes have been largely established upon the study of traditional, so-called model organisms. By using modern genome-wide, high-throughput technologies, recent studies of many nonmodel eukaryotes have unveiled a surprising diversity in the configuration of the translation apparatus across eukaryotes, showing that this apparatus is far from being evolutionarily static. For some of the components of this machinery, functional differences between different species have also been found. The recent research reviewed in this article highlights the molecular and functional diversification the translational machinery has undergone during eukaryotic evolution. A better understanding of all aspects of organismal diversity is key to a more profound knowledge of life.

  17. An Evolutionary Framework for Understanding the Origin of Eukaryotes

    Directory of Open Access Journals (Sweden)

    Neil W. Blackstone

    2016-04-01

    Full Text Available Two major obstacles hinder the application of evolutionary theory to the origin of eukaryotes. The first is more apparent than real—the endosymbiosis that led to the mitochondrion is often described as “non-Darwinian” because it deviates from the incremental evolution championed by the modern synthesis. Nevertheless, endosymbiosis can be accommodated by a multi-level generalization of evolutionary theory, which Darwin himself pioneered. The second obstacle is more serious—all of the major features of eukaryotes were likely present in the last eukaryotic common ancestor thus rendering comparative methods ineffective. In addition to a multi-level theory, the development of rigorous, sequence-based phylogenetic and comparative methods represents the greatest achievement of modern evolutionary theory. Nevertheless, the rapid evolution of major features in the eukaryotic stem group requires the consideration of an alternative framework. Such a framework, based on the contingent nature of these evolutionary events, is developed and illustrated with three examples: the putative intron proliferation leading to the nucleus and the cell cycle; conflict and cooperation in the origin of eukaryotic bioenergetics; and the inter-relationship between aerobic metabolism, sterol synthesis, membranes, and sex. The modern synthesis thus provides sufficient scope to develop an evolutionary framework to understand the origin of eukaryotes.

  18. Motif signatures of transcribed enhancers

    KAUST Repository

    Kleftogiannis, Dimitrios

    2017-09-14

    In mammalian cells, transcribed enhancers (TrEn) play important roles in the initiation of gene expression and maintenance of gene expression levels in spatiotemporal manner. One of the most challenging questions in biology today is how the genomic characteristics of enhancers relate to enhancer activities. This is particularly critical, as several recent studies have linked enhancer sequence motifs to specific functional roles. To date, only a limited number of enhancer sequence characteristics have been investigated, leaving space for exploring the enhancers genomic code in a more systematic way. To address this problem, we developed a novel computational method, TELS, aimed at identifying predictive cell type/tissue specific motif signatures. We used TELS to compile a comprehensive catalog of motif signatures for all known TrEn identified by the FANTOM5 consortium across 112 human primary cells and tissues. Our results confirm that distinct cell type/tissue specific motif signatures characterize TrEn. These signatures allow discriminating successfully a) TrEn from random controls, proxy of non-enhancer activity, and b) cell type/tissue specific TrEn from enhancers expressed and transcribed in different cell types/tissues. TELS codes and datasets are publicly available at http://www.cbrc.kaust.edu.sa/TELS.

  19. [MiRNA system in unicellular eukaryotes and its evolutionary implications].

    Science.gov (United States)

    Zhang, Yan-Qiong; Wen, Jian-Fan

    2010-02-01

    microRNAs (miRNAs) in higher multicellular eukaryotes have been extensively studied in recent years. Great progresses have also been achieved for miRNAs in unicellular eukaryotes. All these studies not only enrich our knowledge about the complex expression regulation system in diverse organisms, but also have evolutionary significance for understanding the origin of this system. In this review, Authors summarize the recent advance in the studies of miRNA in unicellular eukaryotes, including that on the most primitive unicellular eukaryote--Giardia. The origin and evolution of miRNA system is also discussed.

  20. RNase MRP and the RNA processing cascade in the eukaryotic ancestor.

    Science.gov (United States)

    Woodhams, Michael D; Stadler, Peter F; Penny, David; Collins, Lesley J

    2007-02-08

    Within eukaryotes there is a complex cascade of RNA-based macromolecules that process other RNA molecules, especially mRNA, tRNA and rRNA. An example is RNase MRP processing ribosomal RNA (rRNA) in ribosome biogenesis. One hypothesis is that this complexity was present early in eukaryotic evolution; an alternative is that an initial simpler network later gained complexity by gene duplication in lineages that led to animals, fungi and plants. Recently there has been a rapid increase in support for the complexity-early theory because the vast majority of these RNA-processing reactions are found throughout eukaryotes, and thus were likely to be present in the last common ancestor of living eukaryotes, herein called the Eukaryotic Ancestor. We present an overview of the RNA processing cascade in the Eukaryotic Ancestor and investigate in particular, RNase MRP which was previously thought to have evolved later in eukaryotes due to its apparent limited distribution in fungi and animals and plants. Recent publications, as well as our own genomic searches, find previously unknown RNase MRP RNAs, indicating that RNase MRP has a wide distribution in eukaryotes. Combining secondary structure and promoter region analysis of RNAs for RNase MRP, along with analysis of the target substrate (rRNA), allows us to discuss this distribution in the light of eukaryotic evolution. We conclude that RNase MRP can now be placed in the RNA-processing cascade of the Eukaryotic Ancestor, highlighting the complexity of RNA-processing in early eukaryotes. Promoter analyses of MRP-RNA suggest that regulation of the critical processes of rRNA cleavage can vary, showing that even these key cellular processes (for which we expect high conservation) show some species-specific variability. We present our consensus MRP-RNA secondary structure as a useful model for further searches.

  1. Mechanism of Diphtheria Toxin Catalytic Domain Delivery to the Eukaryotic Cell Cytosol and the Cellular Factors that Directly Participate in the Process

    Science.gov (United States)

    Murphy, John R.

    2011-01-01

    Research on diphtheria and anthrax toxins over the past three decades has culminated in a detailed understanding of their structure function relationships (e.g., catalytic (C), transmembrane (T), and receptor binding (R) domains), as well as the identification of their eukaryotic cell surface receptor, an understanding of the molecular events leading to the receptor-mediated internalization of the toxin into an endosomal compartment, and the pH triggered conformational changes required for pore formation in the vesicle membrane. Recently, a major research effort has been focused on the development of a detailed understanding of the molecular interactions between each of these toxins and eukaryotic cell factors that play an essential role in the efficient translocation of their respective catalytic domains through the trans-endosomal vesicle membrane pore and delivery into the cell cytosol. In this review, I shall focus on recent findings that have led to a more detailed understanding of the mechanism by which the diphtheria toxin catalytic domain is delivered to the eukaryotic cell cytosol. While much work remains, it is becoming increasingly clear that the entry process is facilitated by specific interactions with a number of cellular factors in an ordered sequential fashion. In addition, since diphtheria, anthrax lethal factor and anthrax edema factor all carry multiple coatomer I complex binding motifs and COPI complex has been shown to play an essential role in entry process, it is likely that the initial steps in catalytic domain entry of these divergent toxins follow a common mechanism. PMID:22069710

  2. Genome-wide prediction and functional validation of promoter motifs regulating gene expression in spore and infection stages of Phytophthora infestans.

    Directory of Open Access Journals (Sweden)

    Sourav Roy

    2013-03-01

    Full Text Available Most eukaryotic pathogens have complex life cycles in which gene expression networks orchestrate the formation of cells specialized for dissemination or host colonization. In the oomycete Phytophthora infestans, the potato late blight pathogen, major shifts in mRNA profiles during developmental transitions were identified using microarrays. We used those data with search algorithms to discover about 100 motifs that are over-represented in promoters of genes up-regulated in hyphae, sporangia, sporangia undergoing zoosporogenesis, swimming zoospores, or germinated cysts forming appressoria (infection structures. Most of the putative stage-specific transcription factor binding sites (TFBSs thus identified had features typical of TFBSs such as position or orientation bias, palindromy, and conservation in related species. Each of six motifs tested in P. infestans transformants using the GUS reporter gene conferred the expected stage-specific expression pattern, and several were shown to bind nuclear proteins in gel-shift assays. Motifs linked to the appressoria-forming stage, including a functionally validated TFBS, were over-represented in promoters of genes encoding effectors and other pathogenesis-related proteins. To understand how promoter and genome architecture influence expression, we also mapped transcription patterns to the P. infestans genome assembly. Adjacent genes were not typically induced in the same stage, including genes transcribed in opposite directions from small intergenic regions, but co-regulated gene pairs occurred more than expected by random chance. These data help illuminate the processes regulating development and pathogenesis, and will enable future attempts to purify the cognate transcription factors.

  3. Eukaryotic cell flattening

    Science.gov (United States)

    Bae, Albert; Westendorf, Christian; Erlenkamper, Christoph; Galland, Edouard; Franck, Carl; Bodenschatz, Eberhard; Beta, Carsten

    2010-03-01

    Eukaryotic cell flattening is valuable for improving microscopic observations, ranging from bright field to total internal reflection fluorescence microscopy. In this talk, we will discuss traditional overlay techniques, and more modern, microfluidic based flattening, which provides a greater level of control. We demonstrate these techniques on the social amoebae Dictyostelium discoideum, comparing the advantages and disadvantages of each method.

  4. Autophagy in unicellular eukaryotes

    NARCIS (Netherlands)

    Kiel, J.A.K.W.

    2010-01-01

    Cells need a constant supply of precursors to enable the production of macromolecules to sustain growth and survival. Unlike metazoans, unicellular eukaryotes depend exclusively on the extracellular medium for this supply. When environmental nutrients become depleted, existing cytoplasmic components

  5. Nitrate storage and dissimilatory nitrate reduction by eukaryotic microbes

    DEFF Research Database (Denmark)

    Kamp, Anja; Høgslund, Signe; Risgaard-Petersen, Nils

    2015-01-01

    The microbial nitrogen cycle is one of the most complex and environmentally important element cycles on Earth and has long been thought to be mediated exclusively by prokaryotic microbes. Rather recently, it was discovered that certain eukaryotic microbes are able to store nitrate intracellularly......, suggesting that eukaryotes may rival prokaryotes in terms of dissimilatory nitrate reduction. Finally, this review article sketches some evolutionary perspectives of eukaryotic nitrate metabolism and identifies open questions that need to be addressed in future investigations....... and use it for dissimilatory nitrate reduction in the absence of oxygen. The paradigm shift that this entailed is ecologically significant because the eukaryotes in question comprise global players like diatoms, foraminifers, and fungi. This review article provides an unprecedented overview of nitrate...

  6. Comparative Genomics of Eukaryotes.

    NARCIS (Netherlands)

    Noort, V. van

    2007-01-01

    This thesis focuses on developing comparative genomics methods in eukaryotes, with an emphasis on applications for gene function prediction and regulatory element detection. In the past, methods have been developed to predict functional associations between gene pairs in prokaryotes. The challenge

  7. Triadic motifs in the dependence networks of virtual societies

    Science.gov (United States)

    Xie, Wen-Jie; Li, Ming-Xia; Jiang, Zhi-Qiang; Zhou, Wei-Xing

    2014-06-01

    In friendship networks, individuals have different numbers of friends, and the closeness or intimacy between an individual and her friends is heterogeneous. Using a statistical filtering method to identify relationships about who depends on whom, we construct dependence networks (which are directed) from weighted friendship networks of avatars in more than two hundred virtual societies of a massively multiplayer online role-playing game (MMORPG). We investigate the evolution of triadic motifs in dependence networks. Several metrics show that the virtual societies evolved through a transient stage in the first two to three weeks and reached a relatively stable stage. We find that the unidirectional loop motif (M9) is underrepresented and does not appear, open motifs are also underrepresented, while other close motifs are overrepresented. We also find that, for most motifs, the overall level difference of the three avatars in the same motif is significantly lower than average, whereas the sum of ranks is only slightly larger than average. Our findings show that avatars' social status plays an important role in the formation of triadic motifs.

  8. Triadic motifs in the dependence networks of virtual societies.

    Science.gov (United States)

    Xie, Wen-Jie; Li, Ming-Xia; Jiang, Zhi-Qiang; Zhou, Wei-Xing

    2014-06-10

    In friendship networks, individuals have different numbers of friends, and the closeness or intimacy between an individual and her friends is heterogeneous. Using a statistical filtering method to identify relationships about who depends on whom, we construct dependence networks (which are directed) from weighted friendship networks of avatars in more than two hundred virtual societies of a massively multiplayer online role-playing game (MMORPG). We investigate the evolution of triadic motifs in dependence networks. Several metrics show that the virtual societies evolved through a transient stage in the first two to three weeks and reached a relatively stable stage. We find that the unidirectional loop motif (M9) is underrepresented and does not appear, open motifs are also underrepresented, while other close motifs are overrepresented. We also find that, for most motifs, the overall level difference of the three avatars in the same motif is significantly lower than average, whereas the sum of ranks is only slightly larger than average. Our findings show that avatars' social status plays an important role in the formation of triadic motifs.

  9. Direct AUC optimization of regulatory motifs.

    Science.gov (United States)

    Zhu, Lin; Zhang, Hong-Bo; Huang, De-Shuang

    2017-07-15

    The discovery of transcription factor binding site (TFBS) motifs is essential for untangling the complex mechanism of genetic variation under different developmental and environmental conditions. Among the huge amount of computational approaches for de novo identification of TFBS motifs, discriminative motif learning (DML) methods have been proven to be promising for harnessing the discovery power of accumulated huge amount of high-throughput binding data. However, they have to sacrifice accuracy for speed and could fail to fully utilize the information of the input sequences. We propose a novel algorithm called CDAUC for optimizing DML-learned motifs based on the area under the receiver-operating characteristic curve (AUC) criterion, which has been widely used in the literature to evaluate the significance of extracted motifs. We show that when the considered AUC loss function is optimized in a coordinate-wise manner, the cost function of each resultant sub-problem is a piece-wise constant function, whose optimal value can be found exactly and efficiently. Further, a key step of each iteration of CDAUC can be efficiently solved as a computational geometry problem. Experimental results on real world high-throughput datasets illustrate that CDAUC outperforms competing methods for refining DML motifs, while being one order of magnitude faster. Meanwhile, preliminary results also show that CDAUC may also be useful for improving the interpretability of convolutional kernels generated by the emerging deep learning approaches for predicting TF sequences specificities. CDAUC is available at: https://drive.google.com/drive/folders/0BxOW5MtIZbJjNFpCeHlBVWJHeW8 . dshuang@tongji.edu.cn. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  10. DMINDA: an integrated web server for DNA motif identification and analyses.

    Science.gov (United States)

    Ma, Qin; Zhang, Hanyuan; Mao, Xizeng; Zhou, Chuan; Liu, Bingqiang; Chen, Xin; Xu, Ying

    2014-07-01

    DMINDA (DNA motif identification and analyses) is an integrated web server for DNA motif identification and analyses, which is accessible at http://csbl.bmb.uga.edu/DMINDA/. This web site is freely available to all users and there is no login requirement. This server provides a suite of cis-regulatory motif analysis functions on DNA sequences, which are important to elucidation of the mechanisms of transcriptional regulation: (i) de novo motif finding for a given set of promoter sequences along with statistical scores for the predicted motifs derived based on information extracted from a control set, (ii) scanning motif instances of a query motif in provided genomic sequences, (iii) motif comparison and clustering of identified motifs, and (iv) co-occurrence analyses of query motifs in given promoter sequences. The server is powered by a backend computer cluster with over 150 computing nodes, and is particularly useful for motif prediction and analyses in prokaryotic genomes. We believe that DMINDA, as a new and comprehensive web server for cis-regulatory motif finding and analyses, will benefit the genomic research community in general and prokaryotic genome researchers in particular. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. A Conserved EAR Motif Is Required for Avirulence and Stability of the Ralstonia solanacearum Effector PopP2 In Planta

    Directory of Open Access Journals (Sweden)

    Cécile Segonzac

    2017-08-01

    Full Text Available Ralstonia solanacearum is the causal agent of the devastating bacterial wilt disease in many high value Solanaceae crops. R. solanacearum secretes around 70 effectors into host cells in order to promote infection. Plants have, however, evolved specialized immune receptors that recognize corresponding effectors and confer qualitative disease resistance. In the model species Arabidopsis thaliana, the paired immune receptors RRS1 (resistance to Ralstonia solanacearum 1 and RPS4 (resistance to Pseudomonas syringae 4 cooperatively recognize the R. solanacearum effector PopP2 in the nuclei of infected cells. PopP2 is an acetyltransferase that binds to and acetylates the RRS1 WRKY DNA-binding domain resulting in reduced RRS1-DNA association thereby activating plant immunity. Here, we surveyed the naturally occurring variation in PopP2 sequence among the R. solanacearum strains isolated from diseased tomato and pepper fields across the Republic of Korea. Our analysis revealed high conservation of popP2 sequence with only three polymorphic alleles present amongst 17 strains. Only one variation (a premature stop codon caused the loss of RPS4/RRS1-dependent recognition in Arabidopsis. We also found that PopP2 harbors a putative eukaryotic transcriptional repressor motif (ethylene-responsive element binding factor-associated amphiphilic repression or EAR, which is known to be involved in the recruitment of transcriptional co-repressors. Remarkably, mutation of the EAR motif disabled PopP2 avirulence function as measured by the development of hypersensitive response, electrolyte leakage, defense marker gene expression and bacterial growth in Arabidopsis. This lack of recognition was partially but significantly reverted by the C-terminal addition of a synthetic EAR motif. We show that the EAR motif-dependent gain of avirulence correlated with the stability of the PopP2 protein. Furthermore, we demonstrated the requirement of the PopP2 EAR motif for PTI

  12. Efficient motif finding algorithms for large-alphabet inputs

    Directory of Open Access Journals (Sweden)

    Pavlovic Vladimir

    2010-10-01

    Full Text Available Abstract Background We consider the problem of identifying motifs, recurring or conserved patterns, in the biological sequence data sets. To solve this task, we present a new deterministic algorithm for finding patterns that are embedded as exact or inexact instances in all or most of the input strings. Results The proposed algorithm (1 improves search efficiency compared to existing algorithms, and (2 scales well with the size of alphabet. On a synthetic planted DNA motif finding problem our algorithm is over 10× more efficient than MITRA, PMSPrune, and RISOTTO for long motifs. Improvements are orders of magnitude higher in the same setting with large alphabets. On benchmark TF-binding site problems (FNP, CRP, LexA we observed reduction in running time of over 12×, with high detection accuracy. The algorithm was also successful in rapidly identifying protein motifs in Lipocalin, Zinc metallopeptidase, and supersecondary structure motifs for Cadherin and Immunoglobin families. Conclusions Our algorithm reduces computational complexity of the current motif finding algorithms and demonstrate strong running time improvements over existing exact algorithms, especially in important and difficult cases of large-alphabet sequences.

  13. AUG is the only initiation codon in eukaryotes

    Energy Technology Data Exchange (ETDEWEB)

    Sherman, F; McKnight, G; Stewart, J W

    1980-01-01

    An analysis of mutants of the yeast Saccharomyces cerevisiae indicates that AUG is the sole codon capable of initiating translation of iso-1-cytochrome c. This result with yeast and the sequence results of numerous eukaryotic genes indicate that AUG is the only initiation codon in eukaryotes; in contrast, results with Escherichia colia and bacteriophages indicate that both AUG and GUG are initiation codons in prokaryotes. The difference can be explained by the lack of the t/sup 6/ A hypermodified nucleoside (N-(9-(..beta..-D-ribofuranosyl)purin-6-ylcarbamoyl)threonine) in prokaryotic initiator tRNA and its presence in eukaryotic initiator tRNA.

  14. RMOD: a tool for regulatory motif detection in signaling network.

    Directory of Open Access Journals (Sweden)

    Jinki Kim

    Full Text Available Regulatory motifs are patterns of activation and inhibition that appear repeatedly in various signaling networks and that show specific regulatory properties. However, the network structures of regulatory motifs are highly diverse and complex, rendering their identification difficult. Here, we present a RMOD, a web-based system for the identification of regulatory motifs and their properties in signaling networks. RMOD finds various network structures of regulatory motifs by compressing the signaling network and detecting the compressed forms of regulatory motifs. To apply it into a large-scale signaling network, it adopts a new subgraph search algorithm using a novel data structure called path-tree, which is a tree structure composed of isomorphic graphs of query regulatory motifs. This algorithm was evaluated using various sizes of signaling networks generated from the integration of various human signaling pathways and it showed that the speed and scalability of this algorithm outperforms those of other algorithms. RMOD includes interactive analysis and auxiliary tools that make it possible to manipulate the whole processes from building signaling network and query regulatory motifs to analyzing regulatory motifs with graphical illustration and summarized descriptions. As a result, RMOD provides an integrated view of the regulatory motifs and mechanism underlying their regulatory motif activities within the signaling network. RMOD is freely accessible online at the following URL: http://pks.kaist.ac.kr/rmod.

  15. Genome-wide analyses and functional classification of proline repeat-rich proteins: potential role of eIF5A in eukaryotic evolution.

    Directory of Open Access Journals (Sweden)

    Ajeet Mandal

    Full Text Available The eukaryotic translation factor, eIF5A has been recently reported as a sequence-specific elongation factor that facilitates peptide bond formation at consecutive prolines in Saccharomyces cerevisiae, as its ortholog elongation factor P (EF-P does in bacteria. We have searched the genome databases of 35 representative organisms from six kingdoms of life for PPP (Pro-Pro-Pro and/or PPG (Pro-Pro-Gly-encoding genes whose expression is expected to depend on eIF5A. We have made detailed analyses of proteome data of 5 selected species, Escherichia coli, Saccharomyces cerevisiae, Drosophila melanogaster, Mus musculus and Homo sapiens. The PPP and PPG motifs are low in the prokaryotic proteomes. However, their frequencies markedly increase with the biological complexity of eukaryotic organisms, and are higher in newly derived proteins than in those orthologous proteins commonly shared in all species. Ontology classifications of S. cerevisiae and human genes encoding the highest level of polyprolines reveal their strong association with several specific biological processes, including actin/cytoskeletal associated functions, RNA splicing/turnover, DNA binding/transcription and cell signaling. Previously reported phenotypic defects in actin polarity and mRNA decay of eIF5A mutant strains are consistent with the proposed role for eIF5A in the translation of the polyproline-containing proteins. Of all the amino acid tandem repeats (≥3 amino acids, only the proline repeat frequency correlates with functional complexity of the five organisms examined. Taken together, these findings suggest the importance of proline repeat-rich proteins and a potential role for eIF5A and its hypusine modification pathway in the course of eukaryotic evolution.

  16. Anaerobic energy metabolism in unicellular photosynthetic eukaryotes.

    Science.gov (United States)

    Atteia, Ariane; van Lis, Robert; Tielens, Aloysius G M; Martin, William F

    2013-02-01

    Anaerobic metabolic pathways allow unicellular organisms to tolerate or colonize anoxic environments. Over the past ten years, genome sequencing projects have brought a new light on the extent of anaerobic metabolism in eukaryotes. A surprising development has been that free-living unicellular algae capable of photoautotrophic lifestyle are, in terms of their enzymatic repertoire, among the best equipped eukaryotes known when it comes to anaerobic energy metabolism. Some of these algae are marine organisms, common in the oceans, others are more typically soil inhabitants. All these species are important from the ecological (O(2)/CO(2) budget), biotechnological, and evolutionary perspectives. In the unicellular algae surveyed here, mixed-acid type fermentations are widespread while anaerobic respiration, which is more typical of eukaryotic heterotrophs, appears to be rare. The presence of a core anaerobic metabolism among the algae provides insights into its evolutionary origin, which traces to the eukaryote common ancestor. The predicted fermentative enzymes often exhibit an amino acid extension at the N-terminus, suggesting that these proteins might be compartmentalized in the cell, likely in the chloroplast or the mitochondrion. The green algae Chlamydomonas reinhardtii and Chlorella NC64 have the most extended set of fermentative enzymes reported so far. Among the eukaryotes with secondary plastids, the diatom Thalassiosira pseudonana has the most pronounced anaerobic capabilities as yet. From the standpoints of genomic, transcriptomic, and biochemical studies, anaerobic energy metabolism in C. reinhardtii remains the best characterized among photosynthetic protists. This article is part of a Special Issue entitled: The evolutionary aspects of bioenergetic systems. Copyright © 2012 Elsevier B.V. All rights reserved.

  17. Étude structurale de l'assemblage du complexe télomérique humain TRF2/RAP1

    OpenAIRE

    Gaullier , Guillaume

    2015-01-01

    Telomeres are the ends of eukaryotic linear chromosomes. They are made oftandem repeats of a short guanine-rich motif and bound by specific proteins.In vertebrates, these proteins form a complex called shelterin, theintegrity of which is critical to ensure proper replication of chromosomeends and to protect them against illicit targeting by DNA double-strandbreak repair pathways. Telomere dysfunctions lead to genome instability,which can ultimately cause senescence or cancer. Telomeres are a ...

  18. DNA motif alignment by evolving a population of Markov chains.

    Science.gov (United States)

    Bi, Chengpeng

    2009-01-30

    Deciphering cis-regulatory elements or de novo motif-finding in genomes still remains elusive although much algorithmic effort has been expended. The Markov chain Monte Carlo (MCMC) method such as Gibbs motif samplers has been widely employed to solve the de novo motif-finding problem through sequence local alignment. Nonetheless, the MCMC-based motif samplers still suffer from local maxima like EM. Therefore, as a prerequisite for finding good local alignments, these motif algorithms are often independently run a multitude of times, but without information exchange between different chains. Hence it would be worth a new algorithm design enabling such information exchange. This paper presents a novel motif-finding algorithm by evolving a population of Markov chains with information exchange (PMC), each of which is initialized as a random alignment and run by the Metropolis-Hastings sampler (MHS). It is progressively updated through a series of local alignments stochastically sampled. Explicitly, the PMC motif algorithm performs stochastic sampling as specified by a population-based proposal distribution rather than individual ones, and adaptively evolves the population as a whole towards a global maximum. The alignment information exchange is accomplished by taking advantage of the pooled motif site distributions. A distinct method for running multiple independent Markov chains (IMC) without information exchange, or dubbed as the IMC motif algorithm, is also devised to compare with its PMC counterpart. Experimental studies demonstrate that the performance could be improved if pooled information were used to run a population of motif samplers. The new PMC algorithm was able to improve the convergence and outperformed other popular algorithms tested using simulated and biological motif sequences.

  19. David and Goliath: chemical perturbation of eukaryotes by bacteria.

    Science.gov (United States)

    Ho, Louis K; Nodwell, Justin R

    2016-03-01

    Environmental microbes produce biologically active small molecules that have been mined extensively as antibiotics and a smaller number of drugs that act on eukaryotic cells. It is known that there are additional bioactives to be discovered from this source. While the discovery of new antibiotics is challenged by the frequent discovery of known compounds, we contend that the eukaryote-active compounds may be less saturated. Indeed, despite there being far fewer eukaryotic-active natural products these molecules interact with a far richer diversity of molecular and cellular targets.

  20. Reproduction, symbiosis, and the eukaryotic cell

    Science.gov (United States)

    Godfrey-Smith, Peter

    2015-01-01

    This paper develops a conceptual framework for addressing questions about reproduction, individuality, and the units of selection in symbiotic associations, with special attention to the origin of the eukaryotic cell. Three kinds of reproduction are distinguished, and a possible evolutionary sequence giving rise to a mitochondrion-containing eukaryotic cell from an endosymbiotic partnership is analyzed as a series of transitions between each of the three forms of reproduction. The sequence of changes seen in this “egalitarian” evolutionary transition is compared with those that apply in “fraternal” transitions, such as the evolution of multicellularity in animals. PMID:26286983

  1. Horizontal transfer of a eukaryotic plastid-targeted protein gene to cyanobacteria

    Directory of Open Access Journals (Sweden)

    Keeling Patrick J

    2007-06-01

    Full Text Available Abstract Background Horizontal or lateral transfer of genetic material between distantly related prokaryotes has been shown to play a major role in the evolution of bacterial and archaeal genomes, but exchange of genes between prokaryotes and eukaryotes is not as well understood. In particular, gene flow from eukaryotes to prokaryotes is rarely documented with strong support, which is unusual since prokaryotic genomes appear to readily accept foreign genes. Results Here, we show that abundant marine cyanobacteria in the related genera Synechococcus and Prochlorococcus acquired a key Calvin cycle/glycolytic enzyme from a eukaryote. Two non-homologous forms of fructose bisphosphate aldolase (FBA are characteristic of eukaryotes and prokaryotes respectively. However, a eukaryotic gene has been inserted immediately upstream of the ancestral prokaryotic gene in several strains (ecotypes of Synechococcus and Prochlorococcus. In one lineage this new gene has replaced the ancestral gene altogether. The eukaryotic gene is most closely related to the plastid-targeted FBA from red algae. This eukaryotic-type FBA once replaced the plastid/cyanobacterial type in photosynthetic eukaryotes, hinting at a possible functional advantage in Calvin cycle reactions. The strains that now possess this eukaryotic FBA are scattered across the tree of Synechococcus and Prochlorococcus, perhaps because the gene has been transferred multiple times among cyanobacteria, or more likely because it has been selectively retained only in certain lineages. Conclusion A gene for plastid-targeted FBA has been transferred from red algae to cyanobacteria, where it has inserted itself beside its non-homologous, functional analogue. Its current distribution in Prochlorococcus and Synechococcus is punctate, suggesting a complex history since its introduction to this group.

  2. Massive expansion of the calpain gene family in unicellular eukaryotes

    Directory of Open Access Journals (Sweden)

    Zhao Sen

    2012-09-01

    Full Text Available Abstract Background Calpains are Ca2+-dependent cysteine proteases that participate in a range of crucial cellular processes. Dysfunction of these enzymes may cause, for instance, life-threatening diseases in humans, the loss of sex determination in nematodes and embryo lethality in plants. Although the calpain family is well characterized in animal and plant model organisms, there is a great lack of knowledge about these genes in unicellular eukaryote species (i.e. protists. Here, we study the distribution and evolution of calpain genes in a wide range of eukaryote genomes from major branches in the tree of life. Results Our investigations reveal 24 types of protein domains that are combined with the calpain-specific catalytic domain CysPc. In total we identify 41 different calpain domain architectures, 28 of these domain combinations have not been previously described. Based on our phylogenetic inferences, we propose that at least four calpain variants were established in the early evolution of eukaryotes, most likely before the radiation of all the major supergroups of eukaryotes. Many domains associated with eukaryotic calpain genes can be found among eubacteria or archaebacteria but never in combination with the CysPc domain. Conclusions The analyses presented here show that ancient modules present in prokaryotes, and a few de novo eukaryote domains, have been assembled into many novel domain combinations along the evolutionary history of eukaryotes. Some of the new calpain genes show a narrow distribution in a few branches in the tree of life, likely representing lineage-specific innovations. Hence, the functionally important classical calpain genes found among humans and vertebrates make up only a tiny fraction of the calpain family. In fact, a massive expansion of the calpain family occurred by domain shuffling among unicellular eukaryotes and contributed to a wealth of functionally different genes.

  3. Massive expansion of the calpain gene family in unicellular eukaryotes.

    Science.gov (United States)

    Zhao, Sen; Liang, Zhe; Demko, Viktor; Wilson, Robert; Johansen, Wenche; Olsen, Odd-Arne; Shalchian-Tabrizi, Kamran

    2012-09-29

    Calpains are Ca2+-dependent cysteine proteases that participate in a range of crucial cellular processes. Dysfunction of these enzymes may cause, for instance, life-threatening diseases in humans, the loss of sex determination in nematodes and embryo lethality in plants. Although the calpain family is well characterized in animal and plant model organisms, there is a great lack of knowledge about these genes in unicellular eukaryote species (i.e. protists). Here, we study the distribution and evolution of calpain genes in a wide range of eukaryote genomes from major branches in the tree of life. Our investigations reveal 24 types of protein domains that are combined with the calpain-specific catalytic domain CysPc. In total we identify 41 different calpain domain architectures, 28 of these domain combinations have not been previously described. Based on our phylogenetic inferences, we propose that at least four calpain variants were established in the early evolution of eukaryotes, most likely before the radiation of all the major supergroups of eukaryotes. Many domains associated with eukaryotic calpain genes can be found among eubacteria or archaebacteria but never in combination with the CysPc domain. The analyses presented here show that ancient modules present in prokaryotes, and a few de novo eukaryote domains, have been assembled into many novel domain combinations along the evolutionary history of eukaryotes. Some of the new calpain genes show a narrow distribution in a few branches in the tree of life, likely representing lineage-specific innovations. Hence, the functionally important classical calpain genes found among humans and vertebrates make up only a tiny fraction of the calpain family. In fact, a massive expansion of the calpain family occurred by domain shuffling among unicellular eukaryotes and contributed to a wealth of functionally different genes.

  4. DNA motif elucidation using belief propagation.

    Science.gov (United States)

    Wong, Ka-Chun; Chan, Tak-Ming; Peng, Chengbin; Li, Yue; Zhang, Zhaolei

    2013-09-01

    Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k=8∼10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors' websites: e.g. http://www.cs.toronto.edu/∼wkc/kmerHMM.

  5. DNA motif elucidation using belief propagation

    KAUST Repository

    Wong, Ka-Chun; Chan, Tak-Ming; Peng, Chengbin; Li, Yue; Zhang, Zhaolei

    2013-01-01

    Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ?10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors' websites: e.g. http://www.cs.toronto.edu/?wkc/kmerHMM. 2013 The Author(s).

  6. DNA motif elucidation using belief propagation

    KAUST Repository

    Wong, Ka-Chun

    2013-06-29

    Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ?10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors\\' websites: e.g. http://www.cs.toronto.edu/?wkc/kmerHMM. 2013 The Author(s).

  7. The identification of functional motifs in temporal gene expression analysis

    Directory of Open Access Journals (Sweden)

    Michael G. Surette

    2005-01-01

    Full Text Available The identification of transcription factor binding sites is essential to the understanding of the regulation of gene expression and the reconstruction of genetic regulatory networks. The in silico identification of cis-regulatory motifs is challenging due to sequence variability and lack of sufficient data to generate consensus motifs that are of quantitative or even qualitative predictive value. To determine functional motifs in gene expression, we propose a strategy to adopt false discovery rate (FDR and estimate motif effects to evaluate combinatorial analysis of motif candidates and temporal gene expression data. The method decreases the number of predicted motifs, which can then be confirmed by genetic analysis. To assess the method we used simulated motif/expression data to evaluate parameters. We applied this approach to experimental data for a group of iron responsive genes in Salmonella typhimurium 14028S. The method identified known and potentially new ferric-uptake regulator (Fur binding sites. In addition, we identified uncharacterized functional motif candidates that correlated with specific patterns of expression. A SAS code for the simulation and analysis gene expression data is available from the first author upon request.

  8. Arsenic and Antimony Transporters in Eukaryotes

    Directory of Open Access Journals (Sweden)

    Ewa Maciaszczyk-Dziubinska

    2012-03-01

    Full Text Available Arsenic and antimony are toxic metalloids, naturally present in the environment and all organisms have developed pathways for their detoxification. The most effective metalloid tolerance systems in eukaryotes include downregulation of metalloid uptake, efflux out of the cell, and complexation with phytochelatin or glutathione followed by sequestration into the vacuole. Understanding of arsenic and antimony transport system is of high importance due to the increasing usage of arsenic-based drugs in the treatment of certain types of cancer and diseases caused by protozoan parasites as well as for the development of bio- and phytoremediation strategies for metalloid polluted areas. However, in contrast to prokaryotes, the knowledge about specific transporters of arsenic and antimony and the mechanisms of metalloid transport in eukaryotes has been very limited for a long time. Here, we review the recent advances in understanding of arsenic and antimony transport pathways in eukaryotes, including a dual role of aquaglyceroporins in uptake and efflux of metalloids, elucidation of arsenic transport mechanism by the yeast Acr3 transporter and its role in arsenic hyperaccumulation in ferns, identification of vacuolar transporters of arsenic-phytochelatin complexes in plants and forms of arsenic substrates recognized by mammalian ABC transporters.

  9. Arsenic and Antimony Transporters in Eukaryotes

    Science.gov (United States)

    Maciaszczyk-Dziubinska, Ewa; Wawrzycka, Donata; Wysocki, Robert

    2012-01-01

    Arsenic and antimony are toxic metalloids, naturally present in the environment and all organisms have developed pathways for their detoxification. The most effective metalloid tolerance systems in eukaryotes include downregulation of metalloid uptake, efflux out of the cell, and complexation with phytochelatin or glutathione followed by sequestration into the vacuole. Understanding of arsenic and antimony transport system is of high importance due to the increasing usage of arsenic-based drugs in the treatment of certain types of cancer and diseases caused by protozoan parasites as well as for the development of bio- and phytoremediation strategies for metalloid polluted areas. However, in contrast to prokaryotes, the knowledge about specific transporters of arsenic and antimony and the mechanisms of metalloid transport in eukaryotes has been very limited for a long time. Here, we review the recent advances in understanding of arsenic and antimony transport pathways in eukaryotes, including a dual role of aquaglyceroporins in uptake and efflux of metalloids, elucidation of arsenic transport mechanism by the yeast Acr3 transporter and its role in arsenic hyperaccumulation in ferns, identification of vacuolar transporters of arsenic-phytochelatin complexes in plants and forms of arsenic substrates recognized by mammalian ABC transporters. PMID:22489166

  10. Hybrid DNA i-motif: Aminoethylprolyl-PNA (pC5) enhance the stability of DNA (dC5) i-motif structure.

    Science.gov (United States)

    Gade, Chandrasekhar Reddy; Sharma, Nagendra K

    2017-12-15

    This report describes the synthesis of C-rich sequence, cytosine pentamer, of aep-PNA and its biophysical studies for the formation of hybrid DNA:aep-PNAi-motif structure with DNA cytosine pentamer (dC 5 ) under acidic pH conditions. Herein, the CD/UV/NMR/ESI-Mass studies strongly support the formation of stable hybrid DNA i-motif structure with aep-PNA even near acidic conditions. Hence aep-PNA C-rich sequence cytosine could be considered as potential DNA i-motif stabilizing agents in vivo conditions. Copyright © 2017 Elsevier Ltd. All rights reserved.

  11. Automatic annotation of protein motif function with Gene Ontology terms

    Directory of Open Access Journals (Sweden)

    Gopalakrishnan Vanathi

    2004-09-01

    Full Text Available Abstract Background Conserved protein sequence motifs are short stretches of amino acid sequence patterns that potentially encode the function of proteins. Several sequence pattern searching algorithms and programs exist foridentifying candidate protein motifs at the whole genome level. However, amuch needed and importanttask is to determine the functions of the newly identified protein motifs. The Gene Ontology (GO project is an endeavor to annotate the function of genes or protein sequences with terms from a dynamic, controlled vocabulary and these annotations serve well as a knowledge base. Results This paperpresents methods to mine the GO knowledge base and use the association between the GO terms assigned to a sequence and the motifs matched by the same sequence as evidence for predicting the functions of novel protein motifs automatically. The task of assigning GO terms to protein motifsis viewed as both a binary classification and information retrieval problem, where PROSITE motifs are used as samples for mode training and functional prediction. The mutual information of a motif and aGO term association isfound to be a very useful feature. We take advantageof the known motifs to train a logistic regression classifier, which allows us to combine mutual information with other frequency-based features and obtain a probability of correctassociation. The trained logistic regression model has intuitively meaningful and logically plausible parameter values, and performs very well empirically according to our evaluation criteria. Conclusions In this research, different methods for automatic annotation of protein motifs have been investigated. Empirical result demonstrated that the methods have a great potential for detecting and augmenting information about thefunctions of newly discovered candidate protein motifs.

  12. Verification of the MOTIF code version 3.0

    International Nuclear Information System (INIS)

    Chan, T.; Guvanasen, V.; Nakka, B.W.; Reid, J.A.K.; Scheier, N.W.; Stanchell, F.W.

    1996-12-01

    As part of the Canadian Nuclear Fuel Waste Management Program (CNFWMP), AECL has developed a three-dimensional finite-element code, MOTIF (Model Of Transport In Fractured/ porous media), for detailed modelling of groundwater flow, heat transport and solute transport in a fractured rock mass. The code solves the transient and steady-state equations of groundwater flow, solute (including one-species radionuclide) transport, and heat transport in variably saturated fractured/porous media. The initial development was completed in 1985 (Guvanasen 1985) and version 3.0 was completed in 1986. This version is documented in detail in Guvanasen and Chan (in preparation). This report describes a series of fourteen verification cases which has been used to test the numerical solution techniques and coding of MOTIF, as well as demonstrate some of the MOTIF analysis capabilities. For each case the MOTIF solution has been compared with a corresponding analytical or independently developed alternate numerical solution. Several of the verification cases were included in Level 1 of the International Hydrologic Code Intercomparison Project (HYDROCOIN). The MOTIF results for these cases were also described in the HYDROCOIN Secretariat's compilation and comparison of results submitted by the various project teams (Swedish Nuclear Power Inspectorate 1988). It is evident from the graphical comparisons presented that the MOTIF solutions for the fourteen verification cases are generally in excellent agreement with known analytical or numerical solutions obtained from independent sources. This series of verification studies has established the ability of the MOTIF finite-element code to accurately model the groundwater flow and solute and heat transport phenomena for which it is intended. (author). 20 refs., 14 tabs., 32 figs

  13. Identification of amino acid residues in protein SRP72 required for binding to a kinked 5e motif of the human signal recognition particle RNA.

    Science.gov (United States)

    Iakhiaeva, Elena; Iakhiaev, Alexei; Zwieb, Christian

    2010-11-13

    Human cells depend critically on the signal recognition particle (SRP) for the sorting and delivery of their proteins. The SRP is a ribonucleoprotein complex which binds to signal sequences of secretory polypeptides as they emerge from the ribosome. Among the six proteins of the eukaryotic SRP, the largest protein, SRP72, is essential for protein targeting and possesses a poorly characterized RNA binding domain. We delineated the minimal region of SRP72 capable of forming a stable complex with an SRP RNA fragment. The region encompassed residues 545 to 585 of the full-length human SRP72 and contained a lysine-rich cluster (KKKKKKKKGK) at postions 552 to 561 as well as a conserved Pfam motif with the sequence PDPXRWLPXXER at positions 572 to 583. We demonstrated by site-directed mutagenesis that both regions participated in the formation of a complex with the RNA. In agreement with biochemical data and results from chymotryptic digestion experiments, molecular modeling of SRP72 implied that the invariant W577 was located inside the predicted structure of an RNA binding domain. The 11-nucleotide 5e motif contained within the SRP RNA fragment was shown by comparative electrophoresis on native polyacrylamide gels to conform to an RNA kink-turn. The model of the complex suggested that the conserved A240 of the K-turn, previously identified as being essential for the binding to SRP72, could protrude into a groove of the SRP72 RNA binding domain, similar but not identical to how other K-turn recognizing proteins interact with RNA. The results from the presented experiments provided insights into the molecular details of a functionally important and structurally interesting RNA-protein interaction. A model for how a ligand binding pocket of SRP72 can accommodate a new RNA K-turn in the 5e region of the eukaryotic SRP RNA is proposed.

  14. Purification and functional motifs of the recombinant ATPase of orf virus.

    Science.gov (United States)

    Lin, Fong-Yuan; Chan, Kun-Wei; Wang, Chi-Young; Wong, Min-Liang; Hsu, Wei-Li

    2011-10-01

    Our previous study showed that the recombinant ATPase encoded by the A32L gene of orf virus displayed ATP hydrolysis activity as predicted from its amino acids sequence. This viral ATPase contains four known functional motifs (motifs I-IV) and a novel AYDG motif; they are essential for ATP hydrolysis reaction by binding ATP and magnesium ions. The motifs I and II correspond with the Walker A and B motifs of the typical ATPase, respectively. To examine the biochemical roles of these five conserved motifs, recombinant ATPases of five deletion mutants derived from the Taiping strain were expressed and purified. Their ATPase functions were assayed and compared with those of two wild type strains, Taiping and Nantou isolated in Taiwan. Our results showed that deletions at motifs I-III or IV exhibited lower activity than that of the wild type. Interestingly, deletion of AYDG motif decreased the ATPase activity more significantly than those of motifs I-IV deletions. Divalent ions such as magnesium and calcium were essential for ATPase activity. Moreover, our recombinant proteins of orf virus also demonstrated GTPase activity, though weaker than the original ATPase activity. Copyright © 2011 Elsevier Inc. All rights reserved.

  15. A Synthetic Biology Framework for Programming Eukaryotic Transcription Functions

    Science.gov (United States)

    Khalil, Ahmad S.; Lu, Timothy K.; Bashor, Caleb J.; Ramirez, Cherie L.; Pyenson, Nora C.; Joung, J. Keith; Collins, James J.

    2013-01-01

    SUMMARY Eukaryotic transcription factors (TFs) perform complex and combinatorial functions within transcriptional networks. Here, we present a synthetic framework for systematically constructing eukaryotic transcription functions using artificial zinc fingers, modular DNA-binding domains found within many eukaryotic TFs. Utilizing this platform, we construct a library of orthogonal synthetic transcription factors (sTFs) and use these to wire synthetic transcriptional circuits in yeast. We engineer complex functions, such as tunable output strength and transcriptional cooperativity, by rationally adjusting a decomposed set of key component properties, e.g., DNA specificity, affinity, promoter design, protein-protein interactions. We show that subtle perturbations to these properties can transform an individual sTF between distinct roles (activator, cooperative factor, inhibitory factor) within a transcriptional complex, thus drastically altering the signal processing behavior of multi-input systems. This platform provides new genetic components for synthetic biology and enables bottom-up approaches to understanding the design principles of eukaryotic transcriptional complexes and networks. PMID:22863014

  16. Structure and Mechanism of a Eukaryotic FMN Adenylyltransferase

    OpenAIRE

    Huerta, Carlos; Borek, Dominika; Machius, Mischa; Grishin, Nick V.; Zhang, Hong

    2009-01-01

    Flavin mononucleotide adenylyltransferase (FMNAT) catalyzes the formation of the essential flavocoenzyme FAD and plays an important role in flavocoenzyme homeostasis regulation. By sequence comparison, bacterial and eukaryotic FMNAT enzymes belong to two different protein superfamilies and apparently utilize different set of active site residues to accomplish the same chemistry. Here we report the first structural characterization of a eukaryotic FMNAT from a pathogenic yeast Candida glabrata...

  17. An experimental test of a fundamental food web motif.

    Science.gov (United States)

    Rip, Jason M K; McCann, Kevin S; Lynn, Denis H; Fawcett, Sonia

    2010-06-07

    Large-scale changes to the world's ecosystem are resulting in the deterioration of biostructure-the complex web of species interactions that make up ecological communities. A difficult, yet crucial task is to identify food web structures, or food web motifs, that are the building blocks of this baroque network of interactions. Once identified, these food web motifs can then be examined through experiments and theory to provide mechanistic explanations for how structure governs ecosystem stability. Here, we synthesize recent ecological research to show that generalist consumers coupling resources with different interaction strengths, is one such motif. This motif amazingly occurs across an enormous range of spatial scales, and so acts to distribute coupled weak and strong interactions throughout food webs. We then perform an experiment that illustrates the importance of this motif to ecological stability. We find that weak interactions coupled to strong interactions by generalist consumers dampen strong interaction strengths and increase community stability. This study takes a critical step by isolating a common food web motif and through clear, experimental manipulation, identifies the fundamental stabilizing consequences of this structure for ecological communities.

  18. Defensins: antifungal lessons from eukaryotes

    Directory of Open Access Journals (Sweden)

    Patrícia M. Silva

    2014-03-01

    Full Text Available Over the last years, antimicrobial peptides (AMPs have been the focus of intense research towards the finding of a viable alternative to current antifungal drugs. Defensins are one of the major families of AMPs and the most represented among all eukaryotic groups, providing an important first line of host defense against pathogenic microorganisms. Several of these cysteine-stabilized peptides present a relevant effect against fungi. Defensins are the AMPs with the broader distribution across all eukaryotic kingdoms, namely, Fungi, Plantæ and Animalia, and were recently shown to have an ancestor in a bacterial organism. As a part of the host defense, defensins act as an important vehicle of information between innate and adaptive immune system and have a role in immunomodulation. This multidimensionality represents a powerful host shield, hard for microorganisms to overcome using single approach resistance strategies. Pathogenic fungi resistance to conventional antimycotic drugs is becoming a major problem. Defensins, as other AMPs, have shown to be an effective alternative to the current antimycotic therapies, demonstrating potential as novel therapeutic agents or drug leads. In this review, we summarize the current knowledge on some eukaryotic defensins with antifungal action. An overview of the main targets in the fungal cell and the mechanism of action of these AMPs (namely, the selectivity for some fungal membrane components are presented. Additionally, recent works on antifungal defensins structure, activity and citotoxicity are also reviewed.

  19. Highly scalable Ab initio genomic motif identification

    KAUST Repository

    Marchand, Benoit; Bajic, Vladimir B.; Kaushik, Dinesh

    2011-01-01

    We present results of scaling an ab initio motif family identification system, Dragon Motif Finder (DMF), to 65,536 processor cores of IBM Blue Gene/P. DMF seeks groups of mutually similar polynucleotide patterns within a set of genomic sequences and builds various motif families from them. Such information is of relevance to many problems in life sciences. Prior attempts to scale such ab initio motif-finding algorithms achieved limited success. We solve the scalability issues using a combination of mixed-mode MPI-OpenMP parallel programming, master-slave work assignment, multi-level workload distribution, multi-level MPI collectives, and serial optimizations. While the scalability of our algorithm was excellent (94% parallel efficiency on 65,536 cores relative to 256 cores on a modest-size problem), the final speedup with respect to the original serial code exceeded 250,000 when serial optimizations are included. This enabled us to carry out many large-scale ab initio motiffinding simulations in a few hours while the original serial code would have needed decades of execution time. Copyright 2011 ACM.

  20. HIV-1 Replication and the Cellular Eukaryotic Translation Apparatus

    Directory of Open Access Journals (Sweden)

    Santiago Guerrero

    2015-01-01

    Full Text Available Eukaryotic translation is a complex process composed of three main steps: initiation, elongation, and termination. During infections by RNA- and DNA-viruses, the eukaryotic translation machinery is used to assure optimal viral protein synthesis. Human immunodeficiency virus type I (HIV-1 uses several non-canonical pathways to translate its own proteins, such as leaky scanning, frameshifting, shunt, and cap-independent mechanisms. Moreover, HIV-1 modulates the host translation machinery by targeting key translation factors and overcomes different cellular obstacles that affect protein translation. In this review, we describe how HIV-1 proteins target several components of the eukaryotic translation machinery, which consequently improves viral translation and replication.

  1. Mechanisms of zero-lag synchronization in cortical motifs.

    Directory of Open Access Journals (Sweden)

    Leonardo L Gollo

    2014-04-01

    Full Text Available Zero-lag synchronization between distant cortical areas has been observed in a diversity of experimental data sets and between many different regions of the brain. Several computational mechanisms have been proposed to account for such isochronous synchronization in the presence of long conduction delays: Of these, the phenomenon of "dynamical relaying"--a mechanism that relies on a specific network motif--has proven to be the most robust with respect to parameter mismatch and system noise. Surprisingly, despite a contrary belief in the community, the common driving motif is an unreliable means of establishing zero-lag synchrony. Although dynamical relaying has been validated in empirical and computational studies, the deeper dynamical mechanisms and comparison to dynamics on other motifs is lacking. By systematically comparing synchronization on a variety of small motifs, we establish that the presence of a single reciprocally connected pair--a "resonance pair"--plays a crucial role in disambiguating those motifs that foster zero-lag synchrony in the presence of conduction delays (such as dynamical relaying from those that do not (such as the common driving triad. Remarkably, minor structural changes to the common driving motif that incorporate a reciprocal pair recover robust zero-lag synchrony. The findings are observed in computational models of spiking neurons, populations of spiking neurons and neural mass models, and arise whether the oscillatory systems are periodic, chaotic, noise-free or driven by stochastic inputs. The influence of the resonance pair is also robust to parameter mismatch and asymmetrical time delays amongst the elements of the motif. We call this manner of facilitating zero-lag synchrony resonance-induced synchronization, outline the conditions for its occurrence, and propose that it may be a general mechanism to promote zero-lag synchrony in the brain.

  2. A speedup technique for (l, d-motif finding algorithms

    Directory of Open Access Journals (Sweden)

    Dinh Hieu

    2011-03-01

    Full Text Available Abstract Background The discovery of patterns in DNA, RNA, and protein sequences has led to the solution of many vital biological problems. For instance, the identification of patterns in nucleic acid sequences has resulted in the determination of open reading frames, identification of promoter elements of genes, identification of intron/exon splicing sites, identification of SH RNAs, location of RNA degradation signals, identification of alternative splicing sites, etc. In protein sequences, patterns have proven to be extremely helpful in domain identification, location of protease cleavage sites, identification of signal peptides, protein interactions, determination of protein degradation elements, identification of protein trafficking elements, etc. Motifs are important patterns that are helpful in finding transcriptional regulatory elements, transcription factor binding sites, functional genomics, drug design, etc. As a result, numerous papers have been written to solve the motif search problem. Results Three versions of the motif search problem have been proposed in the literature: Simple Motif Search (SMS, (l, d-motif search (or Planted Motif Search (PMS, and Edit-distance-based Motif Search (EMS. In this paper we focus on PMS. Two kinds of algorithms can be found in the literature for solving the PMS problem: exact and approximate. An exact algorithm identifies the motifs always and an approximate algorithm may fail to identify some or all of the motifs. The exact version of PMS problem has been shown to be NP-hard. Exact algorithms proposed in the literature for PMS take time that is exponential in some of the underlying parameters. In this paper we propose a generic technique that can be used to speedup PMS algorithms. Conclusions We present a speedup technique that can be used on any PMS algorithm. We have tested our speedup technique on a number of algorithms. These experimental results show that our speedup technique is indeed very

  3. Transduction motif analysis of gastric cancer based on a human signaling network

    Energy Technology Data Exchange (ETDEWEB)

    Liu, G.; Li, D.Z.; Jiang, C.S.; Wang, W. [Fuzhou General Hospital of Nanjing Command, Department of Gastroenterology, Fuzhou, China, Department of Gastroenterology, Fuzhou General Hospital of Nanjing Command, Fuzhou (China)

    2014-04-04

    To investigate signal regulation models of gastric cancer, databases and literature were used to construct the signaling network in humans. Topological characteristics of the network were analyzed by CytoScape. After marking gastric cancer-related genes extracted from the CancerResource, GeneRIF, and COSMIC databases, the FANMOD software was used for the mining of gastric cancer-related motifs in a network with three vertices. The significant motif difference method was adopted to identify significantly different motifs in the normal and cancer states. Finally, we conducted a series of analyses of the significantly different motifs, including gene ontology, function annotation of genes, and model classification. A human signaling network was constructed, with 1643 nodes and 5089 regulating interactions. The network was configured to have the characteristics of other biological networks. There were 57,942 motifs marked with gastric cancer-related genes out of a total of 69,492 motifs, and 264 motifs were selected as significantly different motifs by calculating the significant motif difference (SMD) scores. Genes in significantly different motifs were mainly enriched in functions associated with cancer genesis, such as regulation of cell death, amino acid phosphorylation of proteins, and intracellular signaling cascades. The top five significantly different motifs were mainly cascade and positive feedback types. Almost all genes in the five motifs were cancer related, including EPOR, MAPK14, BCL2L1, KRT18, PTPN6, CASP3, TGFBR2, AR, and CASP7. The development of cancer might be curbed by inhibiting signal transductions upstream and downstream of the selected motifs.

  4. Archaeal “Dark Matter” and the Origin of Eukaryotes

    Science.gov (United States)

    Williams, Tom A.; Embley, T. Martin

    2014-01-01

    Current hypotheses about the history of cellular life are mainly based on analyses of cultivated organisms, but these represent only a small fraction of extant biodiversity. The sequencing of new environmental lineages therefore provides an opportunity to test, revise, or reject existing ideas about the tree of life and the origin of eukaryotes. According to the textbook three domains hypothesis, the eukaryotes emerge as the sister group to a monophyletic Archaea. However, recent analyses incorporating better phylogenetic models and an improved sampling of the archaeal domain have generally supported the competing eocyte hypothesis, in which core genes of eukaryotic cells originated from within the Archaea, with important implications for eukaryogenesis. Given this trend, it was surprising that a recent analysis incorporating new genomes from uncultivated Archaea recovered a strongly supported three domains tree. Here, we show that this result was due in part to the use of a poorly fitting phylogenetic model and also to the inclusion by an automated pipeline of genes of putative bacterial origin rather than nucleocytosolic versions for some of the eukaryotes analyzed. When these issues were resolved, analyses including the new archaeal lineages placed core eukaryotic genes within the Archaea. These results are consistent with a number of recent studies in which improved archaeal sampling and better phylogenetic models agree in supporting the eocyte tree over the three domains hypothesis. PMID:24532674

  5. Armadillo motifs involved in vesicular transport.

    Directory of Open Access Journals (Sweden)

    Harald Striegl

    Full Text Available Armadillo (ARM repeat proteins function in various cellular processes including vesicular transport and membrane tethering. They contain an imperfect repeating sequence motif that forms a conserved three-dimensional structure. Recently, structural and functional insight into tethering mediated by the ARM-repeat protein p115 has been provided. Here we describe the p115 ARM-motifs for reasons of clarity and nomenclature and show that both sequence and structure are highly conserved among ARM-repeat proteins. We argue that there is no need to invoke repeat types other than ARM repeats for a proper description of the structure of the p115 globular head region. Additionally, we propose to define a new subfamily of ARM-like proteins and show lack of evidence that the ARM motifs found in p115 are present in other long coiled-coil tethering factors of the golgin family.

  6. Discriminative motif discovery via simulated evolution and random under-sampling.

    Directory of Open Access Journals (Sweden)

    Tao Song

    Full Text Available Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the stage of Hidden Markov Models (HMMs training, a random under-sampling method is introduced for the imbalance between the positive and negative datasets. It is shown that, in the task of discovering targeting motifs of nine subcellular compartments, the motifs found by our method are more conserved than the methods without considering data imbalance problem and recover the most known targeting motifs from Minimotif Miner and InterPro. Meanwhile, we use the found motifs to predict protein subcellular localization and achieve higher prediction precision and recall for the minority classes.

  7. Discriminative motif discovery via simulated evolution and random under-sampling.

    Science.gov (United States)

    Song, Tao; Gu, Hong

    2014-01-01

    Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the stage of Hidden Markov Models (HMMs) training, a random under-sampling method is introduced for the imbalance between the positive and negative datasets. It is shown that, in the task of discovering targeting motifs of nine subcellular compartments, the motifs found by our method are more conserved than the methods without considering data imbalance problem and recover the most known targeting motifs from Minimotif Miner and InterPro. Meanwhile, we use the found motifs to predict protein subcellular localization and achieve higher prediction precision and recall for the minority classes.

  8. Improved i-motif thermal stability by insertion of anthraquinone monomers

    DEFF Research Database (Denmark)

    Gouda, Alaa S; Amine, Mahasen S.; Pedersen, Erik Bjerregaard

    2017-01-01

    In order to gain insight into how to improve thermal stability of i-motifs when used in the context of biomedical and nanotechnological applications, novel anthraquinone-modified i-motifs were synthesized by insertion of 1,8-, 1,4-, 1,5- and 2,6-disubstituted anthraquinone monomers into the TAA...... loops of a 22mer cytosine-rich human telomeric DNA sequence. The influence of the four anthraquinone linkers on the i-motif thermal stability was investigated at 295 nm and pH 5.5. Anthraquinone monomers modulate the i-motif stability in a position-depending manner and the modulation also depends...... unlocked nucleic acid monomers or twisted intercalating nucleic acid. The 2,6-disubstituted anthraquinone linker replacing T10 enabled a significant increase of i-motif thermal melting by 8.2 °C. A substantial increase of 5.0 °C in i-motif thermal melting was recorded when both A6 and T16 were modified...

  9. DNA mismatch repair and its many roles in eukaryotic cells

    DEFF Research Database (Denmark)

    Liu, Dekang; Keijzers, Guido; Rasmussen, Lene Juel

    2017-01-01

    in the clinic, and as a biomarker of cancer susceptibility in animal model systems. Prokaryotic MMR is well-characterized at the molecular and mechanistic level; however, MMR is considerably more complex in eukaryotic cells than in prokaryotic cells, and in recent years, it has become evident that MMR plays...... novel roles in eukaryotic cells, several of which are not yet well-defined or understood. Many MMR-deficient human cancer cells lack mutations in known human MMR genes, which strongly suggests that essential eukaryotic MMR components/cofactors remain unidentified and uncharacterized. Furthermore......, the mechanism by which the eukaryotic MMR machinery discriminates between the parental (template) and the daughter (nascent) DNA strand is incompletely understood and how cells choose between the EXO1-dependent and the EXO1–independent subpathways of MMR is not known. This review summarizes recent literature...

  10. Genome-wide Purification of Extrachromosomal Circular DNA from Eukaryotic Cells

    DEFF Research Database (Denmark)

    Møller, Henrik D.; Bojsen, Rasmus Kenneth; Tachibana, Chris

    2016-01-01

    Extrachromosomal circular DNAs (eccDNAs) are common genetic elements in Saccharomyces cerevisiae and are reported in other eukaryotes as well. EccDNAs contribute to genetic variation among somatic cells in multicellular organisms and to evolution of unicellular eukaryotes. Sensitive methods...

  11. Positional bias of general and tissue-specific regulatory motifs in mouse gene promoters

    Directory of Open Access Journals (Sweden)

    Farré Domènec

    2007-12-01

    Full Text Available Abstract Background The arrangement of regulatory motifs in gene promoters, or promoter architecture, is the result of mutation and selection processes that have operated over many millions of years. In mammals, tissue-specific transcriptional regulation is related to the presence of specific protein-interacting DNA motifs in gene promoters. However, little is known about the relative location and spacing of these motifs. To fill this gap, we have performed a systematic search for motifs that show significant bias at specific promoter locations in a large collection of housekeeping and tissue-specific genes. Results We observe that promoters driving housekeeping gene expression are enriched in particular motifs with strong positional bias, such as YY1, which are of little relevance in promoters driving tissue-specific expression. We also identify a large number of motifs that show positional bias in genes expressed in a highly tissue-specific manner. They include well-known tissue-specific motifs, such as HNF1 and HNF4 motifs in liver, kidney and small intestine, or RFX motifs in testis, as well as many potentially novel regulatory motifs. Based on this analysis, we provide predictions for 559 tissue-specific motifs in mouse gene promoters. Conclusion The study shows that motif positional bias is an important feature of mammalian proximal promoters and that it affects both general and tissue-specific motifs. Motif positional constraints define very distinct promoter architectures depending on breadth of expression and type of tissue.

  12. DNA to DNA transcription might exist in eukaryotic cells

    OpenAIRE

    Li, Gao-De

    2016-01-01

    Till now, in biological sciences, the term, transcription, mainly refers to DNA to RNA transcription. But our recently published experimental findings obtained from Plasmodium falciparum strongly suggest the existence of DNA to DNA transcription in the genome of eukaryotic cells, which could shed some light on the functions of certain noncoding DNA in the human and other eukaryotic genomes.

  13. Computational analyses of synergism in small molecular network motifs.

    Directory of Open Access Journals (Sweden)

    Yili Zhang

    2014-03-01

    Full Text Available Cellular functions and responses to stimuli are controlled by complex regulatory networks that comprise a large diversity of molecular components and their interactions. However, achieving an intuitive understanding of the dynamical properties and responses to stimuli of these networks is hampered by their large scale and complexity. To address this issue, analyses of regulatory networks often focus on reduced models that depict distinct, reoccurring connectivity patterns referred to as motifs. Previous modeling studies have begun to characterize the dynamics of small motifs, and to describe ways in which variations in parameters affect their responses to stimuli. The present study investigates how variations in pairs of parameters affect responses in a series of ten common network motifs, identifying concurrent variations that act synergistically (or antagonistically to alter the responses of the motifs to stimuli. Synergism (or antagonism was quantified using degrees of nonlinear blending and additive synergism. Simulations identified concurrent variations that maximized synergism, and examined the ways in which it was affected by stimulus protocols and the architecture of a motif. Only a subset of architectures exhibited synergism following paired changes in parameters. The approach was then applied to a model describing interlocked feedback loops governing the synthesis of the CREB1 and CREB2 transcription factors. The effects of motifs on synergism for this biologically realistic model were consistent with those for the abstract models of single motifs. These results have implications for the rational design of combination drug therapies with the potential for synergistic interactions.

  14. Genome-wide Purification of Extrachromosomal Circular DNA from Eukaryotic Cells

    DEFF Research Database (Denmark)

    Møller, Henrik D.; Bojsen, Rasmus Kenneth; Tachibana, Chris

    2016-01-01

    Extrachromosomal circular DNAs (eccDNAs) are common genetic elements in Saccharomyces cerevisiae and are reported in other eukaryotes as well. EccDNAs contribute to genetic variation among somatic cells in multicellular organisms and to evolution of unicellular eukaryotes. Sensitive methods for d...

  15. Methods and statistics for combining motif match scores.

    Science.gov (United States)

    Bailey, T L; Gribskov, M

    1998-01-01

    Position-specific scoring matrices are useful for representing and searching for protein sequence motifs. A sequence family can often be described by a group of one or more motifs, and an effective search must combine the scores for matching a sequence to each of the motifs in the group. We describe three methods for combining match scores and estimating the statistical significance of the combined scores and evaluate the search quality (classification accuracy) and the accuracy of the estimate of statistical significance of each. The three methods are: 1) sum of scores, 2) sum of reduced variates, 3) product of score p-values. We show that method 3) is superior to the other two methods in both regards, and that combining motif scores indeed gives better search accuracy. The MAST sequence homology search algorithm utilizing the product of p-values scoring method is available for interactive use and downloading at URL http:/(/)www.sdsc.edu/MEME.

  16. MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data.

    Science.gov (United States)

    Ozaki, Haruka; Iwasaki, Wataru

    2016-08-01

    As a key mechanism of gene regulation, transcription factors (TFs) bind to DNA by recognizing specific short sequence patterns that are called DNA-binding motifs. A single TF can accept ambiguity within its DNA-binding motifs, which comprise both canonical (typical) and non-canonical motifs. Clarification of such DNA-binding motif ambiguity is crucial for revealing gene regulatory networks and evaluating mutations in cis-regulatory elements. Although chromatin immunoprecipitation sequencing (ChIP-seq) now provides abundant data on the genomic sequences to which a given TF binds, existing motif discovery methods are unable to directly answer whether a given TF can bind to a specific DNA-binding motif. Here, we report a method for clarifying the DNA-binding motif ambiguity, MOCCS. Given ChIP-Seq data of any TF, MOCCS comprehensively analyzes and describes every k-mer to which that TF binds. Analysis of simulated datasets revealed that MOCCS is applicable to various ChIP-Seq datasets, requiring only a few minutes per dataset. Application to the ENCODE ChIP-Seq datasets proved that MOCCS directly evaluates whether a given TF binds to each DNA-binding motif, even if known position weight matrix models do not provide sufficient information on DNA-binding motif ambiguity. Furthermore, users are not required to provide numerous parameters or background genomic sequence models that are typically unavailable. MOCCS is implemented in Perl and R and is freely available via https://github.com/yuifu/moccs. By complementing existing motif-discovery software, MOCCS will contribute to the basic understanding of how the genome controls diverse cellular processes via DNA-protein interactions. Copyright © 2016 Elsevier Ltd. All rights reserved.

  17. Eukaryotic systematics: a user's guide for cell biologists and parasitologists.

    Science.gov (United States)

    Walker, Giselle; Dorrell, Richard G; Schlacht, Alexander; Dacks, Joel B

    2011-11-01

    Single-celled parasites like Entamoeba, Trypanosoma, Phytophthora and Plasmodium wreak untold havoc on human habitat and health. Understanding the position of the various protistan pathogens in the larger context of eukaryotic diversity informs our study of how these parasites operate on a cellular level, as well as how they have evolved. Here, we review the literature that has brought our understanding of eukaryotic relationships from an idea of parasites as primitive cells to a crystallized view of diversity that encompasses 6 major divisions, or supergroups, of eukaryotes. We provide an updated taxonomic scheme (for 2011), based on extensive genomic, ultrastructural and phylogenetic evidence, with three differing levels of taxonomic detail for ease of referencing and accessibility (see supplementary material at Cambridge Journals On-line). Two of the most pressing issues in cellular evolution, the root of the eukaryotic tree and the evolution of photosynthesis in complex algae, are also discussed along with ideas about what the new generation of genome sequencing technologies may contribute to the field of eukaryotic systematics. We hope that, armed with this user's guide, cell biologists and parasitologists will be encouraged about taking an increasingly evolutionary point of view in the battle against parasites representing real dangers to our livelihoods and lives.

  18. Structural studies demonstrating a bacteriophage-like replication cycle of the eukaryote-infecting Paramecium bursaria chlorella virus-1.

    Directory of Open Access Journals (Sweden)

    Elad Milrot

    2017-08-01

    Full Text Available A fundamental stage in viral infection is the internalization of viral genomes in host cells. Although extensively studied, the mechanisms and factors responsible for the genome internalization process remain poorly understood. Here we report our observations, derived from diverse imaging methods on genome internalization of the large dsDNA Paramecium bursaria chlorella virus-1 (PBCV-1. Our studies reveal that early infection stages of this eukaryotic-infecting virus occurs by a bacteriophage-like pathway, whereby PBCV-1 generates a hole in the host cell wall and ejects its dsDNA genome in a linear, base-pair-by-base-pair process, through a membrane tunnel generated by the fusion of the virus internal membrane with the host membrane. Furthermore, our results imply that PBCV-1 DNA condensation that occurs shortly after infection probably plays a role in genome internalization, as hypothesized for the infection of some bacteriophages. The subsequent perforation of the host photosynthetic membranes presumably enables trafficking of viral genomes towards host nuclei. Previous studies established that at late infection stages PBCV-1 generates cytoplasmic organelles, termed viral factories, where viral assembly takes place, a feature characteristic of many large dsDNA viruses that infect eukaryotic organisms. PBCV-1 thus appears to combine a bacteriophage-like mechanism during early infection stages with a eukaryotic-like infection pathway in its late replication cycle.

  19. Origin and evolution of the self-organizing cytoskeleton in the network of eukaryotic organelles.

    Science.gov (United States)

    Jékely, Gáspár

    2014-09-02

    The eukaryotic cytoskeleton evolved from prokaryotic cytomotive filaments. Prokaryotic filament systems show bewildering structural and dynamic complexity and, in many aspects, prefigure the self-organizing properties of the eukaryotic cytoskeleton. Here, the dynamic properties of the prokaryotic and eukaryotic cytoskeleton are compared, and how these relate to function and evolution of organellar networks is discussed. The evolution of new aspects of filament dynamics in eukaryotes, including severing and branching, and the advent of molecular motors converted the eukaryotic cytoskeleton into a self-organizing "active gel," the dynamics of which can only be described with computational models. Advances in modeling and comparative genomics hold promise of a better understanding of the evolution of the self-organizing cytoskeleton in early eukaryotes, and its role in the evolution of novel eukaryotic functions, such as amoeboid motility, mitosis, and ciliary swimming. Copyright © 2014 Cold Spring Harbor Laboratory Press; all rights reserved.

  20. Gonococcal attachment to eukaryotic cells

    International Nuclear Information System (INIS)

    James, J.F.; Lammel, C.J.; Draper, D.L.; Brown, D.A.; Sweet, R.L.; Brooks, G.F.

    1983-01-01

    The attachment of Neisseria gonorrhoeae to eukaryotic cells grown in tissue culture was analyzed by use of light and electron microscopy and by labeling of the bacteria with [ 3 H]- and [ 14 C]adenine. Isogenic piliated and nonpiliated N. gonorrhoeae from opaque and transparent colonies were studied. The results of light microscopy studies showed that the gonococci attached to cells of human origin, including Flow 2000, HeLa 229, and HEp 2. Studies using radiolabeled gonococci gave comparable results. Piliated N. gonorrhoeae usually attached in larger numbers than nonpiliated organisms, and those from opaque colonies attached more often than isogenic variants from transparent colonies. Day-to-day variation in rate of attachment was observed. Scanning electron microscopy studies showed the gonococcal attachment to be specific for microvilli of the host cells. It is concluded that more N. gonorrhoeae from opaque colonies, as compared with isogenic variants from transparent colonies, attach to eukaryotic cells grown in tissue culture

  1. Low-dimensional morphospace of topological motifs in human fMRI brain networks

    Directory of Open Access Journals (Sweden)

    Sarah E. Morgan

    2018-06-01

    Full Text Available We present a low-dimensional morphospace of fMRI brain networks, where axes are defined in a data-driven manner based on the network motifs. The morphospace allows us to identify the key variations in healthy fMRI networks in terms of their underlying motifs, and we observe that two principal components (PCs can account for 97% of the motif variability. The first PC of the motif distribution is correlated with efficiency and inversely correlated with transitivity. Hence this axis approximately conforms to the well-known economical small-world trade-off between integration and segregation in brain networks. Finally, we show that the economical clustering generative model proposed by Vértes et al. (2012 can approximately reproduce the motif morphospace of the real fMRI brain networks, in contrast to other generative models. Overall, the motif morphospace provides a powerful way to visualize the relationships between network properties and to investigate generative or constraining factors in the formation of complex human brain functional networks. Motifs have been described as the building blocks of complex networks. Meanwhile, a morphospace allows networks to be placed in a common space and can reveal the relationships between different network properties and elucidate the driving forces behind network topology. We combine the concepts of motifs and morphospaces to create the first motif morphospace of fMRI brain networks. Crucially, the morphospace axes are defined by the motifs, in a data-driven manner. We observe strong correlations between the networks’ positions in morphospace and their global topological properties, suggesting that motif morphospaces are a powerful way to capture the topology of networks in a low-dimensional space and to compare generative models of brain networks. Motif morphospaces could also be used to study other complex networks’ topologies.

  2. Memetic algorithms for de novo motif-finding in biomedical sequences.

    Science.gov (United States)

    Bi, Chengpeng

    2012-09-01

    The objectives of this study are to design and implement a new memetic algorithm for de novo motif discovery, which is then applied to detect important signals hidden in various biomedical molecular sequences. In this paper, memetic algorithms are developed and tested in de novo motif-finding problems. Several strategies in the algorithm design are employed that are to not only efficiently explore the multiple sequence local alignment space, but also effectively uncover the molecular signals. As a result, there are a number of key features in the implementation of the memetic motif-finding algorithm (MaMotif), including a chromosome replacement operator, a chromosome alteration-aware local search operator, a truncated local search strategy, and a stochastic operation of local search imposed on individual learning. To test the new algorithm, we compare MaMotif with a few of other similar algorithms using simulated and experimental data including genomic DNA, primary microRNA sequences (let-7 family), and transmembrane protein sequences. The new memetic motif-finding algorithm is successfully implemented in C++, and exhaustively tested with various simulated and real biological sequences. In the simulation, it shows that MaMotif is the most time-efficient algorithm compared with others, that is, it runs 2 times faster than the expectation maximization (EM) method and 16 times faster than the genetic algorithm-based EM hybrid. In both simulated and experimental testing, results show that the new algorithm is compared favorably or superior to other algorithms. Notably, MaMotif is able to successfully discover the transcription factors' binding sites in the chromatin immunoprecipitation followed by massively parallel sequencing (ChIP-Seq) data, correctly uncover the RNA splicing signals in gene expression, and precisely find the highly conserved helix motif in the transmembrane protein sequences, as well as rightly detect the palindromic segments in the primary micro

  3. PDZ binding motif of HTLV-1 Tax promotes virus-mediated T-cell proliferation in vitro and persistence in vivo.

    Science.gov (United States)

    Xie, Li; Yamamoto, Brenda; Haoudi, Abdelali; Semmes, O John; Green, Patrick L

    2006-03-01

    HTLV-1 cellular transformation and disease induction is dependent on expression of the viral Tax oncoprotein. PDZ is a modular protein interaction domain used in organizing signaling complexes in eukaryotic cells through recognition of a specific binding motif in partner proteins. Tax-1, but not Tax-2, contains a PDZ-binding domain motif (PBM) that promotes the interaction with several cellular PDZ proteins. Herein, we investigate the contribution of the Tax-1 PBM in HTLV-induced proliferation and immortalization of primary T cells in vitro and viral survival in an infectious rabbit animal model. We generated several HTLV-1 and HTLV-2 Tax viral mutants, including HTLV-1deltaPBM, HTLV-2+C22(+PBM), and HTLV-2+ C18(deltaPBM). All Tax mutants maintained the ability to significantly activate the CREB/ATF or NFkappaB signaling pathways. Microtiter proliferation assays revealed that the Tax-1 PBM significantly increases both HTLV-1- and HTLV-2-induced primary T-cell proliferation. In addition, Tax-1 PBM was responsible for the micronuclei induction activity of Tax-1 relative to that of Tax-2. Viral infection and persistence were severely attenuated in rabbits inoculated with HTLV-1deltaPBM. Our results provide the first direct evidence suggesting that PBM-mediated associations between Tax-1 and cellular proteins play a key role in HTLV-induced cell proliferation and genetic instability in vitro and facilitate viral persistence in vivo.

  4. Gene Transfer in Eukaryotic Cells Using Activated Dendrimers

    Science.gov (United States)

    Dennig, Jörg

    Gene transfer into eukaryotic cells plays an important role in cell biology. Over the last 30 years a number of transfection methods have been developed to mediate gene transfer into eukaryotic cells. Classical methods include co-precipitation of DNA with calcium phosphate, charge-dependent precipitation of DNA with DEAE-dextran, electroporation of nucleic acids, and formation of transfection complexes between DNA and cationic liposomes. Gene transfer technologies based on activated PAMAM-dendrimers provide another class of transfection reagents. PAMAM-dendrimers are highly branched, spherical molecules. Activation of newly synthesized dendrimers involves hydrolytic removal of some of the branches, and results in a molecule with a higher degree of flexibility. Activated dendrimers assemble DNA into compact structures via charge interactions. Activated dendrimer - DNA complexes bind to the cell membrane of eukaryotic cells, and are transported into the cell by non-specific endocytosis. A structural model of the activated dendrimer - DNA complex and a potential mechanism for its uptake into cells will be discussed.

  5. Discovering Motifs in Biological Sequences Using the Micron Automata Processor.

    Science.gov (United States)

    Roy, Indranil; Aluru, Srinivas

    2016-01-01

    Finding approximately conserved sequences, called motifs, across multiple DNA or protein sequences is an important problem in computational biology. In this paper, we consider the (l, d) motif search problem of identifying one or more motifs of length l present in at least q of the n given sequences, with each occurrence differing from the motif in at most d substitutions. The problem is known to be NP-complete, and the largest solved instance reported to date is (26,11). We propose a novel algorithm for the (l,d) motif search problem using streaming execution over a large set of non-deterministic finite automata (NFA). This solution is designed to take advantage of the micron automata processor, a new technology close to deployment that can simultaneously execute multiple NFA in parallel. We demonstrate the capability for solving much larger instances of the (l, d) motif search problem using the resources available within a single automata processor board, by estimating run-times for problem instances (39,18) and (40,17). The paper serves as a useful guide to solving problems using this new accelerator technology.

  6. PISMA: A Visual Representation of Motif Distribution in DNA Sequences

    Directory of Open Access Journals (Sweden)

    Rogelio Alcántara-Silva

    2017-03-01

    Full Text Available Background: Because the graphical presentation and analysis of motif distribution can provide insights for experimental hypothesis, PISMA aims at identifying motifs on DNA sequences, counting and showing them graphically. The motif length ranges from 2 to 10 bases, and the DNA sequences range up to 10 kb. The motif distribution is shown as a bar-code–like, as a gene-map–like, and as a transcript scheme. Results: We obtained graphical schemes of the CpG site distribution from 91 human papillomavirus genomes. Also, we present 2 analyses: one of DNA motifs associated with either methylation-resistant or methylation-sensitive CpG islands and another analysis of motifs associated with exosome RNA secretion. Availability and Implementation: PISMA is developed in Java; it is executable in any type of hardware and in diverse operating systems. PISMA is freely available to noncommercial users. The English version and the User Manual are provided in Supplementary Files 1 and 2, and a Spanish version is available at www.biomedicas.unam.mx/wp-content/software/pisma.zip and www.biomedicas.unam.mx/wp-content/pdf/manual/pisma.pdf .

  7. Aggregation of topological motifs in the Escherichia coli transcriptional regulatory network

    Directory of Open Access Journals (Sweden)

    Barabási Albert-László

    2004-01-01

    Full Text Available Abstract Background Transcriptional regulation of cellular functions is carried out through a complex network of interactions among transcription factors and the promoter regions of genes and operons regulated by them.To better understand the system-level function of such networks simplification of their architecture was previously achieved by identifying the motifs present in the network, which are small, overrepresented, topologically distinct regulatory interaction patterns (subgraphs. However, the interaction of such motifs with each other, and their form of integration into the full network has not been previously examined. Results By studying the transcriptional regulatory network of the bacterium, Escherichia coli, we demonstrate that the two previously identified motif types in the network (i.e., feed-forward loops and bi-fan motifs do not exist in isolation, but rather aggregate into homologous motif clusters that largely overlap with known biological functions. Moreover, these clusters further coalesce into a supercluster, thus establishing distinct topological hierarchies that show global statistical properties similar to the whole network. Targeted removal of motif links disintegrates the network into small, isolated clusters, while random disruptions of equal number of links do not cause such an effect. Conclusion Individual motifs aggregate into homologous motif clusters and a supercluster forming the backbone of the E. coli transcriptional regulatory network and play a central role in defining its global topological organization.

  8. Parallel motif extraction from very long sequences

    KAUST Repository

    Sahli, Majed

    2013-01-01

    Motifs are frequent patterns used to identify biological functionality in genomic sequences, periodicity in time series, or user trends in web logs. In contrast to a lot of existing work that focuses on collections of many short sequences, modern applications require mining of motifs in one very long sequence (i.e., in the order of several gigabytes). For this case, there exist statistical approaches that are fast but inaccurate; or combinatorial methods that are sound and complete. Unfortunately, existing combinatorial methods are serial and very slow. Consequently, they are limited to very short sequences (i.e., a few megabytes), small alphabets (typically 4 symbols for DNA sequences), and restricted types of motifs. This paper presents ACME, a combinatorial method for extracting motifs from a single very long sequence. ACME arranges the search space in contiguous blocks that take advantage of the cache hierarchy in modern architectures, and achieves almost an order of magnitude performance gain in serial execution. It also decomposes the search space in a smart way that allows scalability to thousands of processors with more than 90% speedup. ACME is the only method that: (i) scales to gigabyte-long sequences; (ii) handles large alphabets; (iii) supports interesting types of motifs with minimal additional cost; and (iv) is optimized for a variety of architectures such as multi-core systems, clusters in the cloud, and supercomputers. ACME reduces the extraction time for an exact-length query from 4 hours to 7 minutes on a typical workstation; handles 3 orders of magnitude longer sequences; and scales up to 16, 384 cores on a supercomputer. Copyright is held by the owner/author(s).

  9. MODA: an efficient algorithm for network motif discovery in biological networks.

    Science.gov (United States)

    Omidi, Saeed; Schreiber, Falk; Masoudi-Nejad, Ali

    2009-10-01

    In recent years, interest has been growing in the study of complex networks. Since Erdös and Rényi (1960) proposed their random graph model about 50 years ago, many researchers have investigated and shaped this field. Many indicators have been proposed to assess the global features of networks. Recently, an active research area has developed in studying local features named motifs as the building blocks of networks. Unfortunately, network motif discovery is a computationally hard problem and finding rather large motifs (larger than 8 nodes) by means of current algorithms is impractical as it demands too much computational effort. In this paper, we present a new algorithm (MODA) that incorporates techniques such as a pattern growth approach for extracting larger motifs efficiently. We have tested our algorithm and found it able to identify larger motifs with more than 8 nodes more efficiently than most of the current state-of-the-art motif discovery algorithms. While most of the algorithms rely on induced subgraphs as motifs of the networks, MODA is able to extract both induced and non-induced subgraphs simultaneously. The MODA source code is freely available at: http://LBB.ut.ac.ir/Download/LBBsoft/MODA/

  10. Patterns of intron gain and conservation in eukaryotic genes

    Directory of Open Access Journals (Sweden)

    Wolf Yuri I

    2007-10-01

    Full Text Available Abstract Background: The presence of introns in protein-coding genes is a universal feature of eukaryotic genome organization, and the genes of multicellular eukaryotes, typically, contain multiple introns, a substantial fraction of which share position in distant taxa, such as plants and animals. Depending on the methods and data sets used, researchers have reached opposite conclusions on the causes of the high fraction of shared introns in orthologous genes from distant eukaryotes. Some studies conclude that shared intron positions reflect, almost entirely, a remarkable evolutionary conservation, whereas others attribute it to parallel gain of introns. To resolve these contradictions, it is crucial to analyze the evolution of introns by using a model that minimally relies on arbitrary assumptions. Results: We developed a probabilistic model of evolution that allows for variability of intron gain and loss rates over branches of the phylogenetic tree, individual genes, and individual sites. Applying this model to an extended set of conserved eukaryotic genes, we find that parallel gain, on average, accounts for only ~8% of the shared intron positions. However, the distribution of parallel gains over the phylogenetic tree of eukaryotes is highly non-uniform. There are, practically, no parallel gains in closely related lineages, whereas for distant lineages, such as animals and plants, parallel gains appear to contribute up to 20% of the shared intron positions. In accord with these findings, we estimated that ancestral introns have a high probability to be retained in extant genomes, and conversely, that a substantial fraction of extant introns have retained their positions since the early stages of eukaryotic evolution. In addition, the density of sites that are available for intron insertion is estimated to be, approximately, one in seven basepairs. Conclusion: We obtained robust estimates of the contribution of parallel gain to the observed

  11. Dynamic motifs in socio-economic networks

    Science.gov (United States)

    Zhang, Xin; Shao, Shuai; Stanley, H. Eugene; Havlin, Shlomo

    2014-12-01

    Socio-economic networks are of central importance in economic life. We develop a method of identifying and studying motifs in socio-economic networks by focusing on “dynamic motifs,” i.e., evolutionary connection patterns that, because of “node acquaintances” in the network, occur much more frequently than random patterns. We examine two evolving bi-partite networks: i) the world-wide commercial ship chartering market and ii) the ship build-to-order market. We find similar dynamic motifs in both bipartite networks, even though they describe different economic activities. We also find that “influence” and “persistence” are strong factors in the interaction behavior of organizations. When two companies are doing business with the same customer, it is highly probable that another customer who currently only has business relationship with one of these two companies, will become customer of the second in the future. This is the effect of influence. Persistence means that companies with close business ties to customers tend to maintain their relationships over a long period of time.

  12. Assessing local structure motifs using order parameters for motif recognition, interstitial identification, and diffusion path characterization

    Science.gov (United States)

    Zimmermann, Nils E. R.; Horton, Matthew K.; Jain, Anubhav; Haranczyk, Maciej

    2017-11-01

    Structure-property relationships form the basis of many design rules in materials science, including synthesizability and long-term stability of catalysts, control of electrical and optoelectronic behavior in semiconductors as well as the capacity of and transport properties in cathode materials for rechargeable batteries. The immediate atomic environments (i.e., the first coordination shells) of a few atomic sites are often a key factor in achieving a desired property. Some of the most frequently encountered coordination patterns are tetrahedra, octahedra, body and face-centered cubic as well as hexagonal closed packed-like environments. Here, we showcase the usefulness of local order parameters to identify these basic structural motifs in inorganic solid materials by developing classification criteria. We introduce a systematic testing framework, the Einstein crystal test rig, that probes the response of order parameters to distortions in perfect motifs to validate our approach. Subsequently, we highlight three important application cases. First, we map basic crystal structure information of a large materials database in an intuitive manner by screening the Materials Project (MP) database (61,422 compounds) for element-specific motif distributions. Second, we use the structure-motif recognition capabilities to automatically find interstitials in metals, semiconductor, and insulator materials. Our Interstitialcy Finding Tool (InFiT) facilitates high-throughput screenings of defect properties. Third, the order parameters are reliable and compact quantitative structure descriptors for characterizing diffusion hops of intercalants as our example of magnesium in MnO2-spinel indicates. Finally, the tools developed in our work are readily and freely available as software implementations in the pymatgen library, and we expect them to be further applied to machine-learning approaches for emerging applications in materials science.

  13. Eelgrass Leaf Surface Microbiomes Are Locally Variable and Highly Correlated with Epibiotic Eukaryotes

    Directory of Open Access Journals (Sweden)

    Mia M. Bengtsson

    2017-07-01

    Full Text Available Eelgrass (Zostera marina is a marine foundation species essential for coastal ecosystem services around the northern hemisphere. Like all macroscopic organisms, it possesses a microbiome (here defined as an associated prokaryotic community which may play critical roles in modulating the interaction of eelgrass with its environment. For example, its leaf surface microbiome could inhibit or attract eukaryotic epibionts which may overgrow the eelgrass leading to reduced primary productivity and subsequent eelgrass meadow decline. We used amplicon sequencing of the 16S and 18S rRNA genes of prokaryotes and eukaryotes to assess the leaf surface microbiome (prokaryotes as well as eukaryotic epibionts in- and outside lagoons on the German Baltic Sea coast. Prokaryote microbiomes varied substantially both between sites inside lagoons and between open coastal and lagoon sites. Water depth, leaf area and biofilm chlorophyll a concentration explained a large amount of variation in both prokaryotic and eukaryotic community composition. The prokaryotic microbiome and eukaryotic epibiont communities were highly correlated, and network analysis revealed disproportionate co-occurrence between a limited number of eukaryotic taxa and several bacterial taxa. This suggests that eelgrass leaf surfaces are home to a mosaic of microbiomes of several epibiotic eukaryotes, in addition to the microbiome of the eelgrass itself. Our findings thereby underline that eukaryotic diversity should be taken into account in order to explain prokaryotic microbiome assembly and dynamics in aquatic environments.

  14. Composite Structural Motifs of Binding Sites for Delineating Biological Functions of Proteins

    Science.gov (United States)

    Kinjo, Akira R.; Nakamura, Haruki

    2012-01-01

    Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs that represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures. PMID:22347478

  15. Probing structural changes of self assembled i-motif DNA

    KAUST Repository

    Lee, Iljoon; Patil, Sachin; Fhayli, Karim; Alsaiari, Shahad K.; Khashab, Niveen M.

    2015-01-01

    We report an i-motif structural probing system based on Thioflavin T (ThT) as a fluorescent sensor. This probe can discriminate the structural changes of RET and Rb i-motif sequences according to pH change. This journal is

  16. Genome-wide conserved consensus transcription factor binding motifs are hyper-methylated

    Directory of Open Access Journals (Sweden)

    Down Thomas A

    2010-09-01

    Full Text Available Abstract Background DNA methylation can regulate gene expression by modulating the interaction between DNA and proteins or protein complexes. Conserved consensus motifs exist across the human genome ("predicted transcription factor binding sites": "predicted TFBS" but the large majority of these are proven by chromatin immunoprecipitation and high throughput sequencing (ChIP-seq not to be biological transcription factor binding sites ("empirical TFBS". We hypothesize that DNA methylation at conserved consensus motifs prevents promiscuous or disorderly transcription factor binding. Results Using genome-wide methylation maps of the human heart and sperm, we found that all conserved consensus motifs as well as the subset of those that reside outside CpG islands have an aggregate profile of hyper-methylation. In contrast, empirical TFBS with conserved consensus motifs have a profile of hypo-methylation. 40% of empirical TFBS with conserved consensus motifs resided in CpG islands whereas only 7% of all conserved consensus motifs were in CpG islands. Finally we further identified a minority subset of TF whose profiles are either hypo-methylated or neutral at their respective conserved consensus motifs implicating that these TF may be responsible for establishing or maintaining an un-methylated DNA state, or whose binding is not regulated by DNA methylation. Conclusions Our analysis supports the hypothesis that at least for a subset of TF, empirical binding to conserved consensus motifs genome-wide may be controlled by DNA methylation.

  17. Use of prokaryotic transcriptional activators as metabolite biosensors in eukaryotic cells

    DEFF Research Database (Denmark)

    2018-01-01

    The present invention relates to the use of transcriptional activators from prokaryotic organisms for use in eukaryotic cells, such as yeast as sensors of intracellular and extracellular accumulation of a ligand or metabolite specifically activating this transcriptional activator in a eukaryot...

  18. BlockLogo: Visualization of peptide and sequence motif conservation

    DEFF Research Database (Denmark)

    Olsen, Lars Rønn; Kudahl, Ulrich Johan; Simon, Christian

    2013-01-01

    BlockLogo is a web-server application for the visualization of protein and nucleotide fragments, continuous protein sequence motifs, and discontinuous sequence motifs using calculation of block entropy from multiple sequence alignments. The user input consists of a multiple sequence alignment, se...

  19. Inorganic phosphate uptake in unicellular eukaryotes.

    Science.gov (United States)

    Dick, Claudia F; Dos-Santos, André L A; Meyer-Fernandes, José R

    2014-07-01

    Inorganic phosphate (Pi) is an essential nutrient for all organisms. The route of Pi utilization begins with Pi transport across the plasma membrane. Here, we analyzed the gene sequences and compared the biochemical profiles, including kinetic and modulator parameters, of Pi transporters in unicellular eukaryotes. The objective of this review is to evaluate the recent findings regarding Pi uptake mechanisms in microorganisms, such as the fungi Neurospora crassa and Saccharomyces cerevisiae and the parasite protozoans Trypanosoma cruzi, Trypanosoma rangeli, Leishmania infantum and Plasmodium falciparum. Pi uptake is the key step of Pi homeostasis and in the subsequent signaling event in eukaryotic microorganisms. Biochemical and structural studies are important for clarifying mechanisms of Pi homeostasis, as well as Pi sensor and downstream pathways, and raise possibilities for future studies in this field. Copyright © 2014 Elsevier B.V. All rights reserved.

  20. Identification of the Raptor-binding motif on Arabidopsis S6 kinase and its use as a TOR signaling suppressor.

    Science.gov (United States)

    Son, Ora; Kim, Sunghan; Hur, Yoon-Sun; Cheon, Choong-Ill

    2016-03-25

    TOR (target of rapamycin) kinase signaling plays central role as a regulator of growth and proliferation in all eukaryotic cells and its key signaling components and effectors are also conserved in plants. Unlike the mammalian and yeast counterparts, however, we found through yeast two-hybrid analysis that multiple regions of the Arabidopsis Raptor (regulatory associated protein of TOR) are required for binding to its substrate. We also identified that a 44-amino acid region at the N-terminal end of Arabidopsis ribosomal S6 kinase 1 (AtS6K1) specifically interacted with AtRaptor1, indicating that this region may contain a functional equivalent of the TOS (TOR-Signaling) motif present in the mammalian TOR substrates. Transient over-expression of this 44-amino acid fragment in Arabidopsis protoplasts resulted in significant decrease in rDNA transcription, demonstrating a feasibility of developing a new plant-specific TOR signaling inhibitor based upon perturbation of the Raptor-substrate interaction. Copyright © 2016 Elsevier Inc. All rights reserved.

  1. Genome Analysis of Conserved Dehydrin Motifs in Vascular Plants

    Directory of Open Access Journals (Sweden)

    Ahmad A. Malik

    2017-05-01

    Full Text Available Dehydrins, a large family of abiotic stress proteins, are defined by the presence of a mostly conserved motif known as the K-segment, and may also contain two other conserved motifs known as the Y-segment and S-segment. Using the dehydrin literature, we developed a sequence motif definition of the K-segment, which we used to create a large dataset of dehydrin sequences by searching the Pfam00257 dehydrin dataset and the Phytozome 10 sequences of vascular plants. A comprehensive analysis of these sequences reveals that lysine residues are highly conserved in the K-segment, while the amino acid type is often conserved at other positions. Despite the Y-segment name, the central tyrosine is somewhat conserved, but can be substituted with two other small aromatic amino acids (phenylalanine or histidine. The S-segment contains a series of serine residues, but in some proteins is also preceded by a conserved LHR sequence. In many dehydrins containing all three of these motifs the S-segment is linked to the K-segment by a GXGGRRKK motif (where X can be any amino acid, suggesting a functional linkage between these two motifs. An analysis of the sequences shows that the dehydrin architecture and several biochemical properties (isoelectric point, molecular mass, and hydrophobicity score are dependent on each other, and that some dehydrin architectures are overexpressed during certain abiotic stress, suggesting that they may be optimized for a specific abiotic stress while others are involved in all forms of dehydration stress (drought, cold, and salinity.

  2. Uniting sex and eukaryote origins in an emerging oxygenic world.

    Science.gov (United States)

    Gross, Jeferson; Bhattacharya, Debashish

    2010-08-23

    Theories about eukaryote origins (eukaryogenesis) need to provide unified explanations for the emergence of diverse complex features that define this lineage. Models that propose a prokaryote-to-eukaryote transition are gridlocked between the opposing "phagocytosis first" and "mitochondria as seed" paradigms, neither of which fully explain the origins of eukaryote cell complexity. Sex (outcrossing with meiosis) is an example of an elaborate trait not yet satisfactorily addressed in theories about eukaryogenesis. The ancestral nature of meiosis and its dependence on eukaryote cell biology suggest that the emergence of sex and eukaryogenesis were simultaneous and synergic and may be explained by a common selective pressure. We propose that a local rise in oxygen levels, due to cyanobacterial photosynthesis in ancient Archean microenvironments, was highly toxic to the surrounding biota. This selective pressure drove the transformation of an archaeal (archaebacterial) lineage into the first eukaryotes. Key is that oxygen might have acted in synergy with environmental stresses such as ultraviolet (UV) radiation and/or desiccation that resulted in the accumulation of reactive oxygen species (ROS). The emergence of eukaryote features such as the endomembrane system and acquisition of the mitochondrion are posited as strategies to cope with a metabolic crisis in the cell plasma membrane and the accumulation of ROS, respectively. Selective pressure for efficient repair of ROS/UV-damaged DNA drove the evolution of sex, which required cell-cell fusions, cytoskeleton-mediated chromosome movement, and emergence of the nuclear envelope. Our model implies that evolution of sex and eukaryogenesis were inseparable processes. Several types of data can be used to test our hypothesis. These include paleontological predictions, simulation of ancient oxygenic microenvironments, and cell biological experiments with Archaea exposed to ROS and UV stresses. Studies of archaeal conjugation

  3. RegRNA: an integrated web server for identifying regulatory RNA motifs and elements

    OpenAIRE

    Huang, Hsi-Yuan; Chien, Chia-Hung; Jen, Kuan-Hua; Huang, Hsien-Da

    2006-01-01

    Numerous regulatory structural motifs have been identified as playing essential roles in transcriptional and post-transcriptional regulation of gene expression. RegRNA is an integrated web server for identifying the homologs of regulatory RNA motifs and elements against an input mRNA sequence. Both sequence homologs and structural homologs of regulatory RNA motifs can be recognized. The regulatory RNA motifs supported in RegRNA are categorized into several classes: (i) motifs in mRNA 5′-untra...

  4. Sex is a ubiquitous, ancient, and inherent attribute of eukaryotic life

    NARCIS (Netherlands)

    Speijer, Dave; Lukeš, Julius; Eliáš, Marek

    2015-01-01

    Sexual reproduction and clonality in eukaryotes are mostly seen as exclusive, the latter being rather exceptional. This view might be biased by focusing almost exclusively on metazoans. We analyze and discuss reproduction in the context of extant eukaryotic diversity, paying special attention to

  5. A novel RNA-recognition-motif protein is required for premeiotic G1/S-phase transition in rice (Oryza sativa L..

    Directory of Open Access Journals (Sweden)

    Ken-Ichi Nonomura

    2011-01-01

    Full Text Available The molecular mechanism for meiotic entry remains largely elusive in flowering plants. Only Arabidopsis SWI1/DYAD and maize AM1, both of which are the coiled-coil protein, are known to be required for the initiation of plant meiosis. The mechanism underlying the synchrony of male meiosis, characteristic to flowering plants, has also been unclear in the plant kingdom. In other eukaryotes, RNA-recognition-motif (RRM proteins are known to play essential roles in germ-cell development and meiosis progression. Rice MEL2 protein discovered in this study shows partial similarity with human proline-rich RRM protein, deleted in Azoospermia-Associated Protein1 (DAZAP1, though MEL2 also possesses ankyrin repeats and a RING finger motif. Expression analyses of several cell-cycle markers revealed that, in mel2 mutant anthers, most germ cells failed to enter premeiotic S-phase and meiosis, and a part escaped from the defect and underwent meiosis with a significant delay or continued mitotic cycles. Immunofluorescent detection revealed that T7 peptide-tagged MEL2 localized at cytoplasmic perinuclear region of germ cells during premeiotic interphase in transgenic rice plants. This study is the first report of the plant RRM protein, which is required for regulating the premeiotic G1/S-phase transition of male and female germ cells and also establishing synchrony of male meiosis. This study will contribute to elucidation of similarities and diversities in reproduction system between plants and other species.

  6. Enzymes from Higher Eukaryotes for Industrial Biocatalysis

    Directory of Open Access Journals (Sweden)

    Zhibin Liu

    2004-01-01

    Full Text Available The industrial production of fine chemicals, feed and food ingredients, pharmaceuticals, agrochemicals and their respective intermediates relies on an increasing application of biocatalysis, i.e. on enzyme or whole-cell catalyzed conversions of molecules. Simple procedures for discovery, cloning and over-expression as well as fast growth favour fungi, yeasts and especially bacteria as sources of biocatalysts. Higher eukaryotes also harbour an almost unlimited number of potential biocatalysts, although to date the limited supply of enzymes, the high heterogeneity of enzyme preparations and the hazard of infectious contaminants keep some interesting candidates out of reach for industrial bioprocesses. In the past only a few animal and plant enzymes from agricultural waste materials were employed in food processing. The use of bacterial expression strains or non-conventional yeasts for the heterologous production of efficient eukaryotic enzymes can overcome the bottleneck in enzyme supply and provide sufficient amounts of homogenous enzyme preparations for reliable and economically feasible applications at large scale. Ideal enzymatic processes represent an environmentally friendly, »near-to-completion« conversion of (mostly non-natural substrates to pure products. Recent developments demonstrate the commercial feasibility of large-scale biocatalytic processes employing enzymes from higher eukaryotes (e.g. plants, animals and also their usefulness in some small-scale industrial applications.

  7. Assessing Local Structure Motifs Using Order Parameters for Motif Recognition, Interstitial Identification, and Diffusion Path Characterization

    Directory of Open Access Journals (Sweden)

    Nils E. R. Zimmermann

    2017-11-01

    Full Text Available Structure–property relationships form the basis of many design rules in materials science, including synthesizability and long-term stability of catalysts, control of electrical and optoelectronic behavior in semiconductors, as well as the capacity of and transport properties in cathode materials for rechargeable batteries. The immediate atomic environments (i.e., the first coordination shells of a few atomic sites are often a key factor in achieving a desired property. Some of the most frequently encountered coordination patterns are tetrahedra, octahedra, body and face-centered cubic as well as hexagonal close packed-like environments. Here, we showcase the usefulness of local order parameters to identify these basic structural motifs in inorganic solid materials by developing classification criteria. We introduce a systematic testing framework, the Einstein crystal test rig, that probes the response of order parameters to distortions in perfect motifs to validate our approach. Subsequently, we highlight three important application cases. First, we map basic crystal structure information of a large materials database in an intuitive manner by screening the Materials Project (MP database (61,422 compounds for element-specific motif distributions. Second, we use the structure-motif recognition capabilities to automatically find interstitials in metals, semiconductor, and insulator materials. Our Interstitialcy Finding Tool (InFiT facilitates high-throughput screenings of defect properties. Third, the order parameters are reliable and compact quantitative structure descriptors for characterizing diffusion hops of intercalants as our example of magnesium in MnO2-spinel indicates. Finally, the tools developed in our work are readily and freely available as software implementations in the pymatgen library, and we expect them to be further applied to machine-learning approaches for emerging applications in materials science.

  8. Evolution of DNA replication protein complexes in eukaryotes and Archaea.

    Directory of Open Access Journals (Sweden)

    Nicholas Chia

    Full Text Available BACKGROUND: The replication of DNA in Archaea and eukaryotes requires several ancillary complexes, including proliferating cell nuclear antigen (PCNA, replication factor C (RFC, and the minichromosome maintenance (MCM complex. Bacterial DNA replication utilizes comparable proteins, but these are distantly related phylogenetically to their archaeal and eukaryotic counterparts at best. METHODOLOGY/PRINCIPAL FINDINGS: While the structures of each of the complexes do not differ significantly between the archaeal and eukaryotic versions thereof, the evolutionary dynamic in the two cases does. The number of subunits in each complex is constant across all taxa. However, they vary subtly with regard to composition. In some taxa the subunits are all identical in sequence, while in others some are homologous rather than identical. In the case of eukaryotes, there is no phylogenetic variation in the makeup of each complex-all appear to derive from a common eukaryotic ancestor. This is not the case in Archaea, where the relationship between the subunits within each complex varies taxon-to-taxon. We have performed a detailed phylogenetic analysis of these relationships in order to better understand the gene duplications and divergences that gave rise to the homologous subunits in Archaea. CONCLUSION/SIGNIFICANCE: This domain level difference in evolution suggests that different forces have driven the evolution of DNA replication proteins in each of these two domains. In addition, the phylogenies of all three gene families support the distinctiveness of the proposed archaeal phylum Thaumarchaeota.

  9. Initiation of translation in bacteria by a structured eukaryotic IRES RNA.

    Science.gov (United States)

    Colussi, Timothy M; Costantino, David A; Zhu, Jianyu; Donohue, John Paul; Korostelev, Andrei A; Jaafar, Zane A; Plank, Terra-Dawn M; Noller, Harry F; Kieft, Jeffrey S

    2015-03-05

    The central dogma of gene expression (DNA to RNA to protein) is universal, but in different domains of life there are fundamental mechanistic differences within this pathway. For example, the canonical molecular signals used to initiate protein synthesis in bacteria and eukaryotes are mutually exclusive. However, the core structures and conformational dynamics of ribosomes that are responsible for the translation steps that take place after initiation are ancient and conserved across the domains of life. We wanted to explore whether an undiscovered RNA-based signal might be able to use these conserved features, bypassing mechanisms specific to each domain of life, and initiate protein synthesis in both bacteria and eukaryotes. Although structured internal ribosome entry site (IRES) RNAs can manipulate ribosomes to initiate translation in eukaryotic cells, an analogous RNA structure-based mechanism has not been observed in bacteria. Here we report our discovery that a eukaryotic viral IRES can initiate translation in live bacteria. We solved the crystal structure of this IRES bound to a bacterial ribosome to 3.8 Å resolution, revealing that despite differences between bacterial and eukaryotic ribosomes this IRES binds directly to both and occupies the space normally used by transfer RNAs. Initiation in both bacteria and eukaryotes depends on the structure of the IRES RNA, but in bacteria this RNA uses a different mechanism that includes a form of ribosome repositioning after initial recruitment. This IRES RNA bridges billions of years of evolutionary divergence and provides an example of an RNA structure-based translation initiation signal capable of operating in two domains of life.

  10. Metabarcoding analysis of eukaryotic microbiota in the gut of HIV-infected patients.

    Directory of Open Access Journals (Sweden)

    Ibrahim Hamad

    Full Text Available Research on the relationship between changes in the gut microbiota and human disease, including AIDS, is a growing field. However, studies on the eukaryotic component of the intestinal microbiota have just begun and have not yet been conducted in HIV-infected patients. Moreover, eukaryotic community profiling is influenced by the use of different methodologies at each step of culture-independent techniques. Herein, initially, four DNA extraction protocols were compared to test the efficiency of each method in recovering eukaryotic DNA from fecal samples. Our results revealed that recovering eukaryotic components from fecal samples differs significantly among DNA extraction methods. Subsequently, the composition of the intestinal eukaryotic microbiota was evaluated in HIV-infected patients and healthy volunteers through clone sequencing, high-throughput sequencing of nuclear ribosomal internal transcribed spacers 1 (ITS1 and 2 (ITS2 amplicons and real-time PCRs. Our results revealed that not only richness (Chao-1 index and alpha diversity (Shannon diversity differ between HIV-infected patients and healthy volunteers, depending on the molecular strategy used, but also the global eukaryotic community composition, with little overlapping taxa found between techniques. Moreover, our results based on cloning libraries and ITS1/ITS2 metabarcoding sequencing showed significant differences in fungal composition between HIV-infected patients and healthy volunteers, but without distinct clusters separating the two groups. Malassezia restricta was significantly more prevalent in fecal samples of HIV-infected patients, according to cloning libraries, whereas operational taxonomic units (OTUs belonging to Candida albicans and Candida tropicalis were significantly more abundant in fecal samples of HIV-infected patients compared to healthy subjects in both ITS subregions. Finally, real-time PCR showed the presence of Microsporidia, Giardia lamblia, Blastocystis

  11. Identification of amino acid residues in protein SRP72 required for binding to a kinked 5e motif of the human signal recognition particle RNA

    Directory of Open Access Journals (Sweden)

    Zwieb Christian

    2010-11-01

    Full Text Available Abstract Background Human cells depend critically on the signal recognition particle (SRP for the sorting and delivery of their proteins. The SRP is a ribonucleoprotein complex which binds to signal sequences of secretory polypeptides as they emerge from the ribosome. Among the six proteins of the eukaryotic SRP, the largest protein, SRP72, is essential for protein targeting and possesses a poorly characterized RNA binding domain. Results We delineated the minimal region of SRP72 capable of forming a stable complex with an SRP RNA fragment. The region encompassed residues 545 to 585 of the full-length human SRP72 and contained a lysine-rich cluster (KKKKKKKKGK at postions 552 to 561 as well as a conserved Pfam motif with the sequence PDPXRWLPXXER at positions 572 to 583. We demonstrated by site-directed mutagenesis that both regions participated in the formation of a complex with the RNA. In agreement with biochemical data and results from chymotryptic digestion experiments, molecular modeling of SRP72 implied that the invariant W577 was located inside the predicted structure of an RNA binding domain. The 11-nucleotide 5e motif contained within the SRP RNA fragment was shown by comparative electrophoresis on native polyacrylamide gels to conform to an RNA kink-turn. The model of the complex suggested that the conserved A240 of the K-turn, previously identified as being essential for the binding to SRP72, could protrude into a groove of the SRP72 RNA binding domain, similar but not identical to how other K-turn recognizing proteins interact with RNA. Conclusions The results from the presented experiments provided insights into the molecular details of a functionally important and structurally interesting RNA-protein interaction. A model for how a ligand binding pocket of SRP72 can accommodate a new RNA K-turn in the 5e region of the eukaryotic SRP RNA is proposed.

  12. RNA recognition motif (RRM)-containing proteins in Bombyx mori

    African Journals Online (AJOL)

    STORAGESEVER

    2009-03-20

    Mar 20, 2009 ... Recognition Motif (RRM), sometimes referred to as. RNP1, is one of the first identified domains for RNA interaction. RRM is very common ..... Apart from the RRM motif, eIF3-S9 has a Trp-Asp. (WD) repeat domain, Poly (A) ...

  13. Genetic exchange in eukaryotes through horizontal transfer: connected by the mobilome.

    Science.gov (United States)

    Wallau, Gabriel Luz; Vieira, Cristina; Loreto, Élgion Lúcio Silva

    2018-01-01

    All living species contain genetic information that was once shared by their common ancestor. DNA is being inherited through generations by vertical transmission (VT) from parents to offspring and from ancestor to descendant species. This process was considered the sole pathway by which biological entities exchange inheritable information. However, Horizontal Transfer (HT), the exchange of genetic information by other means than parents to offspring, was discovered in prokaryotes along with strong evidence showing that it is a very important process by which prokaryotes acquire new genes. For some time now, it has been a scientific consensus that HT events were rare and non-relevant for evolution of eukaryotic species, but there is growing evidence supporting that HT is an important and frequent phenomenon in eukaryotes as well. Here, we will discuss the latest findings regarding HT among eukaryotes, mainly HT of transposons (HTT), establishing HTT once and for all as an important phenomenon that should be taken into consideration to fully understand eukaryotes genome evolution. In addition, we will discuss the latest development methods to detect such events in a broader scale and highlight the new approaches which should be pursued by researchers to fill the knowledge gaps regarding HTT among eukaryotes.

  14. One motif to bind them: A small-XXX-small motif affects transmembrane domain 1 oligomerization, function, localization, and cross-talk between two yeast GPCRs.

    Science.gov (United States)

    Lock, Antonia; Forfar, Rachel; Weston, Cathryn; Bowsher, Leo; Upton, Graham J G; Reynolds, Christopher A; Ladds, Graham; Dixon, Ann M

    2014-12-01

    G protein-coupled receptors (GPCRs) are the largest family of cell-surface receptors in mammals and facilitate a range of physiological responses triggered by a variety of ligands. GPCRs were thought to function as monomers, however it is now accepted that GPCR homo- and hetero-oligomers also exist and influence receptor properties. The Schizosaccharomyces pombe GPCR Mam2 is a pheromone-sensing receptor involved in mating and has previously been shown to form oligomers in vivo. The first transmembrane domain (TMD) of Mam2 contains a small-XXX-small motif, overrepresented in membrane proteins and well-known for promoting helix-helix interactions. An ortholog of Mam2 in Saccharomyces cerevisiae, Ste2, contains an analogous small-XXX-small motif which has been shown to contribute to receptor homo-oligomerization, localization and function. Here we have used experimental and computational techniques to characterize the role of the small-XXX-small motif in function and assembly of Mam2 for the first time. We find that disruption of the motif via mutagenesis leads to reduction of Mam2 TMD1 homo-oligomerization and pheromone-responsive cellular signaling of the full-length protein. It also impairs correct targeting to the plasma membrane. Mutation of the analogous motif in Ste2 yielded similar results, suggesting a conserved mechanism for assembly. Using co-expression of the two fungal receptors in conjunction with computational models, we demonstrate a functional change in G protein specificity and propose that this is brought about through hetero-dimeric interactions of Mam2 with Ste2 via the complementary small-XXX-small motifs. This highlights the potential of these motifs to affect a range of properties that can be investigated in other GPCRs. Copyright © 2014. Published by Elsevier B.V.

  15. Eukaryotic ribosome display with in situ DNA recovery.

    Science.gov (United States)

    He, Mingyue; Edwards, Bryan M; Kastelic, Damjana; Taussig, Michael J

    2012-01-01

    Ribosome display is a cell-free display technology for in vitro selection and optimisation of proteins from large diversified libraries. It operates through the formation of stable protein-ribosome-mRNA (PRM) complexes and selection of ligand-binding proteins, followed by DNA recovery from the selected genetic information. Both prokaryotic and eukaryotic ribosome display systems have been developed. In this chapter, we describe the eukaryotic rabbit reticulocyte method in which a distinct in situ single-primer RT-PCR procedure is used to recover DNA from the selected PRM complexes without the need for prior disruption of the ribosome.

  16. Evolutionarily conserved bias of amino-acid usage refines the definition of PDZ-binding motif

    Directory of Open Access Journals (Sweden)

    Launey Thomas

    2011-06-01

    Full Text Available Abstract Background The interactions between PDZ (PSD-95, Dlg, ZO-1 domains and PDZ-binding motifs play central roles in signal transductions within cells. Proteins with PDZ domains bind to PDZ-binding motifs almost exclusively when the motifs are located at the carboxyl (C- terminal ends of their binding partners. However, it remains little explored whether PDZ-binding motifs show any preferential location at the C-terminal ends of proteins, at genome-level. Results Here, we examined the distribution of the type-I (x-x-S/T-x-I/L/V or type-II (x-x-V-x-I/V PDZ-binding motifs in proteins encoded in the genomes of five different species (human, mouse, zebrafish, fruit fly and nematode. We first established that these PDZ-binding motifs are indeed preferentially present at their C-terminal ends. Moreover, we found specific amino acid (AA bias for the 'x' positions in the motifs at the C-terminal ends. In general, hydrophilic AAs were favored. Our genomics-based findings confirm and largely extend the results of previous interaction-based studies, allowing us to propose refined consensus sequences for all of the examined PDZ-binding motifs. An ontological analysis revealed that the refined motifs are functionally relevant since a large fraction of the proteins bearing the motif appear to be involved in signal transduction. Furthermore, co-precipitation experiments confirmed two new protein interactions predicted by our genomics-based approach. Finally, we show that influenza virus pathogenicity can be correlated with PDZ-binding motif, with high-virulence viral proteins bearing a refined PDZ-binding motif. Conclusions Our refined definition of PDZ-binding motifs should provide important clues for identifying functional PDZ-binding motifs and proteins involved in signal transduction.

  17. Distinct configurations of protein complexes and biochemical pathways revealed by epistatic interaction network motifs

    LENUS (Irish Health Repository)

    Casey, Fergal

    2011-08-22

    Abstract Background Gene and protein interactions are commonly represented as networks, with the genes or proteins comprising the nodes and the relationship between them as edges. Motifs, or small local configurations of edges and nodes that arise repeatedly, can be used to simplify the interpretation of networks. Results We examined triplet motifs in a network of quantitative epistatic genetic relationships, and found a non-random distribution of particular motif classes. Individual motif classes were found to be associated with different functional properties, suggestive of an underlying biological significance. These associations were apparent not only for motif classes, but for individual positions within the motifs. As expected, NNN (all negative) motifs were strongly associated with previously reported genetic (i.e. synthetic lethal) interactions, while PPP (all positive) motifs were associated with protein complexes. The two other motif classes (NNP: a positive interaction spanned by two negative interactions, and NPP: a negative spanned by two positives) showed very distinct functional associations, with physical interactions dominating for the former but alternative enrichments, typical of biochemical pathways, dominating for the latter. Conclusion We present a model showing how NNP motifs can be used to recognize supportive relationships between protein complexes, while NPP motifs often identify opposing or regulatory behaviour between a gene and an associated pathway. The ability to use motifs to point toward underlying biological organizational themes is likely to be increasingly important as more extensive epistasis mapping projects in higher organisms begin.

  18. Fingerprint motifs of phytases | Fan | African Journal of Biotechnology

    African Journals Online (AJOL)

    Among the total of potential 173 phytases gained in 11 plant genomes through MAST, PAPhys are the major phytases, and HAPhys are the minor, and other phytase groups are not found in planta. Keywords: Phytase, fingerprint motif, multiple EM for motif elicitation (MEME), MAST African Journal of Biotechnology Vol.

  19. Mitochondrial uncoupling proteins in unicellular eukaryotes.

    Science.gov (United States)

    Jarmuszkiewicz, Wieslawa; Woyda-Ploszczyca, Andrzej; Antos-Krzeminska, Nina; Sluse, Francis E

    2010-01-01

    Uncoupling proteins (UCPs) are members of the mitochondrial anion carrier protein family that are present in the mitochondrial inner membrane and mediate free fatty acid (FFA)-activated, purine nucleotide (PN)-inhibited proton conductance. Since 1999, the presence of UCPs has been demonstrated in some non-photosynthesising unicellular eukaryotes, including amoeboid and parasite protists, as well as in non-fermentative yeast and filamentous fungi. In the mitochondria of these organisms, UCP activity is revealed upon FFA-induced, PN-inhibited stimulation of resting respiration and a decrease in membrane potential, which are accompanied by a decrease in membranous ubiquinone (Q) reduction level. UCPs in unicellular eukaryotes are able to divert energy from oxidative phosphorylation and thus compete for a proton electrochemical gradient with ATP synthase. Our recent work indicates that membranous Q is a metabolic sensor that might utilise its redox state to release the PN inhibition of UCP-mediated mitochondrial uncoupling under conditions of phosphorylation and resting respiration. The action of reduced Q (QH2) could allow higher or complete activation of UCP. As this regulatory feature was demonstrated for microorganism UCPs (A. castellanii UCP), plant and mammalian UCP1 analogues, and UCP1 in brown adipose tissue, the process could involve all UCPs. Here, we discuss the functional connection and physiological role of UCP and alternative oxidase, two main energy-dissipating systems in the plant-type mitochondrial respiratory chain of unicellular eukaryotes, including the control of cellular energy balance as well as preventive action against the production of reactive oxygen species. Copyright © 2009 Elsevier B.V. All rights reserved.

  20. Towards New Antifolates Targeting Eukaryotic Opportunistic Infections

    Energy Technology Data Exchange (ETDEWEB)

    Liu, J.; Bolstad, D; Bolstad, E; Wright, D; Anderson, A

    2009-01-01

    Trimethoprim, an antifolate commonly prescribed in combination with sulfamethoxazole, potently inhibits several prokaryotic species of dihydrofolate reductase (DHFR). However, several eukaryotic pathogenic organisms are resistant to trimethoprim, preventing its effective use as a therapeutic for those infections. We have been building a program to reengineer trimethoprim to more potently and selectively inhibit eukaryotic species of DHFR as a viable strategy for new drug discovery targeting several opportunistic pathogens. We have developed a series of compounds that exhibit potent and selective inhibition of DHFR from the parasitic protozoa Cryptosporidium and Toxoplasma as well as the fungus Candida glabrata. A comparison of the structures of DHFR from the fungal species Candida glabrata and Pneumocystis suggests that the compounds may also potently inhibit Pneumocystis DHFR.

  1. Short Arginine Motifs Drive Protein Stickiness in the Escherichia coli Cytoplasm.

    Science.gov (United States)

    Kyne, Ciara; Crowley, Peter B

    2017-09-19

    Although essential to numerous biotech applications, knowledge of molecular recognition by arginine-rich motifs in live cells remains limited. 1 H, 15 N HSQC and 19 F NMR spectroscopies were used to investigate the effects of C-terminal -GR n (n = 1-5) motifs on GB1 interactions in Escherichia coli cells and cell extracts. While the "biologically inert" GB1 yields high-quality in-cell spectra, the -GR n fusions with n = 4 or 5 were undetectable. This result suggests that a tetra-arginine motif is sufficient to drive interactions between a test protein and macromolecules in the E. coli cytoplasm. The inclusion of a 12 residue flexible linker between GB1 and the -GR 5 motif did not improve detection of the "inert" domain. In contrast, all of the constructs were detectable in cell lysates and extracts, suggesting that the arginine-mediated complexes were weak. Together these data reveal the significance of weak interactions between short arginine-rich motifs and the E. coli cytoplasm and demonstrate the potential of such motifs to modify protein interactions in living cells. These interactions must be considered in the design of (in vivo) nanoscale assemblies that rely on arginine-rich sequences.

  2. Eukaryotic acquisition of a bacterial operon

    Science.gov (United States)

    The yeast Saccharomyces cerevisiae is one of the champions of basic biomedical research due to its compact eukaryotic genome and ease of experimental manipulation. Despite these immense strengths, its impact on understanding the genetic basis of natural phenotypic variation has been limited by strai...

  3. Quantitative prediction of shrimp disease incidence via the profiles of gut eukaryotic microbiota.

    Science.gov (United States)

    Xiong, Jinbo; Yu, Weina; Dai, Wenfang; Zhang, Jinjie; Qiu, Qiongfen; Ou, Changrong

    2018-04-01

    One common notion is emerging that gut eukaryotes are commensal or beneficial, rather than detrimental. To date, however, surprisingly few studies have been taken to discern the factors that govern the assembly of gut eukaryotes, despite growing interest in the dysbiosis of gut microbiota-disease relationship. Herein, we firstly explored how the gut eukaryotic microbiotas were assembled over shrimp postlarval to adult stages and a disease progression. The gut eukaryotic communities changed markedly as healthy shrimp aged, and converged toward an adult-microbiota configuration. However, the adult-like stability was distorted by disease exacerbation. A null model untangled that the deterministic processes that governed the gut eukaryotic assembly tended to be more important over healthy shrimp development, whereas this trend was inverted as the disease progressed. After ruling out the baseline of gut eukaryotes over shrimp ages, we identified disease-discriminatory taxa (species level afforded the highest accuracy of prediction) that characteristic of shrimp health status. The profiles of these taxa contributed an overall 92.4% accuracy in predicting shrimp health status. Notably, this model can accurately diagnose the onset of shrimp disease. Interspecies interaction analysis depicted how the disease-discriminatory taxa interacted with one another in sustaining shrimp health. Taken together, our findings offer novel insights into the underlying ecological processes that govern the assembly of gut eukaryotes over shrimp postlarval to adult stages and a disease progression. Intriguingly, the established model can quantitatively and accurately predict the incidences of shrimp disease.

  4. Anion induced conformational preference of Cα NN motif residues in functional proteins.

    Science.gov (United States)

    Patra, Piya; Ghosh, Mahua; Banerjee, Raja; Chakrabarti, Jaydeb

    2017-12-01

    Among different ligand binding motifs, anion binding C α NN motif consisting of peptide backbone atoms of three consecutive residues are observed to be important for recognition of free anions, like sulphate or biphosphate and participate in different key functions. Here we study the interaction of sulphate and biphosphate with C α NN motif present in different proteins. Instead of total protein, a peptide fragment has been studied keeping C α NN motif flanked in between other residues. We use classical force field based molecular dynamics simulations to understand the stability of this motif. Our data indicate fluctuations in conformational preferences of the motif residues in absence of the anion. The anion gives stability to one of these conformations. However, the anion induced conformational preferences are highly sequence dependent and specific to the type of anion. In particular, the polar residues are more favourable compared to the other residues for recognising the anion. © 2017 Wiley Periodicals, Inc.

  5. Gene regulatory and signaling networks exhibit distinct topological distributions of motifs

    Science.gov (United States)

    Ferreira, Gustavo Rodrigues; Nakaya, Helder Imoto; Costa, Luciano da Fontoura

    2018-04-01

    The biological processes of cellular decision making and differentiation involve a plethora of signaling pathways and gene regulatory circuits. These networks in turn exhibit a multitude of motifs playing crucial parts in regulating network activity. Here we compare the topological placement of motifs in gene regulatory and signaling networks and observe that it suggests different evolutionary strategies in motif distribution for distinct cellular subnetworks.

  6. Finding a Leucine in a Haystack: Searching the Proteome for ambigous Leucine-Aspartic Acid motifs

    KAUST Repository

    Arold, Stefan T.

    2016-01-25

    Leucine-aspartic acid (LD) motifs are short helical protein-protein interaction motifs involved in cell motility, survival and communication. LD motif interactions are also implicated in cancer metastasis and are targeted by several viruses. LD motifs are notoriously difficult to detect because sequence pattern searches lead to an excessively high number of false positives. Hence, despite 20 years of research, only six LD motif–containing proteins are known in humans, three of which are close homologues of the paxillin family. To enable the proteome-wide discovery of LD motifs, we developed LD Motif Finder (LDMF), a web tool based on machine learning that combines sequence information with structural predictions to detect LD motifs with high accuracy. LDMF predicted 13 new LD motifs in humans. Using biophysical assays, we experimentally confirmed in vitro interactions for four novel LD motif proteins. Thus, LDMF allows proteome-wide discovery of LD motifs, despite a highly ambiguous sequence pattern. Functional implications will be discussed.

  7. A proposed vestigial translation initiation motif in VP1 of hepatitis A virus.

    Science.gov (United States)

    Kang, Jeong-Ah; Funkhouser, Ann W

    2002-07-01

    The internal ribosome entry site (IRES) of picornaviruses has a 3' polypyrimidine tract (PPT) 16-24 bases upstream of an AUG triplet (PPT/AUG motif). This motif is critical in determining the efficiency of cap-independent translation. HAV has a conserved PPT/AUG motif consisting of a nine base sequence (AGGUUUUUC) 23 bases upstream of the preferred AUG start codon. This HAV-specific PPT/AUG motif is repeated and conserved in VP1 of HAV, but not of other picornaviruses. We proposed that the PPT/AUG motif in the open reading frame initiated translation and/or had an impact on the life cycle of the virus. In vitro translation of mutant bicistronic mRNAs and growth in cell culture of mutant viruses provided no evidence that the VP1 PPT/AUG motif had any impact on either translation or growth. HAV differs from other picornaviruses in its inefficient growth in cell culture. Since the HAV-specific PPT/AUG motif is found in only 1 in 300,000 reported viral sequences outside the hepatovirus genus, this motif may be a vestigial translation initiation element and may have played a role in determining the unusual phenotype of HAV.

  8. Mechanisms and regulation of DNA replication initiation in eukaryotes.

    Science.gov (United States)

    Parker, Matthew W; Botchan, Michael R; Berger, James M

    2017-04-01

    Cellular DNA replication is initiated through the action of multiprotein complexes that recognize replication start sites in the chromosome (termed origins) and facilitate duplex DNA melting within these regions. In a typical cell cycle, initiation occurs only once per origin and each round of replication is tightly coupled to cell division. To avoid aberrant origin firing and re-replication, eukaryotes tightly regulate two events in the initiation process: loading of the replicative helicase, MCM2-7, onto chromatin by the origin recognition complex (ORC), and subsequent activation of the helicase by its incorporation into a complex known as the CMG. Recent work has begun to reveal the details of an orchestrated and sequential exchange of initiation factors on DNA that give rise to a replication-competent complex, the replisome. Here, we review the molecular mechanisms that underpin eukaryotic DNA replication initiation - from selecting replication start sites to replicative helicase loading and activation - and describe how these events are often distinctly regulated across different eukaryotic model organisms.

  9. CMD: A Database to Store the Bonding States of Cysteine Motifs with Secondary Structures

    Directory of Open Access Journals (Sweden)

    Hamed Bostan

    2012-01-01

    Full Text Available Computational approaches to the disulphide bonding state and its connectivity pattern prediction are based on various descriptors. One descriptor is the amino acid sequence motifs flanking the cysteine residue motifs. Despite the existence of disulphide bonding information in many databases and applications, there is no complete reference and motif query available at the moment. Cysteine motif database (CMD is the first online resource that stores all cysteine residues, their flanking motifs with their secondary structure, and propensity values assignment derived from the laboratory data. We extracted more than 3 million cysteine motifs from PDB and UniProt data, annotated with secondary structure assignment, propensity value assignment, and frequency of occurrence and coefficiency of their bonding status. Removal of redundancies generated 15875 unique flanking motifs that are always bonded and 41577 unique patterns that are always nonbonded. Queries are based on the protein ID, FASTA sequence, sequence motif, and secondary structure individually or in batch format using the provided APIs that allow remote users to query our database via third party software and/or high throughput screening/querying. The CMD offers extensive information about the bonded, free cysteine residues, and their motifs that allows in-depth characterization of the sequence motif composition.

  10. BEAM web server: a tool for structural RNA motif discovery.

    Science.gov (United States)

    Pietrosanto, Marco; Adinolfi, Marta; Casula, Riccardo; Ausiello, Gabriele; Ferrè, Fabrizio; Helmer-Citterich, Manuela

    2018-03-15

    RNA structural motif finding is a relevant problem that becomes computationally hard when working on high-throughput data (e.g. eCLIP, PAR-CLIP), often represented by thousands of RNA molecules. Currently, the BEAM server is the only web tool capable to handle tens of thousands of RNA in input with a motif discovery procedure that is only limited by the current secondary structure prediction accuracies. The recently developed method BEAM (BEAr Motifs finder) can analyze tens of thousands of RNA molecules and identify RNA secondary structure motifs associated to a measure of their statistical significance. BEAM is extremely fast thanks to the BEAR encoding that transforms each RNA secondary structure in a string of characters. BEAM also exploits the evolutionary knowledge contained in a substitution matrix of secondary structure elements, extracted from the RFAM database of families of homologous RNAs. The BEAM web server has been designed to streamline data pre-processing by automatically handling folding and encoding of RNA sequences, giving users a choice for the preferred folding program. The server provides an intuitive and informative results page with the list of secondary structure motifs identified, the logo of each motif, its significance, graphic representation and information about its position in the RNA molecules sharing it. The web server is freely available at http://beam.uniroma2.it/ and it is implemented in NodeJS and Python with all major browsers supported. marco.pietrosanto@uniroma2.it. Supplementary data are available at Bioinformatics online.

  11. On the Archaeal Origins of Eukaryotes and the Challenges of Inferring Phenotype from Genotype.

    Science.gov (United States)

    Dey, Gautam; Thattai, Mukund; Baum, Buzz

    2016-07-01

    If eukaryotes arose through a merger between archaea and bacteria, what did the first true eukaryotic cell look like? A major step toward an answer came with the discovery of Lokiarchaeum, an archaeon whose genome encodes small GTPases related to those used by eukaryotes to regulate membrane traffic. Although 'Loki' cells have yet to be seen, their existence has prompted the suggestion that the archaeal ancestor of eukaryotes engulfed the future mitochondrion by phagocytosis. We propose instead that the archaeal ancestor was a relatively simple cell, and that eukaryotic cellular organization arose as the result of a gradual transfer of bacterial genes and membranes driven by an ever-closer symbiotic partnership between a bacterium and an archaeon. Copyright © 2016 The Authors. Published by Elsevier Ltd.. All rights reserved.

  12. Review article: The mountain motif in the plot of Matthew

    Directory of Open Access Journals (Sweden)

    Gert J. Volschenk

    2010-09-01

    Full Text Available This article reviewed T.L. Donaldson’s book, Jesus on the mountain: A study in Matthean theology, published in 1985 by JSOT Press, Sheffield, and focused on the mountain motif in the structure and plot of the Gospel of Matthew, in addition to the work of Donaldson on the mountain motif as a literary motif and as theological symbol. The mountain is a primary theological setting for Jesus’ ministry and thus is an important setting, serving as one of the literary devices by which Matthew structured and progressed his narrative. The Zion theological and eschatological significance and Second Temple Judaism serve as the historical and theological background for the mountain motif. The last mountain setting (Mt 28:16–20 is the culmination of the three theological themes in the plot of Matthew, namely Christology, ecclesiology and salvation history.

  13. On the origin of distribution patterns of motifs in biological networks

    Directory of Open Access Journals (Sweden)

    Lesk Arthur M

    2008-08-01

    Full Text Available Abstract Background Inventories of small subgraphs in biological networks have identified commonly-recurring patterns, called motifs. The inference that these motifs have been selected for function rests on the idea that their occurrences are significantly more frequent than random. Results Our analysis of several large biological networks suggests, in contrast, that the frequencies of appearance of common subgraphs are similar in natural and corresponding random networks. Conclusion Indeed, certain topological features of biological networks give rise naturally to the common appearance of the motifs. We therefore question whether frequencies of occurrences are reasonable evidence that the structures of motifs have been selected for their functional contribution to the operation of networks.

  14. Distance-dependent duplex DNA destabilization proximal to G-quadruplex/i-motif sequences

    Science.gov (United States)

    König, Sebastian L. B.; Huppert, Julian L.; Sigel, Roland K. O.; Evans, Amanda C.

    2013-01-01

    G-quadruplexes and i-motifs are complementary examples of non-canonical nucleic acid substructure conformations. G-quadruplex thermodynamic stability has been extensively studied for a variety of base sequences, but the degree of duplex destabilization that adjacent quadruplex structure formation can cause has yet to be fully addressed. Stable in vivo formation of these alternative nucleic acid structures is likely to be highly dependent on whether sufficient spacing exists between neighbouring duplex- and quadruplex-/i-motif-forming regions to accommodate quadruplexes or i-motifs without disrupting duplex stability. Prediction of putative G-quadruplex-forming regions is likely to be assisted by further understanding of what distance (number of base pairs) is required for duplexes to remain stable as quadruplexes or i-motifs form. Using oligonucleotide constructs derived from precedented G-quadruplexes and i-motif-forming bcl-2 P1 promoter region, initial biophysical stability studies indicate that the formation of G-quadruplex and i-motif conformations do destabilize proximal duplex regions. The undermining effect that quadruplex formation can have on duplex stability is mitigated with increased distance from the duplex region: a spacing of five base pairs or more is sufficient to maintain duplex stability proximal to predicted quadruplex/i-motif-forming regions. PMID:23771141

  15. Systematic comparison of the response properties of protein and RNA mediated gene regulatory motifs.

    Science.gov (United States)

    Iyengar, Bharat Ravi; Pillai, Beena; Venkatesh, K V; Gadgil, Chetan J

    2017-05-30

    We present a framework enabling the dissection of the effects of motif structure (feedback or feedforward), the nature of the controller (RNA or protein), and the regulation mode (transcriptional, post-transcriptional or translational) on the response to a step change in the input. We have used a common model framework for gene expression where both motif structures have an activating input and repressing regulator, with the same set of parameters, to enable a comparison of the responses. We studied the global sensitivity of the system properties, such as steady-state gain, overshoot, peak time, and peak duration, to parameters. We find that, in all motifs, overshoot correlated negatively whereas peak duration varied concavely with peak time. Differences in the other system properties were found to be mainly dependent on the nature of the controller rather than the motif structure. Protein mediated motifs showed a higher degree of adaptation i.e. a tendency to return to baseline levels; in particular, feedforward motifs exhibited perfect adaptation. RNA mediated motifs had a mild regulatory effect; they also exhibited a lower peaking tendency and mean overshoot. Protein mediated feedforward motifs showed higher overshoot and lower peak time compared to the corresponding feedback motifs.

  16. Fast social-like learning of complex behaviors based on motor motifs

    Science.gov (United States)

    Calvo Tapia, Carlos; Tyukin, Ivan Y.; Makarov, Valeri A.

    2018-05-01

    Social learning is widely observed in many species. Less experienced agents copy successful behaviors exhibited by more experienced individuals. Nevertheless, the dynamical mechanisms behind this process remain largely unknown. Here we assume that a complex behavior can be decomposed into a sequence of n motor motifs. Then a neural network capable of activating motor motifs in a given sequence can drive an agent. To account for (n -1 )! possible sequences of motifs in a neural network, we employ the winnerless competition approach. We then consider a teacher-learner situation: one agent exhibits a complex movement, while another one aims at mimicking the teacher's behavior. Despite the huge variety of possible motif sequences we show that the learner, equipped with the provided learning model, can rewire "on the fly" its synaptic couplings in no more than (n -1 ) learning cycles and converge exponentially to the durations of the teacher's motifs. We validate the learning model on mobile robots. Experimental results show that the learner is indeed capable of copying the teacher's behavior composed of six motor motifs in a few learning cycles. The reported mechanism of learning is general and can be used for replicating different functions, including, for example, sound patterns or speech.

  17. Symbiosis and the origin of eukaryotic motility

    Science.gov (United States)

    Margulis, L.; Hinkle, G.

    1991-01-01

    Ongoing work to test the hypothesis of the origin of eukaryotic cell organelles by microbial symbioses is discussed. Because of the widespread acceptance of the serial endosymbiotic theory (SET) of the origin of plastids and mitochondria, the idea of the symbiotic origin of the centrioles and axonemes for spirochete bacteria motility symbiosis was tested. Intracellular microtubular systems are purported to derive from symbiotic associations between ancestral eukaryotic cells and motile bacteria. Four lines of approach to this problem are being pursued: (1) cloning the gene of a tubulin-like protein discovered in Spirocheata bajacaliforniesis; (2) seeking axoneme proteins in spirochets by antibody cross-reaction; (3) attempting to cultivate larger, free-living spirochetes; and (4) studying in detail spirochetes (e.g., Cristispira) symbiotic with marine animals. Other aspects of the investigation are presented.

  18. Thermal Stability of Modified i-Motif Oligonucleotides with Naphthalimide Intercalating Nucleic Acids

    DEFF Research Database (Denmark)

    El-Sayed, Ahmed Ali; Pedersen, Erik B.; Khaireldin, Nahid Y.

    2016-01-01

    In continuation of our investigation of characteristics and thermodynamic properties of the i-motif 5′-d[(CCCTAA)3CCCT)] upon insertion of intercalating nucleotides into the cytosine-rich oligonucleotide, this article evaluates the stabilities of i-motif oligonucleotides upon insertion of naphtha......In continuation of our investigation of characteristics and thermodynamic properties of the i-motif 5′-d[(CCCTAA)3CCCT)] upon insertion of intercalating nucleotides into the cytosine-rich oligonucleotide, this article evaluates the stabilities of i-motif oligonucleotides upon insertion...... of naphthalimide (1H-benzo[de]isoquinoline-1,3(2H)-dione) as the intercalating nucleic acid. The stabilities of i-motif structures with inserted naphthalimide intercalating nucleotides were studied using UV melting temperatures (Tm) and circular dichroism spectra at different pH values and conditions (crowding...

  19. The long non-coding RNA GAS5 cooperates with the eukaryotic translation initiation factor 4E to regulate c-Myc translation.

    Directory of Open Access Journals (Sweden)

    Guangzhen Hu

    Full Text Available Long noncoding RNAs (lncRNAs are important regulators of transcription; however, their involvement in protein translation is not well known. Here we explored whether the lncRNA GAS5 is associated with translation initiation machinery and regulates translation. GAS5 was enriched with eukaryotic translation initiation factor-4E (eIF4E in an RNA-immunoprecipitation assay using lymphoma cell lines. We identified two RNA binding motifs within eIF4E protein and the deletion of each motif inhibited the binding of GAS5 with eIF4E. To confirm the role of GAS5 in translation regulation, GAS5 siRNA and in vitro transcribed GAS5 RNA were used to knock down or overexpress GAS5, respectively. GAS5 siRNA had no effect on global protein translation but did specifically increase c-Myc protein level without an effect on c-Myc mRNA. The mechanism of this increase in c-Myc protein was enhanced association of c-Myc mRNA with the polysome without any effect on protein stability. In contrast, overexpression of in vitro transcribed GAS5 RNA suppressed c-Myc protein without affecting c-Myc mRNA. Interestingly, GAS5 was found to be bound with c-Myc mRNA, suggesting that GAS5 regulates c-Myc translation through lncRNA-mRNA interaction. Our findings have uncovered a role of GAS5 lncRNA in translation regulation through its interactions with eIF4E and c-Myc mRNA.

  20. WildSpan: mining structured motifs from protein sequences

    Directory of Open Access Journals (Sweden)

    Chen Chien-Yu

    2011-03-01

    Full Text Available Abstract Background Automatic extraction of motifs from biological sequences is an important research problem in study of molecular biology. For proteins, it is desired to discover sequence motifs containing a large number of wildcard symbols, as the residues associated with functional sites are usually largely separated in sequences. Discovering such patterns is time-consuming because abundant combinations exist when long gaps (a gap consists of one or more successive wildcards are considered. Mining algorithms often employ constraints to narrow down the search space in order to increase efficiency. However, improper constraint models might degrade the sensitivity and specificity of the motifs discovered by computational methods. We previously proposed a new constraint model to handle large wildcard regions for discovering functional motifs of proteins. The patterns that satisfy the proposed constraint model are called W-patterns. A W-pattern is a structured motif that groups motif symbols into pattern blocks interleaved with large irregular gaps. Considering large gaps reflects the fact that functional residues are not always from a single region of protein sequences, and restricting motif symbols into clusters corresponds to the observation that short motifs are frequently present within protein families. To efficiently discover W-patterns for large-scale sequence annotation and function prediction, this paper first formally introduces the problem to solve and proposes an algorithm named WildSpan (sequential pattern mining across large wildcard regions that incorporates several pruning strategies to largely reduce the mining cost. Results WildSpan is shown to efficiently find W-patterns containing conserved residues that are far separated in sequences. We conducted experiments with two mining strategies, protein-based and family-based mining, to evaluate the usefulness of W-patterns and performance of WildSpan. The protein-based mining mode

  1. Motif formation and industry specific topologies in the Japanese business firm network

    Science.gov (United States)

    Maluck, Julian; Donner, Reik V.; Takayasu, Hideki; Takayasu, Misako

    2017-05-01

    Motifs and roles are basic quantities for the characterization of interactions among 3-node subsets in complex networks. In this work, we investigate how the distribution of 3-node motifs can be influenced by modifying the rules of an evolving network model while keeping the statistics of simpler network characteristics, such as the link density and the degree distribution, invariant. We exemplify this problem for the special case of the Japanese Business Firm Network, where a well-studied and relatively simple yet realistic evolving network model is available, and compare the resulting motif distribution in the real-world and simulated networks. To better approximate the motif distribution of the real-world network in the model, we introduce both subgraph dependent and global additional rules. We find that a specific rule that allows only for the merging process between nodes with similar link directionality patterns reduces the observed excess of densely connected motifs with bidirectional links. Our study improves the mechanistic understanding of motif formation in evolving network models to better describe the characteristic features of real-world networks with a scale-free topology.

  2. Binding properties of SUMO-interacting motifs (SIMs) in yeast.

    Science.gov (United States)

    Jardin, Christophe; Horn, Anselm H C; Sticht, Heinrich

    2015-03-01

    Small ubiquitin-like modifier (SUMO) conjugation and interaction play an essential role in many cellular processes. A large number of yeast proteins is known to interact non-covalently with SUMO via short SUMO-interacting motifs (SIMs), but the structural details of this interaction are yet poorly characterized. In the present work, sequence analysis of a large dataset of 148 yeast SIMs revealed the existence of a hydrophobic core binding motif and a preference for acidic residues either within or adjacent to the core motif. Thus the sequence properties of yeast SIMs are highly similar to those described for human. Molecular dynamics simulations were performed to investigate the binding preferences for four representative SIM peptides differing in the number and distribution of acidic residues. Furthermore, the relative stability of two previously observed alternative binding orientations (parallel, antiparallel) was assessed. For all SIMs investigated, the antiparallel binding mode remained stable in the simulations and the SIMs were tightly bound via their hydrophobic core residues supplemented by polar interactions of the acidic residues. In contrary, the stability of the parallel binding mode is more dependent on the sequence features of the SIM motif like the number and position of acidic residues or the presence of additional adjacent interaction motifs. This information should be helpful to enhance the prediction of SIMs and their binding properties in different organisms to facilitate the reconstruction of the SUMO interactome.

  3. Identification of coupling DNA motif pairs on long-range chromatin interactions in human K562 cells

    KAUST Repository

    Wong, Ka-Chun; Li, Yue; Peng, Chengbin

    2015-01-01

    Motivation: The protein-DNA interactions between transcription factors (TFs) and transcription factor binding sites (TFBSs, also known as DNA motifs) are critical activities in gene transcription. The identification of the DNA motifs is a vital task for downstream analysis. Unfortunately, the long-range coupling information between different DNA motifs is still lacking. To fill the void, as the first-of-its-kind study, we have identified the coupling DNA motif pairs on long-range chromatin interactions in human. Results: The coupling DNA motif pairs exhibit substantially higher DNase accessibility than the background sequences. Half of the DNA motifs involved are matched to the existing motif databases, although nearly all of them are enriched with at least one gene ontology term. Their motif instances are also found statistically enriched on the promoter and enhancer regions. Especially, we introduce a novel measurement called motif pairing multiplicity which is defined as the number of motifs that are paired with a given motif on chromatin interactions. Interestingly, we observe that motif pairing multiplicity is linked to several characteristics such as regulatory region type, motif sequence degeneracy, DNase accessibility and pairing genomic distance. Taken into account together, we believe the coupling DNA motif pairs identified in this study can shed lights on the gene transcription mechanism under long-range chromatin interactions. © The Author 2015. Published by Oxford University Press.

  4. Identification of coupling DNA motif pairs on long-range chromatin interactions in human K562 cells

    KAUST Repository

    Wong, Ka-Chun

    2015-09-27

    Motivation: The protein-DNA interactions between transcription factors (TFs) and transcription factor binding sites (TFBSs, also known as DNA motifs) are critical activities in gene transcription. The identification of the DNA motifs is a vital task for downstream analysis. Unfortunately, the long-range coupling information between different DNA motifs is still lacking. To fill the void, as the first-of-its-kind study, we have identified the coupling DNA motif pairs on long-range chromatin interactions in human. Results: The coupling DNA motif pairs exhibit substantially higher DNase accessibility than the background sequences. Half of the DNA motifs involved are matched to the existing motif databases, although nearly all of them are enriched with at least one gene ontology term. Their motif instances are also found statistically enriched on the promoter and enhancer regions. Especially, we introduce a novel measurement called motif pairing multiplicity which is defined as the number of motifs that are paired with a given motif on chromatin interactions. Interestingly, we observe that motif pairing multiplicity is linked to several characteristics such as regulatory region type, motif sequence degeneracy, DNase accessibility and pairing genomic distance. Taken into account together, we believe the coupling DNA motif pairs identified in this study can shed lights on the gene transcription mechanism under long-range chromatin interactions. © The Author 2015. Published by Oxford University Press.

  5. Characterization of prokaryotic and eukaryotic promoters usinghidden Markov models

    DEFF Research Database (Denmark)

    Pedersen, Anders Gorm; Baldi, Pierre; Brunak, Søren

    1996-01-01

    In this paper we utilize hidden Markov models (HMMs) and information theory to analyze prokaryotic and eukaryotic promoters. We perform this analysis with special emphasis on the fact that promoters are divided into a number of different classes, depending on which polymerase-associated factors...... that bind to them. We find that HMMs trained on such subclasses of Escherichia coli promoters (specifically, the so-called sigma-70 and sigma-54 classes) give an excellent classification of unknown promoters with respect to sigma-class. HMMs trained on eukaryotic sequences from human genes also model nicely...

  6. Fragment-based modelling of single stranded RNA bound to RNA recognition motif containing proteins

    Science.gov (United States)

    de Beauchene, Isaure Chauvot; de Vries, Sjoerd J.; Zacharias, Martin

    2016-01-01

    Abstract Protein-RNA complexes are important for many biological processes. However, structural modeling of such complexes is hampered by the high flexibility of RNA. Particularly challenging is the docking of single-stranded RNA (ssRNA). We have developed a fragment-based approach to model the structure of ssRNA bound to a protein, based on only the protein structure, the RNA sequence and conserved contacts. The conformational diversity of each RNA fragment is sampled by an exhaustive library of trinucleotides extracted from all known experimental protein–RNA complexes. The method was applied to ssRNA with up to 12 nucleotides which bind to dimers of the RNA recognition motifs (RRMs), a highly abundant eukaryotic RNA-binding domain. The fragment based docking allows a precise de novo atomic modeling of protein-bound ssRNA chains. On a benchmark of seven experimental ssRNA–RRM complexes, near-native models (with a mean heavy-atom deviation of <3 Å from experiment) were generated for six out of seven bound RNA chains, and even more precise models (deviation < 2 Å) were obtained for five out of seven cases, a significant improvement compared to the state of the art. The method is not restricted to RRMs but was also successfully applied to Pumilio RNA binding proteins. PMID:27131381

  7. Efficient sequential and parallel algorithms for finding edit distance based motifs.

    Science.gov (United States)

    Pal, Soumitra; Xiao, Peng; Rajasekaran, Sanguthevar

    2016-08-18

    Motif search is an important step in extracting meaningful patterns from biological data. The general problem of motif search is intractable and there is a pressing need to develop efficient, exact and approximation algorithms to solve this problem. In this paper, we present several novel, exact, sequential and parallel algorithms for solving the (l,d) Edit-distance-based Motif Search (EMS) problem: given two integers l,d and n biological strings, find all strings of length l that appear in each input string with atmost d errors of types substitution, insertion and deletion. One popular technique to solve the problem is to explore for each input string the set of all possible l-mers that belong to the d-neighborhood of any substring of the input string and output those which are common for all input strings. We introduce a novel and provably efficient neighborhood exploration technique. We show that it is enough to consider the candidates in neighborhood which are at a distance exactly d. We compactly represent these candidate motifs using wildcard characters and efficiently explore them with very few repetitions. Our sequential algorithm uses a trie based data structure to efficiently store and sort the candidate motifs. Our parallel algorithm in a multi-core shared memory setting uses arrays for storing and a novel modification of radix-sort for sorting the candidate motifs. The algorithms for EMS are customarily evaluated on several challenging instances such as (8,1), (12,2), (16,3), (20,4), and so on. The best previously known algorithm, EMS1, is sequential and in estimated 3 days solves up to instance (16,3). Our sequential algorithms are more than 20 times faster on (16,3). On other hard instances such as (9,2), (11,3), (13,4), our algorithms are much faster. Our parallel algorithm has more than 600 % scaling performance while using 16 threads. Our algorithms have pushed up the state-of-the-art of EMS solvers and we believe that the techniques introduced in

  8. Physical-chemical property based sequence motifs and methods regarding same

    Science.gov (United States)

    Braun, Werner [Friendswood, TX; Mathura, Venkatarajan S [Sarasota, FL; Schein, Catherine H [Friendswood, TX

    2008-09-09

    A data analysis system, program, and/or method, e.g., a data mining/data exploration method, using physical-chemical property motifs. For example, a sequence database may be searched for identifying segments thereof having physical-chemical properties similar to the physical-chemical property motifs.

  9. Dissecting protein loops with a statistical scalpel suggests a functional implication of some structural motifs.

    Science.gov (United States)

    Regad, Leslie; Martin, Juliette; Camproux, Anne-Claude

    2011-06-20

    One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet), which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i) ubiquitous motifs, shared by several superfamilies and (ii) superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P) and SAH/SAM. Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins.

  10. Dissecting protein loops with a statistical scalpel suggests a functional implication of some structural motifs

    Directory of Open Access Journals (Sweden)

    Martin Juliette

    2011-06-01

    Full Text Available Abstract Background One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Results Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet, which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i ubiquitous motifs, shared by several superfamilies and (ii superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P and SAH/SAM. Conclusions Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins.

  11. Organization of feed-forward loop motifs reveals architectural principles in natural and engineered networks.

    Science.gov (United States)

    Gorochowski, Thomas E; Grierson, Claire S; di Bernardo, Mario

    2018-03-01

    Network motifs are significantly overrepresented subgraphs that have been proposed as building blocks for natural and engineered networks. Detailed functional analysis has been performed for many types of motif in isolation, but less is known about how motifs work together to perform complex tasks. To address this issue, we measure the aggregation of network motifs via methods that extract precisely how these structures are connected. Applying this approach to a broad spectrum of networked systems and focusing on the widespread feed-forward loop motif, we uncover striking differences in motif organization. The types of connection are often highly constrained, differ between domains, and clearly capture architectural principles. We show how this information can be used to effectively predict functionally important nodes in the metabolic network of Escherichia coli . Our findings have implications for understanding how networked systems are constructed from motif parts and elucidate constraints that guide their evolution.

  12. RSAT matrix-clustering: dynamic exploration and redundancy reduction of transcription factor binding motif collections.

    Science.gov (United States)

    Castro-Mondragon, Jaime Abraham; Jaeger, Sébastien; Thieffry, Denis; Thomas-Chollier, Morgane; van Helden, Jacques

    2017-07-27

    Transcription factor (TF) databases contain multitudes of binding motifs (TFBMs) from various sources, from which non-redundant collections are derived by manual curation. The advent of high-throughput methods stimulated the production of novel collections with increasing numbers of motifs. Meta-databases, built by merging these collections, contain redundant versions, because available tools are not suited to automatically identify and explore biologically relevant clusters among thousands of motifs. Motif discovery from genome-scale data sets (e.g. ChIP-seq) also produces redundant motifs, hampering the interpretation of results. We present matrix-clustering, a versatile tool that clusters similar TFBMs into multiple trees, and automatically creates non-redundant TFBM collections. A feature unique to matrix-clustering is its dynamic visualisation of aligned TFBMs, and its capability to simultaneously treat multiple collections from various sources. We demonstrate that matrix-clustering considerably simplifies the interpretation of combined results from multiple motif discovery tools, and highlights biologically relevant variations of similar motifs. We also ran a large-scale application to cluster ∼11 000 motifs from 24 entire databases, showing that matrix-clustering correctly groups motifs belonging to the same TF families, and drastically reduced motif redundancy. matrix-clustering is integrated within the RSAT suite (http://rsat.eu/), accessible through a user-friendly web interface or command-line for its integration in pipelines. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. Secretory TAT-peptide-mediated protein transduction of LIF receptor α-chain distal cytoplasmic motifs into human myeloid HL-60 cells

    Directory of Open Access Journals (Sweden)

    Q. Sun

    2012-10-01

    Full Text Available The distal cytoplasmic motifs of leukemia inhibitory factor receptor α-chain (LIFRα-CT3 can independently induce intracellular myeloid differentiation in acute myeloid leukemia (AML cells by gene transfection; however, there are significant limitations in the potential clinical use of these motifs due to liposome-derived genetic modifications. To produce a potentially therapeutic LIFRα-CT3 with cell-permeable activity, we constructed a eukaryotic expression pcDNA3.0-TAT-CT3-cMyc plasmid with a signal peptide (ss inserted into the N-terminal that codes for an ss-TAT-CT3-cMyc fusion protein. The stable transfection of Chinese hamster ovary (CHO cells via this vector and subsequent selection by Geneticin resulted in cell lines that express and secrete TAT-CT3-cMyc. The spent medium of pcDNA3.0-TAT-CT3-cMyc-transfected CHO cells could be purified using a cMyc-epitope-tag agarose affinity chromatography column and could be detected via SDS-PAGE, with antibodies against cMyc-tag. The direct administration of TAT-CT3-cMyc to HL-60 cell culture media caused the enrichment of CT3-cMyc in the cytoplasm and nucleus within 30 min and led to a significant reduction of viable cells (P < 0.05 8 h after exposure. The advantages of using this mammalian expression system include the ease of generating TAT fusion proteins that are adequately transcripted and the potential for a sustained production of such proteins in vitro for future AML therapy.

  14. Secretory TAT-peptide-mediated protein transduction of LIF receptor α-chain distal cytoplasmic motifs into human myeloid HL-60 cells

    International Nuclear Information System (INIS)

    Sun, Q.; Xiong, J.; Lu, J.; Xu, S.; Li, Y.; Zhong, X.P.; Gao, G.K.; Liu, H.Q.

    2012-01-01

    The distal cytoplasmic motifs of leukemia inhibitory factor receptor α-chain (LIFRα-CT3) can independently induce intracellular myeloid differentiation in acute myeloid leukemia (AML) cells by gene transfection; however, there are significant limitations in the potential clinical use of these motifs due to liposome-derived genetic modifications. To produce a potentially therapeutic LIFRα-CT3 with cell-permeable activity, we constructed a eukaryotic expression pcDNA3.0-TAT-CT3-cMyc plasmid with a signal peptide (ss) inserted into the N-terminal that codes for an ss-TAT-CT3-cMyc fusion protein. The stable transfection of Chinese hamster ovary (CHO) cells via this vector and subsequent selection by Geneticin resulted in cell lines that express and secrete TAT-CT3-cMyc. The spent medium of pcDNA3.0-TAT-CT3-cMyc-transfected CHO cells could be purified using a cMyc-epitope-tag agarose affinity chromatography column and could be detected via SDS-PAGE, with antibodies against cMyc-tag. The direct administration of TAT-CT3-cMyc to HL-60 cell culture media caused the enrichment of CT3-cMyc in the cytoplasm and nucleus within 30 min and led to a significant reduction of viable cells (P < 0.05) 8 h after exposure. The advantages of using this mammalian expression system include the ease of generating TAT fusion proteins that are adequately transcripted and the potential for a sustained production of such proteins in vitro for future AML therapy

  15. Secretory TAT-peptide-mediated protein transduction of LIF receptor α-chain distal cytoplasmic motifs into human myeloid HL-60 cells

    Energy Technology Data Exchange (ETDEWEB)

    Sun, Q. [Department of Hyperbaric Medicine, No. 401 Hospital of PLA, Qingdao (China); Department of Histology and Embryology, Faculty of Basic Medical Sciences, Second Military Medical University, Shanghai (China); Xiong, J. [Department of Histology and Embryology, Faculty of Basic Medical Sciences, Second Military Medical University, Shanghai (China); Lu, J. [Office of Medical Education, Training Department, Second Military Medical University, Shanghai (China); Xu, S. [Department of Histology and Embryology, Faculty of Basic Medical Sciences, Second Military Medical University, Shanghai (China); Li, Y. [State Food and Drug Administration of China,Huangdao Branch, Qingdao (China); Zhong, X.P.; Gao, G.K. [Department of Hyperbaric Medicine, No. 401 Hospital of PLA, Qingdao (China); Liu, H.Q. [2Department of Histology and Embryology, Faculty of Basic Medical Sciences, Second Military Medical University, Shanghai (China)

    2012-06-22

    The distal cytoplasmic motifs of leukemia inhibitory factor receptor α-chain (LIFRα-CT3) can independently induce intracellular myeloid differentiation in acute myeloid leukemia (AML) cells by gene transfection; however, there are significant limitations in the potential clinical use of these motifs due to liposome-derived genetic modifications. To produce a potentially therapeutic LIFRα-CT3 with cell-permeable activity, we constructed a eukaryotic expression pcDNA3.0-TAT-CT3-cMyc plasmid with a signal peptide (ss) inserted into the N-terminal that codes for an ss-TAT-CT3-cMyc fusion protein. The stable transfection of Chinese hamster ovary (CHO) cells via this vector and subsequent selection by Geneticin resulted in cell lines that express and secrete TAT-CT3-cMyc. The spent medium of pcDNA3.0-TAT-CT3-cMyc-transfected CHO cells could be purified using a cMyc-epitope-tag agarose affinity chromatography column and could be detected via SDS-PAGE, with antibodies against cMyc-tag. The direct administration of TAT-CT3-cMyc to HL-60 cell culture media caused the enrichment of CT3-cMyc in the cytoplasm and nucleus within 30 min and led to a significant reduction of viable cells (P < 0.05) 8 h after exposure. The advantages of using this mammalian expression system include the ease of generating TAT fusion proteins that are adequately transcripted and the potential for a sustained production of such proteins in vitro for future AML therapy.

  16. Structural fragment clustering reveals novel structural and functional motifs in α-helical transmembrane proteins

    Directory of Open Access Journals (Sweden)

    Vassilev Boris

    2010-04-01

    Full Text Available Abstract Background A large proportion of an organism's genome encodes for membrane proteins. Membrane proteins are important for many cellular processes, and several diseases can be linked to mutations in them. With the tremendous growth of sequence data, there is an increasing need to reliably identify membrane proteins from sequence, to functionally annotate them, and to correctly predict their topology. Results We introduce a technique called structural fragment clustering, which learns sequential motifs from 3D structural fragments. From over 500,000 fragments, we obtain 213 statistically significant, non-redundant, and novel motifs that are highly specific to α-helical transmembrane proteins. From these 213 motifs, 58 of them were assigned to function and checked in the scientific literature for a biological assessment. Seventy percent of the motifs are found in co-factor, ligand, and ion binding sites, 30% at protein interaction interfaces, and 12% bind specific lipids such as glycerol or cardiolipins. The vast majority of motifs (94% appear across evolutionarily unrelated families, highlighting the modularity of functional design in membrane proteins. We describe three novel motifs in detail: (1 a dimer interface motif found in voltage-gated chloride channels, (2 a proton transfer motif found in heme-copper oxidases, and (3 a convergently evolved interface helix motif found in an aspartate symporter, a serine protease, and cytochrome b. Conclusions Our findings suggest that functional modules exist in membrane proteins, and that they occur in completely different evolutionary contexts and cover different binding sites. Structural fragment clustering allows us to link sequence motifs to function through clusters of structural fragments. The sequence motifs can be applied to identify and characterize membrane proteins in novel genomes.

  17. Arabinogalactan proteins have deep roots in eukaryotes

    DEFF Research Database (Denmark)

    Hervé, Cécile; Siméon, Amandine; Jam, Murielle

    2016-01-01

    Arabinogalactan proteins (AGPs) are highly glycosylated, hydroxyproline-rich proteins found at the cell surface of plants, where they play key roles in developmental processes. Brown algae are marine, multicellular, photosynthetic eukaryotes. They belong to the phylum Stramenopiles, which...

  18. Enzymes involved in organellar DNA replication in photosynthetic eukaryotes.

    Science.gov (United States)

    Moriyama, Takashi; Sato, Naoki

    2014-01-01

    Plastids and mitochondria possess their own genomes. Although the replication mechanisms of these organellar genomes remain unclear in photosynthetic eukaryotes, several organelle-localized enzymes related to genome replication, including DNA polymerase, DNA primase, DNA helicase, DNA topoisomerase, single-stranded DNA maintenance protein, DNA ligase, primer removal enzyme, and several DNA recombination-related enzymes, have been identified. In the reference Eudicot plant Arabidopsis thaliana, the replication-related enzymes of plastids and mitochondria are similar because many of them are dual targeted to both organelles, whereas in the red alga Cyanidioschyzon merolae, plastids and mitochondria contain different replication machinery components. The enzymes involved in organellar genome replication in green plants and red algae were derived from different origins, including proteobacterial, cyanobacterial, and eukaryotic lineages. In the present review, we summarize the available data for enzymes related to organellar genome replication in green plants and red algae. In addition, based on the type and distribution of replication enzymes in photosynthetic eukaryotes, we discuss the transitional history of replication enzymes in the organelles of plants.

  19. Ubiquitination dynamics in the early-branching eukaryote Giardia intestinalis

    Science.gov (United States)

    Niño, Carlos A; Chaparro, Jenny; Soffientini, Paolo; Polo, Simona; Wasserman, Moises

    2013-01-01

    Ubiquitination is a highly dynamic and versatile posttranslational modification that regulates protein function, stability, and interactions. To investigate the roles of ubiquitination in a primitive eukaryotic lineage, we utilized the early-branching eukaryote Giardia intestinalis. Using a combination of biochemical, immunofluorescence-based, and proteomics approaches, we assessed the ubiquitination status during the process of differentiation in Giardia. We observed that different types of ubiquitin modifications present specific cellular and temporal distribution throughout the Giardia life cycle from trophozoites to cyst maturation. Ubiquitin signal was detected in the wall of mature cysts, and enzymes implicated in cyst wall biogenesis were identified as substrates for ubiquitination. Interestingly, inhibition of proteasome activity did not affect trophozoite replication and differentiation, while it caused a decrease in cyst viability, arguing for proteasome involvement in cyst wall maturation. Using a proteomics approach, we identified around 200 high-confidence ubiquitinated candidates that vary their ubiquitination status during differentiation. Our results indicate that ubiquitination is critical for several cellular processes in this primitive eukaryote. PMID:23613346

  20. What can we infer about the origin of sex in early eukaryotes?

    NARCIS (Netherlands)

    Speijer, Dave

    2016-01-01

    Current analysis shows that the last eukaryotic common ancestor (LECA) was capable of full meiotic sex. The original eukaryotic life cycle can probably be described as clonal, interrupted by episodic sex triggered by external or internal stressors. The cycle could have started in a highly flexible

  1. Deletion of the Sm1 encoding motif in the lsm gene results in distinct changes in the transcriptome and enhanced swarming activity of Haloferax cells.

    Science.gov (United States)

    Maier, Lisa-Katharina; Benz, Juliane; Fischer, Susan; Alstetter, Martina; Jaschinski, Katharina; Hilker, Rolf; Becker, Anke; Allers, Thorsten; Soppa, Jörg; Marchfelder, Anita

    2015-10-01

    Members of the Sm protein family are important for the cellular RNA metabolism in all three domains of life. The family includes archaeal and eukaryotic Lsm proteins, eukaryotic Sm proteins and archaeal and bacterial Hfq proteins. While several studies concerning the bacterial and eukaryotic family members have been published, little is known about the archaeal Lsm proteins. Although structures for several archaeal Lsm proteins have been solved already more than ten years ago, we still do not know much about their biological function, however one can confidently propose that the archaeal Lsm proteins will also be involved in RNA metabolism. Therefore, we investigated this protein in the halophilic archaeon Haloferax volcanii. The Haloferax genome encodes a single Lsm protein, the lsm gene overlaps and is co-transcribed with the gene for the ribosomal L37.eR protein. Here, we show that the reading frame of the lsm gene contains a promoter which regulates expression of the overlapping rpl37R gene. This rpl37R specific promoter ensures high expression of the rpl37R gene in exponential growth phase. To investigate the biological function of the Lsm protein we generated a lsm deletion mutant that had the coding sequence for the Sm1 motif removed but still contained the internal promoter for the downstream rpl37R gene. The transcriptome of this deletion mutant was compared to the wild type transcriptome, revealing that several genes are down-regulated and many genes are up-regulated in the deletion strain. Northern blot analyses confirmed down-regulation of two genes. In addition, the deletion strain showed a gain of function in swarming, in congruence with the up-regulation of transcripts encoding proteins required for motility. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.

  2. Metabolism in anoxic permeable sediments is dominated by eukaryotic dark fermentation

    DEFF Research Database (Denmark)

    Bourke, Michael F.; Marriott, Philip J.; Glud, Ronnie N.

    2017-01-01

    Permeable sediments are common across continental shelves and are critical contributors to marine biogeochemical cycling. Organic matter in permeable sediments is dominated by microalgae, which as eukaryotes have different anaerobic metabolic pathways to prokaryotes such as bacteria and archaea....... Here we present analyses of flow-through reactor experiments showing that dissolved inorganic carbon is produced predominantly as a result of anaerobic eukaryotic metabolic activity. In our experiments, anaerobic production of dissolved inorganic carbon was consistently accompanied by large dissolved H....../hydrogenase pathway of fermentative eukaryotic H2 production, suggesting that pathway as the source of H2 and dissolved inorganic carbon production. Metabolomic analysis showed large increases in lipid production at the onset of anoxia, consistent with documented pathways of anoxic dark fermentation in microalgae...

  3. [Cover motifs of the Tidsskrift. A 14-year cavalcade].

    Science.gov (United States)

    Nylenna, M

    1998-12-10

    In 1985 the Journal of the Norwegian Medical Association changed its cover policy, moving the table of contents inside the Journal and introducing cover illustrations. This article provides an analysis of all cover illustrations published over this 14-year period, 420 covers in all. There is a great variation in cover motifs and designs and a development towards more general motifs. The initial emphasis on historical and medical aspects is now less pronounced, while the use of works of art and nature motifs has increased, and the cover now more often has a direct bearing on the specific contents of the issue. Professor of medical history Oivind Larsen has photographed two thirds of the covers and contributed 95% of the inside essay-style reflections on the cover motif. Over the years, he has expanded the role of the historian of medicine disseminating knowledge to include that of the raconteur with a personal tone of voice. The Journal's covers are now one of its most characteristic features, emblematic of the Journal's ambition of standing for quality and timelessness vis-à-vis the news media, and of its aim of bridging the gap between medicine and the humanities.

  4. Insights into the motif preference of APOBEC3 enzymes.

    Directory of Open Access Journals (Sweden)

    Diako Ebrahimi

    Full Text Available We used a multivariate data analysis approach to identify motifs associated with HIV hypermutation by different APOBEC3 enzymes. The analysis showed that APOBEC3G targets G mainly within GG, TG, TGG, GGG, TGGG and also GGGT. The G nucleotides flanked by a C at the 3' end (in +1 and +2 positions were indicated as disfavoured targets by APOBEC3G. The G nucleotides within GGGG were found to be targeted at a frequency much less than what is expected. We found that the infrequent G-to-A mutation within GGGG is not limited to the inaccessibility, to APOBEC3, of poly Gs in the central and 3'polypurine tracts (PPTs which remain double stranded during the HIV reverse transcription. GGGG motifs outside the PPTs were also disfavoured. The motifs GGAG and GAGG were also found to be disfavoured targets for APOBEC3. The motif-dependent mutation of G within the HIV genome by members of the APOBEC3 family other than APOBEC3G was limited to GA→AA changes. The results did not show evidence of other types of context dependent G-to-A changes in the HIV genome.

  5. HupB Is a Bacterial Nucleoid-Associated Protein with an Indispensable Eukaryotic-Like Tail

    Directory of Open Access Journals (Sweden)

    Joanna Hołówka

    2017-11-01

    Full Text Available In bacteria, chromosomal DNA must be efficiently compacted to fit inside the small cell compartment while remaining available for the proteins involved in replication, segregation, and transcription. Among the nucleoid-associated proteins (NAPs responsible for maintaining this highly organized and yet dynamic chromosome structure, the HU protein is one of the most conserved and highly abundant. HupB, a homologue of HU, was recently identified in mycobacteria. This intriguing mycobacterial NAP is composed of two domains: an N-terminal domain that resembles bacterial HU, and a long and distinctive C-terminal domain that contains several PAKK/KAAK motifs, which are characteristic of the H1/H5 family of eukaryotic histones. In this study, we analyzed the in vivo binding of HupB on the chromosome scale. By using PALM (photoactivated localization microscopy and ChIP-Seq (chromatin immunoprecipitation followed by deep sequencing, we observed that the C-terminal domain is indispensable for the association of HupB with the nucleoid. Strikingly, the in vivo binding of HupB displayed a bias from the origin (oriC to the terminus (ter of the mycobacterial chromosome (numbers of binding sites decreased toward ter. We hypothesized that this binding mode reflects a role for HupB in organizing newly replicated oriC regions. Thus, HupB may be involved in coordinating replication with chromosome segregation.

  6. I-motif DNA structures are formed in the nuclei of human cells

    Science.gov (United States)

    Zeraati, Mahdi; Langley, David B.; Schofield, Peter; Moye, Aaron L.; Rouet, Romain; Hughes, William E.; Bryan, Tracy M.; Dinger, Marcel E.; Christ, Daniel

    2018-06-01

    Human genome function is underpinned by the primary storage of genetic information in canonical B-form DNA, with a second layer of DNA structure providing regulatory control. I-motif structures are thought to form in cytosine-rich regions of the genome and to have regulatory functions; however, in vivo evidence for the existence of such structures has so far remained elusive. Here we report the generation and characterization of an antibody fragment (iMab) that recognizes i-motif structures with high selectivity and affinity, enabling the detection of i-motifs in the nuclei of human cells. We demonstrate that the in vivo formation of such structures is cell-cycle and pH dependent. Furthermore, we provide evidence that i-motif structures are formed in regulatory regions of the human genome, including promoters and telomeric regions. Our results support the notion that i-motif structures provide key regulatory roles in the genome.

  7. Avian leukosis virus is a versatile eukaryotic platform for polypeptide display

    International Nuclear Information System (INIS)

    Khare, Pranay D.; Russell, Stephen J.; Federspiel, Mark J.

    2003-01-01

    Display technology refers to methods of generating libraries of modularly coded biomolecules and screening them for particular properties. Retroviruses are good candidates to be a eukaryotic viral platform for the display of polypeptides synthesized in eukaryotic cells. Here we demonstrate that avian leukosis virus (ALV) provides an ideal platform for display of nonviral polyaeptides expressed in a eukaryotic cell substrate. Different sizes of polypeptides were genetically fused to the extreme N-terminus of the ALV envelope glycoprotein in an ALV infectious clone containing an alkaline phosphatase reporter gene. The chimeric envelope glycoproteins were efficiently incorporated into virions and were stably displayed on the surface of the virions through multiple virus replication cycles. The foreign polypeptides did not interfere with the attachment and entry functions of the underlying ALV envelope glycoproteins. The displayed polypeptides were fully functional and could efficiently mediate attachment of the recombinant viruses to their respective cognate receptors. This study demonstrates that ALV is an ideal display platform for the generation and selection of libraries of polypeptides where there is a need for expression, folding, and posttranslational modification in the endoplasmic reticulum of eukaryotic cells

  8. QuadBase2: web server for multiplexed guanine quadruplex mining and visualization

    Science.gov (United States)

    Dhapola, Parashar; Chowdhury, Shantanu

    2016-01-01

    DNA guanine quadruplexes or G4s are non-canonical DNA secondary structures which affect genomic processes like replication, transcription and recombination. G4s are computationally identified by specific nucleotide motifs which are also called putative G4 (PG4) motifs. Despite the general relevance of these structures, there is currently no tool available that can allow batch queries and genome-wide analysis of these motifs in a user-friendly interface. QuadBase2 (quadbase.igib.res.in) presents a completely reinvented web server version of previously published QuadBase database. QuadBase2 enables users to mine PG4 motifs in up to 178 eukaryotes through the EuQuad module. This module interfaces with Ensembl Compara database, to allow users mine PG4 motifs in the orthologues of genes of interest across eukaryotes. PG4 motifs can be mined across genes and their promoter sequences in 1719 prokaryotes through ProQuad module. This module includes a feature that allows genome-wide mining of PG4 motifs and their visualization as circular histograms. TetraplexFinder, the module for mining PG4 motifs in user-provided sequences is now capable of handling up to 20 MB of data. QuadBase2 is a comprehensive PG4 motif mining tool that further expands the configurations and algorithms for mining PG4 motifs in a user-friendly way. PMID:27185890

  9. Arabidopsis transcription factors: genome-wide comparative analysis among eukaryotes.

    Science.gov (United States)

    Riechmann, J L; Heard, J; Martin, G; Reuber, L; Jiang, C; Keddie, J; Adam, L; Pineda, O; Ratcliffe, O J; Samaha, R R; Creelman, R; Pilgrim, M; Broun, P; Zhang, J Z; Ghandehari, D; Sherman, B K; Yu, G

    2000-12-15

    The completion of the Arabidopsis thaliana genome sequence allows a comparative analysis of transcriptional regulators across the three eukaryotic kingdoms. Arabidopsis dedicates over 5% of its genome to code for more than 1500 transcription factors, about 45% of which are from families specific to plants. Arabidopsis transcription factors that belong to families common to all eukaryotes do not share significant similarity with those of the other kingdoms beyond the conserved DNA binding domains, many of which have been arranged in combinations specific to each lineage. The genome-wide comparison reveals the evolutionary generation of diversity in the regulation of transcription.

  10. Identification of a putative nuclear export signal motif in human NANOG homeobox domain

    International Nuclear Information System (INIS)

    Park, Sung-Won; Do, Hyun-Jin; Huh, Sun-Hyung; Sung, Boreum; Uhm, Sang-Jun; Song, Hyuk; Kim, Nam-Hyung; Kim, Jae-Hwan

    2012-01-01

    Highlights: ► We found the putative nuclear export signal motif within human NANOG homeodomain. ► Leucine-rich residues are important for human NANOG homeodomain nuclear export. ► CRM1-specific inhibitor LMB blocked the potent human NANOG NES-mediated nuclear export. -- Abstract: NANOG is a homeobox-containing transcription factor that plays an important role in pluripotent stem cells and tumorigenic cells. To understand how nuclear localization of human NANOG is regulated, the NANOG sequence was examined and a leucine-rich nuclear export signal (NES) motif ( 125 MQELSNILNL 134 ) was found in the homeodomain (HD). To functionally validate the putative NES motif, deletion and site-directed mutants were fused to an EGFP expression vector and transfected into COS-7 cells, and the localization of the proteins was examined. While hNANOG HD exclusively localized to the nucleus, a mutant with both NLSs deleted and only the putative NES motif contained (hNANOG HD-ΔNLSs) was predominantly cytoplasmic, as observed by nucleo/cytoplasmic fractionation and Western blot analysis as well as confocal microscopy. Furthermore, site-directed mutagenesis of the putative NES motif in a partial hNANOG HD only containing either one of the two NLS motifs led to localization in the nucleus, suggesting that the NES motif may play a functional role in nuclear export. Furthermore, CRM1-specific nuclear export inhibitor LMB blocked the hNANOG potent NES-mediated export, suggesting that the leucine-rich motif may function in CRM1-mediated nuclear export of hNANOG. Collectively, a NES motif is present in the hNANOG HD and may be functionally involved in CRM1-mediated nuclear export pathway.

  11. Three distinct modes of intron dynamics in the evolution of eukaryotes.

    Science.gov (United States)

    Carmel, Liran; Wolf, Yuri I; Rogozin, Igor B; Koonin, Eugene V

    2007-07-01

    Several contrasting scenarios have been proposed for the origin and evolution of spliceosomal introns, a hallmark of eukaryotic genes. A comprehensive probabilistic model to obtain a definitive reconstruction of intron evolution was developed and applied to 391 sets of conserved genes from 19 eukaryotic species. It is inferred that a relatively high intron density was reached early, i.e., the last common ancestor of eukaryotes contained >2.15 introns/kilobase, and the last common ancestor of multicellular life forms harbored approximately 3.4 introns/kilobase, a greater intron density than in most of the extant fungi and in some animals. The rates of intron gain and intron loss appear to have been dropping during the last approximately 1.3 billion years, with the decline in the gain rate being much steeper. Eukaryotic lineages exhibit three distinct modes of evolution of the intron-exon structure. The primary, balanced mode, apparently, operates in all lineages. In this mode, intron gain and loss are strongly and positively correlated, in contrast to previous reports on inverse correlation between these processes. The second mode involves an elevated rate of intron loss and is prevalent in several lineages, such as fungi and insects. The third mode, characterized by elevated rate of intron gain, is seen only in deep branches of the tree, indicating that bursts of intron invasion occurred at key points in eukaryotic evolution, such as the origin of animals. Intron dynamics could depend on multiple mechanisms, and in the balanced mode, gain and loss of introns might share common mechanistic features.

  12. Arginine deiminase pathway enzymes: evolutionary history in metamonads and other eukaryotes.

    Science.gov (United States)

    Novák, Lukáš; Zubáčová, Zuzana; Karnkowska, Anna; Kolisko, Martin; Hroudová, Miluše; Stairs, Courtney W; Simpson, Alastair G B; Keeling, Patrick J; Roger, Andrew J; Čepička, Ivan; Hampl, Vladimír

    2016-10-06

    Multiple prokaryotic lineages use the arginine deiminase (ADI) pathway for anaerobic energy production by arginine degradation. The distribution of this pathway among eukaryotes has been thought to be very limited, with only two specialized groups living in low oxygen environments (Parabasalia and Diplomonadida) known to possess the complete set of all three enzymes. We have performed an extensive survey of available sequence data in order to map the distribution of these enzymes among eukaryotes and to reconstruct their phylogenies. We have found genes for the complete pathway in almost all examined representatives of Metamonada, the anaerobic protist group that includes parabasalids and diplomonads. Phylogenetic analyses indicate the presence of the complete pathway in the last common ancestor of metamonads and heterologous transformation experiments suggest its cytosolic localization in the metamonad ancestor. Outside Metamonada, the complete pathway occurs rarely, nevertheless, it was found in representatives of most major eukaryotic clades. Phylogenetic relationships of complete pathways are consistent with the presence of the Archaea-derived ADI pathway in the last common ancestor of all eukaryotes, although other evolutionary scenarios remain possible. The presence of the incomplete set of enzymes is relatively common among eukaryotes and it may be related to the fact that these enzymes are involved in other cellular processes, such as the ornithine-urea cycle. Single protein phylogenies suggest that the evolutionary history of all three enzymes has been shaped by frequent gene losses and horizontal transfers, which may sometimes be connected with their diverse roles in cellular metabolism.

  13. Leucine-based receptor sorting motifs are dependent on the spacing relative to the plasma membrane

    DEFF Research Database (Denmark)

    Geisler, C; Dietrich, J; Nielsen, B L

    1998-01-01

    Many integral membrane proteins contain leucine-based motifs within their cytoplasmic domains that mediate internalization and intracellular sorting. Two types of leucine-based motifs have been identified. One type is dependent on phosphorylation, whereas the other type, which includes an acidic...... amino acid, is constitutively active. In this study, we have investigated how the spacing relative to the plasma membrane affects the function of both types of leucine-based motifs. For phosphorylation-dependent leucine-based motifs, a minimal spacing of 7 residues between the plasma membrane...... and the phospho-acceptor was required for phosphorylation and thereby activation of the motifs. For constitutively active leucine-based motifs, a minimal spacing of 6 residues between the plasma membrane and the acidic residue was required for optimal activity of the motifs. In addition, we found that the acidic...

  14. Diversity of Eukaryotic Translational Initiation Factor eIF4E in Protists.

    Science.gov (United States)

    Jagus, Rosemary; Bachvaroff, Tsvetan R; Joshi, Bhavesh; Place, Allen R

    2012-01-01

    The greatest diversity of eukaryotic species is within the microbial eukaryotes, the protists, with plants and fungi/metazoa representing just two of the estimated seventy five lineages of eukaryotes. Protists are a diverse group characterized by unusual genome features and a wide range of genome sizes from 8.2 Mb in the apicomplexan parasite Babesia bovis to 112,000-220,050 Mb in the dinoflagellate Prorocentrum micans. Protists possess numerous cellular, molecular and biochemical traits not observed in "text-book" model organisms. These features challenge some of the concepts and assumptions about the regulation of gene expression in eukaryotes. Like multicellular eukaryotes, many protists encode multiple eIF4Es, but few functional studies have been undertaken except in parasitic species. An earlier phylogenetic analysis of protist eIF4Es indicated that they cannot be grouped within the three classes that describe eIF4E family members from multicellular organisms. Many more protist sequences are now available from which three clades can be recognized that are distinct from the plant/fungi/metazoan classes. Understanding of the protist eIF4Es will be facilitated as more sequences become available particularly for the under-represented opisthokonts and amoebozoa. Similarly, a better understanding of eIF4Es within each clade will develop as more functional studies of protist eIF4Es are completed.

  15. Rice MEL2, the RNA recognition motif (RRM) protein, binds in vitro to meiosis-expressed genes containing U-rich RNA consensus sequences in the 3'-UTR.

    Science.gov (United States)

    Miyazaki, Saori; Sato, Yutaka; Asano, Tomoya; Nagamura, Yoshiaki; Nonomura, Ken-Ichi

    2015-10-01

    Post-transcriptional gene regulation by RNA recognition motif (RRM) proteins through binding to cis-elements in the 3'-untranslated region (3'-UTR) is widely used in eukaryotes to complete various biological processes. Rice MEIOSIS ARRESTED AT LEPTOTENE2 (MEL2) is the RRM protein that functions in the transition to meiosis in proper timing. The MEL2 RRM preferentially associated with the U-rich RNA consensus, UUAGUU[U/A][U/G][A/U/G]U, dependently on sequences and proportionally to MEL2 protein amounts in vitro. The consensus sequences were located in the putative looped structures of the RNA ligand. A genome-wide survey revealed a tendency of MEL2-binding consensus appearing in 3'-UTR of rice genes. Of 249 genes that conserved the consensus in their 3'-UTR, 13 genes spatiotemporally co-expressed with MEL2 in meiotic flowers, and included several genes whose function was supposed in meiosis; such as Replication protein A and OsMADS3. The proteome analysis revealed that the amounts of small ubiquitin-related modifier-like protein and eukaryotic translation initiation factor3-like protein were dramatically altered in mel2 mutant anthers. Taken together with transcriptome and gene ontology results, we propose that the rice MEL2 is involved in the translational regulation of key meiotic genes on 3'-UTRs to achieve the faithful transition of germ cells to meiosis.

  16. DistAMo: A web-based tool to characterize DNA-motif distribution on bacterial chromosomes

    Directory of Open Access Journals (Sweden)

    Patrick eSobetzko

    2016-03-01

    Full Text Available Short DNA motifs are involved in a multitude of functions such as for example chromosome segregation, DNA replication or mismatch repair. Distribution of such motifs is often not random and the specific chromosomal pattern relates to the respective motif function. Computational approaches which quantitatively assess such chromosomal motif patterns are necessary. Here we present a new computer tool DistAMo (Distribution Analysis of DNA Motifs. The algorithm uses codon redundancy to calculate the relative abundance of short DNA motifs from single genes to entire chromosomes. Comparative genomics analyses of the GATC-motif distribution in γ-proteobacterial genomes using DistAMo revealed that (i genes beside the replication origin are enriched in GATCs, (ii genome-wide GATC distribution follows a distinct pattern and (iii genes involved in DNA replication and repair are enriched in GATCs. These features are specific for bacterial chromosomes encoding a Dam methyltransferase. The new software is available as a stand-alone or as an easy-to-use web-based server version at http://www.computational.bio.uni-giessen.de/distamo.

  17. Selection against spurious promoter motifs correlates withtranslational efficiency across bacteria

    Energy Technology Data Exchange (ETDEWEB)

    Froula, Jeffrey L.; Francino, M. Pilar

    2007-05-01

    Because binding of RNAP to misplaced sites could compromise the efficiency of transcription, natural selection for the optimization of gene expression should regulate the distribution of DNA motifs capable of RNAP-binding across the genome. Here we analyze the distribution of the -10 promoter motifs that bind the {sigma}{sup 70} subunit of RNAP in 42 bacterial genomes. We show that selection on these motifs operates across the genome, maintaining an over-representation of -10 motifs in regulatory sequences while eliminating them from the nonfunctional and, in most cases, from the protein coding regions. In some genomes, however, -10 sites are over-represented in the coding sequences; these sites could induce pauses effecting regulatory roles throughout the length of a transcriptional unit. For nonfunctional sequences, the extent of motif under-representation varies across genomes in a manner that broadly correlates with the number of tRNA genes, a good indicator of translational speed and growth rate. This suggests that minimizing the time invested in gene transcription is an important selective pressure against spurious binding. However, selection against spurious binding is detectable in the reduced genomes of host-restricted bacteria that grow at slow rates, indicating that components of efficiency other than speed may also be important. Minimizing the number of RNAP molecules per cell required for transcription, and the corresponding energetic expense, may be most relevant in slow growers. These results indicate that genome-level properties affecting the efficiency of transcription and translation can respond in an integrated manner to optimize gene expression. The detection of selection against promoter motifs in nonfunctional regions also implies that no sequence may evolve free of selective constraints, at least in the relatively small and unstructured genomes of bacteria.

  18. Origin of phagotrophic eukaryotes as social cheaters in microbial biofilms

    Directory of Open Access Journals (Sweden)

    Jékely Gáspár

    2007-01-01

    Full Text Available Abstract Background The origin of eukaryotic cells was one of the most dramatic evolutionary transitions in the history of life. It is generally assumed that eukaryotes evolved later then prokaryotes by the transformation or fusion of prokaryotic lineages. However, as yet there is no consensus regarding the nature of the prokaryotic group(s ancestral to eukaryotes. Regardless of this, a hardly debatable fundamental novel characteristic of the last eukaryotic common ancestor was the ability to exploit prokaryotic biomass by the ingestion of entire cells, i.e. phagocytosis. The recent advances in our understanding of the social life of prokaryotes may help to explain the origin of this form of total exploitation. Presentation of the hypothesis Here I propose that eukaryotic cells originated in a social environment, a differentiated microbial mat or biofilm that was maintained by the cooperative action of its members. Cooperation was costly (e.g. the production of developmental signals or an extracellular matrix but yielded benefits that increased the overall fitness of the social group. I propose that eukaryotes originated as selfish cheaters that enjoyed the benefits of social aggregation but did not contribute to it themselves. The cheaters later evolved into predators that lysed other cells and eventually became professional phagotrophs. During several cycles of social aggregation and dispersal the number of cheaters was contained by a chicken game situation, i.e. reproductive success of cheaters was high when they were in low abundance but was reduced when they were over-represented. Radical changes in cell structure, including the loss of the rigid prokaryotic cell wall and the development of endomembranes, allowed the protoeukaryotes to avoid cheater control and to exploit nutrients more efficiently. Cellular changes were buffered by both the social benefits and the protective physico-chemical milieu of the interior of biofilms. Symbiosis

  19. An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes.

    Science.gov (United States)

    Liu, Bingqiang; Zhang, Hanyuan; Zhou, Chuan; Li, Guojun; Fennell, Anne; Wang, Guanghui; Kang, Yu; Liu, Qi; Ma, Qin

    2016-08-09

    Phylogenetic footprinting is an important computational technique for identifying cis-regulatory motifs in orthologous regulatory regions from multiple genomes, as motifs tend to evolve slower than their surrounding non-functional sequences. Its application, however, has several difficulties for optimizing the selection of orthologous data and reducing the false positives in motif prediction. Here we present an integrative phylogenetic footprinting framework for accurate motif predictions in prokaryotic genomes (MP(3)). The framework includes a new orthologous data preparation procedure, an additional promoter scoring and pruning method and an integration of six existing motif finding algorithms as basic motif search engines. Specifically, we collected orthologous genes from available prokaryotic genomes and built the orthologous regulatory regions based on sequence similarity of promoter regions. This procedure made full use of the large-scale genomic data and taxonomy information and filtered out the promoters with limited contribution to produce a high quality orthologous promoter set. The promoter scoring and pruning is implemented through motif voting by a set of complementary predicting tools that mine as many motif candidates as possible and simultaneously eliminate the effect of random noise. We have applied the framework to Escherichia coli k12 genome and evaluated the prediction performance through comparison with seven existing programs. This evaluation was systematically carried out at the nucleotide and binding site level, and the results showed that MP(3) consistently outperformed other popular motif finding tools. We have integrated MP(3) into our motif identification and analysis server DMINDA, allowing users to efficiently identify and analyze motifs in 2,072 completely sequenced prokaryotic genomes. The performance evaluation indicated that MP(3) is effective for predicting regulatory motifs in prokaryotic genomes. Its application may enhance

  20. Convergent use of RhoGAP toxins by eukaryotic parasites and bacterial pathogens.

    Directory of Open Access Journals (Sweden)

    Dominique Colinet

    2007-12-01

    Full Text Available Inactivation of host Rho GTPases is a widespread strategy employed by bacterial pathogens to manipulate mammalian cellular functions and avoid immune defenses. Some bacterial toxins mimic eukaryotic Rho GTPase-activating proteins (GAPs to inactivate mammalian GTPases, probably as a result of evolutionary convergence. An intriguing question remains whether eukaryotic pathogens or parasites may use endogenous GAPs as immune-suppressive toxins to target the same key genes as bacterial pathogens. Interestingly, a RhoGAP domain-containing protein, LbGAP, was recently characterized from the parasitoid wasp Leptopilina boulardi, and shown to protect parasitoid eggs from the immune response of Drosophila host larvae. We demonstrate here that LbGAP has structural characteristics of eukaryotic RhoGAPs but that it acts similarly to bacterial RhoGAP toxins in mammals. First, we show by immunocytochemistry that LbGAP enters Drosophila immune cells, plasmatocytes and lamellocytes, and that morphological changes in lamellocytes are correlated with the quantity of LbGAP they contain. Demonstration that LbGAP displays a GAP activity and specifically interacts with the active, GTP-bound form of the two Drosophila Rho GTPases Rac1 and Rac2, both required for successful encapsulation of Leptopilina eggs, was then achieved using biochemical tests, yeast two-hybrid analysis, and GST pull-down assays. In addition, we show that the overall structure of LbGAP is similar to that of eukaryotic RhoGAP domains, and we identify distinct residues involved in its interaction with Rac GTPases. Altogether, these results show that eukaryotic parasites can use endogenous RhoGAPs as virulence factors and that despite their differences in sequence and structure, eukaryotic and bacterial RhoGAP toxins are similarly used to target the same immune pathways in insects and mammals.

  1. Characterization of a eukaryotic translation initiation factor 5A homolog from Tamarix androssowii involved in plant abiotic stress tolerance

    Directory of Open Access Journals (Sweden)

    Wang Liuqiang

    2012-07-01

    Full Text Available Abstract Background The eukaryotic translation initiation factor 5A (eIF5A promotes formation of the first peptide bond at the onset of protein synthesis. However, the function of eIF5A in plants is not well understood. Results In this study, we characterized the function of eIF5A (TaeIF5A1 from Tamarix androssowii. The promoter of TaeIF5A1 with 1,486 bp in length was isolated, and the cis-elements in the promoter were identified. A WRKY (TaWRKY and RAV (TaRAV protein can specifically bind to a W-box motif in the promoter of TaeIF5A1 and activate the expression of TaeIF5A1. Furthermore, TaeIF5A1, TaWRKY and TaRAV share very similar expression pattern and are all stress-responsive gene that functions in the abscisic acid (ABA signaling pathway, indicating that they are components of a single regulatory pathway. Transgenic yeast and poplar expressing TaeIF5A1 showed elevated protein levels combined with improved abiotic stresses tolerance. Furthermore, TaeIF5A1-transformed plants exhibited enhanced superoxide dismutase (SOD and peroxidase (POD activities, lower electrolyte leakage and higher chlorophyll content under salt stress. Conclusions These results suggested that TaeIF5A1 is involved in abiotic stress tolerance, and is likely regulated by transcription factors TaWRKY and TaRAV both of which can bind to the W-box motif. In addition, TaeIF5A1 may mediate stress tolerance by increasing protein synthesis, enhancing ROS scavenging by improving SOD and POD activities, and preventing chlorophyll loss and membrane damage. Therefore, eIF5A may play an important role in plant adaptation to changing environmental conditions.

  2. Characterization of a eukaryotic translation initiation factor 5A homolog from Tamarix androssowii involved in plant abiotic stress tolerance.

    Science.gov (United States)

    Wang, Liuqiang; Xu, Chenxi; Wang, Chao; Wang, Yucheng

    2012-07-26

    The eukaryotic translation initiation factor 5A (eIF5A) promotes formation of the first peptide bond at the onset of protein synthesis. However, the function of eIF5A in plants is not well understood. In this study, we characterized the function of eIF5A (TaeIF5A1) from Tamarix androssowii. The promoter of TaeIF5A1 with 1,486 bp in length was isolated, and the cis-elements in the promoter were identified. A WRKY (TaWRKY) and RAV (TaRAV) protein can specifically bind to a W-box motif in the promoter of TaeIF5A1 and activate the expression of TaeIF5A1. Furthermore, TaeIF5A1, TaWRKY and TaRAV share very similar expression pattern and are all stress-responsive gene that functions in the abscisic acid (ABA) signaling pathway, indicating that they are components of a single regulatory pathway. Transgenic yeast and poplar expressing TaeIF5A1 showed elevated protein levels combined with improved abiotic stresses tolerance. Furthermore, TaeIF5A1-transformed plants exhibited enhanced superoxide dismutase (SOD) and peroxidase (POD) activities, lower electrolyte leakage and higher chlorophyll content under salt stress. These results suggested that TaeIF5A1 is involved in abiotic stress tolerance, and is likely regulated by transcription factors TaWRKY and TaRAV both of which can bind to the W-box motif. In addition, TaeIF5A1 may mediate stress tolerance by increasing protein synthesis, enhancing ROS scavenging by improving SOD and POD activities, and preventing chlorophyll loss and membrane damage. Therefore, eIF5A may play an important role in plant adaptation to changing environmental conditions.

  3. Fragile DNA Motifs Trigger Mutagenesis at Distant Chromosomal Loci in Saccharomyces cerevisiae

    Science.gov (United States)

    Saini, Natalie; Zhang, Yu; Nishida, Yuri; Sheng, Ziwei; Choudhury, Shilpa; Mieczkowski, Piotr; Lobachev, Kirill S.

    2013-01-01

    DNA sequences capable of adopting non-canonical secondary structures have been associated with gross-chromosomal rearrangements in humans and model organisms. Previously, we have shown that long inverted repeats that form hairpin and cruciform structures and triplex-forming GAA/TTC repeats induce the formation of double-strand breaks which trigger genome instability in yeast. In this study, we demonstrate that breakage at both inverted repeats and GAA/TTC repeats is augmented by defects in DNA replication. Increased fragility is associated with increased mutation levels in the reporter genes located as far as 8 kb from both sides of the repeats. The increase in mutations was dependent on the presence of inverted or GAA/TTC repeats and activity of the translesion polymerase Polζ. Mutagenesis induced by inverted repeats also required Sae2 which opens hairpin-capped breaks and initiates end resection. The amount of breakage at the repeats is an important determinant of mutations as a perfect palindromic sequence with inherently increased fragility was also found to elevate mutation rates even in replication-proficient strains. We hypothesize that the underlying mechanism for mutagenesis induced by fragile motifs involves the formation of long single-stranded regions in the broken chromosome, invasion of the undamaged sister chromatid for repair, and faulty DNA synthesis employing Polζ. These data demonstrate that repeat-mediated breaks pose a dual threat to eukaryotic genome integrity by inducing chromosomal aberrations as well as mutations in flanking genes. PMID:23785298

  4. Gram-Negative Bacterial Sensors for Eukaryotic Signal Molecules

    Directory of Open Access Journals (Sweden)

    Olivier Lesouhaitier

    2009-09-01

    Full Text Available Ample evidence exists showing that eukaryotic signal molecules synthesized and released by the host can activate the virulence of opportunistic pathogens. The sensitivity of prokaryotes to host signal molecules requires the presence of bacterial sensors. These prokaryotic sensors, or receptors, have a double function: stereospecific recognition in a complex environment and transduction of the message in order to initiate bacterial physiological modifications. As messengers are generally unable to freely cross the bacterial membrane, they require either the presence of sensors anchored in the membrane or transporters allowing direct recognition inside the bacterial cytoplasm. Since the discovery of quorum sensing, it was established that the production of virulence factors by bacteria is tightly growth-phase regulated. It is now obvious that expression of bacterial virulence is also controlled by detection of the eukaryotic messengers released in the micro-environment as endocrine or neuro-endocrine modulators. In the presence of host physiological stress many eukaryotic factors are released and detected by Gram-negative bacteria which in return rapidly adapt their physiology. For instance, Pseudomonas aeruginosa can bind elements of the host immune system such as interferon-γ and dynorphin and then through quorum sensing circuitry enhance its virulence. Escherichia coli sensitivity to the neurohormones of the catecholamines family appears relayed by a recently identified bacterial adrenergic receptor. In the present review, we will describe the mechanisms by which various eukaryotic signal molecules produced by host may activate Gram-negative bacteria virulence. Particular attention will be paid to Pseudomonas, a genus whose representative species, P. aeruginosa, is a common opportunistic pathogen. The discussion will be particularly focused on the pivotal role played by these new types of pathogen sensors from the sensing to the transduction

  5. Argo_CUDA: Exhaustive GPU based approach for motif discovery in large DNA datasets.

    Science.gov (United States)

    Vishnevsky, Oleg V; Bocharnikov, Andrey V; Kolchanov, Nikolay A

    2018-02-01

    The development of chromatin immunoprecipitation sequencing (ChIP-seq) technology has revolutionized the genetic analysis of the basic mechanisms underlying transcription regulation and led to accumulation of information about a huge amount of DNA sequences. There are a lot of web services which are currently available for de novo motif discovery in datasets containing information about DNA/protein binding. An enormous motif diversity makes their finding challenging. In order to avoid the difficulties, researchers use different stochastic approaches. Unfortunately, the efficiency of the motif discovery programs dramatically declines with the query set size increase. This leads to the fact that only a fraction of top "peak" ChIP-Seq segments can be analyzed or the area of analysis should be narrowed. Thus, the motif discovery in massive datasets remains a challenging issue. Argo_Compute Unified Device Architecture (CUDA) web service is designed to process the massive DNA data. It is a program for the detection of degenerate oligonucleotide motifs of fixed length written in 15-letter IUPAC code. Argo_CUDA is a full-exhaustive approach based on the high-performance GPU technologies. Compared with the existing motif discovery web services, Argo_CUDA shows good prediction quality on simulated sets. The analysis of ChIP-Seq sequences revealed the motifs which correspond to known transcription factor binding sites.

  6. iFORM: Incorporating Find Occurrence of Regulatory Motifs.

    Science.gov (United States)

    Ren, Chao; Chen, Hebing; Yang, Bite; Liu, Feng; Ouyang, Zhangyi; Bo, Xiaochen; Shu, Wenjie

    2016-01-01

    Accurately identifying the binding sites of transcription factors (TFs) is crucial to understanding the mechanisms of transcriptional regulation and human disease. We present incorporating Find Occurrence of Regulatory Motifs (iFORM), an easy-to-use and efficient tool for scanning DNA sequences with TF motifs described as position weight matrices (PWMs). Both performance assessment with a receiver operating characteristic (ROC) curve and a correlation-based approach demonstrated that iFORM achieves higher accuracy and sensitivity by integrating five classical motif discovery programs using Fisher's combined probability test. We have used iFORM to provide accurate results on a variety of data in the ENCODE Project and the NIH Roadmap Epigenomics Project, and the tool has demonstrated its utility in further elucidating individual roles of functional elements. Both the source and binary codes for iFORM can be freely accessed at https://github.com/wenjiegroup/iFORM. The identified TF binding sites across human cell and tissue types using iFORM have been deposited in the Gene Expression Omnibus under the accession ID GSE53962.

  7. Lucky Motifs in Chinese Folk Art: Interpreting Paper-cut from Chinese Shaanxi

    OpenAIRE

    Xuxiao WANG

    2013-01-01

    Paper-cut is not simply a form of traditional Chinese folk art. Lucky motifs developed in paper-cut certainly acquired profound cultural connotations. As paper-cut is a time-honoured skill across the nation, interpreting those motifs requires cultural receptiveness and anthropological sensitivity. The author of this article analyzes examples of paper-cut from Northern Shaanxi, China, to identify the cohesive motifs and explore the auspiciousness of the specific concepts of Fu, Lu, Shou, Xi. T...

  8. Diversity patterns of microbial eukaryotes mirror those of bacteria in Antarctic cryoconite holes.

    Science.gov (United States)

    Sommers, Pacifica; Darcy, John L; Gendron, Eli M S; Stanish, Lee F; Bagshaw, Elizabeth A; Porazinska, Dorota L; Schmidt, Steven K

    2018-01-01

    Ice-lidded cryoconite holes on glaciers in the Taylor Valley, Antarctica, provide a unique system of natural mesocosms for studying community structure and assembly. We used high-throughput DNA sequencing to characterize both microbial eukaryotic communities and bacterial communities within cryoconite holes across three glaciers to study similarities in their spatial patterns. We expected that the alpha (phylogenetic diversity) and beta (pairwise community dissimilarity) diversity patterns of eukaryotes in cryoconite holes would be related to those of bacteria, and that they would be related to the biogeochemical gradient within the Taylor Valley. We found that eukaryotic alpha and beta diversity were strongly related to those of bacteria across scales ranging from 140 m to 41 km apart. Alpha diversity of both was significantly related to position in the valley and surface area of the cryoconite hole, with pH also significantly correlated with the eukaryotic diversity. Beta diversity for both bacteria and eukaryotes was significantly related to position in the valley, with bacterial beta diversity also related to nitrate. These results are consistent with transport of sediments onto glaciers occurring primarily at local scales relative to the size of the valley, thus creating feedbacks in local chemistry and diversity. © FEMS 2017. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  9. Characterization of prokaryotic and eukaryotic promoters using hidden Markov models

    DEFF Research Database (Denmark)

    Pedersen, Anders Gorm; Baldi, P.; Chauvin, Y.

    1996-01-01

    In this paper we utilize hidden Markov models (HMMs) and information theory to analyze prokaryotic and eukaryotic promoters. We perform this analysis with special emphasis on the fact that promoters are divided into a number of different classes, depending on which polymerase-associated factors...... that bind to them. We find that HMMs trained on such subclasses of Escherichia coli promoters (specifically, the so-called sigma 70 and sigma 54 classes) give an excellent classification of unknown promoters with respect to sigma-class. HMMs trained on eukaryotic sequences from human genes also model nicely...

  10. MOMFER: A Search Engine of Thompson's Motif-Index of Folk Literature

    NARCIS (Netherlands)

    Karsdorp, F.B.; van der Meulen, Marten; Meder, Theo; van den Bosch, Antal

    2015-01-01

    More than fifty years after the first edition of Thompson's seminal Motif-Indexof Folk Literature, we present an online search engine tailored to fully disclose the index digitally. This search engine, called MOMFER, greatly enhances the searchability of the Motif-Index and provides exciting new

  11. LDsplit: screening for cis-regulatory motifs stimulating meiotic recombination hotspots by analysis of DNA sequence polymorphisms.

    Science.gov (United States)

    Yang, Peng; Wu, Min; Guo, Jing; Kwoh, Chee Keong; Przytycka, Teresa M; Zheng, Jie

    2014-02-17

    As a fundamental genomic element, meiotic recombination hotspot plays important roles in life sciences. Thus uncovering its regulatory mechanisms has broad impact on biomedical research. Despite the recent identification of the zinc finger protein PRDM9 and its 13-mer binding motif as major regulators for meiotic recombination hotspots, other regulators remain to be discovered. Existing methods for finding DNA sequence motifs of recombination hotspots often rely on the enrichment of co-localizations between hotspots and short DNA patterns, which ignore the cross-individual variation of recombination rates and sequence polymorphisms in the population. Our objective in this paper is to capture signals encoded in genetic variations for the discovery of recombination-associated DNA motifs. Recently, an algorithm called "LDsplit" has been designed to detect the association between single nucleotide polymorphisms (SNPs) and proximal meiotic recombination hotspots. The association is measured by the difference of population recombination rates at a hotspot between two alleles of a candidate SNP. Here we present an open source software tool of LDsplit, with integrative data visualization for recombination hotspots and their proximal SNPs. Applying LDsplit on SNPs inside an established 7-mer motif bound by PRDM9 we observed that SNP alleles preserving the original motif tend to have higher recombination rates than the opposite alleles that disrupt the motif. Running on SNP windows around hotspots each containing an occurrence of the 7-mer motif, LDsplit is able to guide the established motif finding algorithm of MEME to recover the 7-mer motif. In contrast, without LDsplit the 7-mer motif could not be identified. LDsplit is a software tool for the discovery of cis-regulatory DNA sequence motifs stimulating meiotic recombination hotspots by screening and narrowing down to hotspot associated SNPs. It is the first computational method that utilizes the genetic variation of

  12. BayesMD: flexible biological modeling for motif discovery

    DEFF Research Database (Denmark)

    Tang, Man-Hung Eric; Krogh, Anders; Winther, Ole

    2008-01-01

    We present BayesMD, a Bayesian Motif Discovery model with several new features. Three different types of biological a priori knowledge are built into the framework in a modular fashion. A mixture of Dirichlets is used as prior over nucleotide probabilities in binding sites. It is trained on trans......We present BayesMD, a Bayesian Motif Discovery model with several new features. Three different types of biological a priori knowledge are built into the framework in a modular fashion. A mixture of Dirichlets is used as prior over nucleotide probabilities in binding sites. It is trained...

  13. Lucky Motifs in Chinese Folk Art: Interpreting Paper-cut from Chinese Shaanxi

    Directory of Open Access Journals (Sweden)

    Xuxiao WANG

    2013-11-01

    Full Text Available Paper-cut is not simply a form of traditional Chinese folk art. Lucky motifs developed in paper-cut certainly acquired profound cultural connotations. As paper-cut is a time-honoured skill across the nation, interpreting those motifs requires cultural receptiveness and anthropological sensitivity. The author of this article analyzes examples of paper-cut from Northern Shaanxi, China, to identify the cohesive motifs and explore the auspiciousness of the specific concepts of Fu, Lu, Shou, Xi. The paper-cut of Northern Shaanxi is an ideal representative of the craft as a whole because of the relative stability of this region in history, in terms of both art and culture. Furthermore, its straightforward style provides a clear demonstration of motifs regarding folk understanding of expectations for life.

  14. Eukaryotic transcription factors

    DEFF Research Database (Denmark)

    Staby, Lasse; O'Shea, Charlotte; Willemoës, Martin

    2017-01-01

    Gene-specific transcription factors (TFs) are key regulatory components of signaling pathways, controlling, for example, cell growth, development, and stress responses. Their biological functions are determined by their molecular structures, as exemplified by their structured DNA-binding domains...... regions with function-related, short sequence motifs and molecular recognition features with structural propensities. This review focuses on molecular aspects of TFs, which represent paradigms of ID-related features. Through specific examples, we review how the ID-associated flexibility of TFs enables....... It is furthermore emphasized how classic biochemical concepts like allostery, conformational selection, induced fit, and feedback regulation are undergoing a revival with the appreciation of ID. The review also describes the most recent advances based on computational simulations of ID-based interaction mechanisms...

  15. Eukaryotic ribonucleases P/MRP: the crystal structure of the P3 domain.

    Science.gov (United States)

    Perederina, Anna; Esakova, Olga; Quan, Chao; Khanova, Elena; Krasilnikov, Andrey S

    2010-02-17

    Ribonuclease (RNase) P is a site-specific endoribonuclease found in all kingdoms of life. Typical RNase P consists of a catalytic RNA component and a protein moiety. In the eukaryotes, the RNase P lineage has split into two, giving rise to a closely related enzyme, RNase MRP, which has similar components but has evolved to have different specificities. The eukaryotic RNases P/MRP have acquired an essential helix-loop-helix protein-binding RNA domain P3 that has an important function in eukaryotic enzymes and distinguishes them from bacterial and archaeal RNases P. Here, we present a crystal structure of the P3 RNA domain from Saccharomyces cerevisiae RNase MRP in a complex with RNase P/MRP proteins Pop6 and Pop7 solved to 2.7 A. The structure suggests similar structural organization of the P3 RNA domains in RNases P/MRP and possible functions of the P3 domains and proteins bound to them in the stabilization of the holoenzymes' structures as well as in interactions with substrates. It provides the first insight into the structural organization of the eukaryotic enzymes of the RNase P/MRP family.

  16. Sequence alignment reveals possible MAPK docking motifs on HIV proteins.

    Directory of Open Access Journals (Sweden)

    Perry Evans

    Full Text Available Over the course of HIV infection, virus replication is facilitated by the phosphorylation of HIV proteins by human ERK1 and ERK2 mitogen-activated protein kinases (MAPKs. MAPKs are known to phosphorylate their substrates by first binding with them at a docking site. Docking site interactions could be viable drug targets because the sequences guiding them are more specific than phosphorylation consensus sites. In this study we use multiple bioinformatics tools to discover candidate MAPK docking site motifs on HIV proteins known to be phosphorylated by MAPKs, and we discuss the possibility of targeting docking sites with drugs. Using sequence alignments of HIV proteins of different subtypes, we show that MAPK docking patterns previously described for human proteins appear on the HIV matrix, Tat, and Vif proteins in a strain dependent manner, but are absent from HIV Rev and appear on all HIV Nef strains. We revise the regular expressions of previously annotated MAPK docking patterns in order to provide a subtype independent motif that annotates all HIV proteins. One revision is based on a documented human variant of one of the substrate docking motifs, and the other reduces the number of required basic amino acids in the standard docking motifs from two to one. The proposed patterns are shown to be consistent with in silico docking between ERK1 and the HIV matrix protein. The motif usage on HIV proteins is sufficiently different from human proteins in amino acid sequence similarity to allow for HIV specific targeting using small-molecule drugs.

  17. Stochastic Resonance in Neuronal Network Motifs with Ornstein-Uhlenbeck Colored Noise

    Directory of Open Access Journals (Sweden)

    Xuyang Lou

    2014-01-01

    Full Text Available We consider here the effect of the Ornstein-Uhlenbeck colored noise on the stochastic resonance of the feed-forward-loop (FFL network motif. The FFL motif is modeled through the FitzHugh-Nagumo neuron model as well as the chemical coupling. Our results show that the noise intensity and the correlation time of the noise process serve as the control parameters, which have great impacts on the stochastic dynamics of the FFL motif. We find that, with a proper choice of noise intensities and the correlation time of the noise process, the signal-to-noise ratio (SNR can display more than one peak.

  18. Gene Isolation Using Degenerate Primers Targeting Protein Motif: A Laboratory Exercise

    Science.gov (United States)

    Yeo, Brandon Pei Hui; Foong, Lian Chee; Tam, Sheh May; Lee, Vivian; Hwang, Siaw San

    2018-01-01

    Structures and functions of protein motifs are widely included in many biology-based course syllabi. However, little emphasis is placed to link this knowledge to applications in biotechnology to enhance the learning experience. Here, the conserved motifs of nucleotide binding site-leucine rich repeats (NBS-LRR) proteins, successfully used for the…

  19. Wayward Warriors: The Viking Motif in Swedish and English Children's Literature

    Science.gov (United States)

    Sundmark, Björn

    2014-01-01

    In this article the Viking motif in children's literature is explored--from its roots in (adult) nationalist and antiquarian discourse, over pedagogical and historical texts for children, to the eventual diversification (or dissolution) of the motif into different genres and forms. The focus is on Swedish Viking narratives, but points of…

  20. Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium.

    Science.gov (United States)

    Catania, Francesco; Lynch, Michael

    2010-05-04

    In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa) remains a virtually unexplored issue. By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Our observations 1) shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2) are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3) reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.

  1. SA-Mot: a web server for the identification of motifs of interest extracted from protein loops.

    Science.gov (United States)

    Regad, Leslie; Saladin, Adrien; Maupetit, Julien; Geneix, Colette; Camproux, Anne-Claude

    2011-07-01

    The detection of functional motifs is an important step for the determination of protein functions. We present here a new web server SA-Mot (Structural Alphabet Motif) for the extraction and location of structural motifs of interest from protein loops. Contrary to other methods, SA-Mot does not focus only on functional motifs, but it extracts recurrent and conserved structural motifs involved in structural redundancy of loops. SA-Mot uses the structural word notion to extract all structural motifs from uni-dimensional sequences corresponding to loop structures. Then, SA-Mot provides a description of these structural motifs using statistics computed in the loop data set and in SCOP superfamily, sequence and structural parameters. SA-Mot results correspond to an interactive table listing all structural motifs extracted from a target structure and their associated descriptors. Using this information, the users can easily locate loop regions that are important for the protein folding and function. The SA-Mot web server is available at http://sa-mot.mti.univ-paris-diderot.fr.

  2. Visualizing Patterns of Marine Eukaryotic Diversity from Metabarcoding Data Using QIIME.

    Science.gov (United States)

    Leray, Matthieu; Knowlton, Nancy

    2016-01-01

    PCR amplification followed by deep sequencing of homologous gene regions is increasingly used to characterize the diversity and taxonomic composition of marine eukaryotic communities. This approach may generate millions of sequences for hundreds of samples simultaneously. Therefore, tools that researchers can use to visualize complex patterns of diversity for these massive datasets are essential. Efforts by microbiologists to understand the Earth and human microbiomes using high-throughput sequencing of the 16S rRNA gene has led to the development of several user-friendly, open-source software packages that can be similarly used to analyze eukaryotic datasets. Quantitative Insights Into Microbial Ecology (QIIME) offers some of the most helpful data visualization tools. Here, we describe functionalities to import OTU tables generated with any molecular marker (e.g., 18S, COI, ITS) and associated metadata into QIIME. We then present a range of analytical tools implemented within QIIME that can be used to obtain insights about patterns of alpha and beta diversity for marine eukaryotes.

  3. A second pathway to degrade pyrimidine nucleic acid precursors in eukaryotes

    DEFF Research Database (Denmark)

    Andersen, Gorm; Bjornberg, Olof; Polakova, Silvia

    2008-01-01

    Pyrimidine bases are the central precursors for RNA and DNA, and their intracellular pools are determined by de novo, salvage and catabolic pathways. In eukaryotes, degradation of uracil has been believed to proceed only via the reduction to dihydrouracil. Using a yeast model, Saccharomyces kluyv...... of the eukaryotic or prokaryotic genes involved in pyrimidine degradation described to date.......Pyrimidine bases are the central precursors for RNA and DNA, and their intracellular pools are determined by de novo, salvage and catabolic pathways. In eukaryotes, degradation of uracil has been believed to proceed only via the reduction to dihydrouracil. Using a yeast model, Saccharomyces......, respectively. The gene products of URC1 and URC4 are highly conserved proteins with so far unknown functions and they are present in a variety of prokaryotes and fungi. In bacteria and in some fungi, URC1 and URC4 are linked on the genome together with the gene for uracil phosphoribosyltransferase (URC6). Urc1...

  4. Finding a Leucine in a Haystack: Searching the Proteome for ambigous Leucine-Aspartic Acid motifs

    KAUST Repository

    Arold, Stefan T.

    2016-01-01

    LDMF predicted 13 new LD motifs in humans. Using biophysical assays, we experimentally confirmed in vitro interactions for four novel LD motif proteins. Thus, LDMF allows proteome-wide discovery of LD motifs, despite a highly ambiguous sequence pattern. Functional implications will be discussed.

  5. Discovery of cell-type specific DNA motif grammar in cis-regulatory elements using random Forest.

    Science.gov (United States)

    Wang, Xin; Lin, Peijie; Ho, Joshua W K

    2018-01-19

    It has been observed that many transcription factors (TFs) can bind to different genomic loci depending on the cell type in which a TF is expressed in, even though the individual TF usually binds to the same core motif in different cell types. How a TF can bind to the genome in such a highly cell-type specific manner, is a critical research question. One hypothesis is that a TF requires co-binding of different TFs in different cell types. If this is the case, it may be possible to observe different combinations of TF motifs - a motif grammar - located at the TF binding sites in different cell types. In this study, we develop a bioinformatics method to systematically identify DNA motifs in TF binding sites across multiple cell types based on published ChIP-seq data, and address two questions: (1) can we build a machine learning classifier to predict cell-type specificity based on motif combinations alone, and (2) can we extract meaningful cell-type specific motif grammars from this classifier model. We present a Random Forest (RF) based approach to build a multi-class classifier to predict the cell-type specificity of a TF binding site given its motif content. We applied this RF classifier to two published ChIP-seq datasets of TF (TCF7L2 and MAX) across multiple cell types. Using cross-validation, we show that motif combinations alone are indeed predictive of cell types. Furthermore, we present a rule mining approach to extract the most discriminatory rules in the RF classifier, thus allowing us to discover the underlying cell-type specific motif grammar. Our bioinformatics analysis supports the hypothesis that combinatorial TF motif patterns are cell-type specific.

  6. POWRS: position-sensitive motif discovery.

    Directory of Open Access Journals (Sweden)

    Ian W Davis

    Full Text Available Transcription factors and the short, often degenerate DNA sequences they recognize are central regulators of gene expression, but their regulatory code is challenging to dissect experimentally. Thus, computational approaches have long been used to identify putative regulatory elements from the patterns in promoter sequences. Here we present a new algorithm "POWRS" (POsition-sensitive WoRd Set for identifying regulatory sequence motifs, specifically developed to address two common shortcomings of existing algorithms. First, POWRS uses the position-specific enrichment of regulatory elements near transcription start sites to significantly increase sensitivity, while providing new information about the preferred localization of those elements. Second, POWRS forgoes position weight matrices for a discrete motif representation that appears more resistant to over-generalization. We apply this algorithm to discover sequences related to constitutive, high-level gene expression in the model plant Arabidopsis thaliana, and then experimentally validate the importance of those elements by systematically mutating two endogenous promoters and measuring the effect on gene expression levels. This provides a foundation for future efforts to rationally engineer gene expression in plants, a problem of great importance in developing biotech crop varieties.BSD-licensed Python code at http://grassrootsbio.com/papers/powrs/.

  7. Prokaryotes versus Eukaryotes: Who is hosting whom?

    Directory of Open Access Journals (Sweden)

    Guillermo eTellez

    2014-10-01

    Full Text Available Microorganisms represent the largest component of biodiversity in our world. For millions of years, prokaryotic microorganisms have functioned as a major selective force shaping eukaryotic evolution. Microbes that live inside and on animals outnumber the animals’ actual somatic and germ cells by an estimated 10-fold. Collectively, the intestinal microbiome represents a ‘forgotten organ’, functioning as an organ inside another that can execute many physiological responsibilities. The nature of primitive eukaryotes was drastically changed due to the association with symbiotic prokaryotes facilitating mutual coevolution of host and microbe. Phytophagous insects have long been used to test theories of evolutionary diversification; moreover, the diversification of a number of phytophagous insect lineages has been linked to mutualisms with microbes. From termites and honey bees to ruminants and mammals, depending on novel biochemistries provided by the prokaryotic microbiome, the association helps to metabolize several nutrients that the host cannot digest and converting these into useful end products (such as short chain fatty acids, a process which has huge impact on the biology and homeostasis of metazoans. More importantly, in a direct and/or indirect way, the intestinal microbiota influences the assembly of gut-associated lymphoid tissue, helps to educate immune system, affects the integrity of the intestinal mucosal barrier, modulates proliferation and differentiation of its epithelial lineages, regulates angiogenesis, and modifies the activity of enteric as well as the central nervous system,. Despite these important effects, the mechanisms by which the gut microbial community influences the host’s biology remains almost entirely unknown. Our aim here is to encourage empirical inquiry into the relationship between mutualism and evolutionary diversification between prokaryotes and eukaryotes which encourage us to postulate: Who is

  8. APOCALYPTIC MOTIFS IN THE CYCLE OF STORIES BY M.A. BULGAKOV «NOTES OF A YOUNG DOCTOR»

    Directory of Open Access Journals (Sweden)

    Evgeniy Igorevich Erokhov

    2015-10-01

    Full Text Available The motif analysis of a cycle of stories by M.A. Bulgakov «Notes of a Young Doctor» from the point of view of their apocalyptic problematics was first performed in this article. To identify apocalyptic motifs the method of motif analysis, developed by B.M. Gasparov, was used which will also help to prove the interpenetration of motifs in the cycle of stories. The result of the research work is the identification of apocalyptic motifs which are manifested in the experiences of the main character and the events taking place around him and passing through the prism of physician’s perception of the world. Our identified motifs show that the stories in the cycle are united not only thematically and with the help of the image of the main character, but with the help of the motifs which reflect interpenetration of apocalyptic motifs in the stories of one cycle. There are the following apocalyptic motifs in the cycle of stories by Bulgakov: diseases, darkness (as part of the landscape, resurrection from the dead and beast. They all belong to the biblical type which is allocated on the basis of the associative bond of these motifs with the biblical texts.

  9. Poly(A) motif prediction using spectral latent features from human DNA sequences

    KAUST Repository

    Xie, Bo; Jankovic, Boris R.; Bajic, Vladimir B.; Song, Le; Gao, Xin

    2013-01-01

    Motivation: Polyadenylation is the addition of a poly(A) tail to an RNA molecule. Identifying DNA sequence motifs that signal the addition of poly(A) tails is essential to improved genome annotation and better understanding of the regulatory mechanisms and stability of mRNA.Existing poly(A) motif predictors demonstrate that information extracted from the surrounding nucleotide sequences of candidate poly(A) motifs can differentiate true motifs from the false ones to a great extent. A variety of sophisticated features has been explored, including sequential, structural, statistical, thermodynamic and evolutionary properties. However, most of these methods involve extensive manual feature engineering, which can be time-consuming and can require in-depth domain knowledge.Results: We propose a novel machine-learning method for poly(A) motif prediction by marrying generative learning (hidden Markov models) and discriminative learning (support vector machines). Generative learning provides a rich palette on which the uncertainty and diversity of sequence information can be handled, while discriminative learning allows the performance of the classification task to be directly optimized. Here, we used hidden Markov models for fitting the DNA sequence dynamics, and developed an efficient spectral algorithm for extracting latent variable information from these models. These spectral latent features were then fed into support vector machines to fine-tune the classification performance.We evaluated our proposed method on a comprehensive human poly(A) dataset that consists of 14 740 samples from 12 of the most abundant variants of human poly(A) motifs. Compared with one of the previous state-of-the-art methods in the literature (the random forest model with expert-crafted features), our method reduces the average error rate, false-negative rate and false-positive rate by 26, 15 and 35%, respectively. Meanwhile, our method makes ?30% fewer error predictions relative to the other

  10. Poly(A) motif prediction using spectral latent features from human DNA sequences

    KAUST Repository

    Xie, Bo

    2013-06-21

    Motivation: Polyadenylation is the addition of a poly(A) tail to an RNA molecule. Identifying DNA sequence motifs that signal the addition of poly(A) tails is essential to improved genome annotation and better understanding of the regulatory mechanisms and stability of mRNA.Existing poly(A) motif predictors demonstrate that information extracted from the surrounding nucleotide sequences of candidate poly(A) motifs can differentiate true motifs from the false ones to a great extent. A variety of sophisticated features has been explored, including sequential, structural, statistical, thermodynamic and evolutionary properties. However, most of these methods involve extensive manual feature engineering, which can be time-consuming and can require in-depth domain knowledge.Results: We propose a novel machine-learning method for poly(A) motif prediction by marrying generative learning (hidden Markov models) and discriminative learning (support vector machines). Generative learning provides a rich palette on which the uncertainty and diversity of sequence information can be handled, while discriminative learning allows the performance of the classification task to be directly optimized. Here, we used hidden Markov models for fitting the DNA sequence dynamics, and developed an efficient spectral algorithm for extracting latent variable information from these models. These spectral latent features were then fed into support vector machines to fine-tune the classification performance.We evaluated our proposed method on a comprehensive human poly(A) dataset that consists of 14 740 samples from 12 of the most abundant variants of human poly(A) motifs. Compared with one of the previous state-of-the-art methods in the literature (the random forest model with expert-crafted features), our method reduces the average error rate, false-negative rate and false-positive rate by 26, 15 and 35%, respectively. Meanwhile, our method makes ?30% fewer error predictions relative to the other

  11. Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium

    Directory of Open Access Journals (Sweden)

    Lynch Michael

    2010-05-01

    Full Text Available Abstract Background In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa remains a virtually unexplored issue. Results By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Conclusions Our observations 1 shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2 are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3 reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.

  12. Linear side chains in benzo[1,2-b:4,5-b′]dithiophene-thieno[3,4-c] pyrrole-4,6-dione polymers direct self-assembly and solar cell performance

    KAUST Repository

    Cabanetos, Clement

    2013-03-27

    While varying the size and branching of solubilizing side chains in π-conjugated polymers impacts their self-assembling properties in thin-film devices, these structural changes remain difficult to anticipate. This report emphasizes the determining role that linear side-chain substituents play in poly(benzo[1,2-b:4,5-b′]dithiophene-thieno[3,4-c]pyrrole-4,6-dione) (PBDTTPD) polymers for bulk heterojunction (BHJ) solar cell applications. We show that replacing branched side chains by linear ones in the BDT motifs induces a critical change in polymer self-assembly and backbone orientation in thin films that correlates with a dramatic drop in solar cell efficiency. In contrast, we show that for polymers with branched alkyl-substituted BDT motifs, controlling the number of aliphatic carbons in the linear N-alkyl-substituted TPD motifs is a major contributor to improved material performance. With this approach, PBDTTPD polymers were found to reach power conversion efficiencies of 8.5% and open-circuit voltages of 0.97 V in BHJ devices with PC71BM, making PBDTTPD one of the best polymer donors for use in the high-band-gap cell of tandem solar cells. © 2013 American Chemical Society.

  13. Uncoupling of Sister Replisomes during Eukaryotic DNA Replication

    NARCIS (Netherlands)

    Yardimci, Hasan; Loveland, Anna B.; Habuchi, Satoshi; van Oijen, Antoine M.; Walter, Johannes C.

    2010-01-01

    The duplication of eukaryotic genomes involves the replication of DNA from multiple origins of replication. In S phase, two sister replisomes assemble at each active origin, and they replicate DNA in opposite directions. Little is known about the functional relationship between sister replisomes.

  14. Molecular detection of eukaryotes in a single human stool sample from Senegal.

    Directory of Open Access Journals (Sweden)

    Ibrahim Hamad

    Full Text Available BACKGROUND: Microbial eukaryotes represent an important component of the human gut microbiome, with different beneficial or harmful roles; some species are commensal or mutualistic, whereas others are opportunistic or parasitic. The diversity of eukaryotes inhabiting humans remains relatively unexplored because of either the low abundance of these organisms in human gut or because they have received limited attention from a whole-community perspective. METHODOLOGY/PRINCIPAL FINDING: In this study, a single fecal sample from a healthy African male was studied using both culture-dependent methods and extended molecular methods targeting the 18S rRNA and ITS sequences. Our results revealed that very few fungi, including Candida spp., Galactomyces spp., and Trichosporon asahii, could be isolated using culture-based methods. In contrast, a relatively a high number of eukaryotic species could be identified in this fecal sample when culture-independent methods based on various primer sets were used. A total of 27 species from one sample were found among the 977 analyzed clones. The clone libraries were dominated by fungi (716 clones/977, 73.3%, corresponding to 16 different species. In addition, 187 sequences out of 977 (19.2% corresponded to 9 different species of plants; 59 sequences (6% belonged to other micro-eukaryotes in the gut, including Entamoeba hartmanni and Blastocystis sp; and only 15 clones/977 (1.5% were related to human 18S rRNA sequences. CONCLUSION: Our results revealed a complex eukaryotic community in the volunteer's gut, with fungi being the most abundant species in the stool sample. Larger investigations are needed to assess the generality of these results and to understand their roles in human health and disease.

  15. Discriminative Motif Discovery via Simulated Evolution and Random Under-Sampling

    OpenAIRE

    Song, Tao; Gu, Hong

    2014-01-01

    Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the sta...

  16. Metabolic profiles of prokaryotic and eukaryotic communities in deep-sea sponge Neamphius huxleyi indicated by metagenomics

    Science.gov (United States)

    Li, Zhi-Yong; Wang, Yue-Zhu; He, Li-Ming; Zheng, Hua-Jun

    2014-01-01

    The whole metabolism of a sponge holobiont and the respective contributions of prokaryotic and eukaryotic symbionts and their associations with the sponge host remain largely unclear. Meanwhile, compared with shallow water sponges, deep-sea sponges are rarely understood. Here we report the metagenomic exploration of deep-sea sponge Neamphius huxleyi at the whole community level. Metagenomic data showed phylogenetically diverse prokaryotes and eukaryotes in Neamphius huxleyi. MEGAN and gene enrichment analyses indicated different metabolic potentials of prokaryotic symbionts from eukaryotic symbionts, especially in nitrogen and carbon metabolisms, and their molecular interactions with the sponge host. These results supported the hypothesis that prokaryotic and eukaryotic symbionts have different ecological roles and relationships with sponge host. Moreover, vigorous denitrification, and CO2 fixation by chemoautotrophic prokaryotes were suggested for this deep-sea sponge. The study provided novel insights into the respective potentials of prokaryotic and eukaryotic symbionts and their associations with deep-sea sponge Neamphius huxleyi. PMID:24463735

  17. Metabolic profiles of prokaryotic and eukaryotic communities in deep-sea sponge Lamellomorpha sp. indicated by metagenomics

    Science.gov (United States)

    Li, Zhi-Yong; Wang, Yue-Zhu; He, Li-Ming; Zheng, Hua-Jun

    2014-01-01

    The whole metabolism of a sponge holobiont and the respective contributions of prokaryotic and eukaryotic symbionts and their associations with the sponge host remain largely unclear. Meanwhile, compared with shallow water sponges, deep-sea sponges are rarely understood. Here we report the metagenomic exploration of deep-sea sponge Lamellomorpha sp. at the whole community level. Metagenomic data showed phylogenetically diverse prokaryotes and eukaryotes in Lamellomorpha sp.. MEGAN and gene enrichment analyses indicated different metabolic potentials of prokaryotic symbionts from eukaryotic symbionts, especially in nitrogen and carbon metabolisms, and their molecular interactions with the sponge host. These results supported the hypothesis that prokaryotic and eukaryotic symbionts have different ecological roles and relationships with sponge host. Moreover, vigorous denitrification, and CO2 fixation by chemoautotrophic prokaryotes were suggested for this deep-sea sponge. The study provided novel insights into the respective potentials of prokaryotic and eukaryotic symbionts and their associations with deep-sea sponge Lamellomorpha sp..

  18. Insights into the Initiation of Eukaryotic DNA Replication.

    Science.gov (United States)

    Bruck, Irina; Perez-Arnaiz, Patricia; Colbert, Max K; Kaplan, Daniel L

    2015-01-01

    The initiation of DNA replication is a highly regulated event in eukaryotic cells to ensure that the entire genome is copied once and only once during S phase. The primary target of cellular regulation of eukaryotic DNA replication initiation is the assembly and activation of the replication fork helicase, the 11-subunit assembly that unwinds DNA at a replication fork. The replication fork helicase, called CMG for Cdc45-Mcm2-7, and GINS, assembles in S phase from the constituent Cdc45, Mcm2-7, and GINS proteins. The assembly and activation of the CMG replication fork helicase during S phase is governed by 2 S-phase specific kinases, CDK and DDK. CDK stimulates the interaction between Sld2, Sld3, and Dpb11, 3 initiation factors that are each required for the initiation of DNA replication. DDK, on the other hand, phosphorylates the Mcm2, Mcm4, and Mcm6 subunits of the Mcm2-7 complex. Sld3 recruits Cdc45 to Mcm2-7 in a manner that depends on DDK, and recent work suggests that Sld3 binds directly to Mcm2-7 and also to single-stranded DNA. Furthermore, recent work demonstrates that Sld3 and its human homolog Treslin substantially stimulate DDK phosphorylation of Mcm2. These data suggest that the initiation factor Sld3/Treslin coordinates the assembly and activation of the eukaryotic replication fork helicase by recruiting Cdc45 to Mcm2-7, stimulating DDK phosphorylation of Mcm2, and binding directly to single-stranded DNA as the origin is melted.

  19. Identification of helix capping and {beta}-turn motifs from NMR chemical shifts

    Energy Technology Data Exchange (ETDEWEB)

    Shen Yang; Bax, Ad, E-mail: bax@nih.gov [National Institutes of Health, Laboratory of Chemical Physics, National Institute of Diabetes and Digestive and Kidney Diseases (United States)

    2012-03-15

    We present an empirical method for identification of distinct structural motifs in proteins on the basis of experimentally determined backbone and {sup 13}C{sup {beta}} chemical shifts. Elements identified include the N-terminal and C-terminal helix capping motifs and five types of {beta}-turns: I, II, I Prime , II Prime and VIII. Using a database of proteins of known structure, the NMR chemical shifts, together with the PDB-extracted amino acid preference of the helix capping and {beta}-turn motifs are used as input data for training an artificial neural network algorithm, which outputs the statistical probability of finding each motif at any given position in the protein. The trained neural networks, contained in the MICS (motif identification from chemical shifts) program, also provide a confidence level for each of their predictions, and values ranging from ca 0.7-0.9 for the Matthews correlation coefficient of its predictions far exceed those attainable by sequence analysis. MICS is anticipated to be useful both in the conventional NMR structure determination process and for enhancing on-going efforts to determine protein structures solely on the basis of chemical shift information, where it can aid in identifying protein database fragments suitable for use in building such structures.

  20. Identification of helix capping and β-turn motifs from NMR chemical shifts

    International Nuclear Information System (INIS)

    Shen Yang; Bax, Ad

    2012-01-01

    We present an empirical method for identification of distinct structural motifs in proteins on the basis of experimentally determined backbone and 13 C β chemical shifts. Elements identified include the N-terminal and C-terminal helix capping motifs and five types of β-turns: I, II, I′, II′ and VIII. Using a database of proteins of known structure, the NMR chemical shifts, together with the PDB-extracted amino acid preference of the helix capping and β-turn motifs are used as input data for training an artificial neural network algorithm, which outputs the statistical probability of finding each motif at any given position in the protein. The trained neural networks, contained in the MICS (motif identification from chemical shifts) program, also provide a confidence level for each of their predictions, and values ranging from ca 0.7–0.9 for the Matthews correlation coefficient of its predictions far exceed those attainable by sequence analysis. MICS is anticipated to be useful both in the conventional NMR structure determination process and for enhancing on-going efforts to determine protein structures solely on the basis of chemical shift information, where it can aid in identifying protein database fragments suitable for use in building such structures.

  1. A novel k-mer set memory (KSM) motif representation improves regulatory variant prediction.

    Science.gov (United States)

    Guo, Yuchun; Tian, Kevin; Zeng, Haoyang; Guo, Xiaoyun; Gifford, David Kenneth

    2018-04-13

    The representation and discovery of transcription factor (TF) sequence binding specificities is critical for understanding gene regulatory networks and interpreting the impact of disease-associated noncoding genetic variants. We present a novel TF binding motif representation, the k -mer set memory (KSM), which consists of a set of aligned k -mers that are overrepresented at TF binding sites, and a new method called KMAC for de novo discovery of KSMs. We find that KSMs more accurately predict in vivo binding sites than position weight matrix (PWM) models and other more complex motif models across a large set of ChIP-seq experiments. Furthermore, KSMs outperform PWMs and more complex motif models in predicting in vitro binding sites. KMAC also identifies correct motifs in more experiments than five state-of-the-art motif discovery methods. In addition, KSM-derived features outperform both PWM and deep learning model derived sequence features in predicting differential regulatory activities of expression quantitative trait loci (eQTL) alleles. Finally, we have applied KMAC to 1600 ENCODE TF ChIP-seq data sets and created a public resource of KSM and PWM motifs. We expect that the KSM representation and KMAC method will be valuable in characterizing TF binding specificities and in interpreting the effects of noncoding genetic variations. © 2018 Guo et al.; Published by Cold Spring Harbor Laboratory Press.

  2. Pipeline for the Analysis of ChIP-seq Data and New Motif Ranking Procedure

    KAUST Repository

    Ashoor, Haitham

    2011-06-01

    This thesis presents a computational methodology for ab-initio identification of transcription factor binding sites based on ChIP-seq data. This method consists of three main steps, namely ChIP-seq data processing, motif discovery and models selection. A novel method for ranking the models of motifs identified in this process is proposed. This method combines multiple factors in order to rank the provided candidate motifs. It combines the model coverage of the ChIP-seq fragments that contain motifs from which that model is built, the suitable background data made up of shuffled ChIP-seq fragments, and the p-value that resulted from evaluating the model on actual and background data. Two ChIP-seq datasets retrieved from ENCODE project are used to evaluate and demonstrate the ability of the method to predict correct TFBSs with high precision. The first dataset relates to neuron-restrictive silencer factor, NRSF, while the second one corresponds to growth-associated binding protein, GABP. The pipeline system shows high precision prediction for both datasets, as in both cases the top ranked motif closely resembles the known motifs for the respective transcription factors.

  3. GPUmotif: an ultra-fast and energy-efficient motif analysis program using graphics processing units.

    Science.gov (United States)

    Zandevakili, Pooya; Hu, Ming; Qin, Zhaohui

    2012-01-01

    Computational detection of TF binding patterns has become an indispensable tool in functional genomics research. With the rapid advance of new sequencing technologies, large amounts of protein-DNA interaction data have been produced. Analyzing this data can provide substantial insight into the mechanisms of transcriptional regulation. However, the massive amount of sequence data presents daunting challenges. In our previous work, we have developed a novel algorithm called Hybrid Motif Sampler (HMS) that enables more scalable and accurate motif analysis. Despite much improvement, HMS is still time-consuming due to the requirement to calculate matching probabilities position-by-position. Using the NVIDIA CUDA toolkit, we developed a graphics processing unit (GPU)-accelerated motif analysis program named GPUmotif. We proposed a "fragmentation" technique to hide data transfer time between memories. Performance comparison studies showed that commonly-used model-based motif scan and de novo motif finding procedures such as HMS can be dramatically accelerated when running GPUmotif on NVIDIA graphics cards. As a result, energy consumption can also be greatly reduced when running motif analysis using GPUmotif. The GPUmotif program is freely available at http://sourceforge.net/projects/gpumotif/

  4. GPUmotif: an ultra-fast and energy-efficient motif analysis program using graphics processing units.

    Directory of Open Access Journals (Sweden)

    Pooya Zandevakili

    Full Text Available Computational detection of TF binding patterns has become an indispensable tool in functional genomics research. With the rapid advance of new sequencing technologies, large amounts of protein-DNA interaction data have been produced. Analyzing this data can provide substantial insight into the mechanisms of transcriptional regulation. However, the massive amount of sequence data presents daunting challenges. In our previous work, we have developed a novel algorithm called Hybrid Motif Sampler (HMS that enables more scalable and accurate motif analysis. Despite much improvement, HMS is still time-consuming due to the requirement to calculate matching probabilities position-by-position. Using the NVIDIA CUDA toolkit, we developed a graphics processing unit (GPU-accelerated motif analysis program named GPUmotif. We proposed a "fragmentation" technique to hide data transfer time between memories. Performance comparison studies showed that commonly-used model-based motif scan and de novo motif finding procedures such as HMS can be dramatically accelerated when running GPUmotif on NVIDIA graphics cards. As a result, energy consumption can also be greatly reduced when running motif analysis using GPUmotif. The GPUmotif program is freely available at http://sourceforge.net/projects/gpumotif/

  5. diArk – a resource for eukaryotic genome research

    Directory of Open Access Journals (Sweden)

    Kollmar Martin

    2007-04-01

    Full Text Available Abstract Background The number of completed eukaryotic genome sequences and cDNA projects has increased exponentially in the past few years although most of them have not been published yet. In addition, many microarray analyses yielded thousands of sequenced EST and cDNA clones. For the researcher interested in single gene analyses (from a phylogenetic, a structural biology or other perspective it is therefore important to have up-to-date knowledge about the various resources providing primary data. Description The database is built around 3 central tables: species, sequencing projects and publications. The species table contains commonly and alternatively used scientific names, common names and the complete taxonomic information. For projects the sequence type and links to species project web-sites and species homepages are stored. All publications are linked to projects. The web-interface provides comprehensive search modules with detailed options and three different views of the selected data. We have especially focused on developing an elaborate taxonomic tree search tool that allows the user to instantaneously identify e.g. the closest relative to the organism of interest. Conclusion We have developed a database, called diArk, to store, organize, and present the most relevant information about completed genome projects and EST/cDNA data from eukaryotes. Currently, diArk provides information about 415 eukaryotes, 823 sequencing projects, and 248 publications.

  6. Identification of putative regulatory motifs in the upstream regions of co-expressed functional groups of genes in Plasmodium falciparum

    Directory of Open Access Journals (Sweden)

    Joshi NV

    2009-01-01

    Full Text Available Abstract Background Regulation of gene expression in Plasmodium falciparum (Pf remains poorly understood. While over half the genes are estimated to be regulated at the transcriptional level, few regulatory motifs and transcription regulators have been found. Results The study seeks to identify putative regulatory motifs in the upstream regions of 13 functional groups of genes expressed in the intraerythrocytic developmental cycle of Pf. Three motif-discovery programs were used for the purpose, and motifs were searched for only on the gene coding strand. Four motifs – the 'G-rich', the 'C-rich', the 'TGTG' and the 'CACA' motifs – were identified, and zero to all four of these occur in the 13 sets of upstream regions. The 'CACA motif' was absent in functional groups expressed during the ring to early trophozoite transition. For functional groups expressed in each transition, the motifs tended to be similar. Upstream motifs in some functional groups showed 'positional conservation' by occurring at similar positions relative to the translational start site (TLS; this increases their significance as regulatory motifs. In the ribonucleotide synthesis, mitochondrial, proteasome and organellar translation machinery genes, G-rich, C-rich, CACA and TGTG motifs, respectively, occur with striking positional conservation. In the organellar translation machinery group, G-rich motifs occur close to the TLS. The same motifs were sometimes identified for multiple functional groups; differences in location and abundance of the motifs appear to ensure different modes of action. Conclusion The identification of positionally conserved over-represented upstream motifs throws light on putative regulatory elements for transcription in Pf.

  7. Intermediary metabolism in protists: a sequence-based view of facultative anaerobic metabolism in evolutionarily diverse eukaryotes.

    Science.gov (United States)

    Ginger, Michael L; Fritz-Laylin, Lillian K; Fulton, Chandler; Cande, W Zacheus; Dawson, Scott C

    2010-12-01

    Protists account for the bulk of eukaryotic diversity. Through studies of gene and especially genome sequences the molecular basis for this diversity can be determined. Evident from genome sequencing are examples of versatile metabolism that go far beyond the canonical pathways described for eukaryotes in textbooks. In the last 2-3 years, genome sequencing and transcript profiling has unveiled several examples of heterotrophic and phototrophic protists that are unexpectedly well-equipped for ATP production using a facultative anaerobic metabolism, including some protists that can (Chlamydomonas reinhardtii) or are predicted (Naegleria gruberi, Acanthamoeba castellanii, Amoebidium parasiticum) to produce H(2) in their metabolism. It is possible that some enzymes of anaerobic metabolism were acquired and distributed among eukaryotes by lateral transfer, but it is also likely that the common ancestor of eukaryotes already had far more metabolic versatility than was widely thought a few years ago. The discussion of core energy metabolism in unicellular eukaryotes is the subject of this review. Since genomic sequencing has so far only touched the surface of protist diversity, it is anticipated that sequences of additional protists may reveal an even wider range of metabolic capabilities, while simultaneously enriching our understanding of the early evolution of eukaryotes. Copyright © 2010 Elsevier GmbH. All rights reserved.

  8. Overlapping ETS and CRE Motifs (G/CCGGAAGTGACGTCA) Preferentially Bound by GABPα and CREB Proteins

    Science.gov (United States)

    Chatterjee, Raghunath; Zhao, Jianfei; He, Ximiao; Shlyakhtenko, Andrey; Mann, Ishminder; Waterfall, Joshua J.; Meltzer, Paul; Sathyanarayana, B. K.; FitzGerald, Peter C.; Vinson, Charles

    2012-01-01

    Previously, we identified 8-bps long DNA sequences (8-mers) that localize in human proximal promoters and grouped them into known transcription factor binding sites (TFBS). We now examine split 8-mers consisting of two 4-mers separated by 1-bp to 30-bps (X4-N1-30-X4) to identify pairs of TFBS that localize in proximal promoters at a precise distance. These include two overlapping TFBS: the ETS⇔ETS motif (C/GCCGGAAGCGGAA) and the ETS⇔CRE motif (C/GCGGAAGTGACGTCAC). The nucleotides in bold are part of both TFBS. Molecular modeling shows that the ETS⇔CRE motif can be bound simultaneously by both the ETS and the B-ZIP domains without protein-protein clashes. The electrophoretic mobility shift assay (EMSA) shows that the ETS protein GABPα and the B-ZIP protein CREB preferentially bind to the ETS⇔CRE motif only when the two TFBS overlap precisely. In contrast, the ETS domain of ETV5 and CREB interfere with each other for binding the ETS⇔CRE. The 11-mer (CGGAAGTGACG), the conserved part of the ETS⇔CRE motif, occurs 226 times in the human genome and 83% are in known regulatory regions. In vivo GABPα and CREB ChIP-seq peaks identified the ETS⇔CRE as the most enriched motif occurring in promoters of genes involved in mRNA processing, cellular catabolic processes, and stress response, suggesting that a specific class of genes is regulated by this composite motif. PMID:23050235

  9. Mapping and characterizing N6-methyladenine in eukaryotic genomes using single molecule real-time sequencing.

    Science.gov (United States)

    Zhu, Shijia; Beaulaurier, John; Deikus, Gintaras; Wu, Tao; Strahl, Maya; Hao, Ziyang; Luo, Guanzheng; Gregory, James A; Chess, Andrew; He, Chuan; Xiao, Andrew; Sebra, Robert; Schadt, Eric E; Fang, Gang

    2018-05-15

    N6-methyladenine (m6dA) has been discovered as a novel form of DNA methylation prevalent in eukaryotes, however, methods for high resolution mapping of m6dA events are still lacking. Single-molecule real-time (SMRT) sequencing has enabled the detection of m6dA events at single-nucleotide resolution in prokaryotic genomes, but its application to detecting m6dA in eukaryotic genomes has not been rigorously examined. Herein, we identified unique characteristics of eukaryotic m6dA methylomes that fundamentally differ from those of prokaryotes. Based on these differences, we describe the first approach for mapping m6dA events using SMRT sequencing specifically designed for the study of eukaryotic genomes, and provide appropriate strategies for designing experiments and carrying out sequencing in future studies. We apply the novel approach to study two eukaryotic genomes. For green algae, we construct the first complete genome-wide map of m6dA at single nucleotide and single molecule resolution. For human lymphoblastoid cells (hLCLs), joint analyses of SMRT sequencing and independent sequencing data suggest that putative m6dA events are enriched in the promoters of young, full length LINE-1 elements (L1s). These analyses demonstrate a general method for rigorous mapping and characterization of m6dA events in eukaryotic genomes. Published by Cold Spring Harbor Laboratory Press.

  10. ROMANIAN TRADITIONAL MOTIF ELEMENT OF MODERNITY IN CLOTHING

    Directory of Open Access Journals (Sweden)

    ŞUTEU Marius Darius

    2017-05-01

    Full Text Available In this paper are presented the phases for improving from an aesthetic point of view a clothing item, the T-shirt for women using software design patterns, computerised graphics and textile different modern technologies including: industrial embroidery, digital printing, sublimation. In the first phase a documentation was prepared in the University of Oradea and traditional motif was selected from a collection comprising a number of Romanian traditional motifs from different parts of the country and were reintepreted and stylized whilst preserving the symbolism and color range specified to the area. For the styling phase was used CorelDraw vector graphics program that allows changing the shape, size and color of the drawings without affecting the identity of the pattern. The embroidery was done using BERNINA Embroidery Software Designer Plus Software. This software allows you to export the model to any domestic or industrial embroidery machine regardless of brand. Finally we observed the resistance of the printed and embroided model to various: elasticity, resistance to abrasion and a sensory analysis on the preservation of color. After testing we noticed the imprint resistance applied to the fabric, resulting in a quality that makes possible to keep the Romanian traditional motif from generation to generation.

  11. A Novel Protein Interaction between Nucleotide Binding Domain of Hsp70 and p53 Motif

    Directory of Open Access Journals (Sweden)

    Asita Elengoe

    2015-01-01

    Full Text Available Currently, protein interaction of Homo sapiens nucleotide binding domain (NBD of heat shock 70 kDa protein (PDB: 1HJO with p53 motif remains to be elucidated. The NBD-p53 motif complex enhances the p53 stabilization, thereby increasing the tumor suppression activity in cancer treatment. Therefore, we identified the interaction between NBD and p53 using STRING version 9.1 program. Then, we modeled the three-dimensional structure of p53 motif through homology modeling and determined the binding affinity and stability of NBD-p53 motif complex structure via molecular docking and dynamics (MD simulation. Human DNA binding domain of p53 motif (SCMGGMNR retrieved from UniProt (UniProtKB: P04637 was docked with the NBD protein, using the Autodock version 4.2 program. The binding energy and intermolecular energy for the NBD-p53 motif complex were −0.44 Kcal/mol and −9.90 Kcal/mol, respectively. Moreover, RMSD, RMSF, hydrogen bonds, salt bridge, and secondary structure analyses revealed that the NBD protein had a strong bond with p53 motif and the protein-ligand complex was stable. Thus, the current data would be highly encouraging for designing Hsp70 structure based drug in cancer therapy.

  12. Conserved binding of GCAC motifs by MEC-8, couch potato, and the RBPMS protein family

    Science.gov (United States)

    Soufari, Heddy

    2017-01-01

    Precise regulation of mRNA processing, translation, localization, and stability relies on specific interactions with RNA-binding proteins whose biological function and target preference are dictated by their preferred RNA motifs. The RBPMS family of RNA-binding proteins is defined by a conserved RNA recognition motif (RRM) domain found in metazoan RBPMS/Hermes and RBPMS2, Drosophila couch potato, and MEC-8 from Caenorhabditis elegans. In order to determine the parameters of RNA sequence recognition by the RBPMS family, we have first used the N-terminal domain from MEC-8 in binding assays and have demonstrated a preference for two GCAC motifs optimally separated by >6 nucleotides (nt). We have also determined the crystal structure of the dimeric N-terminal RRM domain from MEC-8 in the unbound form, and in complex with an oligonucleotide harboring two copies of the optimal GCAC motif. The atomic details reveal the molecular network that provides specificity to all four bases in the motif, including multiple hydrogen bonds to the initial guanine. Further studies with human RBPMS, as well as Drosophila couch potato, confirm a general preference for this double GCAC motif by other members of the protein family and the presence of this motif in known targets. PMID:28003515

  13. Parallel motif extraction from very long sequences

    KAUST Repository

    Sahli, Majed; Mansour, Essam; Kalnis, Panos

    2013-01-01

    Motifs are frequent patterns used to identify biological functionality in genomic sequences, periodicity in time series, or user trends in web logs. In contrast to a lot of existing work that focuses on collections of many short sequences, modern

  14. Efficient sequential and parallel algorithms for planted motif search.

    Science.gov (United States)

    Nicolae, Marius; Rajasekaran, Sanguthevar

    2014-01-31

    Motif searching is an important step in the detection of rare events occurring in a set of DNA or protein sequences. One formulation of the problem is known as (l,d)-motif search or Planted Motif Search (PMS). In PMS we are given two integers l and d and n biological sequences. We want to find all sequences of length l that appear in each of the input sequences with at most d mismatches. The PMS problem is NP-complete. PMS algorithms are typically evaluated on certain instances considered challenging. Despite ample research in the area, a considerable performance gap exists because many state of the art algorithms have large runtimes even for moderately challenging instances. This paper presents a fast exact parallel PMS algorithm called PMS8. PMS8 is the first algorithm to solve the challenging (l,d) instances (25,10) and (26,11). PMS8 is also efficient on instances with larger l and d such as (50,21). We include a comparison of PMS8 with several state of the art algorithms on multiple problem instances. This paper also presents necessary and sufficient conditions for 3 l-mers to have a common d-neighbor. The program is freely available at http://engr.uconn.edu/~man09004/PMS8/. We present PMS8, an efficient exact algorithm for Planted Motif Search. PMS8 introduces novel ideas for generating common neighborhoods. We have also implemented a parallel version for this algorithm. PMS8 can solve instances not solved by any previous algorithms.

  15. Motif finding in DNA sequences based on skipping nonconserved positions in background Markov chains.

    Science.gov (United States)

    Zhao, Xiaoyan; Sze, Sing-Hoi

    2011-05-01

    One strategy to identify transcription factor binding sites is through motif finding in upstream DNA sequences of potentially co-regulated genes. Despite extensive efforts, none of the existing algorithms perform very well. We consider a string representation that allows arbitrary ignored positions within the nonconserved portion of single motifs, and use O(2(l)) Markov chains to model the background distributions of motifs of length l while skipping these positions within each Markov chain. By focusing initially on positions that have fixed nucleotides to define core occurrences, we develop an algorithm to identify motifs of moderate lengths. We compare the performance of our algorithm to other motif finding algorithms on a few benchmark data sets, and show that significant improvement in accuracy can be obtained when the sites are sufficiently conserved within a given sample, while comparable performance is obtained when the site conservation rate is low. A software program (PosMotif ) and detailed results are available online at http://faculty.cse.tamu.edu/shsze/posmotif.

  16. Insights into the diversity of eukaryotes in acid mine drainage biofilm communities.

    Science.gov (United States)

    Baker, Brett J; Tyson, Gene W; Goosherst, Lindsey; Banfield, Jillian F

    2009-04-01

    Microscopic eukaryotes are known to have important ecosystem functions, but their diversity in most environments remains vastly unexplored. Here we analyzed an 18S rRNA gene library from a subsurface iron- and sulfur-oxidizing microbial community growing in highly acidic (pH morphological characterization. Results revealed that the populations vary significantly with the habitat and no group is ubiquitous. Surprisingly, many of the eukaryotic lineages (with the exception of the APC) are closely related to neutrophiles, suggesting that they recently adapted to this extreme environment. Molecular analyses presented here confirm that the number of eukaryotic species associated with the acid mine drainage (AMD) communities is low. This finding is consistent with previous results showing a limited diversity of archaea, bacteria, and viruses in AMD environments and suggests that the environmental pressures and interplay between the members of these communities limit species diversity at all trophic levels.

  17. Oxygenation of the Mesoproterozoic ocean and the evolution of complex eukaryotes

    Science.gov (United States)

    Zhang, Kan; Zhu, Xiangkun; Wood, Rachel A.; Shi, Yao; Gao, Zhaofu; Poulton, Simon W.

    2018-05-01

    The Mesoproterozoic era (1,600-1,000 million years ago (Ma)) has long been considered a period of relative environmental stasis, with persistently low levels of atmospheric oxygen. There remains much uncertainty, however, over the evolution of ocean chemistry during this period, which may have been of profound significance for the early evolution of eukaryotic life. Here we present rare earth element, iron-speciation and inorganic carbon isotope data to investigate the redox evolution of the 1,600-1,550 Ma Yanliao Basin, North China Craton. These data confirm that the ocean at the start of the Mesoproterozoic was dominantly anoxic and ferruginous. Significantly, however, we find evidence for a progressive oxygenation event starting at 1,570 Ma, immediately prior to the occurrence of complex multicellular eukaryotes in shelf areas of the Yanliao Basin. Our study thus demonstrates that oxygenation of the Mesoproterozoic environment was far more dynamic and intense than previously envisaged, and establishes an important link between rising oxygen and the emerging record of diverse, multicellular eukaryotic life in the early Mesoproterozoic.

  18. Tracking the rise of eukaryotes to ecological dominance with zinc isotopes.

    Science.gov (United States)

    Isson, Terry T; Love, Gordon D; Dupont, Christopher L; Reinhard, Christopher T; Zumberge, Alex J; Asael, Dan; Gueguen, Bleuenn; McCrow, John; Gill, Ben C; Owens, Jeremy; Rainbird, Robert H; Rooney, Alan D; Zhao, Ming-Yu; Stueeken, Eva E; Konhauser, Kurt O; John, Seth G; Lyons, Timothy W; Planavsky, Noah J

    2018-06-05

    The biogeochemical cycling of zinc (Zn) is intimately coupled with organic carbon in the ocean. Based on an extensive new sedimentary Zn isotope record across Earth's history, we provide evidence for a fundamental shift in the marine Zn cycle ~800 million years ago. We discuss a wide range of potential drivers for this transition and propose that, within available constraints, a restructuring of marine ecosystems is the most parsimonious explanation for this shift. Using a global isotope mass balance approach, we show that a change in the organic Zn/C ratio is required to account for observed Zn isotope trends through time. Given the higher affinity of eukaryotes for Zn relative to prokaryotes, we suggest that a shift toward a more eukaryote-rich ecosystem could have provided a means of more efficiently sequestering organic-derived Zn. Despite the much earlier appearance of eukaryotes in the microfossil record (~1700 to 1600 million years ago), our data suggest a delayed rise to ecological prominence during the Neoproterozoic, consistent with the currently accepted organic biomarker records. © 2018 John Wiley & Sons Ltd.

  19. Proton-pumping rhodopsins are abundantly expressed by microbial eukaryotes in a high-Arctic fjord.

    Science.gov (United States)

    Vader, Anna; Laughinghouse, Haywood D; Griffiths, Colin; Jakobsen, Kjetill S; Gabrielsen, Tove M

    2018-02-01

    Proton-pumping rhodopsins provide an alternative pathway to photosynthesis by which solar energy can enter the marine food web. Rhodopsin genes are widely found in marine bacteria, also in the Arctic, and were recently reported from several eukaryotic lineages. So far, little is known about rhodopsin expression in Arctic eukaryotes. In this study, we used metatranscriptomics and 18S rDNA tag sequencing to examine the mid-summer function and composition of marine protists (size 0.45-10 µm) in the high-Arctic Billefjorden (Spitsbergen), especially focussing on the expression of microbial proton-pumping rhodopsins. Rhodopsin transcripts were highly abundant, at a level similar to that of genes involved in photosynthesis. Phylogenetic analyses placed the environmental rhodopsins within disparate eukaryotic lineages, including dinoflagellates, stramenopiles, haptophytes and cryptophytes. Sequence comparison indicated the presence of several functional types, including xanthorhodopsins and a eukaryotic clade of proteorhodopsin. Transcripts belonging to the proteorhodopsin clade were also abundant in published metatranscriptomes from other oceanic regions, suggesting a global distribution. The diversity and abundance of rhodopsins show that these light-driven proton pumps play an important role in Arctic microbial eukaryotes. Understanding this role is imperative to predicting the future of the Arctic marine ecosystem faced by a changing light climate due to diminishing sea-ice. © 2017 Society for Applied Microbiology and John Wiley & Sons Ltd.

  20. Disparate requirements for the Walker A and B ATPase motifs ofhuman RAD51D in homologous recombination

    Energy Technology Data Exchange (ETDEWEB)

    Wiese, Claudia; Hinz, John M.; Tebbs, Robert S.; Nham, Peter B.; Urbin, Salustra S.; Collins, David W.; Thompson, Larry H.; Schild, David

    2006-04-21

    In vertebrates, homologous recombinational repair (HRR) requires RAD51 and five RAD51 paralogs (XRCC2, XRCC3, RAD51B, RAD51C, and RAD51D) that all contain conserved Walker A and B ATPase motifs. In human RAD51D we examined the requirement for these motifs in interactions with XRCC2 and RAD51C, and for survival of cells in response to DNA interstrand crosslinks. Ectopic expression of wild type human RAD51D or mutants having a non-functional A or B motif was used to test for complementation of a rad51d knockout hamster CHO cell line. Although A-motif mutants complement very efficiently, B-motif mutants do not. Consistent with these results, experiments using the yeast two- and three-hybrid systems show that the interactions between RAD51D and its XRCC2 and RAD51C partners also require a functional RAD51D B motif, but not motif A. Similarly, hamster Xrcc2 is unable to bind to the non-complementing human RAD51D B-motif mutants in co-immunoprecipitation assays. We conclude that a functional Walker B motif, but not A motif, is necessary for RAD51D's interactions with other paralogs and for efficient HRR. We present a model in which ATPase sites are formed in a bipartite manner between RAD51D and other RAD51 paralogs.

  1. Temporal motifs reveal collaboration patterns in online task-oriented networks

    Science.gov (United States)

    Xuan, Qi; Fang, Huiting; Fu, Chenbo; Filkov, Vladimir

    2015-05-01

    Real networks feature layers of interactions and complexity. In them, different types of nodes can interact with each other via a variety of events. Examples of this complexity are task-oriented social networks (TOSNs), where teams of people share tasks towards creating a quality artifact, such as academic research papers or software development in commercial or open source environments. Accomplishing those tasks involves both work, e.g., writing the papers or code, and communication, to discuss and coordinate. Taking into account the different types of activities and how they alternate over time can result in much more precise understanding of the TOSNs behaviors and outcomes. That calls for modeling techniques that can accommodate both node and link heterogeneity as well as temporal change. In this paper, we report on methodology for finding temporal motifs in TOSNs, limited to a system of two people and an artifact. We apply the methods to publicly available data of TOSNs from 31 Open Source Software projects. We find that these temporal motifs are enriched in the observed data. When applied to software development outcome, temporal motifs reveal a distinct dependency between collaboration and communication in the code writing process. Moreover, we show that models based on temporal motifs can be used to more precisely relate both individual developer centrality and team cohesion to programmer productivity than models based on aggregated TOSNs.

  2. Salt-bridge Swapping in the EXXERFXYY Motif of Proton Coupled Oligopeptide Transporters

    DEFF Research Database (Denmark)

    Aduri, Nanda G; Prabhala, Bala K; Ernst, Heidi A

    2015-01-01

    to as E1XXE2R), located on Helix I, in interactions with the proton. In this study we investigated the intracellular substrate accumulation by motif variants with all possible combinations of glutamate residues changed to glutamine and arginine changed to a tyrosine; the latter being a natural variant......-motif salt bridge, i.e. R-E2 to R-E1, which is consistent with previous structural studies. Molecular dynamics simulations of the motif variants E1XXE2R and E1XXQ2R support this mechanism. The simulations showed that upon changing conformation, arginine pushes Helix V, through interactions with the highly...

  3. Systematic discovery of regulatory motifs in Fusarium graminearum by comparing four Fusarium genomes

    Directory of Open Access Journals (Sweden)

    Kistler Corby

    2010-03-01

    Full Text Available Abstract Background Fusarium graminearum (Fg, a major fungal pathogen of cultivated cereals, is responsible for billions of dollars in agriculture losses. There is a growing interest in understanding the transcriptional regulation of this organism, especially the regulation of genes underlying its pathogenicity. The generation of whole genome sequence assemblies for Fg and three closely related Fusarium species provides a unique opportunity for such a study. Results Applying comparative genomics approaches, we developed a computational pipeline to systematically discover evolutionarily conserved regulatory motifs in the promoter, downstream and the intronic regions of Fg genes, based on the multiple alignments of sequenced Fusarium genomes. Using this method, we discovered 73 candidate regulatory motifs in the promoter regions. Nearly 30% of these motifs are highly enriched in promoter regions of Fg genes that are associated with a specific functional category. Through comparison to Saccharomyces cerevisiae (Sc and Schizosaccharomyces pombe (Sp, we observed conservation of transcription factors (TFs, their binding sites and the target genes regulated by these TFs related to pathways known to respond to stress conditions or phosphate metabolism. In addition, this study revealed 69 and 39 conserved motifs in the downstream regions and the intronic regions, respectively, of Fg genes. The top intronic motif is the splice donor site. For the downstream regions, we noticed an intriguing absence of the mammalian and Sc poly-adenylation signals among the list of conserved motifs. Conclusion This study provides the first comprehensive list of candidate regulatory motifs in Fg, and underscores the power of comparative genomics in revealing functional elements among related genomes. The conservation of regulatory pathways among the Fusarium genomes and the two yeast species reveals their functional significance, and provides new insights in their

  4. Through the Portal: Viking Motifs Incorporated in the Romanesque Style in Telemark, Norway

    Directory of Open Access Journals (Sweden)

    Kristine Ødeby

    2013-09-01

    Full Text Available This paper presents the results of an analysis of motifs identified on six carved wooden Romanesque portal panels from the Norwegian county of Telemark. The findings suggest that animal motifs in the Late Viking style survived long into the Late Medieval period and were reused on these medieval portals. Stylistically, late expressions of Viking animal art do not differ a great deal from those of the subsequent Romanesque style. However, their symbolical differences are considered to be significant. The motifs themselves, and the issue of whether the Romanesque style adopted motifs from pre-Christian art, have attracted less attention. The motif portraying Sigurd slaying the dragon is considered in depth. It will be suggested that Sigurd, serving as a mediator between the old and the new beliefs when he appeared in late Viking contexts, was given a new role when portrayed in Christian art. Metaphor and liminality are a central part of this paper, and the theories of Alfred Gell and Margrete Andås suggest that the portal itself affects those who pass through it, and that the iconography is meaningful from a liminal perspective.

  5. Dragon polya spotter: Predictor of poly(A) motifs within human genomic DNA sequences

    KAUST Repository

    Kalkatawi, Manal M.

    2011-11-15

    Motivation: Recognition of poly(A) signals in mRNA is relatively straightforward due to the presence of easily recognizable polyadenylic acid tail. However, the task of identifying poly(A) motifs in the primary genomic DNA sequence that correspond to poly(A) signals in mRNA is a far more challenging problem. Recognition of poly(A) signals is important for better gene annotation and understanding of the gene regulation mechanisms. In this work, we present one such poly(A) motif prediction method based on properties of human genomic DNA sequence surrounding a poly(A) motif. These properties include thermodynamic, physico-chemical and statistical characteristics. For predictions, we developed Artificial Neural Network and Random Forest models. These models are trained to recognize 12 most common poly(A) motifs in human DNA. Our predictors are available as a free web-based tool accessible at http://cbrc.kaust.edu.sa/dps. Compared with other reported predictors, our models achieve higher sensitivity and specificity and furthermore provide a consistent level of accuracy for 12 poly(A) motif variants. The Author(s) 2011. Published by Oxford University Press. All rights reserved.

  6. An intracellular motif of GLUT4 regulates fusion of GLUT4-containing vesicles.

    Science.gov (United States)

    Heyward, Catherine A; Pettitt, Trevor R; Leney, Sophie E; Welsh, Gavin I; Tavaré, Jeremy M; Wakelam, Michael J O

    2008-05-20

    Insulin stimulates glucose uptake by adipocytes through increasing translocation of the glucose transporter GLUT4 from an intracellular compartment to the plasma membrane. Fusion of GLUT4-containing vesicles at the cell surface is thought to involve phospholipase D activity, generating the signalling lipid phosphatidic acid, although the mechanism of action is not yet clear. Here we report the identification of a putative phosphatidic acid-binding motif in a GLUT4 intracellular loop. Mutation of this motif causes a decrease in the insulin-induced exposure of GLUT4 at the cell surface of 3T3-L1 adipocytes via an effect on vesicle fusion. The potential phosphatidic acid-binding motif identified in this study is unique to GLUT4 among the sugar transporters, therefore this motif may provide a unique mechanism for regulating insulin-induced translocation by phospholipase D signalling.

  7. qPMS7: a fast algorithm for finding (ℓ, d-motifs in DNA and protein sequences.

    Directory of Open Access Journals (Sweden)

    Hieu Dinh

    Full Text Available Detection of rare events happening in a set of DNA/protein sequences could lead to new biological discoveries. One kind of such rare events is the presence of patterns called motifs in DNA/protein sequences. Finding motifs is a challenging problem since the general version of motif search has been proven to be intractable. Motifs discovery is an important problem in biology. For example, it is useful in the detection of transcription factor binding sites and transcriptional regulatory elements that are very crucial in understanding gene function, human disease, drug design, etc. Many versions of the motif search problem have been proposed in the literature. One such is the (ℓ, d-motif search (or Planted Motif Search (PMS. A generalized version of the PMS problem, namely, Quorum Planted Motif Search (qPMS, is shown to accurately model motifs in real data. However, solving the qPMS problem is an extremely difficult task because a special case of it, the PMS Problem, is already NP-hard, which means that any algorithm solving it can be expected to take exponential time in the worse case scenario. In this paper, we propose a novel algorithm named qPMS7 that tackles the qPMS problem on real data as well as challenging instances. Experimental results show that our Algorithm qPMS7 is on an average 5 times faster than the state-of-art algorithm. The executable program of Algorithm qPMS7 is freely available on the web at http://pms.engr.uconn.edu/downloads/qPMS7.zip. Our online motif discovery tools that use Algorithm qPMS7 are freely available at http://pms.engr.uconn.edu or http://motifsearch.com.

  8. How to find a leucine in a haystack? Structure, ligand recognition and regulation of leucine-aspartic acid (LD) motifs

    KAUST Repository

    Alam, Tanvir

    2014-05-29

    LD motifs (leucine-aspartic acidmotifs) are short helical protein-protein interaction motifs that have emerged as key players in connecting cell adhesion with cell motility and survival. LD motifs are required for embryogenesis, wound healing and the evolution of multicellularity. LD motifs also play roles in disease, such as in cancer metastasis or viral infection. First described in the paxillin family of scaffolding proteins, LD motifs and similar acidic LXXLL interaction motifs have been discovered in several other proteins, whereas 16 proteins have been reported to contain LDBDs (LD motif-binding domains). Collectively, structural and functional analyses have revealed a surprising multivalency in LD motif interactions and a wide diversity in LDBD architectures. In the present review, we summarize the molecular basis for function, regulation and selectivity of LD motif interactions that has emerged from more than a decade of research. This overview highlights the intricate multi-level regulation and the inherently noisy and heterogeneous nature of signalling through short protein-protein interaction motifs. © 2014 Biochemical Society.

  9. How to find a leucine in a haystack? Structure, ligand recognition and regulation of leucine-aspartic acid (LD) motifs

    KAUST Repository

    Alam, Tanvir; Alazmi, Meshari; Gao, Xin; Arold, Stefan T.

    2014-01-01

    LD motifs (leucine-aspartic acidmotifs) are short helical protein-protein interaction motifs that have emerged as key players in connecting cell adhesion with cell motility and survival. LD motifs are required for embryogenesis, wound healing and the evolution of multicellularity. LD motifs also play roles in disease, such as in cancer metastasis or viral infection. First described in the paxillin family of scaffolding proteins, LD motifs and similar acidic LXXLL interaction motifs have been discovered in several other proteins, whereas 16 proteins have been reported to contain LDBDs (LD motif-binding domains). Collectively, structural and functional analyses have revealed a surprising multivalency in LD motif interactions and a wide diversity in LDBD architectures. In the present review, we summarize the molecular basis for function, regulation and selectivity of LD motif interactions that has emerged from more than a decade of research. This overview highlights the intricate multi-level regulation and the inherently noisy and heterogeneous nature of signalling through short protein-protein interaction motifs. © 2014 Biochemical Society.

  10. Identification of group specific motifs in Beta-lactamase family of proteins

    Directory of Open Access Journals (Sweden)

    Saxena Akansha

    2009-12-01

    Full Text Available Abstract Background Beta-lactamases are one of the most serious threats to public health. In order to combat this threat we need to study the molecular and functional diversity of these enzymes and identify signatures specific to these enzymes. These signatures will enable us to develop inhibitors and diagnostic probes specific to lactamases. The existing classification of beta-lactamases was developed nearly 30 years ago when few lactamases were available. DLact database contain more than 2000 beta-lactamase, which can be used to study the molecular diversity and to identify signatures specific to this family. Methods A set of 2020 beta-lactamase proteins available in the DLact database http://59.160.102.202/DLact were classified using graph-based clustering of Best Bi-Directional Hits. Non-redundant (> 90 percent identical protein sequences from each group were aligned using T-Coffee and annotated using information available in literature. Motifs specific to each group were predicted using PRATT program. Results The graph-based classification of beta-lactamase proteins resulted in the formation of six groups (Four major groups containing 191, 726, 774 and 73 proteins while two minor groups containing 50 and 8 proteins. Based on the information available in literature, we found that each of the four major groups correspond to the four classes proposed by Ambler. The two minor groups were novel and do not contain molecular signatures of beta-lactamase proteins reported in literature. The group-specific motifs showed high sensitivity (> 70% and very high specificity (> 90%. The motifs from three groups (corresponding to class A, C and D had a high level of conservation at DNA as well as protein level whereas the motifs from the fourth group (corresponding to class B showed conservation at only protein level. Conclusion The graph-based classification of beta-lactamase proteins corresponds with the classification proposed by Ambler, thus there is

  11. I-Ad-binding peptides derived from unrelated protein antigens share a common structural motif

    DEFF Research Database (Denmark)

    Sette, A; Buus, S; Colon, S

    1988-01-01

    on the I-Ad binding of the immunogenic peptide OVA 323-339. The results obtained demonstrated the very permissive nature of Ag-Ia interaction. We also showed that unrelated peptides that are good I-Ad binders share a common structural motif and speculated that recognition of such motifs could represent...... that I-Ad molecules recognize a large library of Ag by virtue of common structural motifs present in peptides derived from phylogenetically unrelated proteins....

  12. Phylogenetic analysis of P5 P-type ATPases, a eukaryotic lineage of secretory pathway pumps

    DEFF Research Database (Denmark)

    Møller, Annette; Asp, Torben; Holm, Preben Bach

    2008-01-01

    prokaryotic genome. Based on a protein alignment we could group the P5 ATPases into two subfamilies, P5A and P5B that, based on the number of negative charges in conserved trans-membrane segment 4, are likely to have different ion specificities. P5A ATPases are present in all eukaryotic genomes sequenced so......Eukaryotes encompass a remarkable variety of organisms and unresolved lineages. Different phylogenetic analyses have lead to conflicting conclusions as to the origin and associations between lineages and species. In this work, we investigated evolutionary relationship of a family of cation pumps...... exclusive for the secretory pathway of eukaryotes by combining the identification of lineage-specific genes with phylogenetic evolution of common genes. Sequences of P5 ATPases, which are regarded to be cation pumps in the endoplasmic reticulum (ER), were identified in all eukaryotic lineages but not in any...

  13. GenColors-based comparative genome databases for small eukaryotic genomes.

    Science.gov (United States)

    Felder, Marius; Romualdi, Alessandro; Petzold, Andreas; Platzer, Matthias; Sühnel, Jürgen; Glöckner, Gernot

    2013-01-01

    Many sequence data repositories can give a quick and easily accessible overview on genomes and their annotations. Less widespread is the possibility to compare related genomes with each other in a common database environment. We have previously described the GenColors database system (http://gencolors.fli-leibniz.de) and its applications to a number of bacterial genomes such as Borrelia, Legionella, Leptospira and Treponema. This system has an emphasis on genome comparison. It combines data from related genomes and provides the user with an extensive set of visualization and analysis tools. Eukaryote genomes are normally larger than prokaryote genomes and thus pose additional challenges for such a system. We have, therefore, adapted GenColors to also handle larger datasets of small eukaryotic genomes and to display eukaryotic gene structures. Further recent developments include whole genome views, genome list options and, for bacterial genome browsers, the display of horizontal gene transfer predictions. Two new GenColors-based databases for two fungal species (http://fgb.fli-leibniz.de) and for four social amoebas (http://sacgb.fli-leibniz.de) were set up. Both new resources open up a single entry point for related genomes for the amoebozoa and fungal research communities and other interested users. Comparative genomics approaches are greatly facilitated by these resources.

  14. Consistent mutational paths predict eukaryotic thermostability

    Directory of Open Access Journals (Sweden)

    van Noort Vera

    2013-01-01

    Full Text Available Abstract Background Proteomes of thermophilic prokaryotes have been instrumental in structural biology and successfully exploited in biotechnology, however many proteins required for eukaryotic cell function are absent from bacteria or archaea. With Chaetomium thermophilum, Thielavia terrestris and Thielavia heterothallica three genome sequences of thermophilic eukaryotes have been published. Results Studying the genomes and proteomes of these thermophilic fungi, we found common strategies of thermal adaptation across the different kingdoms of Life, including amino acid biases and a reduced genome size. A phylogenetics-guided comparison of thermophilic proteomes with those of other, mesophilic Sordariomycetes revealed consistent amino acid substitutions associated to thermophily that were also present in an independent lineage of thermophilic fungi. The most consistent pattern is the substitution of lysine by arginine, which we could find in almost all lineages but has not been extensively used in protein stability engineering. By exploiting mutational paths towards the thermophiles, we could predict particular amino acid residues in individual proteins that contribute to thermostability and validated some of them experimentally. By determining the three-dimensional structure of an exemplar protein from C. thermophilum (Arx1, we could also characterise the molecular consequences of some of these mutations. Conclusions The comparative analysis of these three genomes not only enhances our understanding of the evolution of thermophily, but also provides new ways to engineer protein stability.

  15. Signaling mechanisms of apoptosis-like programmed cell death in unicellular eukaryotes.

    Science.gov (United States)

    Shemarova, Irina V

    2010-04-01

    In unicellular eukaryotes, apoptosis-like cell death occurs during development, aging and reproduction, and can be induced by environmental stresses and exposure to toxic agents. The essence of the apoptotic machinery in unicellular organisms is similar to that in mammals, but the apoptotic signal network is less complex and of more ancient origin. The review summarizes current data about key apoptotic proteins and mechanisms of the transduction of apoptotic signals by caspase-like proteases and mitochondrial apoptogenic proteins in unicellular eukaryotes. The roles of receptor-dependent and receptor-independent caspase cascades are reviewed. 2010 Elsevier Inc. All rights reserved.

  16. Glycosyltransferase family 43 is also found in early eukaryotes and has three subfamilies in Charophycean green algae.

    Directory of Open Access Journals (Sweden)

    Rahil Taujale

    Full Text Available The glycosyltransferase family 43 (GT43 has been suggested to be involved in the synthesis of xylans in plant cell walls and proteoglycans in animals. Very recently GT43 family was also found in Charophycean green algae (CGA, the closest relatives of extant land plants. Here we present evidence that non-plant and non-animal early eukaryotes such as fungi, Haptophyceae, Choanoflagellida, Ichthyosporea and Haptophyceae also have GT43-like genes, which are phylogenetically close to animal GT43 genes. By mining RNA sequencing data (RNA-Seq of selected plants, we showed that CGA have evolved three major groups of GT43 genes, one orthologous to IRX14 (IRREGULAR XYLEM14, one orthologous to IRX9/IRX9L and the third one ancestral to all land plant GT43 genes. We confirmed that land plant GT43 has two major clades A and B, while in angiosperms, clade A further evolved into three subclades and the expression and motif pattern of A3 (containing IRX9 are fairly different from the other two clades likely due to rapid evolution. Our in-depth sequence analysis contributed to our overall understanding of the early evolution of GT43 family and could serve as an example for the study of other plant cell wall-related enzyme families.

  17. Biotransformation of arsenic by a Yellowstone thermoacidophilic eukaryotic alga.

    Science.gov (United States)

    Qin, Jie; Lehr, Corinne R; Yuan, Chungang; Le, X Chris; McDermott, Timothy R; Rosen, Barry P

    2009-03-31

    Arsenic is the most common toxic substance in the environment, ranking first on the Superfund list of hazardous substances. It is introduced primarily from geochemical sources and is acted on biologically, creating an arsenic biogeocycle. Geothermal environments are known for their elevated arsenic content and thus provide an excellent setting in which to study microbial redox transformations of arsenic. To date, most studies of microbial communities in geothermal environments have focused on Bacteria and Archaea, with little attention to eukaryotic microorganisms. Here, we show the potential of an extremophilic eukaryotic alga of the order Cyanidiales to influence arsenic cycling at elevated temperatures. Cyanidioschyzon sp. isolate 5508 oxidized arsenite [As(III)] to arsenate [As(V)], reduced As(V) to As(III), and methylated As(III) to form trimethylarsine oxide (TMAO) and dimethylarsenate [DMAs(V)]. Two arsenic methyltransferase genes, CmarsM7 and CmarsM8, were cloned from this organism and demonstrated to confer resistance to As(III) in an arsenite hypersensitive strain of Escherichia coli. The 2 recombinant CmArsMs were purified and shown to transform As(III) into monomethylarsenite, DMAs(V), TMAO, and trimethylarsine gas, with a T(opt) of 60-70 degrees C. These studies illustrate the importance of eukaryotic microorganisms to the biogeochemical cycling of arsenic in geothermal systems, offer a molecular explanation for how these algae tolerate arsenic in their environment, and provide the characterization of algal methyltransferases.

  18. Methyl labeling and TROSY NMR spectroscopy of proteins expressed in the eukaryote Pichia pastoris

    International Nuclear Information System (INIS)

    Clark, Lindsay; Zahm, Jacob A.; Ali, Rustam; Kukula, Maciej; Bian, Liangqiao; Patrie, Steven M.; Gardner, Kevin H.; Rosen, Michael K.; Rosenbaum, Daniel M.

    2015-01-01

    13 C Methyl TROSY NMR spectroscopy has emerged as a powerful method for studying the dynamics of large systems such as macromolecular assemblies and membrane proteins. Specific 13 C labeling of aliphatic methyl groups and perdeuteration has been limited primarily to proteins expressed in E. coli, preventing studies of many eukaryotic proteins of physiological and biomedical significance. We demonstrate the feasibility of efficient 13 C isoleucine δ1-methyl labeling in a deuterated background in an established eukaryotic expression host, Pichia pastoris, and show that this method can be used to label the eukaryotic protein actin, which cannot be expressed in bacteria. This approach will enable NMR studies of previously intractable targets

  19. Evolution of an intricate J-protein network driving protein disaggregation in eukaryotes.

    Science.gov (United States)

    Nillegoda, Nadinath B; Stank, Antonia; Malinverni, Duccio; Alberts, Niels; Szlachcic, Anna; Barducci, Alessandro; De Los Rios, Paolo; Wade, Rebecca C; Bukau, Bernd

    2017-05-15

    Hsp70 participates in a broad spectrum of protein folding processes extending from nascent chain folding to protein disaggregation. This versatility in function is achieved through a diverse family of J-protein cochaperones that select substrates for Hsp70. Substrate selection is further tuned by transient complexation between different classes of J-proteins, which expands the range of protein aggregates targeted by metazoan Hsp70 for disaggregation. We assessed the prevalence and evolutionary conservation of J-protein complexation and cooperation in disaggregation. We find the emergence of a eukaryote-specific signature for interclass complexation of canonical J-proteins. Consistently, complexes exist in yeast and human cells, but not in bacteria, and correlate with cooperative action in disaggregation in vitro. Signature alterations exclude some J-proteins from networking, which ensures correct J-protein pairing, functional network integrity and J-protein specialization. This fundamental change in J-protein biology during the prokaryote-to-eukaryote transition allows for increased fine-tuning and broadening of Hsp70 function in eukaryotes.

  20. An algorithm for detecting eukaryotic sequences in metagenomic ...

    Indian Academy of Sciences (India)

    species but also from accidental contamination from the genome of eukaryotic host cells. The latter scenario generally occurs in the case of host-associated metagenomes, e.g. microbes living in human gut. In such cases, one needs to identify and remove contaminating host DNA sequences, since the latter sequences will ...

  1. Comprehensive human transcription factor binding site map for combinatory binding motifs discovery.

    Directory of Open Access Journals (Sweden)

    Arnoldo J Müller-Molina

    Full Text Available To know the map between transcription factors (TFs and their binding sites is essential to reverse engineer the regulation process. Only about 10%-20% of the transcription factor binding motifs (TFBMs have been reported. This lack of data hinders understanding gene regulation. To address this drawback, we propose a computational method that exploits never used TF properties to discover the missing TFBMs and their sites in all human gene promoters. The method starts by predicting a dictionary of regulatory "DNA words." From this dictionary, it distills 4098 novel predictions. To disclose the crosstalk between motifs, an additional algorithm extracts TF combinatorial binding patterns creating a collection of TF regulatory syntactic rules. Using these rules, we narrowed down a list of 504 novel motifs that appear frequently in syntax patterns. We tested the predictions against 509 known motifs confirming that our system can reliably predict ab initio motifs with an accuracy of 81%-far higher than previous approaches. We found that on average, 90% of the discovered combinatorial binding patterns target at least 10 genes, suggesting that to control in an independent manner smaller gene sets, supplementary regulatory mechanisms are required. Additionally, we discovered that the new TFBMs and their combinatorial patterns convey biological meaning, targeting TFs and genes related to developmental functions. Thus, among all the possible available targets in the genome, the TFs tend to regulate other TFs and genes involved in developmental functions. We provide a comprehensive resource for regulation analysis that includes a dictionary of "DNA words," newly predicted motifs and their corresponding combinatorial patterns. Combinatorial patterns are a useful filter to discover TFBMs that play a major role in orchestrating other factors and thus, are likely to lock/unlock cellular functional clusters.

  2. Memfasilitasi Penalaran Geometri Transformasi Siswa Melalui Eksplorasi Motif Melayu dengan Bantuan Grid

    Directory of Open Access Journals (Sweden)

    Febrian Febrian

    2017-10-01

    Full Text Available Geometri transformasi merupakan pengetahuan yang krusial dalam geometri yang dapat membangun banyak kemampuan lainnya seperti penalaran matematis. Oleh karena itu, geometri transformasi disarankan untuk diberikan pada pebelajar mulai dari usia dini. Penelitian terdahulu menunjukkan bahwa anak-anak memiliki sense untuk melihat karakteristik kedinamisan pada benda, oleh karena itu memfasilitasi pembelajaran yang dapat memanfaatkan sense ini menjadi sangat penting untuk membangun pemahaman geometri transformasi. Penelitian design research ini bertujuan untuk memfasilitasi siswa sekolah dasar untuk dapat mengembangkan pengetahuan awal mereka mengenai komposisi transformasi. Subjek penelitian adalah siswa kelas IV Sekolah Dasar Negeri 001 Toapaya, Kabupaten Bintan, Kepulauan Riau. Pendekatan pembelajaran yang digunakan adalah PMRI dengan konteks motif melayu itik pulang petang dengan bantuan grid. Hasil menunjukkan bahwa setting pembelajaran dapat memfasilitasi penalaran geometri transformasi melalui kegiatan eksplorasi motif dengan bantuan grid. Kata Kunci: komposisi transformasi, penalaran, motif melayu, grid, PMRI Transformation geometry is a crucial knowledge in geometry that can emerge many skills especially mathematical reasoning. Therefore, transformation geometry is suggested to be taught to children especially the young learners. Existing research implies that children have particular sense to see dynamic characteristic of an object or others. On the behalf of this statement, facilitating students in learning process that makes use of this students sense becomes important to undertake to help develop students reasoning of transformation geometry. The subtopic being highlighted is the composition of transformation. This design research aims to facilitate this situation. The subject of the research is fourth graders of the State Elementary School of 001 at Toapaya, Kabupaten Bintan, Kepulauan Riau. The learning approach used was PMRI by using

  3. Metabolic profiles of prokaryotic and eukaryotic communities in deep-sea sponge Neamphius huxleyi [corrected]. indicated by metagenomics.

    Science.gov (United States)

    Li, Zhi-Yong; Wang, Yue-Zhu; He, Li-Ming; Zheng, Hua-Jun

    2014-01-27

    The whole metabolism of a sponge holobiont and the respective contributions of prokaryotic and eukaryotic symbionts and their associations with the sponge host remain largely unclear. Meanwhile, compared with shallow water sponges, deep-sea sponges are rarely understood. Here we report the metagenomic exploration of deep-sea sponge Neamphius huxleyi [corrected] . at the whole community level. Metagenomic data showed phylogenetically diverse prokaryotes and eukaryotes in Neamphius huxleyi [corrected]. MEGAN and gene enrichment analyses indicated different metabolic potentials of prokaryotic symbionts from eukaryotic symbionts, especially in nitrogen and carbon metabolisms, and their molecular interactions with the sponge host. These results supported the hypothesis that prokaryotic and eukaryotic symbionts have different ecological roles and relationships with sponge host. Moreover, vigorous denitrification, and CO2 fixation by chemoautotrophic prokaryotes were suggested for this deep-sea sponge. The study provided novel insights into the respective potentials of prokaryotic and eukaryotic symbionts and their associations with deep-sea sponge Neamphius huxleyi [corrected].

  4. Lateral transfer of tetrahymanol-synthesizing genes has allowed multiple diverse eukaryote lineages to independently adapt to environments without oxygen

    Directory of Open Access Journals (Sweden)

    Takishita Kiyotaka

    2012-02-01

    Full Text Available Abstract Sterols are key components of eukaryotic cellular membranes that are synthesized by multi-enzyme pathways that require molecular oxygen. Because prokaryotes fundamentally lack sterols, it is unclear how the vast diversity of bacterivorous eukaryotes that inhabit hypoxic environments obtain, or synthesize, sterols. Here we show that tetrahymanol, a triterpenoid that does not require molecular oxygen for its biosynthesis, likely functions as a surrogate of sterol in eukaryotes inhabiting oxygen-poor environments. Genes encoding the tetrahymanol synthesizing enzyme squalene-tetrahymanol cyclase were found from several phylogenetically diverged eukaryotes that live in oxygen-poor environments and appear to have been laterally transferred among such eukaryotes. Reviewers This article was reviewed by Eric Bapteste and Eugene Koonin.

  5. Potential of industrial biotechnology with cyanobacteria and eukaryotic microalgae.

    NARCIS (Netherlands)

    Wijffels, R.H.; Kruse, O.; Hellingwerf, K.J.

    2013-01-01

    Both cyanobacteria and eukaryotic microalgae are promising organisms for sustainable production of bulk products such as food, feed, materials, chemicals and fuels. In this review we will summarize the potential and current biotechnological developments. Cyanobacteria are promising host organisms

  6. An intracellular motif of GLUT4 regulates fusion of GLUT4-containing vesicles

    Directory of Open Access Journals (Sweden)

    Welsh Gavin I

    2008-05-01

    Full Text Available Abstract Background Insulin stimulates glucose uptake by adipocytes through increasing translocation of the glucose transporter GLUT4 from an intracellular compartment to the plasma membrane. Fusion of GLUT4-containing vesicles at the cell surface is thought to involve phospholipase D activity, generating the signalling lipid phosphatidic acid, although the mechanism of action is not yet clear. Results Here we report the identification of a putative phosphatidic acid-binding motif in a GLUT4 intracellular loop. Mutation of this motif causes a decrease in the insulin-induced exposure of GLUT4 at the cell surface of 3T3-L1 adipocytes via an effect on vesicle fusion. Conclusion The potential phosphatidic acid-binding motif identified in this study is unique to GLUT4 among the sugar transporters, therefore this motif may provide a unique mechanism for regulating insulin-induced translocation by phospholipase D signalling.

  7. Multiple TPR motifs characterize the Fanconi anemia FANCG protein.

    Science.gov (United States)

    Blom, Eric; van de Vrugt, Henri J; de Vries, Yne; de Winter, Johan P; Arwert, Fré; Joenje, Hans

    2004-01-05

    The genome protection pathway that is defective in patients with Fanconi anemia (FA) is controlled by at least eight genes, including BRCA2. A key step in the pathway involves the monoubiquitylation of FANCD2, which critically depends on a multi-subunit nuclear 'core complex' of at least six FANC proteins (FANCA, -C, -E, -F, -G, and -L). Except for FANCL, which has WD40 repeats and a RING finger domain, no significant domain structure has so far been recognized in any of the core complex proteins. By using a homology search strategy comparing the human FANCG protein sequence with its ortholog sequences in Oryzias latipes (Japanese rice fish) and Danio rerio (zebrafish) we identified at least seven tetratricopeptide repeat motifs (TPRs) covering a major part of this protein. TPRs are degenerate 34-amino acid repeat motifs which function as scaffolds mediating protein-protein interactions, often found in multiprotein complexes. In four out of five TPR motifs tested (TPR1, -2, -5, and -6), targeted missense mutagenesis disrupting the motifs at the critical position 8 of each TPR caused complete or partial loss of FANCG function. Loss of function was evident from failure of the mutant proteins to complement the cellular FA phenotype in FA-G lymphoblasts, which was correlated with loss of binding to FANCA. Although the TPR4 mutant fully complemented the cells, it showed a reduced interaction with FANCA, suggesting that this TPR may also be of functional importance. The recognition of FANCG as a typical TPR protein predicts this protein to play a key role in the assembly and/or stabilization of the nuclear FA protein core complex.

  8. The city as a motif in Slovene youth literature

    Directory of Open Access Journals (Sweden)

    Milena Mileva Blažić

    2003-01-01

    Full Text Available The article presents the city as motif of Slovenian youth literature in four different periods, beginning in the first period of original Slovenian youth literature in the second half of the 19th century, second period in the first half of the 20th century, third period in the second half of the 20th century and after 1950, when significant books were produced in the field of short modern stories, emphasising on picture books and realistic narrative prose, and the fourth period after 1990. A discernable shift can be observed in the thirties of the 20th century, during the times of socialist realism. The most significant change occurred after 1960, when massive migration from rural to urban environments caused by industrialisation began. The motif of urban environment especially marked modern realistic narrative, coined problematic narrative after 1990, with its focus on issues of growing up in such environments. The city as motif or theme doesn’t appear only in realistic narrative, but since the early 20th century also in fantastic narrative, thus it dichotomically presents the image of real world in Slovenian youth realistic narrative.

  9. Structural and functional aspects of winged-helix domains at the core of transcription initiation complexes.

    Science.gov (United States)

    Teichmann, Martin; Dumay-Odelot, Hélène; Fribourg, Sébastien

    2012-01-01

    The winged helix (WH) domain is found in core components of transcription systems in eukaryotes and prokaryotes. It represents a sub-class of the helix-turn-helix motif. The WH domain participates in establishing protein-DNA and protein-protein-interactions. Here, we discuss possible explanations for the enrichment of this motif in transcription systems.

  10. Genome-wide mapping reveals single-origin chromosome replication in Leishmania, a eukaryotic microbe.

    Science.gov (United States)

    Marques, Catarina A; Dickens, Nicholas J; Paape, Daniel; Campbell, Samantha J; McCulloch, Richard

    2015-10-19

    DNA replication initiates on defined genome sites, termed origins. Origin usage appears to follow common rules in the eukaryotic organisms examined to date: all chromosomes are replicated from multiple origins, which display variations in firing efficiency and are selected from a larger pool of potential origins. To ask if these features of DNA replication are true of all eukaryotes, we describe genome-wide origin mapping in the parasite Leishmania. Origin mapping in Leishmania suggests a striking divergence in origin usage relative to characterized eukaryotes, since each chromosome appears to be replicated from a single origin. By comparing two species of Leishmania, we find evidence that such origin singularity is maintained in the face of chromosome fusion or fission events during evolution. Mapping Leishmania origins suggests that all origins fire with equal efficiency, and that the genomic sites occupied by origins differ from related non-origins sites. Finally, we provide evidence that origin location in Leishmania displays striking conservation with Trypanosoma brucei, despite the latter parasite replicating its chromosomes from multiple, variable strength origins. The demonstration of chromosome replication for a single origin in Leishmania, a microbial eukaryote, has implications for the evolution of origin multiplicity and associated controls, and may explain the pervasive aneuploidy that characterizes Leishmania chromosome architecture.

  11. Peptide-binding motifs of two common equine class I MHC molecules in Thoroughbred horses.

    Science.gov (United States)

    Bergmann, Tobias; Lindvall, Mikaela; Moore, Erin; Moore, Eugene; Sidney, John; Miller, Donald; Tallmadge, Rebecca L; Myers, Paisley T; Malaker, Stacy A; Shabanowitz, Jeffrey; Osterrieder, Nikolaus; Peters, Bjoern; Hunt, Donald F; Antczak, Douglas F; Sette, Alessandro

    2017-05-01

    Quantitative peptide-binding motifs of MHC class I alleles provide a valuable tool to efficiently identify putative T cell epitopes. Detailed information on equine MHC class I alleles is still very limited, and to date, only a single equine MHC class I allele, Eqca-1*00101 (ELA-A3 haplotype), has been characterized. The present study extends the number of characterized ELA class I specificities in two additional haplotypes found commonly in the Thoroughbred breed. Accordingly, we here report quantitative binding motifs for the ELA-A2 allele Eqca-16*00101 and the ELA-A9 allele Eqca-1*00201. Utilizing analyses of endogenously bound and eluted ligands and the screening of positional scanning combinatorial libraries, detailed and quantitative peptide-binding motifs were derived for both alleles. Eqca-16*00101 preferentially binds peptides with aliphatic/hydrophobic residues in position 2 and at the C-terminus, and Eqca-1*00201 has a preference for peptides with arginine in position 2 and hydrophobic/aliphatic residues at the C-terminus. Interestingly, the Eqca-16*00101 motif resembles that of the human HLA A02-supertype, while the Eqca-1*00201 motif resembles that of the HLA B27-supertype and two macaque class I alleles. It is expected that the identified motifs will facilitate the selection of candidate epitopes for the study of immune responses in horses.

  12. Potential of industrial biotechnology with cyanobacteria and eukaryotic microalgae

    NARCIS (Netherlands)

    Wijffels, R.H.; Kruse, O.; Hellingwerf, K.J.

    2013-01-01

    Both cyanobacteria and eukaryotic microalgae are promising organisms for sustainable production of bulk products such as food, feed, materials, chemicals and fuels. In this review we will summarize the potential and current biotechnological developments.Cyanobacteria are promising host organisms for

  13. High affinity recognition of a Phytophthora protein by Arabidopsis via an RGD motif

    NARCIS (Netherlands)

    Senchou, V.; Weide, R.L.; Carrasco, A.; Bouyssou, H.; Pont-Lezica, R.; Govers, F.; Canut, H.

    2004-01-01

    The RGD tripeptide sequence, a cell adhesion motif present in several extracellular matrix proteins of mammalians, is involved in numerous plant processes. In plant-pathogen interactions, the RGD motif is believed to reduce plant defence responses by disrupting adhesions between the cell wall and

  14. DXD Motif-Dependent and -Independent Effects of the Chlamydia trachomatis Cytotoxin CT166

    Directory of Open Access Journals (Sweden)

    Miriam Bothe

    2015-02-01

    Full Text Available The Gram-negative, intracellular bacterium Chlamydia trachomatis causes acute and chronic urogenital tract infection, potentially leading to infertility and ectopic pregnancy. The only partially characterized cytotoxin CT166 of serovar D exhibits a DXD motif, which is important for the enzymatic activity of many bacterial and mammalian type A glycosyltransferases, leading to the hypothesis that CT166 possess glycosyltransferase activity. CT166-expressing HeLa cells exhibit actin reorganization, including cell rounding, which has been attributed to the inhibition of the Rho-GTPases Rac/Cdc42. Exploiting the glycosylation-sensitive Ras(27H5 antibody, we here show that CT166 induces an epitope change in Ras, resulting in inhibited ERK and PI3K signaling and delayed cell cycle progression. Consistent with the hypothesis that these effects strictly depend on the DXD motif, CT166 with the mutated DXD motif causes neither Ras-ERK inhibition nor delayed cell cycle progression. In contrast, CT166 with the mutated DXD motif is still capable of inhibiting cell migration, suggesting that CT166 with the mutated DXD motif cannot be regarded as inactive in any case. Taken together, CT166 affects various fundamental cellular processes, strongly suggesting its importance for the intracellular survival of chlamydia.

  15. Regulation of TCF ETS-domain transcription factors by helix-loop-helix motifs.

    Science.gov (United States)

    Stinson, Julie; Inoue, Toshiaki; Yates, Paula; Clancy, Anne; Norton, John D; Sharrocks, Andrew D

    2003-08-15

    DNA binding by the ternary complex factor (TCF) subfamily of ETS-domain transcription factors is tightly regulated by intramolecular and intermolecular interactions. The helix-loop-helix (HLH)-containing Id proteins are trans-acting negative regulators of DNA binding by the TCFs. In the TCF, SAP-2/Net/ERP, intramolecular inhibition of DNA binding is promoted by the cis-acting NID region that also contains an HLH-like motif. The NID also acts as a transcriptional repression domain. Here, we have studied the role of HLH motifs in regulating DNA binding and transcription by the TCF protein SAP-1 and how Cdk-mediated phosphorylation affects the inhibitory activity of the Id proteins towards the TCFs. We demonstrate that the NID region of SAP-1 is an autoinhibitory motif that acts to inhibit DNA binding and also functions as a transcription repression domain. This region can be functionally replaced by fusion of Id proteins to SAP-1, whereby the Id moiety then acts to repress DNA binding in cis. Phosphorylation of the Ids by cyclin-Cdk complexes results in reduction in protein-protein interactions between the Ids and TCFs and relief of their DNA-binding inhibitory activity. In revealing distinct mechanisms through which HLH motifs modulate the activity of TCFs, our results therefore provide further insight into the role of HLH motifs in regulating TCF function and how the inhibitory properties of the trans-acting Id HLH proteins are themselves regulated by phosphorylation.

  16. Alanine substitutions in the GXXXG motif alter C99 cleavage by γ-secretase but not its dimerization.

    Science.gov (United States)

    Higashide, Hidekazu; Ishihara, Seiko; Nobuhara, Mika; Ihara, Yasuo; Funamoto, Satoru

    2017-03-01

    The amyloid β (Aβ) protein is a major component of senile plaques, one of the neuropathological hallmarks of Alzheimer's disease. Amyloidogenic processing of amyloid precursor protein (APP) by β- and γ-secretases leads to production of Aβ. APP contains tandem triple repeats of the GXXXG motif in its extracellular juxtamembrane and transmembrane regions. It is reported that the GXXXG motif is related to protein-protein interactions, but it remains controversial whether the GXXXG motif in APP is involved in substrate dimerization and whether dimerization affects γ-secretase-dependent cleavage. Therefore, the relationship between the GXXXG motifs, substrate dimerization, and γ-secretase-dependent cleavage sites remains unclear. Here, we applied blue native poly acrylamide gel electrophoresis to examine the effect of alanine substitutions within the GXXXG motifs of APP carboxyl terminal fragment (C99) on its dimerization and Aβ production. Surprisingly, alanine substitutions in the motif failed to alter C99 dimerization in detergent soluble state. Cell-based and solubilized γ-secretase assays demonstrated that increasing alanine substitutions in the motif tended to decrease long Aβ species such as Aβ42 and Aβ43 and to increase in short Aβ species concomitantly. Our data suggest that the GXXXG motif is crucial for Aβ production, but not for C99 dimerization. © 2016 International Society for Neurochemistry.

  17. Disparate requirements for the Walker A and B ATPase motifs of human RAD51D in homologous recombination.

    Science.gov (United States)

    Wiese, Claudia; Hinz, John M; Tebbs, Robert S; Nham, Peter B; Urbin, Salustra S; Collins, David W; Thompson, Larry H; Schild, David

    2006-01-01

    In vertebrates, homologous recombinational repair (HRR) requires RAD51 and five RAD51 paralogs (XRCC2, XRCC3, RAD51B, RAD51C and RAD51D) that all contain conserved Walker A and B ATPase motifs. In human RAD51D we examined the requirement for these motifs in interactions with XRCC2 and RAD51C, and for survival of cells in response to DNA interstrand crosslinks (ICLs). Ectopic expression of wild-type human RAD51D or mutants having a non-functional A or B motif was used to test for complementation of a rad51d knockout hamster CHO cell line. Although A-motif mutants complement very efficiently, B-motif mutants do not. Consistent with these results, experiments using the yeast two- and three-hybrid systems show that the interactions between RAD51D and its XRCC2 and RAD51C partners also require a functional RAD51D B motif, but not motif A. Similarly, hamster Xrcc2 is unable to bind to the non-complementing human RAD51D B-motif mutants in co-immunoprecipitation assays. We conclude that a functional Walker B motif, but not A motif, is necessary for RAD51D's interactions with other paralogs and for efficient HRR. We present a model in which ATPase sites are formed in a bipartite manner between RAD51D and other RAD51 paralogs.

  18. Eukaryotic DNA Replication Fork.

    Science.gov (United States)

    Burgers, Peter M J; Kunkel, Thomas A

    2017-06-20

    This review focuses on the biogenesis and composition of the eukaryotic DNA replication fork, with an emphasis on the enzymes that synthesize DNA and repair discontinuities on the lagging strand of the replication fork. Physical and genetic methodologies aimed at understanding these processes are discussed. The preponderance of evidence supports a model in which DNA polymerase ε (Pol ε) carries out the bulk of leading strand DNA synthesis at an undisturbed replication fork. DNA polymerases α and δ carry out the initiation of Okazaki fragment synthesis and its elongation and maturation, respectively. This review also discusses alternative proposals, including cellular processes during which alternative forks may be utilized, and new biochemical studies with purified proteins that are aimed at reconstituting leading and lagging strand DNA synthesis separately and as an integrated replication fork.

  19. Blocking Modification of Eukaryotic Initiation 5A2 Antagonizes Cervical Carcinoma via Inhibition of RhoA/ROCK Signal Transduction Pathway.

    Science.gov (United States)

    Liu, Xiaojun; Chen, Dong; Liu, Jiamei; Chu, Zhangtao; Liu, Dongli

    2017-10-01

    Cervical carcinoma is one of the leading causes of cancer-related death for female worldwide. Eukaryotic initiation factor 5A2 belongs to the eukaryotic initiation factor 5A family and is proposed to be a key factor involved in the development of diverse cancers. In the current study, a series of in vivo and in vitro investigations were performed to characterize the role of eukaryotic initiation factor 5A2 in oncogenesis and metastasis of cervical carcinoma. The expression status of eukaryotic initiation factor 5A2 in 15 cervical carcinoma patients was quantified. Then, the effect of eukaryotic initiation factor 5A2 knockdown on in vivo tumorigenicity ability, cell proliferation, cell cycle distribution, and cell mobility of HeLa cells was measured. To uncover the mechanism driving the function of eukaryotic initiation factor 5A2 in cervical carcinoma, expression of members within RhoA/ROCK pathway was detected, and the results were further verified with an RhoA overexpression modification. The level of eukaryotic initiation factor 5A2 in cervical carcinoma samples was significantly higher than that in paired paratumor tissues ( P cycle arrest ( P ROCK I, and ROCK II were downregulated. The above-mentioned changes in eukaryotic initiation factor 5A2 knockdown cells were alleviated by the overexpression of RhoA. The major findings outlined in the current study confirmed the potential of eukaryotic initiation factor 5A2 as a promising prognosis predictor and therapeutic target for cervical carcinoma treatment. Also, our data inferred that eukaryotic initiation factor 5A2 might function in carcinogenesis of cervical carcinoma through an RhoA/ROCK-dependent manner.

  20. Revisiting the Relationship between Transposable Elements and the Eukaryotic Stress Response.

    Science.gov (United States)

    Horváth, Vivien; Merenciano, Miriam; González, Josefa

    2017-11-01

    A relationship between transposable elements (TEs) and the eukaryotic stress response was suggested in the first publications describing TEs. Since then, it has often been assumed that TEs are activated by stress, and that this activation is often beneficial for the organism. In recent years, the availability of new high-throughput experimental techniques has allowed further interrogation of the relationship between TEs and stress. By reviewing the recent literature, we conclude that although there is evidence for a beneficial effect of TE activation under stress conditions, the relationship between TEs and the eukaryotic stress response is quite complex. Copyright © 2017 Elsevier Ltd. All rights reserved.

  1. Geminin: a major DNA replication safeguard in higher eukaryotes

    DEFF Research Database (Denmark)

    Melixetian, Marina; Helin, Kristian

    2004-01-01

    Eukaryotes have evolved multiple mechanisms to restrict DNA replication to once per cell cycle. These mechanisms prevent relicensing of origins of replication after initiation of DNA replication in S phase until the end of mitosis. Most of our knowledge of mechanisms controlling prereplication...

  2. Recognition of prokaryotic and eukaryotic promoters using convolutional deep learning neural networks

    KAUST Repository

    Umarov, Ramzan

    2017-02-03

    Accurate computational identification of promoters remains a challenge as these key DNA regulatory regions have variable structures composed of functional motifs that provide gene-specific initiation of transcription. In this paper we utilize Convolutional Neural Networks (CNN) to analyze sequence characteristics of prokaryotic and eukaryotic promoters and build their predictive models. We trained a similar CNN architecture on promoters of five distant organisms: human, mouse, plant (Arabidopsis), and two bacteria (Escherichia coli and Bacillus subtilis). We found that CNN trained on sigma70 subclass of Escherichia coli promoter gives an excellent classification of promoters and non-promoter sequences (Sn = 0.90, Sp = 0.96, CC = 0.84). The Bacillus subtilis promoters identification CNN model achieves Sn = 0.91, Sp = 0.95, and CC = 0.86. For human, mouse and Arabidopsis promoters we employed CNNs for identification of two well-known promoter classes (TATA and non-TATA promoters). CNN models nicely recognize these complex functional regions. For human promoters Sn/Sp/CC accuracy of prediction reached 0.95/0.98/0,90 on TATA and 0.90/0.98/0.89 for non-TATA promoter sequences, respectively. For Arabidopsis we observed Sn/Sp/CC 0.95/0.97/0.91 (TATA) and 0.94/0.94/0.86 (non-TATA) promoters. Thus, the developed CNN models, implemented in CNNProm program, demonstrated the ability of deep learning approach to grasp complex promoter sequence characteristics and achieve significantly higher accuracy compared to the previously developed promoter prediction programs. We also propose random substitution procedure to discover positionally conserved promoter functional elements. As the suggested approach does not require knowledge of any specific promoter features, it can be easily extended to identify promoters and other complex functional regions in sequences of many other and especially newly sequenced genomes. The CNNProm program is available to run at web server http://www.softberry.com.

  3. Crystal structure and novel recognition motif of rho ADP-ribosylating C3 exoenzyme from Clostridium botulinum: structural insights for recognition specificity and catalysis.

    Science.gov (United States)

    Han, S; Arvai, A S; Clancy, S B; Tainer, J A

    2001-01-05

    Clostridium botulinum C3 exoenzyme inactivates the small GTP-binding protein family Rho by ADP-ribosylating asparagine 41, which depolymerizes the actin cytoskeleton. C3 thus represents a major family of the bacterial toxins that transfer the ADP-ribose moiety of NAD to specific amino acids in acceptor proteins to modify key biological activities in eukaryotic cells, including protein synthesis, differentiation, transformation, and intracellular signaling. The 1.7 A resolution C3 exoenzyme structure establishes the conserved features of the core NAD-binding beta-sandwich fold with other ADP-ribosylating toxins despite little sequence conservation. Importantly, the central core of the C3 exoenzyme structure is distinguished by the absence of an active site loop observed in many other ADP-ribosylating toxins. Unlike the ADP-ribosylating toxins that possess the active site loop near the central core, the C3 exoenzyme replaces the active site loop with an alpha-helix, alpha3. Moreover, structural and sequence similarities with the catalytic domain of vegetative insecticidal protein 2 (VIP2), an actin ADP-ribosyltransferase, unexpectedly implicates two adjacent, protruding turns, which join beta5 and beta6 of the toxin core fold, as a novel recognition specificity motif for this newly defined toxin family. Turn 1 evidently positions the solvent-exposed, aromatic side-chain of Phe209 to interact with the hydrophobic region of Rho adjacent to its GTP-binding site. Turn 2 evidently both places the Gln212 side-chain for hydrogen bonding to recognize Rho Asn41 for nucleophilic attack on the anomeric carbon of NAD ribose and holds the key Glu214 catalytic side-chain in the adjacent catalytic pocket. This proposed bipartite ADP-ribosylating toxin turn-turn (ARTT) motif places the VIP2 and C3 toxin classes into a single ARTT family characterized by analogous target protein recognition via turn 1 aromatic and turn 2 hydrogen-bonding side-chain moieties. Turn 2 centrally anchors

  4. Challenges in Whole-Genome Annotation of Pyrosequenced Eukaryotic Genomes

    Energy Technology Data Exchange (ETDEWEB)

    Kuo, Alan; Grigoriev, Igor

    2009-04-17

    Pyrosequencing technologies such as 454/Roche and Solexa/Illumina vastly lower the cost of nucleotide sequencing compared to the traditional Sanger method, and thus promise to greatly expand the number of sequenced eukaryotic genomes. However, the new technologies also bring new challenges such as shorter reads and new kinds and higher rates of sequencing errors, which complicate genome assembly and gene prediction. At JGI we are deploying 454 technology for the sequencing and assembly of ever-larger eukaryotic genomes. Here we describe our first whole-genome annotation of a purely 454-sequenced fungal genome that is larger than a yeast (>30 Mbp). The pezizomycotine (filamentous ascomycote) Aspergillus carbonarius belongs to the Aspergillus section Nigri species complex, members of which are significant as platforms for bioenergy and bioindustrial technology, as members of soil microbial communities and players in the global carbon cycle, and as agricultural toxigens. Application of a modified version of the standard JGI Annotation Pipeline has so far predicted ~;;10k genes. ~;;12percent of these preliminary annotations suffer a potential frameshift error, which is somewhat higher than the ~;;9percent rate in the Sanger-sequenced and conventionally assembled and annotated genome of fellow Aspergillus section Nigri member A. niger. Also,>90percent of A. niger genes have potential homologs in the A. carbonarius preliminary annotation. Weconclude, and with further annotation and comparative analysis expect to confirm, that 454 sequencing strategies provide a promising substrate for annotation of modestly sized eukaryotic genomes. We will also present results of annotation of a number of other pyrosequenced fungal genomes of bioenergy interest.

  5. Proteome-level assessment of origin, prevalence and function of Leucine-Aspartic Acid (LD) motifs

    KAUST Repository

    Alam, Tanvir; Alazmi, Meshari; Naser, Rayan Mohammad Mahmoud; Huser, Franceline; Momin, Afaque Ahmad Imtiyaz; Walkiewicz, Katarzyna Wiktoria; Canlas, Christian; Huser, Raphaë l; Ali, Amal J.; Merzaban, Jasmeen; Bajic, Vladimir B.; Gao, Xin; Arold, Stefan T.

    2018-01-01

    and migration, and revealed a new type of inverse LD motif consensus. Our evolutionary analysis suggested that LD motif signalling originated in the common unicellular ancestor of opisthokonts and amoebozoa by co-opting nuclear export sequences. Inter

  6. TOPDOM: database of conservatively located domains and motifs in proteins.

    Science.gov (United States)

    Varga, Julia; Dobson, László; Tusnády, Gábor E

    2016-09-01

    The TOPDOM database-originally created as a collection of domains and motifs located consistently on the same side of the membranes in α-helical transmembrane proteins-has been updated and extended by taking into consideration consistently localized domains and motifs in globular proteins, too. By taking advantage of the recently developed CCTOP algorithm to determine the type of a protein and predict topology in case of transmembrane proteins, and by applying a thorough search for domains and motifs as well as utilizing the most up-to-date version of all source databases, we managed to reach a 6-fold increase in the size of the whole database and a 2-fold increase in the number of transmembrane proteins. TOPDOM database is available at http://topdom.enzim.hu The webpage utilizes the common Apache, PHP5 and MySQL software to provide the user interface for accessing and searching the database. The database itself is generated on a high performance computer. tusnady.gabor@ttk.mta.hu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  7. Critical analysis of eukaryotic phylogeny: a case study based on the HSP70 family.

    Science.gov (United States)

    Germot, A; Philippe, H

    1999-01-01

    Trichomonads, together with diplomonads and microsporidia, emerge at the base of the eukaryotic tree, on the basis of the small subunit rRNA phylogeny. However, phylogenies based on protein sequences such as tubulin are markedly different with these protists emerging much later. We have investigated 70 kDa heat-shock protein (HSP70), which could be a reliable phylogenetic marker. In eukaryotes, HSP70s are found in cytosol, endoplasmic reticulum, and organelles (mitochondria and chloroplasts). In Trichomonas vaginalis we identified nine different HSP70-encoding genes and sequenced three nearly complete cDNAs corresponding to cytosolic, endoplasmic reticulum, and mitochondrial-type HSP70. Phylogenies of eukaryotes were reconstructed using the classical methods while varying the number of species and characters considered. Almost all the undoubtedly monophyletic groups, defined by ultrastructural characters, were recovered. However, due to the long branch attraction phenomenon, the evolutionary rates were the main factor determining the position of species, even with the use of a close outgroup, which is an important advantage of HSP70 with respect to many other markers. Numerous variable sites are peculiar to Trichomonas and probably generated the artefactual placement of this species at the base of the eukaryotes or as the sister group of fast-evolving species. The inter-phyla relationships were not well supported and were sensitive to the reconstruction method, the number of species; and the quantity of information used. This lack of resolution could be explained by the very rapid diversification of eukaryotes, likely after the mitochondrial endosymbiosis.

  8. Evidence for the additions of clustered interacting nodes during the evolution of protein interaction networks from network motifs

    Directory of Open Access Journals (Sweden)

    Guo Hao

    2011-05-01

    Full Text Available Abstract Background High-throughput screens have revealed large-scale protein interaction networks defining most cellular functions. How the proteins were added to the protein interaction network during its growth is a basic and important issue. Network motifs represent the simplest building blocks of cellular machines and are of biological significance. Results Here we study the evolution of protein interaction networks from the perspective of network motifs. We find that in current protein interaction networks, proteins of the same age class tend to form motifs and such co-origins of motif constituents are affected by their topologies and biological functions. Further, we find that the proteins within motifs whose constituents are of the same age class tend to be densely interconnected, co-evolve and share the same biological functions, and these motifs tend to be within protein complexes. Conclusions Our findings provide novel evidence for the hypothesis of the additions of clustered interacting nodes and point out network motifs, especially the motifs with the dense topology and specific function may play important roles during this process. Our results suggest functional constraints may be the underlying driving force for such additions of clustered interacting nodes.

  9. Extreme Diversity of Diplonemid Eukaryotes in the Ocean

    Czech Academy of Sciences Publication Activity Database

    Flegontova, Olga; Flegontov, Pavel; Malviya, S.; Audic, S.; Wincker, P.; de Vargas, C.; Bowler, C.; Lukeš, Julius; Horák, Aleš

    2016-01-01

    Roč. 26, č. 22 (2016), s. 3060-3065 ISSN 0960-9822 R&D Projects: GA ČR GPP506/12/P931; GA ČR(CZ) GA14-23986S Institutional support: RVO:60077344 Keywords : virus-sized particles * microbial eukaryotes * sea-floor * phytoplankton * communities * euglenozoa * dispersal * ecosystem Subject RIV: EG - Zoology Impact factor: 8.851, year: 2016

  10. Strong eukaryotic IRESs have weak secondary structure.

    Directory of Open Access Journals (Sweden)

    Xuhua Xia

    Full Text Available BACKGROUND: The objective of this work was to investigate the hypothesis that eukaryotic Internal Ribosome Entry Sites (IRES lack secondary structure and to examine the generality of the hypothesis. METHODOLOGY/PRINCIPAL FINDINGS: IRESs of the yeast and the fruit fly are located in the 5'UTR immediately upstream of the initiation codon. The minimum folding energy (MFE of 60 nt RNA segments immediately upstream of the initiation codons was calculated as a proxy of secondary structure stability. MFE of the reverse complements of these 60 nt segments was also calculated. The relationship between MFE and empirically determined IRES activity was investigated to test the hypothesis that strong IRES activity is associated with weak secondary structure. We show that IRES activity in the yeast and the fruit fly correlates strongly with the structural stability, with highest IRES activity found in RNA segments that exhibit the weakest secondary structure. CONCLUSIONS: We found that a subset of eukaryotic IRESs exhibits very low secondary structure in the 5'-UTR sequences immediately upstream of the initiation codon. The consistency in results between the yeast and the fruit fly suggests a possible shared mechanism of cap-independent translation initiation that relies on an unstructured RNA segment.

  11. RNA Export through the NPC in Eukaryotes.

    Science.gov (United States)

    Okamura, Masumi; Inose, Haruko; Masuda, Seiji

    2015-03-20

    In eukaryotic cells, RNAs are transcribed in the nucleus and exported to the cytoplasm through the nuclear pore complex. The RNA molecules that are exported from the nucleus into the cytoplasm include messenger RNAs (mRNAs), ribosomal RNAs (rRNAs), transfer RNAs (tRNAs), small nuclear RNAs (snRNAs), micro RNAs (miRNAs), and viral mRNAs. Each RNA is transported by a specific nuclear export receptor. It is believed that most of the mRNAs are exported by Nxf1 (Mex67 in yeast), whereas rRNAs, snRNAs, and a certain subset of mRNAs are exported in a Crm1/Xpo1-dependent manner. tRNAs and miRNAs are exported by Xpot and Xpo5. However, multiple export receptors are involved in the export of some RNAs, such as 60S ribosomal subunit. In addition to these export receptors, some adapter proteins are required to export RNAs. The RNA export system of eukaryotic cells is also used by several types of RNA virus that depend on the machineries of the host cell in the nucleus for replication of their genome, therefore this review describes the RNA export system of two representative viruses. We also discuss the NPC anchoring-dependent mRNA export factors that directly recruit specific genes to the NPC.

  12. FTZ-Factor1 and Fushi tarazu interact via conserved nuclear receptor and coactivator motifs

    Science.gov (United States)

    Schwartz, Carol J.E.; Sampson, Heidi M.; Hlousek, Daniela; Percival-Smith, Anthony; Copeland, John W.R.; Simmonds, Andrew J.; Krause, Henry M.

    2001-01-01

    To activate transcription, most nuclear receptor proteins require coactivators that bind to their ligand-binding domains (LBDs). The Drosophila FTZ-Factor1 (FTZ-F1) protein is a conserved member of the nuclear receptor superfamily, but was previously thought to lack an AF2 motif, a motif that is required for ligand and coactivator binding. Here we show that FTZ-F1 does have an AF2 motif and that it is required to bind a coactivator, the homeodomain-containing protein Fushi tarazu (FTZ). We also show that FTZ contains an AF2-interacting nuclear receptor box, the first to be found in a homeodomain protein. Both interaction motifs are shown to be necessary for physical interactions in vitro and for functional interactions in developing embryos. These unexpected findings have important implications for the conserved homologs of the two proteins. PMID:11157757

  13. Next-Generation Sequencing Assessment of Eukaryotic Diversity in Oil Sands Tailings Ponds Sediments and Surface Water.

    Science.gov (United States)

    Aguilar, Maria; Richardson, Elisabeth; Tan, BoonFei; Walker, Giselle; Dunfield, Peter F; Bass, David; Nesbø, Camilla; Foght, Julia; Dacks, Joel B

    2016-11-01

    Tailings ponds in the Athabasca oil sands (Canada) contain fluid wastes, generated by the extraction of bitumen from oil sands ores. Although the autochthonous prokaryotic communities have been relatively well characterized, almost nothing is known about microbial eukaryotes living in the anoxic soft sediments of tailings ponds or in the thin oxic layer of water that covers them. We carried out the first next-generation sequencing study of microbial eukaryotic diversity in oil sands tailings ponds. In metagenomes prepared from tailings sediment and surface water, we detected very low numbers of sequences encoding eukaryotic small subunit ribosomal RNA representing seven major taxonomic groups of protists. We also produced and analysed three amplicon-based 18S rRNA libraries prepared from sediment samples. These revealed a more diverse set of taxa, 169 different OTUs encompassing up to eleven higher order groups of eukaryotes, according to detailed classification using homology searching and phylogenetic methods. The 10 most abundant OTUs accounted for > 90% of the total of reads, vs. large numbers of rare OTUs (< 1% abundance). Despite the anoxic and hydrocarbon-enriched nature of the environment, the tailings ponds harbour complex communities of microbial eukaryotes indicating that these organisms should be taken into account when studying the microbiology of the oil sands. © 2016 The Author(s) Journal of Eukaryotic Microbiology © 2016 International Society of Protistologists.

  14. Motifs in triadic random graphs based on Steiner triple systems

    Science.gov (United States)

    Winkler, Marco; Reichardt, Jörg

    2013-08-01

    Conventionally, pairwise relationships between nodes are considered to be the fundamental building blocks of complex networks. However, over the last decade, the overabundance of certain subnetwork patterns, i.e., the so-called motifs, has attracted much attention. It has been hypothesized that these motifs, instead of links, serve as the building blocks of network structures. Although the relation between a network's topology and the general properties of the system, such as its function, its robustness against perturbations, or its efficiency in spreading information, is the central theme of network science, there is still a lack of sound generative models needed for testing the functional role of subgraph motifs. Our work aims to overcome this limitation. We employ the framework of exponential random graph models (ERGMs) to define models based on triadic substructures. The fact that only a small portion of triads can actually be set independently poses a challenge for the formulation of such models. To overcome this obstacle, we use Steiner triple systems (STSs). These are partitions of sets of nodes into pair-disjoint triads, which thus can be specified independently. Combining the concepts of ERGMs and STSs, we suggest generative models capable of generating ensembles of networks with nontrivial triadic Z-score profiles. Further, we discover inevitable correlations between the abundance of triad patterns, which occur solely for statistical reasons and need to be taken into account when discussing the functional implications of motif statistics. Moreover, we calculate the degree distributions of our triadic random graphs analytically.

  15. Exon silencing by UAGG motifs in response to neuronal excitation.

    Directory of Open Access Journals (Sweden)

    Ping An

    2007-02-01

    Full Text Available Alternative pre-mRNA splicing plays fundamental roles in neurons by generating functional diversity in proteins associated with the communication and connectivity of the synapse. The CI cassette of the NMDA R1 receptor is one of a variety of exons that show an increase in exon skipping in response to cell excitation, but the molecular nature of this splicing responsiveness is not yet understood. Here we investigate the molecular basis for the induced changes in splicing of the CI cassette exon in primary rat cortical cultures in response to KCl-induced depolarization using an expression assay with a tight neuron-specific readout. In this system, exon silencing in response to neuronal excitation was mediated by multiple UAGG-type silencing motifs, and transfer of the motifs to a constitutive exon conferred a similar responsiveness by gain of function. Biochemical analysis of protein binding to UAGG motifs in extracts prepared from treated and mock-treated cortical cultures showed an increase in nuclear hnRNP A1-RNA binding activity in parallel with excitation. Evidence for the role of the NMDA receptor and calcium signaling in the induced splicing response was shown by the use of specific antagonists, as well as cell-permeable inhibitors of signaling pathways. Finally, a wider role for exon-skipping responsiveness is shown to involve additional exons with UAGG-related silencing motifs, and transcripts involved in synaptic functions. These results suggest that, at the post-transcriptional level, excitable exons such as the CI cassette may be involved in strategies by which neurons mount adaptive responses to hyperstimulation.

  16. A Conserved GPG-Motif in the HIV-1 Nef Core Is Required for Principal Nef-Activities.

    Directory of Open Access Journals (Sweden)

    Marta Martínez-Bonet

    Full Text Available To find out new determinants required for Nef activity we performed a functional alanine scanning analysis along a discrete but highly conserved region at the core of HIV-1 Nef. We identified the GPG-motif, located at the 121-137 region of HIV-1 NL4.3 Nef, as a novel protein signature strictly required for the p56Lck dependent Nef-induced CD4-downregulation in T-cells. Since the Nef-GPG motif was dispensable for CD4-downregulation in HeLa-CD4 cells, Nef/AP-1 interaction and Nef-dependent effects on Tf-R trafficking, the observed effects on CD4 downregulation cannot be attributed to structure constraints or to alterations on general protein trafficking. Besides, we found that the GPG-motif was also required for Nef-dependent inhibition of ring actin re-organization upon TCR triggering and MHCI downregulation, suggesting that the GPG-motif could actively cooperate with the Nef PxxP motif for these HIV-1 Nef-related effects. Finally, we observed that the Nef-GPG motif was required for optimal infectivity of those viruses produced in T-cells. According to these findings, we propose the conserved GPG-motif in HIV-1 Nef as functional region required for HIV-1 infectivity and therefore with a potential interest for the interference of Nef activity during HIV-1 infection.

  17. Codon based co-occurrence network motifs in human mitochondria

    Directory of Open Access Journals (Sweden)

    Pramod Shinde

    2017-10-01

    Full Text Available The nucleotide polymorphism in human mitochondrial genome (mtDNA tolled by codon position bias plays an indispensable role in human population dispersion and expansion. Herein, we constructed genome-wide nucleotide co-occurrence networks using a massive data consisting of five different geographical regions and around 3000 samples for each region. We developed a powerful network model to describe complex mitochondrial evolutionary patterns between codon and non-codon positions. It was interesting to report a different evolution of Asian genomes than those of the rest which is divulged by network motifs. We found evidence that mtDNA undergoes substantial amounts of adaptive evolution, a finding which was supported by a number of previous studies. The dominance of higher order motifs indicated the importance of long-range nucleotide co-occurrence in genomic diversity. Most notably, codon motifs apparently underpinned the preferences among codon positions for co-evolution which is probably highly biased during the origin of the genetic code. Our analyses manifested that codon position co-evolution is very well conserved across human sub-populations and independently maintained within human sub-populations implying the selective role of evolutionary processes on codon position co-evolution. Ergo, this study provided a framework to investigate cooperative genomic interactions which are critical in underlying complex mitochondrial evolution.

  18. A Repeating Sulfated Galactan Motif Resuscitates Dormant Micrococcus luteus Bacteria.

    Science.gov (United States)

    Böttcher, Thomas; Szamosvári, Dávid; Clardy, Jon

    2018-07-01

    Only a small fraction of bacteria can autonomously initiate growth on agar plates. Nongrowing bacteria typically enter a metabolically inactive dormant state and require specific chemical trigger factors or signals to exit this state and to resume growth. Micrococcus luteus has become a model organism for this important yet poorly understood phenomenon. Only a few resuscitation signals have been described to date, and all of them are produced endogenously by bacterial species. We report the discovery of a novel type of resuscitation signal that allows M. luteus to grow on agar but not agarose plates. Fractionation of the agar polysaccharide complex and sulfation of agarose allowed us to identify the signal as highly sulfated saccharides found in agar or carrageenans. Purification of hydrolyzed κ-carrageenan ultimately led to the identification of the signal as a small fragment of a large linear polysaccharide, i.e., an oligosaccharide of five or more sugars with a repeating disaccharide motif containing d-galactose-4-sulfate (G4S) 1,4-linked to 3,6-anhydro-α-d-galactose (DA), G4S-(DA-G4S) n ≥2 IMPORTANCE Most environmental bacteria cannot initiate growth on agar plates, but they can flourish on the same plates once growth is initiated. While there are a number of names for and manifestations of this phenomenon, the underlying cause appears to be the requirement for a molecular signal indicating safe growing conditions. Micrococcus luteus has become a model organism for studying this growth initiation process, often called resuscitation, because of its apparent connection with the persistent or dormant form of Mycobacterium tuberculosis , an important human pathogen. In this report, we identify a highly sulfated saccharide from agar or carrageenans that robustly resuscitates dormant M. luteus on agarose plates. We identified and characterized the signal as a small repeating disaccharide motif. Our results indicate that signals inherent in or absent from the

  19. The conjugal-bed motif in the Alcestis Barcinonensis: two notes

    Directory of Open Access Journals (Sweden)

    Rosario Moreno Soldevila

    2011-06-01

    Full Text Available This paper focuses on the centrality occupied by the conjugal-bed motif in the anonymous poem known as Alcestis Barcinonensis, in the light of which two new interpretations of lines 21-22 and 83-85 are provided. In the first passage, beato … toro should be read as a subtle allusion to marital love, one of the central themes of the poem; in the second, uestigia alludes to a well-known literary motif related to the bed of love, thus providing a more accurate interpretation of the post mortem fidelity which Alcestis demands from her husband.

  20. Evolution of glutamate dehydrogenase genes: evidence for lateral gene transfer within and between prokaryotes and eukaryotes

    Directory of Open Access Journals (Sweden)

    Roger Andrew J

    2003-06-01

    Full Text Available Abstract Background Lateral gene transfer can introduce genes with novel functions into genomes or replace genes with functionally similar orthologs or paralogs. Here we present a study of the occurrence of the latter gene replacement phenomenon in the four gene families encoding different classes of glutamate dehydrogenase (GDH, to evaluate and compare the patterns and rates of lateral gene transfer (LGT in prokaryotes and eukaryotes. Results We extend the taxon sampling of gdh genes with nine new eukaryotic sequences and examine the phylogenetic distribution pattern of the various GDH classes in combination with maximum likelihood phylogenetic analyses. The distribution pattern analyses indicate that LGT has played a significant role in the evolution of the four gdh gene families. Indeed, a number of gene transfer events are identified by phylogenetic analyses, including numerous prokaryotic intra-domain transfers, some prokaryotic inter-domain transfers and several inter-domain transfers between prokaryotes and microbial eukaryotes (protists. Conclusion LGT has apparently affected eukaryotes and prokaryotes to a similar extent within the gdh gene families. In the absence of indications that the evolution of the gdh gene families is radically different from other families, these results suggest that gene transfer might be an important evolutionary mechanism in microbial eukaryote genome evolution.

  1. Three-dimensional structure of a glycosylated cell surface antigen from D. discoideum: a primordial adhesion motif

    International Nuclear Information System (INIS)

    Mabbutt, B.C.; Swarbrick, J.; Cubeddu, L.; Hill, A.

    1999-01-01

    Full text: We have determined the solution structure of pre-spore specific antigen (PsA), a predominant cell surface glycoprotein from the slime mould Dictyostelium discoideum. The structure and function of this protein suggests that it serves as a molecular signal for multicellular organisation, and that it may also be an adhesion motif mediating direct cell-cell contact. PsA consists of a 90-residue N-terminal globular domain tethered to the cell membrane via a heavily O-glycosylated stalk and a GPI anchor. No homologous sequences have been identified for the N-terminal domain. At Macquarie University, the D. discoideum organism has been well developed as a eukaryotic expression host for glycosylated proteins. For NMR, we have engineered a soluble form of PsA (residues 1-122) containing the globular 'head' and the glycopeptide linker. 15 N- and 15 N/ 13 C-labelled PsA was generated in this organism via a protocol that is readily adaptable for the cost-effective production of milligram quantities of other isotopically labelled recombinant proteins. Using 3D heteronuclear NMR, we have solved the three-dimensional structure of the PsA glycoprotein. It defines an eight stranded β-sandwich of five-on-three topology in a unique arrangement. A long loop is constrained by a cis proline residue and a disulphide bond to form an opening across one end of the sandwich, exposing portions of the hydrophobic interior. We postulate that this distortion of the sandwich fold structures a binding site. Structural and dynamics information was also obtained concerning the intact glycopeptide linker of the protein, which comprises a repeating P-T-V-T motif. In our recombinant form, each Thr residue is modified by a single GlcNAc sugar. This simple structure yields interpretable NMR spectra, which show the glycosylated linker to be in extended conformation, and undergoing distinctly different mobility from the globular domain. These same sugar residues provide an ideal attachment

  2. Introns Protect Eukaryotic Genomes from Transcription-Associated Genetic Instability.

    Science.gov (United States)

    Bonnet, Amandine; Grosso, Ana R; Elkaoutari, Abdessamad; Coleno, Emeline; Presle, Adrien; Sridhara, Sreerama C; Janbon, Guilhem; Géli, Vincent; de Almeida, Sérgio F; Palancade, Benoit

    2017-08-17

    Transcription is a source of genetic instability that can notably result from the formation of genotoxic DNA:RNA hybrids, or R-loops, between the nascent mRNA and its template. Here we report an unexpected function for introns in counteracting R-loop accumulation in eukaryotic genomes. Deletion of endogenous introns increases R-loop formation, while insertion of an intron into an intronless gene suppresses R-loop accumulation and its deleterious impact on transcription and recombination in yeast. Recruitment of the spliceosome onto the mRNA, but not splicing per se, is shown to be critical to attenuate R-loop formation and transcription-associated genetic instability. Genome-wide analyses in a number of distant species differing in their intron content, including human, further revealed that intron-containing genes and the intron-richest genomes are best protected against R-loop accumulation and subsequent genetic instability. Our results thereby provide a possible rationale for the conservation of introns throughout the eukaryotic lineage. Copyright © 2017 Elsevier Inc. All rights reserved.

  3. Eukaryotic snoRNAs: a paradigm for gene expression flexibility.

    Science.gov (United States)

    Dieci, Giorgio; Preti, Milena; Montanini, Barbara

    2009-08-01

    Small nucleolar RNAs (snoRNAs) are one of the most ancient and numerous families of non-protein-coding RNAs (ncRNAs). The main function of snoRNAs - to guide site-specific rRNA modification - is the same in Archaea and all eukaryotic lineages. In contrast, as revealed by recent genomic and RNomic studies, their genomic organization and expression strategies are the most varied. Seemingly snoRNA coding units have adopted, in the course of evolution, all the possible ways of being transcribed, thus providing a unique paradigm of gene expression flexibility. By focusing on representative fungal, plant and animal genomes, we review here all the documented types of snoRNA gene organization and expression, and we provide a comprehensive account of snoRNA expressional freedom by precisely estimating the frequency, in each genome, of each type of genomic organization. We finally discuss the relevance of snoRNA genomic studies for our general understanding of ncRNA family evolution and expression in eukaryotes.

  4. Regulated eukaryotic DNA replication origin firing with purified proteins.

    Science.gov (United States)

    Yeeles, Joseph T P; Deegan, Tom D; Janska, Agnieszka; Early, Anne; Diffley, John F X

    2015-03-26

    Eukaryotic cells initiate DNA replication from multiple origins, which must be tightly regulated to promote precise genome duplication in every cell cycle. To accomplish this, initiation is partitioned into two temporally discrete steps: a double hexameric minichromosome maintenance (MCM) complex is first loaded at replication origins during G1 phase, and then converted to the active CMG (Cdc45-MCM-GINS) helicase during S phase. Here we describe the reconstitution of budding yeast DNA replication initiation with 16 purified replication factors, made from 42 polypeptides. Origin-dependent initiation recapitulates regulation seen in vivo. Cyclin-dependent kinase (CDK) inhibits MCM loading by phosphorylating the origin recognition complex (ORC) and promotes CMG formation by phosphorylating Sld2 and Sld3. Dbf4-dependent kinase (DDK) promotes replication by phosphorylating MCM, and can act either before or after CDK. These experiments define the minimum complement of proteins, protein kinase substrates and co-factors required for regulated eukaryotic DNA replication.

  5. The BsaHI restriction-modification system: Cloning, sequencing and analysis of conserved motifs

    Directory of Open Access Journals (Sweden)

    Roberts Richard J

    2008-05-01

    Full Text Available Abstract Background Restriction and modification enzymes typically recognise short DNA sequences of between two and eight bases in length. Understanding the mechanism of this recognition represents a significant challenge that we begin to address for the BsaHI restriction-modification system, which recognises the six base sequence GRCGYC. Results The DNA sequences of the genes for the BsaHI methyltransferase, bsaHIM, and restriction endonuclease, bsaHIR, have been determined (GenBank accession #EU386360, cloned and expressed in E. coli. Both the restriction endonuclease and methyltransferase enzymes share significant similarity with a group of 6 other enzymes comprising the restriction-modification systems HgiDI and HgiGI and the putative HindVP, NlaCORFDP, NpuORFC228P and SplZORFNP restriction-modification systems. A sequence alignment of these homologues shows that their amino acid sequences are largely conserved and highlights several motifs of interest. We target one such conserved motif, reading SPERRFD, at the C-terminal end of the bsaHIR gene. A mutational analysis of these amino acids indicates that the motif is crucial for enzymatic activity. Sequence alignment of the methyltransferase gene reveals a short motif within the target recognition domain that is conserved among enzymes recognising the same sequences. Thus, this motif may be used as a diagnostic tool to define the recognition sequences of the cytosine C5 methyltransferases. Conclusion We have cloned and sequenced the BsaHI restriction and modification enzymes. We have identified a region of the R. BsaHI enzyme that is crucial for its activity. Analysis of the amino acid sequence of the BsaHI methyltransferase enzyme led us to propose two new motifs that can be used in the diagnosis of the recognition sequence of the cytosine C5-methyltransferases.

  6. A Simple Decision Rule for Recognition of Poly(A) Tail Signal Motifs in Human Genome

    KAUST Repository

    AbouEisha, Hassan M.

    2015-05-12

    Background is the numerous attempts were made to predict motifs in genomic sequences that correspond to poly (A) tail signals. Vast portion of this effort has been directed to a plethora of nonlinear classification methods. Even when such approaches yield good discriminant results, identifying dominant features of regulatory mechanisms nevertheless remains a challenge. In this work, we look at decision rules that may help identifying such features. Findings are we present a simple decision rule for classification of candidate poly (A) tail signal motifs in human genomic sequence obtained by evaluating features during the construction of gradient boosted trees. We found that values of a single feature based on the frequency of adenine in the genomic sequence surrounding candidate signal and the number of consecutive adenine molecules in a well-defined region immediately following the motif displays good discriminative potential in classification of poly (A) tail motifs for samples covered by the rule. Conclusions is the resulting simple rule can be used as an efficient filter in construction of more complex poly(A) tail motifs classification algorithms.

  7. An Interactive Exercise To Learn Eukaryotic Cell Structure and Organelle Function.

    Science.gov (United States)

    Klionsky, Daniel J.; Tomashek, John J.

    1999-01-01

    Describes a cooperative, interactive problem-solving exercise for studying eukaryotic cell structure and function. Highlights the dynamic aspects of movement through the cell. Contains 15 references. (WRM)

  8. Metabolism in anoxic permeable sediments is dominated by eukaryotic dark fermentation

    Science.gov (United States)

    Bourke, Michael F.; Marriott, Philip J.; Glud, Ronnie N.; Hasler-Sheetal, Harald; Kamalanathan, Manoj; Beardall, John; Greening, Chris; Cook, Perran L. M.

    2017-01-01

    Permeable sediments are common across continental shelves and are critical contributors to marine biogeochemical cycling. Organic matter in permeable sediments is dominated by microalgae, which as eukaryotes have different anaerobic metabolic pathways to bacteria and archaea. Here we present analyses of flow-through reactor experiments showing that dissolved inorganic carbon is produced predominantly as a result of anaerobic eukaryotic metabolic activity. In our experiments, anaerobic production of dissolved inorganic carbon was consistently accompanied by large dissolved H2 production rates, suggesting the presence of fermentation. The production of both dissolved inorganic carbon and H2 persisted following administration of broad spectrum bactericidal antibiotics, but ceased following treatment with metronidazole. Metronidazole inhibits the ferredoxin/hydrogenase pathway of fermentative eukaryotic H2 production, suggesting that pathway as the source of H2 and dissolved inorganic carbon production. Metabolomic analysis showed large increases in lipid production at the onset of anoxia, consistent with documented pathways of anoxic dark fermentation in microalgae. Cell counts revealed a predominance of microalgae in the sediments. H2 production was observed in dark anoxic cultures of diatoms (Fragilariopsis sp.) and a chlorophyte (Pyramimonas) isolated from the study site, substantiating the hypothesis that microalgae undertake fermentation. We conclude that microalgal dark fermentation could be an important energy-conserving pathway in permeable sediments.

  9. A set of enhanced green fluorescent protein concatemers for quantitative determination of nuclear localization signal strength.

    Science.gov (United States)

    Böhm, Jennifer; Thavaraja, Ramya; Giehler, Susanne; Nalaskowski, Marcus M

    2017-09-15

    Regulated transport of proteins between nucleus and cytoplasm is an important process in the eukaryotic cell. In most cases, active nucleo-cytoplasmic protein transport is mediated by nuclear localization signal (NLS) and/or nuclear export signal (NES) motifs. In this study, we developed a set of vectors expressing enhanced GFP (EGFP) concatemers ranging from 2 to 12 subunits (2xEGFP to 12xEGFP) for analysis of NLS strength. As shown by in gel GFP fluorescence analysis and αGFP Western blotting, EGFP concatemers are expressed as fluorescent full-length proteins in eukaryotic cells. As expected, nuclear localization of concatemeric EGFPs decreases with increasing molecular weight. By oligonucleotide ligation this set of EGFP concatemers can be easily fused to NLS motifs. After determination of intracellular localization of EGFP concatemers alone and fused to different NLS motifs we calculated the size of a hypothetic EGFP concatemer showing a defined distribution of EGFP fluorescence between nucleus and cytoplasm (n/c ratio = 2). Clear differences of the size of the hypothetic EGFP concatemer depending on the fused NLS motif were observed. Therefore, we propose to use the size of this hypothetic concatemer as quantitative indicator for comparing strength of different NLS motifs. Copyright © 2017 Elsevier Inc. All rights reserved.

  10. An Analysis of Multi-type Relational Interactions in FMA Using Graph Motifs with Disjointness Constraints

    Science.gov (United States)

    Zhang, Guo-Qiang; Luo, Lingyun; Ogbuji, Chime; Joslyn, Cliff; Mejino, Jose; Sahoo, Satya S

    2012-01-01

    The interaction of multiple types of relationships among anatomical classes in the Foundational Model of Anatomy (FMA) can provide inferred information valuable for quality assurance. This paper introduces a method called Motif Checking (MOCH) to study the effects of such multi-relation type interactions for detecting logical inconsistencies as well as other anomalies represented by the motifs. MOCH represents patterns of multi-type interaction as small labeled (with multiple types of edges) sub-graph motifs, whose nodes represent class variables, and labeled edges represent relational types. By representing FMA as an RDF graph and motifs as SPARQL queries, fragments of FMA are automatically obtained as auditing candidates. Leveraging the scalability and reconfigurability of Semantic Web Technology, we performed exhaustive analyses of a variety of labeled sub-graph motifs. The quality assurance feature of MOCH comes from the distinct use of a subset of the edges of the graph motifs as constraints for disjointness, whereby bringing in rule-based flavor to the approach as well. With possible disjointness implied by antonyms, we performed manual inspection of the resulting FMA fragments and tracked down sources of abnormal inferred conclusions (logical inconsistencies), which are amendable for programmatic revision of the FMA. Our results demonstrate that MOCH provides a unique source of valuable information for quality assurance. Since our approach is general, it is applicable to any ontological system with an OWL representation. PMID:23304382

  11. An analysis of multi-type relational interactions in FMA using graph motifs with disjointness constraints.

    Science.gov (United States)

    Zhang, Guo-Qiang; Luo, Lingyun; Ogbuji, Chime; Joslyn, Cliff; Mejino, Jose; Sahoo, Satya S

    2012-01-01

    The interaction of multiple types of relationships among anatomical classes in the Foundational Model of Anatomy (FMA) can provide inferred information valuable for quality assurance. This paper introduces a method called Motif Checking (MOCH) to study the effects of such multi-relation type interactions for detecting logical inconsistencies as well as other anomalies represented by the motifs. MOCH represents patterns of multi-type interaction as small labeled (with multiple types of edges) sub-graph motifs, whose nodes represent class variables, and labeled edges represent relational types. By representing FMA as an RDF graph and motifs as SPARQL queries, fragments of FMA are automatically obtained as auditing candidates. Leveraging the scalability and reconfigurability of Semantic Web Technology, we performed exhaustive analyses of a variety of labeled sub-graph motifs. The quality assurance feature of MOCH comes from the distinct use of a subset of the edges of the graph motifs as constraints for disjointness, whereby bringing in rule-based flavor to the approach as well. With possible disjointness implied by antonyms, we performed manual inspection of the resulting FMA fragments and tracked down sources of abnormal inferred conclusions (logical inconsistencies), which are amendable for programmatic revision of the FMA. Our results demonstrate that MOCH provides a unique source of valuable information for quality assurance. Since our approach is general, it is applicable to any ontological system with an OWL representation.

  12. Eukaryotic resistance to fluoride toxicity mediated by a widespread family of fluoride export proteins.

    Science.gov (United States)

    Li, Sanshu; Smith, Kathryn D; Davis, Jared H; Gordon, Patricia B; Breaker, Ronald R; Strobel, Scott A

    2013-11-19

    Fluorine is an abundant element and is toxic to organisms from bacteria to humans, but the mechanisms by which eukaryotes resist fluoride toxicity are unknown. The Escherichia coli gene crcB was recently shown to be regulated by a fluoride-responsive riboswitch, implicating it in fluoride response. There are >8,000 crcB homologs across all domains of life, indicating that it has an important role in biology. Here we demonstrate that eukaryotic homologs [renamed FEX (fluoride exporter)] function in fluoride export. FEX KOs in three eukaryotic model organisms, Neurospora crassa, Saccharomyces cerevisiae, and Candida albicans, are highly sensitized to fluoride (>200-fold) but not to other halides. Some of these KO strains are unable to grow in fluoride concentrations found in tap water. Using the radioactive isotope of fluoride, (18)F, we developed an assay to measure the intracellular fluoride concentration and show that the FEX deletion strains accumulate fluoride in excess of the external concentration, providing direct evidence of FEX function in fluoride efflux. In addition, they are more sensitive to lower pH in the presence of fluoride. These results demonstrate that eukaryotic FEX genes encode a previously unrecognized class of fluoride exporter necessary for survival in standard environmental conditions.

  13. EuGI: a novel resource for studying genomic islands to facilitate horizontal gene transfer detection in eukaryotes.

    Science.gov (United States)

    Clasen, Frederick Johannes; Pierneef, Rian Ewald; Slippers, Bernard; Reva, Oleg

    2018-05-03

    Genomic islands (GIs) are inserts of foreign DNA that have potentially arisen through horizontal gene transfer (HGT). There are evidences that GIs can contribute significantly to the evolution of prokaryotes. The acquisition of GIs through HGT in eukaryotes has, however, been largely unexplored. In this study, the previously developed GI prediction tool, SeqWord Gene Island Sniffer (SWGIS), is modified to predict GIs in eukaryotic chromosomes. Artificial simulations are used to estimate ratios of predicting false positive and false negative GIs by inserting GIs into different test chromosomes and performing the SWGIS v2.0 algorithm. Using SWGIS v2.0, GIs are then identified in 36 fungal, 22 protozoan and 8 invertebrate genomes. SWGIS v2.0 predicts GIs in large eukaryotic chromosomes based on the atypical nucleotide composition of these regions. Averages for predicting false negative and false positive GIs were 20.1% and 11.01% respectively. A total of 10,550 GIs were identified in 66 eukaryotic species with 5299 of these GIs coding for at least one functional protein. The EuGI web-resource, freely accessible at http://eugi.bi.up.ac.za , was developed that allows browsing the database created from identified GIs and genes within GIs through an interactive and visual interface. SWGIS v2.0 along with the EuGI database, which houses GIs identified in 66 different eukaryotic species, and the EuGI web-resource, provide the first comprehensive resource for studying HGT in eukaryotes.

  14. SSTRAP: A computational model for genomic motif discovery ...

    African Journals Online (AJOL)

    Computational methods can potentially provide high-quality prediction of biological molecules such as DNA binding sites and Transcription factors and therefore reduce the time needed for experimental verification and challenges associated with experimental methods. These biological molecules or motifs have significant ...

  15. Eu-Detect: An algorithm for detecting eukaryotic sequences in ...

    Indian Academy of Sciences (India)

    Supplementary figure 1. Plots depicting the classification accuracy of Eu-Detect with various combinations of. 'cumulative sequence count' (40K, 50K, 60K, 70K, 80K) and 'coverage threshold' (20%, 30%, 40%, 50%, 60%, 70%,. 80%). While blue bars represent Eu-Detect's average classification accuracy with eukaryotic ...

  16. EuMicroSatdb: A database for microsatellites in the sequenced genomes of eukaryotes

    Directory of Open Access Journals (Sweden)

    Grover Atul

    2007-07-01

    Full Text Available Abstract Background Microsatellites have immense utility as molecular markers in different fields like genome characterization and mapping, phylogeny and evolutionary biology. Existing microsatellite databases are of limited utility for experimental and computational biologists with regard to their content and information output. EuMicroSatdb (Eukaryotic MicroSatellite database http://ipu.ac.in/usbt/EuMicroSatdb.htm is a web based relational database for easy and efficient positional mining of microsatellites from sequenced eukaryotic genomes. Description A user friendly web interface has been developed for microsatellite data retrieval using Active Server Pages (ASP. The backend database codes for data extraction and assembly have been written using Perl based scripts and C++. Precise need based microsatellites data retrieval is possible using different input parameters like microsatellite type (simple perfect or compound perfect, repeat unit length (mono- to hexa-nucleotide, repeat number, microsatellite length and chromosomal location in the genome. Furthermore, information about clustering of different microsatellites in the genome can also be retrieved. Finally, to facilitate primer designing for PCR amplification of any desired microsatellite locus, 200 bp upstream and downstream sequences are provided. Conclusion The database allows easy systematic retrieval of comprehensive information about simple and compound microsatellites, microsatellite clusters and their locus coordinates in 31 sequenced eukaryotic genomes. The information content of the database is useful in different areas of research like gene tagging, genome mapping, population genetics, germplasm characterization and in understanding microsatellite dynamics in eukaryotic genomes.

  17. Structural motifs of pre-nucleation clusters.

    Science.gov (United States)

    Zhang, Y; Türkmen, I R; Wassermann, B; Erko, A; Rühl, E

    2013-10-07

    Structural motifs of pre-nucleation clusters prepared in single, optically levitated supersaturated aqueous aerosol microparticles containing CaBr2 as a model system are reported. Cluster formation is identified by means of X-ray absorption in the Br K-edge regime. The salt concentration beyond the saturation point is varied by controlling the humidity in the ambient atmosphere surrounding the 15-30 μm microdroplets. This leads to the formation of metastable supersaturated liquid particles. Distinct spectral shifts in near-edge spectra as a function of salt concentration are observed, in which the energy position of the Br K-edge is red-shifted by up to 7.1 ± 0.4 eV if the dilute solution is compared to the solid. The K-edge positions of supersaturated solutions are found between these limits. The changes in electronic structure are rationalized in terms of the formation of pre-nucleation clusters. This assumption is verified by spectral simulations using first-principle density functional theory and molecular dynamics calculations, in which structural motifs are considered, explaining the experimental results. These consist of solvated CaBr2 moieties, rather than building blocks forming calcium bromide hexahydrates, the crystal system that is formed by drying aqueous CaBr2 solutions.

  18. Insertion of tetracysteine motifs into dopamine transporter extracellular domains.

    Directory of Open Access Journals (Sweden)

    Deanna M Navaroli

    Full Text Available The neuronal dopamine transporter (DAT is a major determinant of extracellular dopamine (DA levels and is the primary target for a variety of addictive and therapeutic psychoactive drugs. DAT is acutely regulated by protein kinase C (PKC activation and amphetamine exposure, both of which modulate DAT surface expression by endocytic trafficking. In order to use live imaging approaches to study DAT endocytosis, methods are needed to exclusively label the DAT surface pool. The use of membrane impermeant, sulfonated biarsenic dyes holds potential as one such approach, and requires introduction of an extracellular tetracysteine motif (tetraCys; CCPGCC to facilitate dye binding. In the current study, we took advantage of intrinsic proline-glycine (Pro-Gly dipeptides encoded in predicted DAT extracellular domains to introduce tetraCys motifs into DAT extracellular loops 2, 3, and 4. [(3H]DA uptake studies, surface biotinylation and fluorescence microscopy in PC12 cells indicate that tetraCys insertion into the DAT second extracellular loop results in a functional transporter that maintains PKC-mediated downregulation. Introduction of tetraCys into extracellular loops 3 and 4 yielded DATs with severely compromised function that failed to mature and traffic to the cell surface. This is the first demonstration of successful introduction of a tetracysteine motif into a DAT extracellular domain, and may hold promise for use of biarsenic dyes in live DAT imaging studies.

  19. Comparative genomics of metabolic capacities of regulons controlled by cis-regulatory RNA motifs in bacteria.

    Science.gov (United States)

    Sun, Eric I; Leyn, Semen A; Kazanov, Marat D; Saier, Milton H; Novichkov, Pavel S; Rodionov, Dmitry A

    2013-09-02

    In silico comparative genomics approaches have been efficiently used for functional prediction and reconstruction of metabolic and regulatory networks. Riboswitches are metabolite-sensing structures often found in bacterial mRNA leaders controlling gene expression on transcriptional or translational levels.An increasing number of riboswitches and other cis-regulatory RNAs have been recently classified into numerous RNA families in the Rfam database. High conservation of these RNA motifs provides a unique advantage for their genomic identification and comparative analysis. A comparative genomics approach implemented in the RegPredict tool was used for reconstruction and functional annotation of regulons controlled by RNAs from 43 Rfam families in diverse taxonomic groups of Bacteria. The inferred regulons include ~5200 cis-regulatory RNAs and more than 12000 target genes in 255 microbial genomes. All predicted RNA-regulated genes were classified into specific and overall functional categories. Analysis of taxonomic distribution of these categories allowed us to establish major functional preferences for each analyzed cis-regulatory RNA motif family. Overall, most RNA motif regulons showed predictable functional content in accordance with their experimentally established effector ligands. Our results suggest that some RNA motifs (including thiamin pyrophosphate and cobalamin riboswitches that control the cofactor metabolism) are widespread and likely originated from the last common ancestor of all bacteria. However, many more analyzed RNA motifs are restricted to a narrow taxonomic group of bacteria and likely represent more recent evolutionary innovations. The reconstructed regulatory networks for major known RNA motifs substantially expand the existing knowledge of transcriptional regulation in bacteria. The inferred regulons can be used for genetic experiments, functional annotations of genes, metabolic reconstruction and evolutionary analysis. The obtained genome

  20. Spatiotemporal network motif reveals the biological traits of developmental gene regulatory networks in Drosophila melanogaster

    Directory of Open Access Journals (Sweden)

    Kim Man-Sun

    2012-05-01

    Full Text Available Abstract Background Network motifs provided a “conceptual tool” for understanding the functional principles of biological networks, but such motifs have primarily been used to consider static network structures. Static networks, however, cannot be used to reveal time- and region-specific traits of biological systems. To overcome this limitation, we proposed the concept of a “spatiotemporal network motif,” a spatiotemporal sequence of network motifs of sub-networks which are active only at specific time points and body parts. Results On the basis of this concept, we analyzed the developmental gene regulatory network of the Drosophila melanogaster embryo. We identified spatiotemporal network motifs and investigated their distribution pattern in time and space. As a result, we found how key developmental processes are temporally and spatially regulated by the gene network. In particular, we found that nested feedback loops appeared frequently throughout the entire developmental process. From mathematical simulations, we found that mutual inhibition in the nested feedback loops contributes to the formation of spatial expression patterns. Conclusions Taken together, the proposed concept and the simulations can be used to unravel the design principle of developmental gene regulatory networks.

  1. Functional motifs responsible for human metapneumovirus M2-2-mediated innate immune evasion.

    Science.gov (United States)

    Chen, Yu; Deng, Xiaoling; Deng, Junfang; Zhou, Jiehua; Ren, Yuping; Liu, Shengxuan; Prusak, Deborah J; Wood, Thomas G; Bao, Xiaoyong

    2016-12-01

    Human metapneumovirus (hMPV) is a major cause of lower respiratory infection in young children. Repeated infections occur throughout life, but its immune evasion mechanisms are largely unknown. We recently found that hMPV M2-2 protein elicits immune evasion by targeting mitochondrial antiviral-signaling protein (MAVS), an antiviral signaling molecule. However, the molecular mechanisms underlying such inhibition are not known. Our mutagenesis studies revealed that PDZ-binding motifs, 29-DEMI-32 and 39-KEALSDGI-46, located in an immune inhibitory region of M2-2, are responsible for M2-2-mediated immune evasion. We also found both motifs prevent TRAF5 and TRAF6, the MAVS downstream adaptors, to be recruited to MAVS, while the motif 39-KEALSDGI-46 also blocks TRAF3 migrating to MAVS. In parallel, these TRAFs are important in activating transcription factors NF-kB and/or IRF-3 by hMPV. Our findings collectively demonstrate that M2-2 uses its PDZ motifs to launch the hMPV immune evasion through blocking the interaction of MAVS and its downstream TRAFs. Copyright © 2016 Elsevier Inc. All rights reserved.

  2. PDL1 Signals through Conserved Sequence Motifs to Overcome Interferon-Mediated Cytotoxicity

    Directory of Open Access Journals (Sweden)

    Maria Gato-Cañas

    2017-08-01

    Full Text Available PDL1 blockade produces remarkable clinical responses, thought to occur by T cell reactivation through prevention of PDL1-PD1 T cell inhibitory interactions. Here, we find that PDL1 cell-intrinsic signaling protects cancer cells from interferon (IFN cytotoxicity and accelerates tumor progression. PDL1 inhibited IFN signal transduction through a conserved class of sequence motifs that mediate crosstalk with IFN signaling. Abrogation of PDL1 expression or antibody-mediated PDL1 blockade strongly sensitized cancer cells to IFN cytotoxicity through a STAT3/caspase-7-dependent pathway. Moreover, somatic mutations found in human carcinomas within these PDL1 sequence motifs disrupted motif regulation, resulting in PDL1 molecules with enhanced protective activities from type I and type II IFN cytotoxicity. Overall, our results reveal a mode of action of PDL1 in cancer cells as a first line of defense against IFN cytotoxicity.

  3. Discovery and validation of information theory-based transcription factor and cofactor binding site motifs.

    Science.gov (United States)

    Lu, Ruipeng; Mucaki, Eliseos J; Rogan, Peter K

    2017-03-17

    Data from ChIP-seq experiments can derive the genome-wide binding specificities of transcription factors (TFs) and other regulatory proteins. We analyzed 765 ENCODE ChIP-seq peak datasets of 207 human TFs with a novel motif discovery pipeline based on recursive, thresholded entropy minimization. This approach, while obviating the need to compensate for skewed nucleotide composition, distinguishes true binding motifs from noise, quantifies the strengths of individual binding sites based on computed affinity and detects adjacent cofactor binding sites that coordinate with the targets of primary, immunoprecipitated TFs. We obtained contiguous and bipartite information theory-based position weight matrices (iPWMs) for 93 sequence-specific TFs, discovered 23 cofactor motifs for 127 TFs and revealed six high-confidence novel motifs. The reliability and accuracy of these iPWMs were determined via four independent validation methods, including the detection of experimentally proven binding sites, explanation of effects of characterized SNPs, comparison with previously published motifs and statistical analyses. We also predict previously unreported TF coregulatory interactions (e.g. TF complexes). These iPWMs constitute a powerful tool for predicting the effects of sequence variants in known binding sites, performing mutation analysis on regulatory SNPs and predicting previously unrecognized binding sites and target genes. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  4. Do motifs reflect evolved function?--No convergent evolution of genetic regulatory network subgraph topologies.

    Science.gov (United States)

    Knabe, Johannes F; Nehaniv, Chrystopher L; Schilstra, Maria J

    2008-01-01

    Methods that analyse the topological structure of networks have recently become quite popular. Whether motifs (subgraph patterns that occur more often than in randomized networks) have specific functions as elementary computational circuits has been cause for debate. As the question is difficult to resolve with currently available biological data, we approach the issue using networks that abstractly model natural genetic regulatory networks (GRNs) which are evolved to show dynamical behaviors. Specifically one group of networks was evolved to be capable of exhibiting two different behaviors ("differentiation") in contrast to a group with a single target behavior. In both groups we find motif distribution differences within the groups to be larger than differences between them, indicating that evolutionary niches (target functions) do not necessarily mold network structure uniquely. These results show that variability operators can have a stronger influence on network topologies than selection pressures, especially when many topologies can create similar dynamics. Moreover, analysis of motif functional relevance by lesioning did not suggest that motifs were of greater importance to the functioning of the network than arbitrary subgraph patterns. Only when drastically restricting network size, so that one motif corresponds to a whole functionally evolved network, was preference for particular connection patterns found. This suggests that in non-restricted, bigger networks, entanglement with the rest of the network hinders topological subgraph analysis.

  5. Identification of a Baeyer-Villiger monooxygenase sequence motif

    NARCIS (Netherlands)

    Fraaije, MW; Kamerbeek, NM; van Berkel, WJH; Janssen, DB; Kamerbeek, Nanne M.; Berkel, Willem J.H. van

    2002-01-01

    Baeyer-Villiger monooxygenases (BVMOs) form a distinct class of flavoproteins that catalyze the insertion of an oxygen atom in a C-C bond using dioxygen and NAD(P)H. Using newly characterized BVMO sequences, we have uncovered a BVMO-identifying sequence motif: FXGXXXRXXXW(P/D). Studies with

  6. Distribution and Diversity of Microbial Eukaryotes in Bathypelagic Waters of the South China Sea.

    Science.gov (United States)

    Xu, Dapeng; Jiao, Nianzhi; Ren, Rui; Warren, Alan

    2017-05-01

    Little is known about the biodiversity of microbial eukaryotes in the South China Sea, especially in waters at bathyal depths. Here, we employed SSU rDNA gene sequencing to reveal the diversity and community structure across depth and distance gradients in the South China Sea. Vertically, the highest alpha diversity was found at 75-m depth. The communities of microbial eukaryotes were clustered into shallow-, middle-, and deep-water groups according to the depth from which they were collected, indicating a depth-related diversity and distribution pattern. Rhizaria sequences dominated the microeukaryote community and occurred in all samples except those from less than 50-m deep, being most abundant near the sea floor where they contributed ca. 64-97% and 40-74% of the total sequences and OTUs recovered, respectively. A large portion of rhizarian OTUs has neither a nearest named neighbor nor a nearest neighbor in the GenBank database which indicated the presence of new phylotypes in the South China Sea. Given their overwhelming abundance and richness, further phylogenetic analysis of rhizarians were performed and three new genetic clusters were revealed containing sequences retrieved from the deep waters of the South China Sea. Our results shed light on the diversity and community structure of microbial eukaryotes in this not yet fully explored area. © 2016 The Author(s) Journal of Eukaryotic Microbiology © 2016 International Society of Protistologists.

  7. Assessment of algorithms for inferring positional weight matrix motifs of transcription factor binding sites using protein binding microarray data.

    Directory of Open Access Journals (Sweden)

    Yaron Orenstein

    Full Text Available The new technology of protein binding microarrays (PBMs allows simultaneous measurement of the binding intensities of a transcription factor to tens of thousands of synthetic double-stranded DNA probes, covering all possible 10-mers. A key computational challenge is inferring the binding motif from these data. We present a systematic comparison of four methods developed specifically for reconstructing a binding site motif represented as a positional weight matrix from PBM data. The reconstructed motifs were evaluated in terms of three criteria: concordance with reference motifs from the literature and ability to predict in vivo and in vitro bindings. The evaluation encompassed over 200 transcription factors and some 300 assays. The results show a tradeoff between how the methods perform according to the different criteria, and a dichotomy of method types. Algorithms that construct motifs with low information content predict PBM probe ranking more faithfully, while methods that produce highly informative motifs match reference motifs better. Interestingly, in predicting high-affinity binding, all methods give far poorer results for in vivo assays compared to in vitro assays.

  8. Factoring local sequence composition in motif significance analysis.

    Science.gov (United States)

    Ng, Patrick; Keich, Uri

    2008-01-01

    We recently introduced a biologically realistic and reliable significance analysis of the output of a popular class of motif finders. In this paper we further improve our significance analysis by incorporating local base composition information. Relying on realistic biological data simulation, as well as on FDR analysis applied to real data, we show that our method is significantly better than the increasingly popular practice of using the normal approximation to estimate the significance of a finder's output. Finally we turn to leveraging our reliable significance analysis to improve the actual motif finding task. Specifically, endowing a variant of the Gibbs Sampler with our improved significance analysis we demonstrate that de novo finders can perform better than has been perceived. Significantly, our new variant outperforms all the finders reviewed in a recently published comprehensive analysis of the Harbison genome-wide binding location data. Interestingly, many of these finders incorporate additional information such as nucleosome positioning and the significance of binding data.

  9. Characterization of an eukaryotic peptide deformylase from Plasmodium falciparum.

    Science.gov (United States)

    Bracchi-Ricard, V; Nguyen, K T; Zhou, Y; Rajagopalan, P T; Chakrabarti, D; Pei, D

    2001-12-15

    Ribosomal protein synthesis in eubacteria and eukaryotic organelles initiates with an N-formylmethionyl-tRNA(i), resulting in N-terminal formylation of all nascent polypeptides. Peptide deformylase (PDF) catalyzes the subsequent removal of the N-terminal formyl group from the majority of bacterial proteins. Until recently, PDF has been thought as an enzyme unique to the bacterial kingdom. Searches of the genomic DNA databases identified several genes that encode proteins of high sequence homology to bacterial PDF from eukaryotic organisms. The cDNA encoding Plasmodium falciparum PDF (PfPDF) has been cloned and overexpressed in Escherichia coli. The recombinant protein is catalytically active in deformylating N-formylated peptides, shares many of the properties of bacterial PDF, and is inhibited by specific PDF inhibitors. Western blot analysis indicated expression of mature PfPDF in trophozoite, schizont, and segmenter stages of intraerythrocytic development. These results provide strong evidence that a functional PDF is present in P. falciparum. In addition, PDF inhibitors inhibited the growth of P. falciparum in the intraerythrocytic culture. (c)2001 Elsevier Science.

  10. DNA regulatory motif selection based on support vector machine ...

    African Journals Online (AJOL)

    ... machine (SVM) and its application in microarray experiment of Kashin-Beck disease. ... speed and amount of the corresponding mRNA in gene replication process. ... and revealed that some motifs may be related to the immune reactions.

  11. A novel fibronectin binding motif in MSCRAMMs targets F3 modules.

    Directory of Open Access Journals (Sweden)

    Sabitha Prabhakaran

    Full Text Available BBK32 is a surface expressed lipoprotein and fibronectin (Fn-binding microbial surface component recognizing adhesive matrix molecule (MSCRAMM of Borrelia burgdorferi, the causative agent of Lyme disease. Previous studies from our group showed that BBK32 is a virulence factor in experimental Lyme disease and located the Fn-binding region to residues 21-205 of the lipoprotein.Studies aimed at identifying interacting sites between BBK32 and Fn revealed an interaction between the MSCRAMM and the Fn F3 modules. Further analysis of this interaction showed that BBK32 can cause the aggregation of human plasma Fn in a similar concentration-dependent manner to that of anastellin, the superfibronectin (sFn inducing agent. The resulting Fn aggregates are conformationally distinct from plasma Fn as indicated by a change in available thermolysin cleavage sites. Recombinant BBK32 and anastellin affect the structure of Fn matrices formed by cultured fibroblasts and inhibit endothelial cell proliferation similarly. Within BBK32, we have located the sFn-forming activity to a region between residues 160 and 175 which contains two sequence motifs that are also found in anastellin. Synthetic peptides mimicking these motifs induce Fn aggregation, whereas a peptide with a scrambled sequence motif was inactive, suggesting that these motifs represent the sFn-inducing sequence.We conclude that BBK32 induces the formation of Fn aggregates that are indistinguishable from those formed by anastellin. The results of this study provide evidence for how bacteria can target host proteins to manipulate host cell activities.

  12. Molecular dynamics simulations of electrostatics and hydration distributions around RNA and DNA motifs

    Science.gov (United States)

    Marlowe, Ashley E.; Singh, Abhishek; Semichaevsky, Andrey V.; Yingling, Yaroslava G.

    2009-03-01

    Nucleic acid nanoparticles can self-assembly through the formation of complementary loop-loop interactions or stem-stem interactions. Presence and concentration of ions can significantly affect the self-assembly process and the stability of the nanostructure. In this presentation we use explicit molecular dynamics simulations to examine the variations in cationic distributions and hydration environment around DNA and RNA helices and loop-loop interactions. Our simulations show that the potassium and sodium ionic distributions are different around RNA and DNA motifs which could be indicative of ion mediated relative stability of loop-loop complexes. Moreover in RNA loop-loop motifs ions are consistently present and exchanged through a distinct electronegative channel. We will also show how we used the specific RNA loop-loop motif to design a RNA hexagonal nanoparticle.

  13. The N-terminal region of eukaryotic translation initiation factor 5A signals to nuclear localization of the protein

    International Nuclear Information System (INIS)

    Parreiras-e-Silva, Lucas T.; Gomes, Marcelo D.; Oliveira, Eduardo B.; Costa-Neto, Claudio M.

    2007-01-01

    The eukaryotic translation initiation factor 5A (eIF5A) is a ubiquitous protein of eukaryotic and archaeal organisms which undergoes hypusination, a unique post-translational modification. We have generated a polyclonal antibody against murine eIF5A, which in immunocytochemical assays in B16-F10 cells revealed that the endogenous protein is preferentially localized to the nuclear region. We therefore analyzed possible structural features present in eIF5A proteins that could be responsible for that characteristic. Multiple sequence alignment analysis of eIF5A proteins from different eukaryotic and archaeal organisms showed that the former sequences have an extended N-terminal segment. We have then performed in silico prediction analyses and constructed different truncated forms of murine eIF5A to verify any possible role that the N-terminal extension might have in determining the subcellular localization of the eIF5A in eukaryotic organisms. Our results indicate that the N-terminal extension of the eukaryotic eIF5A contributes in signaling this protein to nuclear localization, despite of bearing no structural similarity with classical nuclear localization signals

  14. GNG Motifs Can Replace a GGG Stretch during G-Quadruplex Formation in a Context Dependent Manner.

    Directory of Open Access Journals (Sweden)

    Kohal Das

    Full Text Available G-quadruplexes are one of the most commonly studied non-B DNA structures. Generally, these structures are formed using a minimum of 4, three guanine tracts, with connecting loops ranging from one to seven. Recent studies have reported deviation from this general convention. One such deviation is the involvement of bulges in the guanine tracts. In this study, guanines along with bulges, also referred to as GNG motifs have been extensively studied using recently reported HOX11 breakpoint fragile region I as a model template. By strategic mutagenesis approach we show that the contribution from continuous G-tracts may be dispensible during G-quadruplex formation when such motifs are flanked by GNGs. Importantly, the positioning and number of GNG/GNGNG can also influence the formation of G-quadruplexes. Further, we assessed three genomic regions from HIF1 alpha, VEGF and SHOX gene for G-quadruplex formation using GNG motifs. We show that HIF1 alpha sequence harbouring GNG motifs can fold into intramolecular G-quadruplex. In contrast, GNG motifs in mutant VEGF sequence could not participate in structure formation, suggesting that the usage of GNG is context dependent. Importantly, we show that when two continuous stretches of guanines are flanked by two independent GNG motifs in a naturally occurring sequence (SHOX, it can fold into an intramolecular G-quadruplex. Finally, we show the specific binding of G-quadruplex binding protein, Nucleolin and G-quadruplex antibody, BG4 to SHOX G-quadruplex. Overall, our study provides novel insights into the role of GNG motifs in G-quadruplex structure formation which may have both physiological and pathological implications.

  15. Clustering and Candidate Motif Detection in Exosomal miRNAs by Application of Machine Learning Algorithms.

    Science.gov (United States)

    Gaur, Pallavi; Chaturvedi, Anoop

    2017-07-22

    The clustering pattern and motifs give immense information about any biological data. An application of machine learning algorithms for clustering and candidate motif detection in miRNAs derived from exosomes is depicted in this paper. Recent progress in the field of exosome research and more particularly regarding exosomal miRNAs has led much bioinformatic-based research to come into existence. The information on clustering pattern and candidate motifs in miRNAs of exosomal origin would help in analyzing existing, as well as newly discovered miRNAs within exosomes. Along with obtaining clustering pattern and candidate motifs in exosomal miRNAs, this work also elaborates the usefulness of the machine learning algorithms that can be efficiently used and executed on various programming languages/platforms. Data were clustered and sequence candidate motifs were detected successfully. The results were compared and validated with some available web tools such as 'BLASTN' and 'MEME suite'. The machine learning algorithms for aforementioned objectives were applied successfully. This work elaborated utility of machine learning algorithms and language platforms to achieve the tasks of clustering and candidate motif detection in exosomal miRNAs. With the information on mentioned objectives, deeper insight would be gained for analyses of newly discovered miRNAs in exosomes which are considered to be circulating biomarkers. In addition, the execution of machine learning algorithms on various language platforms gives more flexibility to users to try multiple iterations according to their requirements. This approach can be applied to other biological data-mining tasks as well.

  16. Overexpression of GmERF5, a new member of the soybean EAR motif-containing ERF transcription factor, enhances resistance to Phytophthora sojae in soybean.

    Science.gov (United States)

    Dong, Lidong; Cheng, Yingxin; Wu, Junjiang; Cheng, Qun; Li, Wenbin; Fan, Sujie; Jiang, Liangyu; Xu, Zhaolong; Kong, Fanjiang; Zhang, Dayong; Xu, Pengfei; Zhang, Shuzhen

    2015-05-01

    Phytophthora root and stem rot of soybean [Glycine max (L.) Merr.], caused by Phytophthora sojae Kaufmann and Gerdemann, is a destructive disease throughout the soybean planting regions in the world. Here, we report insights into the function and underlying mechanisms of a novel ethylene response factor (ERF) in soybean, namely GmERF5, in host responses to P. sojae. GmERF5-overexpressing transgenic soybean exhibited significantly enhanced resistance to P. sojae and positively regulated the expression of the PR10, PR1-1, and PR10-1 genes. Sequence analysis suggested that GmERF5 contains an AP2/ERF domain of 58 aa and a conserved ERF-associated amphiphilic repression (EAR) motif in its C-terminal region. Following stress treatments, GmERF5 was significantly induced by P. sojae, ethylene (ET), abscisic acid (ABA), and salicylic acid (SA). The activity of the GmERF5 promoter (GmERF5P) was upregulated in tobacco leaves with ET, ABA, Phytophthora nicotianae, salt, and drought treatments, suggesting that GmERF5 could be involved not only in the induced defence response but also in the ABA-mediated pathway of salt and drought tolerance. GmERF5 could bind to the GCC-box element and act as a repressor of gene transcription. It was targeted to the nucleus when transiently expressed in Arabidopsis protoplasts. GmERF5 interacted with a basic helix-loop-helix transcription factor (GmbHLH) and eukaryotic translation initiation factor (GmEIF) both in yeast cells and in planta. To the best of our knowledge, GmERF5 is the first soybean EAR motif-containing ERF transcription factor demonstrated to be involved in the response to pathogen infection. © The Author 2015. Published by Oxford University Press on behalf of the Society for Experimental Biology. All rights reserved. For permissions, please email: journals.permissions@oup.com.

  17. Genetic structure and evolution of the Vps25 family, a yeast ESCRT-II component

    Directory of Open Access Journals (Sweden)

    Slater Ruth

    2006-08-01

    Full Text Available Abstract Background Vps25p is the product of yeast gene VPS25 and is found in an endosomal sorting complex required for transport (ESCRT-II, along with Vps22p and Vps36p. This complex is essential for sorting of ubiquitinated biosynthetic and endosomal cargoes into endosomes. Results We found that VPS25 is a highly conserved and widely expressed eukaryotic gene, with single orthologs in chromalveolate, excavate, amoebozoan, plant, fungal and metazoan species. Two paralogs were found in Trichomonas vaginalis. An ortholog was strikingly absent from the Encephalitozoon cuniculi genome. Intron positions were analyzed in VPS25 from 36 species. We found evidence for five ancestral VPS25 introns, intron loss, and single instances of intron gain (a Paramecium species and intron slippage (Theileria species. Processed pseudogenes were identified in four mammalian genomes, with a notable absence in the mouse genome. Two retropseudogenes were found in the chimpanzee genome, one more recently inserted, and one evolving from a common primate ancestor. The amino acid sequences of 119 Vps25 orthologs are aligned, compared with the known secondary structure of yeast Vps25p, and used to carry out phylogenetic analysis. Residues in two amino-terminal PPXY motifs (motif I and II, involved in dimerization of Vps25p and interaction with Vps22p and Vps36p, were closely, but not absolutely conserved. Specifically, motif I was absent in Vps25 homologs of chromalveolates, euglenozoa, and diplomonads. A highly conserved carboxy-terminal lysine was identified, which suggests Vps25 is ubiquitinated. Arginine-83 of yeast Vps25p involved in Vps22p interaction was highly, but not absolutely, conserved. Human tissue expression analysis showed universal expression. Conclusion We have identified 119 orthologs of yeast Vps25p. Expression of mammalian VPS25 in a wide range of tissues, and the presence in a broad range of eukaryotic species, indicates a basic role in eukaryotic cell

  18. Microbial eukaryote plankton communities of high-mountain lakes from three continents exhibit strong biogeographic patterns.

    Science.gov (United States)

    Filker, Sabine; Sommaruga, Ruben; Vila, Irma; Stoeck, Thorsten

    2016-05-01

    Microbial eukaryotes hold a key role in aquatic ecosystem functioning. Yet, their diversity in freshwater lakes, particularly in high-mountain lakes, is relatively unknown compared with the marine environment. Low nutrient availability, low water temperature and high ultraviolet radiation make most high-mountain lakes extremely challenging habitats for life and require specific molecular and physiological adaptations. We therefore expected that these ecosystems support a plankton diversity that differs notably from other freshwater lakes. In addition, we hypothesized that the communities under study exhibit geographic structuring. Our rationale was that geographic dispersal of small-sized eukaryotes in high-mountain lakes over continental distances seems difficult. We analysed hypervariable V4 fragments of the SSU rRNA gene to compare the genetic microbial eukaryote diversity in high-mountain lakes located in the European Alps, the Chilean Altiplano and the Ethiopian Bale Mountains. Microbial eukaryotes were not globally distributed corroborating patterns found for bacteria, multicellular animals and plants. Instead, the plankton community composition emerged as a highly specific fingerprint of a geographic region even on higher taxonomic levels. The intraregional heterogeneity of the investigated lakes was mirrored in shifts in microbial eukaryote community structure, which, however, was much less pronounced compared with interregional beta-diversity. Statistical analyses revealed that on a regional scale, environmental factors are strong predictors for plankton community structures in high-mountain lakes. While on long-distance scales (>10 000 km), isolation by distance is the most plausible scenario, on intermediate scales (up to 6000 km), both contemporary environmental factors and historical contingencies interact to shift plankton community structures. © 2016 John Wiley & Sons Ltd.

  19. Human telomeric DNA: G-quadruplex, i-motif and Watson–Crick double helix

    Science.gov (United States)

    Phan, Anh Tuân; Mergny, Jean-Louis

    2002-01-01

    Human telomeric DNA composed of (TTAGGG/CCCTAA)n repeats may form a classical Watson–Crick double helix. Each individual strand is also prone to quadruplex formation: the G-rich strand may adopt a G-quadruplex conformation involving G-quartets whereas the C-rich strand may fold into an i-motif based on intercalated C·C+ base pairs. Using an equimolar mixture of the telomeric oligonucleotides d[AGGG(TTAGGG)3] and d[(CCCTAA)3CCCT], we defined which structures existed and which would be the predominant species under a variety of experimental conditions. Under near-physiological conditions of pH, temperature and salt concentration, telomeric DNA was predominantly in a double-helix form. However, at lower pH values or higher temperatures, the G-quadruplex and/or the i-motif efficiently competed with the duplex. We also present kinetic and thermodynamic data for duplex association and for G-quadruplex/i-motif unfolding. PMID:12409451

  20. Organofluorine chemistry: synthesis and conformation of vicinal fluoromethylene motifs.

    Science.gov (United States)

    O'Hagan, David

    2012-04-20

    The C-F bond is the most polar bond in organic chemistry, and thus the bond has a relatively large dipole moment with a significant -ve charge density on the fluorine atom and correspondingly a +ve charge density on carbon. The electrostatic nature of the bond renders it the strongest one in organic chemistry. However, the fluorine atom itself is nonpolarizable, and thus, despite the charge localization on fluorine, it is a poor hydrogen-bonding acceptor. These properties of the C-F bond make it attractive in the design of nonviscous but polar organic compounds, with a polarity limited to influencing the intramolecular nature of the molecule and less so intermolecular interactions with the immediate environment. In this Perspective, the synthesis of aliphatic chains carrying multivicinal fluoromethylene motifs is described. It emerges that the dipoles of adjacent C-F bonds orientate relative to each other, and thus, individual diastereoisomers display different backbone carbon chain conformations. These conformational preferences recognize the influence of the well-known gauche effect associated with 1,2-difluoroethane but extend to considering 1,3-fluorine-fluorine dipolar repulsions. The synthesis of carbon chains carrying two, three, four, five, and six vicinal fluoromethylene motifs is described, with an emphasis on our own research contributions. These motifs obey almost predictable conformational behavior, and they emerge as candidates for inclusion in the design of performance organic molecules. © 2012 American Chemical Society

  1. Mechanism for activation of the growth factor-activated AGC kinases by turn motif phosphorylation

    DEFF Research Database (Denmark)

    Hauge, Camilla; Antal, Torben L; Hirschberg, Daniel

    2007-01-01

    investigated the role of the third, so-called turn motif phosphate, also located in the tail, in the AGC kinases PKB, S6K, RSK, MSK, PRK and PKC. We report cooperative action of the HM phosphate and the turn motif phosphate, because it binds a phosphoSer/Thr-binding site above the glycine-rich loop within...

  2. The Q Motif Is Involved in DNA Binding but Not ATP Binding in ChlR1 Helicase.

    Directory of Open Access Journals (Sweden)

    Hao Ding

    Full Text Available Helicases are molecular motors that couple the energy of ATP hydrolysis to the unwinding of structured DNA or RNA and chromatin remodeling. The conversion of energy derived from ATP hydrolysis into unwinding and remodeling is coordinated by seven sequence motifs (I, Ia, II, III, IV, V, and VI. The Q motif, consisting of nine amino acids (GFXXPXPIQ with an invariant glutamine (Q residue, has been identified in some, but not all helicases. Compared to the seven well-recognized conserved helicase motifs, the role of the Q motif is less acknowledged. Mutations in the human ChlR1 (DDX11 gene are associated with a unique genetic disorder known as Warsaw Breakage Syndrome, which is characterized by cellular defects in genome maintenance. To examine the roles of the Q motif in ChlR1 helicase, we performed site directed mutagenesis of glutamine to alanine at residue 23 in the Q motif of ChlR1. ChlR1 recombinant protein was overexpressed and purified from HEK293T cells. ChlR1-Q23A mutant abolished the helicase activity of ChlR1 and displayed reduced DNA binding ability. The mutant showed impaired ATPase activity but normal ATP binding. A thermal shift assay revealed that ChlR1-Q23A has a melting point value similar to ChlR1-WT. Partial proteolysis mapping demonstrated that ChlR1-WT and Q23A have a similar globular structure, although some subtle conformational differences in these two proteins are evident. Finally, we found ChlR1 exists and functions as a monomer in solution, which is different from FANCJ, in which the Q motif is involved in protein dimerization. Taken together, our results suggest that the Q motif is involved in DNA binding but not ATP binding in ChlR1 helicase.

  3. Non-coding RNA regulation in pathogenic bacteria located inside eukaryotic cells

    NARCIS (Netherlands)

    Ortega, Alvaro D.; Quereda, Juan J; Pucciarelli, M Graciela; García-del Portillo, Francisco

    2014-01-01

    Intracellular bacterial pathogens have evolved distinct lifestyles inside eukaryotic cells. Some pathogens coexist with the infected cell in an obligate intracellular state, whereas others transit between the extracellular and intracellular environment. Adaptation to these intracellular lifestyles

  4. EUPAN enables pan-genome studies of a large number of eukaryotic genomes.

    Science.gov (United States)

    Hu, Zhiqiang; Sun, Chen; Lu, Kuang-Chen; Chu, Xixia; Zhao, Yue; Lu, Jinyuan; Shi, Jianxin; Wei, Chaochun

    2017-08-01

    Pan-genome analyses are routinely carried out for bacteria to interpret the within-species gene presence/absence variations (PAVs). However, pan-genome analyses are rare for eukaryotes due to the large sizes and higher complexities of their genomes. Here we proposed EUPAN, a eukaryotic pan-genome analysis toolkit, enabling automatic large-scale eukaryotic pan-genome analyses and detection of gene PAVs at a relatively low sequencing depth. In the previous studies, we demonstrated the effectiveness and high accuracy of EUPAN in the pan-genome analysis of 453 rice genomes, in which we also revealed widespread gene PAVs among individual rice genomes. Moreover, EUPAN can be directly applied to the current re-sequencing projects primarily focusing on single nucleotide polymorphisms. EUPAN is implemented in Perl, R and C ++. It is supported under Linux and preferred for a computer cluster with LSF and SLURM job scheduling system. EUPAN together with its standard operating procedure (SOP) is freely available for non-commercial use (CC BY-NC 4.0) at http://cgm.sjtu.edu.cn/eupan/index.html . ccwei@sjtu.edu.cn or jianxin.shi@sjtu.edu.cn. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  5. Diatoms dominate the eukaryotic metatranscriptome during spring in coastal 'dead zone' sediments.

    Science.gov (United States)

    Broman, Elias; Sachpazidou, Varvara; Dopson, Mark; Hylander, Samuel

    2017-10-11

    An important characteristic of marine sediments is the oxygen concentration that affects many central metabolic processes. There has been a widespread increase in hypoxia in coastal systems (referred to as 'dead zones') mainly caused by eutrophication. Hence, it is central to understand the metabolism and ecology of eukaryotic life in sediments during changing oxygen conditions. Therefore, we sampled coastal 'dead zone' Baltic Sea sediment during autumn and spring, and analysed the eukaryotic metatranscriptome from field samples and after incubation in the dark under oxic or anoxic conditions. Bacillariophyta (diatoms) dominated the eukaryotic metatranscriptome in spring and were also abundant during autumn. A large fraction of the diatom RNA reads was associated with the photosystems suggesting a constitutive expression in darkness. Microscope observation showed intact diatom cells and these would, if hatched, represent a significant part of the pelagic phytoplankton biomass. Oxygenation did not significantly change the relative proportion of diatoms nor resulted in any major shifts in metabolic 'signatures'. By contrast, diatoms rapidly responded when exposed to light suggesting that light is limiting diatom development in hypoxic sediments. Hence, it is suggested that diatoms in hypoxic sediments are on 'standby' to exploit the environment if they reach suitable habitats. © 2017 The Author(s).

  6. CRN13 candidate effectors from plant and animal eukaryotic pathogens are DNA-binding proteins which trigger host DNA damage response.

    Science.gov (United States)

    Ramirez-Garcés, Diana; Camborde, Laurent; Pel, Michiel J C; Jauneau, Alain; Martinez, Yves; Néant, Isabelle; Leclerc, Catherine; Moreau, Marc; Dumas, Bernard; Gaulin, Elodie

    2016-04-01

    To successfully colonize their host, pathogens produce effectors that can interfere with host cellular processes. Here we investigated the function of CRN13 candidate effectors produced by plant pathogenic oomycetes and detected in the genome of the amphibian pathogenic chytrid fungus Batrachochytrium dendrobatidis (BdCRN13). When expressed in Nicotiana, AeCRN13, from the legume root pathogen Aphanomyces euteiches, increases the susceptibility of the leaves to the oomycete Phytophthora capsici. When transiently expressed in amphibians or plant cells, AeCRN13 and BdCRN13 localize to the cell nuclei, triggering aberrant cell development and eventually causing cell death. Using Förster resonance energy transfer experiments in plant cells, we showed that both CRN13s interact with nuclear DNA and trigger plant DNA damage response (DDR). Mutating key amino acid residues in a predicted HNH-like endonuclease motif abolished the interaction of AeCRN13 with DNA, the induction of DDR and the enhancement of Nicotiana susceptibility to P. capsici. Finally, H2AX phosphorylation, a marker of DNA damage, and enhanced expression of genes involved in the DDR were observed in A. euteiches-infected Medicago truncatula roots. These results show that CRN13 from plant and animal eukaryotic pathogens promotes host susceptibility by targeting nuclear DNA and inducing DDR. © 2015 The Authors. New Phytologist © 2015 New Phytologist Trust.

  7. Fine-tuning of T-cell development by the CD3γ di-leucine-based TCR-sorting motif

    DEFF Research Database (Denmark)

    Lauritsen, Jens Peter Holst; Boding, Lasse; Buus, Terkild B

    2015-01-01

    The CD3γ di-leucine-based (diL) receptor-sorting motif plays a central role in TCR down-regulation and in clonal expansion of virus-specific T cells. However, the role of the CD3γ diL motif in T-cell development is not known. In this study, we show that protein kinase C-induced TCR down-regulatio......The CD3γ di-leucine-based (diL) receptor-sorting motif plays a central role in TCR down-regulation and in clonal expansion of virus-specific T cells. However, the role of the CD3γ diL motif in T-cell development is not known. In this study, we show that protein kinase C-induced TCR down...

  8. Anionic lipids and the maintenance of membrane electrostatics in eukaryotes.

    Science.gov (United States)

    Platre, Matthieu Pierre; Jaillais, Yvon

    2017-02-01

    A wide range of signaling processes occurs at the cell surface through the reversible association of proteins from the cytosol to the plasma membrane. Some low abundant lipids are enriched at the membrane of specific compartments and thereby contribute to the identity of cell organelles by acting as biochemical landmarks. Lipids also influence membrane biophysical properties, which emerge as an important feature in specifying cellular territories. Such parameters are crucial for signal transduction and include lipid packing, membrane curvature and electrostatics. In particular, membrane electrostatics specifies the identity of the plasma membrane inner leaflet. Membrane surface charges are carried by anionic phospholipids, however the exact nature of the lipid(s) that powers the plasma membrane electrostatic field varies among eukaryotes and has been hotly debated during the last decade. Herein, we discuss the role of anionic lipids in setting up plasma membrane electrostatics and we compare similarities and differences that were found in different eukaryotic cells.

  9. Automated Eukaryotic Gene Structure Annotation Using EVidenceModeler and the Program to Assemble Spliced Alignments

    Energy Technology Data Exchange (ETDEWEB)

    Haas, B J; Salzberg, S L; Zhu, W; Pertea, M; Allen, J E; Orvis, J; White, O; Buell, C R; Wortman, J R

    2007-12-10

    EVidenceModeler (EVM) is presented as an automated eukaryotic gene structure annotation tool that reports eukaryotic gene structures as a weighted consensus of all available evidence. EVM, when combined with the Program to Assemble Spliced Alignments (PASA), yields a comprehensive, configurable annotation system that predicts protein-coding genes and alternatively spliced isoforms. Our experiments on both rice and human genome sequences demonstrate that EVM produces automated gene structure annotation approaching the quality of manual curation.

  10. The position of the Gly-xxx-Gly motif in transmembrane segments modulates dimer affinity.

    Science.gov (United States)

    Johnson, Rachel M; Rath, Arianna; Deber, Charles M

    2006-12-01

    Although the intrinsic low solubility of membrane proteins presents challenges to their high-resolution structure determination, insight into the amino acid sequence features and forces that stabilize their folds has been provided through study of sequence-dependent helix-helix interactions between single transmembrane (TM) helices. While the stability of helix-helix partnerships mediated by the Gly-xxx-Gly (GG4) motif is known to be generally modulated by distal interfacial residues, it has not been established whether the position of this motif, with respect to the ends of a given TM segment, affects dimer affinity. Here we examine the relationship between motif position and affinity in the homodimers of 2 single-spanning membrane protein TM sequences: glycophorin A (GpA) and bacteriophage M13 coat protein (MCP). Using the TOXCAT assay for dimer affinity on a series of GpA and MCP TM segments that have been modified with either 4 Leu residues at each end or with 8 Leu residues at the N-terminal end, we show that in each protein, centrally located GG4 motifs are capable of stronger helix-helix interactions than those proximal to TM helix ends, even when surrounding interfacial residues are maintained. The relative importance of GG4 motifs in stabilizing helix-helix interactions therefore must be considered not only in its specific residue context but also in terms of the location of the interactive surface relative to the N and C termini of alpha-helical TM segments.

  11. Examples of the Motif of the Shrew in European Literature and Film

    OpenAIRE

    Vasvári, Louise O.

    2001-01-01

    In her article "Examples of the Motif of the Shrew in European Literature and Film" Louise O. Vasvári presents the shrew-taming story as a masterplot of both Eastern and Western folklore and literature concerned with establishing the appropriate power dynamic between a married couple. Vasvári firts reviews the comparative groundwork of the story she has documented in her earlier studies of the topic. In addition to tracing the bundle of motifs that make up the shrew story from medieval Arabic...

  12. Evolution of pH buffers and water homeostasis in eukaryotes: homology between humans and Acanthamoeba proteins.

    Science.gov (United States)

    Baig, Abdul M; Zohaib, R; Tariq, S; Ahmad, H R

    2018-02-01

    This study intended to trace the evolution of acid-base buffers and water homeostasis in eukaryotes. Acanthamoeba castellanii  was selected as a model unicellular eukaryote for this purpose. Homologies of proteins involved in pH and water regulatory mechanisms at cellular levels were compared between humans and A. castellanii. Amino acid sequence homology, structural homology, 3D modeling and docking prediction were done to show the extent of similarities between carbonic anhydrase 1 (CA1), aquaporin (AQP), band-3 protein and H + pump. Experimental assays were done with acetazolamide (AZM), brinzolamide and mannitol to observe their effects on the trophozoites of  A. castellanii.  The human CA1, AQP, band-3 protein and H + -transport proteins revealed similar proteins in Acanthamoeba. Docking showed the binding of AZM on amoebal AQP-like proteins.  Acanthamoeba showed transient shape changes and encystation at differential doses of brinzolamide, mannitol and AZM.  Conclusion: Water and pH regulating adapter proteins in Acanthamoeba and humans show significant homology, these mechanisms evolved early in the primitive unicellular eukaryotes and have remained conserved in multicellular eukaryotes.

  13. Counterintuitive effect of fall mixed layer deepening on eukaryotic new production in the Sargasso Sea

    Science.gov (United States)

    Fawcett, S. E.; Lomas, M. W.; Ward, B. B.; Sigman, D. M.

    2012-12-01

    The Sargasso Sea is characterized by a short period of deep vertical mixing in the late winter and early spring, followed by strong thermal stratification during the summer. Stratification persists into the fall, impeding the upward flux of nitrate from depth so that recycled forms of nitrogen (N) such as ammonium are thought to support most primary production. We collected particles from surface waters during March, July, October, and December, used flow cytometry to separate the prokaryotic and eukaryotic phytoplankton, and analyzed their respective 15N/14N. In all months, the 15N/14N of the prokaryotic genera, Prochlorococcus and Synechococcus, was low, indicative of reliance on recycled N throughout the year. In July, the 15N/14N of eukaryotic phytoplankton was variable but consistently higher than that of the prokaryotes, reflecting eukaryotic consumption of subsurface nitrate. Two eukaryotic profiles from October and December were similar to those from July. In three other fall profiles, the eukaryotes had a 15N/14N similar to that of the prokaryotes, suggesting a switch toward greater reliance on recycled N. This change in the dominant N source supporting eukaryotic production appears to be driven by the density structure of the upper water column. The very shallow low-density surface "mixed layer" (≤20 m) that develops in early-to-mid summer does not contribute to stratification at the base of the euphotic zone, and subsurface nitrate can mix up into the lower euphotic zone, facilitating continued production. The deepening of the mixed layer into the fall, typically taken as an indication of weaker overall stratification, actually strengthens the isolation of the euphotic zone as a whole, reducing the upward supply of nitrate to the photosynthetically active layer. The same counterintuitive dynamic explains the latitudinal patterns in a set of three October depth profiles. Two northern stations (32°N and 27°N) were characterized by a thick, low

  14. Bacterial Signaling Nucleotides Inhibit Yeast Cell Growth by Impacting Mitochondrial and Other Specifically Eukaryotic Functions.

    Science.gov (United States)

    Hesketh, Andy; Vergnano, Marta; Wan, Chris; Oliver, Stephen G

    2017-07-25

    We have engineered Saccharomyces cerevisiae to inducibly synthesize the prokaryotic signaling nucleotides cyclic di-GMP (cdiGMP), cdiAMP, and ppGpp in order to characterize the range of effects these nucleotides exert on eukaryotic cell function during bacterial pathogenesis. Synthetic genetic array (SGA) and transcriptome analyses indicated that, while these compounds elicit some common reactions in yeast, there are also complex and distinctive responses to each of the three nucleotides. All three are capable of inhibiting eukaryotic cell growth, with the guanine nucleotides exhibiting stronger effects than cdiAMP. Mutations compromising mitochondrial function and chromatin remodeling show negative epistatic interactions with all three nucleotides. In contrast, certain mutations that cause defects in chromatin modification and ribosomal protein function show positive epistasis, alleviating growth inhibition by at least two of the three nucleotides. Uniquely, cdiGMP is lethal both to cells growing by respiration on acetate and to obligately fermentative petite mutants. cdiGMP is also synthetically lethal with the ribonucleotide reductase (RNR) inhibitor hydroxyurea. Heterologous expression of the human ppGpp hydrolase Mesh1p prevented the accumulation of ppGpp in the engineered yeast and restored cell growth. Extensive in vivo interactions between bacterial signaling molecules and eukaryotic gene function occur, resulting in outcomes ranging from growth inhibition to death. cdiGMP functions through a mechanism that must be compensated by unhindered RNR activity or by functionally competent mitochondria. Mesh1p may be required for abrogating the damaging effects of ppGpp in human cells subjected to bacterial infection. IMPORTANCE During infections, pathogenic bacteria can release nucleotides into the cells of their eukaryotic hosts. These nucleotides are recognized as signals that contribute to the initiation of defensive immune responses that help the infected

  15. Creation of Hybrid Nanorods From Sequences of Natural Trimeric Fibrous Proteins Using the Fibritin Trimerization Motif

    Science.gov (United States)

    Papanikolopoulou, Katerina; van Raaij, Mark J.; Mitraki, Anna

    Stable, artificial fibrous proteins that can be functionalized open new avenues in fields such as bionanomaterials design and fiber engineering. An important source of inspiration for the creation of such proteins are natural fibrous proteins such as collagen, elastin, insect silks, and fibers from phages and viruses. The fibrous parts of this last class of proteins usually adopt trimeric, β-stranded structural folds and are appended to globular, receptor-binding domains. It has been recently shown that the globular domains are essential for correct folding and trimerization and can be successfully substituted by a very small (27-amino acid) trimerization motif from phage T4 fibritin. The hybrid proteins are correctly folded nanorods that can withstand extreme conditions. When the fibrous part derives from the adenovirus fiber shaft, different tissue-targeting specificities can be engineered into the hybrid proteins, which therefore can be used as gene therapy vectors. The integration of such stable nanorods in devices is also a big challenge in the field of biomechanical design. The fibritin foldon domain is a versatile trimerization motif and can be combined with a variety of fibrous motifs, such as coiled-coil, collagenous, and triple β-stranded motifs, provided the appropriate linkers are used. The combination of different motifs within the same fibrous molecule to create stable rods with multiple functions can even be envisioned. We provide a comprehensive overview of the experimental procedures used for designing, creating, and characterizing hybrid fibrous nanorods using the fibritin trimerization motif.

  16. A sialoreceptor binding motif in the Mycoplasma synoviae adhesin VlhA.

    Directory of Open Access Journals (Sweden)

    Meghan May

    Full Text Available Mycoplasma synoviae depends on its adhesin VlhA to mediate cytadherence to sialylated host cell receptors. Allelic variants of VlhA arise through recombination between an assemblage of promoterless vlhA pseudogenes and a single transcription promoter site, creating lineages of M. synoviae that each express a different vlhA allele. The predicted full-length VlhA sequences adjacent to the promoter of nine lineages of M. synoviae varying in avidity of cytadherence were aligned with that of the reference strain MS53 and with a 60-a.a. hemagglutinating VlhA C-terminal fragment from a Tunisian lineage of strain WVU1853(T. Seven different sequence variants of an imperfectly conserved, single-copy, 12-a.a. candidate cytadherence motif were evident amid the flanking variable residues of the 11 total sequences examined. The motif was predicted to adopt a short hairpin structure in a low-complexity region near the C-terminus of VlhA. Biotinylated synthetic oligopeptides representing four selected variants of the 12-a.a. motif, with the whole synthesized 60-a.a. fragment as a positive control, differed (P<0.01 in the extent they bound to chicken erythrocyte membranes. All bound to a greater extent (P<0.01 than scrambled or irrelevant VlhA domain negative control peptides did. Experimentally introduced branched-chain amino acid (BCAA substitutions Val3Ile and Leu7Ile did not significantly alter binding, whereas fold-destabilizing substitutions Thr4Gly and Ala9Gly tended to reduce it (P<0.05. Binding was also reduced to background levels (P<0.01 when the peptides were exposed to desialylated membranes, or were pre-saturated with free sialic acid before exposure to untreated membranes. From this evidence we conclude that the motif P-X-(BCAA-X-F-X-(BCAA-X-A-K-X-G binds sialic acid and likely mediates VlhA-dependent M. synoviae attachment to host cells. This conserved mechanism retains the potential for fine-scale rheostasis in binding avidity, which could be a

  17. Once in a lifetime: strategies for preventing re-replication in prokaryotic and eukaryotic cells

    DEFF Research Database (Denmark)

    Nielsen, Olaf; Løbner-Olesen, Anders

    2008-01-01

    DNA replication is an extremely accurate process and cells have evolved intricate control mechanisms to ensure that each region of their genome is replicated only once during S phase. Here, we compare what is known about the processes that prevent re-replication in prokaryotic and eukaryotic cells...... prokaryotes and eukaryotes are inactivated until the next cell cycle. Furthermore, in both systems the beta-clamp of the replicative polymerase associates with enzymatic activities that contribute to the inactivation of the helicase loaders. Finally, recent studies suggest that the control mechanism...

  18. Footprinting analysis of interactions between the largest eukaryotic RNase P/MRP protein Pop1 and RNase P/MRP RNA components.

    Science.gov (United States)

    Fagerlund, Robert D; Perederina, Anna; Berezin, Igor; Krasilnikov, Andrey S

    2015-09-01

    Ribonuclease (RNase) P and RNase MRP are closely related catalytic ribonucleoproteins involved in the metabolism of a wide range of RNA molecules, including tRNA, rRNA, and some mRNAs. The catalytic RNA component of eukaryotic RNase P retains the core elements of the bacterial RNase P ribozyme; however, the peripheral RNA elements responsible for the stabilization of the global architecture are largely absent in the eukaryotic enzyme. At the same time, the protein makeup of eukaryotic RNase P is considerably more complex than that of the bacterial RNase P. RNase MRP, an essential and ubiquitous eukaryotic enzyme, has a structural organization resembling that of eukaryotic RNase P, and the two enzymes share most of their protein components. Here, we present the results of the analysis of interactions between the largest protein component of yeast RNases P/MRP, Pop1, and the RNA moieties of the enzymes, discuss structural implications of the results, and suggest that Pop1 plays the role of a scaffold for the stabilization of the global architecture of eukaryotic RNase P RNA, substituting for the network of RNA-RNA tertiary interactions that maintain the global RNA structure in bacterial RNase P. © 2015 Fagerlund et al.; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  19. Stanniocalcin 1 binds hemin through a partially conserved heme regulatory motif

    International Nuclear Information System (INIS)

    Westberg, Johan A.; Jiang, Ji; Andersson, Leif C.

    2011-01-01

    Highlights: → Stanniocalcin 1 (STC1) binds heme through novel heme binding motif. → Central iron atom of heme and cysteine-114 of STC1 are essential for binding. → STC1 binds Fe 2+ and Fe 3+ heme. → STC1 peptide prevents oxidative decay of heme. -- Abstract: Hemin (iron protoporphyrin IX) is a necessary component of many proteins, functioning either as a cofactor or an intracellular messenger. Hemoproteins have diverse functions, such as transportation of gases, gas detection, chemical catalysis and electron transfer. Stanniocalcin 1 (STC1) is a protein involved in respiratory responses of the cell but whose mechanism of action is still undetermined. We examined the ability of STC1 to bind hemin in both its reduced and oxidized states and located Cys 114 as the axial ligand of the central iron atom of hemin. The amino acid sequence differs from the established (Cys-Pro) heme regulatory motif (HRM) and therefore presents a novel heme binding motif (Cys-Ser). A STC1 peptide containing the heme binding sequence was able to inhibit both spontaneous and H 2 O 2 induced decay of hemin. Binding of hemin does not affect the mitochondrial localization of STC1.

  20. A study of eukaryotic response mechanisms to atmospheric pressure cold plasma by using Saccharomyces cerevisiae single gene mutants

    International Nuclear Information System (INIS)

    Feng Hongqing; Wang Ruixue; Sun Peng; Wu Haiyan; Liu Qi; Li Fangting; Fang Jing; Zhang Jue; Zhu Weidong

    2010-01-01

    The mechanisms of eukaryotic cell response to cold plasma are studied. A series of single gene mutants of eukaryotic model organism Saccharomyces cerevisiae are used to compare their sensitivity to plasma treatment with the wild type. We examined 12 mutants in the oxidative stress pathway and the cell cycle pathway, in which 8 are found to be hypersensitive to plasma processing. The mutated genes' roles in the two pathways are analyzed to understand the biological response mechanisms of plasma treatment. The results demonstrate that genes from both pathways are needed for the eukaryotic cells to survive the complex plasma treatment.

  1. Identification of the Raptor-binding motif on Arabidopsis S6 kinase and its use as a TOR signaling suppressor

    Energy Technology Data Exchange (ETDEWEB)

    Son, Ora; Kim, Sunghan; Hur, Yoon-Sun; Cheon, Choong-Ill, E-mail: ccheon@sookmyung.ac.kr

    2016-03-25

    TOR (target of rapamycin) kinase signaling plays central role as a regulator of growth and proliferation in all eukaryotic cells and its key signaling components and effectors are also conserved in plants. Unlike the mammalian and yeast counterparts, however, we found through yeast two-hybrid analysis that multiple regions of the Arabidopsis Raptor (regulatory associated protein of TOR) are required for binding to its substrate. We also identified that a 44-amino acid region at the N-terminal end of Arabidopsis ribosomal S6 kinase 1 (AtS6K1) specifically interacted with AtRaptor1, indicating that this region may contain a functional equivalent of the TOS (TOR-Signaling) motif present in the mammalian TOR substrates. Transient over-expression of this 44-amino acid fragment in Arabidopsis protoplasts resulted in significant decrease in rDNA transcription, demonstrating a feasibility of developing a new plant-specific TOR signaling inhibitor based upon perturbation of the Raptor-substrate interaction. - Highlights: • Multiple regions on the Arabidopsis Raptor protein were found to be involved in substrate binding. • N-terminal end of the Arabidopsis ribosomal S6 kinase 1 (AtS6K1) was responsible for interacting with AtRaptor1. • The Raptor-interacting fragment of AtS6K1 could be utilized as an effective inhibitor of plant TOR signaling.

  2. Identification of the Raptor-binding motif on Arabidopsis S6 kinase and its use as a TOR signaling suppressor

    International Nuclear Information System (INIS)

    Son, Ora; Kim, Sunghan; Hur, Yoon-Sun; Cheon, Choong-Ill

    2016-01-01

    TOR (target of rapamycin) kinase signaling plays central role as a regulator of growth and proliferation in all eukaryotic cells and its key signaling components and effectors are also conserved in plants. Unlike the mammalian and yeast counterparts, however, we found through yeast two-hybrid analysis that multiple regions of the Arabidopsis Raptor (regulatory associated protein of TOR) are required for binding to its substrate. We also identified that a 44-amino acid region at the N-terminal end of Arabidopsis ribosomal S6 kinase 1 (AtS6K1) specifically interacted with AtRaptor1, indicating that this region may contain a functional equivalent of the TOS (TOR-Signaling) motif present in the mammalian TOR substrates. Transient over-expression of this 44-amino acid fragment in Arabidopsis protoplasts resulted in significant decrease in rDNA transcription, demonstrating a feasibility of developing a new plant-specific TOR signaling inhibitor based upon perturbation of the Raptor-substrate interaction. - Highlights: • Multiple regions on the Arabidopsis Raptor protein were found to be involved in substrate binding. • N-terminal end of the Arabidopsis ribosomal S6 kinase 1 (AtS6K1) was responsible for interacting with AtRaptor1. • The Raptor-interacting fragment of AtS6K1 could be utilized as an effective inhibitor of plant TOR signaling.

  3. Plant plasma membrane-bound staphylococcal-like DNases as a novel class of eukaryotic nucleases

    Directory of Open Access Journals (Sweden)

    Leśniewicz Krzysztof

    2012-10-01

    Full Text Available Abstract Background The activity of degradative nucleases responsible for genomic DNA digestion has been observed in all kingdoms of life. It is believed that the main function of DNA degradation occurring during plant programmed cell death is redistribution of nucleic acid derived products such as nitrogen, phosphorus and nucleotide bases. Plant degradative nucleases that have been studied so far belong mainly to the S1-type family and were identified in cellular compartments containing nucleic acids or in the organelles where they are stored before final application. However, the explanation of how degraded DNA components are exported from the dying cells for further reutilization remains open. Results Bioinformatic and experimental data presented in this paper indicate that two Arabidopsis staphylococcal-like nucleases, named CAN1 and CAN2, are anchored to the cell membrane via N-terminal myristoylation and palmitoylation modifications. Both proteins possess a unique hybrid structure in their catalytic domain consisting of staphylococcal nuclease-like and tRNA synthetase anticodon binding-like motifs. They are neutral, Ca2+-dependent nucleaces showing a different specificity toward the ssDNA, dsDNA and RNA substrates. A study of microarray experiments and endogenous nuclease activity revealed that expression of CAN1 gene correlates with different forms of programmed cell death, while the CAN2 gene is constitutively expressed. Conclusions In this paper we present evidence showing that two plant staphylococcal-like nucleases belong to a new, as yet unidentified class of eukaryotic nucleases, characterized by unique plasma membrane localization. The identification of this class of nucleases indicates that plant cells possess additional, so far uncharacterized, mechanisms responsible for DNA and RNA degradation. The potential functions of these nucleases in relation to their unique intracellular location are discussed.

  4. Type VI secretion system MIX-effectors carry both antibacterial and anti-eukaryotic activities.

    Science.gov (United States)

    Ray, Ann; Schwartz, Nika; de Souza Santos, Marcela; Zhang, Junmei; Orth, Kim; Salomon, Dor

    2017-11-01

    Most type VI secretion systems (T6SSs) described to date are protein delivery apparatuses that mediate bactericidal activities. Several T6SSs were also reported to mediate virulence activities, although only few anti-eukaryotic effectors have been described. Here, we identify three T6SSs in the marine bacterium Vibrio proteolyticus and show that T6SS1 mediates bactericidal activities under warm marine-like conditions. Using comparative proteomics, we find nine potential T6SS1 effectors, five of which belong to the polymorphic MIX-effector class. Remarkably, in addition to six predicted bactericidal effectors, the T6SS1 secretome includes three putative anti-eukaryotic effectors. One of these is a MIX-effector containing a cytotoxic necrotizing factor 1 domain. We demonstrate that T6SS1 can use this MIX-effector to target phagocytic cells, resulting in morphological changes and actin cytoskeleton rearrangements. In conclusion, the V. proteolyticus T6SS1, a system homologous to one found in pathogenic vibrios, uses a suite of polymorphic effectors that target both bacteria and eukaryotic neighbors. © 2017 The Authors. Published under the terms of the CC BY 4.0 license.

  5. Indonesian Traditional Toys and the Development of Batik Motifs

    Directory of Open Access Journals (Sweden)

    Bagus Indrayana

    2016-06-01

    Full Text Available There is a wide array of traditional toys in Indonesia. In the past, traditional toys played an important role for skill and creativity development of children. Today, the position of traditional toys in the society is displaced by toys from large-scale manufacturers. Given the critical role of traditional toys for children’s motoric and social development, there is a need to develop media that can be used to promote these traditional products and strengthen their position in the public. We propose to use Batik as a way to effectively disseminate and promote traditional toys to the general public. Apart from this, using traditional toys to create new Batik motifs can have an economic value for the producers of Batik, promote Indonesian products and enrich the Indonesian Batik. This study aims to explore the variety of traditional toys, mainly from Klaten and Magelang, in the Central Java province of Indonesia, and use them as the basis for the development of Batik motif creation. This study used Trilogi Keseimbangan (or Harmony Trilogy aesthetic theory analytical approach that explains the creation of craft consists of the following phases: exploration, design, and materialization. The creation method in this study adopts Tiga Tahap Enam Langkah (Three Phases, Six Steps method offered in the theory. The finding in the field found that the traditional toys material used in Klaten and Magelang, mostly made from waste wood, plywood, and zinc. The manufacturing process is done manually by two or three craftsmen using a simple technology. The traditional toys are designed by the artisans mostly, although there may be designs from the clients. In addition, we also found that the traditional toys have never been used as a Batik motif. The traditional toys Batik motif presented in this work is researcher’s design. For the purposes of this study, we first research the variety of traditional toys available in the market today in Indonesia. We look

  6. Sequence-based classification using discriminatory motif feature selection.

    Directory of Open Access Journals (Sweden)

    Hao Xiong

    Full Text Available Most existing methods for sequence-based classification use exhaustive feature generation, employing, for example, all k-mer patterns. The motivation behind such (enumerative approaches is to minimize the potential for overlooking important features. However, there are shortcomings to this strategy. First, practical constraints limit the scope of exhaustive feature generation to patterns of length ≤ k, such that potentially important, longer (> k predictors are not considered. Second, features so generated exhibit strong dependencies, which can complicate understanding of derived classification rules. Third, and most importantly, numerous irrelevant features are created. These concerns can compromise prediction and interpretation. While remedies have been proposed, they tend to be problem-specific and not broadly applicable. Here, we develop a generally applicable methodology, and an attendant software pipeline, that is predicated on discriminatory motif finding. In addition to the traditional training and validation partitions, our framework entails a third level of data partitioning, a discovery partition. A discriminatory motif finder is used on sequences and associated class labels in the discovery partition to yield a (small set of features. These features are then used as inputs to a classifier in the training partition. Finally, performance assessment occurs on the validation partition. Important attributes of our approach are its modularity (any discriminatory motif finder and any classifier can be deployed and its universality (all data, including sequences that are unaligned and/or of unequal length, can be accommodated. We illustrate our approach on two nucleosome occupancy datasets and a protein solubility dataset, previously analyzed using enumerative feature generation. Our method achieves excellent performance results, with and without optimization of classifier tuning parameters. A Python pipeline implementing the approach is

  7. Cave acoustics in prehistory: Exploring the association of Palaeolithic visual motifs and acoustic response.

    Science.gov (United States)

    Fazenda, Bruno; Scarre, Chris; Till, Rupert; Pasalodos, Raquel Jiménez; Guerra, Manuel Rojo; Tejedor, Cristina; Peredo, Roberto Ontañón; Watson, Aaron; Wyatt, Simon; Benito, Carlos García; Drinkall, Helen; Foulds, Frederick

    2017-09-01

    During the 1980 s, acoustic studies of Upper Palaeolithic imagery in French caves-using the technology then available-suggested a relationship between acoustic response and the location of visual motifs. This paper presents an investigation, using modern acoustic measurement techniques, into such relationships within the caves of La Garma, Las Chimeneas, La Pasiega, El Castillo, and Tito Bustillo in Northern Spain. It addresses methodological issues concerning acoustic measurement at enclosed archaeological sites and outlines a general framework for extraction of acoustic features that may be used to support archaeological hypotheses. The analysis explores possible associations between the position of visual motifs (which may be up to 40 000 yrs old) and localized acoustic responses. Results suggest that motifs, in general, and lines and dots, in particular, are statistically more likely to be found in places where reverberation is moderate and where the low frequency acoustic response has evidence of resonant behavior. The work presented suggests that an association of the location of Palaeolithic motifs with acoustic features is a statistically weak but tenable hypothesis, and that an appreciation of sound could have influenced behavior among Palaeolithic societies of this region.

  8. T cell receptor zeta allows stable expression of receptors containing the CD3gamma leucine-based receptor-sorting motif

    DEFF Research Database (Denmark)

    Dietrich, J; Geisler, C

    1998-01-01

    The leucine-based motif in the T cell receptor (TCR) subunit CD3gamma constitutes a strong internalization signal. In fully assembled TCR this motif is inactive unless phosphorylated. In contrast, the motif is constitutively active in CD4/CD3gamma and Tac/CD3gamma chimeras independently of phosph......The leucine-based motif in the T cell receptor (TCR) subunit CD3gamma constitutes a strong internalization signal. In fully assembled TCR this motif is inactive unless phosphorylated. In contrast, the motif is constitutively active in CD4/CD3gamma and Tac/CD3gamma chimeras independently...... of phosphorylation and leads to rapid internalization and sorting of these chimeras to lysosomal degradation. Because the TCRzeta chain rescues incomplete TCR complexes from lysosomal degradation and allows stable surface expression of fully assembled TCR, we addressed the question whether TCRzeta has the potential...... to mask the CD3gamma leucine-based motif. By studying CD4/CD3gamma and CD16/CD3gamma chimeras, we found that CD16/CD3gamma chimeras associated with TCRzeta. The CD16/CD3gamma-TCRzeta complexes were stably expressed at the cell surface and had a low spontaneous internalization rate, indicating...

  9. In silico ionomics segregates parasitic from free-living eukaryotes.

    Science.gov (United States)

    Greganova, Eva; Steinmann, Michael; Mäser, Pascal; Fankhauser, Niklaus

    2013-01-01

    Ion transporters are fundamental to life. Due to their ancient origin and conservation in sequence, ion transporters are also particularly well suited for comparative genomics of distantly related species. Here, we perform genome-wide ion transporter profiling as a basis for comparative genomics of eukaryotes. From a given predicted proteome, we identify all bona fide ion channels, ion porters, and ion pumps. Concentrating on unicellular eukaryotes (n = 37), we demonstrate that clustering of species according to their repertoire of ion transporters segregates obligate endoparasites (n = 23) on the one hand, from free-living species and facultative parasites (n = 14) on the other hand. This surprising finding indicates strong convergent evolution of the parasites regarding the acquisition and homeostasis of inorganic ions. Random forest classification identifies transporters of ammonia, plus transporters of iron and other transition metals, as the most informative for distinguishing the obligate parasites. Thus, in silico ionomics further underscores the importance of iron in infection biology and suggests access to host sources of nitrogen and transition metals to be selective forces in the evolution of parasitism. This finding is in agreement with the phenomenon of iron withholding as a primordial antimicrobial strategy of infected mammals.

  10. topIb, a phylogenetic hallmark gene of Thaumarchaeota encodes a functional eukaryote-like topoisomerase IB.

    Science.gov (United States)

    Dahmane, Narimane; Gadelle, Danièle; Delmas, Stéphane; Criscuolo, Alexis; Eberhard, Stephan; Desnoues, Nicole; Collin, Sylvie; Zhang, Hongliang; Pommier, Yves; Forterre, Patrick; Sezonov, Guennadi

    2016-04-07

    Type IB DNA topoisomerases can eliminate torsional stresses produced during replication and transcription. These enzymes are found in all eukaryotes and a short version is present in some bacteria and viruses. Among prokaryotes, the long eukaryotic version is only observed in archaea of the phylum Thaumarchaeota. However, the activities and the roles of these topoisomerases have remained an open question. Here, we demonstrate that all available thaumarchaeal genomes contain a topoisomerase IB gene that defines a monophyletic group closely related to the eukaryotic enzymes. We show that the topIB gene is expressed in the model thaumarchaeon Nitrososphaera viennensis and we purified the recombinant enzyme from the uncultivated thaumarchaeon Candidatus Caldiarchaeum subterraneum. This enzyme is active in vitro at high temperature, making it the first thermophilic topoisomerase IB characterized so far. We have compared this archaeal type IB enzyme to its human mitochondrial and nuclear counterparts. The archaeal enzyme relaxes both negatively and positively supercoiled DNA like the eukaryotic enzymes. However, its pattern of DNA cleavage specificity is different and it is resistant to camptothecins (CPTs) and non-CPT Top1 inhibitors, LMP744 and lamellarin D. This newly described thermostable topoisomerases IB should be a promising new model for evolutionary, mechanistic and structural studies. © The Author(s) 2016. Published by Oxford University Press on behalf of Nucleic Acids Research.

  11. THE MOTIF OF THE PRODIGAL SON IN IVAN TURGENEV'S NOVELS

    Directory of Open Access Journals (Sweden)

    Valentina Ivanovna Gabdullina

    2013-11-01

    Full Text Available The author questions the perception of Ivan Turgenev as a “non- Christian writer” and studies the problem of the prodigal son motif functioning in a series of his novels. In his novels, Turgenev pictured different phases of the archetypal story, originating from the Gospel parable of the prodigal son. In the novel Rudin he depicted the phase of spiritual wanderings of the hero who had lost touch with his native land — Russia. In his next novels (Home of the Gentry, Fathers and Sons and Smoke, after leading his hero in circles and sending him back to his paternal home, Turgenev reconstructs the model of human behavior, represented in the parable, thereby recognizing the immutability of the idea formalized in the Gospel. The motif of the return to Russian land gets its completion in Turgenev's last novel Virgin Soil, in which the author paradoxically connects the Westernist idea with the Gospel imperative. Solomin, the son of a deacon, sent by his wise father out to Europe “to get education”, studies in England, masters the European knowledge and returns back “to his native land” to establish his own business in inland Russia. Thus, a series of Turgenev's novels, in which he portrayed different phases of social life, are interlinked with the motif of the prodigal son, who is represented by novels' main characters.

  12. Identification and characterization of two linear epitope motifs in hepatitis E virus ORF2 protein.

    Directory of Open Access Journals (Sweden)

    Heng Wang

    Full Text Available Hepatitis E virus (HEV is responsible for hepatitis E, which represents a global public health problem. HEV genotypes 3 and 4 are reported to be zoonotic, and animals are monitored for HEV infection in the interests of public hygiene and food safety. The development of novel diagnostic methods and vaccines for HEV in humans is thus important topics of research. Opening reading frame (ORF 2 of HEV includes both linear and conformational epitopes and is regarded as the primary candidate for vaccines and diagnostic tests. We investigated the precise location of the HEV epitopes in the ORF2 protein. We prepared four monoclonal antibodies (mAbs against genotype 4 ORF2 protein and identified two linear epitopes, G438IVIPHD444 and Y457DNQH461, corresponding to two of these mAbs using phage display biopanning technology. Both these epitopes were speculated to be universal to genotypes 1, 2, 3, 4, and avian HEVs. We also used two 12-mer fragments of ORF2 protein including these two epitopes to develop a peptide-based enzyme-linked immunosorbent assay (ELISA to detect HEV in serum. This assay demonstrated good specificity but low sensitivity compared with the commercial method, indicating that these two epitopes could serve as potential candidate targets for diagnosis. Overall, these results further our understanding of the epitope distribution of HEV ORF2, and provide important information for the development of peptide-based immunodiagnostic tests to detect HEV in serum.

  13. A Woman Voice in an Epic: Tracing Gendered Motifs in Anne Vabarna's Peko

    Directory of Open Access Journals (Sweden)

    Andreas Kalkun

    2008-12-01

    Full Text Available In the article the gendered motifs found in Anne Vabarna’s Seto epic Peko are analysed. Besides the narrative telling of the life of the male hero, the motives regarding eating, refusing to eat or offering food, and the aspect of the female body or its control deserve to be noticed. These scenes do not communicate the main plot, they are often related to minor characters of the epic and slow down the narrative, but at the same time they clearly carry artistic purpose and meaning. I consider these motifs, present in the liminal parts of the epic, to be the dominant symbols of the epic where the author’s feminine world is being exposed. Observing these motifs of Peko in the context of Seto religious worldview, the life of Anne Vabarna and the social position of Seto women, the symbols become eloquent and informative.

  14. Cancer-related marketing centrality motifs acting as pivot units in the human signaling network and mediating cross-talk between biological pathways.

    Science.gov (United States)

    Li, Wan; Chen, Lina; Li, Xia; Jia, Xu; Feng, Chenchen; Zhang, Liangcai; He, Weiming; Lv, Junjie; He, Yuehan; Li, Weiguo; Qu, Xiaoli; Zhou, Yanyan; Shi, Yuchen

    2013-12-01

    Network motifs in central positions are considered to not only have more in-coming and out-going connections but are also localized in an area where more paths reach the networks. These central motifs have been extensively investigated to determine their consistent functions or associations with specific function categories. However, their functional potentials in the maintenance of cross-talk between different functional communities are unclear. In this paper, we constructed an integrated human signaling network from the Pathway Interaction Database. We identified 39 essential cancer-related motifs in central roles, which we called cancer-related marketing centrality motifs, using combined centrality indices on the system level. Our results demonstrated that these cancer-related marketing centrality motifs were pivotal units in the signaling network, and could mediate cross-talk between 61 biological pathways (25 could be mediated by one motif on average), most of which were cancer-related pathways. Further analysis showed that molecules of most marketing centrality motifs were in the same or adjacent subcellular localizations, such as the motif containing PI3K, PDK1 and AKT1 in the plasma membrane, to mediate signal transduction between 32 cancer-related pathways. Finally, we analyzed the pivotal roles of cancer genes in these marketing centrality motifs in the pathogenesis of cancers, and found that non-cancer genes were potential cancer-related genes.

  15. Genetic interaction motif finding by expectation maximization – a novel statistical model for inferring gene modules from synthetic lethality

    Directory of Open Access Journals (Sweden)

    Ye Ping

    2005-12-01

    Full Text Available Abstract Background Synthetic lethality experiments identify pairs of genes with complementary function. More direct functional associations (for example greater probability of membership in a single protein complex may be inferred between genes that share synthetic lethal interaction partners than genes that are directly synthetic lethal. Probabilistic algorithms that identify gene modules based on motif discovery are highly appropriate for the analysis of synthetic lethal genetic interaction data and have great potential in integrative analysis of heterogeneous datasets. Results We have developed Genetic Interaction Motif Finding (GIMF, an algorithm for unsupervised motif discovery from synthetic lethal interaction data. Interaction motifs are characterized by position weight matrices and optimized through expectation maximization. Given a seed gene, GIMF performs a nonlinear transform on the input genetic interaction data and automatically assigns genes to the motif or non-motif category. We demonstrate the capacity to extract known and novel pathways for Saccharomyces cerevisiae (budding yeast. Annotations suggested for several uncharacterized genes are supported by recent experimental evidence. GIMF is efficient in computation, requires no training and automatically down-weights promiscuous genes with high degrees. Conclusion GIMF effectively identifies pathways from synthetic lethality data with several unique features. It is mostly suitable for building gene modules around seed genes. Optimal choice of one single model parameter allows construction of gene networks with different levels of confidence. The impact of hub genes the generic probabilistic framework of GIMF may be used to group other types of biological entities such as proteins based on stochastic motifs. Analysis of the strongest motifs discovered by the algorithm indicates that synthetic lethal interactions are depleted between genes within a motif, suggesting that synthetic

  16. A Conserved Metal Binding Motif in the Bacillus subtilis Competence Protein ComFA Enhances Transformation.

    Science.gov (United States)

    Chilton, Scott S; Falbel, Tanya G; Hromada, Susan; Burton, Briana M

    2017-08-01

    Genetic competence is a process in which cells are able to take up DNA from their environment, resulting in horizontal gene transfer, a major mechanism for generating diversity in bacteria. Many bacteria carry homologs of the central DNA uptake machinery that has been well characterized in Bacillus subtilis It has been postulated that the B. subtilis competence helicase ComFA belongs to the DEAD box family of helicases/translocases. Here, we made a series of mutants to analyze conserved amino acid motifs in several regions of B. subtilis ComFA. First, we confirmed that ComFA activity requires amino acid residues conserved among the DEAD box helicases, and second, we show that a zinc finger-like motif consisting of four cysteines is required for efficient transformation. Each cysteine in the motif is important, and mutation of at least two of the cysteines dramatically reduces transformation efficiency. Further, combining multiple cysteine mutations with the helicase mutations shows an additive phenotype. Our results suggest that the helicase and metal binding functions are two distinct activities important for ComFA function during transformation. IMPORTANCE ComFA is a highly conserved protein that has a role in DNA uptake during natural competence, a mechanism for horizontal gene transfer observed in many bacteria. Investigation of the details of the DNA uptake mechanism is important for understanding the ways in which bacteria gain new traits from their environment, such as drug resistance. To dissect the role of ComFA in the DNA uptake machinery, we introduced point mutations into several motifs in the protein sequence. We demonstrate that several amino acid motifs conserved among ComFA proteins are important for efficient transformation. This report is the first to demonstrate the functional requirement of an amino-terminal cysteine motif in ComFA. Copyright © 2017 American Society for Microbiology.

  17. Promoter Motifs in NCLDVs: An Evolutionary Perspective

    Directory of Open Access Journals (Sweden)

    Graziele Pereira Oliveira

    2017-01-01

    Full Text Available For many years, gene expression in the three cellular domains has been studied in an attempt to discover sequences associated with the regulation of the transcription process. Some specific transcriptional features were described in viruses, although few studies have been devoted to understanding the evolutionary aspects related to the spread of promoter motifs through related viral families. The discovery of giant viruses and the proposition of the new viral order Megavirales that comprise a monophyletic group, named nucleo-cytoplasmic large DNA viruses (NCLDV, raised new questions in the field. Some putative promoter sequences have already been described for some NCLDV members, bringing new insights into the evolutionary history of these complex microorganisms. In this review, we summarize the main aspects of the transcription regulation process in the three domains of life, followed by a systematic description of what is currently known about promoter regions in several NCLDVs. We also discuss how the analysis of the promoter sequences could bring new ideas about the giant viruses’ evolution. Finally, considering a possible common ancestor for the NCLDV group, we discussed possible promoters’ evolutionary scenarios and propose the term “MEGA-box” to designate an ancestor promoter motif (‘TATATAAAATTGA’ that could be evolved gradually by nucleotides’ gain and loss and point mutations.

  18. Promoter Motifs in NCLDVs: An Evolutionary Perspective

    Science.gov (United States)

    Oliveira, Graziele Pereira; Andrade, Ana Cláudia dos Santos Pereira; Rodrigues, Rodrigo Araújo Lima; Arantes, Thalita Souza; Boratto, Paulo Victor Miranda; Silva, Ludmila Karen dos Santos; Dornas, Fábio Pio; Trindade, Giliane de Souza; Drumond, Betânia Paiva; La Scola, Bernard; Kroon, Erna Geessien; Abrahão, Jônatas Santos

    2017-01-01

    For many years, gene expression in the three cellular domains has been studied in an attempt to discover sequences associated with the regulation of the transcription process. Some specific transcriptional features were described in viruses, although few studies have been devoted to understanding the evolutionary aspects related to the spread of promoter motifs through related viral families. The discovery of giant viruses and the proposition of the new viral order Megavirales that comprise a monophyletic group, named nucleo-cytoplasmic large DNA viruses (NCLDV), raised new questions in the field. Some putative promoter sequences have already been described for some NCLDV members, bringing new insights into the evolutionary history of these complex microorganisms. In this review, we summarize the main aspects of the transcription regulation process in the three domains of life, followed by a systematic description of what is currently known about promoter regions in several NCLDVs. We also discuss how the analysis of the promoter sequences could bring new ideas about the giant viruses’ evolution. Finally, considering a possible common ancestor for the NCLDV group, we discussed possible promoters’ evolutionary scenarios and propose the term “MEGA-box” to designate an ancestor promoter motif (‘TATATAAAATTGA’) that could be evolved gradually by nucleotides’ gain and loss and point mutations. PMID:28117683

  19. Lateral gene transfer between prokaryotes and multicellular eukaryotes: ongoing and significant?

    NARCIS (Netherlands)

    Ros, V.I.D.; Hurst, G.D.D.

    2009-01-01

    The expansion of genome sequencing projects has produced accumulating evidence for lateral transfer of genes between prokaryotic and eukaryotic genomes. However, it remains controversial whether these genes are of functional importance in their recipient host. Nikoh and Nakabachi, in a recent paper

  20. The special neuraminidase stalk-motif responsible for increased virulence and pathogenesis of H5N1 influenza A virus.

    Directory of Open Access Journals (Sweden)

    Hongbo Zhou

    Full Text Available The variation of highly pathogenic avian influenza H5N1 virus results in gradually increased virulence in poultry, and human cases continue to accumulate. The neuraminidase (NA stalk region of influenza virus varies considerably and may associate with its virulence. The NA stalk region of all N1 subtype influenza A viruses can be divided into six different stalk-motifs, H5N1/2004-like (NA-wt, WSN-like, H5N1/97-like, PR/8-like, H7N1/99-like and H5N1/96-like. The NA-wt is a special NA stalk-motif which was first observed in H5N1 influenza virus in 2000, with a 20-amino acid deletion in the 49(th to 68(th positions of the stalk region. Here we show that there is a gradual increase of the special NA stalk-motif in H5N1 isolates from 2000 to 2007, and notably, the special stalk-motif is observed in all 173 H5N1 human isolates from 2004 to 2007. The recombinant H5N1 virus with the special stalk-motif possesses the highest virulence and pathogenicity in chicken and mice, while the recombinant viruses with the other stalk-motifs display attenuated phenotype. This indicates that the special stalk-motif has contributed to the high virulence and pathogenicity of H5N1 isolates since 2000. The gradually increasing emergence of the special NA stalk-motif in H5N1 isolates, especially in human isolates, deserves attention by all.