WorldWideScience

Sample records for conserved intron motif

  1. Fox-2 Splicing Factor Binds to a Conserved Intron Motif to PromoteInclusion of Protein 4.1R Alternative Exon 16

    Energy Technology Data Exchange (ETDEWEB)

    Ponthier, Julie L.; Schluepen, Christina; Chen, Weiguo; Lersch,Robert A.; Gee, Sherry L.; Hou, Victor C.; Lo, Annie J.; Short, Sarah A.; Chasis, Joel A.; Winkelmann, John C.; Conboy, John G.

    2006-03-01

    Activation of protein 4.1R exon 16 (E16) inclusion during erythropoiesis represents a physiologically important splicing switch that increases 4.1R affinity for spectrin and actin. Previous studies showed that negative regulation of E16 splicing is mediated by the binding of hnRNP A/B proteins to silencer elements in the exon and that downregulation of hnRNP A/B proteins in erythroblasts leads to activation of E16 inclusion. This paper demonstrates that positive regulation of E16 splicing can be mediated by Fox-2 or Fox-1, two closely related splicing factors that possess identical RNA recognition motifs. SELEX experiments with human Fox-1 revealed highly selective binding to the hexamer UGCAUG. Both Fox-1 and Fox-2 were able to bind the conserved UGCAUG elements in the proximal intron downstream of E16, and both could activate E16 splicing in HeLa cell co-transfection assays in a UGCAUG-dependent manner. Conversely, knockdown of Fox-2 expression, achieved with two different siRNA sequences resulted in decreased E16 splicing. Moreover, immunoblot experiments demonstrate mouse erythroblasts express Fox-2, but not Fox-1. These findings suggest that Fox-2 is a physiological activator of E16 splicing in differentiating erythroid cells in vivo. Recent experiments show that UGCAUG is present in the proximal intron sequence of many tissue-specific alternative exons, and we propose that the Fox family of splicing enhancers plays an important role in alternative splicing switches during differentiation in metazoan organisms.

  2. Patterns of intron gain and conservation in eukaryotic genes

    Directory of Open Access Journals (Sweden)

    Wolf Yuri I

    2007-10-01

    Full Text Available Abstract Background: The presence of introns in protein-coding genes is a universal feature of eukaryotic genome organization, and the genes of multicellular eukaryotes, typically, contain multiple introns, a substantial fraction of which share position in distant taxa, such as plants and animals. Depending on the methods and data sets used, researchers have reached opposite conclusions on the causes of the high fraction of shared introns in orthologous genes from distant eukaryotes. Some studies conclude that shared intron positions reflect, almost entirely, a remarkable evolutionary conservation, whereas others attribute it to parallel gain of introns. To resolve these contradictions, it is crucial to analyze the evolution of introns by using a model that minimally relies on arbitrary assumptions. Results: We developed a probabilistic model of evolution that allows for variability of intron gain and loss rates over branches of the phylogenetic tree, individual genes, and individual sites. Applying this model to an extended set of conserved eukaryotic genes, we find that parallel gain, on average, accounts for only ~8% of the shared intron positions. However, the distribution of parallel gains over the phylogenetic tree of eukaryotes is highly non-uniform. There are, practically, no parallel gains in closely related lineages, whereas for distant lineages, such as animals and plants, parallel gains appear to contribute up to 20% of the shared intron positions. In accord with these findings, we estimated that ancestral introns have a high probability to be retained in extant genomes, and conversely, that a substantial fraction of extant introns have retained their positions since the early stages of eukaryotic evolution. In addition, the density of sites that are available for intron insertion is estimated to be, approximately, one in seven basepairs. Conclusion: We obtained robust estimates of the contribution of parallel gain to the observed

  3. Remarkable sequence conservation of the last intron in the PKD1 gene.

    Science.gov (United States)

    Rodova, Marianna; Islam, M Rafiq; Peterson, Kenneth R; Calvet, James P

    2003-10-01

    The last intron of the PKD1 gene (intron 45) was found to have exceptionally high sequence conservation across four mammalian species: human, mouse, rat, and dog. This conservation did not extend to the comparable intron in pufferfish. Pairwise comparisons for intron 45 showed 91% identity (human vs. dog) to 100% identity (mouse vs. rat) for an average for all four species of 94% identity. In contrast, introns 43 and 44 of the PKD1 gene had average pairwise identities of 57% and 54%, and exons 43, 44, and 45 and the coding region of exon 46 had average pairwise identities of 80%, 84%, 82%, and 80%. Intron 45 is 90 to 95 bp in length, with the major region of sequence divergence being in a central 4-bp to 9-bp variable region. RNA secondary structure analysis of intron 45 predicts a branching stem-loop structure in which the central variable region lies in one loop and the putative branch point sequence lies in another loop, suggesting that the intron adopts a specific stem-loop structure that may be important for its removal. Although intron 45 appears to conform to the class of small, G-triplet-containing introns that are spliced by a mechanism utilizing intron definition, its high sequence conservation may be a reflection of constraints imposed by a unique mechanism that coordinates splicing of this last PKD1 intron with polyadenylation.

  4. Systematic discovery of regulatory motifs in Fusarium graminearum by comparing four Fusarium genomes

    Directory of Open Access Journals (Sweden)

    Kistler Corby

    2010-03-01

    Full Text Available Abstract Background Fusarium graminearum (Fg, a major fungal pathogen of cultivated cereals, is responsible for billions of dollars in agriculture losses. There is a growing interest in understanding the transcriptional regulation of this organism, especially the regulation of genes underlying its pathogenicity. The generation of whole genome sequence assemblies for Fg and three closely related Fusarium species provides a unique opportunity for such a study. Results Applying comparative genomics approaches, we developed a computational pipeline to systematically discover evolutionarily conserved regulatory motifs in the promoter, downstream and the intronic regions of Fg genes, based on the multiple alignments of sequenced Fusarium genomes. Using this method, we discovered 73 candidate regulatory motifs in the promoter regions. Nearly 30% of these motifs are highly enriched in promoter regions of Fg genes that are associated with a specific functional category. Through comparison to Saccharomyces cerevisiae (Sc and Schizosaccharomyces pombe (Sp, we observed conservation of transcription factors (TFs, their binding sites and the target genes regulated by these TFs related to pathways known to respond to stress conditions or phosphate metabolism. In addition, this study revealed 69 and 39 conserved motifs in the downstream regions and the intronic regions, respectively, of Fg genes. The top intronic motif is the splice donor site. For the downstream regions, we noticed an intriguing absence of the mammalian and Sc poly-adenylation signals among the list of conserved motifs. Conclusion This study provides the first comprehensive list of candidate regulatory motifs in Fg, and underscores the power of comparative genomics in revealing functional elements among related genomes. The conservation of regulatory pathways among the Fusarium genomes and the two yeast species reveals their functional significance, and provides new insights in their

  5. Remarkable interkingdom conservation of intron positions and massive, lineage-specific intron loss and gain in eukaryotic evolution.

    Science.gov (United States)

    Rogozin, Igor B; Wolf, Yuri I; Sorokin, Alexander V; Mirkin, Boris G; Koonin, Eugene V

    2003-09-02

    Sequencing of eukaryotic genomes allows one to address major evolutionary problems, such as the evolution of gene structure. We compared the intron positions in 684 orthologous gene sets from 8 complete genomes of animals, plants, fungi, and protists and constructed parsimonious scenarios of evolution of the exon-intron structure for the respective genes. Approximately one-third of the introns in the malaria parasite Plasmodium falciparum are shared with at least one crown group eukaryote; this number indicates that these introns have been conserved through >1.5 billion years of evolution that separate Plasmodium from the crown group. Paradoxically, humans share many more introns with the plant Arabidopsis thaliana than with the fly or nematode. The inferred evolutionary scenario holds that the common ancestor of Plasmodium and the crown group and, especially, the common ancestor of animals, plants, and fungi had numerous introns. Most of these ancestral introns, which are retained in the genomes of vertebrates and plants, have been lost in fungi, nematodes, arthropods, and probably Plasmodium. In addition, numerous introns have been inserted into vertebrate and plant genes, whereas, in other lineages, intron gain was much less prominent.

  6. Evidence for intron length conservation in a set of mammalian genes associated with embryonic development

    LENUS (Irish Health Repository)

    2011-10-05

    Abstract Background We carried out an analysis of intron length conservation across a diverse group of nineteen mammalian species. Motivated by recent research suggesting a role for time delays associated with intron transcription in gene expression oscillations required for early embryonic patterning, we searched for examples of genes that showed the most extreme conservation of total intron content in mammals. Results Gene sets annotated as being involved in pattern specification in the early embryo or containing the homeobox DNA-binding domain, were significantly enriched among genes with highly conserved intron content. We used ancestral sequences reconstructed with probabilistic models that account for insertion and deletion mutations to distinguish insertion and deletion events on lineages leading to human and mouse from their last common ancestor. Using a randomization procedure, we show that genes containing the homeobox domain show less change in intron content than expected, given the number of insertion and deletion events within their introns. Conclusions Our results suggest selection for gene expression precision or the existence of additional development-associated genes for which transcriptional delay is functionally significant.

  7. An ancient spliceosomal intron in the ribosomal protein L7a gene (Rpl7a of Giardia lamblia

    Directory of Open Access Journals (Sweden)

    Gray Michael W

    2005-08-01

    Full Text Available Abstract Background Only one spliceosomal-type intron has previously been identified in the unicellular eukaryotic parasite, Giardia lamblia (a diplomonad. This intron is only 35 nucleotides in length and is unusual in possessing a non-canonical 5' intron boundary sequence, CT, instead of GT. Results We have identified a second spliceosomal-type intron in G. lamblia, in the ribosomal protein L7a gene (Rpl7a, that possesses a canonical GT 5' intron boundary sequence. A comparison of the two known Giardia intron sequences revealed extensive nucleotide identity at both the 5' and 3' intron boundaries, similar to the conserved sequence motifs recently identified at the boundaries of spliceosomal-type introns in Trichomonas vaginalis (a parabasalid. Based on these observations, we searched the partial G. lamblia genome sequence for these conserved features and identified a third spliceosomal intron, in an unassigned open reading frame. Our comprehensive analysis of the Rpl7a intron in other eukaryotic taxa demonstrates that it is evolutionarily conserved and is an ancient eukaryotic intron. Conclusion An analysis of the phylogenetic distribution and properties of the Rpl7a intron suggests its utility as a phylogenetic marker to evaluate particular eukaryotic groupings. Additionally, analysis of the G. lamblia introns has provided further insight into some of the conserved and unique features possessed by the recently identified spliceosomal introns in related organisms such as T. vaginalis and Carpediemonas membranifera.

  8. Genome-wide conserved consensus transcription factor binding motifs are hyper-methylated

    Directory of Open Access Journals (Sweden)

    Down Thomas A

    2010-09-01

    Full Text Available Abstract Background DNA methylation can regulate gene expression by modulating the interaction between DNA and proteins or protein complexes. Conserved consensus motifs exist across the human genome ("predicted transcription factor binding sites": "predicted TFBS" but the large majority of these are proven by chromatin immunoprecipitation and high throughput sequencing (ChIP-seq not to be biological transcription factor binding sites ("empirical TFBS". We hypothesize that DNA methylation at conserved consensus motifs prevents promiscuous or disorderly transcription factor binding. Results Using genome-wide methylation maps of the human heart and sperm, we found that all conserved consensus motifs as well as the subset of those that reside outside CpG islands have an aggregate profile of hyper-methylation. In contrast, empirical TFBS with conserved consensus motifs have a profile of hypo-methylation. 40% of empirical TFBS with conserved consensus motifs resided in CpG islands whereas only 7% of all conserved consensus motifs were in CpG islands. Finally we further identified a minority subset of TF whose profiles are either hypo-methylated or neutral at their respective conserved consensus motifs implicating that these TF may be responsible for establishing or maintaining an un-methylated DNA state, or whose binding is not regulated by DNA methylation. Conclusions Our analysis supports the hypothesis that at least for a subset of TF, empirical binding to conserved consensus motifs genome-wide may be controlled by DNA methylation.

  9. Genome Analysis of Conserved Dehydrin Motifs in Vascular Plants

    Directory of Open Access Journals (Sweden)

    Ahmad A. Malik

    2017-05-01

    Full Text Available Dehydrins, a large family of abiotic stress proteins, are defined by the presence of a mostly conserved motif known as the K-segment, and may also contain two other conserved motifs known as the Y-segment and S-segment. Using the dehydrin literature, we developed a sequence motif definition of the K-segment, which we used to create a large dataset of dehydrin sequences by searching the Pfam00257 dehydrin dataset and the Phytozome 10 sequences of vascular plants. A comprehensive analysis of these sequences reveals that lysine residues are highly conserved in the K-segment, while the amino acid type is often conserved at other positions. Despite the Y-segment name, the central tyrosine is somewhat conserved, but can be substituted with two other small aromatic amino acids (phenylalanine or histidine. The S-segment contains a series of serine residues, but in some proteins is also preceded by a conserved LHR sequence. In many dehydrins containing all three of these motifs the S-segment is linked to the K-segment by a GXGGRRKK motif (where X can be any amino acid, suggesting a functional linkage between these two motifs. An analysis of the sequences shows that the dehydrin architecture and several biochemical properties (isoelectric point, molecular mass, and hydrophobicity score are dependent on each other, and that some dehydrin architectures are overexpressed during certain abiotic stress, suggesting that they may be optimized for a specific abiotic stress while others are involved in all forms of dehydration stress (drought, cold, and salinity.

  10. Architecture and Distribution of Introns in Core Genes of Four Fusarium Species

    Directory of Open Access Journals (Sweden)

    Mmatshepho M. Phasha

    2017-11-01

    Full Text Available Removal of introns from transcribed RNA represents a crucial step during the production of mRNA in eukaryotes. Available whole-genome sequences and expressed sequence tags (ESTs have increased our knowledge of this process and revealed various commonalities among eukaryotes. However, certain aspects of intron structure and diversity are taxon-specific, which can complicate the accuracy of in silico gene prediction methods. Using core genes, we evaluated the distribution and architecture of Fusarium circinatum spliceosomal introns, and linked these characteristics to the accuracy of the predicted gene models of the genome of this fungus. We also evaluated intron distribution and architecture in F. verticillioides, F. oxysporum, and F. graminearum, and made comparisons with F. circinatum. Results indicated that F. circinatum and the three other Fusarium species have canonical 5′ and 3′ splice sites, but with subtle differences that are apparently not shared with those of other fungal genera. The polypyrimidine tract of Fusarium introns was also found to be highly divergent among species and genes. Furthermore, the conserved adenosine nucleoside required during the first step of splicing is contained within unique branch site motifs in certain Fusarium introns. Data generated here show that introns of F. circinatum, as well as F. verticillioides, F. oxysporum, and F. graminearum, are characterized by a number of unique features such as the CTHAH and ACCAT motifs of the branch site. Incorporation of such information into genome annotation software will undoubtedly improve the accuracy of gene prediction methods used for Fusarium species and related fungi.

  11. The function of introns

    Directory of Open Access Journals (Sweden)

    Liran eCarmel

    2012-04-01

    Full Text Available The intron-exon architecture of many eukaryotic genes raises the intriguing question of whether this unique organization serves any function, or is it simply a result of the spread of functionless introns in eukaryotic genomes. In this review, we show that introns in contemporary species fulfill a broad spectrum of functions, and are involved in virtually every step of mRNA processing. We propose that this great diversity of intronic functions supports the notion that introns were indeed selfish elements in early eukaryotes, but then independently gained numerous functions in different eukaryotic lineages. We suggest a novel criterion of evolutionary conservation, dubbed intron positional conservation, which can identify functional introns.

  12. Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

    Science.gov (United States)

    Fauteux, François; Strömvik, Martina V

    2009-01-01

    Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP) gene promoters from three plant families, namely Brassicaceae (mustards), Fabaceae (legumes) and Poaceae (grasses) using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L.) Heynh.), soybean (Glycine max (L.) Merr.) and rice (Oryza sativa L.) respectively. We have identified three conserved motifs (two RY-like and one ACGT-like) in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination of conserved motifs

  13. Seed storage protein gene promoters contain conserved DNA motifs in Brassicaceae, Fabaceae and Poaceae

    Directory of Open Access Journals (Sweden)

    Fauteux François

    2009-10-01

    Full Text Available Abstract Background Accurate computational identification of cis-regulatory motifs is difficult, particularly in eukaryotic promoters, which typically contain multiple short and degenerate DNA sequences bound by several interacting factors. Enrichment in combinations of rare motifs in the promoter sequence of functionally or evolutionarily related genes among several species is an indicator of conserved transcriptional regulatory mechanisms. This provides a basis for the computational identification of cis-regulatory motifs. Results We have used a discriminative seeding DNA motif discovery algorithm for an in-depth analysis of 54 seed storage protein (SSP gene promoters from three plant families, namely Brassicaceae (mustards, Fabaceae (legumes and Poaceae (grasses using backgrounds based on complete sets of promoters from a representative species in each family, namely Arabidopsis (Arabidopsis thaliana (L. Heynh., soybean (Glycine max (L. Merr. and rice (Oryza sativa L. respectively. We have identified three conserved motifs (two RY-like and one ACGT-like in Brassicaceae and Fabaceae SSP gene promoters that are similar to experimentally characterized seed-specific cis-regulatory elements. Fabaceae SSP gene promoter sequences are also enriched in a novel, seed-specific E2Fb-like motif. Conserved motifs identified in Poaceae SSP gene promoters include a GCN4-like motif, two prolamin-box-like motifs and an Skn-1-like motif. Evidence of the presence of a variant of the TATA-box is found in the SSP gene promoters from the three plant families. Motifs discovered in SSP gene promoters were used to score whole-genome sets of promoters from Arabidopsis, soybean and rice. The highest-scoring promoters are associated with genes coding for different subunits or precursors of seed storage proteins. Conclusion Seed storage protein gene promoter motifs are conserved in diverse species, and different plant families are characterized by a distinct combination

  14. 50/50 Expressional Odds of Retention Signifies the Distinction between Retained Introns and Constitutively Spliced Introns in Arabidopsis thaliana

    Directory of Open Access Journals (Sweden)

    Rui Mao

    2017-10-01

    Full Text Available Intron retention, one of the most prevalent alternative splicing events in plants, can lead to introns retained in mature mRNAs. However, in comparison with constitutively spliced introns (CSIs, the relevantly distinguishable features for retained introns (RIs are still poorly understood. This work proposes a computational pipeline to discover novel RIs from multiple next-generation RNA sequencing (RNA-Seq datasets of Arabidopsis thaliana. Using this pipeline, we detected 3,472 novel RIs from 18 RNA-Seq datasets and re-confirmed 1,384 RIs which are currently annotated in the TAIR10 database. We also use the expression of intron-containing isoforms as a new feature in addition to the conventional features. Based on these features, RIs are highly distinguishable from CSIs by machine learning methods, especially when the expressional odds of retention (i.e., the expression ratio of the RI-containing isoforms relative to the isoforms without RIs for the same gene reaches to or larger than 50/50. In this case, the RIs and CSIs can be clearly separated by the Random Forest with an outstanding performance of 0.95 on AUC (the area under a receiver operating characteristics curve. The closely related characteristics to the RIs include the low strength of splice sites, high similarity with the flanking exon sequences, low occurrence percentage of YTRAY near the acceptor site, existence of putative intronic splicing silencers (ISSs, i.e., AG/GA-rich motifs and intronic splicing enhancers (ISEs, i.e., TTTT-containing motifs, and enrichment of Serine/Arginine-Rich (SR proteins and heterogeneous nuclear ribonucleoparticle proteins (hnRNPs.

  15. Characteristic differences between the promoters of intron-containing and intronless ribosomal protein genes in yeast

    Directory of Open Access Journals (Sweden)

    Vingron Martin

    2008-10-01

    Full Text Available Abstract Background More than two thirds of the highly expressed ribosomal protein (RP genes in Saccharomyces cerevisiae contain introns, which is in sharp contrast to the genome-wide five percent intron-containing genes. It is well established that introns carry regulatory sequences and that the transcription of RP genes is extensively and coordinately regulated. Here we test the hypotheses that introns are innately associated with heavily transcribed genes and that introns of RP genes contribute regulatory TF binding sequences. Moreover, we investigate whether promoter features are significantly different between intron-containing and intronless RP genes. Results We find that directly measured transcription rates tend to be lower for intron-containing compared to intronless RP genes. We do not observe any specifically enriched sequence motifs in the introns of RP genes other than those of the branch point and the two splice sites. Comparing the promoters of intron-containing and intronless RP genes, we detect differences in number and position of Rap1-binding and IFHL motifs. Moreover, the analysis of the length distribution and the folding free energies suggest that, at least in a sub-population of RP genes, the 5' untranslated sequences are optimized for regulatory function. Conclusion Our results argue against the direct involvement of introns in the regulation of transcription of highly expressed genes. Moreover, systematic differences in motif distributions suggest that RP transcription factors may act differently on intron-containing and intronless gene promoters. Thus, our findings contribute to the decoding of the RP promoter architecture and may fuel the discussion on the evolution of introns.

  16. Identification of novel conserved functional motifs across most Influenza A viral strains

    Directory of Open Access Journals (Sweden)

    El-Azab Iman

    2011-01-01

    Full Text Available Abstract Background Influenza A virus poses a continuous threat to global public health. Design of novel universal drugs and vaccine requires a careful analysis of different strains of Influenza A viral genome from diverse hosts and subtypes. We performed a systematic in silico analysis of Influenza A viral segments of all available Influenza A viral strains and subtypes and grouped them based on host, subtype, and years isolated, and through multiple sequence alignments we extrapolated conserved regions, motifs, and accessible regions for functional mapping and annotation. Results Across all species and strains 87 highly conserved regions (conservation percentage > = 90% and 19 functional motifs (conservation percentage = 100% were found in PB2, PB1, PA, NP, M, and NS segments. The conservation percentage of these segments ranged between 94 - 98% in human strains (the most conserved, 85 - 93% in swine strains (the most variable, and 91 - 94% in avian strains. The most conserved segment was different in each host (PB1 for human strains, NS for avian strains, and M for swine strains. Target accessibility prediction yielded 324 accessible regions, with a single stranded probability > 0.5, of which 78 coincided with conserved regions. Some of the interesting annotations in these regions included sites for protein-protein interactions, the RNA binding groove, and the proton ion channel. Conclusions The influenza virus has evolved to adapt to its host through variations in the GC content and conservation percentage of the conserved regions. Nineteen universal conserved functional motifs were discovered, of which some were accessible regions with interesting biological functions. These regions will serve as a foundation for universal drug targets as well as universal vaccine design.

  17. Comparative analysis of evolutionarily conserved motifs of epidermal growth factor receptor 2 (HER2) predicts novel potential therapeutic epitopes

    DEFF Research Database (Denmark)

    Deng, Xiaohong; Zheng, Xuxu; Yang, Huanming

    2014-01-01

    druggable epitopes/targets. We employed the PROSITE Scan to detect structurally conserved motifs and PRINTS to search for linearly conserved motifs of ECD HER2. We found that the epitopes recognized by trastuzumab and pertuzumab are located in the predicted conserved motifs of ECD HER2, supporting our...

  18. Differentiation and fiber type-specific activity of a muscle creatine kinase intronic enhancer

    Directory of Open Access Journals (Sweden)

    Tai Phillip WL

    2011-07-01

    Full Text Available Abstract Background Hundreds of genes, including muscle creatine kinase (MCK, are differentially expressed in fast- and slow-twitch muscle fibers, but the fiber type-specific regulatory mechanisms are not well understood. Results Modulatory region 1 (MR1 is a 1-kb regulatory region within MCK intron 1 that is highly active in terminally differentiating skeletal myocytes in vitro. A MCK small intronic enhancer (MCK-SIE containing a paired E-box/myocyte enhancer factor 2 (MEF2 regulatory motif resides within MR1. The SIE's transcriptional activity equals that of the extensively characterized 206-bp MCK 5'-enhancer, but the MCK-SIE is flanked by regions that can repress its activity via the individual and combined effects of about 15 different but highly conserved 9- to 24-bp sequences. ChIP and ChIP-Seq analyses indicate that the SIE and the MCK 5'-enhancer are occupied by MyoD, myogenin and MEF2. Many other E-boxes located within or immediately adjacent to intron 1 are not occupied by MyoD or myogenin. Transgenic analysis of a 6.5-kb MCK genomic fragment containing the 5'-enhancer and proximal promoter plus the 3.2-kb intron 1, with and without MR1, indicates that MR1 is critical for MCK expression in slow- and intermediate-twitch muscle fibers (types I and IIa, respectively, but is not required for expression in fast-twitch muscle fibers (types IIb and IId. Conclusions In this study, we discovered that MR1 is critical for MCK expression in slow- and intermediate-twitch muscle fibers and that MR1's positive transcriptional activity depends on a paired E-box MEF2 site motif within a SIE. This is the first study to delineate the DNA controls for MCK expression in different skeletal muscle fiber types.

  19. FTZ-Factor1 and Fushi tarazu interact via conserved nuclear receptor and coactivator motifs

    Science.gov (United States)

    Schwartz, Carol J.E.; Sampson, Heidi M.; Hlousek, Daniela; Percival-Smith, Anthony; Copeland, John W.R.; Simmonds, Andrew J.; Krause, Henry M.

    2001-01-01

    To activate transcription, most nuclear receptor proteins require coactivators that bind to their ligand-binding domains (LBDs). The Drosophila FTZ-Factor1 (FTZ-F1) protein is a conserved member of the nuclear receptor superfamily, but was previously thought to lack an AF2 motif, a motif that is required for ligand and coactivator binding. Here we show that FTZ-F1 does have an AF2 motif and that it is required to bind a coactivator, the homeodomain-containing protein Fushi tarazu (FTZ). We also show that FTZ contains an AF2-interacting nuclear receptor box, the first to be found in a homeodomain protein. Both interaction motifs are shown to be necessary for physical interactions in vitro and for functional interactions in developing embryos. These unexpected findings have important implications for the conserved homologs of the two proteins. PMID:11157757

  20. Development of ent-kaurene Oxidase-Based Conserved Intron Spanning Primers for Species Identification in the Genus Poa (Poaceae; Bluegrass

    Directory of Open Access Journals (Sweden)

    Jonathan M. LaMantia

    2018-04-01

    Full Text Available Interspecific hybridization has been attempted to combine the heat and drought of Poa arachnifera Torr. with the turf quality characteristics of several Poa species. Confirmation of an F1 hybrid through morphological analysis of vegetative and flowering characteristics is often time consuming and ambiguous. Ent-kaurene oxidase (KO has been sequenced in rice, barley, and wheat. In rice, each of the five copies of KO gene has unique lengths for the first intron. Conserved intron spanning primers (CISP can be used as a DNA marker to exploit variations of intron lengths that flank conserved gene sequences. In the present study, we developed CISP to sequence partial genomic fragments of the KO gene from seven Poa species. Through sequence analysis, species-specific primers were also developed to produce co-dominant markers that can be used to identify interspecific hybrids between Texas bluegrass and six other Poa species used in the present study.

  1. The BsaHI restriction-modification system: Cloning, sequencing and analysis of conserved motifs

    Directory of Open Access Journals (Sweden)

    Roberts Richard J

    2008-05-01

    Full Text Available Abstract Background Restriction and modification enzymes typically recognise short DNA sequences of between two and eight bases in length. Understanding the mechanism of this recognition represents a significant challenge that we begin to address for the BsaHI restriction-modification system, which recognises the six base sequence GRCGYC. Results The DNA sequences of the genes for the BsaHI methyltransferase, bsaHIM, and restriction endonuclease, bsaHIR, have been determined (GenBank accession #EU386360, cloned and expressed in E. coli. Both the restriction endonuclease and methyltransferase enzymes share significant similarity with a group of 6 other enzymes comprising the restriction-modification systems HgiDI and HgiGI and the putative HindVP, NlaCORFDP, NpuORFC228P and SplZORFNP restriction-modification systems. A sequence alignment of these homologues shows that their amino acid sequences are largely conserved and highlights several motifs of interest. We target one such conserved motif, reading SPERRFD, at the C-terminal end of the bsaHIR gene. A mutational analysis of these amino acids indicates that the motif is crucial for enzymatic activity. Sequence alignment of the methyltransferase gene reveals a short motif within the target recognition domain that is conserved among enzymes recognising the same sequences. Thus, this motif may be used as a diagnostic tool to define the recognition sequences of the cytosine C5 methyltransferases. Conclusion We have cloned and sequenced the BsaHI restriction and modification enzymes. We have identified a region of the R. BsaHI enzyme that is crucial for its activity. Analysis of the amino acid sequence of the BsaHI methyltransferase enzyme led us to propose two new motifs that can be used in the diagnosis of the recognition sequence of the cytosine C5-methyltransferases.

  2. Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium.

    Science.gov (United States)

    Catania, Francesco; Lynch, Michael

    2010-05-04

    In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa) remains a virtually unexplored issue. By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Our observations 1) shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2) are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3) reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.

  3. Evolution of the Exon-Intron Structure in Ciliate Genomes.

    Directory of Open Access Journals (Sweden)

    Vladyslav S Bondarenko

    Full Text Available A typical eukaryotic gene is comprised of alternating stretches of regions, exons and introns, retained in and spliced out a mature mRNA, respectively. Although the length of introns may vary substantially among organisms, a large fraction of genes contains short introns in many species. Notably, some Ciliates (Paramecium and Nyctotherus possess only ultra-short introns, around 25 bp long. In Paramecium, ultra-short introns with length divisible by three (3n are under strong evolutionary pressure and have a high frequency of in-frame stop codons, which, in the case of intron retention, cause premature termination of mRNA translation and consequent degradation of the mis-spliced mRNA by the nonsense-mediated decay mechanism. Here, we analyzed introns in five genera of Ciliates, Paramecium, Tetrahymena, Ichthyophthirius, Oxytricha, and Stylonychia. Introns can be classified into two length classes in Tetrahymena and Ichthyophthirius (with means 48 bp, 69 bp, and 55 bp, 64 bp, respectively, but, surprisingly, comprise three distinct length classes in Oxytricha and Stylonychia (with means 33-35 bp, 47-51 bp, and 78-80 bp. In most ranges of the intron lengths, 3n introns are underrepresented and have a high frequency of in-frame stop codons in all studied species. Introns of Paramecium, Tetrahymena, and Ichthyophthirius are preferentially located at the 5' and 3' ends of genes, whereas introns of Oxytricha and Stylonychia are strongly skewed towards the 5' end. Analysis of evolutionary conservation shows that, in each studied genome, a significant fraction of intron positions is conserved between the orthologs, but intron lengths are not correlated between the species. In summary, our study provides a detailed characterization of introns in several genera of Ciliates and highlights some of their distinctive properties, which, together, indicate that splicing spellchecking is a universal and evolutionarily conserved process in the biogenesis of short

  4. A Conserved Splicing Silencer Dynamically Regulates O-GlcNAc Transferase Intron Retention and O-GlcNAc Homeostasis

    Directory of Open Access Journals (Sweden)

    Sung-Kyun Park

    2017-08-01

    Full Text Available Modification of nucleocytoplasmic proteins with O-GlcNAc regulates a wide variety of cellular processes and has been linked to human diseases. The enzymes O-GlcNAc transferase (OGT and O-GlcNAcase (OGA add and remove O-GlcNAc, but the mechanisms regulating their expression remain unclear. Here, we demonstrate that retention of the fourth intron of OGT is regulated in response to O-GlcNAc levels. We further define a conserved intronic splicing silencer (ISS that is necessary for OGT intron retention. Deletion of the ISS in colon cancer cells leads to increases in OGT, but O-GlcNAc homeostasis is maintained by concomitant increases in OGA protein. However, the ISS-deleted cells are hypersensitive to OGA inhibition in culture and in soft agar. Moreover, growth of xenograft tumors from ISS-deleted cells is compromised in mice treated with an OGA inhibitor. Thus, ISS-mediated regulation of OGT intron retention is a key component in OGT expression and maintaining O-GlcNAc homeostasis.

  5. Development of ent-kaurene Oxidase-Based Conserved Intron Spanning Primers for Species Identification in the Genus Poa (Poaceae; Bluegrass)

    OpenAIRE

    Jonathan M. LaMantia; Ambika Chandra; David R. Huff

    2018-01-01

    Interspecific hybridization has been attempted to combine the heat and drought of Poa arachnifera Torr. with the turf quality characteristics of several Poa species. Confirmation of an F1 hybrid through morphological analysis of vegetative and flowering characteristics is often time consuming and ambiguous. Ent-kaurene oxidase (KO) has been sequenced in rice, barley, and wheat. In rice, each of the five copies of KO gene has unique lengths for the first intron. Conserved intron spanning prime...

  6. Evolutionary dynamics of a conserved sequence motif in the ribosomal genes of the ciliate Paramecium

    Directory of Open Access Journals (Sweden)

    Lynch Michael

    2010-05-01

    Full Text Available Abstract Background In protozoa, the identification of preserved motifs by comparative genomics is often impeded by difficulties to generate reliable alignments for non-coding sequences. Moreover, the evolutionary dynamics of regulatory elements in 3' untranslated regions (both in protozoa and metazoa remains a virtually unexplored issue. Results By screening Paramecium tetraurelia's 3' untranslated regions for 8-mers that were previously found to be preserved in mammalian 3' UTRs, we detect and characterize a motif that is distinctly conserved in the ribosomal genes of this ciliate. The motif appears to be conserved across Paramecium aurelia species but is absent from the ribosomal genes of four additional non-Paramecium species surveyed, including another ciliate, Tetrahymena thermophila. Motif-free ribosomal genes retain fewer paralogs in the genome and appear to be lost more rapidly relative to motif-containing genes. Features associated with the discovered preserved motif are consistent with this 8-mer playing a role in post-transcriptional regulation. Conclusions Our observations 1 shed light on the evolution of a putative regulatory motif across large phylogenetic distances; 2 are expected to facilitate the understanding of the modulation of ribosomal genes expression in Paramecium; and 3 reveal a largely unexplored--and presumably not restricted to Paramecium--association between the presence/absence of a DNA motif and the evolutionary fate of its host genes.

  7. A Conserved Metal Binding Motif in the Bacillus subtilis Competence Protein ComFA Enhances Transformation.

    Science.gov (United States)

    Chilton, Scott S; Falbel, Tanya G; Hromada, Susan; Burton, Briana M

    2017-08-01

    Genetic competence is a process in which cells are able to take up DNA from their environment, resulting in horizontal gene transfer, a major mechanism for generating diversity in bacteria. Many bacteria carry homologs of the central DNA uptake machinery that has been well characterized in Bacillus subtilis It has been postulated that the B. subtilis competence helicase ComFA belongs to the DEAD box family of helicases/translocases. Here, we made a series of mutants to analyze conserved amino acid motifs in several regions of B. subtilis ComFA. First, we confirmed that ComFA activity requires amino acid residues conserved among the DEAD box helicases, and second, we show that a zinc finger-like motif consisting of four cysteines is required for efficient transformation. Each cysteine in the motif is important, and mutation of at least two of the cysteines dramatically reduces transformation efficiency. Further, combining multiple cysteine mutations with the helicase mutations shows an additive phenotype. Our results suggest that the helicase and metal binding functions are two distinct activities important for ComFA function during transformation. IMPORTANCE ComFA is a highly conserved protein that has a role in DNA uptake during natural competence, a mechanism for horizontal gene transfer observed in many bacteria. Investigation of the details of the DNA uptake mechanism is important for understanding the ways in which bacteria gain new traits from their environment, such as drug resistance. To dissect the role of ComFA in the DNA uptake machinery, we introduced point mutations into several motifs in the protein sequence. We demonstrate that several amino acid motifs conserved among ComFA proteins are important for efficient transformation. This report is the first to demonstrate the functional requirement of an amino-terminal cysteine motif in ComFA. Copyright © 2017 American Society for Microbiology.

  8. BayesMotif: de novo protein sorting motif discovery from impure datasets.

    Science.gov (United States)

    Hu, Jianjun; Zhang, Fan

    2010-01-18

    Protein sorting is the process that newly synthesized proteins are transported to their target locations within or outside of the cell. This process is precisely regulated by protein sorting signals in different forms. A major category of sorting signals are amino acid sub-sequences usually located at the N-terminals or C-terminals of protein sequences. Genome-wide experimental identification of protein sorting signals is extremely time-consuming and costly. Effective computational algorithms for de novo discovery of protein sorting signals is needed to improve the understanding of protein sorting mechanisms. We formulated the protein sorting motif discovery problem as a classification problem and proposed a Bayesian classifier based algorithm (BayesMotif) for de novo identification of a common type of protein sorting motifs in which a highly conserved anchor is present along with a less conserved motif regions. A false positive removal procedure is developed to iteratively remove sequences that are unlikely to contain true motifs so that the algorithm can identify motifs from impure input sequences. Experiments on both implanted motif datasets and real-world datasets showed that the enhanced BayesMotif algorithm can identify anchored sorting motifs from pure or impure protein sequence dataset. It also shows that the false positive removal procedure can help to identify true motifs even when there is only 20% of the input sequences containing true motif instances. We proposed BayesMotif, a novel Bayesian classification based algorithm for de novo discovery of a special category of anchored protein sorting motifs from impure datasets. Compared to conventional motif discovery algorithms such as MEME, our algorithm can find less-conserved motifs with short highly conserved anchors. Our algorithm also has the advantage of easy incorporation of additional meta-sequence features such as hydrophobicity or charge of the motifs which may help to overcome the limitations of

  9. Discovery of candidate KEN-box motifs using cell cycle keyword enrichment combined with native disorder prediction and motif conservation.

    Science.gov (United States)

    Michael, Sushama; Travé, Gilles; Ramu, Chenna; Chica, Claudia; Gibson, Toby J

    2008-02-15

    KEN-box-mediated target selection is one of the mechanisms used in the proteasomal destruction of mitotic cell cycle proteins via the APC/C complex. While annotating the Eukaryotic Linear Motif resource (ELM, http://elm.eu.org/), we found that KEN motifs were significantly enriched in human protein entries with cell cycle keywords in the UniProt/Swiss-Prot database-implying that KEN-boxes might be more common than reported. Matches to short linear motifs in protein database searches are not, per se, significant. KEN-box enrichment with cell cycle Gene Ontology terms suggests that collectively these motifs are functional but does not prove that any given instance is so. Candidates were surveyed for native disorder prediction using GlobPlot and IUPred and for motif conservation in homologues. Among >25 strong new candidates, the most notable are human HIPK2, CHFR, CDC27, Dab2, Upf2, kinesin Eg5, DNA Topoisomerase 1 and yeast Cdc5 and Swi5. A similar number of weaker candidates were present. These proteins have yet to be tested for APC/C targeted destruction, providing potential new avenues of research.

  10. Macronuclear genome structure of the ciliate Nyctotherus ovalis: Single-gene chromosomes and tiny introns

    Directory of Open Access Journals (Sweden)

    Landweber Laura F

    2008-12-01

    Full Text Available Abstract Background Nyctotherus ovalis is a single-celled eukaryote that has hydrogen-producing mitochondria and lives in the hindgut of cockroaches. Like all members of the ciliate taxon, it has two types of nuclei, a micronucleus and a macronucleus. N. ovalis generates its macronuclear chromosomes by forming polytene chromosomes that subsequently develop into macronuclear chromosomes by DNA elimination and rearrangement. Results We examined the structure of these gene-sized macronuclear chromosomes in N. ovalis. We determined the telomeres, subtelomeric regions, UTRs, coding regions and introns by sequencing a large set of macronuclear DNA sequences (4,242 and cDNAs (5,484 and comparing them with each other. The telomeres consist of repeats CCC(AAAACCCCn, similar to those in spirotrichous ciliates such as Euplotes, Sterkiella (Oxytricha and Stylonychia. Per sequenced chromosome we found evidence for either a single protein-coding gene, a single tRNA, or the complete ribosomal RNAs cluster. Hence the chromosomes appear to encode single transcripts. In the short subtelomeric regions we identified a few overrepresented motifs that could be involved in gene regulation, but there is no consensus polyadenylation site. The introns are short (21–29 nucleotides, and a significant fraction (1/3 of the tiny introns is conserved in the distantly related ciliate Paramecium tetraurelia. As has been observed in P. tetraurelia, the N. ovalis introns tend to contain in-frame stop codons or have a length that is not dividable by three. This pattern causes premature termination of mRNA translation in the event of intron retention, and potentially degradation of unspliced mRNAs by the nonsense-mediated mRNA decay pathway. Conclusion The combination of short leaders, tiny introns and single genes leads to very minimal macronuclear chromosomes. The smallest we identified contained only 150 nucleotides.

  11. A Conserved GPG-Motif in the HIV-1 Nef Core Is Required for Principal Nef-Activities.

    Directory of Open Access Journals (Sweden)

    Marta Martínez-Bonet

    Full Text Available To find out new determinants required for Nef activity we performed a functional alanine scanning analysis along a discrete but highly conserved region at the core of HIV-1 Nef. We identified the GPG-motif, located at the 121-137 region of HIV-1 NL4.3 Nef, as a novel protein signature strictly required for the p56Lck dependent Nef-induced CD4-downregulation in T-cells. Since the Nef-GPG motif was dispensable for CD4-downregulation in HeLa-CD4 cells, Nef/AP-1 interaction and Nef-dependent effects on Tf-R trafficking, the observed effects on CD4 downregulation cannot be attributed to structure constraints or to alterations on general protein trafficking. Besides, we found that the GPG-motif was also required for Nef-dependent inhibition of ring actin re-organization upon TCR triggering and MHCI downregulation, suggesting that the GPG-motif could actively cooperate with the Nef PxxP motif for these HIV-1 Nef-related effects. Finally, we observed that the Nef-GPG motif was required for optimal infectivity of those viruses produced in T-cells. According to these findings, we propose the conserved GPG-motif in HIV-1 Nef as functional region required for HIV-1 infectivity and therefore with a potential interest for the interference of Nef activity during HIV-1 infection.

  12. Peptomics, identification of novel cationic Arabidopsis peptides with conserved sequence motifs

    DEFF Research Database (Denmark)

    Olsen, Addie Nina; Mundy, John; Skriver, Karen

    2002-01-01

    Arabidopsis family of 34 genes. The predicted peptides are characterized by a conserved C-terminal sequence motif and additional primary structure conservation in a core region. The majority of these genes had not previously been annotated. A subset of the predicted peptides show high overall sequence...... similarity to Rapid Alkalinization Factor (RALF), a peptide isolated from tobacco. We therefore refer to this peptide family as RALFL for RALF-Like. RT-PCR analysis confirmed that several of the Arabidopsis genes are expressed and that their expression patterns vary. The identification of a large gene family...

  13. Conserved intron positions in FGFR genes reflect the modular structure of FGFR and reveal stepwise addition of domains to an already complex ancestral FGFR.

    Science.gov (United States)

    Rebscher, Nicole; Deichmann, Christina; Sudhop, Stefanie; Fritzenwanker, Jens Holger; Green, Stephen; Hassel, Monika

    2009-10-01

    We have analyzed the evolution of fibroblast growth factor receptor (FGFR) tyrosine kinase genes throughout a wide range of animal phyla. No evidence for an FGFR gene was found in Porifera, but we tentatively identified an FGFR gene in the placozoan Trichoplax adhaerens. The gene encodes a protein with three immunoglobulin-like domains, a single-pass transmembrane, and a split tyrosine kinase domain. By superimposing intron positions of 20 FGFR genes from Placozoa, Cnidaria, Protostomia, and Deuterostomia over the respective protein domain structure, we identified ten ancestral introns and three conserved intron groups. Our analysis shows (1) that the position of ancestral introns correlates to the modular structure of FGFRs, (2) that the acidic domain very likely evolved in the last common ancestor of triploblasts, (3) that splicing of IgIII was enabled by a triploblast-specific insertion, and (4) that IgI is subject to substantial loss or duplication particularly in quickly evolving genomes. Moreover, intron positions in the catalytic domain of FGFRs map to the borders of protein subdomains highly conserved in other serine/threonine kinases. Nevertheless, these introns were introduced in metazoan receptor tyrosine kinases exclusively. Our data support the view that protein evolution dating back to the Cambrian explosion took place in such a short time window that only subtle changes in the domain structure are detectable in extant representatives of animal phyla. We propose that the first multidomain FGFR originated in the last common ancestor of Placozoa, Cnidaria, and Bilateria. Additional domains were introduced mainly in the ancestor of triploblasts and in the Ecdysozoa.

  14. PDL1 Signals through Conserved Sequence Motifs to Overcome Interferon-Mediated Cytotoxicity

    Directory of Open Access Journals (Sweden)

    Maria Gato-Cañas

    2017-08-01

    Full Text Available PDL1 blockade produces remarkable clinical responses, thought to occur by T cell reactivation through prevention of PDL1-PD1 T cell inhibitory interactions. Here, we find that PDL1 cell-intrinsic signaling protects cancer cells from interferon (IFN cytotoxicity and accelerates tumor progression. PDL1 inhibited IFN signal transduction through a conserved class of sequence motifs that mediate crosstalk with IFN signaling. Abrogation of PDL1 expression or antibody-mediated PDL1 blockade strongly sensitized cancer cells to IFN cytotoxicity through a STAT3/caspase-7-dependent pathway. Moreover, somatic mutations found in human carcinomas within these PDL1 sequence motifs disrupted motif regulation, resulting in PDL1 molecules with enhanced protective activities from type I and type II IFN cytotoxicity. Overall, our results reveal a mode of action of PDL1 in cancer cells as a first line of defense against IFN cytotoxicity.

  15. The conservation pattern of short linear motifs is highly correlated with the function of interacting protein domains

    Directory of Open Access Journals (Sweden)

    Wang Yiguo

    2008-10-01

    Full Text Available Abstract Background Many well-represented domains recognize primary sequences usually less than 10 amino acids in length, called Short Linear Motifs (SLiMs. Accurate prediction of SLiMs has been difficult because they are short (often Results Our combined approach revealed that SLiMs are highly conserved in proteins from functional classes that are known to interact with a specific domain, but that they are not conserved in most other protein groups. We found that SLiMs recognized by SH2 domains were highly conserved in receptor kinases/phosphatases, adaptor molecules, and tyrosine kinases/phosphatases, that SLiMs recognized by SH3 domains were highly conserved in cytoskeletal and cytoskeletal-associated proteins, that SLiMs recognized by PDZ domains were highly conserved in membrane proteins such as channels and receptors, and that SLiMs recognized by S/T kinase domains were highly conserved in adaptor molecules, S/T kinases/phosphatases, and proteins involved in transcription or cell cycle control. We studied Tyr-SLiMs recognized by SH2 domains in more detail, and found that SH2-recognized Tyr-SLiMs on the cytoplasmic side of membrane proteins are more highly conserved than those on the extra-cellular side. Also, we found that SH2-recognized Tyr-SLiMs that are associated with SH3 motifs and a tyrosine kinase phosphorylation motif are more highly conserved. Conclusion The interactome of protein domains is reflected by the evolutionary conservation of SLiMs recognized by these domains. Combining scoring matrixes derived from peptide libraries and conservation analysis, we would be able to find those protein groups that are more likely to interact with specific domains.

  16. Computational Analysis of an Evolutionarily Conserved VertebrateMuscle Alternative Splicing Program

    Energy Technology Data Exchange (ETDEWEB)

    Das, Debopriya; Clark, Tyson A.; Schweitzer, Anthony; Marr,Henry; Yamamoto, Miki L.; Parra, Marilyn K.; Arribere, Josh; Minovitsky,Simon; Dubchak, Inna; Blume, John E.; Conboy, John G.

    2006-06-15

    A novel exon microarray format that probes gene expression with single exon resolution was employed to elucidate critical features of a vertebrate muscle alternative splicing program. A dataset of 56 microarray-defined, muscle-enriched exons and their flanking introns were examined computationally in order to investigate coordination of the muscle splicing program. Candidate intron regulatory motifs were required to meet several stringent criteria: significant over-representation near muscle-enriched exons, correlation with muscle expression, and phylogenetic conservation among genomes of several vertebrate orders. Three classes of regulatory motifs were identified in the proximal downstream intron, within 200nt of the target exons: UGCAUG, a specific binding site for Fox-1 related splicing factors; ACUAAC, a novel branchpoint-like element; and UG-/UGC-rich elements characteristic of binding sites for CELF splicing factors. UGCAUG was remarkably enriched, being present in nearly one-half of all cases. These studies suggest that Fox and CELF splicing factors play a major role in enforcing the muscle-specific alternative splicing program, facilitating expression of a set of unique isoforms of cytoskeletal proteins that are critical to muscle cell differentiation. Supplementary materials: There are four supplementary tables and one supplementary figure. The tables provide additional detailed information concerning the muscle-enriched datasets, and about over-represented oligonucleotide sequences in the flanking introns. The supplementary figure shows RT-PCR data confirming the muscle-enriched expression of exons predicted from the microarray analysis.

  17. Accurate, model-based tuning of synthetic gene expression using introns in S. cerevisiae.

    Directory of Open Access Journals (Sweden)

    Ido Yofe

    2014-06-01

    Full Text Available Introns are key regulators of eukaryotic gene expression and present a potentially powerful tool for the design of synthetic eukaryotic gene expression systems. However, intronic control over gene expression is governed by a multitude of complex, incompletely understood, regulatory mechanisms. Despite this lack of detailed mechanistic understanding, here we show how a relatively simple model enables accurate and predictable tuning of synthetic gene expression system in yeast using several predictive intron features such as transcript folding and sequence motifs. Using only natural Saccharomyces cerevisiae introns as regulators, we demonstrate fine and accurate control over gene expression spanning a 100 fold expression range. These results broaden the engineering toolbox of synthetic gene expression systems and provide a framework in which precise and robust tuning of gene expression is accomplished.

  18. An evolutionarily conserved intronic region controls the spatiotemporal expression of the transcription factor Sox10

    Directory of Open Access Journals (Sweden)

    Pavan William J

    2008-10-01

    Full Text Available Abstract Background A major challenge lies in understanding the complexities of gene regulation. Mutation of the transcription factor SOX10 is associated with several human diseases. The disease phenotypes reflect the function of SOX10 in diverse tissues including the neural crest, central nervous system and otic vesicle. As expected, the SOX10 expression pattern is complex and highly dynamic, but little is known of the underlying mechanisms regulating its spatiotemporal pattern. SOX10 expression is highly conserved between all vertebrates characterised. Results We have combined in vivo testing of DNA fragments in zebrafish and computational comparative genomics to identify the first regulatory regions of the zebrafish sox10 gene. Both approaches converged on the 3' end of the conserved 1st intron as being critical for spatial patterning of sox10 in the embryo. Importantly, we have defined a minimal region crucial for this function. We show that this region contains numerous binding sites for transcription factors known to be essential in early neural crest induction, including Tcf/Lef, Sox and FoxD3. We show that the identity and relative position of these binding sites are conserved between zebrafish and mammals. A further region, partially required for oligodendrocyte expression, lies in the 5' region of the same intron and contains a putative CSL binding site, consistent with a role for Notch signalling in sox10 regulation. Furthermore, we show that β-catenin, Notch signalling and Sox9 can induce ectopic sox10 expression in early embryos, consistent with regulatory roles predicted from our transgenic and computational results. Conclusion We have thus identified two major sites of sox10 regulation in vertebrates and provided evidence supporting a role for at least three factors in driving sox10 expression in neural crest, otic epithelium and oligodendrocyte domains.

  19. Space-related pharma-motifs for fast search of protein binding motifs and polypharmacological targets.

    Science.gov (United States)

    Chiu, Yi-Yuan; Lin, Chun-Yu; Lin, Chih-Ta; Hsu, Kai-Cheng; Chang, Li-Zen; Yang, Jinn-Moon

    2012-01-01

    To discover a compound inhibiting multiple proteins (i.e. polypharmacological targets) is a new paradigm for the complex diseases (e.g. cancers and diabetes). In general, the polypharmacological proteins often share similar local binding environments and motifs. As the exponential growth of the number of protein structures, to find the similar structural binding motifs (pharma-motifs) is an emergency task for drug discovery (e.g. side effects and new uses for old drugs) and protein functions. We have developed a Space-Related Pharmamotifs (called SRPmotif) method to recognize the binding motifs by searching against protein structure database. SRPmotif is able to recognize conserved binding environments containing spatially discontinuous pharma-motifs which are often short conserved peptides with specific physico-chemical properties for protein functions. Among 356 pharma-motifs, 56.5% interacting residues are highly conserved. Experimental results indicate that 81.1% and 92.7% polypharmacological targets of each protein-ligand complex are annotated with same biological process (BP) and molecular function (MF) terms, respectively, based on Gene Ontology (GO). Our experimental results show that the identified pharma-motifs often consist of key residues in functional (active) sites and play the key roles for protein functions. The SRPmotif is available at http://gemdock.life.nctu.edu.tw/SRP/. SRPmotif is able to identify similar pharma-interfaces and pharma-motifs sharing similar binding environments for polypharmacological targets by rapidly searching against the protein structure database. Pharma-motifs describe the conservations of binding environments for drug discovery and protein functions. Additionally, these pharma-motifs provide the clues for discovering new sequence-based motifs to predict protein functions from protein sequence databases. We believe that SRPmotif is useful for elucidating protein functions and drug discovery.

  20. Introns Protect Eukaryotic Genomes from Transcription-Associated Genetic Instability.

    Science.gov (United States)

    Bonnet, Amandine; Grosso, Ana R; Elkaoutari, Abdessamad; Coleno, Emeline; Presle, Adrien; Sridhara, Sreerama C; Janbon, Guilhem; Géli, Vincent; de Almeida, Sérgio F; Palancade, Benoit

    2017-08-17

    Transcription is a source of genetic instability that can notably result from the formation of genotoxic DNA:RNA hybrids, or R-loops, between the nascent mRNA and its template. Here we report an unexpected function for introns in counteracting R-loop accumulation in eukaryotic genomes. Deletion of endogenous introns increases R-loop formation, while insertion of an intron into an intronless gene suppresses R-loop accumulation and its deleterious impact on transcription and recombination in yeast. Recruitment of the spliceosome onto the mRNA, but not splicing per se, is shown to be critical to attenuate R-loop formation and transcription-associated genetic instability. Genome-wide analyses in a number of distant species differing in their intron content, including human, further revealed that intron-containing genes and the intron-richest genomes are best protected against R-loop accumulation and subsequent genetic instability. Our results thereby provide a possible rationale for the conservation of introns throughout the eukaryotic lineage. Copyright © 2017 Elsevier Inc. All rights reserved.

  1. Introns: The Functional Benefits of Introns in Genomes

    Directory of Open Access Journals (Sweden)

    Bong-Seok Jo

    2015-12-01

    Full Text Available The intron has been a big biological mystery since it was first discovered in several aspects. First, all of the completely sequenced eukaryotes harbor introns in the genomic structure, whereas no prokaryotes identified so far carry introns. Second, the amount of total introns varies in different species. Third, the length and number of introns vary in different genes, even within the same species genome. Fourth, all introns are copied into RNAs by transcription and DNAs by replication processes, but intron sequences do not participate in protein-coding sequences. The existence of introns in the genome should be a burden to some cells, because cells have to consume a great deal of energy to copy and excise them exactly at the correct positions with the help of complicated spliceosomal machineries. The existence throughout the long evolutionary history is explained, only if selective advantages of carrying introns are assumed to be given to cells to overcome the negative effect of introns. In that regard, we summarize previous research about the functional roles or benefits of introns. Additionally, several other studies strongly suggesting that introns should not be junk will be introduced.

  2. Introns: The Functional Benefits of Introns in Genomes.

    Science.gov (United States)

    Jo, Bong-Seok; Choi, Sun Shim

    2015-12-01

    The intron has been a big biological mystery since it was first discovered in several aspects. First, all of the completely sequenced eukaryotes harbor introns in the genomic structure, whereas no prokaryotes identified so far carry introns. Second, the amount of total introns varies in different species. Third, the length and number of introns vary in different genes, even within the same species genome. Fourth, all introns are copied into RNAs by transcription and DNAs by replication processes, but intron sequences do not participate in protein-coding sequences. The existence of introns in the genome should be a burden to some cells, because cells have to consume a great deal of energy to copy and excise them exactly at the correct positions with the help of complicated spliceosomal machineries. The existence throughout the long evolutionary history is explained, only if selective advantages of carrying introns are assumed to be given to cells to overcome the negative effect of introns. In that regard, we summarize previous research about the functional roles or benefits of introns. Additionally, several other studies strongly suggesting that introns should not be junk will be introduced.

  3. Conserved binding of GCAC motifs by MEC-8, couch potato, and the RBPMS protein family

    Science.gov (United States)

    Soufari, Heddy

    2017-01-01

    Precise regulation of mRNA processing, translation, localization, and stability relies on specific interactions with RNA-binding proteins whose biological function and target preference are dictated by their preferred RNA motifs. The RBPMS family of RNA-binding proteins is defined by a conserved RNA recognition motif (RRM) domain found in metazoan RBPMS/Hermes and RBPMS2, Drosophila couch potato, and MEC-8 from Caenorhabditis elegans. In order to determine the parameters of RNA sequence recognition by the RBPMS family, we have first used the N-terminal domain from MEC-8 in binding assays and have demonstrated a preference for two GCAC motifs optimally separated by >6 nucleotides (nt). We have also determined the crystal structure of the dimeric N-terminal RRM domain from MEC-8 in the unbound form, and in complex with an oligonucleotide harboring two copies of the optimal GCAC motif. The atomic details reveal the molecular network that provides specificity to all four bases in the motif, including multiple hydrogen bonds to the initial guanine. Further studies with human RBPMS, as well as Drosophila couch potato, confirm a general preference for this double GCAC motif by other members of the protein family and the presence of this motif in known targets. PMID:28003515

  4. Multi-species comparative analysis of the equine ACE gene identifies a highly conserved potential transcription factor binding site in intron 16.

    Directory of Open Access Journals (Sweden)

    Natasha A Hamilton

    Full Text Available Angiotensin converting enzyme (ACE is essential for control of blood pressure. The human ACE gene contains an intronic Alu indel (I/D polymorphism that has been associated with variation in serum enzyme levels, although the functional mechanism has not been identified. The polymorphism has also been associated with cardiovascular disease, type II diabetes, renal disease and elite athleticism. We have characterized the ACE gene in horses of breeds selected for differing physical abilities. The equine gene has a similar structure to that of all known mammalian ACE genes. Nine common single nucleotide polymorphisms (SNPs discovered in pooled DNA were found to be inherited in nine haplotypes. Three of these SNPs were located in intron 16, homologous to that containing the Alu polymorphism in the human. A highly conserved 18 bp sequence, also within that intron, was identified as being a potential binding site for the transcription factors Oct-1, HFH-1 and HNF-3β, and lies within a larger area of higher than normal homology. This putative regulatory element may contribute to regulation of the documented inter-individual variation in human circulating enzyme levels, for which a functional mechanism is yet to be defined. Two equine SNPs occurred within the conserved area in intron 16, although neither of them disrupted the putative binding site. We propose a possible regulatory mechanism of the ACE gene in mammalian species which was previously unknown. This advance will allow further analysis leading to a better understanding of the mechanisms underpinning the associations seen between the human Alu polymorphism and enzyme levels, cardiovascular disease states and elite athleticism.

  5. Multi-species comparative analysis of the equine ACE gene identifies a highly conserved potential transcription factor binding site in intron 16.

    Science.gov (United States)

    Hamilton, Natasha A; Tammen, Imke; Raadsma, Herman W

    2013-01-01

    Angiotensin converting enzyme (ACE) is essential for control of blood pressure. The human ACE gene contains an intronic Alu indel (I/D) polymorphism that has been associated with variation in serum enzyme levels, although the functional mechanism has not been identified. The polymorphism has also been associated with cardiovascular disease, type II diabetes, renal disease and elite athleticism. We have characterized the ACE gene in horses of breeds selected for differing physical abilities. The equine gene has a similar structure to that of all known mammalian ACE genes. Nine common single nucleotide polymorphisms (SNPs) discovered in pooled DNA were found to be inherited in nine haplotypes. Three of these SNPs were located in intron 16, homologous to that containing the Alu polymorphism in the human. A highly conserved 18 bp sequence, also within that intron, was identified as being a potential binding site for the transcription factors Oct-1, HFH-1 and HNF-3β, and lies within a larger area of higher than normal homology. This putative regulatory element may contribute to regulation of the documented inter-individual variation in human circulating enzyme levels, for which a functional mechanism is yet to be defined. Two equine SNPs occurred within the conserved area in intron 16, although neither of them disrupted the putative binding site. We propose a possible regulatory mechanism of the ACE gene in mammalian species which was previously unknown. This advance will allow further analysis leading to a better understanding of the mechanisms underpinning the associations seen between the human Alu polymorphism and enzyme levels, cardiovascular disease states and elite athleticism.

  6. A conserved cysteine motif is critical for rice ceramide kinase activity and function.

    Directory of Open Access Journals (Sweden)

    Fang-Cheng Bi

    Full Text Available Ceramide kinase (CERK is a key regulator of cell survival in dicotyledonous plants and animals. Much less is known about the roles of CERK and ceramides in mediating cellular processes in monocot plants. Here, we report the characterization of a ceramide kinase, OsCERK, from rice (Oryza sativa spp. Japonica cv. Nipponbare and investigate the effects of ceramides on rice cell viability.OsCERK can complement the Arabidopsis CERK mutant acd5. Recombinant OsCERK has ceramide kinase activity with Michaelis-Menten kinetics and optimal activity at 7.0 pH and 40°C. Mg2+ activates OsCERK in a concentration-dependent manner. Importantly, a CXXXCXXC motif, conserved in all ceramide kinases and important for the activity of the human enzyme, is critical for OsCERK enzyme activity and in planta function. In a rice protoplast system, inhibition of CERK leads to cell death and the ratio of added ceramide and ceramide-1-phosphate, CERK's substrate and product, respectively, influences cell survival. Ceramide-induced rice cell death has apoptotic features and is an active process that requires both de novo protein synthesis and phosphorylation, respectively. Finally, mitochondria membrane potential loss previously associated with ceramide-induced cell death in Arabidopsis was also found in rice, but it occurred with different timing.OsCERK is a bona fide ceramide kinase with a functionally and evolutionarily conserved Cys-rich motif that plays an important role in modulating cell fate in plants. The vital function of the conserved motif in both human and rice CERKs suggests that the biochemical mechanism of CERKs is similar in animals and plants. Furthermore, ceramides induce cell death with similar features in monocot and dicot plants.

  7. Detecting remote sequence homology in disordered proteins: discovery of conserved motifs in the N-termini of Mononegavirales phosphoproteins.

    Directory of Open Access Journals (Sweden)

    David Karlin

    Full Text Available Paramyxovirinae are a large group of viruses that includes measles virus and parainfluenza viruses. The viral Phosphoprotein (P plays a central role in viral replication. It is composed of a highly variable, disordered N-terminus and a conserved C-terminus. A second viral protein alternatively expressed, the V protein, also contains the N-terminus of P, fused to a zinc finger. We suspected that, despite their high variability, the N-termini of P/V might all be homologous; however, using standard approaches, we could previously identify sequence conservation only in some Paramyxovirinae. We now compared the N-termini using sensitive sequence similarity search programs, able to detect residual similarities unnoticeable by conventional approaches. We discovered that all Paramyxovirinae share a short sequence motif in their first 40 amino acids, which we called soyuz1. Despite its short length (11-16aa, several arguments allow us to conclude that soyuz1 probably evolved by homologous descent, unlike linear motifs. Conservation across such evolutionary distances suggests that soyuz1 plays a crucial role and experimental data suggest that it binds the viral nucleoprotein to prevent its illegitimate self-assembly. In some Paramyxovirinae, the N-terminus of P/V contains a second motif, soyuz2, which might play a role in blocking interferon signaling. Finally, we discovered that the P of related Mononegavirales contain similarly overlooked motifs in their N-termini, and that their C-termini share a previously unnoticed structural similarity suggesting a common origin. Our results suggest several testable hypotheses regarding the replication of Mononegavirales and suggest that disordered regions with little overall sequence similarity, common in viral and eukaryotic proteins, might contain currently overlooked motifs (intermediate in length between linear motifs and disordered domains that could be detected simply by comparing orthologous proteins.

  8. Analysis of a conserved RGE/RGD motif in HCV E2 in mediating entry

    Directory of Open Access Journals (Sweden)

    Rong Lijun

    2009-01-01

    Full Text Available Abstract Background Hepatitis C virus (HCV encodes two transmembrane glycoproteins E1 and E2 which form a heterodimer. E1 is believed to mediate fusion while E2 has been shown to bind cellular receptors. It is clear that HCV uses a multi-receptor complex to gain entry into susceptible cells, however key elements of this complex remain elusive. In this study, the role of a highly conserved RGE/RGD motif of HCV E2 glycoprotein in viral entry was examined. The effect of each substitution mutation in this motif was tested by challenging susceptible cell lines with mutant HCV E1E2 pseudotyped viruses generated using a lentiviral system (HCVpp. In addition to assaying infectivity, producer cell expression and HCVpp incorporation of HCV E2 proteins, CD81 binding profiles, and conformation of mutants were examined. Results Based on these characteristics, mutants either displayed wt characteristics (high infectivity [≥ 90% of wt HCVpp], CD81 binding, E1E2 expression, and incorporation into viral particles and proper conformation or very low infectivity (≤ 20% of wt HCVpp. Only amino acid substitutions of the 3rd position (D or E resulted in wt characteristics as long as the negative charge was maintained or a neutral alanine was introduced. A change in charge to a positive lysine, disrupted HCVpp infectivity at this position. Conclusion Although most amino acid substitutions within this conserved motif displayed greatly reduced HCVpp infectivity, they retained soluble CD81 binding, proper E2 conformation, and incorporation into HCVpp. Our results suggest that although RGE/D is a well-defined integrin binding motif, in this case the role of these three hyperconserved amino acids does not appear to be integrin binding. As the extent of conservation of this region extends well beyond these three amino acids, we speculate that this region may play an important role in the structure of HCV E2 or in mediating the interaction with other factor(s during

  9. Conservation and Sex-Specific Splicing of the transformer Gene in the Calliphorids Cochliomyia hominivorax, Cochliomyia macellaria and Lucilia sericata

    Science.gov (United States)

    Li, Fang; Vensko, Steven P.; Belikoff, Esther J.; Scott, Maxwell J.

    2013-01-01

    Transformer (TRA) promotes female development in several dipteran species including the Australian sheep blowfly Lucilia cuprina, the Mediterranean fruit fly, housefly and Drosophila melanogaster. tra transcripts are sex-specifically spliced such that only the female form encodes full length functional protein. The presence of six predicted TRA/TRA2 binding sites in the sex-specific female intron of the L. cuprina gene suggested that tra splicing is auto-regulated as in medfly and housefly. With the aim of identifying conserved motifs that may play a role in tra sex-specific splicing, here we have isolated and characterized the tra gene from three additional blowfly species, L. sericata, Cochliomyia hominivorax and C. macellaria. The blowfly adult male and female transcripts differ in the choice of splice donor site in the first intron, with males using a site downstream of the site used in females. The tra genes all contain a single TRA/TRA2 site in the male exon and a cluster of four to five sites in the male intron. However, overall the sex-specific intron sequences are poorly conserved in closely related blowflies. The most conserved regions are around the exon/intron junctions, the 3′ end of the intron and near the cluster of TRA/TRA2 sites. We propose a model for sex specific regulation of tra splicing that incorporates the conserved features identified in this study. In L. sericata embryos, the male tra transcript was first detected at around the time of cellular blastoderm formation. RNAi experiments showed that tra is required for female development in L. sericata and C. macellaria. The isolation of the tra gene from the New World screwworm fly C. hominivorax, a major livestock pest, will facilitate the development of a “male-only” strain for genetic control programs. PMID:23409170

  10. Origin of introns by 'intronization' of exonic sequences

    DEFF Research Database (Denmark)

    Irimia, Manuel; Rukov, Jakob Lewin; Penny, David

    2008-01-01

    The mechanisms of spliceosomal intron creation have proved elusive. Here we describe a new mechanism: the recruitment of internal exonic sequences ('intronization') in Caenorhabditis species. The numbers of intronization events and introns gained by other mechanisms are similar, suggesting that i...

  11. Comparative Analysis of Vertebrate Dystrophin Loci Indicate Intron Gigantism as a Common Feature

    Science.gov (United States)

    Pozzoli, Uberto; Elgar, Greg; Cagliani, Rachele; Riva, Laura; Comi, Giacomo P.; Bresolin, Nereo; Bardoni, Alessandra; Sironi, Manuela

    2003-01-01

    The human DMD gene is the largest known to date, spanning > 2000 kb on the X chromosome. The gene size is mainly accounted for by huge intronic regions. We sequenced 190 kb of Fugu rubripes (pufferfish) genomic DNA corresponding to the complete dystrophin gene (FrDMD) and provide the first report of gene structure and sequence comparison among dystrophin genomic sequences from different vertebrate organisms. Almost all intron positions and phases are conserved between FrDMD and its mammalian counterparts, and the predicted protein product of the Fugu gene displays 55% identity and 71% similarity to human dystrophin. In analogy to the human gene, FrDMD presents several-fold longer than average intronic regions. Analysis of intron sequences of the human and murine genes revealed that they are extremely conserved in size and that a similar fraction of total intron length is represented by repetitive elements; moreover, our data indicate that intron expansion through repeat accumulation in the two orthologs is the result of independent insertional events. The hypothesis that intron length might be functionally relevant to the DMD gene regulation is proposed and substantiated by the finding that dystrophin intron gigantism is common to the three vertebrate genes. [Supplemental material is available online at www.genome.org.] PMID:12727896

  12. Evolution of the tRNALeu (UAA) Intron and Congruence of Genetic Markers in Lichen-Symbiotic Nostoc.

    Science.gov (United States)

    Kaasalainen, Ulla; Olsson, Sanna; Rikkinen, Jouko

    2015-01-01

    The group I intron interrupting the tRNALeu UAA gene (trnL) is present in most cyanobacterial genomes as well as in the plastids of many eukaryotic algae and all green plants. In lichen symbiotic Nostoc, the P6b stem-loop of trnL intron always involves one of two different repeat motifs, either Class I or Class II, both with unresolved evolutionary histories. Here we attempt to resolve the complex evolution of the two different trnL P6b region types. Our analysis indicates that the Class II repeat motif most likely appeared first and that independent and unidirectional shifts to the Class I motif have since taken place repeatedly. In addition, we compare our results with those obtained with other genetic markers and find strong evidence of recombination in the 16S rRNA gene, a marker widely used in phylogenetic studies on Bacteria. The congruence of the different genetic markers is successfully evaluated with the recently published software Saguaro, which has not previously been utilized in comparable studies.

  13. Evolution of the tRNALeu (UAA Intron and Congruence of Genetic Markers in Lichen-Symbiotic Nostoc.

    Directory of Open Access Journals (Sweden)

    Ulla Kaasalainen

    Full Text Available The group I intron interrupting the tRNALeu UAA gene (trnL is present in most cyanobacterial genomes as well as in the plastids of many eukaryotic algae and all green plants. In lichen symbiotic Nostoc, the P6b stem-loop of trnL intron always involves one of two different repeat motifs, either Class I or Class II, both with unresolved evolutionary histories. Here we attempt to resolve the complex evolution of the two different trnL P6b region types. Our analysis indicates that the Class II repeat motif most likely appeared first and that independent and unidirectional shifts to the Class I motif have since taken place repeatedly. In addition, we compare our results with those obtained with other genetic markers and find strong evidence of recombination in the 16S rRNA gene, a marker widely used in phylogenetic studies on Bacteria. The congruence of the different genetic markers is successfully evaluated with the recently published software Saguaro, which has not previously been utilized in comparable studies.

  14. Analysis of ribosomal protein gene structures: implications for intron evolution.

    Directory of Open Access Journals (Sweden)

    2006-03-01

    Full Text Available Many spliceosomal introns exist in the eukaryotic nuclear genome. Despite much research, the evolution of spliceosomal introns remains poorly understood. In this paper, we tried to gain insights into intron evolution from a novel perspective by comparing the gene structures of cytoplasmic ribosomal proteins (CRPs and mitochondrial ribosomal proteins (MRPs, which are held to be of archaeal and bacterial origin, respectively. We analyzed 25 homologous pairs of CRP and MRP genes that together had a total of 527 intron positions. We found that all 12 of the intron positions shared by CRP and MRP genes resulted from parallel intron gains and none could be considered to be "conserved," i.e., descendants of the same ancestor. This was supported further by the high frequency of proto-splice sites at these shared positions; proto-splice sites are proposed to be sites for intron insertion. Although we could not definitively disprove that spliceosomal introns were already present in the last universal common ancestor, our results lend more support to the idea that introns were gained late. At least, our results show that MRP genes were intronless at the time of endosymbiosis. The parallel intron gains between CRP and MRP genes accounted for 2.3% of total intron positions, which should provide a reliable estimate for future inferences of intron evolution.

  15. Sequence features responsible for intron retention in human

    Directory of Open Access Journals (Sweden)

    Sakabe Noboru

    2007-02-01

    Full Text Available Abstract Background One of the least common types of alternative splicing is the complete retention of an intron in a mature transcript. Intron retention (IR is believed to be the result of intron, rather than exon, definition associated with failure of the recognition of weak splice sites flanking short introns. Although studies on individual retained introns have been published, few systematic surveys of large amounts of data have been conducted on the mechanisms that lead to IR. Results TTo understand how sequence features are associated with or control IR, and to produce a generalized model that could reveal previously unknown signals that regulate this type of alternative splicing, we partitioned intron retention events observed in human cDNAs into two groups based on the relative abundance of both isoforms and compared relevant features. We found that a higher frequency of IR in human is associated with individual introns that have weaker splice sites, genes with shorter intron lengths, higher expression levels and lower density of both a set of exon splicing silencers (ESSs and the intronic splicing enhancer GGG. Both groups of retained introns presented events conserved in mouse, in which the retained introns were also short and presented weaker splice sites. Conclusion Although our results confirmed that weaker splice sites are associated with IR, they showed that this feature alone cannot explain a non-negligible fraction of events. Our analysis suggests that cis-regulatory elements are likely to play a crucial role in regulating IR and also reveals previously unknown features that seem to influence its occurrence. These results highlight the importance of considering the interplay among these features in the regulation of the relative frequency of IR.

  16. Automated classification of RNA 3D motifs and the RNA 3D Motif Atlas

    Science.gov (United States)

    Petrov, Anton I.; Zirbel, Craig L.; Leontis, Neocles B.

    2013-01-01

    The analysis of atomic-resolution RNA three-dimensional (3D) structures reveals that many internal and hairpin loops are modular, recurrent, and structured by conserved non-Watson–Crick base pairs. Structurally similar loops define RNA 3D motifs that are conserved in homologous RNA molecules, but can also occur at nonhomologous sites in diverse RNAs, and which often vary in sequence. To further our understanding of RNA motif structure and sequence variability and to provide a useful resource for structure modeling and prediction, we present a new method for automated classification of internal and hairpin loop RNA 3D motifs and a new online database called the RNA 3D Motif Atlas. To classify the motif instances, a representative set of internal and hairpin loops is automatically extracted from a nonredundant list of RNA-containing PDB files. Their structures are compared geometrically, all-against-all, using the FR3D program suite. The loops are clustered into motif groups, taking into account geometric similarity and structural annotations and making allowance for a variable number of bulged bases. The automated procedure that we have implemented identifies all hairpin and internal loop motifs previously described in the literature. All motif instances and motif groups are assigned unique and stable identifiers and are made available in the RNA 3D Motif Atlas (http://rna.bgsu.edu/motifs), which is automatically updated every four weeks. The RNA 3D Motif Atlas provides an interactive user interface for exploring motif diversity and tools for programmatic data access. PMID:23970545

  17. Three distinct modes of intron dynamics in the evolution of eukaryotes.

    Science.gov (United States)

    Carmel, Liran; Wolf, Yuri I; Rogozin, Igor B; Koonin, Eugene V

    2007-07-01

    Several contrasting scenarios have been proposed for the origin and evolution of spliceosomal introns, a hallmark of eukaryotic genes. A comprehensive probabilistic model to obtain a definitive reconstruction of intron evolution was developed and applied to 391 sets of conserved genes from 19 eukaryotic species. It is inferred that a relatively high intron density was reached early, i.e., the last common ancestor of eukaryotes contained >2.15 introns/kilobase, and the last common ancestor of multicellular life forms harbored approximately 3.4 introns/kilobase, a greater intron density than in most of the extant fungi and in some animals. The rates of intron gain and intron loss appear to have been dropping during the last approximately 1.3 billion years, with the decline in the gain rate being much steeper. Eukaryotic lineages exhibit three distinct modes of evolution of the intron-exon structure. The primary, balanced mode, apparently, operates in all lineages. In this mode, intron gain and loss are strongly and positively correlated, in contrast to previous reports on inverse correlation between these processes. The second mode involves an elevated rate of intron loss and is prevalent in several lineages, such as fungi and insects. The third mode, characterized by elevated rate of intron gain, is seen only in deep branches of the tree, indicating that bursts of intron invasion occurred at key points in eukaryotic evolution, such as the origin of animals. Intron dynamics could depend on multiple mechanisms, and in the balanced mode, gain and loss of introns might share common mechanistic features.

  18. Changes in exon–intron structure during vertebrate evolution affect the splicing pattern of exons

    Science.gov (United States)

    Gelfman, Sahar; Burstein, David; Penn, Osnat; Savchenko, Anna; Amit, Maayan; Schwartz, Schraga; Pupko, Tal; Ast, Gil

    2012-01-01

    Exon–intron architecture is one of the major features directing the splicing machinery to the short exons that are located within long flanking introns. However, the evolutionary dynamics of exon–intron architecture and its impact on splicing is largely unknown. Using a comparative genomic approach, we analyzed 17 vertebrate genomes and reconstructed the ancestral motifs of both 3′ and 5′ splice sites, as also the ancestral length of exons and introns. Our analyses suggest that vertebrate introns increased in length from the shortest ancestral introns to the longest primate introns. An evolutionary analysis of splice sites revealed that weak splice sites act as a restrictive force keeping introns short. In contrast, strong splice sites allow recognition of exons flanked by long introns. Reconstruction of the ancestral state suggests these phenomena were not prevalent in the vertebrate ancestor, but appeared during vertebrate evolution. By calculating evolutionary rate shifts in exons, we identified cis-acting regulatory sequences that became fixed during the transition from early vertebrates to mammals. Experimental validations performed on a selection of these hexamers confirmed their regulatory function. We additionally revealed many features of exons that can discriminate alternative from constitutive exons. These features were integrated into a machine-learning approach to predict whether an exon is alternative. Our algorithm obtains very high predictive power (AUC of 0.91), and using these predictions we have identified and successfully validated novel alternatively spliced exons. Overall, we provide novel insights regarding the evolutionary constraints acting upon exons and their recognition by the splicing machinery. PMID:21974994

  19. Dipeptide frequency/bias analysis identifies conserved sites of nonrandomness shared by cysteine-rich motifs.

    Science.gov (United States)

    Campion, S R; Ameen, A S; Lai, L; King, J M; Munzenmaier, T N

    2001-08-15

    This report describes the application of a simple computational tool, AAPAIR.TAB, for the systematic analysis of the cysteine-rich EGF, Sushi, and Laminin motif/sequence families at the two-amino acid level. Automated dipeptide frequency/bias analysis detects preferences in the distribution of amino acids in established protein families, by determining which "ordered dipeptides" occur most frequently in comprehensive motif-specific sequence data sets. Graphic display of the dipeptide frequency/bias data revealed family-specific preferences for certain dipeptides, but more importantly detected a shared preference for employment of the ordered dipeptides Gly-Tyr (GY) and Gly-Phe (GF) in all three protein families. The dipeptide Asn-Gly (NG) also exhibited high-frequency and bias in the EGF and Sushi motif families, whereas Asn-Thr (NT) was distinguished in the Laminin family. Evaluation of the distribution of dipeptides identified by frequency/bias analysis subsequently revealed the highly restricted localization of the G(F/Y) and N(G/T) sequence elements at two separate sites of extreme conservation in the consensus sequence of all three sequence families. The similar employment of the high-frequency/bias dipeptides in three distinct protein sequence families was further correlated with the concurrence of these shared molecular determinants at similar positions within the distinctive scaffolds of three structurally divergent, but similarly employed, motif modules.

  20. Conserved amino acid motifs from the novel Piv/MooV family of transposases and site-specific recombinases are required for catalysis of DNA inversion by Piv.

    Science.gov (United States)

    Tobiason, D M; Buchner, J M; Thiel, W H; Gernert, K M; Karls, A C

    2001-02-01

    Piv, a site-specific invertase from Moraxella lacunata, exhibits amino acid homology with the transposases of the IS110/IS492 family of insertion elements. The functions of conserved amino acid motifs that define this novel family of both transposases and site-specific recombinases (Piv/MooV family) were examined by mutagenesis of fully conserved amino acids within each motif in Piv. All Piv mutants altered in conserved residues were defective for in vivo inversion of the M. lacunata invertible DNA segment, but competent for in vivo binding to Piv DNA recognition sequences. Although the primary amino acid sequences of the Piv/MooV recombinases do not contain a conserved DDE motif, which defines the retroviral integrase/transposase (IN/Tnps) family, the predicted secondary structural elements of Piv align well with those of the IN/Tnps for which crystal structures have been determined. Molecular modelling of Piv based on these alignments predicts that E59, conserved as either E or D in the Piv/MooV family, forms a catalytic pocket with the conserved D9 and D101 residues. Analysis of Piv E59G confirms a role for E59 in catalysis of inversion. These results suggest that Piv and the related IS110/IS492 transposases mediate DNA recombination by a common mechanism involving a catalytic DED or DDD motif.

  1. Determinism and randomness in the evolution of introns and sine inserts in mouse and human mitochondrial solute carrier and cytokine receptor genes.

    Science.gov (United States)

    Cianciulli, Antonia; Calvello, Rosa; Panaro, Maria A

    2015-04-01

    In the homologous genes studied, the exons and introns alternated in the same order in mouse and human. We studied, in both species: corresponding short segments of introns, whole corresponding introns and complete homologous genes. We considered the total number of nucleotides and the number and orientation of the SINE inserts. Comparisons of mouse and human data series showed that at the level of individual relatively short segments of intronic sequences the stochastic variability prevails in the local structuring, but at higher levels of organization a deterministic component emerges, conserved in mouse and human during the divergent evolution, despite the ample re-editing of the intronic sequences and the fact that processes such as SINE spread had taken place in an independent way in the two species. Intron conservation is negatively correlated with the SINE occupancy, suggesting that virus inserts interfere with the conservation of the sequences inherited from the common ancestor. Copyright © 2015 Elsevier Ltd. All rights reserved.

  2. Conserved XPB Core Structure and Motifs for DNA Unwinding:Implications for Pathway Selection of Transcription or ExcisionRepair

    Energy Technology Data Exchange (ETDEWEB)

    Fan, Li; Arval, Andrew S.; Cooper, Priscilla K.; Iwai, Shigenori; Hanaoka, Fumio; Tainer, John A.

    2005-04-01

    The human xeroderma pigmentosum group B (XPB) helicase is essential for transcription, nucleotide excision repair, and TFIIH functional assembly. Here, we determined crystal structures of an Archaeoglobus fulgidus XPB homolog (AfXPB) that characterize two RecA-like XPB helicase domains and discover a DNA damage recognition domain (DRD), a unique RED motif, a flexible thumb motif (ThM), and implied conformational changes within a conserved functional core. RED motif mutations dramatically reduce helicase activity, and the DRD and ThM, which flank the RED motif, appear structurally as well as functionally analogous to the MutS mismatch recognition and DNA polymerase thumb domains. Substrate specificity is altered by DNA damage, such that AfXPB unwinds dsDNA with 3' extensions, but not blunt-ended dsDNA, unless it contains a lesion, as shown for CPD or (6-4) photoproducts. Together, these results provide an unexpected mechanism of DNA unwinding with Implications for XPB damage verification in nucleotide excision repair.

  3. Genetic structure and evolution of the Vps25 family, a yeast ESCRT-II component

    Directory of Open Access Journals (Sweden)

    Slater Ruth

    2006-08-01

    Full Text Available Abstract Background Vps25p is the product of yeast gene VPS25 and is found in an endosomal sorting complex required for transport (ESCRT-II, along with Vps22p and Vps36p. This complex is essential for sorting of ubiquitinated biosynthetic and endosomal cargoes into endosomes. Results We found that VPS25 is a highly conserved and widely expressed eukaryotic gene, with single orthologs in chromalveolate, excavate, amoebozoan, plant, fungal and metazoan species. Two paralogs were found in Trichomonas vaginalis. An ortholog was strikingly absent from the Encephalitozoon cuniculi genome. Intron positions were analyzed in VPS25 from 36 species. We found evidence for five ancestral VPS25 introns, intron loss, and single instances of intron gain (a Paramecium species and intron slippage (Theileria species. Processed pseudogenes were identified in four mammalian genomes, with a notable absence in the mouse genome. Two retropseudogenes were found in the chimpanzee genome, one more recently inserted, and one evolving from a common primate ancestor. The amino acid sequences of 119 Vps25 orthologs are aligned, compared with the known secondary structure of yeast Vps25p, and used to carry out phylogenetic analysis. Residues in two amino-terminal PPXY motifs (motif I and II, involved in dimerization of Vps25p and interaction with Vps22p and Vps36p, were closely, but not absolutely conserved. Specifically, motif I was absent in Vps25 homologs of chromalveolates, euglenozoa, and diplomonads. A highly conserved carboxy-terminal lysine was identified, which suggests Vps25 is ubiquitinated. Arginine-83 of yeast Vps25p involved in Vps22p interaction was highly, but not absolutely, conserved. Human tissue expression analysis showed universal expression. Conclusion We have identified 119 orthologs of yeast Vps25p. Expression of mammalian VPS25 in a wide range of tissues, and the presence in a broad range of eukaryotic species, indicates a basic role in eukaryotic cell

  4. The strength of intron donor splice sites in human genes displays a bell-shaped pattern

    DEFF Research Database (Denmark)

    Wang, Kai; Wernersson, Rasmus; Brunak, Søren

    2011-01-01

    introns. Interestingly, when analysing the intron containing gene pool from mouse consisting of >15 000 genes, we found the convex pattern to be conserved despite >75 million years of evolutionary divergence between the two organisms. We also analysed an interesting, novel class of chimeric genes which...

  5. AML1/ETO trans-activates c-KIT expression through the long range interaction between promoter and intronic enhancer.

    Science.gov (United States)

    Tian, Ying; Wang, Genjie; Hu, Qingzhu; Xiao, Xichun; Chen, Shuxia

    2018-04-01

    The AML1/ETO onco-fusion protein is crucial for the genesis of t(8;21) acute myeloid leukemia (AML) and is well documented as a transcriptional repressor through dominant-negative effect. However, little is known about the transactivation mechanism of AML1/ETO. Through large cohort of patient's expression level data analysis and a series of experimental validation, we report here that AML1/ETO transactivates c-KIT expression through directly binding to and mediating the long-range interaction between the promoter and intronic enhancer regions of c-KIT. Gene expression analyses verify that c-KIT expression is significantly high in t(8;21) AML. Further ChIP-seq analysis and motif scanning identify two regulatory regions located in the promoter and intronic enhancer region of c-KIT, respectively. Both regions are enriched by co-factors of AML1/ETO, such as AML1, CEBPe, c-Jun, and c-Fos. Further luciferase reporter assays show that AML1/ETO trans-activates c-KIT promoter activity through directly recognizing the AML1 motif and the co-existence of co-factors. The induction of c-KIT promoter activity is reinforced with the existence of intronic enhancer region. Furthermore, ChIP-3C-qPCR assays verify that AML1/ETO mediates the formation of DNA-looping between the c-KIT promoter and intronic enhancer region through the long-range interaction. Collectively, our data uncover a novel transcriptional activity mechanism of AML1/ETO and enrich our knowledge of the onco-fusion protein mediated transcription regulation. © 2017 Wiley Periodicals, Inc.

  6. BlockLogo: Visualization of peptide and sequence motif conservation

    DEFF Research Database (Denmark)

    Olsen, Lars Rønn; Kudahl, Ulrich Johan; Simon, Christian

    2013-01-01

    BlockLogo is a web-server application for the visualization of protein and nucleotide fragments, continuous protein sequence motifs, and discontinuous sequence motifs using calculation of block entropy from multiple sequence alignments. The user input consists of a multiple sequence alignment, se...

  7. An effective approach for annotation of protein families with low sequence similarity and conserved motifs: identifying GDSL hydrolases across the plant kingdom.

    Science.gov (United States)

    Vujaklija, Ivan; Bielen, Ana; Paradžik, Tina; Biđin, Siniša; Goldstein, Pavle; Vujaklija, Dušica

    2016-02-18

    The massive accumulation of protein sequences arising from the rapid development of high-throughput sequencing, coupled with automatic annotation, results in high levels of incorrect annotations. In this study, we describe an approach to decrease annotation errors of protein families characterized by low overall sequence similarity. The GDSL lipolytic family comprises proteins with multifunctional properties and high potential for pharmaceutical and industrial applications. The number of proteins assigned to this family has increased rapidly over the last few years. In particular, the natural abundance of GDSL enzymes reported recently in plants indicates that they could be a good source of novel GDSL enzymes. We noticed that a significant proportion of annotated sequences lack specific GDSL motif(s) or catalytic residue(s). Here, we applied motif-based sequence analyses to identify enzymes possessing conserved GDSL motifs in selected proteomes across the plant kingdom. Motif-based HMM scanning (Viterbi decoding-VD and posterior decoding-PD) and the here described PD/VD protocol were successfully applied on 12 selected plant proteomes to identify sequences with GDSL motifs. A significant number of identified GDSL sequences were novel. Moreover, our scanning approach successfully detected protein sequences lacking at least one of the essential motifs (171/820) annotated by Pfam profile search (PfamA) as GDSL. Based on these analyses we provide a curated list of GDSL enzymes from the selected plants. CLANS clustering and phylogenetic analysis helped us to gain a better insight into the evolutionary relationship of all identified GDSL sequences. Three novel GDSL subfamilies as well as unreported variations in GDSL motifs were discovered in this study. In addition, analyses of selected proteomes showed a remarkable expansion of GDSL enzymes in the lycophyte, Selaginella moellendorffii. Finally, we provide a general motif-HMM scanner which is easily accessible through

  8. Comparisons of Copy Number, Genomic Structure, and Conserved Motifs for α-Amylase Genes from Barley, Rice, and Wheat

    Directory of Open Access Journals (Sweden)

    Qisen Zhang

    2017-10-01

    Full Text Available Barley is an important crop for the production of malt and beer. However, crops such as rice and wheat are rarely used for malting. α-amylase is the key enzyme that degrades starch during malting. In this study, we compared the genomic properties, gene copies, and conserved promoter motifs of α-amylase genes in barley, rice, and wheat. In all three crops, α-amylase consists of four subfamilies designated amy1, amy2, amy3, and amy4. In wheat and barley, members of amy1 and amy2 genes are localized on chromosomes 6 and 7, respectively. In rice, members of amy1 genes are found on chromosomes 1 and 2, and amy2 genes on chromosome 6. The barley genome has six amy1 members and three amy2 members. The wheat B genome contains four amy1 members and three amy2 members, while the rice genome has three amy1 members and one amy2 member. The B genome has mostly amy1 and amy2 members among the three wheat genomes. Amy1 promoters from all three crop genomes contain a GA-responsive complex consisting of a GA-responsive element (CAATAAA, pyrimidine box (CCTTTT and TATCCAT/C box. This study has shown that amy1 and amy2 from both wheat and barley have similar genomic properties, including exon/intron structures and GA-responsive elements on promoters, but these differ in rice. Like barley, wheat should have sufficient amy activity to degrade starch completely during malting. Other factors, such as high protein with haze issues and the lack of husk causing Lauting difficulty, may limit the use of wheat for brewing.

  9. Identification of multiple distinct Snf2 subfamilies with conserved structural motifs.

    Science.gov (United States)

    Flaus, Andrew; Martin, David M A; Barton, Geoffrey J; Owen-Hughes, Tom

    2006-01-01

    The Snf2 family of helicase-related proteins includes the catalytic subunits of ATP-dependent chromatin remodelling complexes found in all eukaryotes. These act to regulate the structure and dynamic properties of chromatin and so influence a broad range of nuclear processes. We have exploited progress in genome sequencing to assemble a comprehensive catalogue of over 1300 Snf2 family members. Multiple sequence alignment of the helicase-related regions enables 24 distinct subfamilies to be identified, a considerable expansion over earlier surveys. Where information is known, there is a good correlation between biological or biochemical function and these assignments, suggesting Snf2 family motor domains are tuned for specific tasks. Scanning of complete genomes reveals all eukaryotes contain members of multiple subfamilies, whereas they are less common and not ubiquitous in eubacteria or archaea. The large sample of Snf2 proteins enables additional distinguishing conserved sequence blocks within the helicase-like motor to be identified. The establishment of a phylogeny for Snf2 proteins provides an opportunity to make informed assignments of function, and the identification of conserved motifs provides a framework for understanding the mechanisms by which these proteins function.

  10. A Conserved EAR Motif Is Required for Avirulence and Stability of the Ralstonia solanacearum Effector PopP2 In Planta

    Directory of Open Access Journals (Sweden)

    Cécile Segonzac

    2017-08-01

    Full Text Available Ralstonia solanacearum is the causal agent of the devastating bacterial wilt disease in many high value Solanaceae crops. R. solanacearum secretes around 70 effectors into host cells in order to promote infection. Plants have, however, evolved specialized immune receptors that recognize corresponding effectors and confer qualitative disease resistance. In the model species Arabidopsis thaliana, the paired immune receptors RRS1 (resistance to Ralstonia solanacearum 1 and RPS4 (resistance to Pseudomonas syringae 4 cooperatively recognize the R. solanacearum effector PopP2 in the nuclei of infected cells. PopP2 is an acetyltransferase that binds to and acetylates the RRS1 WRKY DNA-binding domain resulting in reduced RRS1-DNA association thereby activating plant immunity. Here, we surveyed the naturally occurring variation in PopP2 sequence among the R. solanacearum strains isolated from diseased tomato and pepper fields across the Republic of Korea. Our analysis revealed high conservation of popP2 sequence with only three polymorphic alleles present amongst 17 strains. Only one variation (a premature stop codon caused the loss of RPS4/RRS1-dependent recognition in Arabidopsis. We also found that PopP2 harbors a putative eukaryotic transcriptional repressor motif (ethylene-responsive element binding factor-associated amphiphilic repression or EAR, which is known to be involved in the recruitment of transcriptional co-repressors. Remarkably, mutation of the EAR motif disabled PopP2 avirulence function as measured by the development of hypersensitive response, electrolyte leakage, defense marker gene expression and bacterial growth in Arabidopsis. This lack of recognition was partially but significantly reverted by the C-terminal addition of a synthetic EAR motif. We show that the EAR motif-dependent gain of avirulence correlated with the stability of the PopP2 protein. Furthermore, we demonstrated the requirement of the PopP2 EAR motif for PTI

  11. Interaction of MYC with host cell factor-1 is mediated by the evolutionarily conserved Myc box IV motif.

    Science.gov (United States)

    Thomas, L R; Foshage, A M; Weissmiller, A M; Popay, T M; Grieb, B C; Qualls, S J; Ng, V; Carboneau, B; Lorey, S; Eischen, C M; Tansey, W P

    2016-07-07

    The MYC family of oncogenes encodes a set of three related transcription factors that are overexpressed in many human tumors and contribute to the cancer-related deaths of more than 70,000 Americans every year. MYC proteins drive tumorigenesis by interacting with co-factors that enable them to regulate the expression of thousands of genes linked to cell growth, proliferation, metabolism and genome stability. One effective way to identify critical co-factors required for MYC function has been to focus on sequence motifs within MYC that are conserved throughout evolution, on the assumption that their conservation is driven by protein-protein interactions that are vital for MYC activity. In addition to their DNA-binding domains, MYC proteins carry five regions of high sequence conservation known as Myc boxes (Mb). To date, four of the Mb motifs (MbI, MbII, MbIIIa and MbIIIb) have had a molecular function assigned to them, but the precise role of the remaining Mb, MbIV, and the reason for its preservation in vertebrate Myc proteins, is unknown. Here, we show that MbIV is required for the association of MYC with the abundant transcriptional coregulator host cell factor-1 (HCF-1). We show that the invariant core of MbIV resembles the tetrapeptide HCF-binding motif (HBM) found in many HCF-interaction partners, and demonstrate that MYC interacts with HCF-1 in a manner indistinguishable from the prototypical HBM-containing protein VP16. Finally, we show that rationalized point mutations in MYC that disrupt interaction with HCF-1 attenuate the ability of MYC to drive tumorigenesis in mice. Together, these data expose a molecular function for MbIV and indicate that HCF-1 is an important co-factor for MYC.

  12. Characterization of the molecular basis of group II intron RNA recognition by CRS1-CRM domains.

    Science.gov (United States)

    Keren, Ido; Klipcan, Liron; Bezawork-Geleta, Ayenachew; Kolton, Max; Shaya, Felix; Ostersetzer-Biran, Oren

    2008-08-22

    CRM (chloroplast RNA splicing and ribosome maturation) is a recently recognized RNA-binding domain of ancient origin that has been retained in eukaryotic genomes only within the plant lineage. Whereas in bacteria CRM domains exist as single domain proteins involved in ribosome maturation, in plants they are found in a family of proteins that contain between one and four repeats. Several members of this family with multiple CRM domains have been shown to be required for the splicing of specific plastidic group II introns. Detailed biochemical analysis of one of these factors in maize, CRS1, demonstrated its high affinity and specific binding to the single group II intron whose splicing it facilitates, the plastid-encoded atpF intron RNA. Through its association with two intronic regions, CRS1 guides the folding of atpF intron RNA into its predicted "catalytically active" form. To understand how multiple CRM domains cooperate to achieve high affinity sequence-specific binding to RNA, we analyzed the RNA binding affinity and specificity associated with each individual CRM domain in CRS1; whereas CRM3 bound tightly to the RNA, CRM1 associated specifically with a unique region found within atpF intron domain I. CRM2, which demonstrated only low binding affinity, also seems to form specific interactions with regions localized to domains I, III, and IV. We further show that CRM domains share structural similarities and RNA binding characteristics with the well known RNA recognition motif domain.

  13. The distribution, diversity, and importance of 16S rRNA gene introns in the order Thermoproteales.

    Science.gov (United States)

    Jay, Zackary J; Inskeep, William P

    2015-07-09

    Intron sequences are common in 16S rRNA genes of specific thermophilic lineages of Archaea, specifically the Thermoproteales (phylum Crenarchaeota). Environmental sequencing (16S rRNA gene and metagenome) from geothermal habitats in Yellowstone National Park (YNP) has expanded the available datasets for investigating 16S rRNA gene introns. The objectives of this study were to characterize and curate archaeal 16S rRNA gene introns from high-temperature habitats, evaluate the conservation and distribution of archaeal 16S rRNA introns in geothermal systems, and determine which "universal" archaeal 16S rRNA gene primers are impacted by the presence of intron sequences. Several new introns were identified and their insertion loci were constrained to thirteen locations across the 16S rRNA gene. Many of these introns encode homing endonucleases, although some introns were short or partial sequences. Pyrobaculum, Thermoproteus, and Caldivirga 16S rRNA genes contained the most abundant and diverse intron sequences. Phylogenetic analysis of introns revealed that sequences within the same locus are distributed biogeographically. The most diverse set of introns were observed in a high-temperature, circumneutral (pH 6) sulfur sediment environment, which also contained the greatest diversity of different Thermoproteales phylotypes. The widespread presence of introns in the Thermoproteales indicates a high probability of misalignments using different "universal" 16S rRNA primers employed in environmental microbial community analysis.

  14. Parallel loss of plastid introns and their maturase in the genus Cuscuta.

    Science.gov (United States)

    McNeal, Joel R; Kuehl, Jennifer V; Boore, Jeffrey L; Leebens-Mack, Jim; dePamphilis, Claude W

    2009-06-19

    Plastid genome content and arrangement are highly conserved across most land plants and their closest relatives, streptophyte algae, with nearly all plastid introns having invaded the genome in their common ancestor at least 450 million years ago. One such intron, within the transfer RNA trnK-UUU, contains a large open reading frame that encodes a presumed intron maturase, matK. This gene is missing from the plastid genomes of two species in the parasitic plant genus Cuscuta but is found in all other published land plant and streptophyte algal plastid genomes, including that of the nonphotosynthetic angiosperm Epifagus virginiana and two other species of Cuscuta. By examining matK and plastid intron distribution in Cuscuta, we add support to the hypothesis that its normal role is in splicing seven of the eight group IIA introns in the genome. We also analyze matK nucleotide sequences from Cuscuta species and relatives that retain matK to test whether changes in selective pressure in the maturase are associated with intron deletion. Stepwise loss of most group IIA introns from the plastid genome results in substantial change in selective pressure within the hypothetical RNA-binding domain of matK in both Cuscuta and Epifagus, either through evolution from a generalist to a specialist intron splicer or due to loss of a particular intron responsible for most of the constraint on the binding region. The possibility of intron-specific specialization in the X-domain is implicated by evidence of positive selection on the lineage leading to C. nitida in association with the loss of six of seven introns putatively spliced by matK. Moreover, transfer RNA gene deletion facilitated by parasitism combined with an unusually high rate of intron loss from remaining functional plastid genes created a unique circumstance on the lineage leading to Cuscuta subgenus Grammica that allowed elimination of matK in the most species-rich lineage of Cuscuta.

  15. Parallel loss of plastid introns and their maturase in the genus Cuscuta.

    Directory of Open Access Journals (Sweden)

    Joel R McNeal

    2009-06-01

    Full Text Available Plastid genome content and arrangement are highly conserved across most land plants and their closest relatives, streptophyte algae, with nearly all plastid introns having invaded the genome in their common ancestor at least 450 million years ago. One such intron, within the transfer RNA trnK-UUU, contains a large open reading frame that encodes a presumed intron maturase, matK. This gene is missing from the plastid genomes of two species in the parasitic plant genus Cuscuta but is found in all other published land plant and streptophyte algal plastid genomes, including that of the nonphotosynthetic angiosperm Epifagus virginiana and two other species of Cuscuta. By examining matK and plastid intron distribution in Cuscuta, we add support to the hypothesis that its normal role is in splicing seven of the eight group IIA introns in the genome. We also analyze matK nucleotide sequences from Cuscuta species and relatives that retain matK to test whether changes in selective pressure in the maturase are associated with intron deletion. Stepwise loss of most group IIA introns from the plastid genome results in substantial change in selective pressure within the hypothetical RNA-binding domain of matK in both Cuscuta and Epifagus, either through evolution from a generalist to a specialist intron splicer or due to loss of a particular intron responsible for most of the constraint on the binding region. The possibility of intron-specific specialization in the X-domain is implicated by evidence of positive selection on the lineage leading to C. nitida in association with the loss of six of seven introns putatively spliced by matK. Moreover, transfer RNA gene deletion facilitated by parasitism combined with an unusually high rate of intron loss from remaining functional plastid genes created a unique circumstance on the lineage leading to Cuscuta subgenus Grammica that allowed elimination of matK in the most species-rich lineage of Cuscuta.

  16. Estimating the Nucleotide Diversity in Ceratodon purpureus (Ditrichaceae from 218 Conserved Exon-Primed, Intron-Spanning Nuclear Loci

    Directory of Open Access Journals (Sweden)

    Stuart F. McDaniel

    2013-04-01

    Full Text Available Premise of the study: We developed and tested primers for 218 nuclear loci for studying population genetics, phylogeography, and genome evolution in bryophytes. Methods and Results: We aligned expressed sequence tags (ESTs from Ceratodon purpureus to the Physcomitrella patens genome sequence, and designed primers that are homologous to conserved exons but span introns in the P. patens genome. We tested these primers on four isolates from New York, USA; Otavalo, Ecuador; and two laboratory isolates from Austria (WT4 and GG1. The median genome-wide nucleotide diversity was 0.008 substitutions/site, but the range was large (0–0.14, illustrating the among-locus heterogeneity in the species. Conclusions: These loci provide a valuable resource for finely resolved, genome-wide population genetic and species-level phylogenetic analyses of C. purpureus and its relatives.

  17. Armadillo motifs involved in vesicular transport.

    Directory of Open Access Journals (Sweden)

    Harald Striegl

    Full Text Available Armadillo (ARM repeat proteins function in various cellular processes including vesicular transport and membrane tethering. They contain an imperfect repeating sequence motif that forms a conserved three-dimensional structure. Recently, structural and functional insight into tethering mediated by the ARM-repeat protein p115 has been provided. Here we describe the p115 ARM-motifs for reasons of clarity and nomenclature and show that both sequence and structure are highly conserved among ARM-repeat proteins. We argue that there is no need to invoke repeat types other than ARM repeats for a proper description of the structure of the p115 globular head region. Additionally, we propose to define a new subfamily of ARM-like proteins and show lack of evidence that the ARM motifs found in p115 are present in other long coiled-coil tethering factors of the golgin family.

  18. An evolutionarily conserved glycine-tyrosine motif forms a folding core in outer membrane proteins.

    Directory of Open Access Journals (Sweden)

    Marcin Michalik

    Full Text Available An intimate interaction between a pair of amino acids, a tyrosine and glycine on neighboring β-strands, has been previously reported to be important for the structural stability of autotransporters. Here, we show that the conservation of this interacting pair extends to nearly all major families of outer membrane β-barrel proteins, which are thought to have originated through duplication events involving an ancestral ββ hairpin. We analyzed the function of this motif using the prototypical outer membrane protein OmpX. Stopped-flow fluorescence shows that two folding processes occur in the millisecond time regime, the rates of which are reduced in the tyrosine mutant. Folding assays further demonstrate a reduction in the yield of folded protein for the mutant compared to the wild-type, as well as a reduction in thermal stability. Taken together, our data support the idea of an evolutionarily conserved 'folding core' that affects the folding, membrane insertion, and thermal stability of outer membrane protein β-barrels.

  19. Discriminative motif discovery via simulated evolution and random under-sampling.

    Directory of Open Access Journals (Sweden)

    Tao Song

    Full Text Available Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the stage of Hidden Markov Models (HMMs training, a random under-sampling method is introduced for the imbalance between the positive and negative datasets. It is shown that, in the task of discovering targeting motifs of nine subcellular compartments, the motifs found by our method are more conserved than the methods without considering data imbalance problem and recover the most known targeting motifs from Minimotif Miner and InterPro. Meanwhile, we use the found motifs to predict protein subcellular localization and achieve higher prediction precision and recall for the minority classes.

  20. Discriminative motif discovery via simulated evolution and random under-sampling.

    Science.gov (United States)

    Song, Tao; Gu, Hong

    2014-01-01

    Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the stage of Hidden Markov Models (HMMs) training, a random under-sampling method is introduced for the imbalance between the positive and negative datasets. It is shown that, in the task of discovering targeting motifs of nine subcellular compartments, the motifs found by our method are more conserved than the methods without considering data imbalance problem and recover the most known targeting motifs from Minimotif Miner and InterPro. Meanwhile, we use the found motifs to predict protein subcellular localization and achieve higher prediction precision and recall for the minority classes.

  1. Molecular studies of Callithrix pygmaea (Primates, Platyrrhini based on transferrin intronic and ND1 regions: implications for taxonomy and conservation

    Directory of Open Access Journals (Sweden)

    Tagliaro Claudia Helena

    2000-01-01

    Full Text Available Traditional classifications of Platyrrhini monkeys, based mainly on morphological features, are being contested by recent molecular data. The subfamily Callitrichinae (Platyrrhini, Primates consists of a diverse group of species, many of them considered endangered. Our analysis of two DNA regions, a mtDNA gene (ND1 and a nuclear gene (intronic regions of the transferrin gene, suggests that Callithrix pygmaea may have sufficient variability to justify the existence of subspecies or even separate species. Phylogenetic dendrograms based on the ND1 region show that this species is more closely related to Amazonian than to Atlantic forest marmosets. These results reopen the discussion about diversity and conservation programs based exclusively on traditional classifications.

  2. A conserved WW domain-like motif regulates invariant chain-dependent cell-surface transport of the NKG2D ligand ULBP2

    DEFF Research Database (Denmark)

    Uhlenbrock, Franziska Katharina; van Andel, Esther; Andresen, Lars

    2015-01-01

    that the NKG2D ligand ULBP2 traffics over an invariant chain (Ii)-dependent pathway to the cell surface. This study set out to elucidate how Ii regulates ULBP2 cell-surface transport: We discovered conserved tryptophan (Trp) residues in the primary protein sequence of ULBP1-6 but not in the related MICA....../B. Substitution of Trp to alanine resulted in cell-surface inhibition of ULBP2 in different cancer cell lines. Moreover, the mutated ULBP2 constructs were retained and not degraded inside the cell, indicating a crucial role of this conserved Trp-motif in trafficking. Finally, overexpression of Ii increased...... surface expression of wt ULBP2 while Trp-mutants could not be expressed, proposing that this Trp-motif is required for an Ii-dependent cell-surface transport of ULBP2. Aberrant soluble ULBP2 is immunosuppressive. Thus, targeting a distinct protein module on the ULBP2 sequence could counteract...

  3. Identification of GATA2 and AP-1 activator elements within the enhancer VNTR occurring in intron 5 of the human SIRT3 gene

    Science.gov (United States)

    Human SIRT3 gene contains an intronic VNTR enhancer. A T > C transition occurring in the second repeat of each VNTR allele implies the presence/absence of a putative GATA binding motif. A partially overlapping AP-1 site, not affected by the transition, was also identified. Aims of the present study ...

  4. The Biology of Intron Gain and Loss

    DEFF Research Database (Denmark)

    Jeffares, Daniel C; Mourier, Tobias; Penny, David

    2006-01-01

    Intron density in eukaryote genomes varies by more than three orders of magnitude, so there must have been extensive intron gain and/or intron loss during evolution. A favored and partial explanation for this range of intron densities has been that introns have accumulated stochastically in large...... on introns depending on the biology of the organism and the gene involved....

  5. A speedup technique for (l, d-motif finding algorithms

    Directory of Open Access Journals (Sweden)

    Dinh Hieu

    2011-03-01

    Full Text Available Abstract Background The discovery of patterns in DNA, RNA, and protein sequences has led to the solution of many vital biological problems. For instance, the identification of patterns in nucleic acid sequences has resulted in the determination of open reading frames, identification of promoter elements of genes, identification of intron/exon splicing sites, identification of SH RNAs, location of RNA degradation signals, identification of alternative splicing sites, etc. In protein sequences, patterns have proven to be extremely helpful in domain identification, location of protease cleavage sites, identification of signal peptides, protein interactions, determination of protein degradation elements, identification of protein trafficking elements, etc. Motifs are important patterns that are helpful in finding transcriptional regulatory elements, transcription factor binding sites, functional genomics, drug design, etc. As a result, numerous papers have been written to solve the motif search problem. Results Three versions of the motif search problem have been proposed in the literature: Simple Motif Search (SMS, (l, d-motif search (or Planted Motif Search (PMS, and Edit-distance-based Motif Search (EMS. In this paper we focus on PMS. Two kinds of algorithms can be found in the literature for solving the PMS problem: exact and approximate. An exact algorithm identifies the motifs always and an approximate algorithm may fail to identify some or all of the motifs. The exact version of PMS problem has been shown to be NP-hard. Exact algorithms proposed in the literature for PMS take time that is exponential in some of the underlying parameters. In this paper we propose a generic technique that can be used to speedup PMS algorithms. Conclusions We present a speedup technique that can be used on any PMS algorithm. We have tested our speedup technique on a number of algorithms. These experimental results show that our speedup technique is indeed very

  6. Evolutionarily conserved bias of amino-acid usage refines the definition of PDZ-binding motif

    Directory of Open Access Journals (Sweden)

    Launey Thomas

    2011-06-01

    Full Text Available Abstract Background The interactions between PDZ (PSD-95, Dlg, ZO-1 domains and PDZ-binding motifs play central roles in signal transductions within cells. Proteins with PDZ domains bind to PDZ-binding motifs almost exclusively when the motifs are located at the carboxyl (C- terminal ends of their binding partners. However, it remains little explored whether PDZ-binding motifs show any preferential location at the C-terminal ends of proteins, at genome-level. Results Here, we examined the distribution of the type-I (x-x-S/T-x-I/L/V or type-II (x-x-V-x-I/V PDZ-binding motifs in proteins encoded in the genomes of five different species (human, mouse, zebrafish, fruit fly and nematode. We first established that these PDZ-binding motifs are indeed preferentially present at their C-terminal ends. Moreover, we found specific amino acid (AA bias for the 'x' positions in the motifs at the C-terminal ends. In general, hydrophilic AAs were favored. Our genomics-based findings confirm and largely extend the results of previous interaction-based studies, allowing us to propose refined consensus sequences for all of the examined PDZ-binding motifs. An ontological analysis revealed that the refined motifs are functionally relevant since a large fraction of the proteins bearing the motif appear to be involved in signal transduction. Furthermore, co-precipitation experiments confirmed two new protein interactions predicted by our genomics-based approach. Finally, we show that influenza virus pathogenicity can be correlated with PDZ-binding motif, with high-virulence viral proteins bearing a refined PDZ-binding motif. Conclusions Our refined definition of PDZ-binding motifs should provide important clues for identifying functional PDZ-binding motifs and proteins involved in signal transduction.

  7. Functional characterisation of an intron retaining K+ transporter of barley reveals intron-mediated alternate splicing

    KAUST Repository

    Shahzad, K.

    2015-01-01

    Intron retention in transcripts and the presence of 5 and 3 splice sites within these introns mediate alternate splicing, which is widely observed in animals and plants. Here, functional characterisation of the K+ transporter, HvHKT2;1, with stably retained introns from barley (Hordeum vulgare) in yeast (Saccharomyces cerevisiae), and transcript profiling in yeast and transgenic tobacco (Nicotiana tabacum) is presented. Expression of intron-retaining HvHKT2;1 cDNA (HvHKT2;1-i) in trk1, trk2 yeast strain defective in K+ uptake restored growth in medium containing hygromycin in the presence of different concentrations of K+ and mediated hypersensitivity to Na+. HvHKT2;1-i produces multiple transcripts via alternate splicing of two regular introns and three exons in different compositions. HKT isoforms with retained introns and exon skipping variants were detected in relative expression analysis of (i) HvHKT2;1-i in barley under native conditions, (ii) in transgenic tobacco plants constitutively expressing HvHKT2;1-i, and (iii) in trk1, trk2 yeast expressing HvHKT2;1-i under control of an inducible promoter. Mixed proportions of three HKT transcripts: HvHKT2;1-e (first exon region), HvHKT2;1-i1 (first intron) and HvHKT2;1-i2 (second intron) were observed. The variation in transcript accumulation in response to changing K+ and Na+ concentrations was observed in both heterologous and plant systems. These findings suggest a link between intron-retaining transcripts and different splice variants to ion homeostasis, and their possible role in salt stress.

  8. Properties of non-coding DNA and identification of putative cis-regulatory elements in Theileria parva

    Directory of Open Access Journals (Sweden)

    Guo Xiang

    2008-12-01

    Full Text Available Abstract Background Parasites in the genus Theileria cause lymphoproliferative diseases in cattle, resulting in enormous socio-economic losses. The availability of the genome sequences and annotation for T. parva and T. annulata has facilitated the study of parasite biology and their relationship with host cell transformation and tropism. However, the mechanism of transcriptional regulation in this genus, which may be key to understanding fundamental aspects of its parasitology, remains poorly understood. In this study, we analyze the evolution of non-coding sequences in the Theileria genome and identify conserved sequence elements that may be involved in gene regulation of these parasitic species. Results Intergenic regions and introns in Theileria are short, and their length distributions are considerably right-skewed. Intergenic regions flanked by genes in 5'-5' orientation tend to be longer and slightly more AT-rich than those flanked by two stop codons; intergenic regions flanked by genes in 3'-5' orientation have intermediate values of length and AT composition. Intron position is negatively correlated with intron length, and positively correlated with GC content. Using stringent criteria, we identified a set of high-quality orthologous non-coding sequences between T. parva and T. annulata, and determined the distribution of selective constraints across regions, which are shown to be higher close to translation start sites. A positive correlation between constraint and length in both intergenic regions and introns suggests a tight control over length expansion of non-coding regions. Genome-wide searches for functional elements revealed several conserved motifs in intergenic regions of Theileria genomes. Two such motifs are preferentially located within the first 60 base pairs upstream of transcription start sites in T. parva, are preferentially associated with specific protein functional categories, and have significant similarity to know

  9. Cloning, expression and characterisation of a novel gene encoding ...

    African Journals Online (AJOL)

    微软用户

    2012-01-12

    Jan 12, 2012 ... ... characterisation of a novel gene encoding a chemosensory protein from Bemisia ... The genomic DNA sequence comparisons revealed a 1490 bp intron ... have several conserved sequence motifs, including the. N-terminal ...

  10. Recurrent loss of specific introns during angiosperm evolution.

    Directory of Open Access Journals (Sweden)

    Hao Wang

    2014-12-01

    Full Text Available Numerous instances of presence/absence variations for introns have been documented in eukaryotes, and some cases of recurrent loss of the same intron have been suggested. However, there has been no comprehensive or phylogenetically deep analysis of recurrent intron loss. Of 883 cases of intron presence/absence variation that we detected in five sequenced grass genomes, 93 were confirmed as recurrent losses and the rest could be explained by single losses (652 or single gains (118. No case of recurrent intron gain was observed. Deep phylogenetic analysis often indicated that apparent intron gains were actually numerous independent losses of the same intron. Recurrent loss exhibited extreme non-randomness, in that some introns were removed independently in many lineages. The two larger genomes, maize and sorghum, were found to have a higher rate of both recurrent loss and overall loss and/or gain than foxtail millet, rice or Brachypodium. Adjacent introns and small introns were found to be preferentially lost. Intron loss genes exhibited a high frequency of germ line or early embryogenesis expression. In addition, flanking exon A+T-richness and intron TG/CG ratios were higher in retained introns. This last result suggests that epigenetic status, as evidenced by a loss of methylated CG dinucleotides, may play a role in the process of intron loss. This study provides the first comprehensive analysis of recurrent intron loss, makes a series of novel findings on the patterns of recurrent intron loss during the evolution of the grass family, and provides insight into the molecular mechanism(s underlying intron loss.

  11. A CACGTG motif of the Antirrhinum majus chalcone synthase promoter is recognized by an evolutionarily conserved nuclear protein

    International Nuclear Information System (INIS)

    Staiger, D.; Kaulen, H.; Schell, J.

    1989-01-01

    In the chalcone synthase gene of Antirrhinum majus (snapdragon), 150 base pairs of the 5' flanking region contain cis-acting signals for UV light-induced expression. A nuclear factor, designated CG-1, specifically recognizes a hexameric motif with internal dyad symmetry, CACGTG, located within this light-responsive sequence. Binding of CG-1 is influenced by C-methylation of the CpG dinucleotide in the recognition sequence. CG-1 is a factor found in a variety of dicotyledonous plant species including Nicotiana tabacum, A. majus, Petunia hybrida, Arabidopsis thaliana, and Glycine max. CACGTG motifs contained within trans-acting factor recognition sites in various other plant promoters can interact with CG-1. In addition, the binding site of the human adenovirus major late transcription factor USF can compete for CG-1 binding to the chalcone synthase promoter. This suggests an evolutionary conservation of trans-acting factor recognition sites involved in divergent mechanisms of gene control. (author)

  12. TOPDOM: database of conservatively located domains and motifs in proteins.

    Science.gov (United States)

    Varga, Julia; Dobson, László; Tusnády, Gábor E

    2016-09-01

    The TOPDOM database-originally created as a collection of domains and motifs located consistently on the same side of the membranes in α-helical transmembrane proteins-has been updated and extended by taking into consideration consistently localized domains and motifs in globular proteins, too. By taking advantage of the recently developed CCTOP algorithm to determine the type of a protein and predict topology in case of transmembrane proteins, and by applying a thorough search for domains and motifs as well as utilizing the most up-to-date version of all source databases, we managed to reach a 6-fold increase in the size of the whole database and a 2-fold increase in the number of transmembrane proteins. TOPDOM database is available at http://topdom.enzim.hu The webpage utilizes the common Apache, PHP5 and MySQL software to provide the user interface for accessing and searching the database. The database itself is generated on a high performance computer. tusnady.gabor@ttk.mta.hu Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  13. Development of EST Intron-Targeting SNP Markers for Panax ginseng and Their Application to Cultivar Authentication.

    Science.gov (United States)

    Wang, Hongtao; Li, Guisheng; Kwon, Woo-Saeng; Yang, Deok-Chun

    2016-06-04

    Panax ginseng is one of the most valuable medicinal plants in the Orient. The low level of genetic variation has limited the application of molecular markers for cultivar authentication and marker-assisted selection in cultivated ginseng. To exploit DNA polymorphism within ginseng cultivars, ginseng expressed sequence tags (ESTs) were searched against the potential intron polymorphism (PIP) database to predict the positions of introns. Intron-flanking primers were then designed in conserved exon regions and used to amplify across the more variable introns. Sequencing results showed that single nucleotide polymorphisms (SNPs), as well as indels, were detected in four EST-derived introns, and SNP markers specific to "Gopoong" and "K-1" were first reported in this study. Based on cultivar-specific SNP sites, allele-specific polymerase chain reaction (PCR) was conducted and proved to be effective for the authentication of ginseng cultivars. Additionally, the combination of a simple NaOH-Tris DNA isolation method and real-time allele-specific PCR assay enabled the high throughput selection of cultivars from ginseng fields. The established real-time allele-specific PCR assay should be applied to molecular authentication and marker assisted selection of P. ginseng cultivars, and the EST intron-targeting strategy will provide a potential approach for marker development in species without whole genomic DNA sequence information.

  14. Mutations in the Lactococcus lactis Ll.LtrB group II intron that retain mobility in vivo

    Directory of Open Access Journals (Sweden)

    D'Souza Lisa M

    2002-12-01

    Full Text Available Abstract Background Group II introns are mobile genetic elements that form conserved secondary and tertiary structures. In order to determine which of the conserved structural elements are required for mobility, a series of domain and sub-domain deletions were made in the Lactococcus lactis group II intron (Ll.LtrB and tested for mobility in a genetic assay. Point mutations in domains V and VI were also tested. Results The largest deletion that could be made without severely compromising mobility was 158 nucleotides in DIVb(1–2. This mutant had a mobility frequency comparable to the wild-type Ll.LtrB intron (ΔORF construct. Hence, all subsequent mutations were done in this mutant background. Deletion of DIIb reduced mobility to approximately 18% of wild-type, while another deletion in domain II (nts 404–459 was mobile to a minor extent. Only two deletions in DI and none in DIII were tolerated. Some mobility was also observed for a DIVa deletion mutant. Of the three point mutants at position G3 in DV, only G3A retained mobility. In DVI, deletion of the branch-point nucleotide abolished mobility, but the presence of any nucleotide at the branch-point position restored mobility to some extent. Conclusions The smallest intron capable of efficient retrohoming was 725 nucleotides, comprising the DIVb(1–2 and DII(iia,b deletions. The tertiary elements found to be nonessential for mobility were alpha, kappa and eta. In DV, only the G3A mutant was mobile. A branch-point residue is required for intron mobility.

  15. A single amino-acid change in a highly conserved motif of gp41 elicits HIV-1 neutralization and protects against CD4 depletion.

    Science.gov (United States)

    Petitdemange, Caroline; Achour, Abla; Dispinseri, Stefania; Malet, Isabelle; Sennepin, Alexis; Ho Tsong Fang, Raphaël; Crouzet, Joël; Marcelin, Anne-Geneviève; Calvez, Vincent; Scarlatti, Gabriella; Debré, Patrice; Vieillard, Vincent

    2013-09-01

    The induction of neutralizing antibodies against conserved regions of the human immunodeficiency virus type 1 (HIV-1) envelope protein is a major goal of vaccine strategies. We previously identified 3S, a critical conserved motif of gp41 that induces the NKp44L ligand of an activating NK receptor. In vivo, anti-3S antibodies protect against the natural killer (NK) cell-mediated CD4 depletion that occurs without efficient viral neutralization. Specific substitutions within the 3S peptide motif were prepared by directed mutagenesis. Virus production was monitored by measuring the p24 production. Neutralization assays were performed with immune-purified antibodies from immunized mice and a cohort of HIV-infected patients. Expression of NKp44L on CD4(+) T cells and degranulation assay on activating NK cells were both performed by flow cytometry. Here, we show that specific substitutions in the 3S motif reduce viral infection without affecting gp41 production, while decreasing both its capacity to induce NKp44L expression on CD4(+) T cells and its sensitivity to autologous NK cells. Generation of antibodies in mice against the W614 specific position in the 3S motif elicited a capacity to neutralize cross-clade viruses, notable in its magnitude, breadth, and durability. Antibodies against this 3S variant were also detected in sera from some HIV-1-infected patients, demonstrating both neutralization activity and protection against CD4 depletion. These findings suggest that a specific substitution in a 3S-based immunogen might allow the generation of specific antibodies, providing a foundation for a rational vaccine that combine a capacity to neutralize HIV-1 and to protect CD4(+) T cells.

  16. Identity and functions of CxxC-derived motifs.

    Science.gov (United States)

    Fomenko, Dmitri E; Gladyshev, Vadim N

    2003-09-30

    Two cysteines separated by two other residues (the CxxC motif) are employed by many redox proteins for formation, isomerization, and reduction of disulfide bonds and for other redox functions. The place of the C-terminal cysteine in this motif may be occupied by serine (the CxxS motif), modifying the functional repertoire of redox proteins. Here we found that the CxxC motif may also give rise to a motif, in which the C-terminal cysteine is replaced with threonine (the CxxT motif). Moreover, in contrast to a view that the N-terminal cysteine in the CxxC motif always serves as a nucleophilic attacking group, this residue could also be replaced with threonine (the TxxC motif), serine (the SxxC motif), or other residues. In each of these CxxC-derived motifs, the presence of a downstream alpha-helix was strongly favored. A search for conserved CxxC-derived motif/helix patterns in four complete genomes representing bacteria, archaea, and eukaryotes identified known redox proteins and suggested possible redox functions for several additional proteins. Catalytic sites in peroxiredoxins were major representatives of the TxxC motif, whereas those in glutathione peroxidases represented the CxxT motif. Structural assessments indicated that threonines in these enzymes could stabilize catalytic thiolates, suggesting revisions to previously proposed catalytic triads. Each of the CxxC-derived motifs was also observed in natural selenium-containing proteins, in which selenocysteine was present in place of a catalytic cysteine.

  17. CELF1 preferentially binds to exon-intron boundary and regulates alternative splicing in HeLa cells.

    Science.gov (United States)

    Xia, Heng; Chen, Dong; Wu, Qijia; Wu, Gang; Zhou, Yanhong; Zhang, Yi; Zhang, Libin

    2017-09-01

    The current RIP-seq approach has been developed for the identification of genome-wide interaction between RNA binding protein (RBP) and the bound RNA transcripts, but still rarely for identifying its binding sites. In this study, we performed RIP-seq experiments in HeLa cells using a monoclonal antibody against CELF1. Mapping of the RIP-seq reads showed a biased distribution at the 3'UTR and intronic regions. A total of 15,285 and 1384 CELF1-specific sense and antisense peaks were identified using the ABLIRC software tool. Our bioinformatics analyses revealed that 5' and 3' splice site motifs and GU-rich motifs were highly enriched in the CELF1-bound peaks. Furthermore, transcriptome analyses revealed that alternative splicing was globally regulated by CELF1 in HeLa cells. For example, the inclusion of exon 16 of LMO7 gene, a marker gene of breast cancer, is positively regulated by CELF1. Taken together, we have shown that RIP-seq data can be used to decipher RBP binding sites and reveal an unexpected landscape of the genome-wide CELF1-RNA interactions in HeLa cells. In addition, we found that CELF1 globally regulates the alternative splicing by binding the exon-intron boundary in HeLa cells, which will deepen our understanding of the regulatory roles of CELF1 in the pre-mRNA splicing process. Copyright © 2017 Elsevier B.V. All rights reserved.

  18. Frequency of intron loss correlates with processed pseudogene abundance: a novel strategy to test the reverse transcriptase model of intron loss.

    Science.gov (United States)

    Zhu, Tao; Niu, Deng-Ke

    2013-03-05

    Although intron loss in evolution has been described, the mechanism involved is still unclear. Three models have been proposed, the reverse transcriptase (RT) model, genomic deletion model and double-strand-break repair model. The RT model, also termed mRNA-mediated intron loss, suggests that cDNA molecules reverse transcribed from spliced mRNA recombine with genomic DNA causing intron loss. Many studies have attempted to test this model based on its predictions, such as simultaneous loss of adjacent introns, 3'-side bias of intron loss, and germline expression of intron-lost genes. Evidence either supporting or opposing the model has been reported. The mechanism of intron loss proposed in the RT model shares the process of reverse transcription with the formation of processed pseudogenes. If the RT model is correct, genes that have produced more processed pseudogenes are more likely to undergo intron loss. In the present study, we observed that the frequency of intron loss is correlated with processed pseudogene abundance by analyzing a new dataset of intron loss obtained in mice and rats. Furthermore, we found that mRNA molecules of intron-lost genes are mostly translated on free cytoplasmic ribosomes, a feature shared by mRNA molecules of the parental genes of processed pseudogenes and long interspersed elements. This feature is likely convenient for intron-lost gene mRNA molecules to be reverse transcribed. Analyses of adjacent intron loss, 3'-side bias of intron loss, and germline expression of intron-lost genes also support the RT model. Compared with previous evidence, the correlation between the abundance of processed pseudogenes and intron loss frequency more directly supports the RT model of intron loss. Exploring such a correlation is a new strategy to test the RT model in organisms with abundant processed pseudogenes.

  19. A proposed vestigial translation initiation motif in VP1 of hepatitis A virus.

    Science.gov (United States)

    Kang, Jeong-Ah; Funkhouser, Ann W

    2002-07-01

    The internal ribosome entry site (IRES) of picornaviruses has a 3' polypyrimidine tract (PPT) 16-24 bases upstream of an AUG triplet (PPT/AUG motif). This motif is critical in determining the efficiency of cap-independent translation. HAV has a conserved PPT/AUG motif consisting of a nine base sequence (AGGUUUUUC) 23 bases upstream of the preferred AUG start codon. This HAV-specific PPT/AUG motif is repeated and conserved in VP1 of HAV, but not of other picornaviruses. We proposed that the PPT/AUG motif in the open reading frame initiated translation and/or had an impact on the life cycle of the virus. In vitro translation of mutant bicistronic mRNAs and growth in cell culture of mutant viruses provided no evidence that the VP1 PPT/AUG motif had any impact on either translation or growth. HAV differs from other picornaviruses in its inefficient growth in cell culture. Since the HAV-specific PPT/AUG motif is found in only 1 in 300,000 reported viral sequences outside the hepatovirus genus, this motif may be a vestigial translation initiation element and may have played a role in determining the unusual phenotype of HAV.

  20. Imprecise intron losses are less frequent than precise intron losses but are not rare in plants.

    Science.gov (United States)

    Ma, Ming-Yue; Zhu, Tao; Li, Xue-Nan; Lan, Xin-Ran; Liu, Heng-Yuan; Yang, Yu-Fei; Niu, Deng-Ke

    2015-05-27

    In this study, we identified 19 intron losses, including 11 precise intron losses (PILs), six imprecise intron losses (IILs), one de-exonization, and one exon deletion in tomato and potato, and 17 IILs in Arabidopsis thaliana. Comparative analysis of related genomes confirmed that all of the IILs have been fixed during evolution. Consistent with previous studies, our results indicate that PILs are a major type of intron loss. However, at least in plants, IILs are unlikely to be as rare as previously reported. This article was reviewed by Jun Yu and Zhang Zhang. For complete reviews, see the Reviewers' Reports section.

  1. Translational Control of Host Gene Expression by a Cys-Motif Protein Encoded in a Bracovirus.

    Directory of Open Access Journals (Sweden)

    Eunseong Kim

    Full Text Available Translational control is a strategy that various viruses use to manipulate their hosts to suppress acute antiviral response. Polydnaviruses, a group of insect double-stranded DNA viruses symbiotic to some endoparasitoid wasps, are divided into two genera: ichnovirus (IV and bracovirus (BV. In IV, some Cys-motif genes are known as host translation-inhibitory factors (HTIF. The genome of endoparasitoid wasp Cotesia plutellae contains a Cys-motif gene (Cp-TSP13 homologous to an HTIF known as teratocyte-secretory protein 14 (TSP14 of Microplitis croceipes. Cp-TSP13 consists of 129 amino acid residues with a predicted molecular weight of 13.987 kDa and pI value of 7.928. Genomic DNA region encoding its open reading frame has three introns. Cp-TSP13 possesses six conserved cysteine residues as other Cys-motif genes functioning as HTIF. Cp-TSP13 was expressed in Plutella xylostella larvae parasitized by C. plutellae. C. plutellae bracovirus (CpBV was purified and injected into non-parasitized P. xylostella that expressed Cp-TSP13. Cp-TSP13 was cloned into a eukaryotic expression vector and used to infect Sf9 cells to transiently express Cp-TSP13. The synthesized Cp-TSP13 protein was detected in culture broth. An overlaying experiment showed that the purified Cp-TSP13 entered hemocytes. It was localized in the cytosol. Recombinant Cp-TSP13 significantly inhibited protein synthesis of secretory proteins when it was added to in vitro cultured fat body. In addition, the recombinant Cp-TSP13 directly inhibited the translation of fat body mRNAs in in vitro translation assay using rabbit reticulocyte lysate. Moreover, the recombinant Cp-TSP13 significantly suppressed cellular immune responses by inhibiting hemocyte-spreading behavior. It also exhibited significant insecticidal activities by both injection and feeding routes. These results indicate that Cp-TSP13 is a viral HTIF.

  2. The Runt domain of AML1 (RUNX1) binds a sequence-conserved RNA motif that mimics a DNA element.

    Science.gov (United States)

    Fukunaga, Junichi; Nomura, Yusuke; Tanaka, Yoichiro; Amano, Ryo; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Sakamoto, Taiichi; Kozu, Tomoko

    2013-07-01

    AML1 (RUNX1) is a key transcription factor for hematopoiesis that binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. Aberrations in the AML1 gene are frequently found in human leukemia. To better understand AML1 and its potential utility for diagnosis and therapy, we obtained RNA aptamers that bind specifically to the AML1 Runt domain. Enzymatic probing and NMR analyses revealed that Apt1-S, which is a truncated variant of one of the aptamers, has a CACG tetraloop and two stem regions separated by an internal loop. All the isolated aptamers were found to contain the conserved sequence motif 5'-NNCCAC-3' and 5'-GCGMGN'N'-3' (M:A or C; N and N' form Watson-Crick base pairs). The motif contains one AC mismatch and one base bulged out. Mutational analysis of Apt1-S showed that three guanines of the motif are important for Runt binding as are the three guanines of RDE, which are directly recognized by three arginine residues of the Runt domain. Mutational analyses of the Runt domain revealed that the amino acid residues used for Apt1-S binding were similar to those used for RDE binding. Furthermore, the aptamer competed with RDE for binding to the Runt domain in vitro. These results demonstrated that the Runt domain of the AML1 protein binds to the motif of the aptamer that mimics DNA. Our findings should provide new insights into RNA function and utility in both basic and applied sciences.

  3. MSDmotif: exploring protein sites and motifs

    Directory of Open Access Journals (Sweden)

    Henrick Kim

    2008-07-01

    Full Text Available Abstract Background Protein structures have conserved features – motifs, which have a sufficient influence on the protein function. These motifs can be found in sequence as well as in 3D space. Understanding of these fragments is essential for 3D structure prediction, modelling and drug-design. The Protein Data Bank (PDB is the source of this information however present search tools have limited 3D options to integrate protein sequence with its 3D structure. Results We describe here a web application for querying the PDB for ligands, binding sites, small 3D structural and sequence motifs and the underlying database. Novel algorithms for chemical fragments, 3D motifs, ϕ/ψ sequences, super-secondary structure motifs and for small 3D structural motif associations searches are incorporated. The interface provides functionality for visualization, search criteria creation, sequence and 3D multiple alignment options. MSDmotif is an integrated system where a results page is also a search form. A set of motif statistics is available for analysis. This set includes molecule and motif binding statistics, distribution of motif sequences, occurrence of an amino-acid within a motif, correlation of amino-acids side-chain charges within a motif and Ramachandran plots for each residue. The binding statistics are presented in association with properties that include a ligand fragment library. Access is also provided through the distributed Annotation System (DAS protocol. An additional entry point facilitates XML requests with XML responses. Conclusion MSDmotif is unique by combining chemical, sequence and 3D data in a single search engine with a range of search and visualisation options. It provides multiple views of data found in the PDB archive for exploring protein structures.

  4. Speciation of a group I intron into a lariat capping ribozyme

    DEFF Research Database (Denmark)

    Meyer, Mélanie; Nielsen, Henrik; Oliéric, Vincent

    2014-01-01

    The lariat-capping (LC) ribozyme is a natural ribozyme isolated from eukaryotic microorganisms. Despite apparent structural similarity to group I introns, the LC ribozyme catalyzes cleavage by a 2',5' branching reaction, leaving the 3' product with a 3-nt lariat cap that functionally substitutes ....... The structures also show how conserved interactions twist residues, forming the lariat to join chemical groups involved in branching....

  5. Conserved Functional Motifs and Homology Modeling to Predict Hidden Moonlighting Functional Sites

    KAUST Repository

    Wong, Aloysius Tze

    2015-06-09

    Moonlighting functional centers within proteins can provide them with hitherto unrecognized functions. Here, we review how hidden moonlighting functional centers, which we define as binding sites that have catalytic activity or regulate protein function in a novel manner, can be identified using targeted bioinformatic searches. Functional motifs used in such searches include amino acid residues that are conserved across species and many of which have been assigned functional roles based on experimental evidence. Molecules that were identified in this manner seeking cyclic mononucleotide cyclases in plants are used as examples. The strength of this computational approach is enhanced when good homology models can be developed to test the functionality of the predicted centers in silico, which, in turn, increases confidence in the ability of the identified candidates to perform the predicted functions. Computational characterization of moonlighting functional centers is not diagnostic for catalysis but serves as a rapid screening method, and highlights testable targets from a potentially large pool of candidates for subsequent in vitro and in vivo experiments required to confirm the functionality of the predicted moonlighting centers.

  6. Conserved Functional Motifs and Homology Modeling to Predict Hidden Moonlighting Functional Sites

    KAUST Repository

    Wong, Aloysius Tze; Gehring, Christoph A; Irving, Helen R.

    2015-01-01

    Moonlighting functional centers within proteins can provide them with hitherto unrecognized functions. Here, we review how hidden moonlighting functional centers, which we define as binding sites that have catalytic activity or regulate protein function in a novel manner, can be identified using targeted bioinformatic searches. Functional motifs used in such searches include amino acid residues that are conserved across species and many of which have been assigned functional roles based on experimental evidence. Molecules that were identified in this manner seeking cyclic mononucleotide cyclases in plants are used as examples. The strength of this computational approach is enhanced when good homology models can be developed to test the functionality of the predicted centers in silico, which, in turn, increases confidence in the ability of the identified candidates to perform the predicted functions. Computational characterization of moonlighting functional centers is not diagnostic for catalysis but serves as a rapid screening method, and highlights testable targets from a potentially large pool of candidates for subsequent in vitro and in vivo experiments required to confirm the functionality of the predicted moonlighting centers.

  7. Extensive intron gain in the ancestor of placental mammals

    Science.gov (United States)

    2011-01-01

    Background Genome-wide studies of intron dynamics in mammalian orthologous genes have found convincing evidence for loss of introns but very little for intron turnover. Similarly, large-scale analysis of intron dynamics in a few vertebrate genomes has identified only intron losses and no gains, indicating that intron gain is an extremely rare event in vertebrate evolution. These studies suggest that the intron-rich genomes of vertebrates do not allow intron gain. The aim of this study was to search for evidence of de novo intron gain in domesticated genes from an analysis of their exon/intron structures. Results A phylogenomic approach has been used to analyse all domesticated genes in mammals and chordates that originated from the coding parts of transposable elements. Gain of introns in domesticated genes has been reconstructed on well established mammalian, vertebrate and chordate phylogenies, and examined as to where and when the gain events occurred. The locations, sizes and amounts of de novo introns gained in the domesticated genes during the evolution of mammals and chordates has been analyzed. A significant amount of intron gain was found only in domesticated genes of placental mammals, where more than 70 cases were identified. De novo gained introns show clear positional bias, since they are distributed mainly in 5' UTR and coding regions, while 3' UTR introns are very rare. In the coding regions of some domesticated genes up to 8 de novo gained introns have been found. Intron densities in Eutheria-specific domesticated genes and in older domesticated genes that originated early in vertebrates are lower than those for normal mammalian and vertebrate genes. Surprisingly, the majority of intron gains have occurred in the ancestor of placentals. Conclusions This study provides the first evidence for numerous intron gains in the ancestor of placental mammals and demonstrates that adequate taxon sampling is crucial for reconstructing intron evolution. The

  8. Reenacting the birth of an intron

    Energy Technology Data Exchange (ETDEWEB)

    Hellsten, Uffe; Aspden, Julie L.; Rio, Donald C.; Rokhsar, Daniel S.

    2011-07-01

    An intron is an extended genomic feature whose function requires multiple constrained positions - donor and acceptor splice sites, a branch point, a polypyrimidine tract and suitable splicing enhancers - that may be distributed over hundreds or thousands of nucleotides. New introns are therefore unlikely to emerge by incremental accumulation of functional sub-elements. Here we demonstrate that a functional intron can be created de novo in a single step by a segmental genomic duplication. This experiment recapitulates in vivo the birth of an intron that arose in the ancestral jawed vertebrate lineage nearly half a billion years ago.

  9. The peculiarities of large intron splicing in animals.

    Directory of Open Access Journals (Sweden)

    Samuel Shepard

    Full Text Available In mammals a considerable 92% of genes contain introns, with hundreds and hundreds of these introns reaching the incredible size of over 50,000 nucleotides. These "large introns" must be spliced out of the pre-mRNA in a timely fashion, which involves bringing together distant 5' and 3' acceptor and donor splice sites. In invertebrates, especially Drosophila, it has been shown that larger introns can be spliced efficiently through a process known as recursive splicing-a consecutive splicing from the 5'-end at a series of combined donor-acceptor splice sites called RP-sites. Using a computational analysis of the genomic sequences, we show that vertebrates lack the proper enrichment of RP-sites in their large introns, and, therefore, require some other method to aid splicing. We analyzed over 15,000 non-redundant, large introns from six mammals, 1,600 from chicken and zebrafish, and 560 non-redundant large introns from five invertebrates. Our bioinformatic investigation demonstrates that, unlike the studied invertebrates, the studied vertebrate genomes contain consistently abundant amounts of direct and complementary strand interspersed repetitive elements (mainly SINEs and LINEs that may form stems with each other in large introns. This examination showed that predicted stems are indeed abundant and stable in the large introns of mammals. We hypothesize that such stems with long loops within large introns allow intron splice sites to find each other more quickly by folding the intronic RNA upon itself at smaller intervals and, thus, reducing the distance between donor and acceptor sites.

  10. Functional characterisation of an intron retaining K+ transporter of barley reveals intron-mediated alternate splicing

    KAUST Repository

    Shahzad, K.; Rauf, M.; Ahmed, M.; Malik, Z. A.; Habib, I.; Ahmed, Z.; Mahmood, K.; Ali, R.; Masmoudi, K.; Lemtiri-Chlieh, Fouad; Gehring, Christoph A; Berkowitz, G. A.; Saeed, N. A.

    2015-01-01

    Intron retention in transcripts and the presence of 5 and 3 splice sites within these introns mediate alternate splicing, which is widely observed in animals and plants. Here, functional characterisation of the K+ transporter, HvHKT2;1, with stably

  11. Gene Isolation Using Degenerate Primers Targeting Protein Motif: A Laboratory Exercise

    Science.gov (United States)

    Yeo, Brandon Pei Hui; Foong, Lian Chee; Tam, Sheh May; Lee, Vivian; Hwang, Siaw San

    2018-01-01

    Structures and functions of protein motifs are widely included in many biology-based course syllabi. However, little emphasis is placed to link this knowledge to applications in biotechnology to enhance the learning experience. Here, the conserved motifs of nucleotide binding site-leucine rich repeats (NBS-LRR) proteins, successfully used for the…

  12. Identification of putative regulatory motifs in the upstream regions of co-expressed functional groups of genes in Plasmodium falciparum

    Directory of Open Access Journals (Sweden)

    Joshi NV

    2009-01-01

    Full Text Available Abstract Background Regulation of gene expression in Plasmodium falciparum (Pf remains poorly understood. While over half the genes are estimated to be regulated at the transcriptional level, few regulatory motifs and transcription regulators have been found. Results The study seeks to identify putative regulatory motifs in the upstream regions of 13 functional groups of genes expressed in the intraerythrocytic developmental cycle of Pf. Three motif-discovery programs were used for the purpose, and motifs were searched for only on the gene coding strand. Four motifs – the 'G-rich', the 'C-rich', the 'TGTG' and the 'CACA' motifs – were identified, and zero to all four of these occur in the 13 sets of upstream regions. The 'CACA motif' was absent in functional groups expressed during the ring to early trophozoite transition. For functional groups expressed in each transition, the motifs tended to be similar. Upstream motifs in some functional groups showed 'positional conservation' by occurring at similar positions relative to the translational start site (TLS; this increases their significance as regulatory motifs. In the ribonucleotide synthesis, mitochondrial, proteasome and organellar translation machinery genes, G-rich, C-rich, CACA and TGTG motifs, respectively, occur with striking positional conservation. In the organellar translation machinery group, G-rich motifs occur close to the TLS. The same motifs were sometimes identified for multiple functional groups; differences in location and abundance of the motifs appear to ensure different modes of action. Conclusion The identification of positionally conserved over-represented upstream motifs throws light on putative regulatory elements for transcription in Pf.

  13. Recruitment of Staufen2 Enhances Dendritic Localization of an Intron-Containing CaMKIIα mRNA

    Directory of Open Access Journals (Sweden)

    Raúl Ortiz

    2017-07-01

    Full Text Available Regulation of mRNA localization is a conserved cellular process observed in many types of cells and organisms. Asymmetrical mRNA distribution plays a particularly important role in the nervous system, where local translation of localized mRNA represents a key mechanism in synaptic plasticity. CaMKIIα is a very abundant mRNA detected in neurites, consistent with its crucial role at glutamatergic synapses. Here, we report the presence of CaMKIIα mRNA isoforms that contain intron i16 in dendrites, RNA granules, and synaptoneurosomes from primary neurons and brain. This subpopulation of unspliced mRNA preferentially localizes to distal dendrites in a synaptic-activity-dependent manner. Staufen2, a well-established marker of RNA transport in dendrites, interacts with intron i16 sequences and enhances its distal dendritic localization, pointing to the existence of intron-mediated mechanisms in the molecular pathways that modulate dendritic transport and localization of synaptic mRNAs.

  14. Motif finding in DNA sequences based on skipping nonconserved positions in background Markov chains.

    Science.gov (United States)

    Zhao, Xiaoyan; Sze, Sing-Hoi

    2011-05-01

    One strategy to identify transcription factor binding sites is through motif finding in upstream DNA sequences of potentially co-regulated genes. Despite extensive efforts, none of the existing algorithms perform very well. We consider a string representation that allows arbitrary ignored positions within the nonconserved portion of single motifs, and use O(2(l)) Markov chains to model the background distributions of motifs of length l while skipping these positions within each Markov chain. By focusing initially on positions that have fixed nucleotides to define core occurrences, we develop an algorithm to identify motifs of moderate lengths. We compare the performance of our algorithm to other motif finding algorithms on a few benchmark data sets, and show that significant improvement in accuracy can be obtained when the sites are sufficiently conserved within a given sample, while comparable performance is obtained when the site conservation rate is low. A software program (PosMotif ) and detailed results are available online at http://faculty.cse.tamu.edu/shsze/posmotif.

  15. Large-scale discovery of promoter motifs in Drosophila melanogaster.

    Directory of Open Access Journals (Sweden)

    Thomas A Down

    2007-01-01

    Full Text Available A key step in understanding gene regulation is to identify the repertoire of transcription factor binding motifs (TFBMs that form the building blocks of promoters and other regulatory elements. Identifying these experimentally is very laborious, and the number of TFBMs discovered remains relatively small, especially when compared with the hundreds of transcription factor genes predicted in metazoan genomes. We have used a recently developed statistical motif discovery approach, NestedMICA, to detect candidate TFBMs from a large set of Drosophila melanogaster promoter regions. Of the 120 motifs inferred in our initial analysis, 25 were statistically significant matches to previously reported motifs, while 87 appeared to be novel. Analysis of sequence conservation and motif positioning suggested that the great majority of these discovered motifs are predictive of functional elements in the genome. Many motifs showed associations with specific patterns of gene expression in the D. melanogaster embryo, and we were able to obtain confident annotation of expression patterns for 25 of our motifs, including eight of the novel motifs. The motifs are available through Tiffin, a new database of DNA sequence motifs. We have discovered many new motifs that are overrepresented in D. melanogaster promoter regions, and offer several independent lines of evidence that these are novel TFBMs. Our motif dictionary provides a solid foundation for further investigation of regulatory elements in Drosophila, and demonstrates techniques that should be applicable in other species. We suggest that further improvements in computational motif discovery should narrow the gap between the set of known motifs and the total number of transcription factors in metazoan genomes.

  16. Efficient motif finding algorithms for large-alphabet inputs

    Directory of Open Access Journals (Sweden)

    Pavlovic Vladimir

    2010-10-01

    Full Text Available Abstract Background We consider the problem of identifying motifs, recurring or conserved patterns, in the biological sequence data sets. To solve this task, we present a new deterministic algorithm for finding patterns that are embedded as exact or inexact instances in all or most of the input strings. Results The proposed algorithm (1 improves search efficiency compared to existing algorithms, and (2 scales well with the size of alphabet. On a synthetic planted DNA motif finding problem our algorithm is over 10× more efficient than MITRA, PMSPrune, and RISOTTO for long motifs. Improvements are orders of magnitude higher in the same setting with large alphabets. On benchmark TF-binding site problems (FNP, CRP, LexA we observed reduction in running time of over 12×, with high detection accuracy. The algorithm was also successful in rapidly identifying protein motifs in Lipocalin, Zinc metallopeptidase, and supersecondary structure motifs for Cadherin and Immunoglobin families. Conclusions Our algorithm reduces computational complexity of the current motif finding algorithms and demonstrate strong running time improvements over existing exact algorithms, especially in important and difficult cases of large-alphabet sequences.

  17. Stanniocalcin 1 binds hemin through a partially conserved heme regulatory motif

    International Nuclear Information System (INIS)

    Westberg, Johan A.; Jiang, Ji; Andersson, Leif C.

    2011-01-01

    Highlights: → Stanniocalcin 1 (STC1) binds heme through novel heme binding motif. → Central iron atom of heme and cysteine-114 of STC1 are essential for binding. → STC1 binds Fe 2+ and Fe 3+ heme. → STC1 peptide prevents oxidative decay of heme. -- Abstract: Hemin (iron protoporphyrin IX) is a necessary component of many proteins, functioning either as a cofactor or an intracellular messenger. Hemoproteins have diverse functions, such as transportation of gases, gas detection, chemical catalysis and electron transfer. Stanniocalcin 1 (STC1) is a protein involved in respiratory responses of the cell but whose mechanism of action is still undetermined. We examined the ability of STC1 to bind hemin in both its reduced and oxidized states and located Cys 114 as the axial ligand of the central iron atom of hemin. The amino acid sequence differs from the established (Cys-Pro) heme regulatory motif (HRM) and therefore presents a novel heme binding motif (Cys-Ser). A STC1 peptide containing the heme binding sequence was able to inhibit both spontaneous and H 2 O 2 induced decay of hemin. Binding of hemin does not affect the mitochondrial localization of STC1.

  18. CompariMotif: quick and easy comparisons of sequence motifs.

    Science.gov (United States)

    Edwards, Richard J; Davey, Norman E; Shields, Denis C

    2008-05-15

    CompariMotif is a novel tool for making motif-motif comparisons, identifying and describing similarities between regular expression motifs. CompariMotif can identify a number of different relationships between motifs, including exact matches, variants of degenerate motifs and complex overlapping motifs. Motif relationships are scored using shared information content, allowing the best matches to be easily identified in large comparisons. Many input and search options are available, enabling a list of motifs to be compared to itself (to identify recurring motifs) or to datasets of known motifs. CompariMotif can be run online at http://bioware.ucd.ie/ and is freely available for academic use as a set of open source Python modules under a GNU General Public License from http://bioinformatics.ucd.ie/shields/software/comparimotif/

  19. A conserved WW domain-like motif regulates invariant chain-dependent cell-surface transport of the NKG2D ligand ULBP2.

    Science.gov (United States)

    Uhlenbrock, Franziska; van Andel, Esther; Andresen, Lars; Skov, Søren

    2015-08-01

    Malignant cells expressing NKG2D ligands on their cell surface can be directly sensed and killed by NKG2D-bearing lymphocytes. To ensure this immune recognition, accumulating evidence suggests that NKG2D ligands are trafficed via alternative pathways to the cell surface. We have previously shown that the NKG2D ligand ULBP2 traffics over an invariant chain (Ii)-dependent pathway to the cell surface. This study set out to elucidate how Ii regulates ULBP2 cell-surface transport: We discovered conserved tryptophan (Trp) residues in the primary protein sequence of ULBP1-6 but not in the related MICA/B. Substitution of Trp to alanine resulted in cell-surface inhibition of ULBP2 in different cancer cell lines. Moreover, the mutated ULBP2 constructs were retained and not degraded inside the cell, indicating a crucial role of this conserved Trp-motif in trafficking. Finally, overexpression of Ii increased surface expression of wt ULBP2 while Trp-mutants could not be expressed, proposing that this Trp-motif is required for an Ii-dependent cell-surface transport of ULBP2. Aberrant soluble ULBP2 is immunosuppressive. Thus, targeting a distinct protein module on the ULBP2 sequence could counteract this abnormal expression of ULBP2. Copyright © 2015 Elsevier Ltd. All rights reserved.

  20. Identify alternative splicing events based on position-specific evolutionary conservation.

    Directory of Open Access Journals (Sweden)

    Liang Chen

    Full Text Available The evolution of eukaryotes is accompanied by the increased complexity of alternative splicing which greatly expands genome information. One of the greatest challenges in the post-genome era is a complete revelation of human transcriptome with consideration of alternative splicing. Here, we introduce a comparative genomics approach to systemically identify alternative splicing events based on the differential evolutionary conservation between exons and introns and the high-quality annotation of the ENCODE regions. Specifically, we focus on exons that are included in some transcripts but are completely spliced out for others and we call them conditional exons. First, we characterize distinguishing features among conditional exons, constitutive exons and introns. One of the most important features is the position-specific conservation score. There are dramatic differences in conservation scores between conditional exons and constitutive exons. More importantly, the differences are position-specific. For flanking intronic regions, the differences between conditional exons and constitutive exons are also position-specific. Using the Random Forests algorithm, we can classify conditional exons with high specificities (97% for the identification of conditional exons from intron regions and 95% for the classification of known exons and fair sensitivities (64% and 32% respectively. We applied the method to the human genome and identified 39,640 introns that actually contain conditional exons and classified 8,813 conditional exons from the current RefSeq exon list. Among those, 31,673 introns containing conditional exons and 5,294 conditional exons classified from known exons cannot be inferred from RefSeq, UCSC or Ensembl annotations. Some of these de novo predictions were experimentally verified.

  1. MicroRNA genes preferentially expressed in dendritic cells contain sites for conserved transcription factor binding motifs in their promoters

    Directory of Open Access Journals (Sweden)

    Huynen Martijn A

    2011-06-01

    Full Text Available Abstract Background MicroRNAs (miRNAs play a fundamental role in the regulation of gene expression by translational repression or target mRNA degradation. Regulatory elements in miRNA promoters are less well studied, but may reveal a link between their expression and a specific cell type. Results To explore this link in myeloid cells, miRNA expression profiles were generated from monocytes and dendritic cells (DCs. Differences in miRNA expression among monocytes, DCs and their stimulated progeny were observed. Furthermore, putative promoter regions of miRNAs that are significantly up-regulated in DCs were screened for Transcription Factor Binding Sites (TFBSs based on TFBS motif matching score, the degree to which those TFBSs are over-represented in the promoters of the up-regulated miRNAs, and the extent of conservation of the TFBSs in mammals. Conclusions Analysis of evolutionarily conserved TFBSs in DC promoters revealed preferential clustering of sites within 500 bp upstream of the precursor miRNAs and that many mRNAs of cognate TFs of the conserved TFBSs were indeed expressed in the DCs. Taken together, our data provide evidence that selected miRNAs expressed in DCs have evolutionarily conserved TFBSs relevant to DC biology in their promoters.

  2. Sequence and structural analysis of the chitinase insertion domain reveals two conserved motifs involved in chitin-binding.

    Directory of Open Access Journals (Sweden)

    Hai Li

    2010-01-01

    Full Text Available Chitinases are prevalent in life and are found in species including archaea, bacteria, fungi, plants, and animals. They break down chitin, which is the second most abundant carbohydrate in nature after cellulose. Hence, they are important for maintaining a balance between carbon and nitrogen trapped as insoluble chitin in biomass. Chitinases are classified into two families, 18 and 19 glycoside hydrolases. In addition to a catalytic domain, which is a triosephosphate isomerase barrel, many family 18 chitinases contain another module, i.e., chitinase insertion domain. While numerous studies focus on the biological role of the catalytic domain in chitinase activity, the function of the chitinase insertion domain is not completely understood. Bioinformatics offers an important avenue in which to facilitate understanding the role of residues within the chitinase insertion domain in chitinase function.Twenty-seven chitinase insertion domain sequences, which include four experimentally determined structures and span five kingdoms, were aligned and analyzed using a modified sequence entropy parameter. Thirty-two positions with conserved residues were identified. The role of these conserved residues was explored by conducting a structural analysis of a number of holo-enzymes. Hydrogen bonding and van der Waals calculations revealed a distinct subset of four conserved residues constituting two sequence motifs that interact with oligosaccharides. The other conserved residues may be key to the structure, folding, and stability of this domain.Sequence and structural studies of the chitinase insertion domains conducted within the framework of evolution identified four conserved residues which clearly interact with the substrates. Furthermore, evolutionary studies propose a link between the appearance of the chitinase insertion domain and the function of family 18 chitinases in the subfamily A.

  3. Host Factors Influencing the Retrohoming Pathway of Group II Intron RmInt1, Which Has an Intron-Encoded Protein Naturally Devoid of Endonuclease Activity.

    Directory of Open Access Journals (Sweden)

    Rafael Nisa-Martínez

    Full Text Available Bacterial group II introns are self-splicing catalytic RNAs and mobile retroelements that have an open reading frame encoding an intron-encoded protein (IEP with reverse transcriptase (RT and RNA splicing or maturase activity. Some IEPs carry a DNA endonuclease (En domain, which is required to cleave the bottom strand downstream from the intron-insertion site for target DNA-primed reverse transcription (TPRT of the inserted intron RNA. Host factors complete the insertion of the intron. By contrast, the major retrohoming pathway of introns with IEPs naturally lacking endonuclease activity, like the Sinorhizobium meliloti intron RmInt1, is thought to involve insertion of the intron RNA into the template for lagging strand DNA synthesis ahead of the replication fork, with possible use of the nascent strand to prime reverse transcription of the intron RNA. The host factors influencing the retrohoming pathway of such introns have not yet been described. Here, we identify key candidates likely to be involved in early and late steps of RmInt1 retrohoming. Some of these host factors are common to En+ group II intron retrohoming, but some have different functions. Our results also suggest that the retrohoming process of RmInt1 may be less dependent on the intracellular free Mg2+ concentration than those of other group II introns.

  4. Exon-primed intron-crossing (EPIC markers for non-model teleost fishes

    Directory of Open Access Journals (Sweden)

    Riethoven Jean-Jack M

    2010-03-01

    Full Text Available Abstract Background Exon-primed intron-crossing (EPIC markers have three advantages over anonymous genomic sequences in studying evolution of natural populations. First, the universal primers designed in exon regions can be applied across a broad taxonomic range. Second, the homology of EPIC-amplified sequences can be easily determined by comparing either their exon or intron portion depending on the genetic distance between the taxa. Third, having both the exon and intron fragments could help in examining genetic variation at the intraspecific and interspecific level simultaneously, particularly helpful when studying species complex. However, the paucity of EPIC markers has hindered multilocus studies using nuclear gene sequences, particularly in teleost fishes. Results We introduce a bioinformatics pipeline for developing EPIC markers by comparing the whole genome sequences between two or more species. By applying this approach on five teleost fishes whose genomes were available in the Ensembl database http://www.ensembl.org, we identified 210 EPIC markers that have single-copy and conserved exon regions with identity greater than 85% among the five teleost fishes. We tested 12 randomly chosen EPIC markers in nine teleost species having a wide phylogenetic range. The success rate of amplifying and sequencing those markers varied from 44% to 100% in different species. We analyzed the exon sequences of the 12 EPIC markers from 13 teleosts. The resulting phylogeny contains many traditionally well-supported clades, indicating the usefulness of the exon portion of EPIC markers in reconstructing species phylogeny, in addition to the value of the intron portion of EPIC markers in interrogating the population history. Conclusions This study illustrated an effective approach to develop EPIC markers in a taxonomic group, where two or more genome sequences are available. The markers identified could be amplified across a broad taxonomic range of teleost

  5. Stanniocalcin 1 binds hemin through a partially conserved heme regulatory motif

    Energy Technology Data Exchange (ETDEWEB)

    Westberg, Johan A., E-mail: johan.westberg@helsinki.fi [Department of Pathology, Haartman Institute, University of Helsinki and HUSLAB, P.O. Box 21, Haartmaninkatu 3, FI-00014 Helsinki (Finland); Jiang, Ji, E-mail: ji.jiang@helsinki.fi [Department of Pathology, Haartman Institute, University of Helsinki and HUSLAB, P.O. Box 21, Haartmaninkatu 3, FI-00014 Helsinki (Finland); Andersson, Leif C., E-mail: leif.andersson@helsinki.fi [Department of Pathology, Haartman Institute, University of Helsinki and HUSLAB, P.O. Box 21, Haartmaninkatu 3, FI-00014 Helsinki (Finland)

    2011-06-03

    Highlights: {yields} Stanniocalcin 1 (STC1) binds heme through novel heme binding motif. {yields} Central iron atom of heme and cysteine-114 of STC1 are essential for binding. {yields} STC1 binds Fe{sup 2+} and Fe{sup 3+} heme. {yields} STC1 peptide prevents oxidative decay of heme. -- Abstract: Hemin (iron protoporphyrin IX) is a necessary component of many proteins, functioning either as a cofactor or an intracellular messenger. Hemoproteins have diverse functions, such as transportation of gases, gas detection, chemical catalysis and electron transfer. Stanniocalcin 1 (STC1) is a protein involved in respiratory responses of the cell but whose mechanism of action is still undetermined. We examined the ability of STC1 to bind hemin in both its reduced and oxidized states and located Cys{sup 114} as the axial ligand of the central iron atom of hemin. The amino acid sequence differs from the established (Cys-Pro) heme regulatory motif (HRM) and therefore presents a novel heme binding motif (Cys-Ser). A STC1 peptide containing the heme binding sequence was able to inhibit both spontaneous and H{sub 2}O{sub 2} induced decay of hemin. Binding of hemin does not affect the mitochondrial localization of STC1.

  6. Insights into the molecular evolution of the PDZ/LIM family and identification of a novel conserved protein motif.

    Directory of Open Access Journals (Sweden)

    Aartjan J W Te Velthuis

    Full Text Available The PDZ and LIM domain-containing protein family is encoded by a diverse group of genes whose phylogeny has currently not been analyzed. In mammals, ten genes are found that encode both a PDZ- and one or several LIM-domains. These genes are: ALP, RIL, Elfin (CLP36, Mystique, Enigma (LMP-1, Enigma homologue (ENH, ZASP (Cypher, Oracle, LMO7 and the two LIM domain kinases (LIMK1 and LIMK2. As conventional alignment and phylogenetic procedures of full-length sequences fell short of elucidating the evolutionary history of these genes, we started to analyze the PDZ and LIM domain sequences themselves. Using information from most sequenced eukaryotic lineages, our phylogenetic analysis is based on full-length cDNA-, EST-derived- and genomic- PDZ and LIM domain sequences of over 25 species, ranging from yeast to humans. Plant and protozoan homologs were not found. Our phylogenetic analysis identifies a number of domain duplication and rearrangement events, and shows a single convergent event during evolution of the PDZ/LIM family. Further, we describe the separation of the ALP and Enigma subfamilies in lower vertebrates and identify a novel consensus motif, which we call 'ALP-like motif' (AM. This motif is highly-conserved between ALP subfamily proteins of diverse organisms. We used here a combinatorial approach to define the relation of the PDZ and LIM domain encoding genes and to reconstruct their phylogeny. This analysis allowed us to classify the PDZ/LIM family and to suggest a meaningful model for the molecular evolution of the diverse gene architectures found in this multi-domain family.

  7. Purification and functional motifs of the recombinant ATPase of orf virus.

    Science.gov (United States)

    Lin, Fong-Yuan; Chan, Kun-Wei; Wang, Chi-Young; Wong, Min-Liang; Hsu, Wei-Li

    2011-10-01

    Our previous study showed that the recombinant ATPase encoded by the A32L gene of orf virus displayed ATP hydrolysis activity as predicted from its amino acids sequence. This viral ATPase contains four known functional motifs (motifs I-IV) and a novel AYDG motif; they are essential for ATP hydrolysis reaction by binding ATP and magnesium ions. The motifs I and II correspond with the Walker A and B motifs of the typical ATPase, respectively. To examine the biochemical roles of these five conserved motifs, recombinant ATPases of five deletion mutants derived from the Taiping strain were expressed and purified. Their ATPase functions were assayed and compared with those of two wild type strains, Taiping and Nantou isolated in Taiwan. Our results showed that deletions at motifs I-III or IV exhibited lower activity than that of the wild type. Interestingly, deletion of AYDG motif decreased the ATPase activity more significantly than those of motifs I-IV deletions. Divalent ions such as magnesium and calcium were essential for ATPase activity. Moreover, our recombinant proteins of orf virus also demonstrated GTPase activity, though weaker than the original ATPase activity. Copyright © 2011 Elsevier Inc. All rights reserved.

  8. Intronic L1 retrotransposons and nested genes cause transcriptional interference by inducing intron retention, exonization and cryptic polyadenylation.

    Directory of Open Access Journals (Sweden)

    Kristel Kaer

    Full Text Available Transcriptional interference has been recently recognized as an unexpectedly complex and mostly negative regulation of genes. Despite a relatively few studies that emerged in recent years, it has been demonstrated that a readthrough transcription derived from one gene can influence the transcription of another overlapping or nested gene. However, the molecular effects resulting from this interaction are largely unknown.Using in silico chromosome walking, we searched for prematurely terminated transcripts bearing signatures of intron retention or exonization of intronic sequence at their 3' ends upstream to human L1 retrotransposons, protein-coding and noncoding nested genes. We demonstrate that transcriptional interference induced by intronic L1s (or other repeated DNAs and nested genes could be characterized by intron retention, forced exonization and cryptic polyadenylation. These molecular effects were revealed from the analysis of endogenous transcripts derived from different cell lines and tissues and confirmed by the expression of three minigenes in cell culture. While intron retention and exonization were comparably observed in introns upstream to L1s, forced exonization was preferentially detected in nested genes. Transcriptional interference induced by L1 or nested genes was dependent on the presence or absence of cryptic splice sites, affected the inclusion or exclusion of the upstream exon and the use of cryptic polyadenylation signals.Our results suggest that transcriptional interference induced by intronic L1s and nested genes could influence the transcription of the large number of genes in normal as well as in tumor tissues. Therefore, this type of interference could have a major impact on the regulation of the host gene expression.

  9. Inheritance of the group I rDNA intron in Tetrahymena pigmentosa

    DEFF Research Database (Denmark)

    Nielsen, Henrik; Simon, E M; Engberg, J

    1992-01-01

    - strains looking for a strong polarity in the inheritance of the intron (intron homing). Based on the genetic analysis we find that the intron in T. pigmentosa is inherited as a neutral character and that intron+ and intron- alleles segregate in a Mendelian fashion with no sign of intron homing...

  10. The ability to form full-length intron RNA circles is a general property of nuclear group I introns

    DEFF Research Database (Denmark)

    Nielsen, Henrik; Fiskaa, Tonje; Birgisdottir, Asa Birna

    2003-01-01

    at the expense of the host. The circularization pathway has distinct structural requirements that differ from those of splicing and appears to be specifically suppressed in vivo. The ability to form full-length circles is found in all types of nuclear group I introns, including those from the Tetrahymena...... ribosomal DNA. The biological function of the full-length circles is not known, but the fact that the circles contain the entire genetic information of the intron suggests a role in intron mobility....

  11. Discriminative Motif Discovery via Simulated Evolution and Random Under-Sampling

    OpenAIRE

    Song, Tao; Gu, Hong

    2014-01-01

    Conserved motifs in biological sequences are closely related to their structure and functions. Recently, discriminative motif discovery methods have attracted more and more attention. However, little attention has been devoted to the data imbalance problem, which is one of the main reasons affecting the performance of the discriminative models. In this article, a simulated evolution method is applied to solve the multi-class imbalance problem at the stage of data preprocessing, and at the sta...

  12. The conserved dileucine- and tyrosine-based motifs in MLV and MPMV envelope glycoproteins are both important to regulate a common Env intracellular trafficking

    Directory of Open Access Journals (Sweden)

    Lopez-Vergès Sandra

    2006-09-01

    Full Text Available Abstract Background Retrovirus particles emerge from the assembly of two structural protein components, Gag that is translated as a soluble protein in the cytoplasm of the host cells, and Env, a type I transmembrane protein. Because both components are translated in different intracellular compartments, elucidating the mechanisms of retrovirus assembly thus requires the study of their intracellular trafficking. Results We used a CD25 (Tac chimera-based approach to study the trafficking of Moloney murine leukemia virus and Mason-Pfizer monkey virus Env proteins. We found that the cytoplasmic tails (CTs of both Env conserved two major signals that control a complex intracellular trafficking. A dileucine-based motif controls the sorting of the chimeras from the trans-Golgi network (TGN toward endosomal compartments. Env proteins then follow a retrograde transport to the TGN due to the action of a tyrosine-based motif. Mutation of either motif induces the mis-localization of the chimeric proteins and both motifs are found to mediate interactions of the viral CTs with clathrin adaptors. Conclusion This data reveals the unexpected complexity of the intracellular trafficking of retrovirus Env proteins that cycle between the TGN and endosomes. Given that Gag proteins hijack endosomal host proteins, our work suggests that the endosomal pathway may be used by retroviruses to ensure proper encountering of viral structural Gag and Env proteins in cells, an essential step of virus assembly.

  13. G-quadruplex DNA sequences are evolutionarily conserved and associated with distinct genomic features in Saccharomyces cerevisiae.

    Directory of Open Access Journals (Sweden)

    John A Capra

    2010-07-01

    Full Text Available G-quadruplex DNA is a four-stranded DNA structure formed by non-Watson-Crick base pairing between stacked sets of four guanines. Many possible functions have been proposed for this structure, but its in vivo role in the cell is still largely unresolved. We carried out a genome-wide survey of the evolutionary conservation of regions with the potential to form G-quadruplex DNA structures (G4 DNA motifs across seven yeast species. We found that G4 DNA motifs were significantly more conserved than expected by chance, and the nucleotide-level conservation patterns suggested that the motif conservation was the result of the formation of G4 DNA structures. We characterized the association of conserved and non-conserved G4 DNA motifs in Saccharomyces cerevisiae with more than 40 known genome features and gene classes. Our comprehensive, integrated evolutionary and functional analysis confirmed the previously observed associations of G4 DNA motifs with promoter regions and the rDNA, and it identified several previously unrecognized associations of G4 DNA motifs with genomic features, such as mitotic and meiotic double-strand break sites (DSBs. Conserved G4 DNA motifs maintained strong associations with promoters and the rDNA, but not with DSBs. We also performed the first analysis of G4 DNA motifs in the mitochondria, and surprisingly found a tenfold higher concentration of the motifs in the AT-rich yeast mitochondrial DNA than in nuclear DNA. The evolutionary conservation of the G4 DNA motif and its association with specific genome features supports the hypothesis that G4 DNA has in vivo functions that are under evolutionary constraint.

  14. Frequent gain and loss of introns in fungal cytochrome b genes.

    Directory of Open Access Journals (Sweden)

    Liang-Fen Yin

    Full Text Available In this study, all available cytochrome b (Cyt b genes from the GOBASE database were compiled and the evolutionary dynamics of the Cyt b gene introns was assessed. Cyt b gene introns were frequently present in the fungal kingdom and some lower plants, but generally absent or rare in Chromista, Protozoa, and Animalia. Fungal Cyt b introns were found at 35 positions in Cyt b genes and the number of introns varied at individual positions from a single representative to 32 different introns at position 131, showing a wide and patchy distribution. Many homologous introns were present at the same position in distantly related species but absent in closely related species, suggesting that introns of the Cyt b genes were frequently lost. On the other hand, highly similar intron sequences were observed in some distantly related species rather than in closely related species, suggesting that these introns were gained independently, likely through lateral transfers. The intron loss-and-gain events could be mediated by transpositions that might have occurred between nuclear and mitochondria. Southern hybridization analysis confirmed that some introns contained repetitive sequences and might be transposable elements. An intron gain in Botryotinia fuckeliana prevented the development of QoI fungicide resistance, suggesting that intron loss-and-gain events were not necessarily beneficial to their host organisms.

  15. Intronic microRNAs

    International Nuclear Information System (INIS)

    Ying, S.-Y.; Lin, S.-L.

    2005-01-01

    MicroRNAs (miRNAs), small single-stranded regulatory RNAs capable of interfering with intracellular mRNAs that contain partial complementarity, are useful for the design of new therapies against cancer polymorphism and viral mutation. MiRNA was originally discovered in the intergenic regions of the Caenorhabditis elegans genome as native RNA fragments that modulate a wide range of genetic regulatory pathways during animal development. However, neither RNA promoter nor polymerase responsible for miRNA biogenesis was determined. Recent findings of intron-derived miRNA in C. elegans, mouse, and human have inevitably led to an alternative pathway for miRNA biogenesis, which relies on the coupled interaction of Pol-II-mediated pre-mRNA transcription and intron excision, occurring in certain nuclear regions proximal to genomic perichromatin fibrils

  16. Genome-wide identification, phylogenetic classification, and exon-intron structure characterisation of the tubulin and actin genes in flax (Linum usitatissimum).

    Science.gov (United States)

    Pydiura, Nikolay; Pirko, Yaroslav; Galinousky, Dmitry; Postovoitova, Anastasiia; Yemets, Alla; Kilchevsky, Aleksandr; Blume, Yaroslav

    2018-06-08

    Flax (Linum usitatissimum L.) is a valuable food and fiber crop cultivated for its quality fiber and seed oil. α-, β-, γ-tubulins and actins are the main structural proteins of the cytoskeleton. α- and γ-tubulin and actin genes have not been characterized yet in the flax genome. In this study, we have identified 6 α-tubulin genes, 13 β-tubulin genes, 2 γ-tubulin genes, and 15 actin genes in the flax genome and analysed the phylogenetic relationships between flax and A. thaliana tubulin and actin genes. Six α-tubulin genes are represented by 3 paralogous pairs, among 13 β-tubulin genes 7 different isotypes can be distinguished, 6 of which are encoded by two paralogous genes each. γ-tubulin is represented by a paralogous pair of genes one of which may be not functional. Fifteen actin genes represent 7 paralogous pairs - 7 actin isotypes and a sequentially duplicated copy of one of the genes of one of the isotypes. Exon-intron structure analysis has shown intron length polymorphism within the β-tubulin genes and intron number variation among the α-tubulin gene: 3 or 4 introns are found in two or four genes, respectively. Intron positioning occurs at conservative sites, as observed in numerous other plant species. Flax actin genes show both intron length polymorphisms and variation in the number of intron that may be 2 or 3. These data will be useful to support further studies on the specificity, functioning, regulation and evolution of the flax cytoskeleton proteins. This article is protected by copyright. All rights reserved.

  17. Conservation of batik: Conseptual framework of design and process development

    Science.gov (United States)

    Syamwil, Rodia

    2018-03-01

    Development of Conservation Batik concept becomes critical due to the recessive of traditional batik as the intangible cultural heritage of humanity. The existence of printed batik, polluting process, and new stream design becomes the consequences of batik industry transformation to creative industry. Conservation Batik was proposed to answer all the threats to traditional batik, in the aspect of technique, process, and motif. However, creativities are also critical to meet consumer satisfaction. Research and development was conducted, start with the initial research in formulating the concept, and exploration of ideas to develop the designs of conservation motifs. In development steps, cyclical process to complete motif with high preferences, in the aspect of aesthetics, productivity, and efficiency. Data were collected through bibliography, documentation, observation, and interview, and analyzed in qualitative methods. The concept of Conservation Batik adopted from the principles of Universitas Negeri Semarang (UNNES) vision, as well as theoretical analyses, and expert judgment. Conservation Batik are assessed from three aspect, design, process, and consumer preferences. Conservation means the effort of safeguarding, promoting, maintaining, and preserving. Concervation Batik concept could be interpreted as batik with: (1) traditional values and authenticity; (2) the values of philosophycal meanings; (3) eco-friendly process with minimum waste; (4) conservation as idea resources of design; and (5) raising up of classic motifs.

  18. Discovering Motifs in Biological Sequences Using the Micron Automata Processor.

    Science.gov (United States)

    Roy, Indranil; Aluru, Srinivas

    2016-01-01

    Finding approximately conserved sequences, called motifs, across multiple DNA or protein sequences is an important problem in computational biology. In this paper, we consider the (l, d) motif search problem of identifying one or more motifs of length l present in at least q of the n given sequences, with each occurrence differing from the motif in at most d substitutions. The problem is known to be NP-complete, and the largest solved instance reported to date is (26,11). We propose a novel algorithm for the (l,d) motif search problem using streaming execution over a large set of non-deterministic finite automata (NFA). This solution is designed to take advantage of the micron automata processor, a new technology close to deployment that can simultaneously execute multiple NFA in parallel. We demonstrate the capability for solving much larger instances of the (l, d) motif search problem using the resources available within a single automata processor board, by estimating run-times for problem instances (39,18) and (40,17). The paper serves as a useful guide to solving problems using this new accelerator technology.

  19. MotifNet: a web-server for network motif analysis.

    Science.gov (United States)

    Smoly, Ilan Y; Lerman, Eugene; Ziv-Ukelson, Michal; Yeger-Lotem, Esti

    2017-06-15

    Network motifs are small topological patterns that recur in a network significantly more often than expected by chance. Their identification emerged as a powerful approach for uncovering the design principles underlying complex networks. However, available tools for network motif analysis typically require download and execution of computationally intensive software on a local computer. We present MotifNet, the first open-access web-server for network motif analysis. MotifNet allows researchers to analyze integrated networks, where nodes and edges may be labeled, and to search for motifs of up to eight nodes. The output motifs are presented graphically and the user can interactively filter them by their significance, number of instances, node and edge labels, and node identities, and view their instances. MotifNet also allows the user to distinguish between motifs that are centered on specific nodes and motifs that recur in distinct parts of the network. MotifNet is freely available at http://netbio.bgu.ac.il/motifnet . The website was implemented using ReactJs and supports all major browsers. The server interface was implemented in Python with data stored on a MySQL database. estiyl@bgu.ac.il or michaluz@cs.bgu.ac.il. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  20. An intron capture strategy used to identify and map a lysyl oxidase-like gene on chromosome 9 in the mouse

    Energy Technology Data Exchange (ETDEWEB)

    Wydner, K.S.; Passmore, H.C. [Rutgers Univ., Piscataway, NJ (United States); Kim, Houngho; Csiszar, K.; Boyd, C.D. [UMDNJ, New Brunswick, NJ (United States)

    1997-03-01

    An intron capture strategy involving use of polymerase chain reaction was used to identify and map the mouse homologue of a human lysyl oxidase-like gene (LOXL). Oligonucleotides complementary to conserved domains within exons 4 and 5 of the human lysyl oxidase-like gene were used to amplify the corresponding segment from mouse genomic DNA. Sequencing of the resulting mouse DNA fragment of approximately 1 kb revealed that the exon sequences at the ends of the amplified fragment are highly homologous (90% nucleotide identity) to exons 4 and 5 of the human lysyl oxidase-like gene. An AluI restriction site polymorphism within intron 4 was used to map the mouse lysyl oxidase-like gene (Loxl) to mouse Chromosome 9 in a region that shares linkage conservation with human chromosome 15q24, to which the LOXL was recently mapped. 22 refs., 3 figs.

  1. Glycine in the conserved motif III modulates the thermostability and oxidative stress resistance of peptide deformylase in Mycobacterium tuberculosis.

    Science.gov (United States)

    Narayanan, Sai Shyam; Sokkar, Pandian; Ramachandran, Murugesan; Nampoothiri, Kesavan Madhavan

    2011-07-01

    Peptide deformylase (PDF) catalyses the removal of the N-formyl group from the nascent polypeptide during protein maturation. The PDF of Mycobacterium tuberculosis H37Rv (MtbPDF), overexpressed and purified from Escherichia coli, was characterized as an iron-containing enzyme with stability towards H(2) O(2) and moderate thermostability. Substitution of two conserved residues (G49 and L107) from MtbPDF with the corresponding residues found in human PDF affected its deformylase activity. Among characterized PDFs, glycine (G151) in motif III instead of conserved aspartate is characteristic of M. tuberculosis. Although the G151D mutation in MtbPDF increased its deformylase activity and thermostability, it also affected enzyme stability towards H(2) O(2) . Molecular dynamics and docking results confirmed improved substrate binding and catalysis for the G151D mutant and the study provides another possible molecular basis for the stability of MtbPDF against oxidizing agents. © 2011 Federation of European Microbiological Societies. Published by Blackwell Publishing Ltd. All rights reserved.

  2. The Half-Life of the HSV-1 1.5 kb LAT Intron is similar to the half-Life of the 2.0 kb LAT Intron

    Science.gov (United States)

    Brinkman, Kerry K.; Mishra, Prakhar; Fraser, Nigel W.

    2013-01-01

    Herpes Simplex Virus type 1 (HSV-1) establishes a latent infection in the sensory neurons of the peripheral nervous system of humans. Although about 80 genes are expressed during the lytic cycle of the virus infection, essentially only one gene is expressed during the latent cycle. This gene is known as the latency associated transcript (LAT) and it appears to play a role in the latency cycle through an anti-apoptotic function in the 5’ end of the gene and miRNA encoded along the length of the transcript which down regulate some of the viral immediate early (IE) gene products. The LAT gene is about 8.3 kb long and consists of two exons separated by an unusual intron. The intron between the exons consists of two nested introns. This arrangement of introns has been called a twintron. Furthermore, the larger (2 kb) intron has been shown to be very stable. In this study we measure the stability of the shorter 1.5 kb nested intron and find its half-life is similar to the longer intron. This was achieved by deleting the 0.5 kb overlapping intron from a plasmid construct designed to express the LAT transcript from a tet-inducible promoter, and measuring the half-life of the 1.5 kb intron in tissue culture cells. This finding supports the hypothesis that it is the common branch-point region of these nested introns that is responsible for their stability. PMID:23335177

  3. Newly evolved introns in human retrogenes provide novel insights into their evolutionary roles

    Directory of Open Access Journals (Sweden)

    Kang Li-Fang

    2012-07-01

    Full Text Available Abstract Background Retrogenes generally do not contain introns. However, in some instances, retrogenes may recruit internal exonic sequences as introns, which is known as intronization. A retrogene that undergoes intronization is a good model with which to investigate the origin of introns. Nevertheless, previously, only two cases in vertebrates have been reported. Results In this study, we systematically screened the human (Homo sapiens genome for retrogenes that evolved introns and analyzed their patterns in structure, expression and origin. In total, we identified nine intron-containing retrogenes. Alignment of pairs of retrogenes and their parents indicated that, in addition to intronization (five cases, retrogenes also may have gained introns by insertion of external sequences into the genes (one case or reversal of the orientation of transcription (three cases. Interestingly, many intronizations were promoted not by base substitutions but by cryptic splice sites, which were silent in the parental genes but active in the retrogenes. We also observed that the majority of introns generated by intronization did not involve frameshifts. Conclusions Intron gains in retrogenes are not as rare as previously thought. Furthermore, diverse mechanisms may lead to intron creation in retrogenes. The activation of cryptic splice sites in the intronization of retrogenes may be triggered by the change of gene structure after retroposition. A high percentage of non-frameshift introns in retrogenes may be because non-frameshift introns do not dramatically affect host proteins. Introns generated by intronization in human retrogenes are generally young, which is consistent with previous findings for Caenorhabditis elegans. Our results provide novel insights into the evolutionary role of introns.

  4. A 3.0-kb deletion including an erythroid cell-specific regulatory element in intron 1 of the ABO blood group gene in an individual with the Bm phenotype.

    Science.gov (United States)

    Sano, R; Kuboya, E; Nakajima, T; Takahashi, Y; Takahashi, K; Kubo, R; Kominato, Y; Takeshita, H; Yamao, H; Kishida, T; Isa, K; Ogasawara, K; Uchikawa, M

    2015-04-01

    We developed a sequence-specific primer PCR (SSP-PCR) for detection of a 5.8-kb deletion (B(m) 5.8) involving an erythroid cell-specific regulatory element in intron 1 of the ABO blood group gene. Using this SSP-PCR, we performed genetic analysis of 382 individuals with Bm or ABm. The 5.8-kb deletion was found in 380 individuals, and disruption of the GATA motif in the regulatory element was found in one individual. Furthermore, a novel 3.0-kb deletion involving the element (B(m) 3.0) was demonstrated in the remaining individual. Comparisons of single-nucleotide polymorphisms and microsatellites in intron 1 between B(m) 5.8 and B(m) 3.0 suggested that these deletions occurred independently. © 2014 International Society of Blood Transfusion.

  5. Introns in the genome of bacteriophage T4

    International Nuclear Information System (INIS)

    Gott, J.M.

    1987-01-01

    RNA from T4-infected cells yields multiple end-labeled species when incubated with [α- 32 P]GTP under self-splicing conditions. One of these corresponds to the previously characterized intron from the T4 td gene and, as shown in this work, the others represent additional group I introns in T4. Two loci distinct from the td gene were found to hybridize to the mixed GTP-labeled T4 RNA probe. These were mapped to the unlinked genes nrdB and sunY. Cloned DNA from the nrdB region that contained the intron was shown to generate characteristic group I splice products with RNA synthesized in vivo or in vitro. The splice junction of the nrdB gene was determined and the nature of the RNA reaction products characterized. In vivo expression of the nrdB gene and the open reading frame within the intron was studied using in-frame lacZ fusions and primer extension analyses. The data suggest that expression of the intron open reading frame is highly regulated during T4 infection. Possible regulatory mechanisms are discussed

  6. Relative Stabilities of Conserved and Non-Conserved Structures in the OB-Fold Superfamily

    Directory of Open Access Journals (Sweden)

    Andrei T. Alexandrescu

    2009-05-01

    Full Text Available The OB-fold is a diverse structure superfamily based on a β-barrel motif that is often supplemented with additional non-conserved secondary structures. Previous deletion mutagenesis and NMR hydrogen exchange studies of three OB-fold proteins showed that the structural stabilities of sites within the conserved β-barrels were larger than sites in non-conserved segments. In this work we examined a database of 80 representative domain structures currently classified as OB-folds, to establish the basis of this effect. Residue-specific values were obtained for the number of Cα-Cα distance contacts, sequence hydrophobicities, crystallographic B-factors, and theoretical B-factors calculated from a Gaussian Network Model. All four parameters point to a larger average flexibility for the non-conserved structures compared to the conserved β-barrels. The theoretical B-factors and contact densities show the highest sensitivity.Our results suggest a model of protein structure evolution in which novel structural features develop at the periphery of conserved motifs. Core residues are more resistant to structural changes during evolution since their substitution would disrupt a larger number of interactions. Similar factors are likely to account for the differences in stability to unfolding between conserved and non-conserved structures.

  7. Automatic annotation of protein motif function with Gene Ontology terms

    Directory of Open Access Journals (Sweden)

    Gopalakrishnan Vanathi

    2004-09-01

    Full Text Available Abstract Background Conserved protein sequence motifs are short stretches of amino acid sequence patterns that potentially encode the function of proteins. Several sequence pattern searching algorithms and programs exist foridentifying candidate protein motifs at the whole genome level. However, amuch needed and importanttask is to determine the functions of the newly identified protein motifs. The Gene Ontology (GO project is an endeavor to annotate the function of genes or protein sequences with terms from a dynamic, controlled vocabulary and these annotations serve well as a knowledge base. Results This paperpresents methods to mine the GO knowledge base and use the association between the GO terms assigned to a sequence and the motifs matched by the same sequence as evidence for predicting the functions of novel protein motifs automatically. The task of assigning GO terms to protein motifsis viewed as both a binary classification and information retrieval problem, where PROSITE motifs are used as samples for mode training and functional prediction. The mutual information of a motif and aGO term association isfound to be a very useful feature. We take advantageof the known motifs to train a logistic regression classifier, which allows us to combine mutual information with other frequency-based features and obtain a probability of correctassociation. The trained logistic regression model has intuitively meaningful and logically plausible parameter values, and performs very well empirically according to our evaluation criteria. Conclusions In this research, different methods for automatic annotation of protein motifs have been investigated. Empirical result demonstrated that the methods have a great potential for detecting and augmenting information about thefunctions of newly discovered candidate protein motifs.

  8. The ARTT motif and a unified structural understanding of substraterecognition in ADP ribosylating bacterial toxins and eukaryotic ADPribosyltransferases

    Energy Technology Data Exchange (ETDEWEB)

    Han, S.; Tainer, J.A.

    2001-08-01

    ADP-ribosylation is a widely occurring and biologically critical covalent chemical modification process in pathogenic mechanisms, intracellular signaling systems, DNA repair, and cell division. The reaction is catalyzed by ADP-ribosyltransferases, which transfer the ADP-ribose moiety of NAD to a target protein with nicotinamide release. A family of bacterial toxins and eukaryotic enzymes has been termed the mono-ADP-ribosyltransferases, in distinction to the poly-ADP-ribosyltransferases, which catalyze the addition of multiple ADP-ribose groups to the carboxyl terminus of eukaryotic nucleoproteins. Despite the limited primary sequence homology among the different ADP-ribosyltransferases, a central cleft bearing NAD-binding pocket formed by the two perpendicular b-sheet core has been remarkably conserved between bacterial toxins and eukaryotic mono- and poly-ADP-ribosyltransferases. The majority of bacterial toxins and eukaryotic mono-ADP-ribosyltransferases are characterized by conserved His and catalytic Glu residues. In contrast, Diphtheria toxin, Pseudomonas exotoxin A, and eukaryotic poly-ADP-ribosyltransferases are characterized by conserved Arg and catalytic Glu residues. The NAD-binding core of a binary toxin and a C3-like toxin family identified an ARTT motif (ADP-ribosylating turn-turn motif) that is implicated in substrate specificity and recognition by structural and mutagenic studies. Here we apply structure-based sequence alignment and comparative structural analyses of all known structures of ADP-ribosyltransfeases to suggest that this ARTT motif is functionally important in many ADP-ribosylating enzymes that bear a NAD binding cleft as characterized by conserved Arg and catalytic Glu residues. Overall, structure-based sequence analysis reveals common core structures and conserved active sites of ADP-ribosyltransferases to support similar NAD binding mechanisms but differing mechanisms of target protein binding via sequence variations within the ARTT

  9. Novel and deviant Walker A ATP-binding motifs in bacteriophage large terminase-DNA packaging proteins

    International Nuclear Information System (INIS)

    Mitchell, Michael S.; Rao, Venigalla B.

    2004-01-01

    Bacteriophage terminases constitute a very interesting class of viral-coded multifunctional ATPase 'motors' that apparently drive directional translocation of DNA into an empty viral capsid. A common Walker A motif and other conserved signatures of a critical ATPase catalytic center are identified in the N-terminal half of numerous large terminase proteins. However, several terminases, including the well-characterized λ and SPP1 terminases, seem to lack the classic Walker A in the N-terminus. Using sequence alignment approaches, we discovered the presence of deviant Walker A motifs in these and many other phage terminases. One deviation, the presence of a lysine at the beginning of P-loop, may represent a 3D equivalent of the universally conserved lysine in the Walker A GKT/S signature. This and other novel putative Walker A motifs that first came to light through this study help define the ATPase centers of phage and viral terminases as well as elicit important insights into the molecular functioning of this fundamental motif in biological systems

  10. Drosophila polytene chromosome bands formed by gene introns.

    Science.gov (United States)

    Zhimulev, I F; Boldyreva, L V; Demakova, O V; Poholkova, G V; Khoroshko, V A; Zykova, T Yu; Lavrov, S A; Belyaeva, E S

    2016-01-01

    Genetic organization of bands and interbands in polytene chromosomes has long remained a puzzle for geneticists. It has been recently demonstrated that interbands typically correspond to the 5'-ends of house-keeping genes, whereas adjacent loose bands tend to be composed of coding sequences of the genes. In the present work, we made one important step further and mapped two large introns of ubiquitously active genes on the polytene chromosome map. We show that alternative promoter regions of these genes map to interbands, whereas introns and coding sequences found between those promoters correspond to loose grey bands. Thus, a gene having its long intron "sandwiched" between to alternative promoters and a common coding sequence may occupy two interbands and one band in the context of polytene chromosomes. Loose, partially decompacted bands appear to host large introns.

  11. Proliferation of group II introns in the chloroplast genome of the green alga Oedocladium carolinianum (Chlorophyceae

    Directory of Open Access Journals (Sweden)

    Jean-Simon Brouard

    2016-10-01

    Full Text Available Background The chloroplast genome sustained extensive changes in architecture during the evolution of the Chlorophyceae, a morphologically and ecologically diverse class of green algae belonging to the Chlorophyta; however, the forces driving these changes are poorly understood. The five orders recognized in the Chlorophyceae form two major clades: the CS clade consisting of the Chlamydomonadales and Sphaeropleales, and the OCC clade consisting of the Oedogoniales, Chaetophorales, and Chaetopeltidales. In the OCC clade, considerable variations in chloroplast DNA (cpDNA structure, size, gene order, and intron content have been observed. The large inverted repeat (IR, an ancestral feature characteristic of most green plants, is present in Oedogonium cardiacum (Oedogoniales but is lacking in the examined members of the Chaetophorales and Chaetopeltidales. Remarkably, the Oedogonium 35.5-kb IR houses genes that were putatively acquired through horizontal DNA transfer. To better understand the dynamics of chloroplast genome evolution in the Oedogoniales, we analyzed the cpDNA of a second representative of this order, Oedocladium carolinianum. Methods The Oedocladium cpDNA was sequenced and annotated. The evolutionary distances separating Oedocladium and Oedogonium cpDNAs and two other pairs of chlorophycean cpDNAs were estimated using a 61-gene data set. Phylogenetic analysis of an alignment of group IIA introns from members of the OCC clade was performed. Secondary structures and insertion sites of oedogonialean group IIA introns were analyzed. Results The 204,438-bp Oedocladium genome is 7.9 kb larger than the Oedogonium genome, but its repertoire of conserved genes is remarkably similar and gene order differs by only one reversal. Although the 23.7-kb IR is missing the putative foreign genes found in Oedogonium, it contains sequences coding for a putative phage or bacterial DNA primase and a hypothetical protein. Intergenic sequences are 1.5-fold

  12. The mitochondrial LSU rRNA group II intron of Ustilago maydis encodes an active homing endonuclease likely involved in intron mobility.

    Directory of Open Access Journals (Sweden)

    Anja Pfeifer

    Full Text Available BACKGROUND: The a2 mating type locus gene lga2 is critical for uniparental mitochondrial DNA inheritance during sexual development of Ustilago maydis. Specifically, the absence of lga2 results in biparental inheritance, along with efficient transfer of intronic regions in the large subunit rRNA gene between parental molecules. However, the underlying role of the predicted LAGLIDADG homing endonuclease gene I-UmaI located within the group II intron LRII1 has remained unresolved. METHODOLOGY/PRINCIPAL FINDINGS: We have investigated the enzymatic activity of I-UmaI in vitro based on expression of a tagged full-length and a naturally occurring mutant derivative, which harbors only the N-terminal LAGLIDADG domain. This confirmed Mg²⁺-dependent endonuclease activity and cleavage at the LRII1 insertion site to generate four base pair extensions with 3' overhangs. Specifically, I-UmaI recognizes an asymmetric DNA sequence with a minimum length of 14 base pairs (5'-GACGGGAAGACCCT-3' and tolerates subtle base pair substitutions within the homing site. Enzymatic analysis of the mutant variant indicated a correlation between the activity in vitro and intron homing. Bioinformatic analyses revealed that putatively functional or former functional I-UmaI homologs are confined to a few members within the Ustilaginales and Agaricales, including the phylogenetically distant species Lentinula edodes, and are linked to group II introns inserted into homologous positions in the LSU rDNA. CONCLUSIONS/SIGNIFICANCE: The present data provide strong evidence that intron homing efficiently operates under conditions of biparental inheritance in U. maydis. Conversely, uniparental inheritance may be critical to restrict the transmission of mobile introns. Bioinformatic analyses suggest that I-UmaI-associated introns have been acquired independently in distant taxa and are more widespread than anticipated from available genomic data.

  13. Determination of 5 '-leader sequences from radically disparate strains of porcine reproductive and respiratory syndrome virus reveals the presence of highly conserved sequence motifs

    DEFF Research Database (Denmark)

    Oleksiewicz, M.B.; Bøtner, Anette; Nielsen, Jens

    1999-01-01

    We determined the untranslated 5'-leader sequence for three different isolates of porcine reproductive and respiratory syndrome virus (PRRSV): pathogenic European- and American-types, as well as an American-type vaccine strain. 5'-leader from European- and American-type PRRSV differed in length...... (220 and 190 nt, respectively), and exhibited only approximately 50% nucleotide homology. Nevertheless, highly conserved areas were identified in the leader of all 3 PRRSV isolates, which constitute candidate motifs for binding of protein(s) involved in viral replication. These comparative data provide...

  14. A site-specific endonuclease encoded by a typical archaeal intron

    DEFF Research Database (Denmark)

    Dalgaard, Jacob; Garrett, Roger Antony; Belfort, Malene

    1993-01-01

    The protein encoded by the archaeal intron in the 23S rRNA gene of the hyperthermophile Desulfurococcus mobilis is a double-strand DNase that, like group I intron homing endonucleases, is capable of cleaving an intronless allele of the gene. This enzyme, I-Dmo I, is unusual among the intron...

  15. Mouse transgenesis identifies conserved functional enhancers and cis-regulatory motif in the vertebrate LIM homeobox gene Lhx2 locus.

    Directory of Open Access Journals (Sweden)

    Alison P Lee

    Full Text Available The vertebrate Lhx2 is a member of the LIM homeobox family of transcription factors. It is essential for the normal development of the forebrain, eye, olfactory system and liver as well for the differentiation of lymphoid cells. However, despite the highly restricted spatio-temporal expression pattern of Lhx2, nothing is known about its transcriptional regulation. In mammals and chicken, Crb2, Dennd1a and Lhx2 constitute a conserved linkage block, while the intervening Dennd1a is lost in the fugu Lhx2 locus. To identify functional enhancers of Lhx2, we predicted conserved noncoding elements (CNEs in the human, mouse and fugu Crb2-Lhx2 loci and assayed their function in transgenic mouse at E11.5. Four of the eight CNE constructs tested functioned as tissue-specific enhancers in specific regions of the central nervous system and the dorsal root ganglia (DRG, recapitulating partial and overlapping expression patterns of Lhx2 and Crb2 genes. There was considerable overlap in the expression domains of the CNEs, which suggests that the CNEs are either redundant enhancers or regulating different genes in the locus. Using a large set of CNEs (810 CNEs associated with transcription factor-encoding genes that express predominantly in the central nervous system, we predicted four over-represented 8-mer motifs that are likely to be associated with expression in the central nervous system. Mutation of one of them in a CNE that drove reporter expression in the neural tube and DRG abolished expression in both domains indicating that this motif is essential for expression in these domains. The failure of the four functional enhancers to recapitulate the complete expression pattern of Lhx2 at E11.5 indicates that there must be other Lhx2 enhancers that are either located outside the region investigated or divergent in mammals and fishes. Other approaches such as sequence comparison between multiple mammals are required to identify and characterize such enhancers.

  16. Proteome-level assessment of origin, prevalence and function of Leucine-Aspartic Acid (LD) motifs

    KAUST Repository

    Alam, Tanvir

    2018-03-11

    Short Linear Motifs (SLiMs) contribute to almost every cellular function by connecting appropriate protein partners. Accurate prediction of SLiMs is difficult due to their shortness and sequence degeneracy. Leucine-aspartic acid (LD) motifs are SLiMs that link paxillin family proteins to factors controlling (cancer) cell adhesion, motility and survival. The existence and importance of LD motifs beyond the paxillin family is poorly understood. To enable a proteome-wide assessment of these motifs, we developed an active-learning based framework that iteratively integrates computational predictions with experimental validation. Our analysis of the human proteome identified a dozen proteins that contain LD motifs, all being involved in cell adhesion and migration, and revealed a new type of inverse LD motif consensus. Our evolutionary analysis suggested that LD motif signalling originated in the common unicellular ancestor of opisthokonts and amoebozoa by co-opting nuclear export sequences. Inter-species comparison revealed a conserved LD signalling core, and reveals the emergence of species-specific adaptive connections, while maintaining a strong functional focus of the LD motif interactome. Collectively, our data elucidate the mechanisms underlying the origin and adaptation of an ancestral SLiM.

  17. WildSpan: mining structured motifs from protein sequences

    Directory of Open Access Journals (Sweden)

    Chen Chien-Yu

    2011-03-01

    Full Text Available Abstract Background Automatic extraction of motifs from biological sequences is an important research problem in study of molecular biology. For proteins, it is desired to discover sequence motifs containing a large number of wildcard symbols, as the residues associated with functional sites are usually largely separated in sequences. Discovering such patterns is time-consuming because abundant combinations exist when long gaps (a gap consists of one or more successive wildcards are considered. Mining algorithms often employ constraints to narrow down the search space in order to increase efficiency. However, improper constraint models might degrade the sensitivity and specificity of the motifs discovered by computational methods. We previously proposed a new constraint model to handle large wildcard regions for discovering functional motifs of proteins. The patterns that satisfy the proposed constraint model are called W-patterns. A W-pattern is a structured motif that groups motif symbols into pattern blocks interleaved with large irregular gaps. Considering large gaps reflects the fact that functional residues are not always from a single region of protein sequences, and restricting motif symbols into clusters corresponds to the observation that short motifs are frequently present within protein families. To efficiently discover W-patterns for large-scale sequence annotation and function prediction, this paper first formally introduces the problem to solve and proposes an algorithm named WildSpan (sequential pattern mining across large wildcard regions that incorporates several pruning strategies to largely reduce the mining cost. Results WildSpan is shown to efficiently find W-patterns containing conserved residues that are far separated in sequences. We conducted experiments with two mining strategies, protein-based and family-based mining, to evaluate the usefulness of W-patterns and performance of WildSpan. The protein-based mining mode

  18. MotifMark: Finding regulatory motifs in DNA sequences.

    Science.gov (United States)

    Hassanzadeh, Hamid Reza; Kolhe, Pushkar; Isbell, Charles L; Wang, May D

    2017-07-01

    The interaction between proteins and DNA is a key driving force in a significant number of biological processes such as transcriptional regulation, repair, recombination, splicing, and DNA modification. The identification of DNA-binding sites and the specificity of target proteins in binding to these regions are two important steps in understanding the mechanisms of these biological activities. A number of high-throughput technologies have recently emerged that try to quantify the affinity between proteins and DNA motifs. Despite their success, these technologies have their own limitations and fall short in precise characterization of motifs, and as a result, require further downstream analysis to extract useful and interpretable information from a haystack of noisy and inaccurate data. Here we propose MotifMark, a new algorithm based on graph theory and machine learning, that can find binding sites on candidate probes and rank their specificity in regard to the underlying transcription factor. We developed a pipeline to analyze experimental data derived from compact universal protein binding microarrays and benchmarked it against two of the most accurate motif search methods. Our results indicate that MotifMark can be a viable alternative technique for prediction of motif from protein binding microarrays and possibly other related high-throughput techniques.

  19. Crystallization and preliminary X-ray diffraction analysis of motif N from Saccharomyces cerevisiae Dbf4

    International Nuclear Information System (INIS)

    Matthews, Lindsay A.; Duong, Andrew; Prasad, Ajai A.; Duncker, Bernard P.; Guarné, Alba

    2009-01-01

    To understand the role of the Cdc7–Dbf4 complex in checkpoint responses, a fragment of Saccharomyces cerevisiae Dbf4 encompassing motif N was isolated, overproduced and crystallized. The Cdc7–Dbf4 complex plays an instrumental role in the initiation of DNA replication and is a target of replication-checkpoint responses in Saccharomyces cerevisiae. Cdc7 is a conserved serine/threonine kinase whose activity depends on association with its regulatory subunit, Dbf4. A conserved sequence near the N-terminus of Dbf4 (motif N) is necessary for the interaction of Cdc7–Dbf4 with the checkpoint kinase Rad53. To understand the role of the Cdc7–Dbf4 complex in checkpoint responses, a fragment of Saccharomyces cerevisiae Dbf4 encompassing motif N was isolated, overproduced and crystallized. A complete native data set was collected at 100 K from crystals that diffracted X-rays to 2.75 Å resolution and structure determination is currently under way

  20. Family of autocatalytic group I introns in bacteriophage T4

    International Nuclear Information System (INIS)

    Shub, D.A.; Xu, M.Q.; Gott, J.M.; Zeeh, A.; Wilson, L.D.

    1987-01-01

    The discovery of an intron in phage T4 encouraged the authors to look for additional group I introns in the T4 genome. Further examples would permit sequence and structural comparisons that might lend insight into their evolutionary origin. Additionally, they hoped that their locations within the T4 genome would infer a possible regulatory function in prokaryotic gene expression. They took advantage of the fact that, since G is added to the 5' end of the intron, autocatalytic group I introns could be specifically labeled in vitro for use as probes for DNA blotting experiments. If Group I introns were in more than just the td gene, multiple RNA species should be labeled when total RNA is extracted from T4-infected cells and incubated with [α- 32 P]GTP in vitro. When used as a probe for a Southern blot of T4 DNA, this RNA should hybridize to several DNA bands

  1. SA-Mot: a web server for the identification of motifs of interest extracted from protein loops.

    Science.gov (United States)

    Regad, Leslie; Saladin, Adrien; Maupetit, Julien; Geneix, Colette; Camproux, Anne-Claude

    2011-07-01

    The detection of functional motifs is an important step for the determination of protein functions. We present here a new web server SA-Mot (Structural Alphabet Motif) for the extraction and location of structural motifs of interest from protein loops. Contrary to other methods, SA-Mot does not focus only on functional motifs, but it extracts recurrent and conserved structural motifs involved in structural redundancy of loops. SA-Mot uses the structural word notion to extract all structural motifs from uni-dimensional sequences corresponding to loop structures. Then, SA-Mot provides a description of these structural motifs using statistics computed in the loop data set and in SCOP superfamily, sequence and structural parameters. SA-Mot results correspond to an interactive table listing all structural motifs extracted from a target structure and their associated descriptors. Using this information, the users can easily locate loop regions that are important for the protein folding and function. The SA-Mot web server is available at http://sa-mot.mti.univ-paris-diderot.fr.

  2. The splicing of tiny introns of Paramecium is controlled by MAGO.

    Science.gov (United States)

    Contreras, Julia; Begley, Victoria; Marsella, Laura; Villalobo, Eduardo

    2018-07-15

    The exon junction complex (EJC) is a key element of the splicing machinery. The EJC core is composed of eIF4A3, MAGO, Y14 and MLN51. Few accessory proteins, such as CWC22 or UPF3, bind transiently to the EJC. The EJC has been implicated in the control of the splicing of long introns. To ascertain whether the EJC controls the splicing of short introns, we used Paramecium tetraurelia as a model organism, since it has thousands of very tiny introns. To elucidate whether EJC affects intron splicing in P. tetraurelia, we searched for EJC protein-coding genes, and silenced those genes coding for eIF4A3, MAGO and CWC22. We found that P. tetraurelia likely assembles an active EJC with only three of the core proteins, since MLN51 is lacking. Silencing of eIF4A3 or CWC22 genes, but not that of MAGO, caused lethality. Silencing of the MAGO gene caused either an increase, decrease, or no change in intron retention levels of some intron-containing mRNAs used as reporters. We suggest that a fine-tuning expression of EJC genes is required for steady intron removal in P. tetraurelia. Taking into consideration our results and those published by others, we conclude that the EJC controls splicing independently of the intron size. Copyright © 2018 Elsevier B.V. All rights reserved.

  3. U12 type introns were lost at multiple occasions during evolution

    Directory of Open Access Journals (Sweden)

    Bartschat Sebastian

    2010-02-01

    Full Text Available Abstract Background Two categories of introns are known, a common U2 type and a rare U12 type. These two types of introns are removed by distinct spliceosomes. The phylogenetic distribution of spliceosomal RNAs that are characteristic of the U12 spliceosome, i.e. the U11, U12, U4atac and U6atac RNAs, suggest that U12 spliceosomes were lost in many phylogenetic groups. We have now examined the distribution of U2 and U12 introns in many of these groups. Results U2 and U12 introns were predicted by making use of available EST and genomic sequences. The results show that in species or branches where U12 spliceosomal components are missing, also U12 type of introns are lacking. Examples are the choanoflagellate Monosiga brevicollis, Entamoeba histolytica, green algae, diatoms, and the fungal lineage Basidiomycota. Furthermore, whereas U12 splicing does not occur in Caenorhabditis elegans, U12 introns as well as U12 snRNAs are present in Trichinella spiralis, which is deeply branching in the nematode tree. A comparison of homologous genes in T. spiralis and C. elegans revealed different mechanisms whereby U12 introns were lost. Conclusions The phylogenetic distribution of U12 introns and spliceosomal RNAs give further support to an early origin of U12 dependent splicing. In addition, this distribution identifies a large number of instances during eukaryotic evolution where such splicing was lost.

  4. A Central Nervous System-Dependent Intron-Embedded Gene Encodes a Novel Murine Fyn Binding Protein.

    Science.gov (United States)

    Ben Khalaf, Noureddine; Taha, Safa; Bakhiet, Moiz; Fathallah, M Dahmani

    2016-01-01

    The interplay between the nervous and immune systems is gradually being unraveled. We previously reported in the mouse the novel soluble immune system factor ISRAA, whose activation in the spleen is central nervous system-dependent. We also showed that ISRAA plays a role in modulating anti-infection immunity. Herein, we report the genomic description of the israa locus, along with some insights into the structure-function relationship of the protein. Our findings revealed that israa is nested within intron 6 of the mouse zmiz1 gene. Protein sequence analysis revealed a typical SH2 binding motif (Y102TEV), with Fyn being the most likely binding partner. Docking simulation showed a favorable conformation for the ISRAA-Fyn complex, with a specific binding mode for the binding of the YTEV motif to the SH2 domain. Experimental studies showed that in vitro, recombinant ISRAA is phosphorylated by Fyn at tyrosine 102. Cell transfection and pull-down experiments revealed Fyn as a binding partner of ISRAA in the EL4 mouse T-cell line. Indeed, we demonstrated that ISRAA downregulates T-cell activation and the phosphorylation of an activation tyrosine (Y416) of Src-family kinases in mouse splenocytes. Our observations highlight ISRAA as a novel Fyn binding protein that is likely to be involved in a signaling pathway driven by the nervous system.

  5. A Central Nervous System-Dependent Intron-Embedded Gene Encodes a Novel Murine Fyn Binding Protein.

    Directory of Open Access Journals (Sweden)

    Noureddine Ben Khalaf

    Full Text Available The interplay between the nervous and immune systems is gradually being unraveled. We previously reported in the mouse the novel soluble immune system factor ISRAA, whose activation in the spleen is central nervous system-dependent. We also showed that ISRAA plays a role in modulating anti-infection immunity. Herein, we report the genomic description of the israa locus, along with some insights into the structure-function relationship of the protein. Our findings revealed that israa is nested within intron 6 of the mouse zmiz1 gene. Protein sequence analysis revealed a typical SH2 binding motif (Y102TEV, with Fyn being the most likely binding partner. Docking simulation showed a favorable conformation for the ISRAA-Fyn complex, with a specific binding mode for the binding of the YTEV motif to the SH2 domain. Experimental studies showed that in vitro, recombinant ISRAA is phosphorylated by Fyn at tyrosine 102. Cell transfection and pull-down experiments revealed Fyn as a binding partner of ISRAA in the EL4 mouse T-cell line. Indeed, we demonstrated that ISRAA downregulates T-cell activation and the phosphorylation of an activation tyrosine (Y416 of Src-family kinases in mouse splenocytes. Our observations highlight ISRAA as a novel Fyn binding protein that is likely to be involved in a signaling pathway driven by the nervous system.

  6. Group II intron inhibits conjugative relaxase expression in bacteria by mRNA targeting

    Science.gov (United States)

    Piazza, Carol Lyn; Smith, Dorie

    2018-01-01

    Group II introns are mobile ribozymes that are rare in bacterial genomes, often cohabiting with various mobile elements, and seldom interrupting housekeeping genes. What accounts for this distribution has not been well understood. Here, we demonstrate that Ll.LtrB, the group II intron residing in a relaxase gene on a conjugative plasmid from Lactococcus lactis, inhibits its host gene expression and restrains the naturally cohabiting mobile element from conjugative horizontal transfer. We show that reduction in gene expression is mainly at the mRNA level, and results from the interaction between exon-binding sequences (EBSs) in the intron and intron-binding sequences (IBSs) in the mRNA. The spliced intron targets the relaxase mRNA and reopens ligated exons, causing major mRNA loss. Taken together, this study provides an explanation for the distribution and paucity of group II introns in bacteria, and suggests a potential force for those introns to evolve into spliceosomal introns. PMID:29905149

  7. Localized Retroprocessing as a Model of Intron Loss in the Plant Mitochondrial Genome.

    Science.gov (United States)

    Cuenca, Argelia; Ross, T Gregory; Graham, Sean W; Barrett, Craig F; Davis, Jerrold I; Seberg, Ole; Petersen, Gitte

    2016-08-03

    Loss of introns in plant mitochondrial genes is commonly explained by retroprocessing. Under this model, an mRNA is reverse transcribed and integrated back into the genome, simultaneously affecting the contents of introns and edited sites. To evaluate the extent to which retroprocessing explains intron loss, we analyzed patterns of intron content and predicted RNA editing for whole mitochondrial genomes of 30 species in the monocot order Alismatales. In this group, we found an unusually high degree of variation in the intron content, even expanding the hitherto known variation among angiosperms. Some species have lost some two-third of the cis-spliced introns. We found a strong correlation between intron content and editing frequency, and detected 27 events in which intron loss is consistent with the presence of nucleotides in an edited state, supporting retroprocessing. However, we also detected seven cases of intron loss not readily being explained by retroprocession. Our analyses are also not consistent with the entire length of a fully processed cDNA copy being integrated into the genome, but instead indicate that retroprocessing usually occurs for only part of the gene. In some cases, several rounds of retroprocessing may explain intron loss in genes completely devoid of introns. A number of taxa retroprocessing seem to be very common and a possibly ongoing process. It affects the entire mitochondrial genome. © The Author 2016. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

  8. Interleukin-11 binds specific EF-hand proteins via their conserved structural motifs.

    Science.gov (United States)

    Kazakov, Alexei S; Sokolov, Andrei S; Vologzhannikova, Alisa A; Permyakova, Maria E; Khorn, Polina A; Ismailov, Ramis G; Denessiouk, Konstantin A; Denesyuk, Alexander I; Rastrygina, Victoria A; Baksheeva, Viktoriia E; Zernii, Evgeni Yu; Zinchenko, Dmitry V; Glazatov, Vladimir V; Uversky, Vladimir N; Mirzabekov, Tajib A; Permyakov, Eugene A; Permyakov, Sergei E

    2017-01-01

    Interleukin-11 (IL-11) is a hematopoietic cytokine engaged in numerous biological processes and validated as a target for treatment of various cancers. IL-11 contains intrinsically disordered regions that might recognize multiple targets. Recently we found that aside from IL-11RA and gp130 receptors, IL-11 interacts with calcium sensor protein S100P. Strict calcium dependence of this interaction suggests a possibility of IL-11 interaction with other calcium sensor proteins. Here we probed specificity of IL-11 to calcium-binding proteins of various types: calcium sensors of the EF-hand family (calmodulin, S100B and neuronal calcium sensors: recoverin, NCS-1, GCAP-1, GCAP-2), calcium buffers of the EF-hand family (S100G, oncomodulin), and a non-EF-hand calcium buffer (α-lactalbumin). A specific subset of the calcium sensor proteins (calmodulin, S100B, NCS-1, GCAP-1/2) exhibits metal-dependent binding of IL-11 with dissociation constants of 1-19 μM. These proteins share several amino acid residues belonging to conservative structural motifs of the EF-hand proteins, 'black' and 'gray' clusters. Replacements of the respective S100P residues by alanine drastically decrease its affinity to IL-11, suggesting their involvement into the association process. Secondary structure and accessibility of the hinge region of the EF-hand proteins studied are predicted to control specificity and selectivity of their binding to IL-11. The IL-11 interaction with the EF-hand proteins is expected to occur under numerous pathological conditions, accompanied by disintegration of plasma membrane and efflux of cellular components into the extracellular milieu.

  9. Differential GC Content between Exons and Introns Establishes Distinct Strategies of Splice-Site Recognition

    Directory of Open Access Journals (Sweden)

    Maayan Amit

    2012-05-01

    Full Text Available During evolution segments of homeothermic genomes underwent a GC content increase. Our analyses reveal that two exon-intron architectures have evolved from an ancestral state of low GC content exons flanked by short introns with a lower GC content. One group underwent a GC content elevation that abolished the differential exon-intron GC content, with introns remaining short. The other group retained the overall low GC content as well as the differential exon-intron GC content, and is associated with longer introns. We show that differential exon-intron GC content regulates exon inclusion level in this group, in which disease-associated mutations often lead to exon skipping. This group's exons also display higher nucleosome occupancy compared to flanking introns and exons of the other group, thus “marking” them for spliceosomal recognition. Collectively, our results reveal that differential exon-intron GC content is a previously unidentified determinant of exon selection and argue that the two GC content architectures reflect the two mechanisms by which splicing signals are recognized: exon definition and intron definition.

  10. Relaxed selection against accidental binding of transcription factors with conserved chromatin contexts.

    Science.gov (United States)

    Babbitt, G A

    2010-10-15

    The spurious (or nonfunctional) binding of transcription factors (TF) to the wrong locations on DNA presents a formidable challenge to genomes given the relatively low ceiling for sequence complexity within the short lengths of most binding motifs. The high potential for the occurrence of random motifs and subsequent nonfunctional binding of many transcription factors should theoretically lead to natural selection against the occurrence of spurious motif throughout the genome. However, because of the active role that chromatin can influence over eukaryotic gene regulation, it may also be expected that many supposed spurious binding sites could escape purifying selection if (A) they simply occur in regions of high nucleosome occupancy or (B) their surrounding chromatin was dynamically involved in their identity and function. We compared nucleosome occupancy and the presence/absence of functionally conserved chromatin context to the strength of selection against spurious binding of various TF binding motifs in Saccharomyces yeast. While we find no direct relationship with nucleosome occupancy, we find strong evidence that transcription factors spatially associated with evolutionarily conserved chromatin states are under relaxed selection against accidental binding. Transcription factors (with/without) a conserved chromatin context were found to occur on average, (87.7%/49.3%) of their expected frequencies. Functional binding motifs with conserved chromatin contexts were also significantly shorter in length and more often clustered. These results indicate a role of chromatin context dependency in relaxing selection against spurious binding in nearly half of all TF binding motifs throughout the yeast genome. 2010 Elsevier B.V. All rights reserved.

  11. Ni2+-binding RNA motifs with an asymmetric purine-rich internal loop and a G-A base pair.

    Science.gov (United States)

    Hofmann, H P; Limmer, S; Hornung, V; Sprinzl, M

    1997-01-01

    RNA molecules with high affinity for immobilized Ni2+ were isolated from an RNA pool with 50 randomized positions by in vitro selection-amplification. The selected RNAs preferentially bind Ni2+ and Co2+ over other cations from first series transition metals. Conserved structure motifs, comprising about 15 nt, were identified that are likely to represent the Ni2+ binding sites. Two conserved motifs contain an asymmetric purine-rich internal loop and probably a mismatch G-A base pair. The structure of one of these motifs was studied with proton NMR spectroscopy and formation of the G-A pair at the junction of helix and internal loop was demonstrated. Using Ni2+ as a paramagnetic probe, a divalent metal ion binding site near this G-A base pair was identified. Ni2+ ions bound to this motif exert a specific stabilization effect. We propose that small asymmetric purine-rich loops that contain a G-A interaction may represent a divalent metal ion binding site in RNA. PMID:9409620

  12. One motif to bind them: A small-XXX-small motif affects transmembrane domain 1 oligomerization, function, localization, and cross-talk between two yeast GPCRs.

    Science.gov (United States)

    Lock, Antonia; Forfar, Rachel; Weston, Cathryn; Bowsher, Leo; Upton, Graham J G; Reynolds, Christopher A; Ladds, Graham; Dixon, Ann M

    2014-12-01

    G protein-coupled receptors (GPCRs) are the largest family of cell-surface receptors in mammals and facilitate a range of physiological responses triggered by a variety of ligands. GPCRs were thought to function as monomers, however it is now accepted that GPCR homo- and hetero-oligomers also exist and influence receptor properties. The Schizosaccharomyces pombe GPCR Mam2 is a pheromone-sensing receptor involved in mating and has previously been shown to form oligomers in vivo. The first transmembrane domain (TMD) of Mam2 contains a small-XXX-small motif, overrepresented in membrane proteins and well-known for promoting helix-helix interactions. An ortholog of Mam2 in Saccharomyces cerevisiae, Ste2, contains an analogous small-XXX-small motif which has been shown to contribute to receptor homo-oligomerization, localization and function. Here we have used experimental and computational techniques to characterize the role of the small-XXX-small motif in function and assembly of Mam2 for the first time. We find that disruption of the motif via mutagenesis leads to reduction of Mam2 TMD1 homo-oligomerization and pheromone-responsive cellular signaling of the full-length protein. It also impairs correct targeting to the plasma membrane. Mutation of the analogous motif in Ste2 yielded similar results, suggesting a conserved mechanism for assembly. Using co-expression of the two fungal receptors in conjunction with computational models, we demonstrate a functional change in G protein specificity and propose that this is brought about through hetero-dimeric interactions of Mam2 with Ste2 via the complementary small-XXX-small motifs. This highlights the potential of these motifs to affect a range of properties that can be investigated in other GPCRs. Copyright © 2014. Published by Elsevier B.V.

  13. Identification of group specific motifs in Beta-lactamase family of proteins

    Directory of Open Access Journals (Sweden)

    Saxena Akansha

    2009-12-01

    Full Text Available Abstract Background Beta-lactamases are one of the most serious threats to public health. In order to combat this threat we need to study the molecular and functional diversity of these enzymes and identify signatures specific to these enzymes. These signatures will enable us to develop inhibitors and diagnostic probes specific to lactamases. The existing classification of beta-lactamases was developed nearly 30 years ago when few lactamases were available. DLact database contain more than 2000 beta-lactamase, which can be used to study the molecular diversity and to identify signatures specific to this family. Methods A set of 2020 beta-lactamase proteins available in the DLact database http://59.160.102.202/DLact were classified using graph-based clustering of Best Bi-Directional Hits. Non-redundant (> 90 percent identical protein sequences from each group were aligned using T-Coffee and annotated using information available in literature. Motifs specific to each group were predicted using PRATT program. Results The graph-based classification of beta-lactamase proteins resulted in the formation of six groups (Four major groups containing 191, 726, 774 and 73 proteins while two minor groups containing 50 and 8 proteins. Based on the information available in literature, we found that each of the four major groups correspond to the four classes proposed by Ambler. The two minor groups were novel and do not contain molecular signatures of beta-lactamase proteins reported in literature. The group-specific motifs showed high sensitivity (> 70% and very high specificity (> 90%. The motifs from three groups (corresponding to class A, C and D had a high level of conservation at DNA as well as protein level whereas the motifs from the fourth group (corresponding to class B showed conservation at only protein level. Conclusion The graph-based classification of beta-lactamase proteins corresponds with the classification proposed by Ambler, thus there is

  14. The fission yeast RNA binding protein Mmi1 regulates meiotic genes by controlling intron specific splicing and polyadenylation coupled RNA turnover.

    Directory of Open Access Journals (Sweden)

    Huei-Mei Chen

    Full Text Available The polyA tails of mRNAs are monitored by the exosome as a quality control mechanism. We find that fission yeast, Schizosaccharomyces pombe, adopts this RNA quality control mechanism to regulate a group of 30 or more meiotic genes at the level of both splicing and RNA turnover. In vegetative cells the RNA binding protein Mmi1 binds to the primary transcripts of these genes. We find the novel motif U(U/C/GAAAC highly over-represented in targets of Mmi1. Mmi1 can specifically regulate the splicing of particular introns in a transcript: it inhibits the splicing of introns that are in the vicinity of putative Mmi1 binding sites, while allowing the splicing of other introns that are far from such sites. In addition, binding of Mmi1, particularly near the 3' end, alters 3' processing to promote extremely long polyA tails of up to a kilobase. The hyperadenylated transcripts are then targeted for degradation by the nuclear exonuclease Rrp6. The nuclear polyA binding protein Pab2 assists this hyperadenylation-mediated RNA decay. Rrp6 also targets other hyperadenylated transcripts, which become hyperadenylated in an unknown, but Mmi1-independent way. Thus, hyperadenylation may be a general signal for RNA degradation. In addition, binding of Mmi1 can affect the efficiency of 3' cleavage. Inactivation of Mmi1 in meiosis allows meiotic expression, through splicing and RNA stabilization, of at least 29 target genes, which are apparently constitutively transcribed.

  15. Triple basepair changes within and adjacent to the conserved YY1 motif upstream of the U3 enhancer repeats of SL3-3 murine leukemia virus cause a small but significant shortening of latency of T-lymphoma induction

    International Nuclear Information System (INIS)

    Ma Shiliang; Lovmand, Jette; Soerensen, Annette Balle; Luz, Arne; Schmidt, Joerg; Pedersen, Finn Skou

    2003-01-01

    A highly conserved sequence upstream of the transcriptional enhancer in the U3 of murine leukemia viruses (MLVs) was reported to mediate negative regulation of their expression. In transient expression studies, negative regulation was reported to be conferred by coexpression of the transcription factor YY1, which binds to a motif in the upstream conserved region (UCR). To address the function of the UCR and its YY1-motif in an in vivo model of MLV-host interactions we introduced six consecutive triple basepair mutations into this region of the potent T-lymphomagenic SL3-3 MLV. We report that all mutants have retained their replication competence and that they all, like the SL3-3 wild type (wt), induce T-cell lymphomas when injected into newborn mice of the SWR strain. However, all mutants induced disease with slightly shorter latency periods than the wt SL3-3, suggesting that the YY1 motif as well as its immediate context in the UCR have a negative effect on the pathogenicity of the virus. This result may have implications for the design of retroviral vectors

  16. Nuclear introns outperform mitochondrial DNA in inter-specific phylogenetic reconstruction: Lessons from horseshoe bats (Rhinolophidae: Chiroptera).

    Science.gov (United States)

    Dool, Serena E; Puechmaille, Sebastien J; Foley, Nicole M; Allegrini, Benjamin; Bastian, Anna; Mutumi, Gregory L; Maluleke, Tinyiko G; Odendaal, Lizelle J; Teeling, Emma C; Jacobs, David S

    2016-04-01

    Despite many studies illustrating the perils of utilising mitochondrial DNA in phylogenetic studies, it remains one of the most widely used genetic markers for this purpose. Over the last decade, nuclear introns have been proposed as alternative markers for phylogenetic reconstruction. However, the resolution capabilities of mtDNA and nuclear introns have rarely been quantified and compared. In the current study we generated a novel ∼5kb dataset comprising six nuclear introns and a mtDNA fragment. We assessed the relative resolution capabilities of the six intronic fragments with respect to each other, when used in various combinations together, and when compared to the traditionally used mtDNA. We focused on a major clade in the horseshoe bat family (Afro-Palaearctic clade; Rhinolophidae) as our case study. This old, widely distributed and speciose group contains a high level of conserved morphology. This morphological stasis renders the reconstruction of the phylogeny of this group with traditional morphological characters complex. We sampled multiple individuals per species to represent their geographic distributions as best as possible (122 individuals, 24 species, 68 localities). We reconstructed the species phylogeny using several complementary methods (partitioned Maximum Likelihood and Bayesian and Bayesian multispecies-coalescent) and made inferences based on consensus across these methods. We computed pairwise comparisons based on Robinson-Foulds tree distance metric between all Bayesian topologies generated (27,000) for every gene(s) and visualised the tree space using multidimensional scaling (MDS) plots. Using our supported species phylogeny we estimated the ancestral state of key traits of interest within this group, e.g. echolocation peak frequency which has been implicated in speciation. Our results revealed many potential cryptic species within this group, even in taxa where this was not suspected a priori and also found evidence for mt

  17. Memetic algorithms for de novo motif-finding in biomedical sequences.

    Science.gov (United States)

    Bi, Chengpeng

    2012-09-01

    The objectives of this study are to design and implement a new memetic algorithm for de novo motif discovery, which is then applied to detect important signals hidden in various biomedical molecular sequences. In this paper, memetic algorithms are developed and tested in de novo motif-finding problems. Several strategies in the algorithm design are employed that are to not only efficiently explore the multiple sequence local alignment space, but also effectively uncover the molecular signals. As a result, there are a number of key features in the implementation of the memetic motif-finding algorithm (MaMotif), including a chromosome replacement operator, a chromosome alteration-aware local search operator, a truncated local search strategy, and a stochastic operation of local search imposed on individual learning. To test the new algorithm, we compare MaMotif with a few of other similar algorithms using simulated and experimental data including genomic DNA, primary microRNA sequences (let-7 family), and transmembrane protein sequences. The new memetic motif-finding algorithm is successfully implemented in C++, and exhaustively tested with various simulated and real biological sequences. In the simulation, it shows that MaMotif is the most time-efficient algorithm compared with others, that is, it runs 2 times faster than the expectation maximization (EM) method and 16 times faster than the genetic algorithm-based EM hybrid. In both simulated and experimental testing, results show that the new algorithm is compared favorably or superior to other algorithms. Notably, MaMotif is able to successfully discover the transcription factors' binding sites in the chromatin immunoprecipitation followed by massively parallel sequencing (ChIP-Seq) data, correctly uncover the RNA splicing signals in gene expression, and precisely find the highly conserved helix motif in the transmembrane protein sequences, as well as rightly detect the palindromic segments in the primary micro

  18. In silico identification and analysis of phytoene synthase genes in plants.

    Science.gov (United States)

    Han, Y; Zheng, Q S; Wei, Y P; Chen, J; Liu, R; Wan, H J

    2015-08-14

    In this study, we examined phytoene synthetase (PSY), the first key limiting enzyme in the synthesis of carotenoids and catalyzing the formation of geranylgeranyl pyrophosphate in terpenoid biosynthesis. We used known amino acid sequences of the PSY gene in tomato plants to conduct a genome-wide search and identify putative candidates in 34 sequenced plants. A total of 101 homologous genes were identified. Phylogenetic analysis revealed that PSY evolved independently in algae as well as monocotyledonous and dicotyledonous plants. Our results showed that the amino acid structures exhibited 5 motifs (motifs 1 to 5) in algae and those in higher plants were highly conserved. The PSY gene structures showed that the number of intron in algae varied widely, while the number of introns in higher plants was 4 to 5. Identification of PSY genes in plants and the analysis of the gene structure may provide a theoretical basis for studying evolutionary relationships in future analyses.

  19. Novel intron markers to study the phylogeny of closely related mammalian species

    Directory of Open Access Journals (Sweden)

    Castresana Jose

    2010-11-01

    Full Text Available Abstract Background Multilocus phylogenies can be used to infer the species tree of a group of closely related species. In species trees, the nodes represent the actual separation between species, thus providing essential information about their evolutionary history. In addition, multilocus phylogenies can help in analyses of species delimitation, gene flow and genetic differentiation within species. However, few adequate markers are available for such studies. Results In order to develop nuclear markers that can be useful in multilocus studies of mammals, we analyzed the mammalian genomes of human, chimpanzee, macaque, dog and cow. Rodents were excluded due to their unusual genomic features. Introns were extracted from the mammalian genomes because of their greater genetic variability and ease of amplification from the flanking exons. To an initial set of more than 10,000 one-to-one orthologous introns we applied several filters to select introns that belong to single-copy genes, show neutral evolutionary rates and have an adequate length for their amplification. This analysis led to a final list of 224 intron markers randomly distributed along the genome. To experimentally test their validity, we amplified twelve of these introns in a panel of six mammalian species. The result was that seven of these introns gave rise to a PCR band of the expected size in all species. In addition, we sequenced these bands and analyzed the accumulation of substitutions in these introns in five pairs of closely related species. The results showed that the estimated genetic distances in the five species pairs was quite variable among introns and that this divergence cannot be directly predicted from the overall intron divergence in mammals. Conclusions We have designed a new set of 224 nuclear introns with optimal features for the phylogeny of closely related mammalian species. A large proportion of the introns tested experimentally showed a perfect amplification

  20. Perception Enhancement using Visual Attributes in Sequence Motif Visualization

    OpenAIRE

    Oon, Yin; Lee, Nung; Kok, Wei

    2016-01-01

    Sequence logo is a well-accepted scientific method to visualize the conservation characteristics of biological sequence motifs. Previous studies found that using sequence logo graphical representation for scientific evidence reports or arguments could seriously cause biases and misinterpretation by users. This study investigates on the visual attributes performance of a sequence logo in helping users to perceive and interpret the information based on preattentive theories and Gestalt principl...

  1. Conserved retinoblastoma protein-binding motif in human cytomegalovirus UL97 kinase minimally impacts viral replication but affects susceptibility to maribavir

    Directory of Open Access Journals (Sweden)

    Chou Sunwen

    2009-01-01

    Full Text Available Abstract The UL97 kinase has been shown to phosphorylate and inactivate the retinoblastoma protein (Rb and has three consensus Rb-binding motifs that might contribute to this activity. Recombinant viruses containing mutations in the Rb-binding motifs generally replicated well in human foreskin fibroblasts with only a slight delay in replication kinetics. Their susceptibility to the specific UL97 kinase inhibitor, maribavir, was also examined. Mutation of the amino terminal motif, which is involved in the inactivation of Rb, also renders the virus hypersensitive to the drug and suggests that the motif may play a role in its mechanism of action.

  2. SiteBinder: an improved approach for comparing multiple protein structural motifs.

    Science.gov (United States)

    Sehnal, David; Vařeková, Radka Svobodová; Huber, Heinrich J; Geidl, Stanislav; Ionescu, Crina-Maria; Wimmerová, Michaela; Koča, Jaroslav

    2012-02-27

    There is a paramount need to develop new techniques and tools that will extract as much information as possible from the ever growing repository of protein 3D structures. We report here on the development of a software tool for the multiple superimposition of large sets of protein structural motifs. Our superimposition methodology performs a systematic search for the atom pairing that provides the best fit. During this search, the RMSD values for all chemically relevant pairings are calculated by quaternion algebra. The number of evaluated pairings is markedly decreased by using PDB annotations for atoms. This approach guarantees that the best fit will be found and can be applied even when sequence similarity is low or does not exist at all. We have implemented this methodology in the Web application SiteBinder, which is able to process up to thousands of protein structural motifs in a very short time, and which provides an intuitive and user-friendly interface. Our benchmarking analysis has shown the robustness, efficiency, and versatility of our methodology and its implementation by the successful superimposition of 1000 experimentally determined structures for each of 32 eukaryotic linear motifs. We also demonstrate the applicability of SiteBinder using three case studies. We first compared the structures of 61 PA-IIL sugar binding sites containing nine different sugars, and we found that the sugar binding sites of PA-IIL and its mutants have a conserved structure despite their binding different sugars. We then superimposed over 300 zinc finger central motifs and revealed that the molecular structure in the vicinity of the Zn atom is highly conserved. Finally, we superimposed 12 BH3 domains from pro-apoptotic proteins. Our findings come to support the hypothesis that there is a structural basis for the functional segregation of BH3-only proteins into activators and enablers.

  3. Multi-layered control of Galectin-8 mediated autophagy during adenovirus cell entry through a conserved PPxY motif in the viral capsid.

    Directory of Open Access Journals (Sweden)

    Charlotte Montespan

    2017-02-01

    Full Text Available Cells employ active measures to restrict infection by pathogens, even prior to responses from the innate and humoral immune defenses. In this context selective autophagy is activated upon pathogen induced membrane rupture to sequester and deliver membrane fragments and their pathogen contents for lysosomal degradation. Adenoviruses, which breach the endosome upon entry, escape this fate by penetrating into the cytosol prior to autophagosome sequestration of the ruptured endosome. We show that virus induced membrane damage is recognized through Galectin-8 and sequesters the autophagy receptors NDP52 and p62. We further show that a conserved PPxY motif in the viral membrane lytic protein VI is critical for efficient viral evasion of autophagic sequestration after endosomal lysis. Comparing the wildtype with a PPxY-mutant virus we show that depletion of Galectin-8 or suppression of autophagy in ATG5-/- MEFs rescues infectivity of the PPxY-mutant virus while depletion of the autophagy receptors NDP52, p62 has only minor effects. Furthermore we show that wildtype viruses exploit the autophagic machinery for efficient nuclear genome delivery and control autophagosome formation via the cellular ubiquitin ligase Nedd4.2 resulting in reduced antigenic presentation. Our data thus demonstrate that a short PPxY-peptide motif in the adenoviral capsid permits multi-layered viral control of autophagic processes during entry.

  4. Conservation and diversification of Msx protein in metazoan evolution.

    Science.gov (United States)

    Takahashi, Hirokazu; Kamiya, Akiko; Ishiguro, Akira; Suzuki, Atsushi C; Saitou, Naruya; Toyoda, Atsushi; Aruga, Jun

    2008-01-01

    Msx (/msh) family genes encode homeodomain (HD) proteins that control ontogeny in many animal species. We compared the structures of Msx genes from a wide range of Metazoa (Porifera, Cnidaria, Nematoda, Arthropoda, Tardigrada, Platyhelminthes, Mollusca, Brachiopoda, Annelida, Echiura, Echinodermata, Hemichordata, and Chordata) to gain an understanding of the role of these genes in phylogeny. Exon-intron boundary analysis suggested that the position of the intron located N-terminally to the HDs was widely conserved in all the genes examined, including those of cnidarians. Amino acid (aa) sequence comparison revealed 3 new evolutionarily conserved domains, as well as very strong conservation of the HDs. Two of the three domains were associated with Groucho-like protein binding in both a vertebrate and a cnidarian Msx homolog, suggesting that the interaction between Groucho-like proteins and Msx proteins was established in eumetazoan ancestors. Pairwise comparison among the collected HDs and their C-flanking aa sequences revealed that the degree of sequence conservation varied depending on the animal taxa from which the sequences were derived. Highly conserved Msx genes were identified in the Vertebrata, Cephalochordata, Hemichordata, Echinodermata, Mollusca, Brachiopoda, and Anthozoa. The wide distribution of the conserved sequences in the animal phylogenetic tree suggested that metazoan ancestors had already acquired a set of conserved domains of the current Msx family genes. Interestingly, although strongly conserved sequences were recovered from the Vertebrata, Cephalochordata, and Anthozoa, the sequences from the Urochordata and Hydrozoa showed weak conservation. Because the Vertebrata-Cephalochordata-Urochordata and Anthozoa-Hydrozoa represent sister groups in the Chordata and Cnidaria, respectively, Msx sequence diversification may have occurred differentially in the course of evolution. We speculate that selective loss of the conserved domains in Msx family

  5. MPN+, a putative catalytic motif found in a subset of MPN domain proteins from eukaryotes and prokaryotes, is critical for Rpn11 function

    Directory of Open Access Journals (Sweden)

    Hofmann Kay

    2002-09-01

    Full Text Available Abstract Background Three macromolecular assemblages, the lid complex of the proteasome, the COP9-Signalosome (CSN and the eIF3 complex, all consist of multiple proteins harboring MPN and PCI domains. Up to now, no specific function for any of these proteins has been defined, nor has the importance of these motifs been elucidated. In particular Rpn11, a lid subunit, serves as the paradigm for MPN-containing proteins as it is highly conserved and important for proteasome function. Results We have identified a sequence motif, termed the MPN+ motif, which is highly conserved in a subset of MPN domain proteins such as Rpn11 and Csn5/Jab1, but is not present outside of this subfamily. The MPN+ motif consists of five polar residues that resemble the active site residues of hydrolytic enzyme classes, particularly that of metalloproteases. By using site-directed mutagenesis, we show that the MPN+ residues are important for the function of Rpn11, while a highly conserved Cys residue outside of the MPN+ motif is not essential. Single amino acid substitutions in MPN+ residues all show similar phenotypes, including slow growth, sensitivity to temperature and amino acid analogs, and general proteasome-dependent proteolysis defects. Conclusions The MPN+ motif is abundant in certain MPN-domain proteins, including newly identified proteins of eukaryotes, bacteria and archaea thought to act outside of the traditional large PCI/MPN complexes. The putative catalytic nature of the MPN+ motif makes it a good candidate for a pivotal enzymatic function, possibly a proteasome-associated deubiquitinating activity and a CSN-associated Nedd8/Rub1-removing activity.

  6. A survey of motif finding Web tools for detecting binding site motifs in ChIP-Seq data.

    Science.gov (United States)

    Tran, Ngoc Tam L; Huang, Chun-Hsi

    2014-02-20

    ChIP-Seq (chromatin immunoprecipitation sequencing) has provided the advantage for finding motifs as ChIP-Seq experiments narrow down the motif finding to binding site locations. Recent motif finding tools facilitate the motif detection by providing user-friendly Web interface. In this work, we reviewed nine motif finding Web tools that are capable for detecting binding site motifs in ChIP-Seq data. We showed each motif finding Web tool has its own advantages for detecting motifs that other tools may not discover. We recommended the users to use multiple motif finding Web tools that implement different algorithms for obtaining significant motifs, overlapping resemble motifs, and non-overlapping motifs. Finally, we provided our suggestions for future development of motif finding Web tool that better assists researchers for finding motifs in ChIP-Seq data.

  7. Towards barcode markers in Fungi: an intron map of Ascomycota mitochondria.

    Science.gov (United States)

    Santamaria, Monica; Vicario, Saverio; Pappadà, Graziano; Scioscia, Gaetano; Scazzocchio, Claudio; Saccone, Cecilia

    2009-06-16

    A standardized and cost-effective molecular identification system is now an urgent need for Fungi owing to their wide involvement in human life quality. In particular the potential use of mitochondrial DNA species markers has been taken in account. Unfortunately, a serious difficulty in the PCR and bioinformatic surveys is due to the presence of mobile introns in almost all the fungal mitochondrial genes. The aim of this work is to verify the incidence of this phenomenon in Ascomycota, testing, at the same time, a new bioinformatic tool for extracting and managing sequence databases annotations, in order to identify the mitochondrial gene regions where introns are missing so as to propose them as species markers. The general trend towards a large occurrence of introns in the mitochondrial genome of Fungi has been confirmed in Ascomycota by an extensive bioinformatic analysis, performed on all the entries concerning 11 mitochondrial protein coding genes and 2 mitochondrial rRNA (ribosomal RNA) specifying genes, belonging to this phylum, available in public nucleotide sequence databases. A new query approach has been developed to retrieve effectively introns information included in these entries. After comparing the new query-based approach with a blast-based procedure, with the aim of designing a faithful Ascomycota mitochondrial intron map, the first method appeared clearly the most accurate. Within this map, despite the large pervasiveness of introns, it is possible to distinguish specific regions comprised in several genes, including the full NADH dehydrogenase subunit 6 (ND6) gene, which could be considered as barcode candidates for Ascomycota due to their paucity of introns and to their length, above 400 bp, comparable to the lower end size of the length range of barcodes successfully used in animals. The development of the new query system described here would answer the pressing requirement to improve drastically the bioinformatics support to the DNA Barcode

  8. A novel human AP endonuclease with conserved zinc-finger-like motifs involved in DNA strand break responses

    Science.gov (United States)

    Kanno, Shin-ichiro; Kuzuoka, Hiroyuki; Sasao, Shigeru; Hong, Zehui; Lan, Li; Nakajima, Satoshi; Yasui, Akira

    2007-01-01

    DNA damage causes genome instability and cell death, but many of the cellular responses to DNA damage still remain elusive. We here report a human protein, PALF (PNK and APTX-like FHA protein), with an FHA (forkhead-associated) domain and novel zinc-finger-like CYR (cysteine–tyrosine–arginine) motifs that are involved in responses to DNA damage. We found that the CYR motif is widely distributed among DNA repair proteins of higher eukaryotes, and that PALF, as well as a Drosophila protein with tandem CYR motifs, has endo- and exonuclease activities against abasic site and other types of base damage. PALF accumulates rapidly at single-strand breaks in a poly(ADP-ribose) polymerase 1 (PARP1)-dependent manner in human cells. Indeed, PALF interacts directly with PARP1 and is required for its activation and for cellular resistance to methyl-methane sulfonate. PALF also interacts directly with KU86, LIGASEIV and phosphorylated XRCC4 proteins and possesses endo/exonuclease activity at protruding DNA ends. Various treatments that produce double-strand breaks induce formation of PALF foci, which fully coincide with γH2AX foci. Thus, PALF and the CYR motif may play important roles in DNA repair of higher eukaryotes. PMID:17396150

  9. Structural Diversity in Conserved Regions Like the DRY-Motif among Viral 7TM Receptors-A Consequence of Evolutionary Pressure?

    DEFF Research Database (Denmark)

    Mølleskov-Jensen, Ann-Sofie; Sparre-Ulrich, Alexander Hovard; Davis-Poynter, Nicholas

    2012-01-01

    Several herpes- and poxviruses have captured chemokine receptors from their hosts and modified these to their own benefit. The human and viral chemokine receptors belong to class A 7 transmembrane (TM) receptors which are characterized by several structural motifs like the DRY-motif in TM3...... and the C-terminal tail. In the DRY-motif, the arginine residue serves important purposes by being directly involved in G protein coupling. Interestingly, among the viral receptors there is a greater diversity in the DRY-motif compared to their endogenous receptor homologous. The C-terminal receptor tail...... constitutes another regulatory region that through a number of phosphorylation sites is involved in signaling, desensitization, and internalization. Also this region is more variable among virus-encoded 7TM receptors compared to human class A receptors. In this review we will focus on these two structural...

  10. Phylogenetic distribution of intron positions in alpha-amylase genes of bilateria suggests numerous gains and losses.

    Directory of Open Access Journals (Sweden)

    Jean-Luc Da Lage

    Full Text Available Most eukaryotes have at least some genes interrupted by introns. While it is well accepted that introns were already present at moderate density in the last eukaryote common ancestor, the conspicuous diversity of intron density among genomes suggests a complex evolutionary history, with marked differences between phyla. The question of the rates of intron gains and loss in the course of evolution and factors influencing them remains controversial. We have investigated a single gene family, alpha-amylase, in 55 species covering a variety of animal phyla. Comparison of intron positions across phyla suggests a complex history, with a likely ancestral intronless gene undergoing frequent intron loss and gain, leading to extant intron/exon structures that are highly variable, even among species from the same phylum. Because introns are known to play no regulatory role in this gene and there is no alternative splicing, the structural differences may be interpreted more easily: intron positions, sizes, losses or gains may be more likely related to factors linked to splicing mechanisms and requirements, and to recognition of introns and exons, or to more extrinsic factors, such as life cycle and population size. We have shown that intron losses outnumbered gains in recent periods, but that "resets" of intron positions occurred at the origin of several phyla, including vertebrates. Rates of gain and loss appear to be positively correlated. No phase preference was found. We also found evidence for parallel gains and for intron sliding. Presence of introns at given positions was correlated to a strong protosplice consensus sequence AG/G, which was much weaker in the absence of intron. In contrast, recent intron insertions were not associated with a specific sequence. In animal Amy genes, population size and generation time seem to have played only minor roles in shaping gene structures.

  11. Rbfox-regulated alternative splicing is critical for zebrafish cardiac and skeletal muscle function

    Science.gov (United States)

    Gallagher, Thomas L.; Arribere, Joshua A.; Geurts, Paul A.; Exner, Cameron R. T.; McDonald, Kent L.; Dill, Kariena K.; Marr, Henry L.; Adkar, Shaunak S.; Garnett, Aaron T.; Amacher, Sharon L.; Conboy, John G.

    2012-01-01

    Rbfox RNA binding proteins are implicated as regulators of phylogenetically-conserved alternative splicing events important for muscle function. To investigate the function of rbfox genes, we used morpholino-mediated knockdown of muscle-expressed rbfox1l and rbfox2 in zebrafish embryos. Single and double morphant embryos exhibited changes in splicing of overlapping sets of bioinformatically-predicted rbfox target exons, many of which exhibit a muscle-enriched splicing pattern that is conserved in vertebrates. Thus, conservation of intronic Rbfox binding motifs is a good predictor of Rbfox-regulated alternative splicing. Morphology and development of single morphant embryos was strikingly normal; however, muscle development in double morphants was severely disrupted. Defects in cardiac muscle were marked by reduced heart rate and in skeletal muscle by complete paralysis. The predominance of wavy myofibers and abnormal thick and thin filaments in skeletal muscle revealed that myofibril assembly is defective and disorganized in double morphants. Ultra-structural analysis revealed that although sarcomeres with electron dense M- and Z-bands are present in muscle fibers of rbfox1l/rbox2 morphants, they are substantially reduced in number and alignment. Importantly, splicing changes and morphological defects were rescued by expression of morpholino-resistant rbfox cDNA. Additionally, a target-blocking MO complementary to a single UGCAUG motif adjacent to an rbfox target exon of fxr1 inhibited inclusion in a similar manner to rbfox knockdown, providing evidence that Rbfox regulates the splicing of target exons via direct binding to intronic regulatory motifs. We conclude that Rbfox proteins regulate an alternative splicing program essential for vertebrate heart and skeletal muscle function. PMID:21925157

  12. Rbfox-regulated alternative splicing is critical for zebrafish cardiac and skeletal muscle functions.

    Science.gov (United States)

    Gallagher, Thomas L; Arribere, Joshua A; Geurts, Paul A; Exner, Cameron R T; McDonald, Kent L; Dill, Kariena K; Marr, Henry L; Adkar, Shaunak S; Garnett, Aaron T; Amacher, Sharon L; Conboy, John G

    2011-11-15

    Rbfox RNA binding proteins are implicated as regulators of phylogenetically-conserved alternative splicing events important for muscle function. To investigate the function of rbfox genes, we used morpholino-mediated knockdown of muscle-expressed rbfox1l and rbfox2 in zebrafish embryos. Single and double morphant embryos exhibited changes in splicing of overlapping sets of bioinformatically-predicted rbfox target exons, many of which exhibit a muscle-enriched splicing pattern that is conserved in vertebrates. Thus, conservation of intronic Rbfox binding motifs is a good predictor of Rbfox-regulated alternative splicing. Morphology and development of single morphant embryos were strikingly normal; however, muscle development in double morphants was severely disrupted. Defects in cardiac muscle were marked by reduced heart rate and in skeletal muscle by complete paralysis. The predominance of wavy myofibers and abnormal thick and thin filaments in skeletal muscle revealed that myofibril assembly is defective and disorganized in double morphants. Ultra-structural analysis revealed that although sarcomeres with electron dense M- and Z-bands are present in muscle fibers of rbfox1l/rbox2 morphants, they are substantially reduced in number and alignment. Importantly, splicing changes and morphological defects were rescued by expression of morpholino-resistant rbfox cDNA. Additionally, a target-blocking MO complementary to a single UGCAUG motif adjacent to an rbfox target exon of fxr1 inhibited inclusion in a similar manner to rbfox knockdown, providing evidence that Rbfox regulates the splicing of target exons via direct binding to intronic regulatory motifs. We conclude that Rbfox proteins regulate an alternative splicing program essential for vertebrate heart and skeletal muscle functions. Published by Elsevier Inc.

  13. Class I self-splicing introns are found in the T-even bacteriophage family

    International Nuclear Information System (INIS)

    Chu, F.K.; Maley, F.; Maley, G.F.

    1987-01-01

    The thymidylate synthase gene (td) and ribonucleotide reductase B2 subunit gene (nrdB) EMBO both of bacteriophage T4 in origin, are procaryotic intron-containing protein-encoding genes. To screen for other procaryotic introns, southern hybridization analysis of several procaryotic genomes was carried out, using T4 phage td DNA restriction fragments and synthetic oligodeoxynucleotides defining strategic td exon and intron regions. Furthermore, the labeling pattern of total RNA with [α- 32 P]GTP, a typical reaction of self-splicing RNAs (class I), was examined. Experimental data implicate multiple self-splicing introns only in the T-even phages: five (1, 0.9, 0.83, 0.75 and 0.6 kb) in T4 and three (1, 0.9 and 0.75 kb) each in T2 and T6 phages. Northern hybridization analysis of total RNA extracted from T-even phage-infected cells confirms that the 1 kb RNA from each phage is in fact the excised intron segment from the precursor RNA transcribed from an intron-containing td gene in each case. This RNA cyclizes to form a contiguous circular molecule. The 0.6 kb RNA is most likely the T4 phage nrdB intron which seems to be absent from the corresponding gene in T2 and T6. The remaining RNA species are candidates for other self-splicing introns in these phages

  14. Identification of a truncated splice variant of IL-18 receptor alpha in the human and rat, with evidence of wider evolutionary conservation

    Directory of Open Access Journals (Sweden)

    Chris S. Booker

    2014-09-01

    Full Text Available Interleukin-18 (IL-18 is a pro-inflammatory cytokine which stimulates activation of the nuclear factor kappa beta (NF-κB pathway via interaction with the IL-18 receptor. The receptor itself is formed from a dimer of two subunits, with the ligand-binding IL-18Rα subunit being encoded by the IL18R1 gene. A splice variant of murine IL18r1, which has been previously described, is formed by transcription of an unspliced intron (forming a ‘type II’ IL18r1 transcript and is predicted to encode a receptor with a truncated intracellular domain lacking the capacity to generate downstream signalling. In order to examine the relevance of this finding to human IL-18 function, we assessed the presence of a homologous transcript by reverse transcription-polymerase chain reaction (RT-PCR in the human and rat as another common laboratory animal. We present evidence for type II IL18R1 transcripts in both species. While the mouse and rat transcripts are predicted to encode a truncated receptor with a novel 5 amino acid C-terminal domain, the human sequence is predicted to encode a truncated protein with a novel 22 amino acid sequence bearing resemblance to the ‘Box 1’ motif of the Toll/interleukin-1 receptor (TIR domain, in a similar fashion to the inhibitory interleukin-1 receptor 2. Given that transcripts from these three species are all formed by inclusion of homologous unspliced intronic regions, an analysis of homologous introns across a wider array of 33 species with available IL18R1 gene records was performed, which suggests similar transcripts may encode truncated type II IL-18Rα subunits in other species. This splice variant may represent a conserved evolutionary mechanism for regulating IL-18 activity.

  15. A ChIP-Seq benchmark shows that sequence conservation mainly improves detection of strong transcription factor binding sites.

    Directory of Open Access Journals (Sweden)

    Tony Håndstad

    Full Text Available BACKGROUND: Transcription factors are important controllers of gene expression and mapping transcription factor binding sites (TFBS is key to inferring transcription factor regulatory networks. Several methods for predicting TFBS exist, but there are no standard genome-wide datasets on which to assess the performance of these prediction methods. Also, it is believed that information about sequence conservation across different genomes can generally improve accuracy of motif-based predictors, but it is not clear under what circumstances use of conservation is most beneficial. RESULTS: Here we use published ChIP-seq data and an improved peak detection method to create comprehensive benchmark datasets for prediction methods which use known descriptors or binding motifs to detect TFBS in genomic sequences. We use this benchmark to assess the performance of five different prediction methods and find that the methods that use information about sequence conservation generally perform better than simpler motif-scanning methods. The difference is greater on high-affinity peaks and when using short and information-poor motifs. However, if the motifs are specific and information-rich, we find that simple motif-scanning methods can perform better than conservation-based methods. CONCLUSIONS: Our benchmark provides a comprehensive test that can be used to rank the relative performance of transcription factor binding site prediction methods. Moreover, our results show that, contrary to previous reports, sequence conservation is better suited for predicting strong than weak transcription factor binding sites.

  16. Molecular characterization of a new member of the lariat capping twin-ribozyme introns

    DEFF Research Database (Denmark)

    Tang, Yunjia; Nielsen, Henrik; Masquida, Benoît

    2014-01-01

    BACKGROUND: Twin-ribozyme introns represent a complex class of mobile group I introns that harbour a lariat capping (LC) ribozyme and a homing endonuclease gene embedded in a conventional self-splicing group I ribozyme (GIR2). Twin-ribozyme introns have so far been confined to nucleolar DNA in Na...

  17. Characteristics of binding sites of intergenic, intronic and exonic ...

    African Journals Online (AJOL)

    user

    2013-03-06

    Mar 6, 2013 ... miR-1587). Such part of mRNA is very important for its regulation via several miRNA. Interaction of intronic miRNAs with mRNAs genes coding in-miRNA. Oncogenes (51) are host genes and target genes for in-. miRNAs. Majority of these in-miRNAs are encoded in intron. Five of the studied genes (ATF2, ...

  18. Sequencing of mitochondrial genomes of nine Aspergillus and Penicillium species identifies mobile introns and accessory genes as main sources of genome size variability.

    Science.gov (United States)

    Joardar, Vinita; Abrams, Natalie F; Hostetler, Jessica; Paukstelis, Paul J; Pakala, Suchitra; Pakala, Suman B; Zafar, Nikhat; Abolude, Olukemi O; Payne, Gary; Andrianopoulos, Alex; Denning, David W; Nierman, William C

    2012-12-12

    studies. Despite the conservation of the core genes, the mitochondrial genomes of Aspergillus and Penicillium species examined here exhibit significant amount of interspecies variation. Most of this variation can be attributed to accessory genes and mobile introns, presumably acquired by horizontal gene transfer of mitochondrial plasmids and intron homing.

  19. The MARVEL transmembrane motif of occludin mediates oligomerization and targeting to the basolateral surface in epithelia.

    Science.gov (United States)

    Yaffe, Yakey; Shepshelovitch, Jeanne; Nevo-Yassaf, Inbar; Yeheskel, Adva; Shmerling, Hedva; Kwiatek, Joanna M; Gaus, Katharina; Pasmanik-Chor, Metsada; Hirschberg, Koret

    2012-08-01

    Occludin (Ocln), a MARVEL-motif-containing protein, is found in all tight junctions. MARVEL motifs are comprised of four transmembrane helices associated with the localization to or formation of diverse membrane subdomains by interacting with the proximal lipid environment. The functions of the Ocln MARVEL motif are unknown. Bioinformatics sequence- and structure-based analyses demonstrated that the MARVEL domain of Ocln family proteins has distinct evolutionarily conserved sequence features that are consistent with its basolateral membrane localization. Live-cell microscopy, fluorescence resonance energy transfer (FRET) and bimolecular fluorescence complementation (BiFC) were used to analyze the intracellular distribution and self-association of fluorescent-protein-tagged full-length human Ocln or the Ocln MARVEL motif excluding the cytosolic C- and N-termini (amino acids 60-269, FP-MARVEL-Ocln). FP-MARVEL-Ocln efficiently arrived at the plasma membrane (PM) and was sorted to the basolateral PM in filter-grown polarized MDCK cells. A series of conserved aromatic amino acids within the MARVEL domain were found to be associated with Ocln dimerization using BiFC. FP-MARVEL-Ocln inhibited membrane pore growth during Triton-X-100-induced solubilization and was shown to increase the membrane-ordered state using Laurdan, a lipid dye. These data demonstrate that the Ocln MARVEL domain mediates self-association and correct sorting to the basolateral membrane.

  20. A novel human AP endonuclease with conserved zinc-finger-like motifs involved in DNA strand break responses

    OpenAIRE

    Kanno, Shin-ichiro; Kuzuoka, Hiroyuki; Sasao, Shigeru; Hong, Zehui; Lan, Li; Nakajima, Satoshi; Yasui, Akira

    2007-01-01

    DNA damage causes genome instability and cell death, but many of the cellular responses to DNA damage still remain elusive. We here report a human protein, PALF (PNK and APTX-like FHA protein), with an FHA (forkhead-associated) domain and novel zinc-finger-like CYR (cysteine–tyrosine–arginine) motifs that are involved in responses to DNA damage. We found that the CYR motif is widely distributed among DNA repair proteins of higher eukaryotes, and that PALF, as well as a Drosophila protein with...

  1. Short sequence motifs, overrepresented in mammalian conservednon-coding sequences

    Energy Technology Data Exchange (ETDEWEB)

    Minovitsky, Simon; Stegmaier, Philip; Kel, Alexander; Kondrashov,Alexey S.; Dubchak, Inna

    2007-02-21

    Background: A substantial fraction of non-coding DNAsequences of multicellular eukaryotes is under selective constraint. Inparticular, ~;5 percent of the human genome consists of conservednon-coding sequences (CNSs). CNSs differ from other genomic sequences intheir nucleotide composition and must play important functional roles,which mostly remain obscure.Results: We investigated relative abundancesof short sequence motifs in all human CNSs present in the human/mousewhole-genome alignments vs. three background sets of sequences: (i)weakly conserved or unconserved non-coding sequences (non-CNSs); (ii)near-promoter sequences (located between nucleotides -500 and -1500,relative to a start of transcription); and (iii) random sequences withthe same nucleotide composition as that of CNSs. When compared tonon-CNSs and near-promoter sequences, CNSs possess an excess of AT-richmotifs, often containing runs of identical nucleotides. In contrast, whencompared to random sequences, CNSs contain an excess of GC-rich motifswhich, however, lack CpG dinucleotides. Thus, abundance of short sequencemotifs in human CNSs, taken as a whole, is mostly determined by theiroverall compositional properties and not by overrepresentation of anyspecific short motifs. These properties are: (i) high AT-content of CNSs,(ii) a tendency, probably due to context-dependent mutation, of A's andT's to clump, (iii) presence of short GC-rich regions, and (iv) avoidanceof CpG contexts, due to their hypermutability. Only a small number ofshort motifs, overrepresented in all human CNSs are similar to bindingsites of transcription factors from the FOX family.Conclusion: Human CNSsas a whole appear to be too broad a class of sequences to possess strongfootprints of any short sequence-specific functions. Such footprintsshould be studied at the level of functional subclasses of CNSs, such asthose which flank genes with a particular pattern of expression. Overallproperties of CNSs are affected by

  2. A sialoreceptor binding motif in the Mycoplasma synoviae adhesin VlhA.

    Directory of Open Access Journals (Sweden)

    Meghan May

    Full Text Available Mycoplasma synoviae depends on its adhesin VlhA to mediate cytadherence to sialylated host cell receptors. Allelic variants of VlhA arise through recombination between an assemblage of promoterless vlhA pseudogenes and a single transcription promoter site, creating lineages of M. synoviae that each express a different vlhA allele. The predicted full-length VlhA sequences adjacent to the promoter of nine lineages of M. synoviae varying in avidity of cytadherence were aligned with that of the reference strain MS53 and with a 60-a.a. hemagglutinating VlhA C-terminal fragment from a Tunisian lineage of strain WVU1853(T. Seven different sequence variants of an imperfectly conserved, single-copy, 12-a.a. candidate cytadherence motif were evident amid the flanking variable residues of the 11 total sequences examined. The motif was predicted to adopt a short hairpin structure in a low-complexity region near the C-terminus of VlhA. Biotinylated synthetic oligopeptides representing four selected variants of the 12-a.a. motif, with the whole synthesized 60-a.a. fragment as a positive control, differed (P<0.01 in the extent they bound to chicken erythrocyte membranes. All bound to a greater extent (P<0.01 than scrambled or irrelevant VlhA domain negative control peptides did. Experimentally introduced branched-chain amino acid (BCAA substitutions Val3Ile and Leu7Ile did not significantly alter binding, whereas fold-destabilizing substitutions Thr4Gly and Ala9Gly tended to reduce it (P<0.05. Binding was also reduced to background levels (P<0.01 when the peptides were exposed to desialylated membranes, or were pre-saturated with free sialic acid before exposure to untreated membranes. From this evidence we conclude that the motif P-X-(BCAA-X-F-X-(BCAA-X-A-K-X-G binds sialic acid and likely mediates VlhA-dependent M. synoviae attachment to host cells. This conserved mechanism retains the potential for fine-scale rheostasis in binding avidity, which could be a

  3. Overlapping ETS and CRE Motifs (G/CCGGAAGTGACGTCA) Preferentially Bound by GABPα and CREB Proteins

    Science.gov (United States)

    Chatterjee, Raghunath; Zhao, Jianfei; He, Ximiao; Shlyakhtenko, Andrey; Mann, Ishminder; Waterfall, Joshua J.; Meltzer, Paul; Sathyanarayana, B. K.; FitzGerald, Peter C.; Vinson, Charles

    2012-01-01

    Previously, we identified 8-bps long DNA sequences (8-mers) that localize in human proximal promoters and grouped them into known transcription factor binding sites (TFBS). We now examine split 8-mers consisting of two 4-mers separated by 1-bp to 30-bps (X4-N1-30-X4) to identify pairs of TFBS that localize in proximal promoters at a precise distance. These include two overlapping TFBS: the ETS⇔ETS motif (C/GCCGGAAGCGGAA) and the ETS⇔CRE motif (C/GCGGAAGTGACGTCAC). The nucleotides in bold are part of both TFBS. Molecular modeling shows that the ETS⇔CRE motif can be bound simultaneously by both the ETS and the B-ZIP domains without protein-protein clashes. The electrophoretic mobility shift assay (EMSA) shows that the ETS protein GABPα and the B-ZIP protein CREB preferentially bind to the ETS⇔CRE motif only when the two TFBS overlap precisely. In contrast, the ETS domain of ETV5 and CREB interfere with each other for binding the ETS⇔CRE. The 11-mer (CGGAAGTGACG), the conserved part of the ETS⇔CRE motif, occurs 226 times in the human genome and 83% are in known regulatory regions. In vivo GABPα and CREB ChIP-seq peaks identified the ETS⇔CRE as the most enriched motif occurring in promoters of genes involved in mRNA processing, cellular catabolic processes, and stress response, suggesting that a specific class of genes is regulated by this composite motif. PMID:23050235

  4. Role of NH2-terminal hydrophobic motif in the subcellular localization of ATP-binding cassette protein subfamily D: Common features in eukaryotic organisms

    International Nuclear Information System (INIS)

    Lee, Asaka; Asahina, Kota; Okamoto, Takumi; Kawaguchi, Kosuke; Kostsin, Dzmitry G.; Kashiwayama, Yoshinori; Takanashi, Kojiro; Yazaki, Kazufumi; Imanaka, Tsuneo; Morita, Masashi

    2014-01-01

    Highlights: • ABCD proteins classifies based on with or without NH 2 -terminal hydrophobic segment. • The ABCD proteins with the segment are targeted peroxisomes. • The ABCD proteins without the segment are targeted to the endoplasmic reticulum. • The role of the segment in organelle targeting is conserved in eukaryotic organisms. - Abstract: In mammals, four ATP-binding cassette (ABC) proteins belonging to subfamily D have been identified. ABCD1–3 possesses the NH 2 -terminal hydrophobic region and are targeted to peroxisomes, while ABCD4 lacking the region is targeted to the endoplasmic reticulum (ER). Based on hydropathy plot analysis, we found that several eukaryotes have ABCD protein homologs lacking the NH 2 -terminal hydrophobic segment (H0 motif). To investigate whether the role of the NH 2 -terminal H0 motif in subcellular localization is conserved across species, we expressed ABCD proteins from several species (metazoan, plant and fungi) in fusion with GFP in CHO cells and examined their subcellular localization. ABCD proteins possessing the NH 2 -terminal H0 motif were localized to peroxisomes, while ABCD proteins lacking this region lost this capacity. In addition, the deletion of the NH 2 -terminal H0 motif of ABCD protein resulted in their localization to the ER. These results suggest that the role of the NH 2 -terminal H0 motif in organelle targeting is widely conserved in living organisms

  5. Discovery of cis-elements between sorghum and rice using co-expression and evolutionary conservation

    Directory of Open Access Journals (Sweden)

    Haberer Georg

    2009-06-01

    Full Text Available Abstract Background The spatiotemporal regulation of gene expression largely depends on the presence and absence of cis-regulatory sites in the promoter. In the economically highly important grass family, our knowledge of transcription factor binding sites and transcriptional networks is still very limited. With the completion of the sorghum genome and the available rice genome sequence, comparative promoter analyses now allow genome-scale detection of conserved cis-elements. Results In this study, we identified thousands of phylogenetic footprints conserved between orthologous rice and sorghum upstream regions that are supported by co-expression information derived from three different rice expression data sets. In a complementary approach, cis-motifs were discovered by their highly conserved co-occurrence in syntenic promoter pairs. Sequence conservation and matches to known plant motifs support our findings. Expression similarities of gene pairs positively correlate with the number of motifs that are shared by gene pairs and corroborate the importance of similar promoter architectures for concerted regulation. This strongly suggests that these motifs function in the regulation of transcript levels in rice and, presumably also in sorghum. Conclusion Our work provides the first large-scale collection of cis-elements for rice and sorghum and can serve as a paradigm for cis-element analysis through comparative genomics in grasses in general.

  6. Disparate requirements for the Walker A and B ATPase motifs ofhuman RAD51D in homologous recombination

    Energy Technology Data Exchange (ETDEWEB)

    Wiese, Claudia; Hinz, John M.; Tebbs, Robert S.; Nham, Peter B.; Urbin, Salustra S.; Collins, David W.; Thompson, Larry H.; Schild, David

    2006-04-21

    In vertebrates, homologous recombinational repair (HRR) requires RAD51 and five RAD51 paralogs (XRCC2, XRCC3, RAD51B, RAD51C, and RAD51D) that all contain conserved Walker A and B ATPase motifs. In human RAD51D we examined the requirement for these motifs in interactions with XRCC2 and RAD51C, and for survival of cells in response to DNA interstrand crosslinks. Ectopic expression of wild type human RAD51D or mutants having a non-functional A or B motif was used to test for complementation of a rad51d knockout hamster CHO cell line. Although A-motif mutants complement very efficiently, B-motif mutants do not. Consistent with these results, experiments using the yeast two- and three-hybrid systems show that the interactions between RAD51D and its XRCC2 and RAD51C partners also require a functional RAD51D B motif, but not motif A. Similarly, hamster Xrcc2 is unable to bind to the non-complementing human RAD51D B-motif mutants in co-immunoprecipitation assays. We conclude that a functional Walker B motif, but not A motif, is necessary for RAD51D's interactions with other paralogs and for efficient HRR. We present a model in which ATPase sites are formed in a bipartite manner between RAD51D and other RAD51 paralogs.

  7. Two CRM protein subfamilies cooperate in the splicing of group IIB introns in chloroplasts.

    Science.gov (United States)

    Asakura, Yukari; Bayraktar, Omer Ali; Barkan, Alice

    2008-11-01

    Chloroplast genomes in angiosperms encode approximately 20 group II introns, approximately half of which are classified as subgroup IIB. The splicing of all but one of the subgroup IIB introns requires a heterodimer containing the peptidyl-tRNA hydrolase homolog CRS2 and one of two closely related proteins, CAF1 or CAF2, that harbor a recently recognized RNA binding domain called the CRM domain. Two CRS2/CAF-dependent introns require, in addition, a CRM domain protein called CFM2 that is only distantly related to CAF1 and CAF2. Here, we show that CFM3, a close relative of CFM2, associates in vivo with those CRS2/CAF-dependent introns that are not CFM2 ligands. Mutant phenotypes in rice and Arabidopsis support a role for CFM3 in the splicing of most of the introns with which it associates. These results show that either CAF1 or CAF2 and either CFM2 or CFM3 simultaneously bind most chloroplast subgroup IIB introns in vivo, and that the CAF and CFM subunits play nonredundant roles in splicing. These results suggest that the expansion of the CRM protein family in plants resulted in two subfamilies that play different roles in group II intron splicing, with further diversification within a subfamily to accommodate multiple intron ligands.

  8. Exon sequence requirements for excision in vivo of the bacterial group II intron RmInt1

    Directory of Open Access Journals (Sweden)

    Toro Nicolás

    2011-05-01

    Full Text Available Abstract Background Group II intron splicing proceeds through two sequential transesterification reactions in which the 5' and 3'-exons are joined together and the lariat intron is released. The intron-encoded protein (IEP assists the splicing of the intron in vivo and remains bound to the excised intron lariat RNA in a ribonucleoprotein particle (RNP that promotes intron mobility. Exon recognition occurs through base-pairing interactions between two guide sequences on the ribozyme domain dI known as EBS1 and EBS2 and two stretches of sequence known as IBS1 and IBS2 on the 5' exon, whereas the 3' exon is recognized through interaction with the sequence immediately upstream from EBS1 [(δ-δ' interaction (subgroup IIA] or with a nucleotide [(EBS3-IBS3 interaction (subgroup IIB and IIC] located in the coordination-loop of dI. The δ nucleotide is involved in base pairing with another intron residue (δ' in subgroup IIB introns and this interaction facilitates base pairing between the 5' exon and the intron. Results In this study, we investigated nucleotide requirements in the distal 5'- and 3' exon regions, EBS-IBS interactions and δ-δ' pairing for excision of the group IIB intron RmInt1 in vivo. We found that the EBS1-IBS1 interaction was required and sufficient for RmInt1 excision. In addition, we provide evidence for the occurrence of canonical δ-δ' pairing and its importance for the intron excision in vivo. Conclusions The excision in vivo of the RmInt1 intron is a favored process, with very few constraints for sequence recognition in both the 5' and 3'-exons. Our results contribute to understand how group II introns spread in nature, and might facilitate the use of RmInt1 in gene targeting.

  9. Motif-role-fingerprints: the building-blocks of motifs, clustering-coefficients and transitivities in directed networks.

    Directory of Open Access Journals (Sweden)

    Mark D McDonnell

    Full Text Available Complex networks are frequently characterized by metrics for which particular subgraphs are counted. One statistic from this category, which we refer to as motif-role fingerprints, differs from global subgraph counts in that the number of subgraphs in which each node participates is counted. As with global subgraph counts, it can be important to distinguish between motif-role fingerprints that are 'structural' (induced subgraphs and 'functional' (partial subgraphs. Here we show mathematically that a vector of all functional motif-role fingerprints can readily be obtained from an arbitrary directed adjacency matrix, and then converted to structural motif-role fingerprints by multiplying that vector by a specific invertible conversion matrix. This result demonstrates that a unique structural motif-role fingerprint exists for any given functional motif-role fingerprint. We demonstrate a similar result for the cases of functional and structural motif-fingerprints without node roles, and global subgraph counts that form the basis of standard motif analysis. We also explicitly highlight that motif-role fingerprints are elemental to several popular metrics for quantifying the subgraph structure of directed complex networks, including motif distributions, directed clustering coefficient, and transitivity. The relationships between each of these metrics and motif-role fingerprints also suggest new subtypes of directed clustering coefficients and transitivities. Our results have potential utility in analyzing directed synaptic networks constructed from neuronal connectome data, such as in terms of centrality. Other potential applications include anomaly detection in networks, identification of similar networks and identification of similar nodes within networks. Matlab code for calculating all stated metrics following calculation of functional motif-role fingerprints is provided as S1 Matlab File.

  10. Codon based co-occurrence network motifs in human mitochondria

    Directory of Open Access Journals (Sweden)

    Pramod Shinde

    2017-10-01

    Full Text Available The nucleotide polymorphism in human mitochondrial genome (mtDNA tolled by codon position bias plays an indispensable role in human population dispersion and expansion. Herein, we constructed genome-wide nucleotide co-occurrence networks using a massive data consisting of five different geographical regions and around 3000 samples for each region. We developed a powerful network model to describe complex mitochondrial evolutionary patterns between codon and non-codon positions. It was interesting to report a different evolution of Asian genomes than those of the rest which is divulged by network motifs. We found evidence that mtDNA undergoes substantial amounts of adaptive evolution, a finding which was supported by a number of previous studies. The dominance of higher order motifs indicated the importance of long-range nucleotide co-occurrence in genomic diversity. Most notably, codon motifs apparently underpinned the preferences among codon positions for co-evolution which is probably highly biased during the origin of the genetic code. Our analyses manifested that codon position co-evolution is very well conserved across human sub-populations and independently maintained within human sub-populations implying the selective role of evolutionary processes on codon position co-evolution. Ergo, this study provided a framework to investigate cooperative genomic interactions which are critical in underlying complex mitochondrial evolution.

  11. Functional comparison of three transformer gene introns regulating conditional female lethality

    Science.gov (United States)

    The trasformer gene plays a critical role in the sex determination pathways of many insects. We cloned two transformer gene introns from Anastrepha suspensa, the Caribbean fruit fly. These introns have sequences that putatively have a role in sex-specific splicing patterns that affect sex determinat...

  12. Mobile group II intron based gene targeting in Lactobacillus plantarum WCFS1.

    Science.gov (United States)

    Sasikumar, Ponnusamy; Paul, Eldho; Gomathi, Sivasamy; Abhishek, Albert; Sasikumar, Sundaresan; Selvam, Govindan Sadasivam

    2016-10-01

    The usage of recombinant lactic acid bacteria for delivery of therapeutic proteins to the mucosa has been emerging. In the present study, an attempt was made to engineer a thyA mutant of Lactobacillus plantarum (L. plantarum) using lactococcal group II intron Ll.LtrB for the development of biologically contained recombinant L. plantarum for prevention of calcium oxalate stone disease. The 3 kb Ll.LtrB intron donor cassettes from the source vector pACD4C was PCR amplified, ligated into pSIP series of lactobacillus vector pLp_3050sAmyA, yielding a novel vector pLpACD4C (8.6 kb). The quantitative real-time PCR experiment shows 94-fold increased expression of Ll.LtrB intron and 14-fold increased expression of ltrA gene in recombinant L. plantarum containing pLpACD4C. In order to target the thyA gene, the potential intron RNA binding sites in the thyA gene of L. plantarum was predicted with help of computer algorithm. The insertion location 188|189s of thyA gene (lowest E-0.134) was chosen and the wild type intron Ll.LtrB was PCR modified, yielding a retargeted intron of pLpACDthyA. The retargeted intron was expressed by using induction peptide (sppIP), subsequently the integration of intron in thyA gene was identified by PCR screening and finally ThyA - mutant of L. plantarum (ThyA18) was detected. In vitro growth curve result showed that in the absence of thymidine, colony forming units of mutant ThyA18 was decreased, whereas high thymidine concentration (10 μM) supported the growth of the culture until saturation. In conclusion, ThyA - mutant of L. plantarum (ThyA18) constructed in this study will be used as a biologically contained recombinant probiotic to deliver oxalate decarboxylase into the lumen for treatment of hyperoxaluria and calcium oxalate stone deposition. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  13. Strong Signature of Natural Selection within an FHIT Intron Implicated in Prostate Cancer Risk

    Science.gov (United States)

    Ding, Yan; Larson, Garrett; Rivas, Guillermo; Lundberg, Cathryn; Geller, Louis; Ouyang, Ching; Weitzel, Jeffrey; Archambeau, John; Slater, Jerry; Daly, Mary B.; Benson, Al B.; Kirkwood, John M.; O'Dwyer, Peter J.; Sutphen, Rebecca; Stewart, James A.; Johnson, David; Nordborg, Magnus; Krontiris, Theodore G.

    2008-01-01

    Previously, a candidate gene linkage approach on brother pairs affected with prostate cancer identified a locus of prostate cancer susceptibility at D3S1234 within the fragile histidine triad gene (FHIT), a tumor suppressor that induces apoptosis. Subsequent association tests on 16 SNPs spanning approximately 381 kb surrounding D3S1234 in Americans of European descent revealed significant evidence of association for a single SNP within intron 5 of FHIT. In the current study, re-sequencing and genotyping within a 28.5 kb region surrounding this SNP further delineated the association with prostate cancer risk to a 15 kb region. Multiple SNPs in sequences under evolutionary constraint within intron 5 of FHIT defined several related haplotypes with an increased risk of prostate cancer in European-Americans. Strong associations were detected for a risk haplotype defined by SNPs 138543, 142413, and 152494 in all cases (Pearson's χ2 = 12.34, df 1, P = 0.00045) and for the homozygous risk haplotype defined by SNPs 144716, 142413, and 148444 in cases that shared 2 alleles identical by descent with their affected brothers (Pearson's χ2 = 11.50, df 1, P = 0.00070). In addition to highly conserved sequences encompassing SNPs 148444 and 152413, population studies revealed strong signatures of natural selection for a 1 kb window covering the SNP 144716 in two human populations, the European American (π = 0.0072, Tajima's D = 3.31, 14 SNPs) and the Japanese (π = 0.0049, Fay & Wu's H = 8.05, 14 SNPs), as well as in chimpanzees (Fay & Wu's H = 8.62, 12 SNPs). These results strongly support the involvement of the FHIT intronic region in an increased risk of prostate cancer. PMID:18953408

  14. Cytochrome oxidase subunit II gene in mitochondria of Oenothera has no intron

    Science.gov (United States)

    Hiesel, Rudolf; Brennicke, Axel

    1983-01-01

    The cytochrome oxidase subunit II gene has been localized in the mitochondrial genome of Oenothera berteriana and the nucleotide sequence has been determined. The coding sequence contains 777 bp and, unlike the corresponding gene in Zea mays, is not interrupted by an intron. No TGA codon is found within the open reading frame. The codon CGG, as in the maize gene, is used in place of tryptophan codons of corresponding genes in other organisms. At position 742 in the Oenothera sequence the TGG of maize is changed into a CGG codon, where Trp is conserved as the amino acid in other organisms. Homologous sequences occur more than once in the mitochondrial genome as several mitochondrial DNA species hybridize with DNA probes of the cytochrome oxidase subunit II gene. ImagesFig. 5. PMID:16453484

  15. The group II intron maturase: a reverse transcriptase and splicing factor go hand in hand.

    Science.gov (United States)

    Zhao, Chen; Pyle, Anna Marie

    2017-12-01

    The splicing of group II introns in vivo requires the assistance of a multifunctional intron encoded protein (IEP, or maturase). Each IEP is also a reverse-transcriptase enzyme that enables group II introns to behave as mobile genetic elements. During splicing or retro-transposition, each group II intron forms a tight, specific complex with its own encoded IEP, resulting in a highly reactive holoenzyme. This review focuses on the structural basis for IEP function, as revealed by recent crystal structures of an IEP reverse transcriptase domain and cryo-EM structures of an IEP-intron complex. These structures explain how the same IEP scaffold is utilized for intron recognition, splicing and reverse transcription, while providing a physical basis for understanding the evolutionary transformation of the IEP into the eukaryotic splicing factor Prp8. Copyright © 2017 Elsevier Ltd. All rights reserved.

  16. Disparate requirements for the Walker A and B ATPase motifs of human RAD51D in homologous recombination.

    Science.gov (United States)

    Wiese, Claudia; Hinz, John M; Tebbs, Robert S; Nham, Peter B; Urbin, Salustra S; Collins, David W; Thompson, Larry H; Schild, David

    2006-01-01

    In vertebrates, homologous recombinational repair (HRR) requires RAD51 and five RAD51 paralogs (XRCC2, XRCC3, RAD51B, RAD51C and RAD51D) that all contain conserved Walker A and B ATPase motifs. In human RAD51D we examined the requirement for these motifs in interactions with XRCC2 and RAD51C, and for survival of cells in response to DNA interstrand crosslinks (ICLs). Ectopic expression of wild-type human RAD51D or mutants having a non-functional A or B motif was used to test for complementation of a rad51d knockout hamster CHO cell line. Although A-motif mutants complement very efficiently, B-motif mutants do not. Consistent with these results, experiments using the yeast two- and three-hybrid systems show that the interactions between RAD51D and its XRCC2 and RAD51C partners also require a functional RAD51D B motif, but not motif A. Similarly, hamster Xrcc2 is unable to bind to the non-complementing human RAD51D B-motif mutants in co-immunoprecipitation assays. We conclude that a functional Walker B motif, but not A motif, is necessary for RAD51D's interactions with other paralogs and for efficient HRR. We present a model in which ATPase sites are formed in a bipartite manner between RAD51D and other RAD51 paralogs.

  17. Widespread evolutionary conservation of alternatively spliced exons in caenorhabditis

    DEFF Research Database (Denmark)

    Irimia, Manuel; Rukov, Jakob L; Penny, David

    2007-01-01

    Alternative splicing (AS) contributes to increased transcriptome and proteome diversity in various eukaryotic lineages. Previous studies showed low levels of conservation of alternatively spliced (cassette) exons within mammals and within dipterans. We report a strikingly different pattern...... in Caenorhabditis nematodes-more than 92% of cassette exons from Caenorhabditis elegans are conserved in Caenorhabditis briggsae and/or Caenorhabditis remanei. High levels of conservation extend to minor-form exons (present in a minority of transcripts) and are particularly pronounced for exons showing complex...... patterns of splicing. The functionality of the vast majority of cassette exons is underscored by various other features. We suggest that differences in conservation between lineages reflect differences in levels of functionality and further suggest that these differences are due to differences in intron...

  18. The use of the hypervariable P8 region of trnL(UAA intron for identification of orchid species: Evidence from restriction site polymorphism analysis.

    Directory of Open Access Journals (Sweden)

    Rajkumar Kishor

    Full Text Available The P8 stem-loop region of the trnL intron, which is known to be hypervariable in size with multiple repeat motifs and created difficulties in alignment, is always excluded in phylogenetic as well as barcode analyses. This region was investigated for species discrimination in 98 taxa of orchids belonging to the tribe Vandeae using in silico mapping of restriction site polymorphism. The length of the P8 regions varied from 200 nucleotides in Aerides rosea to 669 nucleotides in Dendrophylax sallei. Forty two taxa had unique lengths, while as many as eight shared a common length of 521 nucleotides. Of the 35 restriction endonucleases producing digestions in the P8 regions, three, viz., AgsI, ApoI and TspDTI turned out to have recognition sites across all the 98 taxa being studied. When their restriction data were combined, 92 taxa could be discriminated leaving three taxon pairs. However, Acampe papillosa and Aeranthes arachnites despite having similar restriction sites differed in their P8 lengths. This is the first report on thorough investigation of the P8 region of trnL intron for search of species specific restriction sites and hence its use as a potential plant DNA barcode.

  19. The calmodulin-binding, short linear motif, NSCaTE is conserved in L-type channel ancestors of vertebrate Cav1.2 and Cav1.3 channels.

    Directory of Open Access Journals (Sweden)

    Valentina Taiakina

    Full Text Available NSCaTE is a short linear motif of (xWxxx(I or Lxxxx, composed of residues with a high helix-forming propensity within a mostly disordered N-terminus that is conserved in L-type calcium channels from protostome invertebrates to humans. NSCaTE is an optional, lower affinity and calcium-sensitive binding site for calmodulin (CaM which competes for CaM binding with a more ancient, C-terminal IQ domain on L-type channels. CaM bound to N- and C- terminal tails serve as dual detectors to changing intracellular Ca(2+ concentrations, promoting calcium-dependent inactivation of L-type calcium channels. NSCaTE is absent in some arthropod species, and is also lacking in vertebrate L-type isoforms, Cav1.1 and Cav1.4 channels. The pervasiveness of a methionine just downstream from NSCaTE suggests that L-type channels could generate alternative N-termini lacking NSCaTE through the choice of translational start sites. Long N-terminus with an NSCaTE motif in L-type calcium channel homolog LCav1 from pond snail Lymnaea stagnalis has a faster calcium-dependent inactivation than a shortened N-termini lacking NSCaTE. NSCaTE effects are present in low concentrations of internal buffer (0.5 mM EGTA, but disappears in high buffer conditions (10 mM EGTA. Snail and mammalian NSCaTE have an alpha-helical propensity upon binding Ca(2+-CaM and can saturate both CaM N-terminal and C-terminal domains in the absence of a competing IQ motif. NSCaTE evolved in ancestors of the first animals with internal organs for promoting a more rapid, calcium-sensitive inactivation of L-type channels.

  20. The WSXWS motif in cytokine receptors is a molecular switch involved in receptor activation

    DEFF Research Database (Denmark)

    Dagil, Robert; Knudsen, Maiken J.; Olsen, Johan Gotthardt

    2012-01-01

    The prolactin receptor (PRLR) is activated by binding of prolactin in a 2:1 complex, but the activation mechanism is poorly understood. PRLR has a conserved WSXWS motif generic to cytokine class I receptors. We have determined the nuclear magnetic resonance solution structure of the membrane...

  1. The brown algae Pl.LSU/2 group II intron-encoded protein has functional reverse transcriptase and maturase activities.

    Directory of Open Access Journals (Sweden)

    Madeleine Zerbato

    Full Text Available Group II introns are self-splicing mobile elements found in prokaryotes and eukaryotic organelles. These introns propagate by homing into precise genomic locations, following assembly of a ribonucleoprotein complex containing the intron-encoded protein (IEP and the spliced intron RNA. Engineered group II introns are now commonly used tools for targeted genomic modifications in prokaryotes but not in eukaryotes. We speculate that the catalytic activation of currently known group II introns is limited in eukaryotic cells. The brown algae Pylaiella littoralis Pl.LSU/2 group II intron is uniquely capable of in vitro ribozyme activity at physiological level of magnesium but this intron remains poorly characterized. We purified and characterized recombinant Pl.LSU/2 IEP. Unlike most IEPs, Pl.LSU/2 IEP displayed a reverse transcriptase activity without intronic RNA. The Pl.LSU/2 intron could be engineered to splice accurately in Saccharomyces cerevisiae and splicing efficiency was increased by the maturase activity of the IEP. However, spliced transcripts were not expressed. Furthermore, intron splicing was not detected in human cells. While further tool development is needed, these data provide the first functional characterization of the PI.LSU/2 IEP and the first evidence that the Pl.LSU/2 group II intron splicing occurs in vivo in eukaryotes in an IEP-dependent manner.

  2. Conservation and divergence of ADAM family proteins in the Xenopus genome

    Directory of Open Access Journals (Sweden)

    Shah Anoop

    2010-07-01

    Full Text Available Abstract Background Members of the disintegrin metalloproteinase (ADAM family play important roles in cellular and developmental processes through their functions as proteases and/or binding partners for other proteins. The amphibian Xenopus has long been used as a model for early vertebrate development, but genome-wide analyses for large gene families were not possible until the recent completion of the X. tropicalis genome sequence and the availability of large scale expression sequence tag (EST databases. In this study we carried out a systematic analysis of the X. tropicalis genome and uncovered several interesting features of ADAM genes in this species. Results Based on the X. tropicalis genome sequence and EST databases, we identified Xenopus orthologues of mammalian ADAMs and obtained full-length cDNA clones for these genes. The deduced protein sequences, synteny and exon-intron boundaries are conserved between most human and X. tropicalis orthologues. The alternative splicing patterns of certain Xenopus ADAM genes, such as adams 22 and 28, are similar to those of their mammalian orthologues. However, we were unable to identify an orthologue for ADAM7 or 8. The Xenopus orthologue of ADAM15, an active metalloproteinase in mammals, does not contain the conserved zinc-binding motif and is hence considered proteolytically inactive. We also found evidence for gain of ADAM genes in Xenopus as compared to other species. There is a homologue of ADAM10 in Xenopus that is missing in most mammals. Furthermore, a single scaffold of X. tropicalis genome contains four genes encoding ADAM28 homologues, suggesting genome duplication in this region. Conclusions Our genome-wide analysis of ADAM genes in X. tropicalis revealed both conservation and evolutionary divergence of these genes in this amphibian species. On the one hand, all ADAMs implicated in normal development and health in other species are conserved in X. tropicalis. On the other hand, some

  3. Transcription Factor KLF5 Binds a Cyclin E1 Polymorphic Intronic Enhancer to Confer Increased Bladder Cancer Risk

    Science.gov (United States)

    Pattison, Jillian M.; Posternak, Valeriya; Cole, Michael D.

    2016-01-01

    It is well established that environmental toxins, such as exposure to arsenic, are risk factors in the development of urinary bladder cancer, yet recent genome-wide association studies (GWAS) provide compelling evidence that there is a strong genetic component associated with disease predisposition. A single nucleotide polymorphism (SNP), rs8102137, was identified on chromosome 19q12, residing 6 kb upstream of the important cell cycle regulator and proto-oncogene, Cyclin E1 (CCNE1). However, the functional role of this variant in bladder cancer predisposition has been unclear since it lies within a non-coding region of the genome. Here, it is demonstrated that bladder cancer cells heterozygous for this SNP exhibit biased allelic expression of CCNE1 with 1.5-fold more transcription occurring from the risk allele. Furthermore, using chromatin immunoprecipitation assays, a novel enhancer element was identified within the first intron of CCNE1 that binds Kruppel-like Factor 5 (KLF5), a known transcriptional activator in bladder cancer. Moreover, the data reveal that the presence of rs200996365, a SNP in high linkage disequilibrium with rs8102137 residing in the center of a KLF5 motif, alters KLF5 binding to this genomic region. Through luciferase assays and CRISPR-Cas9 genome editing, a novel polymorphic intronic regulatory element controlling CCNE1 transcription is characterized. These studies uncover how a cancer-associated polymorphism mechanistically contributes to an increased predisposition for bladder cancer development. Implications A polymorphic KLF5 binding site near the CCNE1 gene explains genetic risk identified through genome wide association studies. PMID:27514407

  4. Diversity in mRNA expression of the serine-type carboxypeptidase ocpG in Aspergillus oryzae through intron retention.

    Science.gov (United States)

    Ishida, Ken; Kuboshima, Megumi; Morita, Hiroto; Maeda, Hiroshi; Okamoto, Ayako; Takeuchi, Michio; Yamagata, Youhei

    2014-01-01

    Alternative splicing is thought to be a means for diversification of products by mRNA modification. Although some intron retentions are predicted by transcriptome analysis in Aspergillus oryzae, its physiological significance remains unknown. We found that intron retention occurred occasionally in the serine-type carboxypeptidase gene, ocpG. Analysis under various culture conditions revealed that extracellular nitrogen conditions influence splicing patterns; this suggested that there might be a correlation between splicing efficiency and the necessity of OcpG activity for obtaining a nitrogen source. Since further analysis showed that splicing occurred independently in each intron, we constructed ocpG intron-exchanging strain by interchanging the positions of intron-1 and intron-2. The splicing pattern indicated the probability that ocpG intron retention was affected by the secondary structures of intronic mRNA.

  5. Selection-driven extinction dynamics for group II introns in Enterobacteriales.

    Directory of Open Access Journals (Sweden)

    Sébastien Leclercq

    Full Text Available Transposable elements (TEs are one of the major driving forces of genome evolution, raising the question of the long-term dynamics underlying their evolutionary success. Some TEs were proposed to evolve under a pattern of periodic extinctions-recolonizations, in which elements recurrently invade and quickly proliferate within their host genomes, then start to disappear until total extinction. Depending on the model, TE extinction is assumed to be driven by purifying selection against colonized host genomes (Sel-DE model or by saturation of host genomes (Sat-DE model. Bacterial group II introns are suspected to follow an extinction-recolonization model of evolution, but whether they follow Sel-DE or Sat-DE dynamics is not known. Our analysis of almost 200 group II intron copies from 90 sequenced Enterobacteriales genomes confirms their extinction-recolonization dynamics: patchy element distributions among genera and even among strains within genera, acquisition of new group II introns through plasmids or other mobile genetic elements, and evidence for recent proliferations in some genomes. Distributions of recent and past proliferations and of their respective homing sites further provide strong support for the Sel-DE model, suggesting that group II introns are deleterious to their hosts. Overall, our observations emphasize the critical impact of host properties on TE dynamics.

  6. Motif analysis unveils the possible co-regulation of chloroplast genes and nuclear genes encoding chloroplast proteins.

    Science.gov (United States)

    Wang, Ying; Ding, Jun; Daniell, Henry; Hu, Haiyan; Li, Xiaoman

    2012-09-01

    Chloroplasts play critical roles in land plant cells. Despite their importance and the availability of at least 200 sequenced chloroplast genomes, the number of known DNA regulatory sequences in chloroplast genomes are limited. In this paper, we designed computational methods to systematically study putative DNA regulatory sequences in intergenic regions near chloroplast genes in seven plant species and in promoter sequences of nuclear genes in Arabidopsis and rice. We found that -35/-10 elements alone cannot explain the transcriptional regulation of chloroplast genes. We also concluded that there are unlikely motifs shared by intergenic sequences of most of chloroplast genes, indicating that these genes are regulated differently. Finally and surprisingly, we found five conserved motifs, each of which occurs in no more than six chloroplast intergenic sequences, are significantly shared by promoters of nuclear-genes encoding chloroplast proteins. By integrating information from gene function annotation, protein subcellular localization analyses, protein-protein interaction data, and gene expression data, we further showed support of the functionality of these conserved motifs. Our study implies the existence of unknown nuclear-encoded transcription factors that regulate both chloroplast genes and nuclear genes encoding chloroplast protein, which sheds light on the understanding of the transcriptional regulation of chloroplast genes.

  7. Extensive Mutagenesis of the Conserved Box E Motif in Duck Hepatitis B Virus P Protein Reveals Multiple Functions in Replication and a Common Structure with the Primer Grip in HIV-1 Reverse Transcriptase

    OpenAIRE

    Wang, Yong-Xiang; Luo, Cheng; Zhao, Dan; Beck, Jürgen; Nassal, Michael

    2012-01-01

    Hepadnaviruses, including the pathogenic hepatitis B virus (HBV), replicate their small DNA genomes through protein-primed reverse transcription, mediated by the terminal protein (TP) domain in their P proteins and an RNA stem-loop, ϵ, on the pregenomic RNA (pgRNA). No direct structural data are available for P proteins, but their reverse transcriptase (RT) domains contain motifs that are conserved in all RTs (box A to box G), implying a similar architecture; however, experimental support for...

  8. Deep intronic GPR143 mutation in a Japanese family with ocular albinism.

    Science.gov (United States)

    Naruto, Takuya; Okamoto, Nobuhiko; Masuda, Kiyoshi; Endo, Takao; Hatsukawa, Yoshikazu; Kohmoto, Tomohiro; Imoto, Issei

    2015-06-10

    Deep intronic mutations are often ignored as possible causes of human disease. Using whole-exome sequencing, we analysed genomic DNAs of a Japanese family with two male siblings affected by ocular albinism and congenital nystagmus. Although mutations or copy number alterations of coding regions were not identified in candidate genes, the novel intronic mutation c.659-131 T > G within GPR143 intron 5 was identified as hemizygous in affected siblings and as heterozygous in the unaffected mother. This mutation was predicted to create a cryptic splice donor site within intron 5 and activate a cryptic acceptor site at 41nt upstream, causing the insertion into the coding sequence of an out-of-frame 41-bp pseudoexon with a premature stop codon in the aberrant transcript, which was confirmed by minigene experiments. This result expands the mutational spectrum of GPR143 and suggests the utility of next-generation sequencing integrated with in silico and experimental analyses for improving the molecular diagnosis of this disease.

  9. Exploring the conserved water site and hydration of a coiled-coil trimerisation motif: a MD simulation study.

    Science.gov (United States)

    Dolenc, Jozica; Baron, Riccardo; Missimer, John H; Steinmetz, Michel O; van Gunsteren, Wilfred F

    2008-07-21

    The solvent structure and dynamics around ccbeta-p, a 17-residue peptide that forms a parallel three-stranded alpha-helical coiled coil in solution, was analysed through 10 ns explicit solvent molecular dynamics (MD) simulations at 278 and 330 K. Comparison with two corresponding simulations of the monomeric form of ccbeta-p was used to investigate the changes of hydration upon coiled-coil formation. Pronounced peaks in the solvent density distribution between residues Arg8 and Glu13 of neighbouring helices show the presence of water bridges between the helices of the ccbeta-p trimer; this is in agreement with the water sites observed in X-ray crystallography experiments. Interestingly, this water site is structurally conserved in many three-stranded coiled coils and, together with the Arg and Glu residues, forms part of a motif that determines three-stranded coiled-coil formation. Our findings show that little direct correlation exists between the solvent density distribution and the temporal ordering of water around the trimeric coiled coil. The MD-calculated effective residence times of up to 40 ps show rapid exchange of surface water molecules with the bulk phase, and indicate that the solvent distribution around biomolecules requires interpretation in terms of continuous density distributions rather than in terms of discrete molecules of water. Together, our study contributes to understanding the principles of three-stranded coiled-coil formation.

  10. The MHC motif viewer: a visualization tool for MHC binding motifs

    DEFF Research Database (Denmark)

    Rapin, Nicolas; Hoof, Ilka; Lund, Ole

    2010-01-01

    is hampered by the lack of tools for browsing and comparing specificity of these molecules. We have developed a Web server, MHC Motif Viewer, which allows the display of the binding motif for MHC class I proteins for human, chimpanzee, rhesus monkey, mouse, and swine, as well as HLA-DR protein sequences...

  11. Analysis of Horse Myostatin Gene and Identification of Single Nucleotide Polymorphisms in Breeds of Different Morphological Types

    Directory of Open Access Journals (Sweden)

    Stefania Dall'Olio

    2010-01-01

    Full Text Available Myostatin (MSTN is a negative modulator of muscle mass. We characterized the horse (Equus caballus MSTN gene and identified and analysed single nucleotide polymorphisms (SNPs in breeds of different morphological types. Sequencing of coding, untranslated, intronic, and regulatory regions of MSTN gene in 12 horses from 10 breeds revealed seven SNPs: two in the promoter, four in intron 1, and one in intron 2. The SNPs of the promoter (GQ183900:g.26T>C and GQ183900:g.156T>C, the latter located within a conserved TATA-box like motif were screened in 396 horses from 16 breeds. The g.26C and the g.156C alleles presented higher frequency in heavy (brachymorphic type than in light breeds (dolichomorphic type such as Italian Trotter breed. The significant difference of allele frequencies for the SNPs at the promoter and analysis of molecular variance (AMOVA on haplotypes indicates that these polymorphisms could be associated with variability of morphology traits in horse breeds.

  12. Analysis of Horse Myostatin Gene and Identification of Single Nucleotide Polymorphisms in Breeds of Different Morphological Types

    Science.gov (United States)

    Dall'Olio, Stefania; Fontanesi, Luca; Nanni Costa, Leonardo; Tassinari, Marco; Minieri, Laura; Falaschini, Adalberto

    2010-01-01

    Myostatin (MSTN) is a negative modulator of muscle mass. We characterized the horse (Equus caballus) MSTN gene and identified and analysed single nucleotide polymorphisms (SNPs) in breeds of different morphological types. Sequencing of coding, untranslated, intronic, and regulatory regions of MSTN gene in 12 horses from 10 breeds revealed seven SNPs: two in the promoter, four in intron 1, and one in intron 2. The SNPs of the promoter (GQ183900:g.26T>C and GQ183900:g.156T>C, the latter located within a conserved TATA-box like motif) were screened in 396 horses from 16 breeds. The g.26C and the g.156C alleles presented higher frequency in heavy (brachymorphic type) than in light breeds (dolichomorphic type such as Italian Trotter breed). The significant difference of allele frequencies for the SNPs at the promoter and analysis of molecular variance (AMOVA) on haplotypes indicates that these polymorphisms could be associated with variability of morphology traits in horse breeds. PMID:20706663

  13. A Conserved Acidic Motif in the N-Terminal Domain of Nitrate Reductase Is Necessary for the Inactivation of the Enzyme in the Dark by Phosphorylation and 14-3-3 Binding1

    Science.gov (United States)

    Pigaglio, Emmanuelle; Durand, Nathalie; Meyer, Christian

    1999-01-01

    It has previously been shown that the N-terminal domain of tobacco (Nicotiana tabacum) nitrate reductase (NR) is involved in the inactivation of the enzyme by phosphorylation, which occurs in the dark (L. Nussaume, M. Vincentz, C. Meyer, J.P. Boutin, and M. Caboche [1995] Plant Cell 7: 611–621). The activity of a mutant NR protein lacking this N-terminal domain was no longer regulated by light-dark transitions. In this study smaller deletions were performed in the N-terminal domain of tobacco NR that removed protein motifs conserved among higher plant NRs. The resulting truncated NR-coding sequences were then fused to the cauliflower mosaic virus 35S RNA promoter and introduced in NR-deficient mutants of the closely related species Nicotiana plumbaginifolia. We found that the deletion of a conserved stretch of acidic residues led to an active NR protein that was more thermosensitive than the wild-type enzyme, but it was relatively insensitive to the inactivation by phosphorylation in the dark. Therefore, the removal of this acidic stretch seems to have the same effects on NR activation state as the deletion of the N-terminal domain. A hypothetical explanation for these observations is that a specific factor that impedes inactivation remains bound to the truncated enzyme. A synthetic peptide derived from this acidic protein motif was also found to be a good substrate for casein kinase II. PMID:9880364

  14. Kopi dan Kakao dalam Kreasi Motif Batik Khas Jember

    Directory of Open Access Journals (Sweden)

    Irfa'ina Rohana Salma

    2015-06-01

    Full Text Available ABSTRAK Batik Jember selama ini identik dengan motif daun tembakau. Visualisasi daun tembakau dalam motif Batik Jember cukup lemah, yaitu kurang berkarakter karena motif yang muncul adalah seperti gambar daun pada umumnya. Oleh karena itu perlu diciptakan desain motif batik khas Jember yang sumber inspirasinya digali dari kekayaan alam lainnya dari Jember yang mempunyai bentuk spesifik dan karakteristik sehingga identitas motif bisa didapatkan dengan lebih kuat. Hasil alam khas Jember tersebut adalah kopi dan kakao. Tujuan penciptaan seni ini adalah untuk menghasilkan motif batik  baru yang mempunyai ciri khas Jember. Metode yang digunakan yaitu pengumpulan data, pengamatan mendalam terhadap objek penciptaan, pengkajian sumber inspirasi, pembuatan desain motif, dan perwujudan menjadi batik. Dari penciptaan seni ini berhasil dikreasikan 6 (enam motif batik yaitu: (1 Motif Uwoh Kopi; (2 Motif Godong Kopi;  (3 Motif Ceplok Kakao; (4 Motif Kakao Raja; (5 Motif Kakao Biru; dan (6 Motif Wiji Mukti. Berdasarkan hasil penilaian “Selera Estetika” diketahui bahwa motif yang paling banyak disukai adalah Motif Uwoh Kopi dan Motif Kakao Raja. Kata kunci: Motif Woh Kopi, Motif Godong Kopi, Motif Ceplok Kakao, Motif Kakao Raja, Motif Kakao Biru, Motif Wiji Mukti ABSTRACTBatik Jember is synonymous with tobacco leaf motif. Tobacco leaf shape is quite weak in the visual appearance characterized as that motif emerges like a picture of leaves in general. Therefore, it is necessary to create a distinctive design motif extracted from other natural resources of Jember that have specific shapes and characteristics that can be obtained as the stronger motif identity. The typical natural resources from Jember are coffee and cocoa. The purpose of the creation of this art is to produce the unique, creative and innovative batik and have specific characteristics of Jember. The method used are data collection, observation of the object, reviewing inspiration sources

  15. The KYxxL motif in Rad17 protein is essential for the interaction with the 9–1–1 complex

    Energy Technology Data Exchange (ETDEWEB)

    Fukumoto, Yasunori, E-mail: fukumoto@faculty.chiba-u.jp [Laboratory of Molecular Cell Biology, Graduate School of Pharmaceutical Sciences, Chiba University, Chiba 260-8675 (Japan); Ikeuchi, Masayoshi; Nakayama, Yuji [Department of Biochemistry & Molecular Biology, Kyoto Pharmaceutical University, Kyoto 607-8414 (Japan); Yamaguchi, Naoto, E-mail: nyama@faculty.chiba-u.jp [Laboratory of Molecular Cell Biology, Graduate School of Pharmaceutical Sciences, Chiba University, Chiba 260-8675 (Japan)

    2016-09-02

    ATR-dependent DNA damage checkpoint is the major DNA damage checkpoint against UV irradiation and DNA replication stress. The Rad17–RFC and Rad9–Rad1–Hus1 (9–1–1) complexes interact with each other to contribute to ATR signaling, however, the precise regulatory mechanism of the interaction has not been established. Here, we identified a conserved sequence motif, KYxxL, in the AAA+ domain of Rad17 protein, and demonstrated that this motif is essential for the interaction with the 9–1–1 complex. We also show that UV-induced Rad17 phosphorylation is increased in the Rad17 KYxxL mutants. These data indicate that the interaction with the 9–1–1 complex is not required for Rad17 protein to be an efficient substrate for the UV-induced phosphorylation. Our data also raise the possibility that the 9–1–1 complex plays a negative regulatory role in the Rad17 phosphorylation. We also show that the nucleotide-binding activity of Rad17 is required for its nuclear localization. - Highlights: • We have identified a conserved KYxxL motif in Rad17 protein. • The KYxxL motif is crucial for the interaction with the 9–1–1 complex. • The KYxxL motif is dispensable or inhibitory for UV-induced Rad17 phosphorylation. • Nucleotide binding of Rad17 is required for its nuclear localization.

  16. Loop 7 of E2 enzymes: an ancestral conserved functional motif involved in the E2-mediated steps of the ubiquitination cascade.

    Directory of Open Access Journals (Sweden)

    Elena Papaleo

    Full Text Available The ubiquitin (Ub system controls almost every aspect of eukaryotic cell biology. Protein ubiquitination depends on the sequential action of three classes of enzymes (E1, E2 and E3. E2 Ub-conjugating enzymes have a central role in the ubiquitination pathway, interacting with both E1 and E3, and influencing the ultimate fate of the substrates. Several E2s are characterized by an extended acidic insertion in loop 7 (L7, which if mutated is known to impair the proper E2-related functions. In the present contribution, we show that acidic loop is a conserved ancestral motif in E2s, relying on the presence of alternate hydrophobic and acidic residues. Moreover, the dynamic properties of a subset of family 3 E2s, as well as their binary and ternary complexes with Ub and the cognate E3, have been investigated. Here we provide a model of L7 role in the different steps of the ubiquitination cascade of family 3 E2s. The L7 hydrophobic residues turned out to be the main determinant for the stabilization of the E2 inactive conformations by a tight network of interactions in the catalytic cleft. Moreover, phosphorylation is known from previous studies to promote E2 competent conformations for Ub charging, inducing electrostatic repulsion and acting on the L7 acidic residues. Here we show that these active conformations are stabilized by a network of hydrophobic interactions between L7 and L4, the latter being a conserved interface for E3-recruitment in several E2s. In the successive steps, L7 conserved acidic residues also provide an interaction interface for both Ub and the Rbx1 RING subdomain of the cognate E3. Our data therefore suggest a crucial role for L7 of family 3 E2s in all the E2-mediated steps of the ubiquitination cascade. Its different functions are exploited thank to its conserved hydrophobic and acidic residues in a finely orchestrate mechanism.

  17. BLSSpeller: exhaustive comparative discovery of conserved cis-regulatory elements.

    Science.gov (United States)

    De Witte, Dieter; Van de Velde, Jan; Decap, Dries; Van Bel, Michiel; Audenaert, Pieter; Demeester, Piet; Dhoedt, Bart; Vandepoele, Klaas; Fostier, Jan

    2015-12-01

    The accurate discovery and annotation of regulatory elements remains a challenging problem. The growing number of sequenced genomes creates new opportunities for comparative approaches to motif discovery. Putative binding sites are then considered to be functional if they are conserved in orthologous promoter sequences of multiple related species. Existing methods for comparative motif discovery usually rely on pregenerated multiple sequence alignments, which are difficult to obtain for more diverged species such as plants. As a consequence, misaligned regulatory elements often remain undetected. We present a novel algorithm that supports both alignment-free and alignment-based motif discovery in the promoter sequences of related species. Putative motifs are exhaustively enumerated as words over the IUPAC alphabet and screened for conservation using the branch length score. Additionally, a confidence score is established in a genome-wide fashion. In order to take advantage of a cloud computing infrastructure, the MapReduce programming model is adopted. The method is applied to four monocotyledon plant species and it is shown that high-scoring motifs are significantly enriched for open chromatin regions in Oryza sativa and for transcription factor binding sites inferred through protein-binding microarrays in O.sativa and Zea mays. Furthermore, the method is shown to recover experimentally profiled ga2ox1-like KN1 binding sites in Z.mays. BLSSpeller was written in Java. Source code and manual are available at http://bioinformatics.intec.ugent.be/blsspeller Klaas.Vandepoele@psb.vib-ugent.be or jan.fostier@intec.ugent.be. Supplementary data are available at Bioinformatics online. © The Author 2015. Published by Oxford University Press.

  18. A distant cis acting intronic element induces site-selective RNA editing

    DEFF Research Database (Denmark)

    Daniel, Chammiran; Venø, Morten Trillingsgaard; Ekdahl, Ylva

    2012-01-01

    Transcripts have been found to be site selectively edited from adenosine-to-inosine (A-to-I) in the mammalian brain, mostly in genes involved in neurotransmission. While A-to-I editing occurs at double-stranded structures, other structural requirements are largely unknown. We have investigated...... shown to be important for A-to-I editing. We demonstrate that the element also can induce editing in related but normally not edited RNA sequences. In human, thousands of genes are edited in duplexes formed by inverted repeats in non-coding regions. It is likely that numerous such duplexes can induce...... the requirements for editing at the I/M site in the Gabra-3 transcript of the GABA(A) receptor. We identify an evolutionarily conserved intronic duplex, 150 nt downstream of the exonic hairpin where the I/M site resides, which is required for its editing. This is the first time a distant RNA structure has been...

  19. CombiMotif: A new algorithm for network motifs discovery in protein-protein interaction networks

    Science.gov (United States)

    Luo, Jiawei; Li, Guanghui; Song, Dan; Liang, Cheng

    2014-12-01

    Discovering motifs in protein-protein interaction networks is becoming a current major challenge in computational biology, since the distribution of the number of network motifs can reveal significant systemic differences among species. However, this task can be computationally expensive because of the involvement of graph isomorphic detection. In this paper, we present a new algorithm (CombiMotif) that incorporates combinatorial techniques to count non-induced occurrences of subgraph topologies in the form of trees. The efficiency of our algorithm is demonstrated by comparing the obtained results with the current state-of-the art subgraph counting algorithms. We also show major differences between unicellular and multicellular organisms. The datasets and source code of CombiMotif are freely available upon request.

  20. Solution NMR characterization of Sgf73(1-104) indicates that Zn ion is required to stabilize zinc finger motif

    International Nuclear Information System (INIS)

    Lai, Chaohua; Wu, Minhao; Li, Pan; Shi, Chaowei; Tian, Changlin; Zang, Jianye

    2010-01-01

    Zinc finger motif contains a zinc ion coordinated by several conserved amino acid residues. Yeast Sgf73 protein was identified as a component of SAGA (Spt/Ada/Gcn5 acetyltransferase) multi-subunit complex and Sgf73 protein was known to contain two zinc finger motifs. Sgf73(1-104), containing the first zinc finger motif, was necessary to modulate the deubiquitinase activity of SAGA complex. Here, Sgf73(1-104) was over-expressed using bacterial expression system and purified for solution NMR (nuclear magnetic resonance) structural studies. Secondary structure and site-specific relaxation analysis of Sgf73(1-104) were achieved after solution NMR backbone assignment. Solution NMR and circular dichroism analysis of Sgf73(1-104) after zinc ion removal using chelation reagent EDTA (ethylene-diamine-tetraacetic acid) demonstrated that zinc ion was required to maintain stable conformation of the zinc finger motif.

  1. Functional understanding of the diverse exon-intron structures of human GPCR genes.

    Science.gov (United States)

    Hammond, Dorothy A; Olman, Victor; Xu, Ying

    2014-02-01

    The GPCR genes have a variety of exon-intron structures even though their proteins are all structurally homologous. We have examined all human GPCR genes with at least two functional protein isoforms, totaling 199, aiming to gain an understanding of what may have contributed to the large diversity of the exon-intron structures of the GPCR genes. The 199 genes have a total of 808 known protein splicing isoforms with experimentally verified functions. Our analysis reveals that 1301 (80.6%) adjacent exon-exon pairs out of the total of 1,613 in the 199 genes have either exactly one exon skipped or the intron in-between retained in at least one of the 808 protein splicing isoforms. This observation has a statistical significance p-value of 2.051762 * e(-09), assuming that the observed splicing isoforms are independent of the exon-intron structures. Our interpretation of this observation is that the exon boundaries of the GPCR genes are not randomly determined; instead they may be selected to facilitate specific alternative splicing for functional purposes.

  2. Exploiting publicly available biological and biochemical information for the discovery of novel short linear motifs.

    KAUST Repository

    Sayadi, Ahmed

    2011-07-20

    The function of proteins is often mediated by short linear segments of their amino acid sequence, called Short Linear Motifs or SLiMs, the identification of which can provide important information about a protein function. However, the short length of the motifs and their variable degree of conservation makes their identification hard since it is difficult to correctly estimate the statistical significance of their occurrence. Consequently, only a small fraction of them have been discovered so far. We describe here an approach for the discovery of SLiMs based on their occurrence in evolutionarily unrelated proteins belonging to the same biological, signalling or metabolic pathway and give specific examples of its effectiveness in both rediscovering known motifs and in discovering novel ones. An automatic implementation of the procedure, available for download, allows significant motifs to be identified, automatically annotated with functional, evolutionary and structural information and organized in a database that can be inspected and queried. An instance of the database populated with pre-computed data on seven organisms is accessible through a publicly available server and we believe it constitutes by itself a useful resource for the life sciences (http://www.biocomputing.it/modipath).

  3. Nuclear import of influenza B virus nucleoprotein: Involvement of an N-terminal nuclear localization signal and a cleavage-protection motif

    International Nuclear Information System (INIS)

    Wanitchang, Asawin; Narkpuk, Jaraspim; Jongkaewwattana, Anan

    2013-01-01

    The nucleoprotein of influenza B virus (BNP) shares several characteristics with its influenza A virus counterpart (ANP), including localization in the host's nucleus. However, while the nuclear localization signal(s) (NLS) of ANP are well characterized, little is known about those of BNP. In this study, we showed that the fusion protein bearing the BNP N-terminus fused with GFP (N70–GFP) is exclusively nuclear, and identified a highly conserved KRXR motif spanning residues 44–47 as a putative NLS. In addition, we demonstrated that residues 3–15 of BNP, though not an NLS, are also crucial for nuclear import. Results from mutational analyses of N70–GFP and the full-length BNP suggest that this region may be required for protection of the N-terminus from proteolytic cleavage. Altogether, we propose that the N-terminal region of BNP contains the NLS and cleavage-protection motif, which together drive its nuclear localization. - Highlights: • The N-terminal region of BNP is required for nuclear accumulation. • The conserved motif at position 44–47 is a putative nuclear localization signal. • The first 15 amino acids of BNP may function as a cleavage-protection motif. • BNP may get access to the nucleus via a mechanism distinct from ANP

  4. Nuclear import of influenza B virus nucleoprotein: Involvement of an N-terminal nuclear localization signal and a cleavage-protection motif

    Energy Technology Data Exchange (ETDEWEB)

    Wanitchang, Asawin; Narkpuk, Jaraspim; Jongkaewwattana, Anan, E-mail: anan.jon@biotec.or.th

    2013-08-15

    The nucleoprotein of influenza B virus (BNP) shares several characteristics with its influenza A virus counterpart (ANP), including localization in the host's nucleus. However, while the nuclear localization signal(s) (NLS) of ANP are well characterized, little is known about those of BNP. In this study, we showed that the fusion protein bearing the BNP N-terminus fused with GFP (N70–GFP) is exclusively nuclear, and identified a highly conserved KRXR motif spanning residues 44–47 as a putative NLS. In addition, we demonstrated that residues 3–15 of BNP, though not an NLS, are also crucial for nuclear import. Results from mutational analyses of N70–GFP and the full-length BNP suggest that this region may be required for protection of the N-terminus from proteolytic cleavage. Altogether, we propose that the N-terminal region of BNP contains the NLS and cleavage-protection motif, which together drive its nuclear localization. - Highlights: • The N-terminal region of BNP is required for nuclear accumulation. • The conserved motif at position 44–47 is a putative nuclear localization signal. • The first 15 amino acids of BNP may function as a cleavage-protection motif. • BNP may get access to the nucleus via a mechanism distinct from ANP.

  5. Conservation of the glycoprotein B homologs of the Kaposi’s sarcoma-associated herpesvirus (KSHV/HHV8) and Old World primate rhadinoviruses of chimpanzees and macaques

    Science.gov (United States)

    Bruce, A. Gregory; Horst, Jeremy A.; Rose, Timothy M.

    2016-01-01

    The envelope-associated glycoprotein B (gB) is highly conserved within the Herpesviridae and plays a critical role in viral entry. We analyzed the evolutionary conservation of sequence and structural motifs within the Kaposi’s sarcoma-associated herpesvirus (KSHV) gB and homologs of Old World primate rhadinoviruses belonging to the distinct RV1 and RV2 rhadinovirus lineages. In addition to gB homologs of rhadinoviruses infecting the pig-tailed and rhesus macaques, we cloned and sequenced gB homologs of RV1 and RV2 rhadinoviruses infecting chimpanzees. A structural model of the KSHV gB was determined, and functional motifs and sequence variants were mapped to the model structure. Conserved domains and motifs were identified, including an “RGD” motif that plays a critical role in KSHV binding and entry through the cellular integrin αVβ3. The RGD motif was only detected in RV1 rhadinoviruses suggesting an important difference in cell tropism between the two rhadinovirus lineages. PMID:27070755

  6. The NTP-binding motif in cowpea mosaic virus B polyprotein is essential for viral replication

    NARCIS (Netherlands)

    Peters, S A; Verver, J; Nollen, E A; van Lent, J W; Wellink, J; van Kammen, A

    1994-01-01

    We have assessed the functional importance of the NTP-binding motif (NTBM) in the cowpea mosaic virus (CPMV) B-RNA-encoded 58K domain by changing two conserved amino acids within the consensus A and B sites (GKSRTGK500S and MDD545, respectively). Both Lys-500 to Thr and Asp-545 to Pro substitutions

  7. Mutational analysis of the RecJ exonuclease of Escherichia coli: identification of phosphoesterase motifs.

    Science.gov (United States)

    Sutera, V A; Han, E S; Rajman, L A; Lovett, S T

    1999-10-01

    The recJ gene, identified in Escherichia coli, encodes a Mg(+2)-dependent 5'-to-3' exonuclease with high specificity for single-strand DNA. Genetic and biochemical experiments implicate RecJ exonuclease in homologous recombination, base excision, and methyl-directed mismatch repair. Genes encoding proteins with strong similarities to RecJ have been found in every eubacterial genome sequenced to date, with the exception of Mycoplasma and Mycobacterium tuberculosis. Multiple genes encoding proteins similar to RecJ are found in some eubacteria, including Bacillus and Helicobacter, and in the archaea. Among this divergent set of sequences, seven conserved motifs emerge. We demonstrate here that amino acids within six of these motifs are essential for both the biochemical and genetic functions of E. coli RecJ. These motifs may define interactions with Mg(2+) ions or substrate DNA. A large family of proteins more distantly related to RecJ is present in archaea, eubacteria, and eukaryotes, including a hypothetical protein in the MgPa adhesin operon of Mycoplasma, a domain of putative polyA polymerases in Synechocystis and Aquifex, PRUNE of Drosophila, and an exopolyphosphatase (PPX1) of Saccharomyces cereviseae. Because these six RecJ motifs are shared between exonucleases and exopolyphosphatases, they may constitute an ancient phosphoesterase domain now found in all kingdoms of life.

  8. Powdery mildew fungal effector candidates share N-terminal Y/F/WxC-motif

    Directory of Open Access Journals (Sweden)

    Emmersen Jeppe

    2010-05-01

    Full Text Available Abstract Background Powdery mildew and rust fungi are widespread, serious pathogens that depend on developing haustoria in the living plant cells. Haustoria are separated from the host cytoplasm by a plant cell-derived extrahaustorial membrane. They secrete effector proteins, some of which are subsequently transferred across this membrane to the plant cell to suppress defense. Results In a cDNA library from barley epidermis containing powdery mildew haustoria, two-thirds of the sequenced ESTs were fungal and represented ~3,000 genes. Many of the most highly expressed genes encoded small proteins with N-terminal signal peptides. While these proteins are novel and poorly related, they do share a three-amino acid motif, which we named "Y/F/WxC", in the N-terminal of the mature proteins. The first amino acid of this motif is aromatic: tyrosine, phenylalanine or tryptophan, and the last is always cysteine. In total, we identified 107 such proteins, for which the ESTs represent 19% of the fungal clones in our library, suggesting fundamental roles in haustoria function. While overall sequence similarity between the powdery mildew Y/F/WxC-proteins is low, they do have a highly similar exon-intron structure, suggesting they have a common origin. Interestingly, searches of public fungal genome and EST databases revealed that haustoria-producing rust fungi also encode large numbers of novel, short proteins with signal peptides and the Y/F/WxC-motif. No significant numbers of such proteins were identified from genome and EST sequences from either fungi which do not produce haustoria or from haustoria-producing Oomycetes. Conclusion In total, we identified 107, 178 and 57 such Y/F/WxC-proteins from the barley powdery mildew, the wheat stem rust and the wheat leaf rust fungi, respectively. All together, our findings suggest the Y/F/WxC-proteins to be a new class of effectors from haustoria-producing pathogenic fungi.

  9. Role of conserved cysteine residues in Herbaspirillum seropedicae NifA activity.

    Science.gov (United States)

    Oliveira, Marco A S; Baura, Valter A; Aquino, Bruno; Huergo, Luciano F; Kadowaki, Marco A S; Chubatsu, Leda S; Souza, Emanuel M; Dixon, Ray; Pedrosa, Fábio O; Wassem, Roseli; Monteiro, Rose A

    2009-01-01

    Herbaspirillum seropedicae is an endophytic diazotrophic bacterium that associates with economically important crops. NifA protein, the transcriptional activator of nif genes in H. seropedicae, binds to nif promoters and, together with RNA polymerase-sigma(54) holoenzyme, catalyzes the formation of open complexes to allow transcription initiation. The activity of H. seropedicae NifA is controlled by ammonium and oxygen levels, but the mechanisms of such control are unknown. Oxygen sensitivity is attributed to a conserved motif of cysteine residues in NifA that spans the central AAA+ domain and the interdomain linker that connects the AAA+ domain to the C-terminal DNA binding domain. Here we mutagenized this conserved motif of cysteines and assayed the activity of mutant proteins in vivo. We also purified the mutant variants of NifA and tested their capacity to bind to the nifB promoter region. Chimeric proteins between H. seropedicae NifA, an oxygen-sensitive protein, and Azotobacter vinelandii NifA, an oxygen-tolerant protein, were constructed and showed that the oxygen response is conferred by the central AAA+ and C-terminal DNA binding domains of H. seropedicae NifA. We conclude that the conserved cysteine motif is essential for NifA activity, although single cysteine-to-serine mutants are still competent at binding DNA.

  10. Statistical tests to compare motif count exceptionalities

    Directory of Open Access Journals (Sweden)

    Vandewalle Vincent

    2007-03-01

    Full Text Available Abstract Background Finding over- or under-represented motifs in biological sequences is now a common task in genomics. Thanks to p-value calculation for motif counts, exceptional motifs are identified and represent candidate functional motifs. The present work addresses the related question of comparing the exceptionality of one motif in two different sequences. Just comparing the motif count p-values in each sequence is indeed not sufficient to decide if this motif is significantly more exceptional in one sequence compared to the other one. A statistical test is required. Results We develop and analyze two statistical tests, an exact binomial one and an asymptotic likelihood ratio test, to decide whether the exceptionality of a given motif is equivalent or significantly different in two sequences of interest. For that purpose, motif occurrences are modeled by Poisson processes, with a special care for overlapping motifs. Both tests can take the sequence compositions into account. As an illustration, we compare the octamer exceptionalities in the Escherichia coli K-12 backbone versus variable strain-specific loops. Conclusion The exact binomial test is particularly adapted for small counts. For large counts, we advise to use the likelihood ratio test which is asymptotic but strongly correlated with the exact binomial test and very simple to use.

  11. Structural motif screening reveals a novel, conserved carbohydrate-binding surface in the pathogenesis-related protein PR-5d

    Directory of Open Access Journals (Sweden)

    Moffatt Barbara A

    2010-08-01

    Full Text Available Abstract Background Aromatic amino acids play a critical role in protein-glycan interactions. Clusters of surface aromatic residues and their features may therefore be useful in distinguishing glycan-binding sites as well as predicting novel glycan-binding proteins. In this work, a structural bioinformatics approach was used to screen the Protein Data Bank (PDB for coplanar aromatic motifs similar to those found in known glycan-binding proteins. Results The proteins identified in the screen were significantly associated with carbohydrate-related functions according to gene ontology (GO enrichment analysis, and predicted motifs were found frequently within novel folds and glycan-binding sites not included in the training set. In addition to numerous binding sites predicted in structural genomics proteins of unknown function, one novel prediction was a surface motif (W34/W36/W192 in the tobacco pathogenesis-related protein, PR-5d. Phylogenetic analysis revealed that the surface motif is exclusive to a subfamily of PR-5 proteins from the Solanaceae family of plants, and is absent completely in more distant homologs. To confirm PR-5d's insoluble-polysaccharide binding activity, a cellulose-pulldown assay of tobacco proteins was performed and PR-5d was identified in the cellulose-binding fraction by mass spectrometry. Conclusions Based on the combined results, we propose that the putative binding site in PR-5d may be an evolutionary adaptation of Solanaceae plants including potato, tomato, and tobacco, towards defense against cellulose-containing pathogens such as species of the deadly oomycete genus, Phytophthora. More generally, the results demonstrate that coplanar aromatic clusters on protein surfaces are a structural signature of glycan-binding proteins, and can be used to computationally predict novel glycan-binding proteins from 3 D structure.

  12. Structural motif screening reveals a novel, conserved carbohydrate-binding surface in the pathogenesis-related protein PR-5d.

    Science.gov (United States)

    Doxey, Andrew C; Cheng, Zhenyu; Moffatt, Barbara A; McConkey, Brendan J

    2010-08-03

    Aromatic amino acids play a critical role in protein-glycan interactions. Clusters of surface aromatic residues and their features may therefore be useful in distinguishing glycan-binding sites as well as predicting novel glycan-binding proteins. In this work, a structural bioinformatics approach was used to screen the Protein Data Bank (PDB) for coplanar aromatic motifs similar to those found in known glycan-binding proteins. The proteins identified in the screen were significantly associated with carbohydrate-related functions according to gene ontology (GO) enrichment analysis, and predicted motifs were found frequently within novel folds and glycan-binding sites not included in the training set. In addition to numerous binding sites predicted in structural genomics proteins of unknown function, one novel prediction was a surface motif (W34/W36/W192) in the tobacco pathogenesis-related protein, PR-5d. Phylogenetic analysis revealed that the surface motif is exclusive to a subfamily of PR-5 proteins from the Solanaceae family of plants, and is absent completely in more distant homologs. To confirm PR-5d's insoluble-polysaccharide binding activity, a cellulose-pulldown assay of tobacco proteins was performed and PR-5d was identified in the cellulose-binding fraction by mass spectrometry. Based on the combined results, we propose that the putative binding site in PR-5d may be an evolutionary adaptation of Solanaceae plants including potato, tomato, and tobacco, towards defense against cellulose-containing pathogens such as species of the deadly oomycete genus, Phytophthora. More generally, the results demonstrate that coplanar aromatic clusters on protein surfaces are a structural signature of glycan-binding proteins, and can be used to computationally predict novel glycan-binding proteins from 3 D structure.

  13. [Genome-wide identification and bioinformatic analysis of PPR gene family in tomato].

    Science.gov (United States)

    Ding, Anming; Li, Ling; Qu, Xu; Sun, Tingting; Chen, Yaqiong; Zong, Peng; Li, Zunqiang; Gong, Daping; Sun, Yuhe

    2014-01-01

    Pentatricopeptide repeats (PPRs) genes constitute one of the largest gene families in plants, which play a broad and essential role in plant growth and development. In this study, the protein sequences annotated by the tomato (S. lycopersicum L.) genome project were screened with the Pfam PPR sequences. A total of 471 putative PPR-encoding genes were identified. Based on the motifs defined in A. thaliana L., protein structure and conserved sequences for each tomato motif were analyzed. We also analyzed phylogenetic relationship, subcellular localization, expression and GO analysis of the identified gene sequences. Our results demonstrate that tomato PPR gene family contains two subfamilies, P and PLS, each accounting for half of the family. PLS subfamily can be divided into four subclasses i.e., PLS, E, E+ and DYW. Each subclass of sequences forms a clade in the phylogenetic tree. The PPR motifs were found highly conserved among plants. The tomato PPR genes were distributed over 12 chromosomes and most of them lack introns. The majority of PPR proteins harbor mitochondrial or chloroplast localization sequences, whereas GO analysis showed that most PPR proteins participate in RNA-related biological processes.

  14. Molecular and functional characterization of a novel gonadotropin-releasing-hormone receptor isolated from the common octopus (Octopus vulgaris)

    OpenAIRE

    Kanda, Atsuhiro; Takahashi, Toshio; Satake, Honoo; Minakata, Hiroyuki

    2006-01-01

    GnRH (gonadotropin-releasing hormone) plays a pivotal role in the regulation of reproduction in vertebrates through interaction with a specific receptor. Previously, we isolated a GnRH homo-logue, oct-GnRH, from the common octopus (Octopus vulgaris). In the present study, we have identified a GnRH receptor (oct-GnRHR) specific for oct-GnRH from Octopus brain. Oct-GnRHR includes domains and motifs typical of vertebrate GnRH receptors. The intron-inserted positions are conserved between oct-GnR...

  15. Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence

    Science.gov (United States)

    Gordon, Kacy L.; Arthur, Robert K.; Ruvinsky, Ilya

    2015-01-01

    Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2) from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements. PMID:26020930

  16. Phylum-Level Conservation of Regulatory Information in Nematodes despite Extensive Non-coding Sequence Divergence.

    Directory of Open Access Journals (Sweden)

    Kacy L Gordon

    2015-05-01

    Full Text Available Gene regulatory information guides development and shapes the course of evolution. To test conservation of gene regulation within the phylum Nematoda, we compared the functions of putative cis-regulatory sequences of four sets of orthologs (unc-47, unc-25, mec-3 and elt-2 from distantly-related nematode species. These species, Caenorhabditis elegans, its congeneric C. briggsae, and three parasitic species Meloidogyne hapla, Brugia malayi, and Trichinella spiralis, represent four of the five major clades in the phylum Nematoda. Despite the great phylogenetic distances sampled and the extensive sequence divergence of nematode genomes, all but one of the regulatory elements we tested are able to drive at least a subset of the expected gene expression patterns. We show that functionally conserved cis-regulatory elements have no more extended sequence similarity to their C. elegans orthologs than would be expected by chance, but they do harbor motifs that are important for proper expression of the C. elegans genes. These motifs are too short to be distinguished from the background level of sequence similarity, and while identical in sequence they are not conserved in orientation or position. Functional tests reveal that some of these motifs contribute to proper expression. Our results suggest that conserved regulatory circuitry can persist despite considerable turnover within cis elements.

  17. Functional identification of conserved residues involved in Lactobacillus rhamnosus strain GG sortase specificity and pilus biogenesis.

    Science.gov (United States)

    Douillard, François P; Rasinkangas, Pia; von Ossowski, Ingemar; Reunanen, Justus; Palva, Airi; de Vos, Willem M

    2014-05-30

    In Gram-positive bacteria, sortase-dependent pili mediate the adhesion of bacteria to host epithelial cells and play a pivotal role in colonization, host signaling, and biofilm formation. Lactobacillus rhamnosus strain GG, a well known probiotic bacterium, also displays on its cell surface mucus-binding pilus structures, along with other LPXTG surface proteins, which are processed by sortases upon specific recognition of a highly conserved LPXTG motif. Bioinformatic analysis of all predicted LPXTG proteins encoded by the L. rhamnosus GG genome revealed a remarkable conservation of glycine residues juxtaposed to the canonical LPXTG motif. Here, we investigated and defined the role of this so-called triple glycine (TG) motif in determining sortase specificity during the pilus assembly and anchoring. Mutagenesis of the TG motif resulted in a lack or an alteration of the L. rhamnosus GG pilus structures, indicating that the TG motif is critical in pilus assembly and that they govern the pilin-specific and housekeeping sortase specificity. This allowed us to propose a regulatory model of the L. rhamnosus GG pilus biogenesis. Remarkably, the TG motif was identified in multiple pilus gene clusters of other Gram-positive bacteria, suggesting that similar signaling mechanisms occur in other, mainly pathogenic, species. © 2014 by The American Society for Biochemistry and Molecular Biology, Inc.

  18. The valine and lysine residues in the conserved FxVTxK motif are important for the function of phylogenetically distant plant cellulose synthases

    Energy Technology Data Exchange (ETDEWEB)

    Slabaugh, Erin; Scavuzzo-Duggan, Tess; Chaves, Arielle; Wilson, Liza; Wilson, Carmen; Davis, Jonathan K.; Cosgrove, Daniel J.; Anderson, Charles T.; Roberts, Alison W.; Haigler, Candace H.

    2015-12-08

    Cellulose synthases (CESAs) synthesize the β-1,4-glucan chains that coalesce to form cellulose microfibrils in plant cell walls. In addition to a large cytosolic (catalytic) domain, CESAs have eight predicted transmembrane helices (TMHs). However, analogous to the structure of BcsA, a bacterial CESA, predicted TMH5 in CESA may instead be an interfacial helix. This would place the conserved FxVTxK motif in the plant cell cytosol where it could function as a substrate-gating loop as occurs in BcsA. To define the functional importance of the CESA region containing FxVTxK, we tested five parallel mutations in Arabidopsis thaliana CESA1 and Physcomitrella patens CESA5 in complementation assays of the relevant cesa mutants. In both organisms, the substitution of the valine or lysine residues in FxVTxK severely affected CESA function. In Arabidopsis roots, both changes were correlated with lower cellulose anisotropy, as revealed by Pontamine Fast Scarlet. Analysis of hypocotyl inner cell wall layers by atomic force microscopy showed that two altered versions of Atcesa1 could rescue cell wall phenotypes observed in the mutant background line. Overall, the data show that the FxVTxK motif is functionally important in two phylogenetically distant plant CESAs. The results show that Physcomitrella provides an efficient model for assessing the effects of engineered CESA mutations affecting primary cell wall synthesis and that diverse testing systems can lead to nuanced insights into CESA structure–function relationships. Although CESA membrane topology needs to be experimentally determined, the results support the possibility that the FxVTxK region functions similarly in CESA and BcsA.

  19. Bayesian phylogeny analysis of vertebrate serpins illustrates evolutionary conservation of the intron and indels based six groups classification system from lampreys for ∼500 MY

    Directory of Open Access Journals (Sweden)

    Abhishek Kumar

    2015-06-01

    Full Text Available The serpin superfamily is characterized by proteins that fold into a conserved tertiary structure and exploits a sophisticated and irreversible suicide-mechanism of inhibition. Vertebrate serpins are classified into six groups (V1–V6, based on three independent biological features—genomic organization, diagnostic amino acid sites and rare indels. However, this classification system was based on the limited number of mammalian genomes available. In this study, several non-mammalian genomes are used to validate this classification system using the powerful Bayesian phylogenetic method. This method supports the intron and indel based vertebrate classification and proves that serpins have been maintained from lampreys to humans for about 500 MY. Lampreys have fewer than 10 serpins, which expand into 36 serpins in humans. The two expanding groups V1 and V2 have SERPINB1/SERPINB6 and SERPINA8/SERPIND1 as the ancestral serpins, respectively. Large clusters of serpins are formed by local duplications of these serpins in tetrapod genomes. Interestingly, the ancestral HCII/SERPIND1 locus (nested within PIK4CA possesses group V4 serpin (A2APL1, homolog of α2-AP/SERPINF2 of lampreys; hence, pointing to the fact that group V4 might have originated from group V2. Additionally in this study, details of the phylogenetic history and genomic characteristics of vertebrate serpins are revisited.

  20. Conservation of the human integrin-type beta-propeller domain in bacteria.

    Directory of Open Access Journals (Sweden)

    Bhanupratap Chouhan

    Full Text Available Integrins are heterodimeric cell-surface receptors with key functions in cell-cell and cell-matrix adhesion. Integrin α and β subunits are present throughout the metazoans, but it is unclear whether the subunits predate the origin of multicellular organisms. Several component domains have been detected in bacteria, one of which, a specific 7-bladed β-propeller domain, is a unique feature of the integrin α subunits. Here, we describe a structure-derived motif, which incorporates key features of each blade from the X-ray structures of human αIIbβ3 and αVβ3, includes elements of the FG-GAP/Cage and Ca(2+-binding motifs, and is specific only for the metazoan integrin domains. Separately, we searched for the metazoan integrin type β-propeller domains among all available sequences from bacteria and unicellular eukaryotic organisms, which must incorporate seven repeats, corresponding to the seven blades of the β-propeller domain, and so that the newly found structure-derived motif would exist in every repeat. As the result, among 47 available genomes of unicellular eukaryotes we could not find a single instance of seven repeats with the motif. Several sequences contained three repeats, a predicted transmembrane segment, and a short cytoplasmic motif associated with some integrins, but otherwise differ from the metazoan integrin α subunits. Among the available bacterial sequences, we found five examples containing seven sequential metazoan integrin-specific motifs within the seven repeats. The motifs differ in having one Ca(2+-binding site per repeat, whereas metazoan integrins have three or four sites. The bacterial sequences are more conserved in terms of motif conservation and loop length, suggesting that the structure is more regular and compact than those example structures from human integrins. Although the bacterial examples are not full-length integrins, the full-length metazoan-type 7-bladed β-propeller domains are present, and

  1. Comparative genomics of metabolic capacities of regulons controlled by cis-regulatory RNA motifs in bacteria.

    Science.gov (United States)

    Sun, Eric I; Leyn, Semen A; Kazanov, Marat D; Saier, Milton H; Novichkov, Pavel S; Rodionov, Dmitry A

    2013-09-02

    In silico comparative genomics approaches have been efficiently used for functional prediction and reconstruction of metabolic and regulatory networks. Riboswitches are metabolite-sensing structures often found in bacterial mRNA leaders controlling gene expression on transcriptional or translational levels.An increasing number of riboswitches and other cis-regulatory RNAs have been recently classified into numerous RNA families in the Rfam database. High conservation of these RNA motifs provides a unique advantage for their genomic identification and comparative analysis. A comparative genomics approach implemented in the RegPredict tool was used for reconstruction and functional annotation of regulons controlled by RNAs from 43 Rfam families in diverse taxonomic groups of Bacteria. The inferred regulons include ~5200 cis-regulatory RNAs and more than 12000 target genes in 255 microbial genomes. All predicted RNA-regulated genes were classified into specific and overall functional categories. Analysis of taxonomic distribution of these categories allowed us to establish major functional preferences for each analyzed cis-regulatory RNA motif family. Overall, most RNA motif regulons showed predictable functional content in accordance with their experimentally established effector ligands. Our results suggest that some RNA motifs (including thiamin pyrophosphate and cobalamin riboswitches that control the cofactor metabolism) are widespread and likely originated from the last common ancestor of all bacteria. However, many more analyzed RNA motifs are restricted to a narrow taxonomic group of bacteria and likely represent more recent evolutionary innovations. The reconstructed regulatory networks for major known RNA motifs substantially expand the existing knowledge of transcriptional regulation in bacteria. The inferred regulons can be used for genetic experiments, functional annotations of genes, metabolic reconstruction and evolutionary analysis. The obtained genome

  2. Motif decomposition of the phosphotyrosine proteome reveals a new N-terminal binding motif for SHIP2

    DEFF Research Database (Denmark)

    Miller, Martin Lee; Hanke, S.; Hinsby, A. M.

    2008-01-01

    set of 481 unique phosphotyrosine (Tyr(P)) peptides by sequence similarity to known ligands of the Src homology 2 (SH2) and the phosphotyrosine binding (PTB) domains. From 20 clusters we extracted 16 known and four new interaction motifs. Using quantitative mass spectrometry we pulled down Tyr......(P)-specific binding partners for peptides corresponding to the extracted motifs. We confirmed numerous previously known interaction motifs and found 15 new interactions mediated by phosphosites not previously known to bind SH2 or PTB. Remarkably, a novel hydrophobic N-terminal motif ((L/V/I)(L/V/I)pY) was identified...

  3. A CRM domain protein functions dually in group I and group II intron splicing in land plant chloroplasts.

    Science.gov (United States)

    Asakura, Yukari; Barkan, Alice

    2007-12-01

    The CRM domain is a recently recognized RNA binding domain found in three group II intron splicing factors in chloroplasts, in a bacterial protein that associates with ribosome precursors, and in a family of uncharacterized proteins in plants. To elucidate the functional repertoire of proteins with CRM domains, we studied CFM2 (for CRM Family Member 2), which harbors four CRM domains. RNA coimmunoprecipitation assays showed that CFM2 in maize (Zea mays) chloroplasts is associated with the group I intron in pre-trnL-UAA and group II introns in the ndhA and ycf3 pre-mRNAs. T-DNA insertions in the Arabidopsis thaliana ortholog condition a defective-seed phenotype (strong allele) or chlorophyll-deficient seedlings with impaired splicing of the trnL group I intron and the ndhA, ycf3-int1, and clpP-int2 group II introns (weak alleles). CFM2 and two previously described CRM proteins are bound simultaneously to the ndhA and ycf3-int1 introns and act in a nonredundant fashion to promote their splicing. With these findings, CRM domain proteins are implicated in the activities of three classes of catalytic RNA: group I introns, group II introns, and 23S rRNA.

  4. Structure of the central RNA recognition motif of human TIA-1 at 1.95 A resolution

    International Nuclear Information System (INIS)

    Kumar, Amit O.; Swenson, Matthew C.; Benning, Matthew M.; Kielkopf, Clara L.

    2008-01-01

    T-cell-restricted intracellular antigen-1 (TIA-1) regulates alternative pre-mRNA splicing in the nucleus, and mRNA translation in the cytoplasm, by recognizing uridine-rich sequences of RNAs. As a step towards understanding RNA recognition by this regulatory factor, the X-ray structure of the central RNA recognition motif (RRM2) of human TIA-1 is presented at 1.95 A resolution. Comparison with structurally homologous RRM-RNA complexes identifies residues at the RNA interfaces that are conserved in TIA-1-RRM2. The versatile capability of RNP motifs to interact with either proteins or RNA is reinforced by symmetry-related protein-protein interactions mediated by the RNP motifs of TIA-1-RRM2. Importantly, the TIA-1-RRM2 structure reveals the locations of mutations responsible for inhibiting nuclear import. In contrast with previous assumptions, the mutated residues are buried within the hydrophobic interior of the domain, where they would be likely to destabilize the RRM fold rather than directly inhibit RNA binding

  5. [Personal motif in art].

    Science.gov (United States)

    Gerevich, József

    2015-01-01

    One of the basic questions of the art psychology is whether a personal motif is to be found behind works of art and if so, how openly or indirectly it appears in the work itself. Analysis of examples and documents from the fine arts and literature allow us to conclude that the personal motif that can be identified by the viewer through symbols, at times easily at others with more difficulty, gives an emotional plus to the artistic product. The personal motif may be found in traumatic experiences, in communication to the model or with other emotionally important persons (mourning, disappointment, revenge, hatred, rivalry, revolt etc.), in self-searching, or self-analysis. The emotions are expressed in artistic activity either directly or indirectly. The intention nourished by the artist's identity (Kunstwollen) may stand in the way of spontaneous self-expression, channelling it into hidden paths. Under the influence of certain circumstances, the artist may arouse in the viewer, consciously or unconsciously, an illusionary, misleading image of himself. An examination of the personal motif is one of the important research areas of art therapy.

  6. Genome-wide classification and expression analysis of MYB transcription factor families in rice and Arabidopsis

    Science.gov (United States)

    2012-01-01

    Background The MYB gene family comprises one of the richest groups of transcription factors in plants. Plant MYB proteins are characterized by a highly conserved MYB DNA-binding domain. MYB proteins are classified into four major groups namely, 1R-MYB, 2R-MYB, 3R-MYB and 4R-MYB based on the number and position of MYB repeats. MYB transcription factors are involved in plant development, secondary metabolism, hormone signal transduction, disease resistance and abiotic stress tolerance. A comparative analysis of MYB family genes in rice and Arabidopsis will help reveal the evolution and function of MYB genes in plants. Results A genome-wide analysis identified at least 155 and 197 MYB genes in rice and Arabidopsis, respectively. Gene structure analysis revealed that MYB family genes possess relatively more number of introns in the middle as compared with C- and N-terminal regions of the predicted genes. Intronless MYB-genes are highly conserved both in rice and Arabidopsis. MYB genes encoding R2R3 repeat MYB proteins retained conserved gene structure with three exons and two introns, whereas genes encoding R1R2R3 repeat containing proteins consist of six exons and five introns. The splicing pattern is similar among R1R2R3 MYB genes in Arabidopsis. In contrast, variation in splicing pattern was observed among R1R2R3 MYB members of rice. Consensus motif analysis of 1kb upstream region (5′ to translation initiation codon) of MYB gene ORFs led to the identification of conserved and over-represented cis-motifs in both rice and Arabidopsis. Real-time quantitative RT-PCR analysis showed that several members of MYBs are up-regulated by various abiotic stresses both in rice and Arabidopsis. Conclusion A comprehensive genome-wide analysis of chromosomal distribution, tandem repeats and phylogenetic relationship of MYB family genes in rice and Arabidopsis suggested their evolution via duplication. Genome-wide comparative analysis of MYB genes and their expression analysis

  7. Temporal motifs in time-dependent networks

    International Nuclear Information System (INIS)

    Kovanen, Lauri; Karsai, Márton; Kaski, Kimmo; Kertész, János; Saramäki, Jari

    2011-01-01

    Temporal networks are commonly used to represent systems where connections between elements are active only for restricted periods of time, such as telecommunication, neural signal processing, biochemical reaction and human social interaction networks. We introduce the framework of temporal motifs to study the mesoscale topological–temporal structure of temporal networks in which the events of nodes do not overlap in time. Temporal motifs are classes of similar event sequences, where the similarity refers not only to topology but also to the temporal order of the events. We provide a mapping from event sequences to coloured directed graphs that enables an efficient algorithm for identifying temporal motifs. We discuss some aspects of temporal motifs, including causality and null models, and present basic statistics of temporal motifs in a large mobile call network

  8. Motif enrichment tool.

    Science.gov (United States)

    Blatti, Charles; Sinha, Saurabh

    2014-07-01

    The Motif Enrichment Tool (MET) provides an online interface that enables users to find major transcriptional regulators of their gene sets of interest. MET searches the appropriate regulatory region around each gene and identifies which transcription factor DNA-binding specificities (motifs) are statistically overrepresented. Motif enrichment analysis is currently available for many metazoan species including human, mouse, fruit fly, planaria and flowering plants. MET also leverages high-throughput experimental data such as ChIP-seq and DNase-seq from ENCODE and ModENCODE to identify the regulatory targets of a transcription factor with greater precision. The results from MET are produced in real time and are linked to a genome browser for easy follow-up analysis. Use of the web tool is free and open to all, and there is no login requirement. ADDRESS: http://veda.cs.uiuc.edu/MET/. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  9. Using Group II Introns for Attenuating the In Vitro and In Vivo Expression of a Homing Endonuclease.

    Directory of Open Access Journals (Sweden)

    Tuhin Kumar Guha

    Full Text Available In Chaetomium thermophilum (DSM 1495 within the mitochondrial DNA (mtDNA small ribosomal subunit (rns gene a group IIA1 intron interrupts an open reading frame (ORF encoded within a group I intron (mS1247. This arrangement offers the opportunity to examine if the nested group II intron could be utilized as a regulatory element for the expression of the homing endonuclease (HEase. Constructs were generated where the codon-optimized ORF was interrupted with either the native group IIA1 intron or a group IIB type intron. This study showed that the expression of the HEase (in vivo in Escherichia coli can be regulated by manipulating the splicing efficiency of the HEase ORF-embedded group II introns. Exogenous magnesium chloride (MgCl2 stimulated the expression of a functional HEase but the addition of cobalt chloride (CoCl2 to growth media antagonized the expression of HEase activity. Ultimately the ability to attenuate HEase activity might be useful in precision genome engineering, minimizing off target activities, or where pathways have to be altered during a specific growth phase.

  10. Role of an Absolutely Conserved Tryptophan Pair in the Extracellular Domain of Cys-Loop Receptors

    DEFF Research Database (Denmark)

    Braun, Nina; Lynagh, Timothy; Yu, Rilei

    2016-01-01

    Cys-loop receptors mediate fast synaptic transmission in the nervous system, and their dysfunction is associated with a number of diseases. While some sequence variability is essential to ensure specific recognition of a chemically diverse set of ligands, other parts of the underlying amino acid...... sequences show a high degree of conservation, possibly to preserve the overall structural fold across the protein family. In this study, we focus on the only two absolutely conserved residues across the Cys-loop receptor family, two Trp side chains in the WXD motif of Loop D and in the WXPD motif of Loop A...

  11. The Q Motif Is Involved in DNA Binding but Not ATP Binding in ChlR1 Helicase.

    Directory of Open Access Journals (Sweden)

    Hao Ding

    Full Text Available Helicases are molecular motors that couple the energy of ATP hydrolysis to the unwinding of structured DNA or RNA and chromatin remodeling. The conversion of energy derived from ATP hydrolysis into unwinding and remodeling is coordinated by seven sequence motifs (I, Ia, II, III, IV, V, and VI. The Q motif, consisting of nine amino acids (GFXXPXPIQ with an invariant glutamine (Q residue, has been identified in some, but not all helicases. Compared to the seven well-recognized conserved helicase motifs, the role of the Q motif is less acknowledged. Mutations in the human ChlR1 (DDX11 gene are associated with a unique genetic disorder known as Warsaw Breakage Syndrome, which is characterized by cellular defects in genome maintenance. To examine the roles of the Q motif in ChlR1 helicase, we performed site directed mutagenesis of glutamine to alanine at residue 23 in the Q motif of ChlR1. ChlR1 recombinant protein was overexpressed and purified from HEK293T cells. ChlR1-Q23A mutant abolished the helicase activity of ChlR1 and displayed reduced DNA binding ability. The mutant showed impaired ATPase activity but normal ATP binding. A thermal shift assay revealed that ChlR1-Q23A has a melting point value similar to ChlR1-WT. Partial proteolysis mapping demonstrated that ChlR1-WT and Q23A have a similar globular structure, although some subtle conformational differences in these two proteins are evident. Finally, we found ChlR1 exists and functions as a monomer in solution, which is different from FANCJ, in which the Q motif is involved in protein dimerization. Taken together, our results suggest that the Q motif is involved in DNA binding but not ATP binding in ChlR1 helicase.

  12. Identification, duplication, evolution and expression analyses of caleosins in Brassica plants and Arabidopsis subspecies.

    Science.gov (United States)

    Shen, Yue; Liu, Mingzhe; Wang, Lili; Li, Zhuowei; Taylor, David C; Li, Zhixi; Zhang, Meng

    2016-04-01

    Caleosins are a class of Ca(2+) binding proteins that appear to be ubiquitous in plants. Some of the main proteins embedded in the lipid monolayer of lipid droplets, caleosins, play critical roles in the degradation of storage lipids during germination and in lipid trafficking. Some of them have been shown to have histidine-dependent peroxygenase activity, which is believed to participate in stress responses in Arabidopsis. In the model plant Arabidopsis thaliana, caleosins have been examined extensively. However, little is known on a genome-wide scale about these proteins in other members of the Brassicaceae. In this study, 51 caleosins in Brassica plants and Arabidopsis lyrata were investigated and analyzed in silico. Among them, 31 caleosins, including 7 in A. lyrata, 11 in Brassica oleracea and 13 in Brassica napus, are herein identified for the first time. Segmental duplication was the main form of gene expansion. Alignment, motif and phylogenetic analyses showed that Brassica caleosins belong to either the H-family or the L-family with different motif structures and physicochemical properties. Our findings strongly suggest that L-caleosins are evolved from H-caleosins. Predicted phosphorylation sites were differentially conserved in H-caleosin and L-caleosins, respectively. 'RY-repeat' elements and phytohormone-related cis-elements were identified in different caleosins, which suggest diverse physiological functions. Gene structure analysis indicated that most caleosins (38 out of 44) contained six exons and five introns and their intron phases were highly conserved. Structurally integrated caleosins, such as BrCLO3-3 and BrCLO4-2, showed high expression levels and may have important roles. Some caleosins, such as BrCLO2 and BoCLO8-2, lost motifs of the calcium binding domain, proline knot, potential phosphorylation sites and haem-binding sites. Combined with their low expression, it is suggested that these caleosins may have lost function.

  13. Engineering an efficient and tight D-amino acid-inducible gene expression system in Rhodosporidium/Rhodotorula species.

    Science.gov (United States)

    Liu, Yanbin; Koh, Chong Mei John; Ngoh, Si Te; Ji, Lianghui

    2015-10-26

    Rhodosporidium and Rhodotorula are two genera of oleaginous red yeast with great potential for industrial biotechnology. To date, there is no effective method for inducible expression of proteins and RNAs in these hosts. We have developed a luciferase gene reporter assay based on a new codon-optimized LUC2 reporter gene (RtLUC2), which is flanked with CAR2 homology arms and can be integrated into the CAR2 locus in the nuclear genome at >90 % efficiency. We characterized the upstream DNA sequence of a D-amino acid oxidase gene (DAO1) from R. toruloides ATCC 10657 by nested deletions. By comparing the upstream DNA sequences of several putative DAO1 homologs of Basidiomycetous fungi, we identified a conserved DNA motif with a consensus sequence of AGGXXGXAGX11GAXGAXGG within a 0.2 kb region from the mRNA translation initiation site. Deletion of this motif led to strong mRNA transcription under non-inducing conditions. Interestingly, DAO1 promoter activity was enhanced about fivefold when the 108 bp intron 1 was included in the reporter construct. We identified a conserved CT-rich motif in the intron with a consensus sequence of TYTCCCYCTCCYCCCCACWYCCGA, deletion or point mutations of which drastically reduced promoter strength under both inducing and non-inducing conditions. Additionally, we created a selection marker-free DAO1-null mutant (∆dao1e) which displayed greatly improved inducible gene expression, particularly when both glucose and nitrogen were present in high levels. To avoid adding unwanted peptide to proteins to be expressed, we converted the original translation initiation codon to ATC and re-created a translation initiation codon at the start of exon 2. This promoter, named P DAO1-in1m1 , showed very similar luciferase activity to the wild-type promoter upon induction with D-alanine. The inducible system was tunable by adjusting the levels of inducers, carbon source and nitrogen source. The intron 1-containing DAO1 promoters coupled with a DAO1 null

  14. UKIRAN KERAWANG ACEH GAYO SEBAGAI INSPIRASI PENCIPTAAN MOTIF BATIK KHAS GAYO

    Directory of Open Access Journals (Sweden)

    Irfa ina Rohana Salma

    2016-12-01

    Full Text Available ABSTRAK Industri batik mulai berkembang di Gayo, tetapi belum memiliki motif batik khas daerah. Oleh karena itu perlu diciptakan motif batik khas Gayo, dengan mengambil inspirasi dari ukiran yang terdapat pada rumah tradisional yang biasa disebut ukiran kerawang Gayo. Tujuan penciptaan seni ini adalah untuk menciptakan motif batik yang memiliki ciri khas Gayo. Metode yang digunakan yaitu eksplorasi ide, perancangan, dan perwujudan menjadi motif batik. Dalam kegiatan ini telah diciptakan enam motif batik khas Gayo yaitu: (1 Motif Ceplok Gayo; (2 Motif Gayo Tegak; (3 Motif Gayo Lurus; (4 Motif Parang Gayo; (5 Motif Gayo Lembut; dan (6 Motif Geometris Gayo. Hasil uji kesukaan terhadap motif kepada lima puluh responden menunjukkan bahwa Motif Ceplok Gayo paling banyak dipilih oleh responden yaitu sebesar 19%, sedangkan Motif Parang Gayo 18%, Motif Gayo Lembut 17%, Motif Geometris Gayo 17%, Motif Gayo Lurus 15% dan Motif Gayo Tegak 14%. Rata-rata motif yang dihasilkan mendapatkan apresiasi yang baik dari responden, sehingga semua motif layak diproduksi sebagai batik khas Gayo.Kata kunci: batik Gayo, Motif Ceplok Gayo, Motif Parang Gayo.ABSTRACTBatik industry began to develop in Gayo, but have not had a typical batik motif itself. Therefore, it is necessary to create batik motifs of Gayo, by taking inspiration from the carvings found in traditional houses commonly called kerawang Gayo. The purpose of this art is to create motifs those have a Gayo characteristic. The method used are the idea exploration, design, and motifs embodiment. In this activity has created six Gayo batik motifs, namely: (1 Motif Ceplok Gayo; (2 Motif Gayo Tegak; (3 Motif GayoLurus; (4 Motif Parang Gayo; (5 Motif Gayo Lembut; dan (6 Motif Geometris Gayo. The test results fondness of the motives to fifty respondents indicated that the Motif Ceplok Gayo most preferred by respondents ie 19%, while Motif Parang Gayo 18%, Motif Gayo Lembut 17%, Motif Geometris Gayo 17%, Motif Gayo

  15. Evidence against the energetic cost hypothesis for the short introns in highly expressed genes

    Directory of Open Access Journals (Sweden)

    Niu Deng-Ke

    2008-05-01

    Full Text Available Abstract Background In animals, the moss Physcomitrella patens and the pollen of Arabidopsis thaliana, highly expressed genes have shorter introns than weakly expressed genes. A popular explanation for this is selection for transcription efficiency, which includes two sub-hypotheses: to minimize the energetic cost or to minimize the time cost. Results In an individual human, different organs may differ up to hundreds of times in cell number (for example, a liver versus a hypothalamus. Considered at the individual level, a gene specifically expressed in a large organ is actually transcribed tens or hundreds of times more than a gene with a similar expression level (a measure of mRNA abundance per cell specifically expressed in a small organ. According to the energetic cost hypothesis, the former should have shorter introns than the latter. However, in humans and mice we have not found significant differences in intron length between large-tissue/organ-specific genes and small-tissue/organ-specific genes with similar expression levels. Qualitative estimation shows that the deleterious effect (that is, the energetic burden of long introns in highly expressed genes is too negligible to be efficiently selected against in mammals. Conclusion The short introns in highly expressed genes should not be attributed to energy constraint. We evaluated evidence for the time cost hypothesis and other alternatives.

  16. Identification of early zygotic genes in the yellow fever mosquito Aedes aegypti and discovery of a motif involved in early zygotic genome activation.

    Science.gov (United States)

    Biedler, James K; Hu, Wanqi; Tae, Hongseok; Tu, Zhijian

    2012-01-01

    During early embryogenesis the zygotic genome is transcriptionally silent and all mRNAs present are of maternal origin. The maternal-zygotic transition marks the time over which embryogenesis changes its dependence from maternal RNAs to zygotically transcribed RNAs. Here we present the first systematic investigation of early zygotic genes (EZGs) in a mosquito species and focus on genes involved in the onset of transcription during 2-4 hr. We used transcriptome sequencing to identify the "pure" (without maternal expression) EZGs by analyzing transcripts from four embryonic time ranges of 0-2, 2-4, 4-8, and 8-12 hr, which includes the time of cellular blastoderm formation and up to the start of gastrulation. Blast of 16,789 annotated transcripts vs. the transcriptome reads revealed evidence for 63 (P<0.001) and 143 (P<0.05) nonmaternally derived transcripts having a significant increase in expression at 2-4 hr. One third of the 63 EZG transcripts do not have predicted introns compared to 10% of all Ae. aegypti genes. We have confirmed by RT-PCR that zygotic transcription starts as early as 2-3 hours. A degenerate motif VBRGGTA was found to be overrepresented in the upstream sequences of the identified EZGs using a motif identification software called SCOPE. We find evidence for homology between this motif and the TAGteam motif found in Drosophila that has been implicated in EZG activation. A 38 bp sequence in the proximal upstream sequence of a kinesin light chain EZG (KLC2.1) contains two copies of the mosquito motif. This sequence was shown to support EZG transcription by luciferase reporter assays performed on injected early embryos, and confers early zygotic activity to a heterologous promoter from a divergent mosquito species. The results of these studies are consistent with the model of early zygotic genome activation via transcriptional activators, similar to what has been found recently in Drosophila.

  17. Identification of early zygotic genes in the yellow fever mosquito Aedes aegypti and discovery of a motif involved in early zygotic genome activation.

    Directory of Open Access Journals (Sweden)

    James K Biedler

    Full Text Available During early embryogenesis the zygotic genome is transcriptionally silent and all mRNAs present are of maternal origin. The maternal-zygotic transition marks the time over which embryogenesis changes its dependence from maternal RNAs to zygotically transcribed RNAs. Here we present the first systematic investigation of early zygotic genes (EZGs in a mosquito species and focus on genes involved in the onset of transcription during 2-4 hr. We used transcriptome sequencing to identify the "pure" (without maternal expression EZGs by analyzing transcripts from four embryonic time ranges of 0-2, 2-4, 4-8, and 8-12 hr, which includes the time of cellular blastoderm formation and up to the start of gastrulation. Blast of 16,789 annotated transcripts vs. the transcriptome reads revealed evidence for 63 (P<0.001 and 143 (P<0.05 nonmaternally derived transcripts having a significant increase in expression at 2-4 hr. One third of the 63 EZG transcripts do not have predicted introns compared to 10% of all Ae. aegypti genes. We have confirmed by RT-PCR that zygotic transcription starts as early as 2-3 hours. A degenerate motif VBRGGTA was found to be overrepresented in the upstream sequences of the identified EZGs using a motif identification software called SCOPE. We find evidence for homology between this motif and the TAGteam motif found in Drosophila that has been implicated in EZG activation. A 38 bp sequence in the proximal upstream sequence of a kinesin light chain EZG (KLC2.1 contains two copies of the mosquito motif. This sequence was shown to support EZG transcription by luciferase reporter assays performed on injected early embryos, and confers early zygotic activity to a heterologous promoter from a divergent mosquito species. The results of these studies are consistent with the model of early zygotic genome activation via transcriptional activators, similar to what has been found recently in Drosophila.

  18. AMP-acetyl CoA synthetase from Leishmania donovani: identification and functional analysis of 'PX4GK' motif.

    Science.gov (United States)

    Soumya, Neelagiri; Kumar, I Sravan; Shivaprasad, S; Gorakh, Landage Nitin; Dinesh, Neeradi; Swamy, Kayala Kambagiri; Singh, Sushma

    2015-04-01

    An adenosine monophosphate forming acetyl CoA synthetase (AceCS) which is the key enzyme involved in the conversion of acetate to acetyl CoA has been identified from Leishmania donovani for the first time. Sequence analysis of L. donovani AceCS (LdAceCS) revealed the presence of a 'PX4GK' motif which is highly conserved throughout organisms with higher sequence identity (96%) to lower sequence identity (38%). A ∼ 77 kDa heterologous protein with C-terminal 6X His-tag was expressed in Escherichia coli. Expression of LdAceCS in promastigotes was confirmed by western blot and RT-PCR analysis. Immunolocalization studies revealed that it is a cytosolic protein. We also report the kinetic characterization of recombinant LdAceCS with acetate, adenosine 5'-triphosphate, coenzyme A and propionate as substrates. Site directed mutagenesis of residues in conserved PX4GK motif of LdAceCS was performed to gain insight into its potential role in substrate binding, catalysis and its role in maintaining structural integrity of the protein. P646A, G651A and K652R exhibited more than 90% loss in activity signifying its indispensible role in the enzyme activity. Substitution of other residues in this motif resulted in altered substrate specificity and catalysis. However, none of them had any role in modulation of the secondary structure of the protein except G651A mutant. Copyright © 2015 Elsevier B.V. All rights reserved.

  19. Insights into the Activity and Substrate Binding of Xylella fastidiosa Polygalacturonase by Modification of a Unique QMK Amino Acid Motif Using Protein Chimeras.

    Science.gov (United States)

    Warren, Jeremy G; Lincoln, James E; Kirkpatrick, Bruce C

    2015-01-01

    Polygalacturonases (EC 3.2.1.15) catalyze the random hydrolysis of 1, 4-alpha-D-galactosiduronic linkages in pectate and other galacturonans. Xylella fastidiosa possesses a single polygalacturonase gene, pglA (PD1485), and X. fastidiosa mutants deficient in the production of polygalacturonase are non-pathogenic and show a compromised ability to systemically infect grapevines. These results suggested that grapevines expressing sufficient amounts of an inhibitor of X. fastidiosa polygalacturonase might be protected from disease. Previous work in our laboratory and others have tried without success to produce soluble active X. fastidiosa polygalacturonase for use in inhibition assays. In this study, we created two enzymatically active X. fastidiosa / A. vitis polygalacturonase chimeras, AX1A and AX2A to explore the functionality of X. fastidiosa polygalacturonase in vitro. The AX1A chimera was constructed to specifically test if recombinant chimeric protein, produced in Escherichia coli, is soluble and if the X. fastidiosa polygalacturonase catalytic amino acids are able to hydrolyze polygalacturonic acid. The AX2A chimera was constructed to evaluate the ability of a unique QMK motif of X. fastidiosa polygalacturonase, most polygalacturonases have a R(I/L)K motif, to bind to and allow the hydrolysis of polygalacturonic acid. Furthermore, the AX2A chimera was also used to explore what effect modification of the QMK motif of X. fastidiosa polygalacturonase to a conserved RIK motif has on enzymatic activity. These experiments showed that both the AX1A and AX2A polygalacturonase chimeras were soluble and able to hydrolyze the polygalacturonic acid substrate. Additionally, the modification of the QMK motif to the conserved RIK motif eliminated hydrolytic activity, suggesting that the QMK motif is important for the activity of X. fastidiosa polygalacturonase. This result suggests X. fastidiosa polygalacturonase may preferentially hydrolyze a different pectic substrate or

  20. The RXL motif of the African cassava mosaic virus Rep protein is necessary for rereplication of yeast DNA and viral infection in plants

    Energy Technology Data Exchange (ETDEWEB)

    Hipp, Katharina; Rau, Peter; Schäfer, Benjamin [Institut für Biomaterialien und biomolekulare Systeme, Abteilung für Molekularbiologie und Virologie der Pflanzen, Universität Stuttgart, Pfaffenwaldring 57, D-70550 Stuttgart (Germany); Gronenborn, Bruno [Institut des Sciences du Végétal, CNRS, 91198 Gif-sur-Yvette (France); Jeske, Holger, E-mail: holger.jeske@bio.uni-stuttgart.de [Institut für Biomaterialien und biomolekulare Systeme, Abteilung für Molekularbiologie und Virologie der Pflanzen, Universität Stuttgart, Pfaffenwaldring 57, D-70550 Stuttgart (Germany)

    2014-08-15

    Geminiviruses, single-stranded DNA plant viruses, encode a replication-initiator protein (Rep) that is indispensable for virus replication. A potential cyclin interaction motif (RXL) in the sequence of African cassava mosaic virus Rep may be an alternative link to cell cycle controls to the known interaction with plant homologs of retinoblastoma protein (pRBR). Mutation of this motif abrogated rereplication in fission yeast induced by expression of wildtype Rep suggesting that Rep interacts via its RXL motif with one or several yeast proteins. The RXL motif is essential for viral infection of Nicotiana benthamiana plants, since mutation of this motif in infectious clones prevented any symptomatic infection. The cell-cycle link (Clink) protein of a nanovirus (faba bean necrotic yellows virus) was investigated that activates the cell cycle by binding via its LXCXE motif to pRBR. Expression of wildtype Clink and a Clink mutant deficient in pRBR-binding did not trigger rereplication in fission yeast. - Highlights: • A potential cyclin interaction motif is conserved in geminivirus Rep proteins. • In ACMV Rep, this motif (RXL) is essential for rereplication of fission yeast DNA. • Mutating RXL abrogated viral infection completely in Nicotiana benthamiana. • Expression of a nanovirus Clink protein in yeast did not induce rereplication. • Plant viruses may have evolved multiple routes to exploit host DNA synthesis.

  1. The RXL motif of the African cassava mosaic virus Rep protein is necessary for rereplication of yeast DNA and viral infection in plants

    International Nuclear Information System (INIS)

    Hipp, Katharina; Rau, Peter; Schäfer, Benjamin; Gronenborn, Bruno; Jeske, Holger

    2014-01-01

    Geminiviruses, single-stranded DNA plant viruses, encode a replication-initiator protein (Rep) that is indispensable for virus replication. A potential cyclin interaction motif (RXL) in the sequence of African cassava mosaic virus Rep may be an alternative link to cell cycle controls to the known interaction with plant homologs of retinoblastoma protein (pRBR). Mutation of this motif abrogated rereplication in fission yeast induced by expression of wildtype Rep suggesting that Rep interacts via its RXL motif with one or several yeast proteins. The RXL motif is essential for viral infection of Nicotiana benthamiana plants, since mutation of this motif in infectious clones prevented any symptomatic infection. The cell-cycle link (Clink) protein of a nanovirus (faba bean necrotic yellows virus) was investigated that activates the cell cycle by binding via its LXCXE motif to pRBR. Expression of wildtype Clink and a Clink mutant deficient in pRBR-binding did not trigger rereplication in fission yeast. - Highlights: • A potential cyclin interaction motif is conserved in geminivirus Rep proteins. • In ACMV Rep, this motif (RXL) is essential for rereplication of fission yeast DNA. • Mutating RXL abrogated viral infection completely in Nicotiana benthamiana. • Expression of a nanovirus Clink protein in yeast did not induce rereplication. • Plant viruses may have evolved multiple routes to exploit host DNA synthesis

  2. The Mitochondrial Genome of the Prasinophyte Prasinoderma coloniale Reveals Two Trans-Spliced Group I Introns in the Large Subunit rRNA Gene

    Science.gov (United States)

    Pombert, Jean-François; Otis, Christian; Turmel, Monique; Lemieux, Claude

    2013-01-01

    Organelle genes are often interrupted by group I and or group II introns. Splicing of these mobile genetic occurs at the RNA level via serial transesterification steps catalyzed by the introns'own tertiary structures and, sometimes, with the help of external factors. These catalytic ribozymes can be found in cis or trans configuration, and although trans-arrayed group II introns have been known for decades, trans-spliced group I introns have been reported only recently. In the course of sequencing the complete mitochondrial genome of the prasinophyte picoplanktonic green alga Prasinoderma coloniale CCMP 1220 (Prasinococcales, clade VI), we uncovered two additional cases of trans-spliced group I introns. Here, we describe these introns and compare the 54,546 bp-long mitochondrial genome of Prasinoderma with those of four other prasinophytes (clades II, III and V). This comparison underscores the highly variable mitochondrial genome architecture in these ancient chlorophyte lineages. Both Prasinoderma trans-spliced introns reside within the large subunit rRNA gene (rnl) at positions where cis-spliced relatives, often containing homing endonuclease genes, have been found in other organelles. In contrast, all previously reported trans-spliced group I introns occur in different mitochondrial genes (rns or coxI). Each Prasinoderma intron is fragmented into two pieces, forming at the RNA level a secondary structure that resembles those of its cis-spliced counterparts. As observed for other trans-spliced group I introns, the breakpoint of the first intron maps to the variable loop L8, whereas that of the second is uniquely located downstream of P9.1. The breakpoint In each Prasinoderma intron corresponds to the same region where the open reading frame (ORF) occurs when present in cis-spliced orthologs. This correlation between the intron breakpoint and the ORF location in cis-spliced orthologs also holds for other trans-spliced introns; we discuss the possible implications

  3. The mitochondrial genome of the prasinophyte Prasinoderma coloniale reveals two trans-spliced group I introns in the large subunit rRNA gene.

    Directory of Open Access Journals (Sweden)

    Jean-François Pombert

    Full Text Available Organelle genes are often interrupted by group I and or group II introns. Splicing of these mobile genetic occurs at the RNA level via serial transesterification steps catalyzed by the introns'own tertiary structures and, sometimes, with the help of external factors. These catalytic ribozymes can be found in cis or trans configuration, and although trans-arrayed group II introns have been known for decades, trans-spliced group I introns have been reported only recently. In the course of sequencing the complete mitochondrial genome of the prasinophyte picoplanktonic green alga Prasinoderma coloniale CCMP 1220 (Prasinococcales, clade VI, we uncovered two additional cases of trans-spliced group I introns. Here, we describe these introns and compare the 54,546 bp-long mitochondrial genome of Prasinoderma with those of four other prasinophytes (clades II, III and V. This comparison underscores the highly variable mitochondrial genome architecture in these ancient chlorophyte lineages. Both Prasinoderma trans-spliced introns reside within the large subunit rRNA gene (rnl at positions where cis-spliced relatives, often containing homing endonuclease genes, have been found in other organelles. In contrast, all previously reported trans-spliced group I introns occur in different mitochondrial genes (rns or coxI. Each Prasinoderma intron is fragmented into two pieces, forming at the RNA level a secondary structure that resembles those of its cis-spliced counterparts. As observed for other trans-spliced group I introns, the breakpoint of the first intron maps to the variable loop L8, whereas that of the second is uniquely located downstream of P9.1. The breakpoint In each Prasinoderma intron corresponds to the same region where the open reading frame (ORF occurs when present in cis-spliced orthologs. This correlation between the intron breakpoint and the ORF location in cis-spliced orthologs also holds for other trans-spliced introns; we discuss the

  4. MHC motif viewer

    DEFF Research Database (Denmark)

    Rapin, Nicolas Philippe Jean-Pierre; Hoof, Ilka; Lund, Ole

    2008-01-01

    . Algorithms that predict which peptides MHC molecules bind have recently been developed and cover many different alleles, but the utility of these algorithms is hampered by the lack of tools for browsing and comparing the specificity of these molecules. We have, therefore, developed a web server, MHC motif....... A special viewing feature, MHC fight, allows for display of the specificity of two different MHC molecules side by side. We show how the web server can be used to discover and display surprising similarities as well as differences between MHC molecules within and between different species. The MHC motif...

  5. Polymorphism in Mitochondrial Group I Introns among Cryptococcus neoformans and Cryptococcus gattii Genotypes and Its Association with Drug Susceptibility

    Directory of Open Access Journals (Sweden)

    Felipe E. E. S. Gomes

    2018-02-01

    Full Text Available Cryptococcosis, one of the most important systemic mycosis in the world, is caused by different genotypes of Cryptococcus neoformans and Cryptococcus gattii, which differ in their ecology, epidemiology, and antifungal susceptibility. Therefore, the search for new molecular markers for genotyping, pathogenicity and drug susceptibility is necessary. Group I introns fulfill the requisites for such task because (i they are polymorphic sequences; (ii their self-splicing is inhibited by some drugs; and (iii their correct splicing under parasitic conditions is indispensable for pathogen survival. Here, we investigated the presence of group I introns in the mitochondrial LSU rRNA gene in 77 Cryptococcus isolates and its possible relation to drug susceptibility. Sequencing revealed two new introns in the LSU rRNA gene. All the introns showed high sequence similarity to other mitochondrial introns from distinct fungi, supporting the hypothesis of an ancient non-allelic invasion. Intron presence was statistically associated with those genotypes reported to be less pathogenic (p < 0.001. Further virulence assays are needed to confirm this finding. In addition, in vitro antifungal tests indicated that the presence of LSU rRNA introns may influence the minimum inhibitory concentration (MIC of amphotericin B and 5-fluorocytosine. These findings point to group I introns in the mitochondrial genome of Cryptococcus as potential molecular markers for antifungal resistance, as well as therapeutic targets.

  6. Genome-wide identification and analysis of basic helix-loop-helix domains in dog, Canis lupus familiaris.

    Science.gov (United States)

    Wang, Xu-Hua; Wang, Yong; Liu, A-Ke; Liu, Xiao-Ting; Zhou, Yang; Yao, Qin; Chen, Ke-Ping

    2015-04-01

    The basic helix-loop-helix (bHLH) domain is a highly conserved amino acid motif that defines a group of DNA-binding transcription factors. bHLH proteins play essential regulatory roles in a variety of biological processes in animal, plant, and fungus. The domestic dog, Canis lupus familiaris, is a good model organism for genetic, physiological, and behavioral studies. In this study, we identified 115 putative bHLH genes in the dog genome. Based on a phylogenetic analysis, 51, 26, 14, 4, 12, and 4 dog bHLH genes were assigned to six separate groups (A-F); four bHLH genes were categorized as ''orphans''. Within-group evolutionary relationships inferred from the phylogenetic analysis were consistent with positional conservation, other conserved domains flanking the bHLH motif, and highly conserved intron/exon patterns in other vertebrates. Our analytical results confirmed the GenBank annotations of 89 dog bHLH proteins and provided information that could be used to update the annotations of the remaining 26 dog bHLH proteins. These data will provide good references for further studies on the structures and regulatory functions of bHLH proteins in the growth and development of dogs, which may help in understanding the mechanisms that underlie the physical and behavioral differences between dogs and wolves.

  7. Motif discovery in ranked lists of sequences

    DEFF Research Database (Denmark)

    Nielsen, Morten Muhlig; Tataru, Paula; Madsen, Tobias

    2016-01-01

    Motif analysis has long been an important method to characterize biological functionality and the current growth of sequencing-based genomics experiments further extends its potential. These diverse experiments often generate sequence lists ranked by some functional property. There is therefore...... advantage of the regular expression feature, including enrichments for combinations of different microRNA seed sites. The method is implemented and made publicly available as an R package and supports high parallelization on multi-core machinery....... a growing need for motif analysis methods that can exploit this coupled data structure and be tailored for specific biological questions. Here, we present an exploratory motif analysis tool, Regmex (REGular expression Motif EXplorer), which offers several methods to evaluate the correlation of motifs...

  8. Solution structure of a DNA mimicking motif of an RNA aptamer against transcription factor AML1 Runt domain.

    Science.gov (United States)

    Nomura, Yusuke; Tanaka, Yoichiro; Fukunaga, Jun-ichi; Fujiwara, Kazuya; Chiba, Manabu; Iibuchi, Hiroaki; Tanaka, Taku; Nakamura, Yoshikazu; Kawai, Gota; Kozu, Tomoko; Sakamoto, Taiichi

    2013-12-01

    AML1/RUNX1 is an essential transcription factor involved in the differentiation of hematopoietic cells. AML1 binds to the Runt-binding double-stranded DNA element (RDE) of target genes through its N-terminal Runt domain. In a previous study, we obtained RNA aptamers against the AML1 Runt domain by systematic evolution of ligands by exponential enrichment and revealed that RNA aptamers exhibit higher affinity for the Runt domain than that for RDE and possess the 5'-GCGMGNN-3' and 5'-N'N'CCAC-3' conserved motif (M: A or C; N and N' form Watson-Crick base pairs) that is important for Runt domain binding. In this study, to understand the structural basis of recognition of the Runt domain by the aptamer motif, the solution structure of a 22-mer RNA was determined using nuclear magnetic resonance. The motif contains the AH(+)-C mismatch and base triple and adopts an unusual backbone structure. Structural analysis of the aptamer motif indicated that the aptamer binds to the Runt domain by mimicking the RDE sequence and structure. Our data should enhance the understanding of the structural basis of DNA mimicry by RNA molecules.

  9. Comparison of loline alkaloid gene clusters across fungal endophytes: predicting the co-regulatory sequence motifs and the evolutionary history.

    Science.gov (United States)

    Kutil, Brandi L; Greenwald, Charles; Liu, Gang; Spiering, Martin J; Schardl, Christopher L; Wilkinson, Heather H

    2007-10-01

    LOL, a fungal secondary metabolite gene cluster found in Epichloë and Neotyphodium species, is responsible for production of insecticidal loline alkaloids. To analyze the genetic architecture and to predict the evolutionary history of LOL, we compared five clusters from four fungal species (single clusters from Epichloë festucae, Neotyphodium sp. PauTG-1, Neotyphodium coenophialum, and two clusters we previously characterized in Neotyphodium uncinatum). Using PhyloCon to compare putative lol gene promoter regions, we have identified four motifs conserved across the lol genes in all five clusters. Each motif has significant similarity to known fungal transcription factor binding sites in the TRANSFAC database. Conservation of these motifs is further support for the hypothesis that the lol genes are co-regulated. Interestingly, the history of asexual Neotyphodium spp. includes multiple interspecific hybridization events. Comparing clusters from three Neotyphodium species and E. festucae allowed us to determine which Epichloë ancestors are the most likely contributors of LOL in these asexual species. For example, while no present day Epichloë typhina isolates are known to produce lolines, our data support the hypothesis that the E. typhina ancestor(s) of three asexual endophyte species contained a LOL gene cluster. Thus, these data support a model of evolution in which the polymorphism in loline alkaloid production phenotypes among endophyte species is likely due to the loss of the trait over time.

  10. Deciphering functional glycosaminoglycan motifs in development.

    Science.gov (United States)

    Townley, Robert A; Bülow, Hannes E

    2018-03-23

    Glycosaminoglycans (GAGs) such as heparan sulfate, chondroitin/dermatan sulfate, and keratan sulfate are linear glycans, which when attached to protein backbones form proteoglycans. GAGs are essential components of the extracellular space in metazoans. Extensive modifications of the glycans such as sulfation, deacetylation and epimerization create structural GAG motifs. These motifs regulate protein-protein interactions and are thereby repsonsible for many of the essential functions of GAGs. This review focusses on recent genetic approaches to characterize GAG motifs and their function in defined signaling pathways during development. We discuss a coding approach for GAGs that would enable computational analyses of GAG sequences such as alignments and the computation of position weight matrices to describe GAG motifs. Copyright © 2018 Elsevier Ltd. All rights reserved.

  11. Fitness for synchronization of network motifs

    DEFF Research Database (Denmark)

    Vega, Y.M.; Vázquez-Prada, M.; Pacheco, A.F.

    2004-01-01

    We study the synchronization of Kuramoto's oscillators in small parts of networks known as motifs. We first report on the system dynamics for the case of a scale-free network and show the existence of a non-trivial critical point. We compute the probability that network motifs synchronize, and fi...... that the fitness for synchronization correlates well with motifs interconnectedness and structural complexity. Possible implications for present debates about network evolution in biological and other systems are discussed....

  12. Comparative genome analysis of PHB gene family reveals deep evolutionary origins and diverse gene function.

    Science.gov (United States)

    Di, Chao; Xu, Wenying; Su, Zhen; Yuan, Joshua S

    2010-10-07

    PHB (Prohibitin) gene family is involved in a variety of functions important for different biological processes. PHB genes are ubiquitously present in divergent species from prokaryotes to eukaryotes. Human PHB genes have been found to be associated with various diseases. Recent studies by our group and others have shown diverse function of PHB genes in plants for development, senescence, defence, and others. Despite the importance of the PHB gene family, no comprehensive gene family analysis has been carried to evaluate the relatedness of PHB genes across different species. In order to better guide the gene function analysis and understand the evolution of the PHB gene family, we therefore carried out the comparative genome analysis of the PHB genes across different kingdoms. The relatedness, motif distribution, and intron/exon distribution all indicated that PHB genes is a relatively conserved gene family. The PHB genes can be classified into 5 classes and each class have a very deep evolutionary origin. The PHB genes within the class maintained the same motif patterns during the evolution. With Arabidopsis as the model species, we found that PHB gene intron/exon structure and domains are also conserved during the evolution. Despite being a conserved gene family, various gene duplication events led to the expansion of the PHB genes. Both segmental and tandem gene duplication were involved in Arabidopsis PHB gene family expansion. However, segmental duplication is predominant in Arabidopsis. Moreover, most of the duplicated genes experienced neofunctionalization. The results highlighted that PHB genes might be involved in important functions so that the duplicated genes are under the evolutionary pressure to derive new function. PHB gene family is a conserved gene family and accounts for diverse but important biological functions based on the similar molecular mechanisms. The highly diverse biological function indicated that more research needs to be carried out

  13. Identification of a highly conserved valine-glycine-phenylalanine amino acid triplet required for HIV-1 Nef function

    Directory of Open Access Journals (Sweden)

    Meuwissen Pieter J

    2012-04-01

    Full Text Available Abstract Background The Nef protein of HIV facilitates virus replication and disease progression in infected patients. This role as pathogenesis factor depends on several genetically separable Nef functions that are mediated by interactions of highly conserved protein-protein interaction motifs with different host cell proteins. By studying the functionality of a series of nef alleles from clinical isolates, we identified a dysfunctional HIV group O Nef in which a highly conserved valine-glycine-phenylalanine (VGF region, which links a preceding acidic cluster with the following proline-rich motif into an amphipathic surface was deleted. In this study, we aimed to study the functional importance of this VGF region. Results The dysfunctional HIV group O8 nef allele was restored to the consensus sequence, and mutants of canonical (NL4.3, NA-7, SF2 and non-canonical (B2 and C1422 HIV-1 group M nef alleles were generated in which the amino acids of the VGF region were changed into alanines (VGF→AAA and tested for their capacity to interfere with surface receptor trafficking, signal transduction and enhancement of viral replication and infectivity. We found the VGF motif, and each individual amino acid of this motif, to be critical for downregulation of MHC-I and CXCR4. Moreover, Nef’s association with the cellular p21-activated kinase 2 (PAK2, the resulting deregulation of cofilin and inhibition of host cell actin remodeling, and targeting of Lck kinase to the trans-golgi-network (TGN were affected as well. Of particular interest, VGF integrity was essential for Nef-mediated enhancement of HIV virion infectivity and HIV replication in peripheral blood lymphocytes. For targeting of Lck kinase to the TGN and viral infectivity, especially the phenylalanine of the triplet was essential. At the molecular level, the VGF motif was required for the physical interaction of the adjacent proline-rich motif with Hck. Conclusion Based on these findings, we

  14. Multiple group I introns in the small-subunit rDNA of Botryosphaeria dothidea: implication for intraspecific genetic diversity.

    Directory of Open Access Journals (Sweden)

    Chao Xu

    Full Text Available Botryosphaeria dothidea is a widespread and economically important pathogen on various fruit trees, and it often causes die-back and canker on limbs and fruit rot. In characterizing intraspecies genetic variation within this fungus, group I introns, rich in rDNA of fungi, may provide a productive region for exploration. In this research, we analysed complete small subunit (SSU ribosomal DNA (rDNA sequences of 37 B. dothidea strains, and found four insertions, designated Bdo.S943, Bdo.S1199-A, Bdo.S1199-B and Bdo.S1506, at three positions. Sequence analysis and structure prediction revealed that both Bdo.S943 and Bdo.S1506 belonged to subgroup IC1 of group I introns, whereas Bdo.S1199-A and Bdo.S1199-B corresponded to group IE introns. Moreover, Bdo.S1199-A was found to host an open reading frame (ORF for encoding the homing endonuclease (HE, whereas Bdo.S1199-B, an evolutionary descendant of Bdo.S1199-A, included a degenerate HE. The above four introns were novel, and were the first group I introns observed and characterized in this species. Differential distribution of these introns revealed that all strains could be separated into four genotypes. Genotype III (no intron and genotype IV (Bdo.S1199-B were each found in only one strain, whereas genotype I (Bdo.S1199-A and genotype II (Bdo.S943 and Bdo.S1506 occurred in 95% of the strains. There is a correlation between B. dothidea genotypes and hosts or geographic locations. Thus, these newly discovered group I introns can help to advance understanding of genetic differentiation within B. dothidea.

  15. An intronic microRNA silences genes that are functionally antagonistic to its host gene.

    Science.gov (United States)

    Barik, Sailen

    2008-09-01

    MicroRNAs (miRNAs) are short noncoding RNAs that down-regulate gene expression by silencing specific target mRNAs. While many miRNAs are transcribed from their own genes, nearly half map within introns of 'host' genes, the significance of which remains unclear. We report that transcriptional activation of apoptosis-associated tyrosine kinase (AATK), essential for neuronal differentiation, also generates miR-338 from an AATK gene intron that silences a family of mRNAs whose protein products are negative regulators of neuronal differentiation. We conclude that an intronic miRNA, transcribed together with the host gene mRNA, may serve the interest of its host gene by silencing a cohort of genes that are functionally antagonistic to the host gene itself.

  16. Aplikasi Ornamen Khas Maluku untuk Pengembangan Desain Motif Batik

    Directory of Open Access Journals (Sweden)

    Masiswo Masiswo

    2016-04-01

    Full Text Available ABSTRAKMaluku memiliki banyak ragam hias budaya warisan nilai leluhur berupa ornamen etnis yang merupakan kesenian dan keterampilan kerajinan. Hasil warisan tersebut sampai saat ini masih lestari hidup serta dapat dinikmati sebagai konsumsi rohani yang memuaskan manusia. Berkaitan dengan keberlangsungan nilai-nilai tradisi etnis yang berwujud pada ornamen-ornamen daerah Maluku, maka dikembangkan untuk kebutuhan manusia berupa motif batik pada kain. Pengembangan ornamen ini lebih menekankan pada representasi akan bentuk-bentuk ornamen yang diterapkan pada kerajinan batik berupa motif khas Maluku. Pengembangan alternatif desain motif batik dibuat tiga variasi yang bersumber dari ornamen khas Maluku dibuat prototipe produknya dan diuji ketahanan luntur warnanya. Hasil uji ketahanan luntur warna terhadap gosokan basah dari tiga prototipe produk berpredikat baik sekali terdapat pada “Motif Siwa” dan predikat baik pada motif “Siwa Talang” dan motif “Matahari Siwa Talang”.Kata kunci: desain, Maluku, motif batik, ornamenABSTRACTMaluku has much decorative ancestral cultural heritage value in the form of ornament ethnic arts and crafts skills. The result of the legacy is still sustainable living can be enjoyed as well as satisfying spiritual human consumption.Related to the sustainability of traditional values in the form of ethnic ornaments Maluku, it was developed for human needs in the form of batik cloth . The development of these ornaments will be more emphasis on the representation forms of ornamentation that is applied to a batik motif Maluku. Development of alternative design motif made three variations. The development of three alternative design motifs derived from the Maluku ornaments made and tested a prototype product color fastness. The test results of color fastness to wet rubbing of the three prototypes are excellent products predicated on the "Motif Siwa" and a good rating on the motif "Siwa Talang" and motif "Matahari Siwa

  17. Requirement for asparagine in the aquaporin NPA sequence signature motifs for cation exclusion

    DEFF Research Database (Denmark)

    Wree, Dorothea; Wu, Binghua; Zeuthen, Thomas

    2011-01-01

    Two highly conserved NPA motifs are a hallmark of the aquaporin (AQP) family. The NPA triplets form N-terminal helix capping structures with the Asn side chains located in the centre of the water or solute-conducting channel, and are considered to play an important role in AQP selectivity. Although...... interchangeable at both NPA sites without affecting protein expression or water, glycerol and methylamine permeability. However, other mutations in the NPA region led to reduced permeability (S186C and S186D), to nonfunctional channels (N64D), or even to lack of protein expression (S186A and S186T). Using...... electrophysiology, we found that an analogous mammalian AQP1 N76S mutant excluded protons and potassium ions, but leaked sodium ions, providing an argument for the overwhelming prevalence of Asn over other amino acids. We conclude that, at the first position in the NPA motifs, only Asn provides efficient helix cap...

  18. The limits of de novo DNA motif discovery.

    Directory of Open Access Journals (Sweden)

    David Simcha

    Full Text Available A major challenge in molecular biology is reverse-engineering the cis-regulatory logic that plays a major role in the control of gene expression. This program includes searching through DNA sequences to identify "motifs" that serve as the binding sites for transcription factors or, more generally, are predictive of gene expression across cellular conditions. Several approaches have been proposed for de novo motif discovery-searching sequences without prior knowledge of binding sites or nucleotide patterns. However, unbiased validation is not straightforward. We consider two approaches to unbiased validation of discovered motifs: testing the statistical significance of a motif using a DNA "background" sequence model to represent the null hypothesis and measuring performance in predicting membership in gene clusters. We demonstrate that the background models typically used are "too null," resulting in overly optimistic assessments of significance, and argue that performance in predicting TF binding or expression patterns from DNA motifs should be assessed by held-out data, as in predictive learning. Applying this criterion to common motif discovery methods resulted in universally poor performance, although there is a marked improvement when motifs are statistically significant against real background sequences. Moreover, on synthetic data where "ground truth" is known, discriminative performance of all algorithms is far below the theoretical upper bound, with pronounced "over-fitting" in training. A key conclusion from this work is that the failure of de novo discovery approaches to accurately identify motifs is basically due to statistical intractability resulting from the fixed size of co-regulated gene clusters, and thus such failures do not necessarily provide evidence that unfound motifs are not active biologically. Consequently, the use of prior knowledge to enhance motif discovery is not just advantageous but necessary. An implementation of

  19. Presence of a consensus DNA motif at nearby DNA sequence of the mutation susceptible CG nucleotides.

    Science.gov (United States)

    Chowdhury, Kaushik; Kumar, Suresh; Sharma, Tanu; Sharma, Ankit; Bhagat, Meenakshi; Kamai, Asangla; Ford, Bridget M; Asthana, Shailendra; Mandal, Chandi C

    2018-01-10

    Complexity in tissues affected by cancer arises from somatic mutations and epigenetic modifications in the genome. The mutation susceptible hotspots present within the genome indicate a non-random nature and/or a position specific selection of mutation. An association exists between the occurrence of mutations and epigenetic DNA methylation. This study is primarily aimed at determining mutation status, and identifying a signature for predicting mutation prone zones of tumor suppressor (TS) genes. Nearby sequences from the top five positions having a higher mutation frequency in each gene of 42 TS genes were selected from a cosmic database and were considered as mutation prone zones. The conserved motifs present in the mutation prone DNA fragments were identified. Molecular docking studies were done to determine putative interactions between the identified conserved motifs and enzyme methyltransferase DNMT1. Collective analysis of 42 TS genes found GC as the most commonly replaced and AT as the most commonly formed residues after mutation. Analysis of the top 5 mutated positions of each gene (210 DNA segments for 42 TS genes) identified that CG nucleotides of the amino acid codons (e.g., Arginine) are most susceptible to mutation, and found a consensus DNA "T/AGC/GAGGA/TG" sequence present in these mutation prone DNA segments. Similar to TS genes, analysis of 54 oncogenes not only found CG nucleotides of the amino acid Arg as the most susceptible to mutation, but also identified the presence of similar consensus DNA motifs in the mutation prone DNA fragments (270 DNA segments for 54 oncogenes) of oncogenes. Docking studies depicted that, upon binding of DNMT1 methylates to this consensus DNA motif (C residues of CpG islands), mutation was likely to occur. Thus, this study proposes that DNMT1 mediated methylation in chromosomal DNA may decrease if a foreign DNA segment containing this consensus sequence along with CG nucleotides is exogenously introduced to dividing

  20. Parole, Sintagmatik, dan Paradigmatik Motif Batik Mega Mendung

    Directory of Open Access Journals (Sweden)

    Rudi - Nababan

    2012-04-01

    Full Text Available ABSTRACT   Discussing traditional batik is related a lot to the organization system of fine arts element ac- companying it, either the pattern of the motif or the technique of the making. In this case, the motif of Mega Mendung Cirebon certainly has patterns and rules which are traditionally different from the other motifs in other areas. Through  semiotics analysis especially with Saussure and Pierce concept, it can be traced that batik with Cirebon motif, in this case Mega Mendung motif, has parole and langue system, as unique fine arts language in batik, and structure of visual syntagmatic and paradigmatic. In the context of batik motif as fine arts language, it is surely related to sign system as symbol and icon.       Keywords: visual semiotic, Cirebon’s batik.

  1. Regulation of mRNA Levels by Decay-Promoting Introns that Recruit the Exosome Specificity Factor Mmi1

    Directory of Open Access Journals (Sweden)

    Cornelia Kilchert

    2015-12-01

    Full Text Available In eukaryotic cells, inefficient splicing is surprisingly common and leads to the degradation of transcripts with retained introns. How pre-mRNAs are committed to nuclear decay is unknown. Here, we uncover a mechanism by which specific intron-containing transcripts are targeted for nuclear degradation in fission yeast. Sequence elements within these “decay-promoting” introns co-transcriptionally recruit the exosome specificity factor Mmi1, which induces degradation of the unspliced precursor and leads to a reduction in the levels of the spliced mRNA. This mechanism negatively regulates levels of the RNA helicase DDX5/Dbp2 to promote cell survival in response to stress. In contrast, fast removal of decay-promoting introns by co-transcriptional splicing precludes Mmi1 recruitment and relieves negative expression regulation. We propose that decay-promoting introns facilitate the regulation of gene expression. Based on the identification of multiple additional Mmi1 targets, including mRNAs, long non-coding RNAs, and sn/snoRNAs, we suggest a general role in RNA regulation for Mmi1 through transcript degradation.

  2. In silico genome-wide identification and characterization of the glutathione S-transferase gene family in Vigna radiata.

    Science.gov (United States)

    Vaish, Swati; Awasthi, Praveen; Tiwari, Siddharth; Tiwari, Shailesh Kumar; Gupta, Divya; Basantani, Mahesh Kumar

    2018-05-01

    Plant glutathione S-transferases (GSTs) are integral to normal plant metabolism and biotic and abiotic stress tolerance. The GST gene family has been characterized in diverse plant species using molecular biology and bioinformatics approaches. In the current study, in silico analysis identified 44 GSTs in Vigna radiata. Of the total 44 GSTs identified, chromosomal locations of 31 GSTs were confirmed. The pI value of GST proteins ranged from 5.10 to 9.40. The predicted molecular weights ranged from 13.12 to 50 kDa. Subcellular localization analysis revealed that all GSTs were predominantly localized in the cytoplasm. The active site amino acids were confirmed to be serine in tau, phi, theta, zeta, and TCHQD; cysteine in lambda, DHAR, and omega; and tyrosine in EF1G. The gene architecture conformed to the two-exon/one-intron and three-exon/two-intron organization in the case of tau and phi classes, respectively. MEME analysis identified 10 significantly conserved motifs with the width of 8-50 amino acids. The motifs identified were either specific to a specific GST class or were shared by multiple GST classes. The results of the current study will be of potential importance in the characterization of the GST gene family in V. radiata, an economically important leguminous crop.

  3. Loss of a highly conserved sterile alpha motif domain gene (WEEP) results in pendulous branch growth in peach trees.

    Science.gov (United States)

    Hollender, Courtney A; Pascal, Thierry; Tabb, Amy; Hadiarto, Toto; Srinivasan, Chinnathambi; Wang, Wanpeng; Liu, Zhongchi; Scorza, Ralph; Dardick, Chris

    2018-05-15

    Plant shoots typically grow upward in opposition to the pull of gravity. However, exceptions exist throughout the plant kingdom. Most conspicuous are trees with weeping or pendulous branches. While such trees have long been cultivated and appreciated for their ornamental value, the molecular basis behind the weeping habit is not known. Here, we characterized a weeping tree phenotype in Prunus persica (peach) and identified the underlying genetic mutation using a genomic sequencing approach. Weeping peach tree shoots exhibited a downward elliptical growth pattern and did not exhibit an upward bending in response to 90° reorientation. The causative allele was found to be an uncharacterized gene, Ppa013325 , having a 1.8-Kb deletion spanning the 5' end. This gene, dubbed WEEP , was predominantly expressed in phloem tissues and encodes a highly conserved 129-amino acid protein containing a sterile alpha motif (SAM) domain. Silencing WEEP in the related tree species Prunus domestica (plum) resulted in more outward, downward, and wandering shoot orientations compared to standard trees, supporting a role for WEEP in directing lateral shoot growth in trees. This previously unknown regulator of branch orientation, which may also be a regulator of gravity perception or response, provides insights into our understanding of how tree branches grow in opposition to gravity and could serve as a critical target for manipulating tree architecture for improved tree shape in agricultural and horticulture applications. Copyright © 2018 the Author(s). Published by PNAS.

  4. [Genome-wide identification and expression analysis of the WRKY gene family in peach].

    Science.gov (United States)

    Gu, Yan-bing; Ji, Zhi-rui; Chi, Fu-mei; Qiao, Zhuang; Xu, Cheng-nan; Zhang, Jun-xiang; Zhou, Zong-shan; Dong, Qing-long

    2016-03-01

    The WRKY transcription factors are one of the largest families of transcriptional regulators and play diverse regulatory roles in biotic and abiotic stresses, plant growth and development processes. In this study, the WRKY DNA-binding domain (Pfam Database number: PF03106) downloaded from Pfam protein families database was exploited to identify WRKY genes from the peach (Prunus persica 'Lovell') genome using HMMER 3.0. The obtained amino acid sequences were analyzed with DNAMAN 5.0, WebLogo 3, MEGA 5.1, MapInspect and MEME bioinformatics softwares. Totally 61 peach WRKY genes were found in the peach genome. Our phylogenetic analysis revealed that peach WRKY genes were classified into three Groups: Ⅰ, Ⅱ and Ⅲ. The WRKY N-terminal and C-terminal domains of Group Ⅰ (group I-N and group I-C) were monophyletic. The Group Ⅱ was sub-divided into five distinct clades (groupⅡ-a, Ⅱ-b, Ⅱ-c, Ⅱ-d and Ⅱ-e). Our domain analysis indicated that the WRKY regions contained a highly conserved heptapeptide stretch WRKYGQK at its N-terminus followed by a zinc-finger motif. The chromosome mapping analysis showed that peach WRKY genes were distributed with different densities over 8 chromosomes. The intron-exon structure analysis revealed that structures of the WRKY gene were highly conserved in the peach. The conserved motif analysis showed that the conserved motifs 1, 2 and 3, which specify the WRKY domain, were observed in all peach WRKY proteins, motif 5 as the unknown domain was observed in group Ⅱ-d, two WRKY domains were assigned to GroupⅠ. SqRT-PCR and qRT-PCR results indicated that 16 PpWRKY genes were expressed in roots, stems, leaves, flowers and fruits at various expression levels. Our analysis thus identified the PpWRKY gene families, and future functional studies are needed to reveal its specific roles.

  5. Mutational definition of binding requirements of an hnRNP-like protein in Arabidopsis using fluorescence correlation spectroscopy

    Energy Technology Data Exchange (ETDEWEB)

    Leder, Verena [Molecular Cell Physiology, Faculty of Biology, Bielefeld University (Germany); Biomolecular Photonics, Faculty of Physics, Bielefeld University (Germany); Lummer, Martina [Molecular Cell Physiology, Faculty of Biology, Bielefeld University (Germany); Tegeler, Kathrin [Molecular Cell Physiology, Faculty of Biology, Bielefeld University (Germany); Biomolecular Photonics, Faculty of Physics, Bielefeld University (Germany); Humpert, Fabian [Biomolecular Photonics, Faculty of Physics, Bielefeld University (Germany); Lewinski, Martin [Molecular Cell Physiology, Faculty of Biology, Bielefeld University (Germany); Schüttpelz, Mark [Biomolecular Photonics, Faculty of Physics, Bielefeld University (Germany); Staiger, Dorothee, E-mail: dorothee.staiger@uni-bielefeld.de [Molecular Cell Physiology, Faculty of Biology, Bielefeld University (Germany)

    2014-10-10

    Highlights: • We use FCS to investigate binding site requirements for the hnRNP-like protein AtGRP7. • We identify three nucleotides critical for AtGRP7 binding to its own intron. • Mutation of the conserved R{sup 49} abolishes binding altogether. • The paralogue AtGRP8 binds to an overlapping motif with different sequence requirement. • The glycine-rich stretch of a plant hnRNP-like protein contributes to binding. - Abstract: Arabidopsis thaliana glycine-rich RNA binding protein 7 (AtGRP7) is part of a negative feedback loop through which it regulates alternative splicing and steady-state abundance of its pre-mRNA. Here we use fluorescence correlation spectroscopy to investigate the requirements for AtGRP7 binding to its intron using fluorescently-labelled synthetic oligonucleotides. By systematically introducing point mutations we identify three nucleotides that lead to an increased K{sub d} value when mutated and thus are critical for AtGRP7 binding. Simultaneous mutation of all three residues abrogates binding. The paralogue AtGRP8 binds to an overlapping motif but with a different sequence preference, in line with overlapping but not identical functions of this protein pair. Truncation of the glycine-rich domain reduces the binding affinity of AtGRP7, showing for the first time that the glycine-rich stretch of a plant hnRNP-like protein contributes to binding. Mutation of the conserved R{sup 49} that is crucial for AtGRP7 function in pathogen defence and splicing abolishes binding.

  6. Mutational definition of binding requirements of an hnRNP-like protein in Arabidopsis using fluorescence correlation spectroscopy

    International Nuclear Information System (INIS)

    Leder, Verena; Lummer, Martina; Tegeler, Kathrin; Humpert, Fabian; Lewinski, Martin; Schüttpelz, Mark; Staiger, Dorothee

    2014-01-01

    Highlights: • We use FCS to investigate binding site requirements for the hnRNP-like protein AtGRP7. • We identify three nucleotides critical for AtGRP7 binding to its own intron. • Mutation of the conserved R 49 abolishes binding altogether. • The paralogue AtGRP8 binds to an overlapping motif with different sequence requirement. • The glycine-rich stretch of a plant hnRNP-like protein contributes to binding. - Abstract: Arabidopsis thaliana glycine-rich RNA binding protein 7 (AtGRP7) is part of a negative feedback loop through which it regulates alternative splicing and steady-state abundance of its pre-mRNA. Here we use fluorescence correlation spectroscopy to investigate the requirements for AtGRP7 binding to its intron using fluorescently-labelled synthetic oligonucleotides. By systematically introducing point mutations we identify three nucleotides that lead to an increased K d value when mutated and thus are critical for AtGRP7 binding. Simultaneous mutation of all three residues abrogates binding. The paralogue AtGRP8 binds to an overlapping motif but with a different sequence preference, in line with overlapping but not identical functions of this protein pair. Truncation of the glycine-rich domain reduces the binding affinity of AtGRP7, showing for the first time that the glycine-rich stretch of a plant hnRNP-like protein contributes to binding. Mutation of the conserved R 49 that is crucial for AtGRP7 function in pathogen defence and splicing abolishes binding

  7. Motif statistics and spike correlations in neuronal networks

    International Nuclear Information System (INIS)

    Hu, Yu; Shea-Brown, Eric; Trousdale, James; Josić, Krešimir

    2013-01-01

    Motifs are patterns of subgraphs of complex networks. We studied the impact of such patterns of connectivity on the level of correlated, or synchronized, spiking activity among pairs of cells in a recurrent network of integrate and fire neurons. For a range of network architectures, we find that the pairwise correlation coefficients, averaged across the network, can be closely approximated using only three statistics of network connectivity. These are the overall network connection probability and the frequencies of two second order motifs: diverging motifs, in which one cell provides input to two others, and chain motifs, in which two cells are connected via a third intermediary cell. Specifically, the prevalence of diverging and chain motifs tends to increase correlation. Our method is based on linear response theory, which enables us to express spiking statistics using linear algebra, and a resumming technique, which extrapolates from second order motifs to predict the overall effect of coupling on network correlation. Our motif-based results seek to isolate the effect of network architecture perturbatively from a known network state. (paper)

  8. Bayesian centroid estimation for motif discovery.

    Science.gov (United States)

    Carvalho, Luis

    2013-01-01

    Biological sequences may contain patterns that signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequence of the set. We propose a new centroid estimator that arises from a refined and meaningful loss function for binding site inference. We discuss the main advantages of centroid estimation for motif discovery, including computational convenience, and how its principled derivation offers further insights about the posterior distribution of binding site configurations. We also illustrate, using simulated and real datasets, that the centroid estimator can differ from the traditional maximum a posteriori or maximum likelihood estimators.

  9. Bayesian centroid estimation for motif discovery.

    Directory of Open Access Journals (Sweden)

    Luis Carvalho

    Full Text Available Biological sequences may contain patterns that signal important biomolecular functions; a classical example is regulation of gene expression by transcription factors that bind to specific patterns in genomic promoter regions. In motif discovery we are given a set of sequences that share a common motif and aim to identify not only the motif composition, but also the binding sites in each sequence of the set. We propose a new centroid estimator that arises from a refined and meaningful loss function for binding site inference. We discuss the main advantages of centroid estimation for motif discovery, including computational convenience, and how its principled derivation offers further insights about the posterior distribution of binding site configurations. We also illustrate, using simulated and real datasets, that the centroid estimator can differ from the traditional maximum a posteriori or maximum likelihood estimators.

  10. The LINKS motif zippers trans-acyltransferase polyketide synthase assembly lines into a biosynthetic megacomplex.

    Science.gov (United States)

    Gay, Darren C; Wagner, Drew T; Meinke, Jessica L; Zogzas, Charles E; Gay, Glen R; Keatinge-Clay, Adrian T

    2016-03-01

    Polyketides such as the clinically-valuable antibacterial agent mupirocin are constructed by architecturally-sophisticated assembly lines known as trans-acyltransferase polyketide synthases. Organelle-sized megacomplexes composed of several copies of trans-acyltransferase polyketide synthase assembly lines have been observed by others through transmission electron microscopy to be located at the Bacillus subtilis plasma membrane, where the synthesis and export of the antibacterial polyketide bacillaene takes place. In this work we analyze ten crystal structures of trans-acyltransferase polyketide synthases ketosynthase domains, seven of which are reported here for the first time, to characterize a motif capable of zippering assembly lines into a megacomplex. While each of the three-helix LINKS (Laterally-INteracting Ketosynthase Sequence) motifs is observed to similarly dock with a spatially-reversed copy of itself through hydrophobic and ionic interactions, the amino acid sequences of this motif are not conserved. Such a code is appropriate for mediating homotypic contacts between assembly lines to ensure the ordered self-assembly of a noncovalent, yet tightly-knit, enzymatic network. LINKS-mediated lateral interactions would also have the effect of bolstering the vertical association of the polypeptides that comprise a polyketide synthase assembly line. Copyright © 2015 Elsevier Inc. All rights reserved.

  11. Vaccinia protein F12 has structural similarity to kinesin light chain and contains a motor binding motif required for virion export.

    Directory of Open Access Journals (Sweden)

    Gareth W Morgan

    2010-02-01

    Full Text Available Vaccinia virus (VACV uses microtubules for export of virions to the cell surface and this process requires the viral protein F12. Here we show that F12 has structural similarity to kinesin light chain (KLC, a subunit of the kinesin-1 motor that binds cargo. F12 and KLC share similar size, pI, hydropathy and cargo-binding tetratricopeptide repeats (TPRs. Moreover, molecular modeling of F12 TPRs upon the crystal structure of KLC2 TPRs showed a striking conservation of structure. We also identified multiple TPRs in VACV proteins E2 and A36. Data presented demonstrate that F12 is critical for recruitment of kinesin-1 to virions and that a conserved tryptophan and aspartic acid (WD motif, which is conserved in the kinesin-1-binding sequence (KBS of the neuronal protein calsyntenin/alcadein and several other cellular kinesin-1 binding proteins, is essential for kinesin-1 recruitment and virion transport. In contrast, mutation of WD motifs in protein A36 revealed they were not required for kinesin-1 recruitment or IEV transport. This report of a viral KLC-like protein containing a KBS that is conserved in several cellular proteins advances our understanding of how VACV recruits the kinesin motor to virions, and exemplifies how viruses use molecular mimicry of cellular components to their advantage.

  12. Characterization of the GXXXG motif in the first transmembrane segment of Japanese encephalitis virus precursor membrane (prM protein

    Directory of Open Access Journals (Sweden)

    Wu Suh-Chin

    2010-05-01

    Full Text Available Abstract The interaction between prM and E proteins in flavivirus-infected cells is a major driving force for the assembly of flavivirus particles. We used site-directed mutagenesis to study the potential role of the transmembrane domains of the prM proteins of Japanese encephalitis virus (JEV in prM-E heterodimerization as well as subviral particle formation. Alanine insertion scanning mutagenesis within the GXXXG motif in the first transmembrane segment of JEV prM protein affected the prM-E heterodimerization; its specificity was confirmed by replacing the two glycines of the GXXXG motif with alanine, leucine and valine. The GXXXG motif was found to be conserved in the JEV serocomplex viruses but not other flavivirus groups. These mutants with alanine inserted in the two prM transmembrane segments all impaired subviral particle formation in cell cultures. The prM transmembrane domains of JEV may play importation roles in prM-E heterodimerization and viral particle assembly.

  13. WeederH: an algorithm for finding conserved regulatory motifs and regions in homologous sequences

    Directory of Open Access Journals (Sweden)

    Pesole Graziano

    2007-02-01

    Full Text Available Abstract Background This work addresses the problem of detecting conserved transcription factor binding sites and in general regulatory regions through the analysis of sequences from homologous genes, an approach that is becoming more and more widely used given the ever increasing amount of genomic data available. Results We present an algorithm that identifies conserved transcription factor binding sites in a given sequence by comparing it to one or more homologs, adapting a framework we previously introduced for the discovery of sites in sequences from co-regulated genes. Differently from the most commonly used methods, the approach we present does not need or compute an alignment of the sequences investigated, nor resorts to descriptors of the binding specificity of known transcription factors. The main novel idea we introduce is a relative measure of conservation, assuming that true functional elements should present a higher level of conservation with respect to the rest of the sequence surrounding them. We present tests where we applied the algorithm to the identification of conserved annotated sites in homologous promoters, as well as in distal regions like enhancers. Conclusion Results of the tests show how the algorithm can provide fast and reliable predictions of conserved transcription factor binding sites regulating the transcription of a gene, with better performances than other available methods for the same task. We also show examples on how the algorithm can be successfully employed when promoter annotations of the genes investigated are missing, or when regulatory sites and regions are located far away from the genes.

  14. The conserved 12-amino acid stretch in the inter-bromodomain region of BET family proteins functions as a nuclear localization signal.

    Science.gov (United States)

    Fukazawa, Hidesuke; Masumi, Atsuko

    2012-01-01

    The bromodomain and extraterminal (BET) family is a group of chromatin-binding proteins characterized by two bromodomains, an extraterminal (ET) domain, and several other conserved regions of unknown function. In humans, the BET family consists of four members, BRD2, BRD3, BRD4 and BRDT, that all normally localize to the nucleus. We identified a 12-amino acid stretch in the inter-bromodomain region that is perfectly conserved among the BET family members. We deleted these residues and expressed the mutant proteins in HEK293T cells to investigate the function of this motif. We found that the deletion of this motif alters the localization of BET proteins. Mutated BRD3 and BRD4 were excluded from the nucleus, and BRDT was found to be diffused throughout the nucleus and cytoplasm. Although the mutant BRD2 remained predominantly in the nucleus, a punctate distribution was also observed in the cytosol. It has been reported that a conserved motif between the second bromodomain and the ET domain serves as a nuclear localization signal for BRD2. Nevertheless, BET mutants lacking the reported nuclear localization signal motif but retaining the 12-amino acid stretch resided in the nucleus. Furthermore, these mutants were diffused throughout the cytoplasm when the 12 residues were removed. These results indicate that the conserved amino acid stretch in the inter-bromodomain region of the BET family functions as a nuclear localization signal.

  15. Euglena gracilis chloroplast DNA: analysis of a 1.6 kb intron of the psb C gene containing an open reading frame of 458 codons.

    Science.gov (United States)

    Montandon, P E; Vasserot, A; Stutz, E

    1986-01-01

    We retrieved a 1.6 kbp intron separating two exons of the psb C gene which codes for the 44 kDa reaction center protein of photosystem II. This intron is 3 to 4 times the size of all previously sequenced Euglena gracilis chloroplast introns. It contains an open reading frame of 458 codons potentially coding for a basic protein of 54 kDa of yet unknown function. The intron boundaries follow consensus sequences established for chloroplast introns related to class II and nuclear pre-mRNA introns. Its 3'-terminal segment has structural features similar to class II mitochondrial introns with an invariant base A as possible branch point for lariat formation.

  16. CONTEMPORARY USAGE OF TRADITIONAL TURKISH MOTIFS IN PRODUCT DESIGNS

    Directory of Open Access Journals (Sweden)

    Tulay Gumuser

    2012-12-01

    Full Text Available The aim of this study is to identify the traditional Turkish motifs and its relations among present industrial designs. Traditional Turkish motifs played a very important role in 16th century onwards. The arts of the Ottoman Empire were used because of their symbolic meanings and unique styles. When we examine these motifs we encounter; Tiger Stripe, Three Spot (Çintemani, Rumi, Hatayi, Penç, Cloud, Crescent, Star, Crown, Hyacinth, Tulip and Carnation motifs. Nowadays, Turkish designers have begun to use these traditional Turkish motifs in their designs so as to create differences and awareness in the world design. The examples of these industrial designs, using the Turkish motifs, have survived and have Ottoman heritage and historical value. In this study, the Turkish motifs will be examined along with their focus on contemporary Turkish industrial designs used today.

  17. RNA motif search with data-driven element ordering.

    Science.gov (United States)

    Rampášek, Ladislav; Jimenez, Randi M; Lupták, Andrej; Vinař, Tomáš; Brejová, Broňa

    2016-05-18

    In this paper, we study the problem of RNA motif search in long genomic sequences. This approach uses a combination of sequence and structure constraints to uncover new distant homologs of known functional RNAs. The problem is NP-hard and is traditionally solved by backtracking algorithms. We have designed a new algorithm for RNA motif search and implemented a new motif search tool RNArobo. The tool enhances the RNAbob descriptor language, allowing insertions in helices, which enables better characterization of ribozymes and aptamers. A typical RNA motif consists of multiple elements and the running time of the algorithm is highly dependent on their ordering. By approaching the element ordering problem in a principled way, we demonstrate more than 100-fold speedup of the search for complex motifs compared to previously published tools. We have developed a new method for RNA motif search that allows for a significant speedup of the search of complex motifs that include pseudoknots. Such speed improvements are crucial at a time when the rate of DNA sequencing outpaces growth in computing. RNArobo is available at http://compbio.fmph.uniba.sk/rnarobo .

  18. Genome-wide data-mining of candidate human splice translational efficiency polymorphisms (STEPs and an online database.

    Directory of Open Access Journals (Sweden)

    Christopher A Raistrick

    2010-10-01

    Full Text Available Variation in pre-mRNA splicing is common and in some cases caused by genetic variants in intronic splicing motifs. Recent studies into the insulin gene (INS discovered a polymorphism in a 5' non-coding intron that influences the likelihood of intron retention in the final mRNA, extending the 5' untranslated region and maintaining protein quality. Retention was also associated with increased insulin levels, suggesting that such variants--splice translational efficiency polymorphisms (STEPs--may relate to disease phenotypes through differential protein expression. We set out to explore the prevalence of STEPs in the human genome and validate this new category of protein quantitative trait loci (pQTL using publicly available data.Gene transcript and variant data were collected and mined for candidate STEPs in motif regions. Sequences from transcripts containing potential STEPs were analysed for evidence of splice site recognition and an effect in expressed sequence tags (ESTs. 16 publicly released genome-wide association data sets of common diseases were searched for association to candidate polymorphisms with HapMap frequency data. Our study found 3324 candidate STEPs lying in motif sequences of 5' non-coding introns and further mining revealed 170 with transcript evidence of intron retention. 21 potential STEPs had EST evidence of intron retention or exon extension, as well as population frequency data for comparison.Results suggest that the insulin STEP was not a unique example and that many STEPs may occur genome-wide with potentially causal effects in complex disease. An online database of STEPs is freely accessible at http://dbstep.genes.org.uk/.

  19. Identification of novel intronic BRCA1 variants of uncertain ...

    Indian Academy of Sciences (India)

    in a Thai hereditary breast cancer family. Adisorn Ratanaphan, Pornpen Panomwan, Bhutorn Canyuk and Tanaphon Maipang. J. Genet. 90, 327–331. Table 1. Oligodeoxyribonucleotide primers used for PCR amplification of BRCA1 exon–intron 7 boundary sequences. Primers. Nucleotide position. Primer sequence (5 –3 ).

  20. Highly Conserved Arg Residue of ERFNIN Motif of Pro-Domain is Important for pH-Induced Zymogen Activation Process in Cysteine Cathepsins K and L.

    Science.gov (United States)

    Aich, Pulakesh; Biswas, Sampa

    2018-06-01

    Pro-domain of a cysteine cathepsin contains a highly conserved Ex 2 Rx 2 Fx 2 Nx 3 Ix 3 N (ERFNIN) motif. The zymogen structure of cathepsins revealed that the Arg(R) residue of the motif is a central residue of a salt-bridge/H-bond network, stabilizing the scaffold of the pro-domain. Importance of the arginine is also demonstrated in studies where a single mutation (Arg → Trp) in human lysosomal cathepsin K (hCTSK) is linked to a bone-related genetic disorder "Pycnodysostosis". In the present study, we have characterized in vitro Arg → Trp mutant of hCTSK and the same mutant of hCTSL. The R → W mutant of hCTSK revealed that this mutation leads to an unstable zymogen that is spontaneously activated and auto-proteolytically degraded rapidly. In contrast, the same mutant of hCTSL is sufficiently stable and has proteolytic activity almost like its wild-type counterpart; however it shows an altered zymogen activation condition in terms of pH, temperature and time. Far and near UV circular dichroism and intrinsic tryptophan fluorescence experiments have revealed that the mutation has minimal effect on structure of the protease hCTSL. Molecular modeling studies shows that the mutated Trp31 in hCTSL forms an aromatic cluster with Tyr23 and Trp30 leading to a local stabilization of pro-domain and supplements the loss of salt-bridge interaction mediated by Arg31 in wild-type. In hCTSK-R31W mutant, due to presence of a non-aromatic Ser30 residue such interaction is not possible and may be responsible for local instability. These differences may cause detrimental effects of R31W mutation on the regulation of hCTSK auto-activation process compared to altered activation process in hCTSL.

  1. Protein clustering and RNA phylogenetic reconstruction of the influenza A [corrected] virus NS1 protein allow an update in classification and identification of motif conservation.

    Science.gov (United States)

    Sevilla-Reyes, Edgar E; Chavaro-Pérez, David A; Piten-Isidro, Elvira; Gutiérrez-González, Luis H; Santos-Mendoza, Teresa

    2013-01-01

    The non-structural protein 1 (NS1) of influenza A virus (IAV), coded by its third most diverse gene, interacts with multiple molecules within infected cells. NS1 is involved in host immune response regulation and is a potential contributor to the virus host range. Early phylogenetic analyses using 50 sequences led to the classification of NS1 gene variants into groups (alleles) A and B. We reanalyzed NS1 diversity using 14,716 complete NS IAV sequences, downloaded from public databases, without host bias. Removal of sequence redundancy and further structured clustering at 96.8% amino acid similarity produced 415 clusters that enhanced our capability to detect distinct subgroups and lineages, which were assigned a numerical nomenclature. Maximum likelihood phylogenetic reconstruction using RNA sequences indicated the previously identified deep branching separating group A from group B, with five distinct subgroups within A as well as two and five lineages within the A4 and A5 subgroups, respectively. Our classification model proposes that sequence patterns in thirteen amino acid positions are sufficient to fit >99.9% of all currently available NS1 sequences into the A subgroups/lineages or the B group. This classification reduces host and virus bias through the prioritization of NS1 RNA phylogenetics over host or virus phenetics. We found significant sequence conservation within the subgroups and lineages with characteristic patterns of functional motifs, such as the differential binding of CPSF30 and crk/crkL or the availability of a C-terminal PDZ-binding motif. To understand selection pressures and evolution acting on NS1, it is necessary to organize the available data. This updated classification may help to clarify and organize the study of NS1 interactions and pathogenic differences and allow the drawing of further functional inferences on sequences in each group, subgroup and lineage rather than on a strain-by-strain basis.

  2. The heptanucleotide motif GAGACGC is a key component of a cis-acting promoter element that is critical for SnSAG1 expression in Sarcocystis neurona.

    Science.gov (United States)

    Gaji, Rajshekhar Y; Howe, Daniel K

    2009-07-01

    The apicomplexan parasite Sarcocystis neurona undergoes a complex process of intracellular development, during which many genes are temporally regulated. The described study was undertaken to begin identifying the basic promoter elements that control gene expression in S. neurona. Sequence analysis of the 5'-flanking region of five S. neurona genes revealed a conserved heptanucleotide motif GAGACGC that is similar to the WGAGACG motif described upstream of multiple genes in Toxoplasma gondii. The promoter region for the major surface antigen gene SnSAG1, which contains three heptanucleotide motifs within 135 bases of the transcription start site, was dissected by functional analysis using a dual luciferase reporter assay. These analyses revealed that a minimal promoter fragment containing all three motifs was sufficient to drive reporter molecule expression, with the presence and orientation of the 5'-most heptanucleotide motif being absolutely critical for promoter function. Further studies should help to identify additional sequence elements important for promoter function and for controlling gene expression during intracellular development by this apicomplexan pathogen.

  3. Identification of sequence motifs significantly associated with antisense activity

    Directory of Open Access Journals (Sweden)

    Peek Andrew S

    2007-06-01

    Full Text Available Abstract Background Predicting the suppression activity of antisense oligonucleotide sequences is the main goal of the rational design of nucleic acids. To create an effective predictive model, it is important to know what properties of an oligonucleotide sequence associate significantly with antisense activity. Also, for the model to be efficient we must know what properties do not associate significantly and can be omitted from the model. This paper will discuss the results of a randomization procedure to find motifs that associate significantly with either high or low antisense suppression activity, analysis of their properties, as well as the results of support vector machine modelling using these significant motifs as features. Results We discovered 155 motifs that associate significantly with high antisense suppression activity and 202 motifs that associate significantly with low suppression activity. The motifs range in length from 2 to 5 bases, contain several motifs that have been previously discovered as associating highly with antisense activity, and have thermodynamic properties consistent with previous work associating thermodynamic properties of sequences with their antisense activity. Statistical analysis revealed no correlation between a motif's position within an antisense sequence and that sequences antisense activity. Also, many significant motifs existed as subwords of other significant motifs. Support vector regression experiments indicated that the feature set of significant motifs increased correlation compared to all possible motifs as well as several subsets of the significant motifs. Conclusion The thermodynamic properties of the significantly associated motifs support existing data correlating the thermodynamic properties of the antisense oligonucleotide with antisense efficiency, reinforcing our hypothesis that antisense suppression is strongly associated with probe/target thermodynamics, as there are no enzymatic

  4. Plant and yeast cornichon possess a conserved acidic motif required for correct targeting of plasma membrane cargos

    Czech Academy of Sciences Publication Activity Database

    Rosas-Santiago, P.; Lagunas-Goméz, D.; Yánez-Domínguez, C.; Vera-Estrella, R.; Zimmermannová, Olga; Sychrová, Hana; Pantoja, O.

    2017-01-01

    Roč. 1864, č. 10 (2017), s. 1809-1818 ISSN 0167-4889 R&D Projects: GA MŠk(CZ) LQ1604; GA MŠk(CZ) ED1.1.00/02.0109; GA ČR(CZ) GA17-01953S Institutional support: RVO:67985823 Keywords : cornichon * ScErv14 * acidic motif * cargo selection Subject RIV: EB - Genetics ; Molecular Biology OBOR OECD: Biochemistry and molecular biology Impact factor: 4.521, year: 2016

  5. Genic regions of a large salamander genome contain long introns and novel genes

    Directory of Open Access Journals (Sweden)

    Bryant Susan V

    2009-01-01

    Full Text Available Abstract Background The basis of genome size variation remains an outstanding question because DNA sequence data are lacking for organisms with large genomes. Sixteen BAC clones from the Mexican axolotl (Ambystoma mexicanum: c-value = 32 × 109 bp were isolated and sequenced to characterize the structure of genic regions. Results Annotation of genes within BACs showed that axolotl introns are on average 10× longer than orthologous vertebrate introns and they are predicted to contain more functional elements, including miRNAs and snoRNAs. Loci were discovered within BACs for two novel EST transcripts that are differentially expressed during spinal cord regeneration and skin metamorphosis. Unexpectedly, a third novel gene was also discovered while manually annotating BACs. Analysis of human-axolotl protein-coding sequences suggests there are 2% more lineage specific genes in the axolotl genome than the human genome, but the great majority (86% of genes between axolotl and human are predicted to be 1:1 orthologs. Considering that axolotl genes are on average 5× larger than human genes, the genic component of the salamander genome is estimated to be incredibly large, approximately 2.8 gigabases! Conclusion This study shows that a large salamander genome has a correspondingly large genic component, primarily because genes have incredibly long introns. These intronic sequences may harbor novel coding and non-coding sequences that regulate biological processes that are unique to salamanders.

  6. Analisis Unsur Matematika pada Motif Sulam Usus

    Directory of Open Access Journals (Sweden)

    Fredi Ganda Putra

    2017-12-01

    Full Text Available Based on interviews with researchers sources said that the beginning of the intestine embroidery is an art of genuine crafts. Called the intestine embroidery because this technique is a technique of combining a strand of cloth resembling the intestine formed according to the pattern by means of embroidered using a thread. Intestinal embroidery techniques were originally used to create a cover of the women's customary wardrobe of Lampung or often referred to as bebe. But not many people in Lampung, especially people who live in Lampung are still many who do not know and recognize the intestine embroidery because most only know tapis only characteristic of Lampung, besides that there are other cultural results that is embroidered intestine. There are still many who do not know that the intestine motif there is a knowledge of mathematics. The researcher's problem formulation is whether there are mathematical elements contained in the intestine embroidery motif based on the concept of geometry. The purpose of this study is to determine whether there are elements of mathematics contained in the intestine motif based on the concept of geometry. Subjects in this study consisted of 4 people obtained by purposive sampling technique. From the results of data analysis conducted by using descriptive analysis and discussion as follows: (1 Intestinal embroidery motif contains the meaning of mathematics and culture or often called Etnomatematika. On the meaning of culture there is a link between the embroidery intestine with a culture that has been there before as the existence of cultural linkage between Hindu belief Buddhism and there are similarities of motifs and decorative patterns contained in the motif embroidery intestine with ornamental variety in Indonesia. (2 The relationship between the intestine with mathematical motifs there are elements of mathematics such as geometry elements in the form of geometry of dimension one and dimension two, and the

  7. Conservation of the LexA repressor binding site in Deinococcus radiodurans

    Directory of Open Access Journals (Sweden)

    Khan Feroz

    2008-03-01

    Full Text Available The LexA protein is a transcriptional repressor of the bacterial SOS DNA repair system, which comprises a set of DNA repair and cellular survival genes that are induced in response to DNA damage. Its varied DNA binding motifs have been characterized and reported in the Escherichia coli, Bacillus subtilis, rhizobia family members, marine magnetotactic bacterium, Salmonella typhimurium and recently in Mycobacterium tuberculosis and this motifs information has been used in our theoretical analysis to detect its novel regulated genes in radio-resistant Deinococcus radiodurans genome. This bacterium showed presence of SOS-box like consensus sequence in the upstream sequences of 3166 genes with >60% motif score similarity percentage (MSSP on both strands. Attempts to identify LexA-binding sites and the composition of the putative SOS regulon in D. radiodurans have been unsuccessful so far. To resolve the problem we performed theoretical analysis with modifications on reported data set of genes related to DNA repair (61 genes, stress response (145 genes and some unusual predicted operons (21 clusters. Expression of some of the predicted SOS-box regulated operon members then was examined through the previously reported microarray data which confirm the expression of only single predicted operon i.e. DRB0143 (AAA superfamily NTPase related to 5-methylcytosine specific restriction enzyme subunit McrB and DRB0144 (homolog of the McrC subunit of the McrBC restriction modification system. The methodology involved weight matrix construction through CONSENSUS algorithm using information of conserved upstream sequences of eight known genes including dinB, tagC, lexA, recA, uvrB, yneA of B. subtilis while lexA and recA of D. radiodurans through phylogenetic footprinting method and later detection of similar conserved SOS-box like LexA binding motifs through both RSAT & PoSSuMsearch programs. The resultant DNA consensus sequence had highly conserved 14 bp SOS

  8. Motif signatures of transcribed enhancers

    KAUST Repository

    Kleftogiannis, Dimitrios

    2017-09-14

    In mammalian cells, transcribed enhancers (TrEn) play important roles in the initiation of gene expression and maintenance of gene expression levels in spatiotemporal manner. One of the most challenging questions in biology today is how the genomic characteristics of enhancers relate to enhancer activities. This is particularly critical, as several recent studies have linked enhancer sequence motifs to specific functional roles. To date, only a limited number of enhancer sequence characteristics have been investigated, leaving space for exploring the enhancers genomic code in a more systematic way. To address this problem, we developed a novel computational method, TELS, aimed at identifying predictive cell type/tissue specific motif signatures. We used TELS to compile a comprehensive catalog of motif signatures for all known TrEn identified by the FANTOM5 consortium across 112 human primary cells and tissues. Our results confirm that distinct cell type/tissue specific motif signatures characterize TrEn. These signatures allow discriminating successfully a) TrEn from random controls, proxy of non-enhancer activity, and b) cell type/tissue specific TrEn from enhancers expressed and transcribed in different cell types/tissues. TELS codes and datasets are publicly available at http://www.cbrc.kaust.edu.sa/TELS.

  9. Sequence analysis of the L protein of the Ebola 2014 outbreak: Insight into conserved regions and mutations.

    Science.gov (United States)

    Ayub, Gohar; Waheed, Yasir

    2016-06-01

    The 2014 Ebola outbreak was one of the largest that have occurred; it started in Guinea and spread to Nigeria, Liberia and Sierra Leone. Phylogenetic analysis of the current virus species indicated that this outbreak is the result of a divergent lineage of the Zaire ebolavirus. The L protein of Ebola virus (EBOV) is the catalytic subunit of the RNA‑dependent RNA polymerase complex, which, with VP35, is key for the replication and transcription of viral RNA. Earlier sequence analysis demonstrated that the L protein of all non‑segmented negative‑sense (NNS) RNA viruses consists of six domains containing conserved functional motifs. The aim of the present study was to analyze the presence of these motifs in 2014 EBOV isolates, highlight their function and how they may contribute to the overall pathogenicity of the isolates. For this purpose, 81 2014 EBOV L protein sequences were aligned with 475 other NNS RNA viruses, including Paramyxoviridae and Rhabdoviridae viruses. Phylogenetic analysis of all EBOV outbreak L protein sequences was also performed. Analysis of the amino acid substitutions in the 2014 EBOV outbreak was conducted using sequence analysis. The alignment demonstrated the presence of previously conserved motifs in the 2014 EBOV isolates and novel residues. Notably, all the mutations identified in the 2014 EBOV isolates were tolerant, they were pathogenic with certain examples occurring within previously determined functional conserved motifs, possibly altering viral pathogenicity, replication and virulence. The phylogenetic analysis demonstrated that all sequences with the exception of the 2014 EBOV sequences were clustered together. The 2014 EBOV outbreak has acquired a great number of mutations, which may explain the reasons behind this unprecedented outbreak. Certain residues critical to the function of the polymerase remain conserved and may be targets for the development of antiviral therapeutic agents.

  10. Asthma and COPD in cystic fibrosis intron-8 5T carriers. A population-based study

    DEFF Research Database (Denmark)

    Dahl, Morten; Tybjaerg-Hansen, Anne; Lange, Peter

    2005-01-01

    Carriers of cystic fibrosis intron-8 5T alleles with high exon-9 skipping could have increased annual lung function decline and increased risk for asthma or chronic obstructive pulmonary disease (COPD).......Carriers of cystic fibrosis intron-8 5T alleles with high exon-9 skipping could have increased annual lung function decline and increased risk for asthma or chronic obstructive pulmonary disease (COPD)....

  11. An intronic deletion in the PROM1 gene leads to autosomal recessive cone-rod dystrophy.

    Science.gov (United States)

    Eidinger, Osnat; Leibu, Rina; Newman, Hadas; Rizel, Leah; Perlman, Ido; Ben-Yosef, Tamar

    2015-01-01

    To investigate the genetic basis for autosomal recessive cone-rod dystrophy (CRD) in a consanguineous Israeli Jewish family. Patients underwent a detailed ophthalmic evaluation, including eye examination, visual field testing, optical coherence tomography (OCT), and electrophysiological tests, electroretinography (ERG) and visual evoked potential (VEP). Genome-wide homozygosity mapping using a single nucleotide polymorphism (SNP) array was performed to identify homozygous regions shared among two of the affected individuals. Mutation screening of the underlying gene was performed with direct sequencing. In silico and in vitro analyses were used to predict the effect of the identified mutation on splicing. The affected family members are three siblings who have various degrees of progressive visual deterioration, glare, color vision abnormalities, and night vision difficulties. Visual field tests revealed central scotomas of different extension. Cone and rod ERG responses were reduced, with cones more severely affected. Homozygosity mapping revealed several homozygous intervals shared among two of the affected individuals. One included the PROM1 gene. Sequence analysis of the 26 coding exons of PROM1 in one affected individual revealed no mutations in the coding sequence or in intronic splice sites. However, in intron 21, proximate to the intron-exon junction, we observed a homozygous 10 bp deletion between positions -26 and -17 (c.2281-26_-17del). The deletion was linked to a known SNP, c.2281-6C>G. The deletion cosegregated with the disease in the family, and was not detected in public databases or in 101 ethnically-matched control individuals. In silico analysis predicted that this deletion would lead to altered intron 21 splicing. Bioinformatic analysis predicted that a recognition site for the SRSF2 splicing factor is located within the deleted sequence. The in vitro splicing assay demonstrated that c.2281-26_-17del leads to complete exon 22 skipping. A novel

  12. Complex exon-intron marking by histone modifications is not determined solely by nucleosome distribution.

    Directory of Open Access Journals (Sweden)

    Pawandeep Dhami

    2010-08-01

    Full Text Available It has recently been shown that nucleosome distribution, histone modifications and RNA polymerase II (Pol II occupancy show preferential association with exons ("exon-intron marking", linking chromatin structure and function to co-transcriptional splicing in a variety of eukaryotes. Previous ChIP-sequencing studies suggested that these marking patterns reflect the nucleosomal landscape. By analyzing ChIP-chip datasets across the human genome in three cell types, we have found that this marking system is far more complex than previously observed. We show here that a range of histone modifications and Pol II are preferentially associated with exons. However, there is noticeable cell-type specificity in the degree of exon marking by histone modifications and, surprisingly, this is also reflected in some histone modifications patterns showing biases towards introns. Exon-intron marking is laid down in the absence of transcription on silent genes, with some marking biases changing or becoming reversed for genes expressed at different levels. Furthermore, the relationship of this marking system with splicing is not simple, with only some histone modifications reflecting exon usage/inclusion, while others mirror patterns of exon exclusion. By examining nucleosomal distributions in all three cell types, we demonstrate that these histone modification patterns cannot solely be accounted for by differences in nucleosome levels between exons and introns. In addition, because of inherent differences between ChIP-chip array and ChIP-sequencing approaches, these platforms report different nucleosome distribution patterns across the human genome. Our findings confound existing views and point to active cellular mechanisms which dynamically regulate histone modification levels and account for exon-intron marking. We believe that these histone modification patterns provide links between chromatin accessibility, Pol II movement and co-transcriptional splicing.

  13. Conserved Nonexonic Elements: A Novel Class of Marker for Phylogenomics.

    Science.gov (United States)

    Edwards, Scott V; Cloutier, Alison; Baker, Allan J

    2017-11-01

    Noncoding markers have a particular appeal as tools for phylogenomic analysis because, at least in vertebrates, they appear less subject to strong variation in GC content among lineages. Thus far, ultraconserved elements (UCEs) and introns have been the most widely used noncoding markers. Here we analyze and study the evolutionary properties of a new type of noncoding marker, conserved nonexonic elements (CNEEs), which consists of noncoding elements that are estimated to evolve slower than the neutral rate across a set of species. Although they often include UCEs, CNEEs are distinct from UCEs because they are not ultraconserved, and, most importantly, the core region alone is analyzed, rather than both the core and its flanking regions. Using a data set of 16 birds plus an alligator outgroup, and ∼3600-∼3800 loci per marker type, we found that although CNEEs were less variable than bioinformatically derived UCEs or introns and in some cases exhibited a slower approach to branch resolution as determined by phylogenomic subsampling, the quality of CNEE alignments was superior to those of the other markers, with fewer gaps and missing species. Phylogenetic resolution using coalescent approaches was comparable among the three marker types, with most nodes being fully and congruently resolved. Comparison of phylogenetic results across the three marker types indicated that one branch, the sister group to the passerine + falcon clade, was resolved differently and with moderate (>70%) bootstrap support between CNEEs and UCEs or introns. Overall, CNEEs appear to be promising as phylogenomic markers, yielding phylogenetic resolution as high as for UCEs and introns but with fewer gaps, less ambiguity in alignments and with patterns of nucleotide substitution more consistent with the assumptions of commonly used methods of phylogenetic analysis. © The Author(s) 2017. Published by Oxford University Press on behalf of the Systematic Biologists.

  14. Modulation of mdm2 pre-mRNA splicing by 9-aminoacridine-PNA (peptide nucleic acid conjugates targeting intron-exon junctions

    Directory of Open Access Journals (Sweden)

    Nielsen Peter E

    2010-06-01

    Full Text Available Abstract Background Modulation of pre-mRNA splicing by antisense molecules is a promising mechanism of action for gene therapeutic drugs. In this study, we have examined the potential of peptide nucleic acid (PNA 9-aminoacridine conjugates to modulate the pre-mRNA splicing of the mdm2 human cancer gene in JAR cells. Methods We screened 10 different 15 mer PNAs targeting intron2 at both the 5' - and the 3'-splice site for their effects on the splicing of mdm2 using RT-PCR analysis. We also tested a PNA (2512 targeting the 3'-splice site of intron3 with a complementarity of 4 bases to intron3 and 11 bases to exon4 for its splicing modulation effect. This PNA2512 was further tested for the effects on the mdm2 protein level as well as for inhibition of cell growth in combination with the DNA damaging agent camptothecin (CPT. Results We show that several of these PNAs effectively inhibit the splicing thereby producing a larger mRNA still containing intron2, while skipping of exon3 was not observed by any of these PNAs. The most effective PNA (PNA2406 targeting the 3'-splice site of intron2 had a complementarity of 4 bases to intron2 and 11 bases to exon3. PNA (2512 targeting the 3'-splice site of intron3 induced both splicing inhibition (intron3 skipping and skipping of exon4. Furthermore, treatment of JAR cells with this PNA resulted in a reduction in the level of MDM2 protein and a concomitant increase in the level of tumor suppressor p53. In addition, a combination of this PNA with CPT inhibited cell growth more than CPT alone. Conclusion We have identified several PNAs targeting the 5'- or 3'-splice sites in intron2 or the 3'-splice site of intron3 of mdm2 pre-mRNA which can inhibit splicing. Antisense targeting of splice junctions of mdm2 pre-mRNA may be a powerful method to evaluate the cellular function of MDM2 splice variants as well as a promising approach for discovery of mdm2 targeted anticancer drugs.

  15. Analysis of tomato plasma membrane H(+)-ATPase gene family suggests a mycorrhiza-mediated regulatory mechanism conserved in diverse plant species.

    Science.gov (United States)

    Liu, Junli; Liu, Jianjian; Chen, Aiqun; Ji, Minjie; Chen, Jiadong; Yang, Xiaofeng; Gu, Mian; Qu, Hongye; Xu, Guohua

    2016-10-01

    In plants, the plasma membrane H(+)-ATPase (HA) is considered to play a crucial role in regulating plant growth and respoding to environment stresses. Multiple paralogous genes encoding different isozymes of HA have been identified and characterized in several model plants, while limited information of the HA gene family is available to date for tomato. Here, we describe the molecular and expression features of eight HA-encoding genes (SlHA1-8) from tomato. All these genes are interrupted by multiple introns with conserved positions. SlHA1, 2, and 4 were widely expressed in all tissues, while SlHA5, 6, and 7 were almost only expressed in flowers. SlHA8, the transcripts of which were barely detectable under normal or nutrient-/salt-stress growth conditions, was strongly activated in arbuscular mycorrhizal (AM) fungal-colonized roots. Extreme lack of SlHA8 expression in M161, a mutant defective to AM fungal colonization, provided genetic evidence towards the dependence of its expression on AM symbiosis. A 1521-bp SlHA8 promoter could direct the GUS reporter expression specifically in colonized cells of transgenic tobacco, soybean, and rice mycorrhizal roots. Promoter deletion assay revealed a 223-bp promoter fragment of SlHA8 containing a variant of AM-specific cis-element MYCS (vMYCS) sufficient to confer the AM-induced activity. Targeted deletion of this motif in the corresponding promoter region causes complete abolishment of GUS staining in mycorrhizal roots. Together, these results lend cogent evidence towards the evolutionary conservation of a potential regulatory mechanism mediating the activation of AM-responsive HA genes in diverse mycorrhizal plant species.

  16. Positive evolutionary selection of an HD motif on Alzheimer precursor protein orthologues suggests a functional role.

    Science.gov (United States)

    Miklós, István; Zádori, Zoltán

    2012-02-01

    HD amino acid duplex has been found in the active center of many different enzymes. The dyad plays remarkably different roles in their catalytic processes that usually involve metal coordination. An HD motif is positioned directly on the amyloid beta fragment (Aβ) and on the carboxy-terminal region of the extracellular domain (CAED) of the human amyloid precursor protein (APP) and a taxonomically well defined group of APP orthologues (APPOs). In human Aβ HD is part of a presumed, RGD-like integrin-binding motif RHD; however, neither RHD nor RXD demonstrates reasonable conservation in APPOs. The sequences of CAEDs and the position of the HD are not particularly conserved either, yet we show with a novel statistical method using evolutionary modeling that the presence of HD on CAEDs cannot be the result of neutral evolutionary forces (pHD motif is underrepresented in the proteomes of all species of the animal kingdom. Position migration can be explained by high probability occurrence of multiple copies of HD on intermediate sequences, from which only one is kept by selective evolutionary forces, in a similar way as in the case of the "transcription binding site turnover." CAED of all APP orthologues and homologues are predicted to bind metal ions including Amyloid-like protein 1 (APLP1) and Amyloid-like protein 2 (APLP2). Our results suggest that HDs on the CAEDs are most probably key components of metal-binding domains, which facilitate and/or regulate inter- or intra-molecular interactions in a metal ion-dependent or metal ion concentration-dependent manner. The involvement of naturally occurring mutations of HD (Tottori (D7N) and English (H6R) mutations) in early onset Alzheimer's disease gives additional support to our finding that HD has an evolutionary preserved function on APPOs.

  17. Un gene con intrones en vez de exones / Envejecimiento Prematuro de la Piel

    Directory of Open Access Journals (Sweden)

    Tobías Mojica

    1996-04-01

    Full Text Available Un gene con intrones en vez de exones. La noción de que los genes son discontinuos (compuestos de exones e intrones en forma alterna y en cuya organización los exones representan regiones presentes, por medio del código genético en las proteínas, y los intrones nadie sabe todavía que representan produjo una cierta cantidad de desasosiego entre los genetistas mayores de edad, pero hoy día es ampliamente aceptada, con poco o ningún dolor, y se ha convertido en parte del cánon científico. / Envejecimiento Prematuro de la Piel. La exposición a largo plazo de la piel a la luz ultravioleta proveniente del sol resulta en daño al colágeno de la piel y a la elastina de la matriz extracelular; se cree que este daño es responsable de la apariencia típicamente arrugadita de la piel expuesta al sol por mucho tiempo (como en los vaqueros de los comerciales de la televisión.

  18. Developing a set of strong intronic promoters for robust metabolic engineering in oleaginous Rhodotorula (Rhodosporidium) yeast species.

    Science.gov (United States)

    Liu, Yanbin; Yap, Sihui Amy; Koh, Chong Mei John; Ji, Lianghui

    2016-11-25

    Red yeast species in the Rhodotorula/Rhodosporidium genus are outstanding producers of triacylglyceride and cell biomass. Metabolic engineering is expected to further enhance the productivity and versatility of these hosts for the production of biobased chemicals and fuels. Promoters with strong activity during oil-accumulation stage are critical tools for metabolic engineering of these oleaginous yeasts. The upstream DNA sequences of 6 genes involved in lipid biosynthesis or accumulation in Rhodotorula toruloides were studied by luciferase reporter assay. The promoter of perilipin/lipid droplet protein 1 gene (LDP1) displayed much stronger activity (4-11 folds) than that of glyceraldehyde-3-phosphate dehydrogenase gene (GPD1), one of the strongest promoters known in yeasts. Depending on the stage of cultivation, promoter of acetyl-CoA carboxylase gene (ACC1) and fatty acid synthase β subunit gene (FAS1) exhibited intermediate strength, displaying 50-160 and 20-90% levels of GPD1 promoter, respectively. Interestingly, introns significantly modulated promoter strength at high frequency. The incorporation of intron 1 and 2 of LDP1 (LDP1in promoter) enhanced its promoter activity by 1.6-3.0 folds. Similarly, the strength of ACC1 promoter was enhanced by 1.5-3.2 folds if containing intron 1. The intron 1 sequences of ACL1 and FAS1 also played significant regulatory roles. When driven by the intronic promoters of ACC1 and LDP1 (ACC1in and LDP1in promoter, respectively), the reporter gene expression were up-regulated by nitrogen starvation, independent of de novo oil biosynthesis and accumulation. As a proof of principle, overexpression of the endogenous acyl-CoA-dependent diacylglycerol acyltransferase 1 gene (DGA1) by LDP1in promoter was significantly more efficient than GPD1 promoter in enhancing lipid accumulation. Intronic sequences play an important role in regulating gene expression in R. toruloides. Three intronic promoters, LDP1in, ACC1in and FAS1in, are

  19. Triadic motifs in the dependence networks of virtual societies

    Science.gov (United States)

    Xie, Wen-Jie; Li, Ming-Xia; Jiang, Zhi-Qiang; Zhou, Wei-Xing

    2014-06-01

    In friendship networks, individuals have different numbers of friends, and the closeness or intimacy between an individual and her friends is heterogeneous. Using a statistical filtering method to identify relationships about who depends on whom, we construct dependence networks (which are directed) from weighted friendship networks of avatars in more than two hundred virtual societies of a massively multiplayer online role-playing game (MMORPG). We investigate the evolution of triadic motifs in dependence networks. Several metrics show that the virtual societies evolved through a transient stage in the first two to three weeks and reached a relatively stable stage. We find that the unidirectional loop motif (M9) is underrepresented and does not appear, open motifs are also underrepresented, while other close motifs are overrepresented. We also find that, for most motifs, the overall level difference of the three avatars in the same motif is significantly lower than average, whereas the sum of ranks is only slightly larger than average. Our findings show that avatars' social status plays an important role in the formation of triadic motifs.

  20. Triadic motifs in the dependence networks of virtual societies.

    Science.gov (United States)

    Xie, Wen-Jie; Li, Ming-Xia; Jiang, Zhi-Qiang; Zhou, Wei-Xing

    2014-06-10

    In friendship networks, individuals have different numbers of friends, and the closeness or intimacy between an individual and her friends is heterogeneous. Using a statistical filtering method to identify relationships about who depends on whom, we construct dependence networks (which are directed) from weighted friendship networks of avatars in more than two hundred virtual societies of a massively multiplayer online role-playing game (MMORPG). We investigate the evolution of triadic motifs in dependence networks. Several metrics show that the virtual societies evolved through a transient stage in the first two to three weeks and reached a relatively stable stage. We find that the unidirectional loop motif (M9) is underrepresented and does not appear, open motifs are also underrepresented, while other close motifs are overrepresented. We also find that, for most motifs, the overall level difference of the three avatars in the same motif is significantly lower than average, whereas the sum of ranks is only slightly larger than average. Our findings show that avatars' social status plays an important role in the formation of triadic motifs.

  1. The PDZ-binding motif of Yes-associated protein is required for its co-activation of TEAD-mediated CTGF transcription and oncogenic cell transforming activity

    International Nuclear Information System (INIS)

    Shimomura, Tadanori; Miyamura, Norio; Hata, Shoji; Miura, Ryota; Hirayama, Jun; Nishina, Hiroshi

    2014-01-01

    Highlights: •Loss of the PDZ-binding motif inhibits constitutively active YAP (5SA)-induced oncogenic cell transformation. •The PDZ-binding motif of YAP promotes its nuclear localization in cultured cells and mouse liver. •Loss of the PDZ-binding motif inhibits YAP (5SA)-induced CTGF transcription in cultured cells and mouse liver. -- Abstract: YAP is a transcriptional co-activator that acts downstream of the Hippo signaling pathway and regulates multiple cellular processes, including proliferation. Hippo pathway-dependent phosphorylation of YAP negatively regulates its function. Conversely, attenuation of Hippo-mediated phosphorylation of YAP increases its ability to stimulate proliferation and eventually induces oncogenic transformation. The C-terminus of YAP contains a highly conserved PDZ-binding motif that regulates YAP’s functions in multiple ways. However, to date, the importance of the PDZ-binding motif to the oncogenic cell transforming activity of YAP has not been determined. In this study, we disrupted the PDZ-binding motif in the YAP (5SA) protein, in which the sites normally targeted by Hippo pathway-dependent phosphorylation are mutated. We found that loss of the PDZ-binding motif significantly inhibited the oncogenic transformation of cultured cells induced by YAP (5SA). In addition, the increased nuclear localization of YAP (5SA) and its enhanced activation of TEAD-dependent transcription of the cell proliferation gene CTGF were strongly reduced when the PDZ-binding motif was deleted. Similarly, in mouse liver, deletion of the PDZ-binding motif suppressed nuclear localization of YAP (5SA) and YAP (5SA)-induced CTGF expression. Taken together, our results indicate that the PDZ-binding motif of YAP is critical for YAP-mediated oncogenesis, and that this effect is mediated by YAP’s co-activation of TEAD-mediated CTGF transcription

  2. The PDZ-binding motif of Yes-associated protein is required for its co-activation of TEAD-mediated CTGF transcription and oncogenic cell transforming activity

    Energy Technology Data Exchange (ETDEWEB)

    Shimomura, Tadanori; Miyamura, Norio; Hata, Shoji; Miura, Ryota; Hirayama, Jun, E-mail: hirayama.dbio@mri.tmd.ac.jp; Nishina, Hiroshi, E-mail: nishina.dbio@mri.tmd.ac.jp

    2014-01-17

    Highlights: •Loss of the PDZ-binding motif inhibits constitutively active YAP (5SA)-induced oncogenic cell transformation. •The PDZ-binding motif of YAP promotes its nuclear localization in cultured cells and mouse liver. •Loss of the PDZ-binding motif inhibits YAP (5SA)-induced CTGF transcription in cultured cells and mouse liver. -- Abstract: YAP is a transcriptional co-activator that acts downstream of the Hippo signaling pathway and regulates multiple cellular processes, including proliferation. Hippo pathway-dependent phosphorylation of YAP negatively regulates its function. Conversely, attenuation of Hippo-mediated phosphorylation of YAP increases its ability to stimulate proliferation and eventually induces oncogenic transformation. The C-terminus of YAP contains a highly conserved PDZ-binding motif that regulates YAP’s functions in multiple ways. However, to date, the importance of the PDZ-binding motif to the oncogenic cell transforming activity of YAP has not been determined. In this study, we disrupted the PDZ-binding motif in the YAP (5SA) protein, in which the sites normally targeted by Hippo pathway-dependent phosphorylation are mutated. We found that loss of the PDZ-binding motif significantly inhibited the oncogenic transformation of cultured cells induced by YAP (5SA). In addition, the increased nuclear localization of YAP (5SA) and its enhanced activation of TEAD-dependent transcription of the cell proliferation gene CTGF were strongly reduced when the PDZ-binding motif was deleted. Similarly, in mouse liver, deletion of the PDZ-binding motif suppressed nuclear localization of YAP (5SA) and YAP (5SA)-induced CTGF expression. Taken together, our results indicate that the PDZ-binding motif of YAP is critical for YAP-mediated oncogenesis, and that this effect is mediated by YAP’s co-activation of TEAD-mediated CTGF transcription.

  3. Direct AUC optimization of regulatory motifs.

    Science.gov (United States)

    Zhu, Lin; Zhang, Hong-Bo; Huang, De-Shuang

    2017-07-15

    The discovery of transcription factor binding site (TFBS) motifs is essential for untangling the complex mechanism of genetic variation under different developmental and environmental conditions. Among the huge amount of computational approaches for de novo identification of TFBS motifs, discriminative motif learning (DML) methods have been proven to be promising for harnessing the discovery power of accumulated huge amount of high-throughput binding data. However, they have to sacrifice accuracy for speed and could fail to fully utilize the information of the input sequences. We propose a novel algorithm called CDAUC for optimizing DML-learned motifs based on the area under the receiver-operating characteristic curve (AUC) criterion, which has been widely used in the literature to evaluate the significance of extracted motifs. We show that when the considered AUC loss function is optimized in a coordinate-wise manner, the cost function of each resultant sub-problem is a piece-wise constant function, whose optimal value can be found exactly and efficiently. Further, a key step of each iteration of CDAUC can be efficiently solved as a computational geometry problem. Experimental results on real world high-throughput datasets illustrate that CDAUC outperforms competing methods for refining DML motifs, while being one order of magnitude faster. Meanwhile, preliminary results also show that CDAUC may also be useful for improving the interpretability of convolutional kernels generated by the emerging deep learning approaches for predicting TF sequences specificities. CDAUC is available at: https://drive.google.com/drive/folders/0BxOW5MtIZbJjNFpCeHlBVWJHeW8 . dshuang@tongji.edu.cn. Supplementary data are available at Bioinformatics online. © The Author 2017. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com

  4. Genomewide analysis of intronic microRNAs in rice and Arabidopsis

    Indian Academy of Sciences (India)

    2012-12-13

    Dec 13, 2012 ... Seventy-five miRNA stem– loop sequences for rice came from ... The A. thaliana genotype used in this study was Columbia. (Col-0) wildtype. ... sense strand of intronic regions of protein-coding gene, 40 were located in the ...

  5. DMINDA: an integrated web server for DNA motif identification and analyses.

    Science.gov (United States)

    Ma, Qin; Zhang, Hanyuan; Mao, Xizeng; Zhou, Chuan; Liu, Bingqiang; Chen, Xin; Xu, Ying

    2014-07-01

    DMINDA (DNA motif identification and analyses) is an integrated web server for DNA motif identification and analyses, which is accessible at http://csbl.bmb.uga.edu/DMINDA/. This web site is freely available to all users and there is no login requirement. This server provides a suite of cis-regulatory motif analysis functions on DNA sequences, which are important to elucidation of the mechanisms of transcriptional regulation: (i) de novo motif finding for a given set of promoter sequences along with statistical scores for the predicted motifs derived based on information extracted from a control set, (ii) scanning motif instances of a query motif in provided genomic sequences, (iii) motif comparison and clustering of identified motifs, and (iv) co-occurrence analyses of query motifs in given promoter sequences. The server is powered by a backend computer cluster with over 150 computing nodes, and is particularly useful for motif prediction and analyses in prokaryotic genomes. We believe that DMINDA, as a new and comprehensive web server for cis-regulatory motif finding and analyses, will benefit the genomic research community in general and prokaryotic genome researchers in particular. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  6. RMOD: a tool for regulatory motif detection in signaling network.

    Directory of Open Access Journals (Sweden)

    Jinki Kim

    Full Text Available Regulatory motifs are patterns of activation and inhibition that appear repeatedly in various signaling networks and that show specific regulatory properties. However, the network structures of regulatory motifs are highly diverse and complex, rendering their identification difficult. Here, we present a RMOD, a web-based system for the identification of regulatory motifs and their properties in signaling networks. RMOD finds various network structures of regulatory motifs by compressing the signaling network and detecting the compressed forms of regulatory motifs. To apply it into a large-scale signaling network, it adopts a new subgraph search algorithm using a novel data structure called path-tree, which is a tree structure composed of isomorphic graphs of query regulatory motifs. This algorithm was evaluated using various sizes of signaling networks generated from the integration of various human signaling pathways and it showed that the speed and scalability of this algorithm outperforms those of other algorithms. RMOD includes interactive analysis and auxiliary tools that make it possible to manipulate the whole processes from building signaling network and query regulatory motifs to analyzing regulatory motifs with graphical illustration and summarized descriptions. As a result, RMOD provides an integrated view of the regulatory motifs and mechanism underlying their regulatory motif activities within the signaling network. RMOD is freely accessible online at the following URL: http://pks.kaist.ac.kr/rmod.

  7. Learning to live together: mutualism between self-splicing introns and their hosts

    Directory of Open Access Journals (Sweden)

    Chalamcharla Venkata R

    2011-04-01

    Full Text Available Abstract Group I and II introns can be considered as molecular parasites that interrupt protein-coding and structural RNA genes in all domains of life. They function as self-splicing ribozymes and thereby limit the phenotypic costs associated with disruption of a host gene while they act as mobile DNA elements to promote their spread within and between genomes. Once considered purely selfish DNA elements, they now seem, in the light of recent work on the molecular mechanisms regulating bacterial and phage group I and II intron dynamics, to show evidence of co-evolution with their hosts. These previously underappreciated relationships serve the co-evolving entities particularly well in times of environmental stress.

  8. Novel Structural and Functional Motifs in cellulose synthase (CesA Genes of Bread Wheat (Triticum aestivum, L..

    Directory of Open Access Journals (Sweden)

    Simerjeet Kaur

    Full Text Available Cellulose is the primary determinant of mechanical strength in plant tissues. Late-season lodging is inversely related to the amount of cellulose in a unit length of the stem. Wheat is the most widely grown of all the crops globally, yet information on its CesA gene family is limited. We have identified 22 CesA genes from bread wheat, which include homoeologs from each of the three genomes, and named them as TaCesAXA, TaCesAXB or TaCesAXD, where X denotes the gene number and the last suffix stands for the respective genome. Sequence analyses of the CESA proteins from wheat and their orthologs from barley, maize, rice, and several dicot species (Arabidopsis, beet, cotton, poplar, potato, rose gum and soybean revealed motifs unique to monocots (Poales or dicots. Novel structural motifs CQIC and SVICEXWFA were identified, which distinguished the CESAs involved in the formation of primary and secondary cell wall (PCW and SCW in all the species. We also identified several new motifs specific to monocots or dicots. The conserved motifs identified in this study possibly play functional roles specific to PCW or SCW formation. The new insights from this study advance our knowledge about the structure, function and evolution of the CesA family in plants in general and wheat in particular. This information will be useful in improving culm strength to reduce lodging or alter wall composition to improve biofuel production.

  9. Phospholipid composition and a polybasic motif determine D6 PROTEIN KINASE polar association with the plasma membrane and tropic responses.

    Science.gov (United States)

    Barbosa, Inês C R; Shikata, Hiromasa; Zourelidou, Melina; Heilmann, Mareike; Heilmann, Ingo; Schwechheimer, Claus

    2016-12-15

    Polar transport of the phytohormone auxin through PIN-FORMED (PIN) auxin efflux carriers is essential for the spatiotemporal control of plant development. The Arabidopsis thaliana serine/threonine kinase D6 PROTEIN KINASE (D6PK) is polarly localized at the plasma membrane of many cells where it colocalizes with PINs and activates PIN-mediated auxin efflux. Here, we show that the association of D6PK with the basal plasma membrane and PINs is dependent on the phospholipid composition of the plasma membrane as well as on the phosphatidylinositol phosphate 5-kinases PIP5K1 and PIP5K2 in epidermis cells of the primary root. We further show that D6PK directly binds polyacidic phospholipids through a polybasic lysine-rich motif in the middle domain of the kinase. The lysine-rich motif is required for proper PIN3 phosphorylation and for auxin transport-dependent tropic growth. Polybasic motifs are also present at a conserved position in other D6PK-related kinases and required for membrane and phospholipid binding. Thus, phospholipid-dependent recruitment to membranes through polybasic motifs might not only be required for D6PK-mediated auxin transport but also other processes regulated by these, as yet, functionally uncharacterized kinases. © 2016. Published by The Company of Biologists Ltd.

  10. DNA motif alignment by evolving a population of Markov chains.

    Science.gov (United States)

    Bi, Chengpeng

    2009-01-30

    Deciphering cis-regulatory elements or de novo motif-finding in genomes still remains elusive although much algorithmic effort has been expended. The Markov chain Monte Carlo (MCMC) method such as Gibbs motif samplers has been widely employed to solve the de novo motif-finding problem through sequence local alignment. Nonetheless, the MCMC-based motif samplers still suffer from local maxima like EM. Therefore, as a prerequisite for finding good local alignments, these motif algorithms are often independently run a multitude of times, but without information exchange between different chains. Hence it would be worth a new algorithm design enabling such information exchange. This paper presents a novel motif-finding algorithm by evolving a population of Markov chains with information exchange (PMC), each of which is initialized as a random alignment and run by the Metropolis-Hastings sampler (MHS). It is progressively updated through a series of local alignments stochastically sampled. Explicitly, the PMC motif algorithm performs stochastic sampling as specified by a population-based proposal distribution rather than individual ones, and adaptively evolves the population as a whole towards a global maximum. The alignment information exchange is accomplished by taking advantage of the pooled motif site distributions. A distinct method for running multiple independent Markov chains (IMC) without information exchange, or dubbed as the IMC motif algorithm, is also devised to compare with its PMC counterpart. Experimental studies demonstrate that the performance could be improved if pooled information were used to run a population of motif samplers. The new PMC algorithm was able to improve the convergence and outperformed other popular algorithms tested using simulated and biological motif sequences.

  11. Length and GC content variability of introns among teleostean genomes in the light of the metabolic rate hypothesis.

    Science.gov (United States)

    Chaurasia, Ankita; Tarallo, Andrea; Bernà, Luisa; Yagi, Mitsuharu; Agnisola, Claudio; D'Onofrio, Giuseppe

    2014-01-01

    A comparative analysis of five teleostean genomes, namely zebrafish, medaka, three-spine stickleback, fugu and pufferfish was performed with the aim to highlight the nature of the forces driving both length and base composition of introns (i.e., bpi and GCi). An inter-genome approach using orthologous intronic sequences was carried out, analyzing independently both variables in pairwise comparisons. An average length shortening of introns was observed at increasing average GCi values. The result was not affected by masking transposable and repetitive elements harbored in the intronic sequences. The routine metabolic rate (mass specific temperature-corrected using the Boltzmann's factor) was measured for each species. A significant correlation held between average differences of metabolic rate, length and GC content, while environmental temperature of fish habitat was not correlated with bpi and GCi. Analyzing the concomitant effect of both variables, i.e., bpi and GCi, at increasing genomic GC content, a decrease of bpi and an increase of GCi was observed for the significant majority of the intronic sequences (from ∼ 40% to ∼ 90%, in each pairwise comparison). The opposite event, concomitant increase of bpi and decrease of GCi, was counter selected (from hypothesis that the metabolic rate plays a key role in shaping genome architecture and evolution of vertebrate genomes.

  12. DNA motif elucidation using belief propagation.

    Science.gov (United States)

    Wong, Ka-Chun; Chan, Tak-Ming; Peng, Chengbin; Li, Yue; Zhang, Zhaolei

    2013-09-01

    Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k=8∼10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors' websites: e.g. http://www.cs.toronto.edu/∼wkc/kmerHMM.

  13. DNA motif elucidation using belief propagation

    KAUST Repository

    Wong, Ka-Chun; Chan, Tak-Ming; Peng, Chengbin; Li, Yue; Zhang, Zhaolei

    2013-01-01

    Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ?10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors' websites: e.g. http://www.cs.toronto.edu/?wkc/kmerHMM. 2013 The Author(s).

  14. DNA motif elucidation using belief propagation

    KAUST Repository

    Wong, Ka-Chun

    2013-06-29

    Protein-binding microarray (PBM) is a high-throughout platform that can measure the DNA-binding preference of a protein in a comprehensive and unbiased manner. A typical PBM experiment can measure binding signal intensities of a protein to all the possible DNA k-mers (k = 8 ?10); such comprehensive binding affinity data usually need to be reduced and represented as motif models before they can be further analyzed and applied. Since proteins can often bind to DNA in multiple modes, one of the major challenges is to decompose the comprehensive affinity data into multimodal motif representations. Here, we describe a new algorithm that uses Hidden Markov Models (HMMs) and can derive precise and multimodal motifs using belief propagations. We describe an HMM-based approach using belief propagations (kmerHMM), which accepts and preprocesses PBM probe raw data into median-binding intensities of individual k-mers. The k-mers are ranked and aligned for training an HMM as the underlying motif representation. Multiple motifs are then extracted from the HMM using belief propagations. Comparisons of kmerHMM with other leading methods on several data sets demonstrated its effectiveness and uniqueness. Especially, it achieved the best performance on more than half of the data sets. In addition, the multiple binding modes derived by kmerHMM are biologically meaningful and will be useful in interpreting other genome-wide data such as those generated from ChIP-seq. The executables and source codes are available at the authors\\' websites: e.g. http://www.cs.toronto.edu/?wkc/kmerHMM. 2013 The Author(s).

  15. Canonical Bcl-2 motifs of the Na+/K+ pump revealed by the BH3 mimetic chelerythrine: early signal transducers of apoptosis?

    Science.gov (United States)

    Lauf, Peter K; Heiny, Judith; Meller, Jarek; Lepera, Michael A; Koikov, Leonid; Alter, Gerald M; Brown, Thomas L; Adragna, Norma C

    2013-01-01

    Chelerythrine [CET], a protein kinase C [PKC] inhibitor, is a prop-apoptotic BH3-mimetic binding to BH1-like motifs of Bcl-2 proteins. CET action was examined on PKC phosphorylation-dependent membrane transporters (Na+/K+ pump/ATPase [NKP, NKA], Na+-K+-2Cl+ [NKCC] and K+-Cl- [KCC] cotransporters, and channel-supported K+ loss) in human lens epithelial cells [LECs]. K+ loss and K+ uptake, using Rb+ as congener, were measured by atomic absorption/emission spectrophotometry with NKP and NKCC inhibitors, and Cl- replacement by NO3ˉ to determine KCC. 3H-Ouabain binding was performed on a pig renal NKA in the presence and absence of CET. Bcl-2 protein and NKA sequences were aligned and motifs identified and mapped using PROSITE in conjunction with BLAST alignments and analysis of conservation and structural similarity based on prediction of secondary and crystal structures. CET inhibited NKP and NKCC by >90% (IC50 values ~35 and ~15 μM, respectively) without significant KCC activity change, and stimulated K+ loss by ~35% at 10-30 μM. Neither ATP levels nor phosphorylation of the NKA α1 subunit changed. 3H-ouabain was displaced from pig renal NKA only at 100 fold higher CET concentrations than the ligand. Sequence alignments of NKA with BH1- and BH3-like motifs containing pro-survival Bcl-2 and BclXl proteins showed more than one BH1-like motif within NKA for interaction with CET or with BH3 motifs. One NKA BH1-like motif (ARAAEILARDGPN) was also found in all P-type ATPases. Also, NKA possessed a second motif similar to that near the BH3 region of Bcl-2. Findings support the hypothesis that CET inhibits NKP by binding to BH1-like motifs and disrupting the α1 subunit catalytic activity through conformational changes. By interacting with Bcl-2 proteins through their complementary BH1- or BH3-like-motifs, NKP proteins may be sensors of normal and pathological cell functions, becoming important yet unrecognized signal transducers in the initial phases of apoptosis. CET

  16. The identification of functional motifs in temporal gene expression analysis

    Directory of Open Access Journals (Sweden)

    Michael G. Surette

    2005-01-01

    Full Text Available The identification of transcription factor binding sites is essential to the understanding of the regulation of gene expression and the reconstruction of genetic regulatory networks. The in silico identification of cis-regulatory motifs is challenging due to sequence variability and lack of sufficient data to generate consensus motifs that are of quantitative or even qualitative predictive value. To determine functional motifs in gene expression, we propose a strategy to adopt false discovery rate (FDR and estimate motif effects to evaluate combinatorial analysis of motif candidates and temporal gene expression data. The method decreases the number of predicted motifs, which can then be confirmed by genetic analysis. To assess the method we used simulated motif/expression data to evaluate parameters. We applied this approach to experimental data for a group of iron responsive genes in Salmonella typhimurium 14028S. The method identified known and potentially new ferric-uptake regulator (Fur binding sites. In addition, we identified uncharacterized functional motif candidates that correlated with specific patterns of expression. A SAS code for the simulation and analysis gene expression data is available from the first author upon request.

  17. KEJADIAN INDEL SIMULTAN PADA INTRON 7 GEN BRANCHED-CHAIN Α-KETOACID DEHYDROGENASE E1A (BCKDHA PADA SAPI MADURA

    Directory of Open Access Journals (Sweden)

    Asri Febriana

    2015-08-01

    Full Text Available Madura cattle is one of the Indonesian local cattle breeds derived from crossing between Zebu cattle (Bos indicus and banteng (Bos javanicus. Branched-chain α-ketoacid dehydrogenase (BCKDH is one of the main enzyme complexes in the inner mitochondrial membrane that metabolizes branched chain amino acid (BCAA, ie valine, leucine, and isoleucine. The diversity of the nucleotide sequences of the genes largely determine the efficiency of enzyme encoded. This paper aimed to determine the nucleotide variation contained in section intron 7, exon 8, and intron 8 genes BCKDHA on Madura cattle. This study was conducted on three Madura cattle that used as bull race (karapan, beauty contest (sonok, and beef cattle. The analysis showed that the variation in intron higher than occurred in the exon. Simultaneous indel found at base position 34 and 68 in sonok cattle. In addition, the C266T variant found in beef cattle. These variants do not cause significant changes in amino acids. There was no specific mutation in intron 7, exon 8, and intron 8 were found in Madura cattle designation. This indicated the absence of differentiation Madura cattle designation of selection pressure of BCKDHA gene.

  18. Biotechnological applications of mobile group II introns and their reverse transcriptases: gene targeting, RNA-seq, and non-coding RNA analysis.

    Science.gov (United States)

    Enyeart, Peter J; Mohr, Georg; Ellington, Andrew D; Lambowitz, Alan M

    2014-01-13

    Mobile group II introns are bacterial retrotransposons that combine the activities of an autocatalytic intron RNA (a ribozyme) and an intron-encoded reverse transcriptase to insert site-specifically into DNA. They recognize DNA target sites largely by base pairing of sequences within the intron RNA and achieve high DNA target specificity by using the ribozyme active site to couple correct base pairing to RNA-catalyzed intron integration. Algorithms have been developed to program the DNA target site specificity of several mobile group II introns, allowing them to be made into 'targetrons.' Targetrons function for gene targeting in a wide variety of bacteria and typically integrate at efficiencies high enough to be screened easily by colony PCR, without the need for selectable markers. Targetrons have found wide application in microbiological research, enabling gene targeting and genetic engineering of bacteria that had been intractable to other methods. Recently, a thermostable targetron has been developed for use in bacterial thermophiles, and new methods have been developed for using targetrons to position recombinase recognition sites, enabling large-scale genome-editing operations, such as deletions, inversions, insertions, and 'cut-and-pastes' (that is, translocation of large DNA segments), in a wide range of bacteria at high efficiency. Using targetrons in eukaryotes presents challenges due to the difficulties of nuclear localization and sub-optimal magnesium concentrations, although supplementation with magnesium can increase integration efficiency, and directed evolution is being employed to overcome these barriers. Finally, spurred by new methods for expressing group II intron reverse transcriptases that yield large amounts of highly active protein, thermostable group II intron reverse transcriptases from bacterial thermophiles are being used as research tools for a variety of applications, including qRT-PCR and next-generation RNA sequencing (RNA-seq). The

  19. Mutation analysis in Duchenne and Becker muscular dystrophy patients from Bulgaria shows a peculiar distribution of breakpoints by intron

    Energy Technology Data Exchange (ETDEWEB)

    Todorova, A.; Bronzova, J.; Kremensky, I. [Univ. Hospital of Obstetrics and Gynecology, Sofia (Bulgaria)] [and others

    1996-10-02

    For the first time in Bulgaria, a deletion/duplication screening was performed on a group of 84 unrelated Duchenne/Becker muscular dystrophy patients, and the breakpoint distribution in the dystrophin gene was analyzed. Intragenic deletions were detected in 67.8% of patients, and intragenic duplications in 2.4%. A peculiar distribution of deletion breakpoints was found. Only 13.2% of the deletion breakpoints fell in the {open_quotes}classical{close_quotes} hot spot in intron 44, whereas the majority (> 54%) were located within the segment encompassing introns 45-51, which includes intron 50, the richest in breakpoints (16%) in the Bulgarian sample. Comparison with data from Greece and Turkey points at the probable existence of a deletion hot spot within intron 50, which might be a characteristic of populations of the Balkan region. 17 refs., 2 figs.

  20. Use of a Fluorescent Aptamer RNA as an Exonic Sequence to Analyze Self-Splicing Ability of a Group I Intron from Structured RNAs

    Directory of Open Access Journals (Sweden)

    Airi Furukawa

    2016-11-01

    Full Text Available Group I self-splicing intron constitutes an important class of functional RNA molecules that can promote chemical transformation. Although the fundamental mechanism of the auto-excision from its precursor RNA has been established, convenient assay systems for its splicing activity are still useful for a further understanding of its detailed mechanism and of its application. Because some host RNA sequences, to which group I introns inserted form stable three-dimensional (3D structures, the effects of the 3D structures of exonic elements on the splicing efficiency of group I introns are important but not a fully investigated issue. We developed an assay system for group I intron self-splicing by employing a fluorescent aptamer RNA (spinach RNA as a model exonic sequence inserted by the Tetrahymena group I intron. We investigated self-splicing of the intron from spinach RNA, serving as a model exonic sequence with a 3D structure.

  1. Hybrid DNA i-motif: Aminoethylprolyl-PNA (pC5) enhance the stability of DNA (dC5) i-motif structure.

    Science.gov (United States)

    Gade, Chandrasekhar Reddy; Sharma, Nagendra K

    2017-12-15

    This report describes the synthesis of C-rich sequence, cytosine pentamer, of aep-PNA and its biophysical studies for the formation of hybrid DNA:aep-PNAi-motif structure with DNA cytosine pentamer (dC 5 ) under acidic pH conditions. Herein, the CD/UV/NMR/ESI-Mass studies strongly support the formation of stable hybrid DNA i-motif structure with aep-PNA even near acidic conditions. Hence aep-PNA C-rich sequence cytosine could be considered as potential DNA i-motif stabilizing agents in vivo conditions. Copyright © 2017 Elsevier Ltd. All rights reserved.

  2. Verification of the MOTIF code version 3.0

    International Nuclear Information System (INIS)

    Chan, T.; Guvanasen, V.; Nakka, B.W.; Reid, J.A.K.; Scheier, N.W.; Stanchell, F.W.

    1996-12-01

    As part of the Canadian Nuclear Fuel Waste Management Program (CNFWMP), AECL has developed a three-dimensional finite-element code, MOTIF (Model Of Transport In Fractured/ porous media), for detailed modelling of groundwater flow, heat transport and solute transport in a fractured rock mass. The code solves the transient and steady-state equations of groundwater flow, solute (including one-species radionuclide) transport, and heat transport in variably saturated fractured/porous media. The initial development was completed in 1985 (Guvanasen 1985) and version 3.0 was completed in 1986. This version is documented in detail in Guvanasen and Chan (in preparation). This report describes a series of fourteen verification cases which has been used to test the numerical solution techniques and coding of MOTIF, as well as demonstrate some of the MOTIF analysis capabilities. For each case the MOTIF solution has been compared with a corresponding analytical or independently developed alternate numerical solution. Several of the verification cases were included in Level 1 of the International Hydrologic Code Intercomparison Project (HYDROCOIN). The MOTIF results for these cases were also described in the HYDROCOIN Secretariat's compilation and comparison of results submitted by the various project teams (Swedish Nuclear Power Inspectorate 1988). It is evident from the graphical comparisons presented that the MOTIF solutions for the fourteen verification cases are generally in excellent agreement with known analytical or numerical solutions obtained from independent sources. This series of verification studies has established the ability of the MOTIF finite-element code to accurately model the groundwater flow and solute and heat transport phenomena for which it is intended. (author). 20 refs., 14 tabs., 32 figs

  3. Length and GC content variability of introns among teleostean genomes in the light of the metabolic rate hypothesis.

    Directory of Open Access Journals (Sweden)

    Ankita Chaurasia

    Full Text Available A comparative analysis of five teleostean genomes, namely zebrafish, medaka, three-spine stickleback, fugu and pufferfish was performed with the aim to highlight the nature of the forces driving both length and base composition of introns (i.e., bpi and GCi. An inter-genome approach using orthologous intronic sequences was carried out, analyzing independently both variables in pairwise comparisons. An average length shortening of introns was observed at increasing average GCi values. The result was not affected by masking transposable and repetitive elements harbored in the intronic sequences. The routine metabolic rate (mass specific temperature-corrected using the Boltzmann's factor was measured for each species. A significant correlation held between average differences of metabolic rate, length and GC content, while environmental temperature of fish habitat was not correlated with bpi and GCi. Analyzing the concomitant effect of both variables, i.e., bpi and GCi, at increasing genomic GC content, a decrease of bpi and an increase of GCi was observed for the significant majority of the intronic sequences (from ∼ 40% to ∼ 90%, in each pairwise comparison. The opposite event, concomitant increase of bpi and decrease of GCi, was counter selected (from <1% to ∼ 10%, in each pairwise comparison. The results further support the hypothesis that the metabolic rate plays a key role in shaping genome architecture and evolution of vertebrate genomes.

  4. Intron retention regulates the expression of pectin methyl esterase inhibitor (Pmei) genes during wheat growth and development.

    Science.gov (United States)

    Rocchi, V; Janni, M; Bellincampi, D; Giardina, T; D'Ovidio, R

    2012-03-01

    Pectin is an important component of the plant cell wall and its remodelling occurs during normal plant growth or following stress responses. Pectin is secreted into the cell wall in a highly methyl-esterified form and subsequently de-methyl-esterified by pectin methyl esterase (PME), whose activity is controlled by the pectin methyl esterase inhibitor protein (PMEI). Cereal cell wall contains a low amount of pectin; nonetheless the level and pattern of pectin methyl esterification play a primary role during development or pathogen infection. Since few data are available on the role of PMEI in plant development and defence of cereal species, we isolated and characterised three Pmei genes (Tdpmei2.1, Tdpmei2.2 and Tdpmei3) and their encoded products in wheat. Sequence comparisons showed a low level of intra- and inter-specific sequence conservation of PMEIs. Tdpmei2.1 and Tdpmei2.2 share 94% identity at protein level, but only 20% identity with the product of Tdpmei3. All three Tdpmei genes code for functional inhibitors of plant PMEs and do not inhibit microbial PMEs or a plant invertase. RT-PCR analyses demonstrated, for the first time to our knowledge, that Pmei genes are regulated by intron retention. Processed and unprocessed transcripts of Tdpmei2.1 and Tdpmei2.2 accumulated in several organs, but anthers contained only mature transcripts. Tdpmei3 lacks introns and its transcript accumulated mainly in stem internodes. These findings suggest that products encoded by these Tdpmei genes control organ- or tissue-specific activity of specific PME isoforms in wheat. © 2011 German Botanical Society and The Royal Botanical Society of the Netherlands.

  5. Naturally occuring nucleosome positioning signals in human exons and introns

    DEFF Research Database (Denmark)

    Baldi, Pierre; Brunak, Søren; Chauvin, Yves

    1996-01-01

    We describe the structural implications of a periodic pattern found in human exons and introns by hidden Markov models. We show that exons (besides the reading frame) have a specific sequential structure in the form of a pattern with triplet consensus non-T(A/T)G, and a minimal periodicity of rou...

  6. Regulatory elements of the floral homeotic gene AGAMOUS identified by phylogenetic footprinting and shadowing.

    Energy Technology Data Exchange (ETDEWEB)

    Hong, R. L., Hamaguchi, L., Busch, M. A., and Weigel, D.

    2003-06-01

    OAK-B135 In Arabidopsis thaliana, cis-regulatory sequences of the floral homeotic gene AGAMOUS (AG) are located in the second intron. This 3 kb intron contains binding sites for two direct activators of AG, LEAFY (LFY) and WUSCHEL (WUS), along with other putative regulatory elements. We have used phylogenetic footprinting and the related technique of phylogenetic shadowing to identify putative cis-regulatory elements in this intron. Among 29 Brassicaceae, several other motifs, but not the LFY and WUS binding sites previously identified, are largely invariant. Using reporter gene analyses, we tested six of these motifs and found that they are all functionally important for activity of AG regulatory sequences in A. thaliana. Although there is little obvious sequence similarity outside the Brassicaceae, the intron from cucumber AG has at least partial activity in A. thaliana. Our studies underscore the value of the comparative approach as a tool that complements gene-by-gene promoter dissection, but also highlight that sequence-based studies alone are insufficient for a complete identification of cis-regulatory sites.

  7. Permuting the PGF Signature Motif Blocks both Archaeosortase-Dependent C-Terminal Cleavage and Prenyl Lipid Attachment for the Haloferax volcanii S-Layer Glycoprotein.

    Science.gov (United States)

    Abdul Halim, Mohd Farid; Karch, Kelly R; Zhou, Yitian; Haft, Daniel H; Garcia, Benjamin A; Pohlschroder, Mechthild

    2015-12-28

    For years, the S-layer glycoprotein (SLG), the sole component of many archaeal cell walls, was thought to be anchored to the cell surface by a C-terminal transmembrane segment. Recently, however, we demonstrated that the Haloferax volcanii SLG C terminus is removed by an archaeosortase (ArtA), a novel peptidase. SLG, which was previously shown to be lipid modified, contains a C-terminal tripartite structure, including a highly conserved proline-glycine-phenylalanine (PGF) motif. Here, we demonstrate that ArtA does not process an SLG variant where the PGF motif is replaced with a PFG motif (slg(G796F,F797G)). Furthermore, using radiolabeling, we show that SLG lipid modification requires the PGF motif and is ArtA dependent, lending confirmation to the use of a novel C-terminal lipid-mediated protein-anchoring mechanism by prokaryotes. Similar to the case for the ΔartA strain, the growth, cellular morphology, and cell wall of the slg(G796F,F797G) strain, in which modifications of additional H. volcanii ArtA substrates should not be altered, are adversely affected, demonstrating the importance of these posttranslational SLG modifications. Our data suggest that ArtA is either directly or indirectly involved in a novel proteolysis-coupled, covalent lipid-mediated anchoring mechanism. Given that archaeosortase homologs are encoded by a broad range of prokaryotes, it is likely that this anchoring mechanism is widely conserved. Prokaryotic proteins bound to cell surfaces through intercalation, covalent attachment, or protein-protein interactions play critical roles in essential cellular processes. Unfortunately, the molecular mechanisms that anchor proteins to archaeal cell surfaces remain poorly characterized. Here, using the archaeon H. volcanii as a model system, we report the first in vivo studies of a novel protein-anchoring pathway involving lipid modification of a peptidase-processed C terminus. Our findings not only yield important insights into poorly understood

  8. An experimental test of a fundamental food web motif.

    Science.gov (United States)

    Rip, Jason M K; McCann, Kevin S; Lynn, Denis H; Fawcett, Sonia

    2010-06-07

    Large-scale changes to the world's ecosystem are resulting in the deterioration of biostructure-the complex web of species interactions that make up ecological communities. A difficult, yet crucial task is to identify food web structures, or food web motifs, that are the building blocks of this baroque network of interactions. Once identified, these food web motifs can then be examined through experiments and theory to provide mechanistic explanations for how structure governs ecosystem stability. Here, we synthesize recent ecological research to show that generalist consumers coupling resources with different interaction strengths, is one such motif. This motif amazingly occurs across an enormous range of spatial scales, and so acts to distribute coupled weak and strong interactions throughout food webs. We then perform an experiment that illustrates the importance of this motif to ecological stability. We find that weak interactions coupled to strong interactions by generalist consumers dampen strong interaction strengths and increase community stability. This study takes a critical step by isolating a common food web motif and through clear, experimental manipulation, identifies the fundamental stabilizing consequences of this structure for ecological communities.

  9. Highly scalable Ab initio genomic motif identification

    KAUST Repository

    Marchand, Benoit; Bajic, Vladimir B.; Kaushik, Dinesh

    2011-01-01

    We present results of scaling an ab initio motif family identification system, Dragon Motif Finder (DMF), to 65,536 processor cores of IBM Blue Gene/P. DMF seeks groups of mutually similar polynucleotide patterns within a set of genomic sequences and builds various motif families from them. Such information is of relevance to many problems in life sciences. Prior attempts to scale such ab initio motif-finding algorithms achieved limited success. We solve the scalability issues using a combination of mixed-mode MPI-OpenMP parallel programming, master-slave work assignment, multi-level workload distribution, multi-level MPI collectives, and serial optimizations. While the scalability of our algorithm was excellent (94% parallel efficiency on 65,536 cores relative to 256 cores on a modest-size problem), the final speedup with respect to the original serial code exceeded 250,000 when serial optimizations are included. This enabled us to carry out many large-scale ab initio motiffinding simulations in a few hours while the original serial code would have needed decades of execution time. Copyright 2011 ACM.

  10. Mechanisms of zero-lag synchronization in cortical motifs.

    Directory of Open Access Journals (Sweden)

    Leonardo L Gollo

    2014-04-01

    Full Text Available Zero-lag synchronization between distant cortical areas has been observed in a diversity of experimental data sets and between many different regions of the brain. Several computational mechanisms have been proposed to account for such isochronous synchronization in the presence of long conduction delays: Of these, the phenomenon of "dynamical relaying"--a mechanism that relies on a specific network motif--has proven to be the most robust with respect to parameter mismatch and system noise. Surprisingly, despite a contrary belief in the community, the common driving motif is an unreliable means of establishing zero-lag synchrony. Although dynamical relaying has been validated in empirical and computational studies, the deeper dynamical mechanisms and comparison to dynamics on other motifs is lacking. By systematically comparing synchronization on a variety of small motifs, we establish that the presence of a single reciprocally connected pair--a "resonance pair"--plays a crucial role in disambiguating those motifs that foster zero-lag synchrony in the presence of conduction delays (such as dynamical relaying from those that do not (such as the common driving triad. Remarkably, minor structural changes to the common driving motif that incorporate a reciprocal pair recover robust zero-lag synchrony. The findings are observed in computational models of spiking neurons, populations of spiking neurons and neural mass models, and arise whether the oscillatory systems are periodic, chaotic, noise-free or driven by stochastic inputs. The influence of the resonance pair is also robust to parameter mismatch and asymmetrical time delays amongst the elements of the motif. We call this manner of facilitating zero-lag synchrony resonance-induced synchronization, outline the conditions for its occurrence, and propose that it may be a general mechanism to promote zero-lag synchrony in the brain.

  11. Site-specific, insertional inactivation of incA in Chlamydia trachomatis using a group II intron.

    Science.gov (United States)

    Johnson, Cayla M; Fisher, Derek J

    2013-01-01

    Chlamydia trachomatis is an obligate, intracellular bacterial pathogen that has until more recently remained recalcitrant to genetic manipulation. However, the field still remains hindered by the absence of tools to create selectable, targeted chromosomal mutations. Previous work with mobile group II introns demonstrated that they can be retargeted by altering DNA sequences within the intron's substrate recognition region to create site-specific gene insertions. This platform (marketed as TargeTron™, Sigma) has been successfully employed in a variety of bacteria. We subsequently modified TargeTron™ for use in C. trachomatis and as proof of principle used our system to insertionally inactivate incA, a chromosomal gene encoding a protein required for homotypic fusion of chlamydial inclusions. C. trachomatis incA::GII(bla) mutants were selected with ampicillin and plaque purified clones were then isolated for genotypic and phenotypic analysis. PCR, Southern blotting, and DNA sequencing verified proper GII(bla) insertion, while continuous passaging in the absence of selection demonstrated that the insertion was stable. As seen with naturally occurring IncA(-) mutants, light and immunofluorescence microscopy confirmed the presence of non-fusogenic inclusions in cells infected with the incA::GII(bla) mutants at a multiplicity of infection greater than one. Lack of IncA production by mutant clones was further confirmed by Western blotting. Ultimately, the ease of retargeting the intron, ability to select for mutants, and intron stability in the absence of selection makes this method a powerful addition to the growing chlamydial molecular toolbox.

  12. Insights into the evolution and diversification of the AT-hook Motif Nuclear Localized gene family in land plants.

    Science.gov (United States)

    Zhao, Jianfei; Favero, David S; Qiu, Jiwen; Roalson, Eric H; Neff, Michael M

    2014-10-14

    Members of the ancient land-plant-specific transcription factor AT-Hook Motif Nuclear Localized (AHL) gene family regulate various biological processes. However, the relationships among the AHL genes, as well as their evolutionary history, still remain unexplored. We analyzed over 500 AHL genes from 19 land plant species, ranging from the early diverging Physcomitrella patens and Selaginella to a variety of monocot and dicot flowering plants. We classified the AHL proteins into three types (Type-I/-II/-III) based on the number and composition of their functional domains, the AT-hook motif(s) and PPC domain. We further inferred their phylogenies via Bayesian inference analysis and predicted gene gain/loss events throughout their diversification. Our analyses suggested that the AHL gene family emerged in embryophytes and further evolved into two distinct clades, with Type-I AHLs forming one clade (Clade-A), and the other two types together diversifying in another (Clade-B). The two AHL clades likely diverged before the separation of Physcomitrella patens from the vascular plant lineage. In angiosperms, Clade-A AHLs expanded into 5 subfamilies; while, the ones in Clade-B expanded into 4 subfamilies. Examination of their expression patterns suggests that the AHLs within each clade share similar expression patterns with each other; however, AHLs in one monophyletic clade exhibit distinct expression patterns from the ones in the other clade. Over-expression of a Glycine max AHL PPC domain in Arabidopsis thaliana recapitulates the phenotype observed when over-expressing its Arabidopsis thaliana counterpart. This result suggests that the AHL genes from different land plant species may share conserved functions in regulating plant growth and development. Our study further suggests that such functional conservation may be due to conserved physical interactions among the PPC domains of AHL proteins. Our analyses reveal a possible evolutionary scenario for the AHL gene family

  13. Transduction motif analysis of gastric cancer based on a human signaling network

    Energy Technology Data Exchange (ETDEWEB)

    Liu, G.; Li, D.Z.; Jiang, C.S.; Wang, W. [Fuzhou General Hospital of Nanjing Command, Department of Gastroenterology, Fuzhou, China, Department of Gastroenterology, Fuzhou General Hospital of Nanjing Command, Fuzhou (China)

    2014-04-04

    To investigate signal regulation models of gastric cancer, databases and literature were used to construct the signaling network in humans. Topological characteristics of the network were analyzed by CytoScape. After marking gastric cancer-related genes extracted from the CancerResource, GeneRIF, and COSMIC databases, the FANMOD software was used for the mining of gastric cancer-related motifs in a network with three vertices. The significant motif difference method was adopted to identify significantly different motifs in the normal and cancer states. Finally, we conducted a series of analyses of the significantly different motifs, including gene ontology, function annotation of genes, and model classification. A human signaling network was constructed, with 1643 nodes and 5089 regulating interactions. The network was configured to have the characteristics of other biological networks. There were 57,942 motifs marked with gastric cancer-related genes out of a total of 69,492 motifs, and 264 motifs were selected as significantly different motifs by calculating the significant motif difference (SMD) scores. Genes in significantly different motifs were mainly enriched in functions associated with cancer genesis, such as regulation of cell death, amino acid phosphorylation of proteins, and intracellular signaling cascades. The top five significantly different motifs were mainly cascade and positive feedback types. Almost all genes in the five motifs were cancer related, including EPOR, MAPK14, BCL2L1, KRT18, PTPN6, CASP3, TGFBR2, AR, and CASP7. The development of cancer might be curbed by inhibiting signal transductions upstream and downstream of the selected motifs.

  14. Acidic and uncharged polar residues in the consensus motifs of the yeast Ca2+ transporter Gdt1p are required for calcium transport.

    Science.gov (United States)

    Colinet, Anne-Sophie; Thines, Louise; Deschamps, Antoine; Flémal, Gaëlle; Demaegd, Didier; Morsomme, Pierre

    2017-07-01

    The UPF0016 family is a recently identified group of poorly characterized membrane proteins whose function is conserved through evolution and that are defined by the presence of 1 or 2 copies of the E-φ-G-D-[KR]-[TS] consensus motif in their transmembrane domain. We showed that 2 members of this family, the human TMEM165 and the budding yeast Gdt1p, are functionally related and are likely to form a new group of Ca 2+ transporters. Mutations in TMEM165 have been demonstrated to cause a new type of rare human genetic diseases denominated as Congenital Disorders of Glycosylation. Using site-directed mutagenesis, we generated 17 mutations in the yeast Golgi-localized Ca 2+ transporter Gdt1p. Single alanine substitutions were targeted to the highly conserved consensus motifs, 4 acidic residues localized in the central cytosolic loop, and the arginine at position 71. The mutants were screened in a yeast strain devoid of both the endogenous Gdt1p exchanger and Pmr1p, the Ca 2+ -ATPase of the Golgi apparatus. We show here that acidic and polar uncharged residues of the consensus motifs play a crucial role in calcium tolerance and calcium transport activity and are therefore likely to be architectural components of the cation binding site of Gdt1p. Importantly, we confirm the essential role of the E53 residue whose mutation in humans triggers congenital disorders of glycosylation. © 2017 John Wiley & Sons Ltd.

  15. A Novel Heterozygous Intronic Mutation in the FBN1 Gene Contributes to FBN1 RNA Missplicing Events in the Marfan Syndrome

    Directory of Open Access Journals (Sweden)

    Mario Torrado

    2018-01-01

    Full Text Available Marfan syndrome (MFS is an autosomal dominantly inherited connective tissue disorder, mostly caused by mutations in the fibrillin-1 (FBN1 gene. We, by using targeted next-generation sequence analysis, identified a novel intronic FBN1 mutation (the c.2678-15C>A variant in a MFS patient with aortic dilatation. The computational predictions showed that the heterozygous c.2678-15C>A intronic variant might influence the splicing process by differentially affecting canonical versus cryptic splice site utilization within intron 22 of the FBN1 gene. RT-PCR and Western blot analyses, using FBN1 minigenes transfected into HeLa and COS-7 cells, revealed that the c.2678-15C>A variant disrupts normal splicing of intron 22 leading to aberrant 13-nt intron 22 inclusion, frameshift, and premature termination codon. Collectively, the results strongly suggest that the c.2678-15C>A variant could lead to haploinsufficiency of the FBN1 functional protein and structural connective tissue fragility in MFS complicated by aorta dilation, a finding that further expands on the genetic basis of aortic pathology.

  16. Characterizing Motif Dynamics of Electric Brain Activity Using Symbolic Analysis

    Directory of Open Access Journals (Sweden)

    Massimiliano Zanin

    2014-10-01

    Full Text Available Motifs are small recurring circuits of interactions which constitute the backbone of networked systems. Characterizing motif dynamics is therefore key to understanding the functioning of such systems. Here we propose a method to define and quantify the temporal variability and time scales of electroencephalogram (EEG motifs of resting brain activity. Given a triplet of EEG sensors, links between them are calculated by means of linear correlation; each pattern of links (i.e., each motif is then associated to a symbol, and its appearance frequency is analyzed by means of Shannon entropy. Our results show that each motif becomes observable with different coupling thresholds and evolves at its own time scale, with fronto-temporal sensors emerging at high thresholds and changing at fast time scales, and parietal ones at low thresholds and changing at slower rates. Finally, while motif dynamics differed across individuals, for each subject, it showed robustness across experimental conditions, indicating that it could represent an individual dynamical signature.

  17. Improved i-motif thermal stability by insertion of anthraquinone monomers

    DEFF Research Database (Denmark)

    Gouda, Alaa S; Amine, Mahasen S.; Pedersen, Erik Bjerregaard

    2017-01-01

    In order to gain insight into how to improve thermal stability of i-motifs when used in the context of biomedical and nanotechnological applications, novel anthraquinone-modified i-motifs were synthesized by insertion of 1,8-, 1,4-, 1,5- and 2,6-disubstituted anthraquinone monomers into the TAA...... loops of a 22mer cytosine-rich human telomeric DNA sequence. The influence of the four anthraquinone linkers on the i-motif thermal stability was investigated at 295 nm and pH 5.5. Anthraquinone monomers modulate the i-motif stability in a position-depending manner and the modulation also depends...... unlocked nucleic acid monomers or twisted intercalating nucleic acid. The 2,6-disubstituted anthraquinone linker replacing T10 enabled a significant increase of i-motif thermal melting by 8.2 °C. A substantial increase of 5.0 °C in i-motif thermal melting was recorded when both A6 and T16 were modified...

  18. Selection of functional 2A sequences within foot-and-mouth disease virus; requirements for the NPGP motif with a distinct codon bias.

    Science.gov (United States)

    Kjær, Jonas; Belsham, Graham J

    2018-01-01

    Foot-and-mouth disease virus (FMDV) has a positive-sense ssRNA genome including a single, large, open reading frame. Splitting of the encoded polyprotein at the 2A/2B junction is mediated by the 2A peptide (18 residues long), which induces a nonproteolytic, cotranslational "cleavage" at its own C terminus. A conserved feature among variants of 2A is the C-terminal motif N 16 P 17 G 18 /P 19 , where P 19 is the first residue of 2B. It has been shown previously that certain amino acid substitutions can be tolerated at residues E 14 , S 15 , and N 16 within the 2A sequence of infectious FMDVs, but no variants at residues P 17 , G 18 , or P 19 have been identified. In this study, using highly degenerate primers, we analyzed if any other residues can be present at each position of the NPG/P motif within infectious FMDV. No alternative forms of this motif were found to be encoded by rescued FMDVs after two, three, or four passages. However, surprisingly, a clear codon preference for the wt nucleotide sequence encoding the NPGP motif within these viruses was observed. Indeed, the codons selected to code for P 17 and P 19 within this motif were distinct; thus the synonymous codons are not equivalent. © 2018 Kjær and Belsham; Published by Cold Spring Harbor Laboratory Press for the RNA Society.

  19. Interrupted thymidylate synthase gene of bacteriophages T2 and T6 and other potential self-splicing introns in the T-even bacteriophages

    International Nuclear Information System (INIS)

    Chu, F.K.; Maley, F.; Martinez, J.; Maley, G.F.

    1987-01-01

    Southern hybridization analyses of procaryotic DNA from Escherchia coli, λ bacteriophage, and T1 to T7 phages were carried out. The hybridization probes used consisted of DNA restriction fragments derived from the T4 phage intron-containing thymidylate synthase gene (td) and short synthetic oligodeoxynucleotides defining specific exon and intron regions of the gene. It was shown that intact as well as restricted DNA from the T-even phages hybridized not only to both T4 phage td intron- and exon-specific probes but also to probes defining the td 5' (exon I-intron) and 3' (intron-exon II) presplice junctions. These data strongly suggest that, analogous to the T4 phage, only the T2 and T6 phages among the procaryotes tested contain interrupted td genes. The td intervening sequence in each phage is roughly 1 kilobase pair (kb) in size and interrupts the td gene at a site analogous to that in the T4 phage. This was confirmed by data from Northern (RNA) hybridization analysis of td-specific in vitro transcripts of these phage DNAs. [α- 32 P]GTP in vitro labeling of total RNA from T4 phage-infected cells produced five species of labeled RNAs that were 1, 0.9, 0.83, 0.75, and 0.6 kb in size. Only the 1-, 0.9-, and 0.75-kb species were labeled in RNA from T2- or T6-infected cells. The commonly present 1-kb RNA is the excised td intron, which exists in both linear and circular forms in the respective T-even-phage-infected cells, while the 0.6-kb RNA unique to T4 may be the excised intron derived from the ribonucleotide reductase small subunit gene (nrdB) of the phage. The remaining labeled RNA species are likely candidates for other self-splicing introns

  20. A Contracted DNA Repeat in LHX3 Intron 5 Is Associated with Aberrant Splicing and Pituitary Dwarfism in German Shepherd Dogs

    Science.gov (United States)

    Voorbij, Annemarie M. W. Y.; van Steenbeek, Frank G.; Vos-Loohuis, Manon; Martens, Ellen E. C. P.; Hanson-Nilsson, Jeanette M.; van Oost, Bernard A.; Kooistra, Hans S.; Leegwater, Peter A.

    2011-01-01

    Dwarfism in German shepherd dogs is due to combined pituitary hormone deficiency of unknown genetic cause. We localized the recessively inherited defect by a genome wide approach to a region on chromosome 9 with a lod score of 9.8. The region contains LHX3, which codes for a transcription factor essential for pituitary development. Dwarfs have a deletion of one of six 7 bp repeats in intron 5 of LHX3, reducing the intron size to 68 bp. One dwarf was compound heterozygous for the deletion and an insertion of an asparagine residue in the DNA-binding homeodomain of LHX3, suggesting involvement of the gene in the disorder. An exon trapping assay indicated that the shortened intron is not spliced efficiently, probably because it is too small. We applied bisulfite conversion of cytosine to uracil in RNA followed by RT-PCR to analyze the splicing products. The aberrantly spliced RNA molecules resulted from either skipping of exon 5 or retention of intron 5. The same splicing defects were observed in cDNA derived from the pituitary of dwarfs. A survey of similarly mutated introns suggests that there is a minimal distance requirement between the splice donor and branch site of 50 nucleotides. In conclusion, a contraction of a DNA repeat in intron 5 of canine LHX3 leads to deficient splicing and is associated with pituitary dwarfism. PMID:22132174

  1. Bioinformatic Analyses of Subgroup-A Members of the Wheat bZIP Transcription Factor Family and Functional Identification of TabZIP174 Involved in Drought Stress Response

    Directory of Open Access Journals (Sweden)

    Xueyin Li

    2016-11-01

    Full Text Available Extensive studies in Arabidopsis and rice have demonstrated that Subgroup-A members of the bZIP transcription factor family play important roles in plant responses to multiple abiotic stresses. Although common wheat (Triticum aestivum is one of the most widely cultivated and consumed food crops in the world, there are limited investigations into Subgroup A of the bZIP family in wheat. In this study, we performed bioinformatic analyses of the 41 Subgroup-A members of the wheat bZIP family. Phylogenetic and conserved motif analyses showed that most of the Subgroup-A bZIP proteins involved in abiotic stress responses of wheat, Arabidopsis and rice clustered in Clade A1 of the phylogenetic tree, and shared a majority of conserved motifs, suggesting the potential importance of Clade-A1 members in abiotic stress responses. Gene structure analysis showed that TabZIP genes with close phylogenetic relationships tended to possess similar exon-intron compositions, and the positions of introns in the hinge regions of the bZIP domains were highly conserved, whereas introns in the leucine zipper regions were at variable positions. Additionally, eleven groups of homologs and two groups of tandem paralogs were also identified in Subgroup A of the wheat bZIP family. Expression profiling analysis indicated that most Subgroup-A TabZIP genes were responsive to abscisic acid and various abiotic stress treatments. TabZIP27, TabZIP74, TabZIP138 and TabZIP174 proteins were localized in the nucleus of wheat protoplasts, whereas TabZIP9-GFP fusion protein was simultaneously present in the nucleus, cytoplasm and cell membrane. Transgenic Arabidopsis overexpressing TabZIP174 displayed increased seed germination rates and primary root lengths under drought treatments. Overexpression of TabZIP174 in transgenic Arabidopsis conferred enhanced drought tolerance, and transgenic plants exhibited lower water loss rates, higher survival rates, higher proline, soluble sugar and leaf

  2. Bioinformatic Analyses of Subgroup-A Members of the Wheat bZIP Transcription Factor Family and Functional Identification of TabZIP174 Involved in Drought Stress Response

    Science.gov (United States)

    Li, Xueyin; Feng, Biane; Zhang, Fengjie; Tang, Yimiao; Zhang, Liping; Ma, Lingjian; Zhao, Changping; Gao, Shiqing

    2016-01-01

    Extensive studies in Arabidopsis and rice have demonstrated that Subgroup-A members of the bZIP transcription factor family play important roles in plant responses to multiple abiotic stresses. Although common wheat (Triticum aestivum) is one of the most widely cultivated and consumed food crops in the world, there are limited investigations into Subgroup A of the bZIP family in wheat. In this study, we performed bioinformatic analyses of the 41 Subgroup-A members of the wheat bZIP family. Phylogenetic and conserved motif analyses showed that most of the Subgroup-A bZIP proteins involved in abiotic stress responses of wheat, Arabidopsis, and rice clustered in Clade A1 of the phylogenetic tree, and shared a majority of conserved motifs, suggesting the potential importance of Clade-A1 members in abiotic stress responses. Gene structure analysis showed that TabZIP genes with close phylogenetic relationships tended to possess similar exon–intron compositions, and the positions of introns in the hinge regions of the bZIP domains were highly conserved, whereas introns in the leucine zipper regions were at variable positions. Additionally, eleven groups of homologs and two groups of tandem paralogs were also identified in Subgroup A of the wheat bZIP family. Expression profiling analysis indicated that most Subgroup-A TabZIP genes were responsive to abscisic acid and various abiotic stress treatments. TabZIP27, TabZIP74, TabZIP138, and TabZIP174 proteins were localized in the nucleus of wheat protoplasts, whereas TabZIP9-GFP fusion protein was simultaneously present in the nucleus, cytoplasm, and cell membrane. Transgenic Arabidopsis overexpressing TabZIP174 displayed increased seed germination rates and primary root lengths under drought treatments. Overexpression of TabZIP174 in transgenic Arabidopsis conferred enhanced drought tolerance, and transgenic plants exhibited lower water loss rates, higher survival rates, higher proline, soluble sugar, and leaf chlorophyll

  3. Modulation of splicing of the preceding intron by antisense oligonucleotide complementary to intra-exon sequence deleted in dystrophin Kobe

    Energy Technology Data Exchange (ETDEWEB)

    Takeshima, Y.; Matuso, M.; Sakamoto, H.; Nishio, H. [Kobe Univ. School of Medicine and Science (Japan)

    1994-09-01

    Molecular analysis of dystrophin Kobe showed that exon 19 of the dystrophin gene bearing a 52 bp deletion was skipped during splicing, although the known consensus sequences at the 5{prime} and 3{prime} splice site of exon 19 were maintained. These data suggest that the deleted sequence of exon 19 may function as a cis-acting factor for exact splicing for the upstream intron. To investigate this potential role, an in vitro splicing system using dystrophin precursors was established. A two-exon precursor containing exon 18, truncated intron 18, and exon 19 was accurately spliced. However, splicing of intron 18 was dramatically inhibited when wild exon 19 was replaced with mutated exon 19. Even though the length of exon 19 was restored to normal by replacing the deleted sequence with other sequence, splicing of intron 18 was not fully reactivated. Characteristically, splicing of intron 18 was inactivated more markedly when the replaced sequence contained less polypurine stretches. These data suggested that modification of the exon sequence would result in a splicing abnormality. Antisense 31 mer 2`-O-methyl ribonucleotide was targeted against 5{prime} end of deleted region of exon 19 to modulate splicing of the mRNA precursor. Splicing of intron 18 was inhibited in a dose- and time-dependent manner. This is the first in vitro evidence to show splicing of dystrophin pre-mRNA can be managed by antisense oligonucleotides. These experiments represent an approach in which antisense oligonucleotides are used to restore the function of a defective dystrophin gene in Duchenne muscular dystrophy by inducing skipping of certain exons during splicing.

  4. Canonical Bcl-2 Motifs of the Na+/K+ Pump Revealed by the BH3 Mimetic Chelerythrine: Early Signal Transducers of Apoptosis?

    Directory of Open Access Journals (Sweden)

    Peter K. Lauf

    2013-02-01

    Full Text Available Background/Aims: Chelerythrine [CET], a protein kinase C [PKC] inhibitor, is a prop-apoptotic BH3-mimetic binding to BH1-like motifs of Bcl-2 proteins. CET action was examined on PKC phosphorylation-dependent membrane transporters (Na+/K+ pump/ATPase [NKP, NKA], Na+-K+-2Cl+ [NKCC] and K+-Cl- [KCC] cotransporters, and channel-supported K+ loss in human lens epithelial cells [LECs]. Methods: K+ loss and K+ uptake, using Rb+ as congener, were measured by atomic absorption/emission spectrophotometry with NKP and NKCC inhibitors, and Cl- replacement by NO3ˉ to determine KCC. 3H-Ouabain binding was performed on a pig renal NKA in the presence and absence of CET. Bcl-2 protein and NKA sequences were aligned and motifs identified and mapped using PROSITE in conjunction with BLAST alignments and analysis of conservation and structural similarity based on prediction of secondary and crystal structures. Results: CET inhibited NKP and NKCC by >90% (IC50 values ∼35 and ∼15 µM, respectively without significant KCC activity change, and stimulated K+ loss by ∼35% at 10-30 µM. Neither ATP levels nor phosphorylation of the NKA α1 subunit changed. 3H-ouabain was displaced from pig renal NKA only at 100 fold higher CET concentrations than the ligand. Sequence alignments of NKA with BH1- and BH3-like motifs containing pro-survival Bcl-2 and BclXl proteins showed more than one BH1-like motif within NKA for interaction with CET or with BH3 motifs. One NKA BH1-like motif (ARAAEILARDGPN was also found in all P-type ATPases. Also, NKA possessed a second motif similar to that near the BH3 region of Bcl-2. Conclusion: Findings support the hypothesis that CET inhibits NKP by binding to BH1-like motifs and disrupting the α1 subunit catalytic activity through conformational changes. By interacting with Bcl-2 proteins through their complementary BH1- or BH3-like-motifs, NKP proteins may be sensors of normal and pathological cell functions, becoming important yet

  5. Crystal structure and novel recognition motif of rho ADP-ribosylating C3 exoenzyme from Clostridium botulinum: structural insights for recognition specificity and catalysis.

    Science.gov (United States)

    Han, S; Arvai, A S; Clancy, S B; Tainer, J A

    2001-01-05

    Clostridium botulinum C3 exoenzyme inactivates the small GTP-binding protein family Rho by ADP-ribosylating asparagine 41, which depolymerizes the actin cytoskeleton. C3 thus represents a major family of the bacterial toxins that transfer the ADP-ribose moiety of NAD to specific amino acids in acceptor proteins to modify key biological activities in eukaryotic cells, including protein synthesis, differentiation, transformation, and intracellular signaling. The 1.7 A resolution C3 exoenzyme structure establishes the conserved features of the core NAD-binding beta-sandwich fold with other ADP-ribosylating toxins despite little sequence conservation. Importantly, the central core of the C3 exoenzyme structure is distinguished by the absence of an active site loop observed in many other ADP-ribosylating toxins. Unlike the ADP-ribosylating toxins that possess the active site loop near the central core, the C3 exoenzyme replaces the active site loop with an alpha-helix, alpha3. Moreover, structural and sequence similarities with the catalytic domain of vegetative insecticidal protein 2 (VIP2), an actin ADP-ribosyltransferase, unexpectedly implicates two adjacent, protruding turns, which join beta5 and beta6 of the toxin core fold, as a novel recognition specificity motif for this newly defined toxin family. Turn 1 evidently positions the solvent-exposed, aromatic side-chain of Phe209 to interact with the hydrophobic region of Rho adjacent to its GTP-binding site. Turn 2 evidently both places the Gln212 side-chain for hydrogen bonding to recognize Rho Asn41 for nucleophilic attack on the anomeric carbon of NAD ribose and holds the key Glu214 catalytic side-chain in the adjacent catalytic pocket. This proposed bipartite ADP-ribosylating toxin turn-turn (ARTT) motif places the VIP2 and C3 toxin classes into a single ARTT family characterized by analogous target protein recognition via turn 1 aromatic and turn 2 hydrogen-bonding side-chain moieties. Turn 2 centrally anchors

  6. Identification of a conserved B-cell epitope on the GapC protein of Streptococcus dysgalactiae.

    Science.gov (United States)

    Zhang, Limeng; Zhou, Xue; Fan, Ziyao; Tang, Wei; Chen, Liang; Dai, Jian; Wei, Yuhua; Zhang, Jianxin; Yang, Xuan; Yang, Xijing; Liu, Daolong; Yu, Liquan; Zhang, Hua; Wu, Zhijun; Yu, Yongzhong; Sun, Hunan; Cui, Yudong

    2015-01-01

    Streptococcus dysgalactiae (S. dysgalactia) GapC is a highly conserved surface dehydrogenase among the streptococcus spp., which is responsible for inducing protective antibody immune responses in animals. However, the B-cell epitope of S. dysgalactia GapC have not been well characterized. In this study, a monoclonal antibody 1F2 (mAb1F2) against S. dysgalactiae GapC was generated by the hybridoma technique and used to screen a phage-displayed 12-mer random peptide library (Ph.D.-12) for mapping the linear B-cell epitope. The mAb1F2 recognized phages displaying peptides with the consensus motif TRINDLT. Amino acid sequence of the motif exactly matched (30)TRINDLT(36) of the S. dysgalactia GapC. Subsequently, site-directed mutagenic analysis further demonstrated that residues R31, I32, N33, D34 and L35 formed the core of (30)TRINDLT(36), and this core motif was the minimal determinant of the B-cell epitope recognized by the mAb1F2. The epitope (30)TRINDLT(36) showed high homology among different streptococcus species. Overall, our findings characterized a conserved B-cell epitope, which will be useful for the further study of epitope-based vaccines. Copyright © 2015 Elsevier Ltd. All rights reserved.

  7. Assessment of allelic diversity in intron-containing Mal d 1 genes and their association to apple allergenicity

    Directory of Open Access Journals (Sweden)

    Bolhaar Suzanne THP

    2008-11-01

    Full Text Available Abstract Background Mal d 1 is a major apple allergen causing food allergic symptoms of the oral allergy syndrome (OAS in birch-pollen sensitised patients. The Mal d 1 gene family is known to have at least 7 intron-containing and 11 intronless members that have been mapped in clusters on three linkage groups. In this study, the allelic diversity of the seven intron-containing Mal d 1 genes was assessed among a set of apple cultivars by sequencing or indirectly through pedigree genotyping. Protein variant constitutions were subsequently compared with Skin Prick Test (SPT responses to study the association of deduced protein variants with allergenicity in a set of 14 cultivars. Results From the seven intron-containing Mal d 1 genes investigated, Mal d 1.01 and Mal d 1.02 were highly conserved, as nine out of ten cultivars coded for the same protein variant, while only one cultivar coded for a second variant. Mal d 1.04, Mal d 1.05 and Mal d 1.06 A, B and C were more variable, coding for three to six different protein variants. Comparison of Mal d 1 allelic composition between the high-allergenic cultivar Golden Delicious and the low-allergenic cultivars Santana and Priscilla, which are linked in pedigree, showed an association between the protein variants coded by the Mal d 1.04 and -1.06A genes (both located on linkage group 16 with allergenicity. This association was confirmed in 10 other cultivars. In addition, Mal d 1.06A allele dosage effects associated with the degree of allergenicity based on prick to prick testing. Conversely, no associations were observed for the protein variants coded by the Mal d 1.01 (on linkage group 13, -1.02, -1.06B, -1.06C genes (all on linkage group 16, nor by the Mal d 1.05 gene (on linkage group 6. Conclusion Protein variant compositions of Mal d 1.04 and -1.06A and, in case of Mal d 1.06A, allele doses are associated with the differences in allergenicity among fourteen apple cultivars. This information

  8. Positional bias of general and tissue-specific regulatory motifs in mouse gene promoters

    Directory of Open Access Journals (Sweden)

    Farré Domènec

    2007-12-01

    Full Text Available Abstract Background The arrangement of regulatory motifs in gene promoters, or promoter architecture, is the result of mutation and selection processes that have operated over many millions of years. In mammals, tissue-specific transcriptional regulation is related to the presence of specific protein-interacting DNA motifs in gene promoters. However, little is known about the relative location and spacing of these motifs. To fill this gap, we have performed a systematic search for motifs that show significant bias at specific promoter locations in a large collection of housekeeping and tissue-specific genes. Results We observe that promoters driving housekeeping gene expression are enriched in particular motifs with strong positional bias, such as YY1, which are of little relevance in promoters driving tissue-specific expression. We also identify a large number of motifs that show positional bias in genes expressed in a highly tissue-specific manner. They include well-known tissue-specific motifs, such as HNF1 and HNF4 motifs in liver, kidney and small intestine, or RFX motifs in testis, as well as many potentially novel regulatory motifs. Based on this analysis, we provide predictions for 559 tissue-specific motifs in mouse gene promoters. Conclusion The study shows that motif positional bias is an important feature of mammalian proximal promoters and that it affects both general and tissue-specific motifs. Motif positional constraints define very distinct promoter architectures depending on breadth of expression and type of tissue.

  9. First intron of nestin gene regulates its expression during C2C12 myoblast ifferentiation

    Institute of Scientific and Technical Information of China (English)

    Hua Zhong; Zhigang Jin; Yongfeng Chen; Ting Zhang; Wei Bian; Xing Cui; Naihe Jing

    2008-01-01

    Institutes for Biological Sciences, Chinese Academy of Sciences, Shanghai 200031, China Nestin is an intermediate filament protein expressed in neural progenitor cells and in developing skeletal muscle. Nestin has been widely used as a neural progenitor cell marker. It is well established that the specific expression of the nestin gene in neural progenitor cells is conferred by the neural-specific enhancer located in the second intron of the nestin gene. However, the transcriptional mechanism of nestin expression in developing muscle is still unclear. In this study, we identified a muscle cell-specific enhancer in the first intron of mouse nestin gene in mouse myoblast C2C12 cells.We localized the core enhancer activity to the 291-661 region of the first intron, and showed that the two E-boxes in the core enhancer region were important for enhancer activity in differentiating C2C12 cells. We also showed that MyoD protein was involved in the regulation of nestin expression in the myogenic differentiation of C2C12 cells.

  10. Computational analyses of synergism in small molecular network motifs.

    Directory of Open Access Journals (Sweden)

    Yili Zhang

    2014-03-01

    Full Text Available Cellular functions and responses to stimuli are controlled by complex regulatory networks that comprise a large diversity of molecular components and their interactions. However, achieving an intuitive understanding of the dynamical properties and responses to stimuli of these networks is hampered by their large scale and complexity. To address this issue, analyses of regulatory networks often focus on reduced models that depict distinct, reoccurring connectivity patterns referred to as motifs. Previous modeling studies have begun to characterize the dynamics of small motifs, and to describe ways in which variations in parameters affect their responses to stimuli. The present study investigates how variations in pairs of parameters affect responses in a series of ten common network motifs, identifying concurrent variations that act synergistically (or antagonistically to alter the responses of the motifs to stimuli. Synergism (or antagonism was quantified using degrees of nonlinear blending and additive synergism. Simulations identified concurrent variations that maximized synergism, and examined the ways in which it was affected by stimulus protocols and the architecture of a motif. Only a subset of architectures exhibited synergism following paired changes in parameters. The approach was then applied to a model describing interlocked feedback loops governing the synthesis of the CREB1 and CREB2 transcription factors. The effects of motifs on synergism for this biologically realistic model were consistent with those for the abstract models of single motifs. These results have implications for the rational design of combination drug therapies with the potential for synergistic interactions.

  11. Phyloproteomic Analysis of 11780 Six-Residue-Long Motifs Occurrences

    Directory of Open Access Journals (Sweden)

    O. V. Galzitskaya

    2015-01-01

    Full Text Available How is it possible to find good traits for phylogenetic reconstructions? Here, we present a new phyloproteomic criterion that is an occurrence of simple motifs which can be imprints of evolution history. We studied the occurrences of 11780 six-residue-long motifs consisting of two randomly located amino acids in 97 eukaryotic and 25 bacterial proteomes. For all eukaryotic proteomes, with the exception of the Amoebozoa, Stramenopiles, and Diplomonadida kingdoms, the number of proteins containing the motifs from the first group (one of the two amino acids occurs once at the terminal position made about 20%; in the case of motifs from the second (one of two amino acids occurs one time within the pattern and third (the two amino acids occur randomly groups, 30% and 50%, respectively. For bacterial proteomes, this relationship was 10%, 27%, and 63%, respectively. The matrices of correlation coefficients between numbers of proteins where a motif from the set of 11780 motifs appears at least once in 9 kingdoms and 5 phyla of bacteria were calculated. Among the correlation coefficients for eukaryotic proteomes, the correlation between the animal and fungi kingdoms (0.62 is higher than between fungi and plants (0.54. Our study provides support that animals and fungi are sibling kingdoms. Comparison of the frequencies of six-residue-long motifs in different proteomes allows obtaining phylogenetic relationships based on similarities between these frequencies: the Diplomonadida kingdoms are more close to Bacteria than to Eukaryota; Stramenopiles and Amoebozoa are more close to each other than to other kingdoms of Eukaryota.

  12. [Gene cloning and bioinformatics analysis of SABATH methyltransferase in Lonicera japonica var. chinensis].

    Science.gov (United States)

    Yu, Xiao-Dan; Jiang, Chao; Huang, Lu-Qi; Qin, Shuang-Shuang; Zeng, Xiang-Mei; Chen, Ping; Yuan, Yuan

    2013-08-01

    To clone SABATH methyltransferase (rLjSABATHMT) gene in Lonicera japonica var. chinensis, and compare the gene expression and intron sequence of SABATH methyltransferase orthologous in L. japonica with L. japonica var. chinensis. It provide a basis for gene regulate the formation of L. japonica floral scents. The cDNA and genome sequences of LjSABATHMT from L. japonica var. chinensis were cloned according to the gene fragments in cDNA library. The LjSABATHMT protein was characterized by bioinformatics analysis. SABATH family phylogenetic tree were built by MEGA 5.0. The transcripted level of SABATHMT orthologous were analyzed in different organs and different flower periods of L. japonica and L. japonica var. chinensis using RT-PCR analysis. Intron sequences of SABATHMT orthologous were also analyzied. The cDNA of LjSABATHMT was 1 251 bp, had a complete coding frame with 365 amino acids. The protein had the conservative SABATHMT domain, and phylogenetic tree showed that it may be a salicylic acid/benzoic acid methyltransferase. Higher expression of SABATH methyltransferase orthologous was found in flower. The intron sequence of L. japonica and L. japonica var. chinensis had rich polymorphism, and two SNP are unique genotype of L. japonica var. chinensis. The motif elements in two orthologous genes were significant differences. The intron difference of SABATH methyltransferase orthologous could be inducing to difference of gene expression between L. japonica and L. japonica var. chinensis. These results will provide important base on regulating active compounds of L. japonica.

  13. Alteration of introns in a hyaluronan synthase 1 (HAS1 minigene convert Pre-mRNA [corrected] splicing to the aberrant pattern in multiple myeloma (MM: MM patients harbor similar changes.

    Directory of Open Access Journals (Sweden)

    Jitra Kriangkum

    Full Text Available Aberrant pre-mRNA splice variants of hyaluronan synthase 1 (HAS1 have been identified in malignant cells from cancer patients. Bioinformatic analysis suggests that intronic sequence changes can underlie aberrant splicing. Deletions and mutations were introduced into HAS1 minigene constructs to identify regions that can influence aberrant intronic splicing, comparing the splicing pattern in transfectants with that in multiple myeloma (MM patients. Introduced genetic variations in introns 3 and 4 of HAS1 as shown here can promote aberrant splicing of the type detected in malignant cells from MM patients. HAS1Vd is a novel intronic splice variant first identified here. HAS1Vb, an intronic splice variant previously identified in patients, skips exon 4 and utilizes the same intron 4 alternative 3'splice site as HAS1Vd. For transfected constructs with unaltered introns 3 and 4, HAS1Vd transcripts are readily detectable, frequently to the exclusion of HAS1Vb. In contrast, in MM patients, HAS1Vb is more frequent than HAS1Vd. In the HAS1 minigene, combining deletion in intron 4 with mutations in intron 3 leads to a shift from HAS1Vd expression to HAS1Vb expression. The upregulation of aberrant splicing, exemplified here by the expression of HAS1Vb, is shown here to be influenced by multiple genetic changes in intronic sequences. For HAS1Vb, this includes enhanced exon 4 skipping and increased usage of alternative 3' splice sites. Thus, the combination of introduced mutations in HAS1 intron3 with introduced deletions in HAS1 intron 4 promoted a shift to an aberrant splicing pattern previously shown to be clinically significant. Most MM patients harbor genetic variations in intron 4, and as shown here, nearly half harbor recurrent mutations in HAS1 intron 3. Our work suggests that aberrant intronic HAS1 splicing in MM patients may rely on intronic HAS1 deletions and mutations that are frequent in MM patients but absent from healthy donors.

  14. Cloning, characterisation and comparative analysis of a starch synthase IV gene in wheat: functional and evolutionary implications

    Directory of Open Access Journals (Sweden)

    Broglie Karen E

    2008-09-01

    Full Text Available Abstract Background Starch is of great importance to humans as a food and biomaterial, and the amount and structure of starch made in plants is determined in part by starch synthase (SS activity. Five SS isoforms, SSI, II, III, IV and Granule Bound SSI, have been identified, each with a unique catalytic role in starch synthesis. The basic mode of action of SSs is known; however our knowledge of several aspects of SS enzymology at the structural and mechanistic level is incomplete. To gain a better understanding of the differences in SS sequences that underscore their specificity, the previously uncharacterised SSIVb from wheat was cloned and extensive bioinformatics analyses of this and other SSs sequences were done. Results The wheat SSIV cDNA is most similar to rice SSIVb with which it shows synteny and shares a similar exon-intron arrangement. The wheat SSIVb gene was preferentially expressed in leaf and was not regulated by a circadian clock. Phylogenetic analysis showed that in plants, SSIV is closely related to SSIII, while SSI, SSII and Granule Bound SSI clustered together and distinctions between the two groups can be made at the genetic level and included chromosomal location and intron conservation. Further, identified differences at the amino acid level in their glycosyltransferase domains, predicted secondary structures, global conformations and conserved residues might be indicative of intragroup functional associations. Conclusion Based on bioinformatics analysis of the catalytic region of 36 SSs and 3 glycogen synthases (GSs, it is suggested that the valine residue in the highly conserved K-X-G-G-L motif in SSIII and SSIV may be a determining feature of primer specificity of these SSs as compared to GBSSI, SSI and SSII. In GBSSI, the Ile485 residue may partially explain that enzyme's unique catalytic features. The flexible 380s Loop in the starch catalytic domain may be important in defining the specificity of action for each

  15. Convergent evolution and mimicry of protein linear motifs in host-pathogen interactions.

    Science.gov (United States)

    Chemes, Lucía Beatriz; de Prat-Gay, Gonzalo; Sánchez, Ignacio Enrique

    2015-06-01

    Pathogen linear motif mimics are highly evolvable elements that facilitate rewiring of host protein interaction networks. Host linear motifs and pathogen mimics differ in sequence, leading to thermodynamic and structural differences in the resulting protein-protein interactions. Moreover, the functional output of a mimic depends on the motif and domain repertoire of the pathogen protein. Regulatory evolution mediated by linear motifs can be understood by measuring evolutionary rates, quantifying positive and negative selection and performing phylogenetic reconstructions of linear motif natural history. Convergent evolution of linear motif mimics is widespread among unrelated proteins from viral, prokaryotic and eukaryotic pathogens and can also take place within individual protein phylogenies. Statistics, biochemistry and laboratory models of infection link pathogen linear motifs to phenotypic traits such as tropism, virulence and oncogenicity. In vitro evolution experiments and analysis of natural sequences suggest that changes in linear motif composition underlie pathogen adaptation to a changing environment. Copyright © 2015 Elsevier Ltd. All rights reserved.

  16. Selection of functional 2A sequences within foot-and-mouth disease virus; requirements for the NPGP motif with a distinct codon bias

    DEFF Research Database (Denmark)

    Kjær, Jonas; Belsham, Graham J.

    2018-01-01

    Foot-and-mouth disease virus (FMDV) has a positive-sense ssRNA genome including a single, large, open reading frame. Splitting of the encoded polyprotein at the 2A/2B junction is mediated by the 2A peptide (18 residues long) which induces a non-proteolytic, co-translational, "cleavage" at its own C......-terminus. A conserved feature among variants of 2A is the C-terminal motif N16P17G18/P19 where P19 is the first residue of 2B. It has been shown previously that certain amino acid substitutions can be tolerated at residues E14, S15 and N16 within the 2A sequence of infectious FMDVs but no variants at residues P17, G18...... or P19 have been identified. In this study, using highly degenerate primers, we analysed if any other residues can be present at each position of the NPG/P motif within infectious FMDV. No alternative forms of this motif were found to be encoded by rescued FMDVs after 2, 3 or 4 passages. However...

  17. Functional and comparative genomics analyses of pmp22 in medaka fish

    Directory of Open Access Journals (Sweden)

    Kawarabayasi Yutaka

    2009-06-01

    Full Text Available Abstract Background Pmp22, a member of the junction protein family Claudin/EMP/PMP22, plays an important role in myelin formation. Increase of pmp22 transcription causes peripheral neuropathy, Charcot-Marie-Tooth disease type1A (CMT1A. The pathophysiological phenotype of CMT1A is aberrant axonal myelination which induces a reduction in nerve conduction velocity (NCV. Several CMT1A model rodents have been established by overexpressing pmp22. Thus, it is thought that pmp22 expression must be tightly regulated for correct myelin formation in mammals. Interestingly, the myelin sheath is also present in other jawed vertebrates. The purpose of this study is to analyze the evolutionary conservation of the association between pmp22 transcription level and vertebrate myelin formation, and to find the conserved non-coding sequences for pmp22 regulation by comparative genomics analyses between jawed fishes and mammals. Results A transgenic pmp22 over-expression medaka fish line was established. The transgenic fish had approximately one fifth the peripheral NCV values of controls, and aberrant myelination of transgenic fish in the peripheral nerve system (PNS was observed. We successfully confirmed that medaka fish pmp22 has the same exon-intron structure as mammals, and identified some known conserved regulatory motifs. Furthermore, we found novel conserved sequences in the first intron and 3'UTR. Conclusion Medaka fish undergo abnormalities in the PNS when pmp22 transcription increases. This result indicates that an adequate pmp22 transcription level is necessary for correct myelination of jawed vertebrates. Comparison of pmp22 orthologs between distantly related species identifies evolutionary conserved sequences that contribute to precise regulation of pmp22 expression.

  18. Methods and statistics for combining motif match scores.

    Science.gov (United States)

    Bailey, T L; Gribskov, M

    1998-01-01

    Position-specific scoring matrices are useful for representing and searching for protein sequence motifs. A sequence family can often be described by a group of one or more motifs, and an effective search must combine the scores for matching a sequence to each of the motifs in the group. We describe three methods for combining match scores and estimating the statistical significance of the combined scores and evaluate the search quality (classification accuracy) and the accuracy of the estimate of statistical significance of each. The three methods are: 1) sum of scores, 2) sum of reduced variates, 3) product of score p-values. We show that method 3) is superior to the other two methods in both regards, and that combining motif scores indeed gives better search accuracy. The MAST sequence homology search algorithm utilizing the product of p-values scoring method is available for interactive use and downloading at URL http:/(/)www.sdsc.edu/MEME.

  19. MOCCS: Clarifying DNA-binding motif ambiguity using ChIP-Seq data.

    Science.gov (United States)

    Ozaki, Haruka; Iwasaki, Wataru

    2016-08-01

    As a key mechanism of gene regulation, transcription factors (TFs) bind to DNA by recognizing specific short sequence patterns that are called DNA-binding motifs. A single TF can accept ambiguity within its DNA-binding motifs, which comprise both canonical (typical) and non-canonical motifs. Clarification of such DNA-binding motif ambiguity is crucial for revealing gene regulatory networks and evaluating mutations in cis-regulatory elements. Although chromatin immunoprecipitation sequencing (ChIP-seq) now provides abundant data on the genomic sequences to which a given TF binds, existing motif discovery methods are unable to directly answer whether a given TF can bind to a specific DNA-binding motif. Here, we report a method for clarifying the DNA-binding motif ambiguity, MOCCS. Given ChIP-Seq data of any TF, MOCCS comprehensively analyzes and describes every k-mer to which that TF binds. Analysis of simulated datasets revealed that MOCCS is applicable to various ChIP-Seq datasets, requiring only a few minutes per dataset. Application to the ENCODE ChIP-Seq datasets proved that MOCCS directly evaluates whether a given TF binds to each DNA-binding motif, even if known position weight matrix models do not provide sufficient information on DNA-binding motif ambiguity. Furthermore, users are not required to provide numerous parameters or background genomic sequence models that are typically unavailable. MOCCS is implemented in Perl and R and is freely available via https://github.com/yuifu/moccs. By complementing existing motif-discovery software, MOCCS will contribute to the basic understanding of how the genome controls diverse cellular processes via DNA-protein interactions. Copyright © 2016 Elsevier Ltd. All rights reserved.

  20. A contracted DNA repeat in LHX3 intron 5 is associated with aberrant splicing and pituitary dwarfism in German shepherd dogs.

    Directory of Open Access Journals (Sweden)

    Annemarie M W Y Voorbij

    Full Text Available Dwarfism in German shepherd dogs is due to combined pituitary hormone deficiency of unknown genetic cause. We localized the recessively inherited defect by a genome wide approach to a region on chromosome 9 with a lod score of 9.8. The region contains LHX3, which codes for a transcription factor essential for pituitary development. Dwarfs have a deletion of one of six 7 bp repeats in intron 5 of LHX3, reducing the intron size to 68 bp. One dwarf was compound heterozygous for the deletion and an insertion of an asparagine residue in the DNA-binding homeodomain of LHX3, suggesting involvement of the gene in the disorder. An exon trapping assay indicated that the shortened intron is not spliced efficiently, probably because it is too small. We applied bisulfite conversion of cytosine to uracil in RNA followed by RT-PCR to analyze the splicing products. The aberrantly spliced RNA molecules resulted from either skipping of exon 5 or retention of intron 5. The same splicing defects were observed in cDNA derived from the pituitary of dwarfs. A survey of similarly mutated introns suggests that there is a minimal distance requirement between the splice donor and branch site of 50 nucleotides. In conclusion, a contraction of a DNA repeat in intron 5 of canine LHX3 leads to deficient splicing and is associated with pituitary dwarfism.

  1. The association between Interleukin (IL)-4 gene intron 3 VNTR polymorphism and alopecia areata (AA) in Turkish population.

    Science.gov (United States)

    Kalkan, Göknur; Karakus, Nevin; Baş, Yalçın; Takçı, Zennure; Ozuğuz, Pınar; Ateş, Omer; Yigit, Serbulent

    2013-09-25

    Alopecia areata (AA) is hypothesized to be an organ-specific autoimmune disease of hair follicles mediated by T cells. As immunological and genetic factors have been implicated in the pathogenesis of AA, the purpose of the present study was to investigate possible associations between the functional Interleukin (IL)-4 gene intron 3 VNTR polymorphism and AA susceptibility and disease progression in Turkish population. The study group consisted of 116 unrelated patients with AA and 125 unrelated healthy controls. Genomic DNA was isolated and IL-4 gene 70 bp VNTR polymorphism determined by using polymerase chain reaction (PCR) with specific primers. No association was observed between AA patients and controls according to genotype distribution (p=0.051). The allele distribution of IL-4 gene intron 3 VNTR polymorphism was statistically different between AA patients and control group (p=0.026). The frequency of P1 allele in patients was significantly higher than that in the control group. When the P2P2 genotype was compared with P1P2+P1P1 genotypes, a statistically significant difference was observed between patients and controls (p=0.036). Intron 3 VNTR polymorphism in the IL-4 gene was found to be associated with AA susceptibility in Turkish population. The results suggest that IL-4 VNTR polymorphism in the intron 3 region may be a risk factor for the development of AA among Turkish population. This is the first to report that intron 3 VNTR polymorphism in the IL-4 gene is associated with AA susceptibility. © 2013.

  2. EBNA-2 of herpesvirus papio diverges significantly from the type A and type B EBNA-2 proteins of Epstein-Barr virus but retains an efficient transactivation domain with a conserved hydrophobic motif.

    Science.gov (United States)

    Ling, P D; Ryon, J J; Hayward, S D

    1993-01-01

    EBNA-2 contributes to the establishment of Epstein-Barr virus (EBV) latency in B cells and to the resultant alterations in B-cell growth pattern by up-regulating expression from specific viral and cellular promoters. We have taken a comparative approach toward characterizing functional domains within EBNA-2. To this end, we have cloned and sequenced the EBNA-2 gene from the closely related baboon virus herpesvirus papio (HVP). All human EBV isolates have either a type A or type B EBNA-2 gene. However, the HVP EBNA-2 gene falls into neither the type A category nor the type B category, suggesting that the separation into these two subtypes may have been a recent evolutionary event. Comparison of the predicted amino acid sequences indicates 37% amino acid identity with EBV type A EBNA-2 and 35% amino acid identity with type B EBNA-2. To define the domains of EBNA-2 required for transcriptional activation, the DNA binding domain of the GAL4 protein was fused to overlapping segments of EBV EBNA-2. This approach identified a 40-amino-acid (40-aa) EBNA-2 activation domain located between aa 437 and 477. Transactivation ability was completely lost when the amino-terminal boundary of this domain was moved to aa 441, indicating that the motif at aa 437 to 440, Pro-Ile-Leu-Phe, contains residues critical for function. The aa 437 boundary identified in these experiments coincides precisely with a block of conserved sequences in HVP EBNA-2, and the comparable carboxy-terminal region of HVP EBNA-2 also functioned as a strong transcriptional activation domain when fused to the Gal4(1-147) protein. The EBV and HVP EBNA-2 activation domains share a mixed proline-rich, negatively charged character with a striking conservation of positionally equivalent hydrophobic residues. The importance of the individual amino acids making up the Pro-Ile-Leu-Phe motif was examined by mutagenesis. Any alteration of these residues was found to reduce transactivation efficiency, with changes at the

  3. Production of mouse monoclonal antibody against Streptococcus dysgalactiae GapC protein and mapping its conserved B-cell epitope.

    Science.gov (United States)

    Zhang, Limeng; Zhang, Hua; Fan, Ziyao; Zhou, Xue; Yu, Liquan; Sun, Hunan; Wu, Zhijun; Yu, Yongzhong; Song, Baifen; Ma, Jinzhu; Tong, Chunyu; Zhu, Zhanbo; Cui, Yudong

    2015-02-01

    Streptococcus dysgalactiae (S. dysgalactiae) GapC protein is a protective antigen that induces partial immunity against S. dysgalactiae infection in animals. To identify the conserved B-cell epitope of S. dysgalactiae GapC, a mouse monoclonal antibody 1E11 (mAb1E11) against GapC was generated and used to screen a phage-displayed 12-mer random peptide library (Ph.D.-12). Eleven positive clones recognized by mAb1E11 were identified, most of which matched the consensus motif TGFFAKK. Sequence of the motif exactly matched amino acids 97-103 of the S. dysgalactiae GapC. In addition, the epitope (97)TGFFAKK(103) showed high homology among different streptococcus species. Site-directed mutagenic analysis further confirmed that residues G98, F99, F100 and K103 formed the core of (97)TGFFAKK(103), and this core motif was the minimal determinant of the B-cell epitope recognized by the mAb1E11. Collectively, the identification of conserved B-cell epitope within S. dysgalactiae GapC highlights the possibility of developing the epitope-based vaccine. Copyright © 2014 Elsevier Ltd. All rights reserved.

  4. Genome-wide generation and use of informative intron-spanning and intron-length polymorphism markers for high-throughput genetic analysis in rice

    Science.gov (United States)

    Badoni, Saurabh; Das, Sweta; Sayal, Yogesh K.; Gopalakrishnan, S.; Singh, Ashok K.; Rao, Atmakuri R.; Agarwal, Pinky; Parida, Swarup K.; Tyagi, Akhilesh K.

    2016-01-01

    We developed genome-wide 84634 ISM (intron-spanning marker) and 16510 InDel-fragment length polymorphism-based ILP (intron-length polymorphism) markers from genes physically mapped on 12 rice chromosomes. These genic markers revealed much higher amplification-efficiency (80%) and polymorphic-potential (66%) among rice accessions even by a cost-effective agarose gel-based assay. A wider level of functional molecular diversity (17–79%) and well-defined precise admixed genetic structure was assayed by 3052 genome-wide markers in a structured population of indica, japonica, aromatic and wild rice. Six major grain weight QTLs (11.9–21.6% phenotypic variation explained) were mapped on five rice chromosomes of a high-density (inter-marker distance: 0.98 cM) genetic linkage map (IR 64 x Sonasal) anchored with 2785 known/candidate gene-derived ISM and ILP markers. The designing of multiple ISM and ILP markers (2 to 4 markers/gene) in an individual gene will broaden the user-preference to select suitable primer combination for efficient assaying of functional allelic variation/diversity and realistic estimation of differential gene expression profiles among rice accessions. The genomic information generated in our study is made publicly accessible through a user-friendly web-resource, “Oryza ISM-ILP marker” database. The known/candidate gene-derived ISM and ILP markers can be enormously deployed to identify functionally relevant trait-associated molecular tags by optimal-resource expenses, leading towards genomics-assisted crop improvement in rice. PMID:27032371

  5. Low-dimensional morphospace of topological motifs in human fMRI brain networks

    Directory of Open Access Journals (Sweden)

    Sarah E. Morgan

    2018-06-01

    Full Text Available We present a low-dimensional morphospace of fMRI brain networks, where axes are defined in a data-driven manner based on the network motifs. The morphospace allows us to identify the key variations in healthy fMRI networks in terms of their underlying motifs, and we observe that two principal components (PCs can account for 97% of the motif variability. The first PC of the motif distribution is correlated with efficiency and inversely correlated with transitivity. Hence this axis approximately conforms to the well-known economical small-world trade-off between integration and segregation in brain networks. Finally, we show that the economical clustering generative model proposed by Vértes et al. (2012 can approximately reproduce the motif morphospace of the real fMRI brain networks, in contrast to other generative models. Overall, the motif morphospace provides a powerful way to visualize the relationships between network properties and to investigate generative or constraining factors in the formation of complex human brain functional networks. Motifs have been described as the building blocks of complex networks. Meanwhile, a morphospace allows networks to be placed in a common space and can reveal the relationships between different network properties and elucidate the driving forces behind network topology. We combine the concepts of motifs and morphospaces to create the first motif morphospace of fMRI brain networks. Crucially, the morphospace axes are defined by the motifs, in a data-driven manner. We observe strong correlations between the networks’ positions in morphospace and their global topological properties, suggesting that motif morphospaces are a powerful way to capture the topology of networks in a low-dimensional space and to compare generative models of brain networks. Motif morphospaces could also be used to study other complex networks’ topologies.

  6. PISMA: A Visual Representation of Motif Distribution in DNA Sequences

    Directory of Open Access Journals (Sweden)

    Rogelio Alcántara-Silva

    2017-03-01

    Full Text Available Background: Because the graphical presentation and analysis of motif distribution can provide insights for experimental hypothesis, PISMA aims at identifying motifs on DNA sequences, counting and showing them graphically. The motif length ranges from 2 to 10 bases, and the DNA sequences range up to 10 kb. The motif distribution is shown as a bar-code–like, as a gene-map–like, and as a transcript scheme. Results: We obtained graphical schemes of the CpG site distribution from 91 human papillomavirus genomes. Also, we present 2 analyses: one of DNA motifs associated with either methylation-resistant or methylation-sensitive CpG islands and another analysis of motifs associated with exosome RNA secretion. Availability and Implementation: PISMA is developed in Java; it is executable in any type of hardware and in diverse operating systems. PISMA is freely available to noncommercial users. The English version and the User Manual are provided in Supplementary Files 1 and 2, and a Spanish version is available at www.biomedicas.unam.mx/wp-content/software/pisma.zip and www.biomedicas.unam.mx/wp-content/pdf/manual/pisma.pdf .

  7. Aggregation of topological motifs in the Escherichia coli transcriptional regulatory network

    Directory of Open Access Journals (Sweden)

    Barabási Albert-László

    2004-01-01

    Full Text Available Abstract Background Transcriptional regulation of cellular functions is carried out through a complex network of interactions among transcription factors and the promoter regions of genes and operons regulated by them.To better understand the system-level function of such networks simplification of their architecture was previously achieved by identifying the motifs present in the network, which are small, overrepresented, topologically distinct regulatory interaction patterns (subgraphs. However, the interaction of such motifs with each other, and their form of integration into the full network has not been previously examined. Results By studying the transcriptional regulatory network of the bacterium, Escherichia coli, we demonstrate that the two previously identified motif types in the network (i.e., feed-forward loops and bi-fan motifs do not exist in isolation, but rather aggregate into homologous motif clusters that largely overlap with known biological functions. Moreover, these clusters further coalesce into a supercluster, thus establishing distinct topological hierarchies that show global statistical properties similar to the whole network. Targeted removal of motif links disintegrates the network into small, isolated clusters, while random disruptions of equal number of links do not cause such an effect. Conclusion Individual motifs aggregate into homologous motif clusters and a supercluster forming the backbone of the E. coli transcriptional regulatory network and play a central role in defining its global topological organization.

  8. Parallel motif extraction from very long sequences

    KAUST Repository

    Sahli, Majed

    2013-01-01

    Motifs are frequent patterns used to identify biological functionality in genomic sequences, periodicity in time series, or user trends in web logs. In contrast to a lot of existing work that focuses on collections of many short sequences, modern applications require mining of motifs in one very long sequence (i.e., in the order of several gigabytes). For this case, there exist statistical approaches that are fast but inaccurate; or combinatorial methods that are sound and complete. Unfortunately, existing combinatorial methods are serial and very slow. Consequently, they are limited to very short sequences (i.e., a few megabytes), small alphabets (typically 4 symbols for DNA sequences), and restricted types of motifs. This paper presents ACME, a combinatorial method for extracting motifs from a single very long sequence. ACME arranges the search space in contiguous blocks that take advantage of the cache hierarchy in modern architectures, and achieves almost an order of magnitude performance gain in serial execution. It also decomposes the search space in a smart way that allows scalability to thousands of processors with more than 90% speedup. ACME is the only method that: (i) scales to gigabyte-long sequences; (ii) handles large alphabets; (iii) supports interesting types of motifs with minimal additional cost; and (iv) is optimized for a variety of architectures such as multi-core systems, clusters in the cloud, and supercomputers. ACME reduces the extraction time for an exact-length query from 4 hours to 7 minutes on a typical workstation; handles 3 orders of magnitude longer sequences; and scales up to 16, 384 cores on a supercomputer. Copyright is held by the owner/author(s).

  9. Structural modelling and phylogenetic analyses of PgeIF4A2 (Eukaryotic translation initiation factor) from Pennisetum glaucum reveal signature motifs with a role in stress tolerance and development.

    Science.gov (United States)

    Agarwal, Aakrati; Mudgil, Yashwanti; Pandey, Saurabh; Fartyal, Dhirendra; Reddy, Malireddy K

    2016-01-01

    Eukaryotic translation initiation factor 4A (eIF4A) is an indispensable component of the translation machinery and also play a role in developmental processes and stress alleviation in plants and animals. Different eIF4A isoforms are present in the cytosol of the cell, namely, eIF4A1, eIF4A2, and eIF4A3 and their expression is tightly regulated in cap-dependent translation. We revealed the structural model of PgeIF4A2 protein using the crystal structure of Homo sapiens eIF4A3 (PDB ID: 2J0S) as template by Modeller 9.12. The resultant PgeIF4A2 model structure was refined by PROCHECK, ProSA, Verify3D and RMSD that showed the model structure is reliable with 77 % amino acid sequence identity with template. Investigation revealed two conserved signatures for ATP-dependent RNA Helicase DEAD-box conserved site (VLDEADEML) and RNA helicase DEAD-box type, Q-motif in sheet-turn-helix and α-helical region respectively. All these conserved motifs are responsible for response during developmental stages and stress tolerance in plants.

  10. MODA: an efficient algorithm for network motif discovery in biological networks.

    Science.gov (United States)

    Omidi, Saeed; Schreiber, Falk; Masoudi-Nejad, Ali

    2009-10-01

    In recent years, interest has been growing in the study of complex networks. Since Erdös and Rényi (1960) proposed their random graph model about 50 years ago, many researchers have investigated and shaped this field. Many indicators have been proposed to assess the global features of networks. Recently, an active research area has developed in studying local features named motifs as the building blocks of networks. Unfortunately, network motif discovery is a computationally hard problem and finding rather large motifs (larger than 8 nodes) by means of current algorithms is impractical as it demands too much computational effort. In this paper, we present a new algorithm (MODA) that incorporates techniques such as a pattern growth approach for extracting larger motifs efficiently. We have tested our algorithm and found it able to identify larger motifs with more than 8 nodes more efficiently than most of the current state-of-the-art motif discovery algorithms. While most of the algorithms rely on induced subgraphs as motifs of the networks, MODA is able to extract both induced and non-induced subgraphs simultaneously. The MODA source code is freely available at: http://LBB.ut.ac.ir/Download/LBBsoft/MODA/

  11. Dynamic motifs in socio-economic networks

    Science.gov (United States)

    Zhang, Xin; Shao, Shuai; Stanley, H. Eugene; Havlin, Shlomo

    2014-12-01

    Socio-economic networks are of central importance in economic life. We develop a method of identifying and studying motifs in socio-economic networks by focusing on “dynamic motifs,” i.e., evolutionary connection patterns that, because of “node acquaintances” in the network, occur much more frequently than random patterns. We examine two evolving bi-partite networks: i) the world-wide commercial ship chartering market and ii) the ship build-to-order market. We find similar dynamic motifs in both bipartite networks, even though they describe different economic activities. We also find that “influence” and “persistence” are strong factors in the interaction behavior of organizations. When two companies are doing business with the same customer, it is highly probable that another customer who currently only has business relationship with one of these two companies, will become customer of the second in the future. This is the effect of influence. Persistence means that companies with close business ties to customers tend to maintain their relationships over a long period of time.

  12. Intronic deletions in the SLC34A3 gene: A cautionary tale for mutation analysis of hereditary hypophosphatemic rickets with hypercalciuria

    Science.gov (United States)

    Ichikawa, Shoji; Tuchman, Shamir; Padgett, Leah R.; Gray, Amie K.; Baluarte, H. Jorge; Econs, Michael J.

    2013-01-01

    Hereditary hypophosphatemic rickets with hypercalciuria (HHRH) is a rare metabolic disorder, characterized by hypophosphatemia, variable degrees of rickets/osteomalacia, and hypercalciuria secondary to increased serum 1,25-dihydroxyvitamin D [1,25(OH)2D] levels. HHRH is caused by mutations in the SLC34A3 gene, which encodes sodium-phosphate co-transporter type IIc. A 6 ½-year-old female presented with a history of nephrolithiasis. Her metabolic evaluation revealed increased 24- hour urine calcium excretion with high serum calcium, low intact parathyroid hormone (PTH) levels, and elevated 1,25(OH)2D level. In addition, the patient had low to low-normal serum phosphorus with high urine phosphorus. The patient had normal stature; without rachitic or boney deformities or a history of fractures. Genetic analysis of SLC34A3 revealed the patient to be a compound heterozygote for a novel single base pair deletion in exon 12 (c.1304delG) and 30-base pair deletion in intron 6 (g.1440–1469del). The single-base pair mutation causes a frameshift, which results in premature stop codon. The intronic deletion is likely caused by misalignment of the 4-basepair homologous repeats and results in the truncation of an already small intron to 63 bp, which would impair proper RNA splicing of the intron. This is the fourth unique intronic deletion identified in patients with HHRH, suggesting the frequent occurrence of sequence misalignments in SLC34A3 and the importance of screening introns in patients with HHRH. PMID:24176905

  13. Correlation of PTPN11 polymorphism at intron 3 with gastric cancer

    Directory of Open Access Journals (Sweden)

    Li ZHANG

    2011-05-01

    Full Text Available Objective To explore the correlation of protein-tyrosine-phosphatase nonreceptor-type 11(PTPN11 polymorphism at intron 3 with gastric cancer in Chinese population,and the feasibility and accuracy of employing mastrix-assisted laser desorption ionization time-of-flight mass spectrogram(MALDI-TOF-MS in genotyping of this SNP.Methods Two hundred and forty-seven patients with gastric cancer,212 cancer-free patients and 160 cord blood samples were enrolled in present study.Genotypes of PTPN11 G/A polymorphism at intron 3 were determined by MALDI-TOF-MS analysis,and direct sequencing of PCR products with 20 samples of the gene locus was done for checking the accuracy of MALDI-TOF-MS.Histological examination,Helicobacter pylori culture,rapid urease test,serum anti-H.pylori antibodies(ELISA and urease colloidal gold test were performed to evaluate H.pylori infection.Results Direct sequencing of 20 random selected samples were well consistent with the MALDI-TOF-MS results.The rates of H.pylori infection were 73.68% in gastric cancer patients and 75.47% in cancer-free patients,implying no significant difference between the two groups.The distributions of genotypes were in Hardy Weinberg equilibrium in both gastric cancer patients and controls.There were no significant differences in the genotype frequencies between the 2 groups(P>0.05.Compared with the GG genotype,GA+AA genotype could not influence the risk of gastric cancer.When stratified for status,PTPN11 polymorphism was not associated with age,gender and H.pylori infection states in both cancer patients and controls.Conclusion It seems that PTPN11 G/A polymorphism at intron 3 has no affection on the risk of gastric cancer in Chinese population.

  14. Assessing local structure motifs using order parameters for motif recognition, interstitial identification, and diffusion path characterization

    Science.gov (United States)

    Zimmermann, Nils E. R.; Horton, Matthew K.; Jain, Anubhav; Haranczyk, Maciej

    2017-11-01

    Structure-property relationships form the basis of many design rules in materials science, including synthesizability and long-term stability of catalysts, control of electrical and optoelectronic behavior in semiconductors as well as the capacity of and transport properties in cathode materials for rechargeable batteries. The immediate atomic environments (i.e., the first coordination shells) of a few atomic sites are often a key factor in achieving a desired property. Some of the most frequently encountered coordination patterns are tetrahedra, octahedra, body and face-centered cubic as well as hexagonal closed packed-like environments. Here, we showcase the usefulness of local order parameters to identify these basic structural motifs in inorganic solid materials by developing classification criteria. We introduce a systematic testing framework, the Einstein crystal test rig, that probes the response of order parameters to distortions in perfect motifs to validate our approach. Subsequently, we highlight three important application cases. First, we map basic crystal structure information of a large materials database in an intuitive manner by screening the Materials Project (MP) database (61,422 compounds) for element-specific motif distributions. Second, we use the structure-motif recognition capabilities to automatically find interstitials in metals, semiconductor, and insulator materials. Our Interstitialcy Finding Tool (InFiT) facilitates high-throughput screenings of defect properties. Third, the order parameters are reliable and compact quantitative structure descriptors for characterizing diffusion hops of intercalants as our example of magnesium in MnO2-spinel indicates. Finally, the tools developed in our work are readily and freely available as software implementations in the pymatgen library, and we expect them to be further applied to machine-learning approaches for emerging applications in materials science.

  15. Evolutionary conservation of TORC1 components, TOR, Raptor, and LST8, between rice and yeast.

    Science.gov (United States)

    Maegawa, Kentaro; Takii, Rumi; Ushimaru, Takashi; Kozaki, Akiko

    2015-10-01

    Target of rapamycin (TOR) is a conserved eukaryotic serine/threonine kinase that functions as a central controller of cell growth. TOR protein is structurally defined by the presence several conserved domains such as the HEAT repeat, focal adhesion target (FAT), FKBP12/rapamycin binding (FRB), kinase, and FATC domains starting from the N-terminus. In most eukaryotes, TOR forms two distinct physical and functional complexes, which are termed as TOR complex 1 (TORC1) and TORC2. However, plants contain only TORC1 components, i.e., TOR, Raptor, and LST8. In this study, we analyzed the gene structure and functions of TORC components in rice to understand the properties of the TOR complex in plants. Comparison of the locations of introns in these genes among rice and other eukaryotes showed that they were well conserved among plants except for Chlamydomonas. Moreover, the intron positions in the coding sequence of human Raptor and LST8 were closer to those of plants than of fly or nematode. Complementation tests of rice TOR (OsTOR) components in yeast showed that although OsTOR did not complement yeast tor mutants, chimeric TOR, which consisted of the HEAT repeat and FAT domain from yeast and other regions from rice, rescued the tor mutants, indicating that the HEAT repeat and FAT domains are important for species-specific signaling. OsRaptor perfectly complemented a kog1 (yeast Raptor homolog) mutant, and OsLST8 partially complemented an lst8 mutant. Together, these data suggest the importance of the N-terminal region of the TOR, HEAT, and FAT domains for functional diversification of the TOR complex.

  16. Noroviruses Co-opt the Function of Host Proteins VAPA and VAPB for Replication via a Phenylalanine-Phenylalanine-Acidic-Tract-Motif Mimic in Nonstructural Viral Protein NS1/2.

    Science.gov (United States)

    McCune, Broc T; Tang, Wei; Lu, Jia; Eaglesham, James B; Thorne, Lucy; Mayer, Anne E; Condiff, Emily; Nice, Timothy J; Goodfellow, Ian; Krezel, Andrzej M; Virgin, Herbert W

    2017-07-11

    The Norovirus genus contains important human pathogens, but the role of host pathways in norovirus replication is largely unknown. Murine noroviruses provide the opportunity to study norovirus replication in cell culture and in small animals. The human norovirus nonstructural protein NS1/2 interacts with the host protein VAMP-associated protein A (VAPA), but the significance of the NS1/2-VAPA interaction is unexplored. Here we report decreased murine norovirus replication in VAPA- and VAPB-deficient cells. We characterized the role of VAPA in detail. VAPA was required for the efficiency of a step(s) in the viral replication cycle after entry of viral RNA into the cytoplasm but before the synthesis of viral minus-sense RNA. The interaction of VAPA with viral NS1/2 proteins is conserved between murine and human noroviruses. Murine norovirus NS1/2 directly bound the major sperm protein (MSP) domain of VAPA through its NS1 domain. Mutations within NS1 that disrupted interaction with VAPA inhibited viral replication. Structural analysis revealed that the viral NS1 domain contains a mimic of the phenylalanine-phenylalanine-acidic-tract (FFAT) motif that enables host proteins to bind to the VAPA MSP domain. The NS1/2-FFAT mimic region interacted with the VAPA-MSP domain in a manner similar to that seen with bona fide host FFAT motifs. Amino acids in the FFAT mimic region of the NS1 domain that are important for viral replication are highly conserved across murine norovirus strains. Thus, VAPA interaction with a norovirus protein that functionally mimics host FFAT motifs is important for murine norovirus replication. IMPORTANCE Human noroviruses are a leading cause of gastroenteritis worldwide, but host factors involved in norovirus replication are incompletely understood. Murine noroviruses have been studied to define mechanisms of norovirus replication. Here we defined the importance of the interaction between the hitherto poorly studied NS1/2 norovirus protein and the

  17. Composite Structural Motifs of Binding Sites for Delineating Biological Functions of Proteins

    Science.gov (United States)

    Kinjo, Akira R.; Nakamura, Haruki

    2012-01-01

    Most biological processes are described as a series of interactions between proteins and other molecules, and interactions are in turn described in terms of atomic structures. To annotate protein functions as sets of interaction states at atomic resolution, and thereby to better understand the relation between protein interactions and biological functions, we conducted exhaustive all-against-all atomic structure comparisons of all known binding sites for ligands including small molecules, proteins and nucleic acids, and identified recurring elementary motifs. By integrating the elementary motifs associated with each subunit, we defined composite motifs that represent context-dependent combinations of elementary motifs. It is demonstrated that function similarity can be better inferred from composite motif similarity compared to the similarity of protein sequences or of individual binding sites. By integrating the composite motifs associated with each protein function, we define meta-composite motifs each of which is regarded as a time-independent diagrammatic representation of a biological process. It is shown that meta-composite motifs provide richer annotations of biological processes than sequence clusters. The present results serve as a basis for bridging atomic structures to higher-order biological phenomena by classification and integration of binding site structures. PMID:22347478

  18. Characterization of intronic uridine-rich sequence elements acting as possible targets for nuclear proteins during pre-mRNA splicing in Nicotiana plumbaginifolia.

    Science.gov (United States)

    Gniadkowski, M; Hemmings-Mieszczak, M; Klahre, U; Liu, H X; Filipowicz, W

    1996-02-15

    Introns of nuclear pre-mRNAs in dicotyledonous plants, unlike introns in vertebrates or yeast, are distinctly rich in A+U nucleotides and this feature is essential for their processing. In order to define more precisely sequence elements important for intron recognition in plants, we investigated the effects of short insertions, either U-rich or A-rich, on splicing of synthetic introns in transfected protoplast of Nicotiana plumbaginifolia. It was found that insertions of U-rich (sequence UUUUUAU) but not A-rich (AUAAAAA) segments can activate splicing of a GC-rich synthetic infron, and that U-rich segments, or multimers thereof, can function irrespective of the site of insertion within the intron. Insertions of multiple U-rich segments, either at the same or different locations, generally had an additive, stimulatory effect on splicing. Mutational analysis showed that replacement of one or two U residues in the UUUUUAU sequence with A or C residues had only a small effect on splicing, but replacement with G residues was strongly inhibitory. Proteins that interact with fragments of natural and synthetic pre-mRNAs in vitro were identified in nuclear extracts of N.plumbaginifolia by UV cross- linking. The profile of cross-linked plant proteins was considerably less complex than that obtained with a HeLa cell nuclear extract. Two major cross-linkable plant proteins had apparent molecular mass of 50 and 54 kDa and showed affinity for oligouridilates present in synGC introns or for poly(U).

  19. Probing structural changes of self assembled i-motif DNA

    KAUST Repository

    Lee, Iljoon; Patil, Sachin; Fhayli, Karim; Alsaiari, Shahad K.; Khashab, Niveen M.

    2015-01-01

    We report an i-motif structural probing system based on Thioflavin T (ThT) as a fluorescent sensor. This probe can discriminate the structural changes of RET and Rb i-motif sequences according to pH change. This journal is

  20. Conservation patterns in different functional sequence categoriesof divergent Drosophila species

    Energy Technology Data Exchange (ETDEWEB)

    Papatsenko, Dmitri; Kislyuk, Andrey; Levine, Michael; Dubchak, Inna

    2005-10-01

    We have explored the distributions of fully conservedungapped blocks in genome-wide pairwise alignments of recently completedspecies of Drosophila: D.yakuba, D.ananassae, D.pseudoobscura, D.virilisand D.mojavensis. Based on these distributions we have found that nearlyevery functional sequence category possesses its own distinctiveconservation pattern, sometimes independent of the overall sequenceconservation level. In the coding and regulatory regions, the ungappedblocks were longer than in introns, UTRs and non-functional sequences. Atthe same time, the blocks in the coding regions carried 3N+2 signaturecharacteristic to synonymic substitutions in the 3rd codon positions.Larger block sizes in transcription regulatory regions can be explainedby the presence of conserved arrays of binding sites for transcriptionfactors. We also have shown that the longest ungapped blocks, or'ultraconserved' sequences, are associated with specific gene groups,including those encoding ion channels and components of the cytoskeleton.We discussed how restrained conservation patterns may help in mappingfunctional sequence categories and improving genomeannotation.

  1. RegRNA: an integrated web server for identifying regulatory RNA motifs and elements

    OpenAIRE

    Huang, Hsi-Yuan; Chien, Chia-Hung; Jen, Kuan-Hua; Huang, Hsien-Da

    2006-01-01

    Numerous regulatory structural motifs have been identified as playing essential roles in transcriptional and post-transcriptional regulation of gene expression. RegRNA is an integrated web server for identifying the homologs of regulatory RNA motifs and elements against an input mRNA sequence. Both sequence homologs and structural homologs of regulatory RNA motifs can be recognized. The regulatory RNA motifs supported in RegRNA are categorized into several classes: (i) motifs in mRNA 5′-untra...

  2. A 6-Nucleotide Regulatory Motif within the AbcR Small RNAs of Brucella abortus Mediates Host-Pathogen Interactions.

    Science.gov (United States)

    Sheehan, Lauren M; Caswell, Clayton C

    2017-06-06

    In Brucella abortus , two small RNAs (sRNAs), AbcR1 and AbcR2, are responsible for regulating transcripts encoding ABC-type transport systems. AbcR1 and AbcR2 are required for Brucella virulence, as a double chromosomal deletion of both sRNAs results in attenuation in mice. Although these sRNAs are responsible for targeting transcripts for degradation, the mechanism utilized by the AbcR sRNAs to regulate mRNA in Brucella has not been described. Here, two motifs (M1 and M2) were identified in AbcR1 and AbcR2, and complementary motif sequences were defined in AbcR-regulated transcripts. Site-directed mutagenesis of M1 or M2 or of both M1 and M2 in the sRNAs revealed transcripts to be targeted by one or both motifs. Electrophoretic mobility shift assays revealed direct, concentration-dependent binding of both AbcR sRNAs to a target mRNA sequence. These experiments genetically and biochemically characterized two indispensable motifs within the AbcR sRNAs that bind to and regulate transcripts. Additionally, cellular and animal models of infection demonstrated that only M2 in the AbcR sRNAs is required for Brucella virulence. Furthermore, one of the M2-regulated targets, BAB2_0612, was found to be critical for the virulence of B. abortus in a mouse model of infection. Although these sRNAs are highly conserved among Alphaproteobacteria , the present report displays how gene regulation mediated by the AbcR sRNAs has diverged to meet the intricate regulatory requirements of each particular organism and its unique biological niche. IMPORTANCE Small RNAs (sRNAs) are important components of bacterial regulation, allowing organisms to quickly adapt to changes in their environments. The AbcR sRNAs are highly conserved throughout the Alphaproteobacteria and negatively regulate myriad transcripts, many encoding ABC-type transport systems. In Brucella abortus , AbcR1 and AbcR2 are functionally redundant, as only a double abcR1 abcR2 ( abcR1 / 2 ) deletion results in attenuation in

  3. Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae

    Directory of Open Access Journals (Sweden)

    Christian J. Michel

    2017-12-01

    Full Text Available A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C 3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X , using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X , in the complete genome of the yeast Saccharomyces cerevisiae. Several properties of X motifs are identified by basic statistics (at the frequency level, and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R . We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae. We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae, but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions. This property is true for all cardinalities of X motifs (from 4 to 20 and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non- X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together

  4. Enrichment of Circular Code Motifs in the Genes of the Yeast Saccharomyces cerevisiae.

    Science.gov (United States)

    Michel, Christian J; Ngoune, Viviane Nguefack; Poch, Olivier; Ripp, Raymond; Thompson, Julie D

    2017-12-03

    A set X of 20 trinucleotides has been found to have the highest average occurrence in the reading frame, compared to the two shifted frames, of genes of bacteria, archaea, eukaryotes, plasmids and viruses. This set X has an interesting mathematical property, since X is a maximal C3 self-complementary trinucleotide circular code. Furthermore, any motif obtained from this circular code X has the capacity to retrieve, maintain and synchronize the original (reading) frame. Since 1996, the theory of circular codes in genes has mainly been developed by analysing the properties of the 20 trinucleotides of X, using combinatorics and statistical approaches. For the first time, we test this theory by analysing the X motifs, i.e., motifs from the circular code X, in the complete genome of the yeast Saccharomyces cerevisiae . Several properties of X motifs are identified by basic statistics (at the frequency level), and evaluated by comparison to R motifs, i.e., random motifs generated from 30 different random codes R. We first show that the frequency of X motifs is significantly greater than that of R motifs in the genome of S. cerevisiae . We then verify that no significant difference is observed between the frequencies of X and R motifs in the non-coding regions of S. cerevisiae , but that the occurrence number of X motifs is significantly higher than R motifs in the genes (protein-coding regions). This property is true for all cardinalities of X motifs (from 4 to 20) and for all 16 chromosomes. We further investigate the distribution of X motifs in the three frames of S. cerevisiae genes and show that they occur more frequently in the reading frame, regardless of their cardinality or their length. Finally, the ratio of X genes, i.e., genes with at least one X motif, to non-X genes, in the set of verified genes is significantly different to that observed in the set of putative or dubious genes with no experimental evidence. These results, taken together, represent the first

  5. Assessing Local Structure Motifs Using Order Parameters for Motif Recognition, Interstitial Identification, and Diffusion Path Characterization

    Directory of Open Access Journals (Sweden)

    Nils E. R. Zimmermann

    2017-11-01

    Full Text Available Structure–property relationships form the basis of many design rules in materials science, including synthesizability and long-term stability of catalysts, control of electrical and optoelectronic behavior in semiconductors, as well as the capacity of and transport properties in cathode materials for rechargeable batteries. The immediate atomic environments (i.e., the first coordination shells of a few atomic sites are often a key factor in achieving a desired property. Some of the most frequently encountered coordination patterns are tetrahedra, octahedra, body and face-centered cubic as well as hexagonal close packed-like environments. Here, we showcase the usefulness of local order parameters to identify these basic structural motifs in inorganic solid materials by developing classification criteria. We introduce a systematic testing framework, the Einstein crystal test rig, that probes the response of order parameters to distortions in perfect motifs to validate our approach. Subsequently, we highlight three important application cases. First, we map basic crystal structure information of a large materials database in an intuitive manner by screening the Materials Project (MP database (61,422 compounds for element-specific motif distributions. Second, we use the structure-motif recognition capabilities to automatically find interstitials in metals, semiconductor, and insulator materials. Our Interstitialcy Finding Tool (InFiT facilitates high-throughput screenings of defect properties. Third, the order parameters are reliable and compact quantitative structure descriptors for characterizing diffusion hops of intercalants as our example of magnesium in MnO2-spinel indicates. Finally, the tools developed in our work are readily and freely available as software implementations in the pymatgen library, and we expect them to be further applied to machine-learning approaches for emerging applications in materials science.

  6. [Applylication of new type combined fragments: nrDNA ITS+ nad 1-intron 2 for identification of Dendrobium species of Fengdous].

    Science.gov (United States)

    Geng, Li-xia; Zheng, Rui; Ren, Jie; Niu, Zhi-tao; Sun, Yu-long; Xue, Qing-yun; Liu, Wei; Ding, Xiao-yu

    2015-08-01

    In this study, 17 kinds of Dendrobium species of Fengdous including 39 individuals were collected from 4 provinces. Mitochondrial gene sequences co I, nad 5, nad 1-intron 2 and chloroplast gene sequences rbcL, matK amd psbA-trnH were amplified from these materials, as well as nrDNA ITS. Furthermore, suitable sequences for identification of Dendrobium species of Fengdous were screened by K-2-P and P-distance. The results showed that during the mentioned 7 sequences, nrDNA ITS, nad 1-intron 2 and psbA-trnH which had a high degree of variability could be used to identify Dendrobium species of Fengdous. However, single fragment could not be used to distinguish D. moniliforme and D. huoshanense. Moreover, compared to other combined fragments, new type combined fragments nrDNA ITS+nad 1-intron 2 was more effective in identifying the original plants of Dendrobium species and could be used to identify D. huoshanense and D. moniliforme. Besides, according to the UPGMA tree constructed with nrDNA ITS+nad 1-intron 2, 3 inspected Dendrobium plants were identified as D. huoshanense, D. moniliforme and D. officinale, respectively. This study identified Dendrobium species of Fengdous by combined fragments nrDNA ITS+nad 1-intron 2 for the first time, which provided a more effective basis for identification of Dendrobium species. And this study will be helpful for regulating the market of Fengdous.

  7. RNA recognition motif (RRM)-containing proteins in Bombyx mori

    African Journals Online (AJOL)

    STORAGESEVER

    2009-03-20

    Mar 20, 2009 ... Recognition Motif (RRM), sometimes referred to as. RNP1, is one of the first identified domains for RNA interaction. RRM is very common ..... Apart from the RRM motif, eIF3-S9 has a Trp-Asp. (WD) repeat domain, Poly (A) ...

  8. Comparative anatomy of the human APRT gene and enzyme: nucleotide sequence divergence and conservation of a nonrandom CpG dinucleotide arrangement

    International Nuclear Information System (INIS)

    Broderick, T.P.; Schaff, D.A.; Bertino, A.M.; Dush, M.K.; Tischfield, J.A.; Stambrook, P.J.

    1987-01-01

    The functional human adenine phosphoribosyltransferase (APRT) gene is <2.6 kilobases in length and contains five exons. The amino acid sequences of APRTs have been highly conserved throughout evolution. The human enzyme is 82%, 90%, and 40% identical to the mouse, hamster, and Escherichia coli enzymes, respectively. The promoter region of the human APRT gene, like that of several other housekeeping genes, lacks TATA and CCAAT boxes but contains five GC boxes that are potential binding sites for the Sp1 transcription factor. The distal three, however, are dispensable for gene expression. Comparison between human and mouse APRT gene nucleotide sequences reveals a high degree of homology within protein coding regions but an absence of significant homology in 5' flanking, 3' untranslated, and intron sequences, except for similarly positioned GC boxes in the promoter region and a 26-base-pair region in intron 3. This 26-base-pair sequence is 92% identical with a similarly positioned sequence in the mouse gene and is also found in intron 3 of the hamster gene, suggesting that its retention may be a consequence of stringent selection. The positions of all introns have been precisely retained in the human and both rodent genes. Retention of an elevated CpG dinucleotide content, despite loss of sequence homology, suggests that there may be selection for CpG dinucleotides in these regions and that their maintenance may be important for APRT gene function

  9. The Dunaliella salina organelle genomes: large sequences, inflated with intronic and intergenic DNA

    Energy Technology Data Exchange (ETDEWEB)

    Smith, David R.; Lee, Robert W.; Cushman, John C.; Magnuson, Jon K.; Tran, Duc; Polle, Juergen E.

    2010-05-07

    Abstract Background: Dunaliella salina Teodoresco, a unicellular, halophilic green alga belonging to the Chlorophyceae, is among the most industrially important microalgae. This is because D. salina can produce massive amounts of β-carotene, which can be collected for commercial purposes, and because of its potential as a feedstock for biofuels production. Although the biochemistry and physiology of D. salina have been studied in great detail, virtually nothing is known about the genomes it carries, especially those within its mitochondrion and plastid. This study presents the complete mitochondrial and plastid genome sequences of D. salina and compares them with those of the model green algae Chlamydomonas reinhardtii and Volvox carteri. Results: The D. salina organelle genomes are large, circular-mapping molecules with ~60% noncoding DNA, placing them among the most inflated organelle DNAs sampled from the Chlorophyta. In fact, the D. salina plastid genome, at 269 kb, is the largest complete plastid DNA (ptDNA) sequence currently deposited in GenBank, and both the mitochondrial and plastid genomes have unprecedentedly high intron densities for organelle DNA: ~1.5 and ~0.4 introns per gene, respectively. Moreover, what appear to be the relics of genes, introns, and intronic open reading frames are found scattered throughout the intergenic ptDNA regions -- a trait without parallel in other characterized organelle genomes and one that gives insight into the mechanisms and modes of expansion of the D. salina ptDNA. Conclusions: These findings confirm the notion that chlamydomonadalean algae have some of the most extreme organelle genomes of all eukaryotes. They also suggest that the events giving rise to the expanded ptDNA architecture of D. salina and other Chlamydomonadales may have occurred early in the evolution of this lineage. Although interesting from a genome evolution standpoint, the D. salina organelle DNA sequences will aid in the development of a viable

  10. The Dunaliella salina organelle genomes: large sequences, inflated with intronic and intergenic DNA

    Directory of Open Access Journals (Sweden)

    Tran Duc

    2010-05-01

    Full Text Available Abstract Background Dunaliella salina Teodoresco, a unicellular, halophilic green alga belonging to the Chlorophyceae, is among the most industrially important microalgae. This is because D. salina can produce massive amounts of β-carotene, which can be collected for commercial purposes, and because of its potential as a feedstock for biofuels production. Although the biochemistry and physiology of D. salina have been studied in great detail, virtually nothing is known about the genomes it carries, especially those within its mitochondrion and plastid. This study presents the complete mitochondrial and plastid genome sequences of D. salina and compares them with those of the model green algae Chlamydomonas reinhardtii and Volvox carteri. Results The D. salina organelle genomes are large, circular-mapping molecules with ~60% noncoding DNA, placing them among the most inflated organelle DNAs sampled from the Chlorophyta. In fact, the D. salina plastid genome, at 269 kb, is the largest complete plastid DNA (ptDNA sequence currently deposited in GenBank, and both the mitochondrial and plastid genomes have unprecedentedly high intron densities for organelle DNA: ~1.5 and ~0.4 introns per gene, respectively. Moreover, what appear to be the relics of genes, introns, and intronic open reading frames are found scattered throughout the intergenic ptDNA regions -- a trait without parallel in other characterized organelle genomes and one that gives insight into the mechanisms and modes of expansion of the D. salina ptDNA. Conclusions These findings confirm the notion that chlamydomonadalean algae have some of the most extreme organelle genomes of all eukaryotes. They also suggest that the events giving rise to the expanded ptDNA architecture of D. salina and other Chlamydomonadales may have occurred early in the evolution of this lineage. Although interesting from a genome evolution standpoint, the D. salina organelle DNA sequences will aid in the

  11. Intron loss from the NADH dehydrogenase subunit 4 gene of lettuce mitochondrial DNA: evidence for homologous recombination of a cDNA intermediate.

    Science.gov (United States)

    Geiss, K T; Abbas, G M; Makaroff, C A

    1994-04-01

    The mitochondrial gene coding for subunit 4 of the NADH dehydrogenase complex I (nad4) has been isolated and characterized from lettuce, Lactuca sativa. Analysis of nad4 genes in a number of plants by Southern hybridization had previously suggested that the intron content varied between species. Characterization of the lettuce gene confirms this observation. Lettuce nad4 contains two exons and one group IIA intron, whereas previously sequenced nad4 genes from turnip and wheat contain three group IIA introns. Northern analysis identified a transcript of 1600 nucleotides, which represents the mature nad4 mRNA and a primary transcript of 3200 nucleotides. Sequence analysis of lettuce and turnip nad4 cDNAs was used to confirm the intron/exon border sequences and to examine RNA editing patterns. Editing is observed at the 5' and 3' ends of the lettuce transcript, but is absent from sequences that correspond to exons two, three and the 5' end of exon four in turnip and wheat. In contrast, turnip transcripts are highly edited in this region, suggesting that homologous recombination of an edited and spliced cDNA intermediate was involved in the loss of introns two and three from an ancestral lettuce nad4 gene.

  12. Distinct configurations of protein complexes and biochemical pathways revealed by epistatic interaction network motifs

    LENUS (Irish Health Repository)

    Casey, Fergal

    2011-08-22

    Abstract Background Gene and protein interactions are commonly represented as networks, with the genes or proteins comprising the nodes and the relationship between them as edges. Motifs, or small local configurations of edges and nodes that arise repeatedly, can be used to simplify the interpretation of networks. Results We examined triplet motifs in a network of quantitative epistatic genetic relationships, and found a non-random distribution of particular motif classes. Individual motif classes were found to be associated with different functional properties, suggestive of an underlying biological significance. These associations were apparent not only for motif classes, but for individual positions within the motifs. As expected, NNN (all negative) motifs were strongly associated with previously reported genetic (i.e. synthetic lethal) interactions, while PPP (all positive) motifs were associated with protein complexes. The two other motif classes (NNP: a positive interaction spanned by two negative interactions, and NPP: a negative spanned by two positives) showed very distinct functional associations, with physical interactions dominating for the former but alternative enrichments, typical of biochemical pathways, dominating for the latter. Conclusion We present a model showing how NNP motifs can be used to recognize supportive relationships between protein complexes, while NPP motifs often identify opposing or regulatory behaviour between a gene and an associated pathway. The ability to use motifs to point toward underlying biological organizational themes is likely to be increasingly important as more extensive epistasis mapping projects in higher organisms begin.

  13. Fingerprint motifs of phytases | Fan | African Journal of Biotechnology

    African Journals Online (AJOL)

    Among the total of potential 173 phytases gained in 11 plant genomes through MAST, PAPhys are the major phytases, and HAPhys are the minor, and other phytase groups are not found in planta. Keywords: Phytase, fingerprint motif, multiple EM for motif elicitation (MEME), MAST African Journal of Biotechnology Vol.

  14. BIALLELIC POLYMORPHISM IN THE INTRON REGION OF B-TUBULIN GENE OF CRYPTOSPORIDIUM PARASITES

    Science.gov (United States)

    Nucleotide sequencing of polymerase chain reaction-amplified intron region of the Cryptosporidium parvum B-tubulin gene in 26 human and 15 animal isolates revealed distinct genetic polymorphism between the human and bovine genotypes. The separation of 2 genotypes of C. parvum is...

  15. The Long Intron 1 of Growth Hormone Gene from Reeves’ Turtle (Chinemys reevesii Correlates with Negatively Regulated GH Expression in Four Cell Lines

    Directory of Open Access Journals (Sweden)

    Wen-Sheng Liu

    2016-04-01

    Full Text Available Turtles grow slowly and have a long lifespan. Ultrastructural studies of the pituitary gland in Reeves’ turtle (Chinemys reevesii have revealed that the species possesses a higher nucleoplasmic ratio and fewer secretory granules in growth hormone (GH cells than other animal species in summer and winter. C. reevesii GH gene was cloned and species-specific similarities and differences were investigated. The full GH gene sequence in C. reevesii contains 8517 base pairs (bp, comprising five exons and four introns. Intron 1 was found to be much longer in C. reevesii than in other species. The coding sequence (CDS of the turtle’s GH gene, with and without the inclusion of intron 1, was transfected into four cell lines, including DF-1 chicken embryo fibroblasts, Chinese hamster ovary (CHO cells, human embryonic kidney 293FT cells, and GH4C1 rat pituitary cells; the turtle growth hormone (tGH gene mRNA and protein expression levels decreased significantly in the intron-containing CDS in these cell lines, compared with that of the corresponding intronless CDS. Thus, the long intron 1 of GH gene in Reeves’ turtle might correlate with downregulated gene expression.

  16. Complex group-I introns in nuclear SSU rDNA of red and green algae: evidence of homing-endonuclease pseudogenes in the Bangiophyceae

    DEFF Research Database (Denmark)

    Haugen, P; Huss, V A; Nielsen, Henrik

    1999-01-01

    on the complementary strand. A comparison between related group-I introns in the Bangiophyceae revealed homing-endonuclease-like pseudogenes due to frame-shifts and deletions in Porphyra and Bangia. The Scenedesmus and Porphyra introns provide new insights into the evolution and possible novel functions of nuclear...

  17. Short Arginine Motifs Drive Protein Stickiness in the Escherichia coli Cytoplasm.

    Science.gov (United States)

    Kyne, Ciara; Crowley, Peter B

    2017-09-19

    Although essential to numerous biotech applications, knowledge of molecular recognition by arginine-rich motifs in live cells remains limited. 1 H, 15 N HSQC and 19 F NMR spectroscopies were used to investigate the effects of C-terminal -GR n (n = 1-5) motifs on GB1 interactions in Escherichia coli cells and cell extracts. While the "biologically inert" GB1 yields high-quality in-cell spectra, the -GR n fusions with n = 4 or 5 were undetectable. This result suggests that a tetra-arginine motif is sufficient to drive interactions between a test protein and macromolecules in the E. coli cytoplasm. The inclusion of a 12 residue flexible linker between GB1 and the -GR 5 motif did not improve detection of the "inert" domain. In contrast, all of the constructs were detectable in cell lysates and extracts, suggesting that the arginine-mediated complexes were weak. Together these data reveal the significance of weak interactions between short arginine-rich motifs and the E. coli cytoplasm and demonstrate the potential of such motifs to modify protein interactions in living cells. These interactions must be considered in the design of (in vivo) nanoscale assemblies that rely on arginine-rich sequences.

  18. Group I introns and associated homing endonuclease genes reveals a clinal structure for Porphyra spiralis var. amplifolia (Bangiales, Rhodophyta) along the Eastern coast of South America

    Science.gov (United States)

    2008-01-01

    Background Group I introns are found in the nuclear small subunit ribosomal RNA gene (SSU rDNA) of some species of the genus Porphyra (Bangiales, Rhodophyta). Size polymorphisms in group I introns has been interpreted as the result of the degeneration of homing endonuclease genes (HEG) inserted in peripheral loops of intron paired elements. In this study, intron size polymorphisms were characterized for different Porphyra spiralis var. amplifolia (PSA) populations on the Southern Brazilian coast, and were used to infer genetic relationships and genetic structure of these PSA populations, in addition to cox2-3 and rbcL-S regions. Introns of different sizes were tested qualitatively for in vitro self-splicing. Results Five intron size polymorphisms within 17 haplotypes were obtained from 80 individuals representing eight localities along the distribution of PSA in the Eastern coast of South America. In order to infer genetic structure and genetic relationships of PSA, these polymorphisms and haplotypes were used as markers for pairwise Fst analyses, Mantel's test and median joining network. The five cox2-3 haplotypes and the unique rbcL-S haplotype were used as markers for summary statistics, neutrality tests Tajima's D and Fu's Fs and for median joining network analyses. An event of demographic expansion from a population with low effective number, followed by a pattern of isolation by distance was obtained for PSA populations with the three analyses. In vitro experiments have shown that introns of different lengths were able to self-splice from pre-RNA transcripts. Conclusion The findings indicated that degenerated HEGs are reminiscent of the presence of a full-length and functional HEG, once fixed for PSA populations. The cline of HEG degeneration determined the pattern of isolation by distance. Analyses with the other markers indicated an event of demographic expansion from a population with low effective number. The different degrees of degeneration of the HEG

  19. Group I introns and associated homing endonuclease genes reveals a clinal structure for Porphyra spiralis var. amplifolia (Bangiales, Rhodophyta along the Eastern coast of South America

    Directory of Open Access Journals (Sweden)

    Matioli Sergio R

    2008-11-01

    Full Text Available Abstract Background Group I introns are found in the nuclear small subunit ribosomal RNA gene (SSU rDNA of some species of the genus Porphyra (Bangiales, Rhodophyta. Size polymorphisms in group I introns has been interpreted as the result of the degeneration of homing endonuclease genes (HEG inserted in peripheral loops of intron paired elements. In this study, intron size polymorphisms were characterized for different Porphyra spiralis var. amplifolia (PSA populations on the Southern Brazilian coast, and were used to infer genetic relationships and genetic structure of these PSA populations, in addition to cox2-3 and rbcL-S regions. Introns of different sizes were tested qualitatively for in vitro self-splicing. Results Five intron size polymorphisms within 17 haplotypes were obtained from 80 individuals representing eight localities along the distribution of PSA in the Eastern coast of South America. In order to infer genetic structure and genetic relationships of PSA, these polymorphisms and haplotypes were used as markers for pairwise Fst analyses, Mantel's test and median joining network. The five cox2-3 haplotypes and the unique rbcL-S haplotype were used as markers for summary statistics, neutrality tests Tajima's D and Fu's Fs and for median joining network analyses. An event of demographic expansion from a population with low effective number, followed by a pattern of isolation by distance was obtained for PSA populations with the three analyses. In vitro experiments have shown that introns of different lengths were able to self-splice from pre-RNA transcripts. Conclusion The findings indicated that degenerated HEGs are reminiscent of the presence of a full-length and functional HEG, once fixed for PSA populations. The cline of HEG degeneration determined the pattern of isolation by distance. Analyses with the other markers indicated an event of demographic expansion from a population with low effective number. The different degrees of

  20. Crystal Structures of the Scaffolding Protein LGN Reveal the General Mechanism by Which GoLoco Binding Motifs Inhibit the Release of GDP from Gαi *

    Science.gov (United States)

    Jia, Min; Li, Jianchao; Zhu, Jinwei; Wen, Wenyu; Zhang, Mingjie; Wang, Wenning

    2012-01-01

    GoLoco (GL) motif-containing proteins regulate G protein signaling by binding to Gα subunit and acting as guanine nucleotide dissociation inhibitors. GLs of LGN are also known to bind the GDP form of Gαi/o during asymmetric cell division. Here, we show that the C-terminal GL domain of LGN binds four molecules of Gαi·GDP. The crystal structures of Gαi·GDP in complex with LGN GL3 and GL4, respectively, reveal distinct GL/Gαi interaction features when compared with the only high resolution structure known with GL/Gαi interaction between RGS14 and Gαi1. Only a few residues C-terminal to the conserved GL sequence are required for LGN GLs to bind to Gαi·GDP. A highly conserved “double Arg finger” sequence (RΨ(D/E)(D/E)QR) is responsible for LGN GL to bind to GDP bound to Gαi. Together with the sequence alignment, we suggest that the LGN GL/Gαi interaction represents a general binding mode between GL motifs and Gαi. We also show that LGN GLs are potent guanine nucleotide dissociation inhibitors. PMID:22952234

  1. Anion induced conformational preference of Cα NN motif residues in functional proteins.

    Science.gov (United States)

    Patra, Piya; Ghosh, Mahua; Banerjee, Raja; Chakrabarti, Jaydeb

    2017-12-01

    Among different ligand binding motifs, anion binding C α NN motif consisting of peptide backbone atoms of three consecutive residues are observed to be important for recognition of free anions, like sulphate or biphosphate and participate in different key functions. Here we study the interaction of sulphate and biphosphate with C α NN motif present in different proteins. Instead of total protein, a peptide fragment has been studied keeping C α NN motif flanked in between other residues. We use classical force field based molecular dynamics simulations to understand the stability of this motif. Our data indicate fluctuations in conformational preferences of the motif residues in absence of the anion. The anion gives stability to one of these conformations. However, the anion induced conformational preferences are highly sequence dependent and specific to the type of anion. In particular, the polar residues are more favourable compared to the other residues for recognising the anion. © 2017 Wiley Periodicals, Inc.

  2. Gene regulatory and signaling networks exhibit distinct topological distributions of motifs

    Science.gov (United States)

    Ferreira, Gustavo Rodrigues; Nakaya, Helder Imoto; Costa, Luciano da Fontoura

    2018-04-01

    The biological processes of cellular decision making and differentiation involve a plethora of signaling pathways and gene regulatory circuits. These networks in turn exhibit a multitude of motifs playing crucial parts in regulating network activity. Here we compare the topological placement of motifs in gene regulatory and signaling networks and observe that it suggests different evolutionary strategies in motif distribution for distinct cellular subnetworks.

  3. Finding a Leucine in a Haystack: Searching the Proteome for ambigous Leucine-Aspartic Acid motifs

    KAUST Repository

    Arold, Stefan T.

    2016-01-25

    Leucine-aspartic acid (LD) motifs are short helical protein-protein interaction motifs involved in cell motility, survival and communication. LD motif interactions are also implicated in cancer metastasis and are targeted by several viruses. LD motifs are notoriously difficult to detect because sequence pattern searches lead to an excessively high number of false positives. Hence, despite 20 years of research, only six LD motif–containing proteins are known in humans, three of which are close homologues of the paxillin family. To enable the proteome-wide discovery of LD motifs, we developed LD Motif Finder (LDMF), a web tool based on machine learning that combines sequence information with structural predictions to detect LD motifs with high accuracy. LDMF predicted 13 new LD motifs in humans. Using biophysical assays, we experimentally confirmed in vitro interactions for four novel LD motif proteins. Thus, LDMF allows proteome-wide discovery of LD motifs, despite a highly ambiguous sequence pattern. Functional implications will be discussed.

  4. Phylogenetic inferences of Nepenthes species in Peninsular Malaysia revealed by chloroplast (trnL intron) and nuclear (ITS) DNA sequences

    OpenAIRE

    Bunawan, Hamidun; Yen, Choong Chee; Yaakop, Salmah; Noor, Normah Mohd

    2017-01-01

    Background The chloroplastic trnL intron and the nuclear internal transcribed spacer (ITS) region were sequenced for 11 Nepenthes species recorded in Peninsular Malaysia to examine their phylogenetic relationship and to evaluate the usage of trnL intron and ITS sequences for phylogenetic reconstruction of this genus. Results Phylogeny reconstruction was carried out using neighbor-joining, maximum parsimony and Bayesian analyses. All the trees revealed two major clusters, a lowland group consi...

  5. Sequence motifs in MADS transcription factors responsible for specificity and diversification of protein-protein interaction.

    Directory of Open Access Journals (Sweden)

    Aalt D J van Dijk

    Full Text Available Protein sequences encompass tertiary structures and contain information about specific molecular interactions, which in turn determine biological functions of proteins. Knowledge about how protein sequences define interaction specificity is largely missing, in particular for paralogous protein families with high sequence similarity, such as the plant MADS domain transcription factor family. In comparison to the situation in mammalian species, this important family of transcription regulators has expanded enormously in plant species and contains over 100 members in the model plant species Arabidopsis thaliana. Here, we provide insight into the mechanisms that determine protein-protein interaction specificity for the Arabidopsis MADS domain transcription factor family, using an integrated computational and experimental approach. Plant MADS proteins have highly similar amino acid sequences, but their dimerization patterns vary substantially. Our computational analysis uncovered small sequence regions that explain observed differences in dimerization patterns with reasonable accuracy. Furthermore, we show the usefulness of the method for prediction of MADS domain transcription factor interaction networks in other plant species. Introduction of mutations in the predicted interaction motifs demonstrated that single amino acid mutations can have a large effect and lead to loss or gain of specific interactions. In addition, various performed bioinformatics analyses shed light on the way evolution has shaped MADS domain transcription factor interaction specificity. Identified protein-protein interaction motifs appeared to be strongly conserved among orthologs, indicating their evolutionary importance. We also provide evidence that mutations in these motifs can be a source for sub- or neo-functionalization. The analyses presented here take us a step forward in understanding protein-protein interactions and the interplay between protein sequences and

  6. CMD: A Database to Store the Bonding States of Cysteine Motifs with Secondary Structures

    Directory of Open Access Journals (Sweden)

    Hamed Bostan

    2012-01-01

    Full Text Available Computational approaches to the disulphide bonding state and its connectivity pattern prediction are based on various descriptors. One descriptor is the amino acid sequence motifs flanking the cysteine residue motifs. Despite the existence of disulphide bonding information in many databases and applications, there is no complete reference and motif query available at the moment. Cysteine motif database (CMD is the first online resource that stores all cysteine residues, their flanking motifs with their secondary structure, and propensity values assignment derived from the laboratory data. We extracted more than 3 million cysteine motifs from PDB and UniProt data, annotated with secondary structure assignment, propensity value assignment, and frequency of occurrence and coefficiency of their bonding status. Removal of redundancies generated 15875 unique flanking motifs that are always bonded and 41577 unique patterns that are always nonbonded. Queries are based on the protein ID, FASTA sequence, sequence motif, and secondary structure individually or in batch format using the provided APIs that allow remote users to query our database via third party software and/or high throughput screening/querying. The CMD offers extensive information about the bonded, free cysteine residues, and their motifs that allows in-depth characterization of the sequence motif composition.

  7. BEAM web server: a tool for structural RNA motif discovery.

    Science.gov (United States)

    Pietrosanto, Marco; Adinolfi, Marta; Casula, Riccardo; Ausiello, Gabriele; Ferrè, Fabrizio; Helmer-Citterich, Manuela

    2018-03-15

    RNA structural motif finding is a relevant problem that becomes computationally hard when working on high-throughput data (e.g. eCLIP, PAR-CLIP), often represented by thousands of RNA molecules. Currently, the BEAM server is the only web tool capable to handle tens of thousands of RNA in input with a motif discovery procedure that is only limited by the current secondary structure prediction accuracies. The recently developed method BEAM (BEAr Motifs finder) can analyze tens of thousands of RNA molecules and identify RNA secondary structure motifs associated to a measure of their statistical significance. BEAM is extremely fast thanks to the BEAR encoding that transforms each RNA secondary structure in a string of characters. BEAM also exploits the evolutionary knowledge contained in a substitution matrix of secondary structure elements, extracted from the RFAM database of families of homologous RNAs. The BEAM web server has been designed to streamline data pre-processing by automatically handling folding and encoding of RNA sequences, giving users a choice for the preferred folding program. The server provides an intuitive and informative results page with the list of secondary structure motifs identified, the logo of each motif, its significance, graphic representation and information about its position in the RNA molecules sharing it. The web server is freely available at http://beam.uniroma2.it/ and it is implemented in NodeJS and Python with all major browsers supported. marco.pietrosanto@uniroma2.it. Supplementary data are available at Bioinformatics online.

  8. Modeling of the Ebola Virus Delta Peptide Reveals a Potential Lytic Sequence Motif

    Directory of Open Access Journals (Sweden)

    William R. Gallaher

    2015-01-01

    Full Text Available Filoviruses, such as Ebola and Marburg viruses, cause severe outbreaks of human infection, including the extensive epidemic of Ebola virus disease (EVD in West Africa in 2014. In the course of examining mutations in the glycoprotein gene associated with 2014 Ebola virus (EBOV sequences, a differential level of conservation was noted between the soluble form of glycoprotein (sGP and the full length glycoprotein (GP, which are both encoded by the GP gene via RNA editing. In the region of the proteins encoded after the RNA editing site sGP was more conserved than the overlapping region of GP when compared to a distant outlier species, Tai Forest ebolavirus. Half of the amino acids comprising the “delta peptide”, a 40 amino acid carboxy-terminal fragment of sGP, were identical between otherwise widely divergent species. A lysine-rich amphipathic peptide motif was noted at the carboxyl terminus of delta peptide with high structural relatedness to the cytolytic peptide of the non-structural protein 4 (NSP4 of rotavirus. EBOV delta peptide is a candidate viroporin, a cationic pore-forming peptide, and may contribute to EBOV pathogenesis.

  9. Review article: The mountain motif in the plot of Matthew

    Directory of Open Access Journals (Sweden)

    Gert J. Volschenk

    2010-09-01

    Full Text Available This article reviewed T.L. Donaldson’s book, Jesus on the mountain: A study in Matthean theology, published in 1985 by JSOT Press, Sheffield, and focused on the mountain motif in the structure and plot of the Gospel of Matthew, in addition to the work of Donaldson on the mountain motif as a literary motif and as theological symbol. The mountain is a primary theological setting for Jesus’ ministry and thus is an important setting, serving as one of the literary devices by which Matthew structured and progressed his narrative. The Zion theological and eschatological significance and Second Temple Judaism serve as the historical and theological background for the mountain motif. The last mountain setting (Mt 28:16–20 is the culmination of the three theological themes in the plot of Matthew, namely Christology, ecclesiology and salvation history.

  10. On the origin of distribution patterns of motifs in biological networks

    Directory of Open Access Journals (Sweden)

    Lesk Arthur M

    2008-08-01

    Full Text Available Abstract Background Inventories of small subgraphs in biological networks have identified commonly-recurring patterns, called motifs. The inference that these motifs have been selected for function rests on the idea that their occurrences are significantly more frequent than random. Results Our analysis of several large biological networks suggests, in contrast, that the frequencies of appearance of common subgraphs are similar in natural and corresponding random networks. Conclusion Indeed, certain topological features of biological networks give rise naturally to the common appearance of the motifs. We therefore question whether frequencies of occurrences are reasonable evidence that the structures of motifs have been selected for their functional contribution to the operation of networks.

  11. Distance-dependent duplex DNA destabilization proximal to G-quadruplex/i-motif sequences

    Science.gov (United States)

    König, Sebastian L. B.; Huppert, Julian L.; Sigel, Roland K. O.; Evans, Amanda C.

    2013-01-01

    G-quadruplexes and i-motifs are complementary examples of non-canonical nucleic acid substructure conformations. G-quadruplex thermodynamic stability has been extensively studied for a variety of base sequences, but the degree of duplex destabilization that adjacent quadruplex structure formation can cause has yet to be fully addressed. Stable in vivo formation of these alternative nucleic acid structures is likely to be highly dependent on whether sufficient spacing exists between neighbouring duplex- and quadruplex-/i-motif-forming regions to accommodate quadruplexes or i-motifs without disrupting duplex stability. Prediction of putative G-quadruplex-forming regions is likely to be assisted by further understanding of what distance (number of base pairs) is required for duplexes to remain stable as quadruplexes or i-motifs form. Using oligonucleotide constructs derived from precedented G-quadruplexes and i-motif-forming bcl-2 P1 promoter region, initial biophysical stability studies indicate that the formation of G-quadruplex and i-motif conformations do destabilize proximal duplex regions. The undermining effect that quadruplex formation can have on duplex stability is mitigated with increased distance from the duplex region: a spacing of five base pairs or more is sufficient to maintain duplex stability proximal to predicted quadruplex/i-motif-forming regions. PMID:23771141

  12. Systematic comparison of the response properties of protein and RNA mediated gene regulatory motifs.

    Science.gov (United States)

    Iyengar, Bharat Ravi; Pillai, Beena; Venkatesh, K V; Gadgil, Chetan J

    2017-05-30

    We present a framework enabling the dissection of the effects of motif structure (feedback or feedforward), the nature of the controller (RNA or protein), and the regulation mode (transcriptional, post-transcriptional or translational) on the response to a step change in the input. We have used a common model framework for gene expression where both motif structures have an activating input and repressing regulator, with the same set of parameters, to enable a comparison of the responses. We studied the global sensitivity of the system properties, such as steady-state gain, overshoot, peak time, and peak duration, to parameters. We find that, in all motifs, overshoot correlated negatively whereas peak duration varied concavely with peak time. Differences in the other system properties were found to be mainly dependent on the nature of the controller rather than the motif structure. Protein mediated motifs showed a higher degree of adaptation i.e. a tendency to return to baseline levels; in particular, feedforward motifs exhibited perfect adaptation. RNA mediated motifs had a mild regulatory effect; they also exhibited a lower peaking tendency and mean overshoot. Protein mediated feedforward motifs showed higher overshoot and lower peak time compared to the corresponding feedback motifs.

  13. Plastid and mitochondrion genomic sequences from Arctic Chlorella sp. ArM0029B

    Science.gov (United States)

    2014-01-01

    Background Chorella is the representative taxon of Chlorellales in Trebouxiophyceae, and its chloroplast (cp) genomic information has been thought to depend only on studies concerning Chlorella vulgaris and GenBank information of C. variablis. Mitochondrial (mt) genomic information regarding Chlorella is currently unavailable. To elucidate the evolution of organelle genomes and genetic information of Chlorella, we have sequenced and characterized the cp and mt genomes of Arctic Chlorella sp. ArM0029B. Results The 119,989-bp cp genome lacking inverted repeats and 65,049-bp mt genome were sequenced. The ArM0029B cp genome contains 114 conserved genes, including 32 tRNA genes, 3 rRNA genes, and 79 genes encoding proteins. Chlorella cp genomes are highly rearranged except for a Chlorella-specific six-gene cluster, and the ArM0029B plastid resembles that of Chlorella variabilis except for a 15-kb gene cluster inversion. In the mt genome, 62 conserved genes, including 27 tRNA genes, 3 rRNA genes, and 32 genes encoding proteins were determined. The mt genome of ArM0029B is similar to that of the non-photosynthetic species Prototheca and Heicosporidium. The ArM0029B mt genome contains a group I intron, with an ORF containing two LAGLIDADG motifs, in cox1. The intronic ORF is shared by C. vulgaris and Prototheca. The phylogeny of the plastid genome reveals that ArM0029B showed a close relationship of Chlorella to Parachlorella and Oocystis within Chlorellales. The distribution of the cox1 intron at 721 support membership in the order Chlorellales. Mitochondrial phylogenomic analyses, however, indicated that ArM0029B shows a greater affinity to MX-AZ01 and Coccomyxa than to the Helicosporidium-Prototheca clade, although the detailed phylogenetic relationships among the three taxa remain to be resolved. Conclusions The plastid genome of ArM0029B is similar to that of C. variabilis. The mt sequence of ArM0029B is the first genome to be reported for Chlorella. Chloroplast

  14. Annotating RNA motifs in sequences and alignments.

    Science.gov (United States)

    Gardner, Paul P; Eldai, Hisham

    2015-01-01

    RNA performs a diverse array of important functions across all cellular life. These functions include important roles in translation, building translational machinery and maturing messenger RNA. More recent discoveries include the miRNAs and bacterial sRNAs that regulate gene expression, the thermosensors, riboswitches and other cis-regulatory elements that help prokaryotes sense their environment and eukaryotic piRNAs that suppress transposition. However, there can be a long period between the initial discovery of a RNA and determining its function. We present a bioinformatic approach to characterize RNA motifs, which are critical components of many RNA structure-function relationships. These motifs can, in some instances, provide researchers with functional hypotheses for uncharacterized RNAs. Moreover, we introduce a new profile-based database of RNA motifs--RMfam--and illustrate some applications for investigating the evolution and functional characterization of RNA. All the data and scripts associated with this work are available from: https://github.com/ppgardne/RMfam. © The Author(s) 2014. Published by Oxford University Press on behalf of Nucleic Acids Research.

  15. Fast social-like learning of complex behaviors based on motor motifs

    Science.gov (United States)

    Calvo Tapia, Carlos; Tyukin, Ivan Y.; Makarov, Valeri A.

    2018-05-01

    Social learning is widely observed in many species. Less experienced agents copy successful behaviors exhibited by more experienced individuals. Nevertheless, the dynamical mechanisms behind this process remain largely unknown. Here we assume that a complex behavior can be decomposed into a sequence of n motor motifs. Then a neural network capable of activating motor motifs in a given sequence can drive an agent. To account for (n -1 )! possible sequences of motifs in a neural network, we employ the winnerless competition approach. We then consider a teacher-learner situation: one agent exhibits a complex movement, while another one aims at mimicking the teacher's behavior. Despite the huge variety of possible motif sequences we show that the learner, equipped with the provided learning model, can rewire "on the fly" its synaptic couplings in no more than (n -1 ) learning cycles and converge exponentially to the durations of the teacher's motifs. We validate the learning model on mobile robots. Experimental results show that the learner is indeed capable of copying the teacher's behavior composed of six motor motifs in a few learning cycles. The reported mechanism of learning is general and can be used for replicating different functions, including, for example, sound patterns or speech.

  16. Phylogenetic inferences of Nepenthes species in Peninsular Malaysia revealed by chloroplast (trnL intron) and nuclear (ITS) DNA sequences.

    Science.gov (United States)

    Bunawan, Hamidun; Yen, Choong Chee; Yaakop, Salmah; Noor, Normah Mohd

    2017-01-26

    The chloroplastic trnL intron and the nuclear internal transcribed spacer (ITS) region were sequenced for 11 Nepenthes species recorded in Peninsular Malaysia to examine their phylogenetic relationship and to evaluate the usage of trnL intron and ITS sequences for phylogenetic reconstruction of this genus. Phylogeny reconstruction was carried out using neighbor-joining, maximum parsimony and Bayesian analyses. All the trees revealed two major clusters, a lowland group consisting of N. ampullaria, N. mirabilis, N. gracilis and N. rafflesiana, and another containing both intermediately distributed species (N. albomarginata and N. benstonei) and four highland species (N. sanguinea, N. macfarlanei, N. ramispina and N. alba). The trnL intron and ITS sequences proved to provide phylogenetic informative characters for deriving a phylogeny of Nepenthes species in Peninsular Malaysia. To our knowledge, this is the first molecular phylogenetic study of Nepenthes species occurring along an altitudinal gradient in Peninsular Malaysia.

  17. Crammed signaling motifs in the T-cell receptor.

    Science.gov (United States)

    Borroto, Aldo; Abia, David; Alarcón, Balbino

    2014-09-01

    Although the T cell antigen receptor (TCR) is long known to contain multiple signaling subunits (CD3γ, CD3δ, CD3ɛ and CD3ζ), their role in signal transduction is still not well understood. The presence of at least one immunoreceptor tyrosine-based activation motif (ITAM) in each CD3 subunit has led to the idea that the multiplication of such elements essentially serves to amplify signals. However, the evolutionary conservation of non-ITAM sequences suggests that each CD3 subunit is likely to have specific non-redundant roles at some stage of development or in mature T cell function. The CD3ɛ subunit is paradigmatic because in a relatively short cytoplasmic sequence (∼55 amino acids) it contains several docking sites for proteins involved in intracellular trafficking and signaling, proteins whose relevance in T cell activation is slowly starting to be revealed. In this review we will summarize our current knowledge on the signaling effectors that bind directly to the TCR and we will propose a hierarchy in their response to TCR triggering. Copyright © 2014 Elsevier B.V. All rights reserved.

  18. Thermal Stability of Modified i-Motif Oligonucleotides with Naphthalimide Intercalating Nucleic Acids

    DEFF Research Database (Denmark)

    El-Sayed, Ahmed Ali; Pedersen, Erik B.; Khaireldin, Nahid Y.

    2016-01-01

    In continuation of our investigation of characteristics and thermodynamic properties of the i-motif 5′-d[(CCCTAA)3CCCT)] upon insertion of intercalating nucleotides into the cytosine-rich oligonucleotide, this article evaluates the stabilities of i-motif oligonucleotides upon insertion of naphtha......In continuation of our investigation of characteristics and thermodynamic properties of the i-motif 5′-d[(CCCTAA)3CCCT)] upon insertion of intercalating nucleotides into the cytosine-rich oligonucleotide, this article evaluates the stabilities of i-motif oligonucleotides upon insertion...... of naphthalimide (1H-benzo[de]isoquinoline-1,3(2H)-dione) as the intercalating nucleic acid. The stabilities of i-motif structures with inserted naphthalimide intercalating nucleotides were studied using UV melting temperatures (Tm) and circular dichroism spectra at different pH values and conditions (crowding...

  19. Role of the Box C/D Motif in Localization of Small Nucleolar RNAs to Coiled Bodies and Nucleoli

    Science.gov (United States)

    Narayanan, Aarthi; Speckmann, Wayne; Terns, Rebecca; Terns, Michael P.

    1999-01-01

    Small nucleolar RNAs (snoRNAs) are a large family of eukaryotic RNAs that function within the nucleolus in the biogenesis of ribosomes. One major class of snoRNAs is the box C/D snoRNAs named for their conserved box C and box D sequence elements. We have investigated the involvement of cis-acting sequences and intranuclear structures in the localization of box C/D snoRNAs to the nucleolus by assaying the intranuclear distribution of fluorescently labeled U3, U8, and U14 snoRNAs injected into Xenopus oocyte nuclei. Analysis of an extensive panel of U3 RNA variants showed that the box C/D motif, comprised of box C′, box D, and the 3′ terminal stem of U3, is necessary and sufficient for the nucleolar localization of U3 snoRNA. Disruption of the elements of the box C/D motif of U8 and U14 snoRNAs also prevented nucleolar localization, indicating that all box C/D snoRNAs use a common nucleolar-targeting mechanism. Finally, we found that wild-type box C/D snoRNAs transiently associate with coiled bodies before they localize to nucleoli and that variant RNAs that lack an intact box C/D motif are detained within coiled bodies. These results suggest that coiled bodies play a role in the biogenesis and/or intranuclear transport of box C/D snoRNAs. PMID:10397754

  20. When Heterotrimeric G Proteins Are Not Activated by G Protein-Coupled Receptors: Structural Insights and Evolutionary Conservation.

    Science.gov (United States)

    DiGiacomo, Vincent; Marivin, Arthur; Garcia-Marcos, Mikel

    2018-01-23

    Heterotrimeric G proteins are signal-transducing switches conserved across eukaryotes. In humans, they work as critical mediators of intercellular communication in the context of virtually any physiological process. While G protein regulation by G protein-coupled receptors (GPCRs) is well-established and has received much attention, it has become recently evident that heterotrimeric G proteins can also be activated by cytoplasmic proteins. However, this alternative mechanism of G protein regulation remains far less studied than GPCR-mediated signaling. This Viewpoint focuses on recent advances in the characterization of a group of nonreceptor proteins that contain a sequence dubbed the "Gα-binding and -activating (GBA) motif". So far, four proteins present in mammals [GIV (also known as Girdin), DAPLE, CALNUC, and NUCB2] and one protein in Caenorhabditis elegans (GBAS-1) have been described as possessing a functional GBA motif. The GBA motif confers guanine nucleotide exchange factor activity on Gαi subunits in vitro and activates G protein signaling in cells. The importance of this mechanism of signal transduction is highlighted by the fact that its dysregulation underlies human diseases, such as cancer, which has made the proteins attractive new candidates for therapeutic intervention. Here we discuss recent discoveries on the structural basis of GBA-mediated activation of G proteins and its evolutionary conservation and compare them with the better-studied mechanism mediated by GPCRs.

  1. Motif formation and industry specific topologies in the Japanese business firm network

    Science.gov (United States)

    Maluck, Julian; Donner, Reik V.; Takayasu, Hideki; Takayasu, Misako

    2017-05-01

    Motifs and roles are basic quantities for the characterization of interactions among 3-node subsets in complex networks. In this work, we investigate how the distribution of 3-node motifs can be influenced by modifying the rules of an evolving network model while keeping the statistics of simpler network characteristics, such as the link density and the degree distribution, invariant. We exemplify this problem for the special case of the Japanese Business Firm Network, where a well-studied and relatively simple yet realistic evolving network model is available, and compare the resulting motif distribution in the real-world and simulated networks. To better approximate the motif distribution of the real-world network in the model, we introduce both subgraph dependent and global additional rules. We find that a specific rule that allows only for the merging process between nodes with similar link directionality patterns reduces the observed excess of densely connected motifs with bidirectional links. Our study improves the mechanistic understanding of motif formation in evolving network models to better describe the characteristic features of real-world networks with a scale-free topology.

  2. Binding properties of SUMO-interacting motifs (SIMs) in yeast.

    Science.gov (United States)

    Jardin, Christophe; Horn, Anselm H C; Sticht, Heinrich

    2015-03-01

    Small ubiquitin-like modifier (SUMO) conjugation and interaction play an essential role in many cellular processes. A large number of yeast proteins is known to interact non-covalently with SUMO via short SUMO-interacting motifs (SIMs), but the structural details of this interaction are yet poorly characterized. In the present work, sequence analysis of a large dataset of 148 yeast SIMs revealed the existence of a hydrophobic core binding motif and a preference for acidic residues either within or adjacent to the core motif. Thus the sequence properties of yeast SIMs are highly similar to those described for human. Molecular dynamics simulations were performed to investigate the binding preferences for four representative SIM peptides differing in the number and distribution of acidic residues. Furthermore, the relative stability of two previously observed alternative binding orientations (parallel, antiparallel) was assessed. For all SIMs investigated, the antiparallel binding mode remained stable in the simulations and the SIMs were tightly bound via their hydrophobic core residues supplemented by polar interactions of the acidic residues. In contrary, the stability of the parallel binding mode is more dependent on the sequence features of the SIM motif like the number and position of acidic residues or the presence of additional adjacent interaction motifs. This information should be helpful to enhance the prediction of SIMs and their binding properties in different organisms to facilitate the reconstruction of the SUMO interactome.

  3. Identification of coupling DNA motif pairs on long-range chromatin interactions in human K562 cells

    KAUST Repository

    Wong, Ka-Chun; Li, Yue; Peng, Chengbin

    2015-01-01

    Motivation: The protein-DNA interactions between transcription factors (TFs) and transcription factor binding sites (TFBSs, also known as DNA motifs) are critical activities in gene transcription. The identification of the DNA motifs is a vital task for downstream analysis. Unfortunately, the long-range coupling information between different DNA motifs is still lacking. To fill the void, as the first-of-its-kind study, we have identified the coupling DNA motif pairs on long-range chromatin interactions in human. Results: The coupling DNA motif pairs exhibit substantially higher DNase accessibility than the background sequences. Half of the DNA motifs involved are matched to the existing motif databases, although nearly all of them are enriched with at least one gene ontology term. Their motif instances are also found statistically enriched on the promoter and enhancer regions. Especially, we introduce a novel measurement called motif pairing multiplicity which is defined as the number of motifs that are paired with a given motif on chromatin interactions. Interestingly, we observe that motif pairing multiplicity is linked to several characteristics such as regulatory region type, motif sequence degeneracy, DNase accessibility and pairing genomic distance. Taken into account together, we believe the coupling DNA motif pairs identified in this study can shed lights on the gene transcription mechanism under long-range chromatin interactions. © The Author 2015. Published by Oxford University Press.

  4. Identification of coupling DNA motif pairs on long-range chromatin interactions in human K562 cells

    KAUST Repository

    Wong, Ka-Chun

    2015-09-27

    Motivation: The protein-DNA interactions between transcription factors (TFs) and transcription factor binding sites (TFBSs, also known as DNA motifs) are critical activities in gene transcription. The identification of the DNA motifs is a vital task for downstream analysis. Unfortunately, the long-range coupling information between different DNA motifs is still lacking. To fill the void, as the first-of-its-kind study, we have identified the coupling DNA motif pairs on long-range chromatin interactions in human. Results: The coupling DNA motif pairs exhibit substantially higher DNase accessibility than the background sequences. Half of the DNA motifs involved are matched to the existing motif databases, although nearly all of them are enriched with at least one gene ontology term. Their motif instances are also found statistically enriched on the promoter and enhancer regions. Especially, we introduce a novel measurement called motif pairing multiplicity which is defined as the number of motifs that are paired with a given motif on chromatin interactions. Interestingly, we observe that motif pairing multiplicity is linked to several characteristics such as regulatory region type, motif sequence degeneracy, DNase accessibility and pairing genomic distance. Taken into account together, we believe the coupling DNA motif pairs identified in this study can shed lights on the gene transcription mechanism under long-range chromatin interactions. © The Author 2015. Published by Oxford University Press.

  5. Significant association of interleukin-4 gene intron 3 VNTR polymorphism with susceptibility to knee osteoarthritis.

    Science.gov (United States)

    Yigit, Serbulent; Inanir, Ahmet; Tekcan, Akın; Tural, Ercan; Ozturk, Gokhan Tuna; Kismali, Gorkem; Karakus, Nevin

    2014-03-01

    Interleukin-4 (IL-4) is a strong chondroprotective cytokine and polymorphisms within this gene may be a risk factor for osteoarthritis (OA). We aimed to investigate genotype and allele frequencies of IL-4 gene intron 3 variable number of tandem repeats (VNTR) polymorphism in patients with knee OA in a Turkish population. The study included 202 patients with knee OA and 180 healthy controls. Genomic DNA was isolated and IL-4 gene 70 bp VNTR polymorphism determined by using polymerase chain reaction (PCR) with specific primers followed by restriction fragment length polymorphism (RFLP) analysis. Our result show that there was statistically significant difference between knee OA patients and control group with respect to IL-4 genotype distribution and allele frequencies (p=0.000, OR: 0.20, 95% CI: 0.10-0.41, OR: 0.22, 95% CI: 0.12-0.42, respectively). Our findings suggest that there is an association of IL-4 gene intron 3 VNTR polymorphism with susceptibility of a person for development of knee OA. As a result, IL-4 gene intron 3 VNTR polymorphism could be a genetic marker in OA in a Turkish study population. This is the first association study that evaluates the associations between IL-4 gene VNTR polymorphism and knee OA. Crown Copyright © 2013. Published by Elsevier B.V. All rights reserved.

  6. Extreme sequence divergence but conserved ligand-binding specificity in Streptococcus pyogenes M protein.

    Directory of Open Access Journals (Sweden)

    2006-05-01

    Full Text Available Many pathogenic microorganisms evade host immunity through extensive sequence variability in a protein region targeted by protective antibodies. In spite of the sequence variability, a variable region commonly retains an important ligand-binding function, reflected in the presence of a highly conserved sequence motif. Here, we analyze the limits of sequence divergence in a ligand-binding region by characterizing the hypervariable region (HVR of Streptococcus pyogenes M protein. Our studies were focused on HVRs that bind the human complement regulator C4b-binding protein (C4BP, a ligand that confers phagocytosis resistance. A previous comparison of C4BP-binding HVRs identified residue identities that could be part of a binding motif, but the extended analysis reported here shows that no residue identities remain when additional C4BP-binding HVRs are included. Characterization of the HVR in the M22 protein indicated that two relatively conserved Leu residues are essential for C4BP binding, but these residues are probably core residues in a coiled-coil, implying that they do not directly contribute to binding. In contrast, substitution of either of two relatively conserved Glu residues, predicted to be solvent-exposed, had no effect on C4BP binding, although each of these changes had a major effect on the antigenic properties of the HVR. Together, these findings show that HVRs of M proteins have an extraordinary capacity for sequence divergence and antigenic variability while retaining a specific ligand-binding function.

  7. Efficient sequential and parallel algorithms for finding edit distance based motifs.

    Science.gov (United States)

    Pal, Soumitra; Xiao, Peng; Rajasekaran, Sanguthevar

    2016-08-18

    Motif search is an important step in extracting meaningful patterns from biological data. The general problem of motif search is intractable and there is a pressing need to develop efficient, exact and approximation algorithms to solve this problem. In this paper, we present several novel, exact, sequential and parallel algorithms for solving the (l,d) Edit-distance-based Motif Search (EMS) problem: given two integers l,d and n biological strings, find all strings of length l that appear in each input string with atmost d errors of types substitution, insertion and deletion. One popular technique to solve the problem is to explore for each input string the set of all possible l-mers that belong to the d-neighborhood of any substring of the input string and output those which are common for all input strings. We introduce a novel and provably efficient neighborhood exploration technique. We show that it is enough to consider the candidates in neighborhood which are at a distance exactly d. We compactly represent these candidate motifs using wildcard characters and efficiently explore them with very few repetitions. Our sequential algorithm uses a trie based data structure to efficiently store and sort the candidate motifs. Our parallel algorithm in a multi-core shared memory setting uses arrays for storing and a novel modification of radix-sort for sorting the candidate motifs. The algorithms for EMS are customarily evaluated on several challenging instances such as (8,1), (12,2), (16,3), (20,4), and so on. The best previously known algorithm, EMS1, is sequential and in estimated 3 days solves up to instance (16,3). Our sequential algorithms are more than 20 times faster on (16,3). On other hard instances such as (9,2), (11,3), (13,4), our algorithms are much faster. Our parallel algorithm has more than 600 % scaling performance while using 16 threads. Our algorithms have pushed up the state-of-the-art of EMS solvers and we believe that the techniques introduced in

  8. Physical-chemical property based sequence motifs and methods regarding same

    Science.gov (United States)

    Braun, Werner [Friendswood, TX; Mathura, Venkatarajan S [Sarasota, FL; Schein, Catherine H [Friendswood, TX

    2008-09-09

    A data analysis system, program, and/or method, e.g., a data mining/data exploration method, using physical-chemical property motifs. For example, a sequence database may be searched for identifying segments thereof having physical-chemical properties similar to the physical-chemical property motifs.

  9. Dissecting protein loops with a statistical scalpel suggests a functional implication of some structural motifs.

    Science.gov (United States)

    Regad, Leslie; Martin, Juliette; Camproux, Anne-Claude

    2011-06-20

    One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet), which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i) ubiquitous motifs, shared by several superfamilies and (ii) superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P) and SAH/SAM. Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins.

  10. Dissecting protein loops with a statistical scalpel suggests a functional implication of some structural motifs

    Directory of Open Access Journals (Sweden)

    Martin Juliette

    2011-06-01

    Full Text Available Abstract Background One of the strategies for protein function annotation is to search particular structural motifs that are known to be shared by proteins with a given function. Results Here, we present a systematic extraction of structural motifs of seven residues from protein loops and we explore their correspondence with functional sites. Our approach is based on the structural alphabet HMM-SA (Hidden Markov Model - Structural Alphabet, which allows simplification of protein structures into uni-dimensional sequences, and advanced pattern statistics adapted to short sequences. Structural motifs of interest are selected by looking for structural motifs significantly over-represented in SCOP superfamilies in protein loops. We discovered two types of structural motifs significantly over-represented in SCOP superfamilies: (i ubiquitous motifs, shared by several superfamilies and (ii superfamily-specific motifs, over-represented in few superfamilies. A comparison of ubiquitous words with known small structural motifs shows that they contain well-described motifs as turn, niche or nest motifs. A comparison between superfamily-specific motifs and biological annotations of Swiss-Prot reveals that some of them actually correspond to functional sites involved in the binding sites of small ligands, such as ATP/GTP, NAD(P and SAH/SAM. Conclusions Our findings show that statistical over-representation in SCOP superfamilies is linked to functional features. The detection of over-represented motifs within structures simplified by HMM-SA is therefore a promising approach for prediction of functional sites and annotation of uncharacterized proteins.

  11. Organization of feed-forward loop motifs reveals architectural principles in natural and engineered networks.

    Science.gov (United States)

    Gorochowski, Thomas E; Grierson, Claire S; di Bernardo, Mario

    2018-03-01

    Network motifs are significantly overrepresented subgraphs that have been proposed as building blocks for natural and engineered networks. Detailed functional analysis has been performed for many types of motif in isolation, but less is known about how motifs work together to perform complex tasks. To address this issue, we measure the aggregation of network motifs via methods that extract precisely how these structures are connected. Applying this approach to a broad spectrum of networked systems and focusing on the widespread feed-forward loop motif, we uncover striking differences in motif organization. The types of connection are often highly constrained, differ between domains, and clearly capture architectural principles. We show how this information can be used to effectively predict functionally important nodes in the metabolic network of Escherichia coli . Our findings have implications for understanding how networked systems are constructed from motif parts and elucidate constraints that guide their evolution.

  12. RSAT matrix-clustering: dynamic exploration and redundancy reduction of transcription factor binding motif collections.

    Science.gov (United States)

    Castro-Mondragon, Jaime Abraham; Jaeger, Sébastien; Thieffry, Denis; Thomas-Chollier, Morgane; van Helden, Jacques

    2017-07-27

    Transcription factor (TF) databases contain multitudes of binding motifs (TFBMs) from various sources, from which non-redundant collections are derived by manual curation. The advent of high-throughput methods stimulated the production of novel collections with increasing numbers of motifs. Meta-databases, built by merging these collections, contain redundant versions, because available tools are not suited to automatically identify and explore biologically relevant clusters among thousands of motifs. Motif discovery from genome-scale data sets (e.g. ChIP-seq) also produces redundant motifs, hampering the interpretation of results. We present matrix-clustering, a versatile tool that clusters similar TFBMs into multiple trees, and automatically creates non-redundant TFBM collections. A feature unique to matrix-clustering is its dynamic visualisation of aligned TFBMs, and its capability to simultaneously treat multiple collections from various sources. We demonstrate that matrix-clustering considerably simplifies the interpretation of combined results from multiple motif discovery tools, and highlights biologically relevant variations of similar motifs. We also ran a large-scale application to cluster ∼11 000 motifs from 24 entire databases, showing that matrix-clustering correctly groups motifs belonging to the same TF families, and drastically reduced motif redundancy. matrix-clustering is integrated within the RSAT suite (http://rsat.eu/), accessible through a user-friendly web interface or command-line for its integration in pipelines. © The Author(s) 2017. Published by Oxford University Press on behalf of Nucleic Acids Research.

  13. Structure-aided prediction of mammalian transcription factor complexes in conserved non-coding elements

    KAUST Repository

    Guturu, H.

    2013-11-11

    Mapping the DNA-binding preferences of transcription factor (TF) complexes is critical for deciphering the functions of cis-regulatory elements. Here, we developed a computational method that compares co-occurring motif spacings in conserved versus unconserved regions of the human genome to detect evolutionarily constrained binding sites of rigid TF complexes. Structural data were used to estimate TF complex physical plausibility, explore overlapping motif arrangements seldom tackled by non-structure-aware methods, and generate and analyse three-dimensional models of the predicted complexes bound to DNA. Using this approach, we predicted 422 physically realistic TF complex motifs at 18% false discovery rate, the majority of which (326, 77%) contain some sequence overlap between binding sites. The set of mostly novel complexes is enriched in known composite motifs, predictive of binding site configurations in TF-TF-DNA crystal structures, and supported by ChIP-seq datasets. Structural modelling revealed three cooperativity mechanisms: direct protein-protein interactions, potentially indirect interactions and \\'through-DNA\\' interactions. Indeed, 38% of the predicted complexes were found to contain four or more bases in which TF pairs appear to synergize through overlapping binding to the same DNA base pairs in opposite grooves or strands. Our TF complex and associated binding site predictions are available as a web resource at http://bejerano.stanford.edu/complex.

  14. Structure-aided prediction of mammalian transcription factor complexes in conserved non-coding elements

    KAUST Repository

    Guturu, H.; Doxey, A. C.; Wenger, A. M.; Bejerano, G.

    2013-01-01

    Mapping the DNA-binding preferences of transcription factor (TF) complexes is critical for deciphering the functions of cis-regulatory elements. Here, we developed a computational method that compares co-occurring motif spacings in conserved versus unconserved regions of the human genome to detect evolutionarily constrained binding sites of rigid TF complexes. Structural data were used to estimate TF complex physical plausibility, explore overlapping motif arrangements seldom tackled by non-structure-aware methods, and generate and analyse three-dimensional models of the predicted complexes bound to DNA. Using this approach, we predicted 422 physically realistic TF complex motifs at 18% false discovery rate, the majority of which (326, 77%) contain some sequence overlap between binding sites. The set of mostly novel complexes is enriched in known composite motifs, predictive of binding site configurations in TF-TF-DNA crystal structures, and supported by ChIP-seq datasets. Structural modelling revealed three cooperativity mechanisms: direct protein-protein interactions, potentially indirect interactions and 'through-DNA' interactions. Indeed, 38% of the predicted complexes were found to contain four or more bases in which TF pairs appear to synergize through overlapping binding to the same DNA base pairs in opposite grooves or strands. Our TF complex and associated binding site predictions are available as a web resource at http://bejerano.stanford.edu/complex.

  15. Structural fragment clustering reveals novel structural and functional motifs in α-helical transmembrane proteins

    Directory of Open Access Journals (Sweden)

    Vassilev Boris

    2010-04-01

    Full Text Available Abstract Background A large proportion of an organism's genome encodes for membrane proteins. Membrane proteins are important for many cellular processes, and several diseases can be linked to mutations in them. With the tremendous growth of sequence data, there is an increasing need to reliably identify membrane proteins from sequence, to functionally annotate them, and to correctly predict their topology. Results We introduce a technique called structural fragment clustering, which learns sequential motifs from 3D structural fragments. From over 500,000 fragments, we obtain 213 statistically significant, non-redundant, and novel motifs that are highly specific to α-helical transmembrane proteins. From these 213 motifs, 58 of them were assigned to function and checked in the scientific literature for a biological assessment. Seventy percent of the motifs are found in co-factor, ligand, and ion binding sites, 30% at protein interaction interfaces, and 12% bind specific lipids such as glycerol or cardiolipins. The vast majority of motifs (94% appear across evolutionarily unrelated families, highlighting the modularity of functional design in membrane proteins. We describe three novel motifs in detail: (1 a dimer interface motif found in voltage-gated chloride channels, (2 a proton transfer motif found in heme-copper oxidases, and (3 a convergently evolved interface helix motif found in an aspartate symporter, a serine protease, and cytochrome b. Conclusions Our findings suggest that functional modules exist in membrane proteins, and that they occur in completely different evolutionary contexts and cover different binding sites. Structural fragment clustering allows us to link sequence motifs to function through clusters of structural fragments. The sequence motifs can be applied to identify and characterize membrane proteins in novel genomes.

  16. H2B ubiquitylation is part of chromatin architecture that marks exon-intron structure in budding yeast

    LENUS (Irish Health Repository)

    Shieh, Grace S.

    2011-12-22

    Abstract Background The packaging of DNA into chromatin regulates transcription from initiation through 3\\' end processing. One aspect of transcription in which chromatin plays a poorly understood role is the co-transcriptional splicing of pre-mRNA. Results Here we provide evidence that H2B monoubiquitylation (H2BK123ub1) marks introns in Saccharomyces cerevisiae. A genome-wide map of H2BK123ub1 in this organism reveals that this modification is enriched in coding regions and that its levels peak at the transcribed regions of two characteristic subgroups of genes. First, long genes are more likely to have higher levels of H2BK123ub1, correlating with the postulated role of this modification in preventing cryptic transcription initiation in ORFs. Second, genes that are highly transcribed also have high levels of H2BK123ub1, including the ribosomal protein genes, which comprise the majority of intron-containing genes in yeast. H2BK123ub1 is also a feature of introns in the yeast genome, and the disruption of this modification alters the intragenic distribution of H3 trimethylation on lysine 36 (H3K36me3), which functionally correlates with alternative RNA splicing in humans. In addition, the deletion of genes encoding the U2 snRNP subunits, Lea1 or Msl1, in combination with an htb-K123R mutation, leads to synthetic lethality. Conclusion These data suggest that H2BK123ub1 facilitates cross talk between chromatin and pre-mRNA splicing by modulating the distribution of intronic and exonic histone modifications.

  17. H2B ubiquitylation is part of chromatin architecture that marks exon-intron structure in budding yeast

    Directory of Open Access Journals (Sweden)

    Shieh Grace S

    2011-12-01

    Full Text Available Abstract Background The packaging of DNA into chromatin regulates transcription from initiation through 3' end processing. One aspect of transcription in which chromatin plays a poorly understood role is the co-transcriptional splicing of pre-mRNA. Results Here we provide evidence that H2B monoubiquitylation (H2BK123ub1 marks introns in Saccharomyces cerevisiae. A genome-wide map of H2BK123ub1 in this organism reveals that this modification is enriched in coding regions and that its levels peak at the transcribed regions of two characteristic subgroups of genes. First, long genes are more likely to have higher levels of H2BK123ub1, correlating with the postulated role of this modification in preventing cryptic transcription initiation in ORFs. Second, genes that are highly transcribed also have high levels of H2BK123ub1, including the ribosomal protein genes, which comprise the majority of intron-containing genes in yeast. H2BK123ub1 is also a feature of introns in the yeast genome, and the disruption of this modification alters the intragenic distribution of H3 trimethylation on lysine 36 (H3K36me3, which functionally correlates with alternative RNA splicing in humans. In addition, the deletion of genes encoding the U2 snRNP subunits, Lea1 or Msl1, in combination with an htb-K123R mutation, leads to synthetic lethality. Conclusion These data suggest that H2BK123ub1 facilitates cross talk between chromatin and pre-mRNA splicing by modulating the distribution of intronic and exonic histone modifications.

  18. Myosin-1A Targets to Microvilli Using Multiple Membrane Binding Motifs in the Tail Homology 1 (TH1) Domain*

    Science.gov (United States)

    Mazerik, Jessica N.; Tyska, Matthew J.

    2012-01-01

    One of the most abundant components of the enterocyte brush border is the actin-based monomeric motor, myosin-1a (Myo1a). Within brush border microvilli, Myo1a carries out a number of critical functions at the interface between membrane and actin cytoskeleton. Proper physiological function of Myo1a depends on its ability to bind to microvillar membrane, an interaction mediated by a C-terminal tail homology 1 (TH1) domain. However, little is known about the mechanistic details of the Myo1a-TH1/membrane interaction. Structure-function analysis of Myo1a-TH1 targeting in epithelial cells revealed that an N-terminal motif conserved among class I myosins and a C-terminal motif unique to Myo1a-TH1 are both required for steady state microvillar enrichment. Purified Myo1a bound to liposomes composed of phosphatidylserine and phosphoinositol 4,5-bisphosphate, with moderate affinity in a charge-dependent manner. Additionally, peptides of the N- and C-terminal regions required for targeting were able to compete with Myo1a for binding to highly charged liposomes in vitro. Single molecule total internal reflection fluorescence microscopy showed that these motifs are also necessary for slowing the membrane detachment rate in cells. Finally, Myo1a-TH1 co-localized with both lactadherin-C2 (a phosphatidylserine-binding protein) and PLCδ1-PH (a phosphoinositol 4,5-bisphosphate-binding protein) in microvilli, but only lactaderin-C2 expression reduced brush border targeting of Myo1a-TH1. Together, our results suggest that Myo1a targeting to microvilli is driven by membrane binding potential that is distributed throughout TH1 rather than localized to a single motif. These data highlight the diversity of mechanisms that enable different class I myosins to target membranes in distinct biological contexts. PMID:22367206

  19. [Cover motifs of the Tidsskrift. A 14-year cavalcade].

    Science.gov (United States)

    Nylenna, M

    1998-12-10

    In 1985 the Journal of the Norwegian Medical Association changed its cover policy, moving the table of contents inside the Journal and introducing cover illustrations. This article provides an analysis of all cover illustrations published over this 14-year period, 420 covers in all. There is a great variation in cover motifs and designs and a development towards more general motifs. The initial emphasis on historical and medical aspects is now less pronounced, while the use of works of art and nature motifs has increased, and the cover now more often has a direct bearing on the specific contents of the issue. Professor of medical history Oivind Larsen has photographed two thirds of the covers and contributed 95% of the inside essay-style reflections on the cover motif. Over the years, he has expanded the role of the historian of medicine disseminating knowledge to include that of the raconteur with a personal tone of voice. The Journal's covers are now one of its most characteristic features, emblematic of the Journal's ambition of standing for quality and timelessness vis-à-vis the news media, and of its aim of bridging the gap between medicine and the humanities.

  20. Insights into the motif preference of APOBEC3 enzymes.

    Directory of Open Access Journals (Sweden)

    Diako Ebrahimi

    Full Text Available We used a multivariate data analysis approach to identify motifs associated with HIV hypermutation by different APOBEC3 enzymes. The analysis showed that APOBEC3G targets G mainly within GG, TG, TGG, GGG, TGGG and also GGGT. The G nucleotides flanked by a C at the 3' end (in +1 and +2 positions were indicated as disfavoured targets by APOBEC3G. The G nucleotides within GGGG were found to be targeted at a frequency much less than what is expected. We found that the infrequent G-to-A mutation within GGGG is not limited to the inaccessibility, to APOBEC3, of poly Gs in the central and 3'polypurine tracts (PPTs which remain double stranded during the HIV reverse transcription. GGGG motifs outside the PPTs were also disfavoured. The motifs GGAG and GAGG were also found to be disfavoured targets for APOBEC3. The motif-dependent mutation of G within the HIV genome by members of the APOBEC3 family other than APOBEC3G was limited to GA→AA changes. The results did not show evidence of other types of context dependent G-to-A changes in the HIV genome.

  1. I-motif DNA structures are formed in the nuclei of human cells

    Science.gov (United States)

    Zeraati, Mahdi; Langley, David B.; Schofield, Peter; Moye, Aaron L.; Rouet, Romain; Hughes, William E.; Bryan, Tracy M.; Dinger, Marcel E.; Christ, Daniel

    2018-06-01

    Human genome function is underpinned by the primary storage of genetic information in canonical B-form DNA, with a second layer of DNA structure providing regulatory control. I-motif structures are thought to form in cytosine-rich regions of the genome and to have regulatory functions; however, in vivo evidence for the existence of such structures has so far remained elusive. Here we report the generation and characterization of an antibody fragment (iMab) that recognizes i-motif structures with high selectivity and affinity, enabling the detection of i-motifs in the nuclei of human cells. We demonstrate that the in vivo formation of such structures is cell-cycle and pH dependent. Furthermore, we provide evidence that i-motif structures are formed in regulatory regions of the human genome, including promoters and telomeric regions. Our results support the notion that i-motif structures provide key regulatory roles in the genome.

  2. Nonsynonymous substitution in abalone sperm fertilization genes exceeds substitution in introns and mitochondrial DNA

    Science.gov (United States)

    Metz, Edward C.; Robles-Sikisaka, Refugio; Vacquier, Victor D.

    1998-01-01

    Strong positive Darwinian selection acts on two sperm fertilization proteins, lysin and 18-kDa protein, from abalone (Haliotis). To understand the phylogenetic context for this dramatic molecular evolution, we obtained sequences of mitochondrial cytochrome c oxidase subunit I (mtCOI), and genomic sequences of lysin, 18-kDa, and a G protein subunit. Based on mtDNA differentiation, four north Pacific abalone species diverged within the past 2 million years (Myr), and remaining north Pacific species diverged over a period of 4–20 Myr. Between-species nonsynonymous differences in lysin and 18-kDa exons exceed nucleotide differences in introns by 3.5- to 24-fold. Remarkably, in some comparisons nonsynonymous substitutions in lysin and 18-kDa genes exceed synonymous substitutions in mtCOI. Lysin and 18-kDa intron/exon segments were sequenced from multiple red abalone individuals collected over a 1,200-km range. Only two nucleotide changes and two sites of slippage variation were detected in a total of >29,000 nucleotides surveyed. However, polymorphism in mtCOI and a G protein intron was found in this species. This finding suggests that positive selection swept one lysin allele and one 18-kDa allele to fixation. Similarities between mtCOI and lysin gene trees indicate that rapid adaptive evolution of lysin has occurred consistently through the history of the group. Comparisons with mtCOI molecular clock calibrations suggest that nonsynonymous substitutions accumulate 2–50 times faster in lysin and 18-kDa genes than in rapidly evolving mammalian genes. PMID:9724763

  3. Identification and analysis of Eimeria nieschulzi gametocyte genes reveal splicing events of gam genes and conserved motifs in the wall-forming proteins within the genus Eimeria (Coccidia, Apicomplexa

    Directory of Open Access Journals (Sweden)

    Wiedmer Stefanie

    2017-01-01

    Full Text Available The genus Eimeria (Apicomplexa, Coccidia provides a wide range of different species with different hosts to study common and variable features within the genus and its species. A common characteristic of all known Eimeria species is the oocyst, the infectious stage where its life cycle starts and ends. In our study, we utilized Eimeria nieschulzi as a model organism. This rat-specific parasite has complex oocyst morphology and can be transfected and even cultivated in vitro up to the oocyst stage. We wanted to elucidate how the known oocyst wall-forming proteins are preserved in this rodent Eimeria species compared to other Eimeria. In newly obtained genomics data, we were able to identify different gametocyte genes that are orthologous to already known gam genes involved in the oocyst wall formation of avian Eimeria species. These genes appeared putatively as single exon genes, but cDNA analysis showed alternative splicing events in the transcripts. The analysis of the translated sequence revealed different conserved motifs but also dissimilar regions in GAM proteins, as well as polymorphic regions. The occurrence of an underrepresented gam56 gene version suggests the existence of a second distinct E. nieschulzi genotype within the E. nieschulzi Landers isolate that we maintain.

  4. Identification and analysis of Eimeria nieschulzi gametocyte genes reveal splicing events of gam genes and conserved motifs in the wall-forming proteins within the genus Eimeria (Coccidia, Apicomplexa)

    Science.gov (United States)

    Wiedmer, Stefanie; Erdbeer, Alexander; Volke, Beate; Randel, Stephanie; Kapplusch, Franz; Hanig, Sacha; Kurth, Michael

    2017-01-01

    The genus Eimeria (Apicomplexa, Coccidia) provides a wide range of different species with different hosts to study common and variable features within the genus and its species. A common characteristic of all known Eimeria species is the oocyst, the infectious stage where its life cycle starts and ends. In our study, we utilized Eimeria nieschulzi as a model organism. This rat-specific parasite has complex oocyst morphology and can be transfected and even cultivated in vitro up to the oocyst stage. We wanted to elucidate how the known oocyst wall-forming proteins are preserved in this rodent Eimeria species compared to other Eimeria. In newly obtained genomics data, we were able to identify different gametocyte genes that are orthologous to already known gam genes involved in the oocyst wall formation of avian Eimeria species. These genes appeared putatively as single exon genes, but cDNA analysis showed alternative splicing events in the transcripts. The analysis of the translated sequence revealed different conserved motifs but also dissimilar regions in GAM proteins, as well as polymorphic regions. The occurrence of an underrepresented gam56 gene version suggests the existence of a second distinct E. nieschulzi genotype within the E. nieschulzi Landers isolate that we maintain. PMID:29210668

  5. Identification of a putative nuclear export signal motif in human NANOG homeobox domain

    International Nuclear Information System (INIS)

    Park, Sung-Won; Do, Hyun-Jin; Huh, Sun-Hyung; Sung, Boreum; Uhm, Sang-Jun; Song, Hyuk; Kim, Nam-Hyung; Kim, Jae-Hwan

    2012-01-01

    Highlights: ► We found the putative nuclear export signal motif within human NANOG homeodomain. ► Leucine-rich residues are important for human NANOG homeodomain nuclear export. ► CRM1-specific inhibitor LMB blocked the potent human NANOG NES-mediated nuclear export. -- Abstract: NANOG is a homeobox-containing transcription factor that plays an important role in pluripotent stem cells and tumorigenic cells. To understand how nuclear localization of human NANOG is regulated, the NANOG sequence was examined and a leucine-rich nuclear export signal (NES) motif ( 125 MQELSNILNL 134 ) was found in the homeodomain (HD). To functionally validate the putative NES motif, deletion and site-directed mutants were fused to an EGFP expression vector and transfected into COS-7 cells, and the localization of the proteins was examined. While hNANOG HD exclusively localized to the nucleus, a mutant with both NLSs deleted and only the putative NES motif contained (hNANOG HD-ΔNLSs) was predominantly cytoplasmic, as observed by nucleo/cytoplasmic fractionation and Western blot analysis as well as confocal microscopy. Furthermore, site-directed mutagenesis of the putative NES motif in a partial hNANOG HD only containing either one of the two NLS motifs led to localization in the nucleus, suggesting that the NES motif may play a functional role in nuclear export. Furthermore, CRM1-specific nuclear export inhibitor LMB blocked the hNANOG potent NES-mediated export, suggesting that the leucine-rich motif may function in CRM1-mediated nuclear export of hNANOG. Collectively, a NES motif is present in the hNANOG HD and may be functionally involved in CRM1-mediated nuclear export pathway.

  6. Leucine-based receptor sorting motifs are dependent on the spacing relative to the plasma membrane

    DEFF Research Database (Denmark)

    Geisler, C; Dietrich, J; Nielsen, B L

    1998-01-01

    Many integral membrane proteins contain leucine-based motifs within their cytoplasmic domains that mediate internalization and intracellular sorting. Two types of leucine-based motifs have been identified. One type is dependent on phosphorylation, whereas the other type, which includes an acidic...... amino acid, is constitutively active. In this study, we have investigated how the spacing relative to the plasma membrane affects the function of both types of leucine-based motifs. For phosphorylation-dependent leucine-based motifs, a minimal spacing of 7 residues between the plasma membrane...... and the phospho-acceptor was required for phosphorylation and thereby activation of the motifs. For constitutively active leucine-based motifs, a minimal spacing of 6 residues between the plasma membrane and the acidic residue was required for optimal activity of the motifs. In addition, we found that the acidic...

  7. Autoinhibition and signaling by the switch II motif in the G-protein chaperone of a radical B12 enzyme.

    Science.gov (United States)

    Lofgren, Michael; Koutmos, Markos; Banerjee, Ruma

    2013-10-25

    MeaB is an accessory GTPase protein involved in the assembly, protection, and reactivation of 5'-deoxyadenosyl cobalamin-dependent methylmalonyl-CoA mutase (MCM). Mutations in the human ortholog of MeaB result in methylmalonic aciduria, an inborn error of metabolism. G-proteins typically utilize conserved switch I and II motifs for signaling to effector proteins via conformational changes elicited by nucleotide binding and hydrolysis. Our recent discovery that MeaB utilizes an unusual switch III region for bidirectional signaling with MCM raised questions about the roles of the switch I and II motifs in MeaB. In this study, we addressed the functions of conserved switch II residues by performing alanine-scanning mutagenesis. Our results demonstrate that the GTPase activity of MeaB is autoinhibited by switch II and that this loop is important for coupling nucleotide-sensitive conformational changes in switch III to elicit the multiple chaperone functions of MeaB. Furthermore, we report the structure of MeaB·GDP crystallized in the presence of AlFx(-) to form the putative transition state analog, GDP·AlF4(-). The resulting crystal structure and its comparison with related G-proteins support the conclusion that the catalytic site of MeaB is incomplete in the absence of the GTPase-activating protein MCM and therefore unable to stabilize the transition state analog. Favoring an inactive conformation in the absence of the client MCM protein might represent a strategy for suppressing the intrinsic GTPase activity of MeaB in which the switch II loop plays an important role.

  8. quatre-quart1 is an indispensable U12 intron-containing gene that plays a crucial role in Arabidopsis development.

    Science.gov (United States)

    Kwak, Kyung Jin; Kim, Bo Mi; Lee, Kwanuk; Kang, Hunseung

    2017-05-17

    Despite increasing understanding of the importance of the splicing of U12-type introns in plant development, the key question of which U12 intron-containing genes are essential for plant development has not yet been explored. Here, we assessed the functional role of the quatre-quart1 (QQT1) gene, one of the ~230 U12 intron-containing genes in Arabidopsis thaliana. Expression of QQT1 in the U11/U12-31K small nuclear ribonucleoprotein mutant (31k) rescued the developmental-defect phenotypes of the 31k mutant, whereas the miRNA-mediated qqt1 knockdown mutants displayed severe defects in growth and development, including severely arrested stem growth, small size, and the formation of serrated leaves. The structures of the shoot apical meristems in the qqt1 mutants were abnormal and disordered. Identification of QQT1-interacting proteins via a yeast two-hybrid screening and a firefly luciferase complementation-imaging assay revealed that a variety of proteins, including many chloroplast-targeted proteins, interacted with QQT1. Importantly, the levels of chloroplast-targeted proteins in the chloroplast were reduced, and the chloroplast structure was abnormal in the qqt1 mutant. Collectively, these results provide clear evidence that QQT1 is an indispensable U12 intron-containing gene whose correct splicing is crucial for the normal development of Arabidopsis. © The Author 2017. Published by Oxford University Press on behalf of the Society for Experimental Biology.

  9. Characterization of novel and uncharacterized p53 SNPs in the Chinese population--intron 2 SNP co-segregates with the common codon 72 polymorphism.

    Directory of Open Access Journals (Sweden)

    Beng Hooi Phang

    Full Text Available Multiple single nucleotide polymorphisms (SNPs have been identified in the tumor suppressor gene p53, though the relevance of many of them is unclear. Some of them are also differentially distributed in various ethnic populations, suggesting selective functionality. We have therefore sequenced all exons and flanking regions of p53 from the Singaporean Chinese population and report here the characterization of some novel and uncharacterized SNPs - four in intron 1 (nucleotide positions 8759/10361/10506/11130, three in intron 3 (11968/11969/11974 and two in the 3'UTR (19168/19514. Allelic frequencies were determined for all these and some known SNPs, and were compared in a limited scale to leukemia and lung cancer patient samples. Intron 2 (11827 and 7 (14181/14201 SNPs were found to have a high minor allele frequency of between 26-47%, in contrast to the lower frequencies found in the US population, but similar in trend to the codon 72 polymorphism (SNP12139 that shows a distribution pattern correlative with latitude. Several of the SNPs were linked, such as those in introns 1, 3 and 7. Most interestingly, we noticed the co-segregation of the intron 2 and the codon 72 SNPs, the latter which has been shown to be expressed in an allele-specific manner, suggesting possible regulatory cross-talk. Association analysis indicated that the T/G alleles in both the co-segregating intron 7 SNPs and a 4tagSNP haplotype was strongly associated increased susceptibility to lung cancer in non-smoker females [OR: 1.97 (1.32, 3.394]. These data together demonstrate high SNP diversity in p53 gene between different populations, highlighting ethnicity-based differences, and their association with cancer risk.

  10. Sequence comparison of the rDNA introns from six different species of Tetrahymena

    DEFF Research Database (Denmark)

    Nielsen, Henrik; Engberg, J

    1985-01-01

    model for the intron RNA of Cech et al. (Proc. Natl. Acad. Sci. U.S.A. 80, 3903 (83)). Most of the sequence variation in the four new sequences reported here is found in single stranded loops in the model. However, in four cases we found nucleotide substitutions in duplex stem regions, two of them...

  11. The organization structure and regulatory elements of Chlamydomonas histone genes reveal features linking plant and animal genes.

    Science.gov (United States)

    Fabry, S; Müller, K; Lindauer, A; Park, P B; Cornelius, T; Schmitt, R

    1995-09-01

    The genome of the green alga Chlamydomonas reinhardtii contains approximately 15 gene clusters of the nucleosomal (or core) histone H2A, H2B, H3 and H4 genes and at least one histone H1 gene. Seven non-allelic histone gene loci were isolated from a genomic library, physically mapped, and the nucleotide sequences of three isotypes of each core histone gene species and one linked H1 gene determined. The core histone genes are organized in clusters of H2A-H2B and H3-H4 pairs, in which each gene pair shows outwardly divergent transcription from a short (< 300 bp) intercistronic region. These intercistronic regions contain typically conserved promoter elements, namely a TATA-box and the three motifs TGGCCAG-G(G/C)-CGAG, CGTTGACC and CGGTTG. Different from the genes of higher plants, but like those of animals and the related alga Volvox, the 3' untranslated regions contain no poly A signal, but a palindromic sequence (3' palindrome) essential for mRNA processing is present. One single H1 gene was found in close linkage to a H2A-H2B pair. The H1 upstream region contains the octameric promoter element GGTTGACC (also found upstream of the core histone genes) and two specific sequence motifs that are shared only with the Volvox H1 promoters. This suggests differential transcription of the H1 and the core histone genes. The H1 gene is interrupted by two introns. Unlike Volvox H3 genes, the three sequenced H3 isoforms are intron-free. Primer-directed PCR of genomic DNA demonstrated, however, that at least 8 of the about 15 H3 genes do contain one intron at a conserved position. In synchronized C. reinhardtii cells, H4 mRNA levels (representative of all core histone mRNAs) peak during cell division, suggesting strict replication-dependent gene control. The derived peptide sequences place C. reinhardtii core histones closer to plants than to animals, except that the H2A histones are more animal-like. The peptide sequence of histone H1 is closely related to the V. carteri VH1-II

  12. A cyclic nucleotide-gated channel mutation associated with canine daylight blindness provides insight into a role for the S2 segment tri-Asp motif in channel biogenesis.

    Directory of Open Access Journals (Sweden)

    Naoto Tanaka

    Full Text Available Cone cyclic nucleotide-gated channels are tetramers formed by CNGA3 and CNGB3 subunits; CNGA3 subunits function as homotetrameric channels but CNGB3 exhibits channel function only when co-expressed with CNGA3. An aspartatic acid (Asp to asparagine (Asn missense mutation at position 262 in the canine CNGB3 (D262N subunit results in loss of cone function (daylight blindness, suggesting an important role for this aspartic acid residue in channel biogenesis and/or function. Asp 262 is located in a conserved region of the second transmembrane segment containing three Asp residues designated the Tri-Asp motif. This motif is conserved in all CNG channels. Here we examine mutations in canine CNGA3 homomeric channels using a combination of experimental and computational approaches. Mutations of these conserved Asp residues result in the absence of nucleotide-activated currents in heterologous expression. A fluorescent tag on CNGA3 shows mislocalization of mutant channels. Co-expressing CNGB3 Tri-Asp mutants with wild type CNGA3 results in some functional channels, however, their electrophysiological characterization matches the properties of homomeric CNGA3 channels. This failure to record heteromeric currents suggests that Asp/Asn mutations affect heteromeric subunit assembly. A homology model of S1-S6 of the CNGA3 channel was generated and relaxed in a membrane using molecular dynamics simulations. The model predicts that the Tri-Asp motif is involved in non-specific salt bridge pairings with positive residues of S3/S4. We propose that the D262N mutation in dogs with CNGB3-day blindness results in the loss of these inter-helical interactions altering the electrostatic equilibrium within in the S1-S4 bundle. Because residues analogous to Tri-Asp in the voltage-gated Shaker potassium channel family were implicated in monomer folding, we hypothesize that destabilizing these electrostatic interactions impairs the monomer folding state in D262N mutant CNG

  13. Conserved CDC20 cell cycle functions are carried out by two of the five isoforms in Arabidopsis thaliana.

    Directory of Open Access Journals (Sweden)

    Zoltán Kevei

    Full Text Available The CDC20 and Cdh1/CCS52 proteins are substrate determinants and activators of the Anaphase Promoting Complex/Cyclosome (APC/C E3 ubiquitin ligase and as such they control the mitotic cell cycle by targeting the degradation of various cell cycle regulators. In yeasts and animals the main CDC20 function is the destruction of securin and mitotic cyclins. Plants have multiple CDC20 gene copies whose functions have not been explored yet. In Arabidopsis thaliana there are five CDC20 isoforms and here we aimed at defining their contribution to cell cycle regulation, substrate selectivity and plant development.Studying the gene structure and phylogeny of plant CDC20s, the expression of the five AtCDC20 gene copies and their interactions with the APC/C subunit APC10, the CCS52 proteins, components of the mitotic checkpoint complex (MCC and mitotic cyclin substrates, conserved CDC20 functions could be assigned for AtCDC20.1 and AtCDC20.2. The other three intron-less genes were silent and specific for Arabidopsis. We show that AtCDC20.1 and AtCDC20.2 are components of the MCC and interact with mitotic cyclins with unexpected specificity. AtCDC20.1 and AtCDC20.2 are expressed in meristems, organ primordia and AtCDC20.1 also in pollen grains and developing seeds. Knocking down both genes simultaneously by RNAi resulted in severe delay in plant development and male sterility. In these lines, the meristem size was reduced while the cell size and ploidy levels were unaffected indicating that the lower cell number and likely slowdown of the cell cycle are the cause of reduced plant growth.The intron-containing CDC20 gene copies provide conserved and redundant functions for cell cycle progression in plants and are required for meristem maintenance, plant growth and male gametophyte formation. The Arabidopsis-specific intron-less genes are possibly "retrogenes" and have hitherto undefined functions or are pseudogenes.

  14. DistAMo: A web-based tool to characterize DNA-motif distribution on bacterial chromosomes

    Directory of Open Access Journals (Sweden)

    Patrick eSobetzko

    2016-03-01

    Full Text Available Short DNA motifs are involved in a multitude of functions such as for example chromosome segregation, DNA replication or mismatch repair. Distribution of such motifs is often not random and the specific chromosomal pattern relates to the respective motif function. Computational approaches which quantitatively assess such chromosomal motif patterns are necessary. Here we present a new computer tool DistAMo (Distribution Analysis of DNA Motifs. The algorithm uses codon redundancy to calculate the relative abundance of short DNA motifs from single genes to entire chromosomes. Comparative genomics analyses of the GATC-motif distribution in γ-proteobacterial genomes using DistAMo revealed that (i genes beside the replication origin are enriched in GATCs, (ii genome-wide GATC distribution follows a distinct pattern and (iii genes involved in DNA replication and repair are enriched in GATCs. These features are specific for bacterial chromosomes encoding a Dam methyltransferase. The new software is available as a stand-alone or as an easy-to-use web-based server version at http://www.computational.bio.uni-giessen.de/distamo.

  15. Structure-function analysis of the human sialyltransferase ST3Gal I - Role of N-glycosylation and a novel conserved sialylmotif

    DEFF Research Database (Denmark)

    Jeanneau, C.; Chazalet, V.; Auge, C.

    2004-01-01

    of these residues and of the conserved residues of motif VS (HX4E) was assessed using as a template the human ST3Gal I. Mutational analysis showed that residues His(299) and Tyr(300) of the new motif, and His(316) of the VS motif, are essential for activity since their substitution by alanine yielded inactive...... showed that none of the mutants tested had any significant effect in nucleotide donor binding. Instead the mutant proteins were affected in their binding to the acceptor and/or demonstrated lower catalytic efficiency. Although the human ST3Gal I has four N-glycan attachment sites in its catalytic domain...... that are potentially glycosylated, none of them was shown to be necessary for enzyme activity. However, N-glycosylation appears to contribute to the proper folding and trafficking of the enzyme....

  16. Selection against spurious promoter motifs correlates withtranslational efficiency across bacteria

    Energy Technology Data Exchange (ETDEWEB)

    Froula, Jeffrey L.; Francino, M. Pilar

    2007-05-01

    Because binding of RNAP to misplaced sites could compromise the efficiency of transcription, natural selection for the optimization of gene expression should regulate the distribution of DNA motifs capable of RNAP-binding across the genome. Here we analyze the distribution of the -10 promoter motifs that bind the {sigma}{sup 70} subunit of RNAP in 42 bacterial genomes. We show that selection on these motifs operates across the genome, maintaining an over-representation of -10 motifs in regulatory sequences while eliminating them from the nonfunctional and, in most cases, from the protein coding regions. In some genomes, however, -10 sites are over-represented in the coding sequences; these sites could induce pauses effecting regulatory roles throughout the length of a transcriptional unit. For nonfunctional sequences, the extent of motif under-representation varies across genomes in a manner that broadly correlates with the number of tRNA genes, a good indicator of translational speed and growth rate. This suggests that minimizing the time invested in gene transcription is an important selective pressure against spurious binding. However, selection against spurious binding is detectable in the reduced genomes of host-restricted bacteria that grow at slow rates, indicating that components of efficiency other than speed may also be important. Minimizing the number of RNAP molecules per cell required for transcription, and the corresponding energetic expense, may be most relevant in slow growers. These results indicate that genome-level properties affecting the efficiency of transcription and translation can respond in an integrated manner to optimize gene expression. The detection of selection against promoter motifs in nonfunctional regions also implies that no sequence may evolve free of selective constraints, at least in the relatively small and unstructured genomes of bacteria.

  17. An integrative and applicable phylogenetic footprinting framework for cis-regulatory motifs identification in prokaryotic genomes.

    Science.gov (United States)

    Liu, Bingqiang; Zhang, Hanyuan; Zhou, Chuan; Li, Guojun; Fennell, Anne; Wang, Guanghui; Kang, Yu; Liu, Qi; Ma, Qin

    2016-08-09

    Phylogenetic footprinting is an important computational technique for identifying cis-regulatory motifs in orthologous regulatory regions from multiple genomes, as motifs tend to evolve slower than their surrounding non-functional sequences. Its application, however, has several difficulties for optimizing the selection of orthologous data and reducing the false positives in motif prediction. Here we present an integrative phylogenetic footprinting framework for accurate motif predictions in prokaryotic genomes (MP(3)). The framework includes a new orthologous data preparation procedure, an additional promoter scoring and pruning method and an integration of six existing motif finding algorithms as basic motif search engines. Specifically, we collected orthologous genes from available prokaryotic genomes and built the orthologous regulatory regions based on sequence similarity of promoter regions. This procedure made full use of the large-scale genomic data and taxonomy information and filtered out the promoters with limited contribution to produce a high quality orthologous promoter set. The promoter scoring and pruning is implemented through motif voting by a set of complementary predicting tools that mine as many motif candidates as possible and simultaneously eliminate the effect of random noise. We have applied the framework to Escherichia coli k12 genome and evaluated the prediction performance through comparison with seven existing programs. This evaluation was systematically carried out at the nucleotide and binding site level, and the results showed that MP(3) consistently outperformed other popular motif finding tools. We have integrated MP(3) into our motif identification and analysis server DMINDA, allowing users to efficiently identify and analyze motifs in 2,072 completely sequenced prokaryotic genomes. The performance evaluation indicated that MP(3) is effective for predicting regulatory motifs in prokaryotic genomes. Its application may enhance

  18. Amino acid sequence motifs essential for P0-mediated suppression of RNA silencing in an isolate of potato leafroll virus from Inner Mongolia.

    Science.gov (United States)

    Zhuo, Tao; Li, Yuan-Yuan; Xiang, Hai-Ying; Wu, Zhan-Yu; Wang, Xian-Bin; Wang, Ying; Zhang, Yong-Liang; Li, Da-Wei; Yu, Jia-Lin; Han, Cheng-Gui

    2014-06-01

    Polerovirus P0 suppressors of host gene silencing contain a consensus F-box-like motif with Leu/Pro (L/P) requirements for suppressor activity. The Inner Mongolian Potato leafroll virus (PLRV) P0 protein (P0(PL-IM)) has an unusual F-box-like motif that contains a Trp/Gly (W/G) sequence and an additional GW/WG-like motif (G139/W140/G141) that is lacking in other P0 proteins. We used Agrobacterium infiltration-mediated RNA silencing assays to establish that P0(PL-IM) has a strong suppressor activity. Mutagenesis experiments demonstrated that the P0(PL-IM) F-box-like motif encompasses amino acids 76-LPRHLHYECLEWGLLCG THP-95, and that the suppressor activity is abolished by L76A, W87A, or G88A substitution. The suppressor activity is also weakened substantially by mutations within the G139/W140/G141 region and is eliminated by a mutation (F220R) in a C-terminal conserved sequence of P0(PL-IM). As has been observed with other P0 proteins, P0(PL-IM) suppression is correlated with reduced accumulation of the host AGO1-silencing complex protein. However, P0(PL-IM) fails to bind SKP1, which functions in a proteasome pathway that may be involved in AGO1 degradation. These results suggest that P0(PL-IM) may suppress RNA silencing by using an alternative pathway to target AGO1 for degradation. Our results help improve our understanding of the molecular mechanisms involved in PLRV infection.

  19. A deep intronic CLRN1 (USH3A) founder mutation generates an aberrant exon and underlies severe Usher syndrome on the Arabian Peninsula.

    Science.gov (United States)

    Khan, Arif O; Becirovic, Elvir; Betz, Christian; Neuhaus, Christine; Altmüller, Janine; Maria Riedmayr, Lisa; Motameny, Susanne; Nürnberg, Gudrun; Nürnberg, Peter; Bolz, Hanno J

    2017-05-03

    Deafblindness is mostly due to Usher syndrome caused by recessive mutations in the known genes. Mutation-negative patients therefore either have distinct diseases, mutations in yet unknown Usher genes or in extra-exonic parts of the known genes - to date a largely unexplored possibility. In a consanguineous Saudi family segregating Usher syndrome type 1 (USH1), NGS of genes for Usher syndrome, deafness and retinal dystrophy and subsequent whole-exome sequencing each failed to identify a mutation. Genome-wide linkage analysis revealed two small candidate regions on chromosome 3, one containing the USH3A gene CLRN1, which has never been associated with Usher syndrome in Saudi Arabia. Whole-genome sequencing (WGS) identified a homozygous deep intronic mutation, c.254-649T > G, predicted to generate a novel donor splice site. CLRN1 minigene-based analysis confirmed the splicing of an aberrant exon due to usage of this novel motif, resulting in a frameshift and a premature termination codon. We identified this mutation in an additional two of seven unrelated mutation-negative Saudi USH1 patients. Locus-specific markers indicated that c.254-649T > G CLRN1 represents a founder allele that may significantly contribute to deafblindness in this population. Our finding underlines the potential of WGS to uncover atypically localized, hidden mutations in patients who lack exonic mutations in the known disease genes.

  20. Non-Watson Crick base pairs might stabilize RNA structural motifs in ...

    Indian Academy of Sciences (India)

    Watson Crick base pairs, internal loops and pseudoknots have been the highlighting feature of recent structural determination of RNAs. The recent crystal structure of group-I introns has demonstrated that these might constitute RNA structural ...